Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1109/JCDL57899.2023.00035acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
research-article

FastCat Catalogues: Interactive Entity-Based Exploratory Analysis of Archival Documents

Published: 04 September 2024 Publication History

Abstract

We describe FastCat Catalogues, a web application that supports researchers studying archival material, such as historians, in exploring and quantitatively analysing the data (transcripts) of archival documents. The application was designed based on real information needs provided by a large group of researchers, makes use of JSON technology, and is configurable for use over any type of archival documents whose contents have been transcribed and exported in JSON format. The supported functionalities include a) source- or record-specific entity browsing, b) source-independent entity browsing, c) data filtering, d) inspection of provenance information, e) data aggregation and visualisation in charts, f) table and chart data export for further (external) analysis. The application is provided as open source and is currently used by historians in maritime history research.

References

[1]
Vassilis Christophides, Vasilis Efthymiou, Themis Palpanas, George Papadakis, and Kostas Stefanidis. 2020. An overview of end-to-end entity resolution for big data. ACM Computing Surveys (CSUR) 53, 6 (2020), 1--42.
[2]
Apostolos Delis. 2020. Seafaring Lives at the crossroads of Mediterranean maritime history. International Journal of Maritime History 32, 2 (2020), 464--478.
[3]
Pavlos Fafalios, Kostas Petrakis, Georgios Samaritakis, Korina Doerr, Athina Kritsotaki, Yannis Tzitzikas, and Martin Doerr. 2021. FAST CAT: collaborative data entry and curation for semantic interoperability in digital humanities. Journal on Computing and Cultural Heritage (JOCCH) 14, 4 (2021), 1--20.
[4]
Pavlos Fafalios, Georgios Samaritakis, Kostas Petrakis, Korina Doerr, Athina Kritsotaki, Anastasia Axaridou, and Martin Doerr. 2022. Building and Exploring a Semantic Network of Maritime History Data. In Mediterranean Seafarers in Transition. Brill, 509--535.
[5]
Ashleigh Hawkins. 2022. Archives, linked data and the digital humanities: increasing access to digitised and born-digital archives via the semantic web. Archival Science 22, 3 (2022), 319--344.
[6]
Vangelis Kritsotakis, Yannis Roussakis, Theodore Patkos, and Maria Theodoridou. 2018. Assistive Query Building for Semantic Data. In SEMANTICS Posters&Demos.
[7]
Richard Marciano, Victoria Lemieux, Mark Hedges, Maria Esteva, William Underwood, Michael Kurtz, and Mark Conrad. 2018. Archival records and training in the age of big data. In Re-Envisioning the MLS: Perspectives on the future of library and information science education. Emerald Publishing Limited.
[8]
Albert Meroño-Peñuela, Ashkan Ashkpour, Marieke Van Erp, Kees Mandemakers, Leen Breure, Andrea Scharnhorst, Stefan Schlobach, and Frank Van Harmelen. 2015. Semantic technologies for historical research: A survey. Semantic Web 6, 6 (2015), 539--564.
[9]
Dominic Oldman, Martin Doerr, and Stefan Gradmann. 2016. Zen and the art of Linked Data: new strategies for a Semantic Web of humanist knowledge. (2016).
[10]
Dominic Oldman and Diana Tanase. 2018. Reshaping the knowledge graph by connecting researchers, data and practices in ResearchSpace. In International Semantic Web Conference. Springer, 325--340.
[11]
Kostas Petrakis, Georgios Samaritakis, Thomas Kalesios, Enric Garcia i Domingo, Apostolos Delis, Yannis Tzitzikas, Martin Doerr, and Pavlos Fafalios. 2020. Digitizing, Curating and Visualizing Archival Sources of Maritime History: the case of ship logbooks of the nineteenth and twentieth centuries. Drassana: revista del Museu Marítim 28 (2020), 60--87.
[12]
Kim Pham, Fernando Reyes, and Jeff Rynhart. 2020. Building a Library Search Infrastructure with Elasticsearch. Code4Lib Journal 48 (2020).
[13]
Marc J Ventresca and John W Mohr. 2017. Archival research methods. The Blackwell companion to organizations (2017), 805--828.

Index Terms

  1. FastCat Catalogues: Interactive Entity-Based Exploratory Analysis of Archival Documents

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      JCDL '23: Proceedings of the 2023 ACM/IEEE Joint Conference on Digital Libraries
      June 2024
      352 pages
      ISBN:9798350399318

      Sponsors

      Publisher

      IEEE Press

      Publication History

      Published: 04 September 2024

      Check for updates

      Author Tags

      1. archival research
      2. archival data search
      3. exploratory data analysis
      4. archival data browsing
      5. entity-based archival search

      Qualifiers

      • Research-article

      Conference

      JCDL '23
      Sponsor:
      JCDL '23: 2023 ACM/IEEE Joint Conference on Digital Libraries
      June 26 - 30, 2024
      New Mexico, Santa Fe, USA

      Acceptance Rates

      Overall Acceptance Rate 415 of 1,482 submissions, 28%

      Upcoming Conference

      JCDL '24
      The 2024 ACM/IEEE Joint Conference on Digital Libraries
      December 16 - 20, 2024
      Hong Kong , China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 2
        Total Downloads
      • Downloads (Last 12 months)2
      • Downloads (Last 6 weeks)2
      Reflects downloads up to 01 Oct 2024

      Other Metrics

      Citations

      View Options

      Get Access

      Login options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media