Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3197026.3203880acmconferencesArticle/Chapter ViewAbstractPublication PagesjcdlConference Proceedingsconference-collections
poster
Public Access

ArchiveNow: Simplified, Extensible, Multi-Archive Preservation

Published: 23 May 2018 Publication History

Abstract

ArchiveNow is a Python module for preserving web pages in on-demand web archives. This module allows a user to submit a URI of a web page for archiving at several configured web archives. Once the web page is captured, ArchiveNow provides the user with links to the archived copies of the web page. ArchiveNow is initially configured to use four archives but is easily configurable to add or remove other archives. In addition to pushing web pages to public archives, ArchiveNow, through the use of Wget and Squidwarc, allows users to generate local WARC files, enabling them to create their own personal and private archives.

References

[1]
Mohamed Aturban . 2017. Archivenow - A Tool To Push Web Resources Into Web Archives. https://github.com/oduwsdl/archivenow. (February . 2017).
[2]
John Berlin . 2017. Squidwarc - A high fidelity archival crawler that uses Chrome or Chrome Headless. https://github.com/N0taN3rd/Squidwarc. (July . 2017).
[3]
Free Software Foundation . 2013. GNU Wget - Introduction to GNU Wget. https://www.gnu.org/software/wget/. (2013).
[4]
International Internet Preservation Consortium (IIPC) . 2005. OpenWayback. https://github.com/iipc/openwayback/wiki. (October . 2005).
[5]
ISO 28500 . 2009. WARC (Web ARChive) file format. http://www.digitalpreservation.gov/formats/fdd/fdd000236.shtml. (August . 2009).
[6]
Mat Kelly, Michael L. Nelson, and Michele C. Weigle . 2014. Mink: Integrating the Live and Archived Web Viewing Experience Using Web Browsers and Memento . In Proceedings of JCDL. 469--470.
[7]
Mat Kelly and Michele C Weigle . 2012. WARCreate: Create Wayback-Consumable WARC Files from Any Webpage Proceedings of JCDL. 437--438.
[8]
Ilya Kreymer . 2013. PyWb - Web Archiving Tools for All. https://github.com/ikreymer/pywb. (December . 2013).
[9]
Ilya Kreymer . 2015. Webrecorder - a web archiving platform and service for all. https://webrecorder.io. (2015).
[10]
Ben Welsh . 2016. PastPages. https://github.com/pastpages. (2016).

Cited By

View all
  • (2022)Representing COVID-19 information in collaborative knowledge graphs: The case of WikidataSemantic Web10.3233/SW-21044413:2(233-264)Online publication date: 3-Feb-2022
  • (2022)A Chromium-Based Memento-Aware Web BrowserLinking Theory and Practice of Digital Libraries10.1007/978-3-031-16802-4_12(147-160)Online publication date: 15-Sep-2022
  • (2021)HypercaneACM SIGWEB Newsletter10.1145/3473044.34730472021:Summer(1-14)Online publication date: 12-Oct-2021

Index Terms

  1. ArchiveNow: Simplified, Extensible, Multi-Archive Preservation

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      JCDL '18: Proceedings of the 18th ACM/IEEE on Joint Conference on Digital Libraries
      May 2018
      453 pages
      ISBN:9781450351782
      DOI:10.1145/3197026
      Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 23 May 2018

      Check for updates

      Author Tags

      1. memento
      2. warc
      3. web archiving

      Qualifiers

      • Poster

      Funding Sources

      Conference

      JCDL '18
      Sponsor:

      Acceptance Rates

      JCDL '18 Paper Acceptance Rate 26 of 71 submissions, 37%;
      Overall Acceptance Rate 415 of 1,482 submissions, 28%

      Upcoming Conference

      JCDL '24
      The 2024 ACM/IEEE Joint Conference on Digital Libraries
      December 16 - 20, 2024
      Hong Kong , China

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)59
      • Downloads (Last 6 weeks)17
      Reflects downloads up to 20 Nov 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2022)Representing COVID-19 information in collaborative knowledge graphs: The case of WikidataSemantic Web10.3233/SW-21044413:2(233-264)Online publication date: 3-Feb-2022
      • (2022)A Chromium-Based Memento-Aware Web BrowserLinking Theory and Practice of Digital Libraries10.1007/978-3-031-16802-4_12(147-160)Online publication date: 15-Sep-2022
      • (2021)HypercaneACM SIGWEB Newsletter10.1145/3473044.34730472021:Summer(1-14)Online publication date: 12-Oct-2021

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media