-
The Digital Preservation Coalition
- UK
- anjackson.net
- https://orcid.org/0000-0001-8168-0797
- @anjacks0n
- @anj@digipres.club
-
-
-
the-turing-way Public
Forked from the-turing-way/the-turing-wayHost repository for The Turing Way: a how to guide for reproducible data science
TeX Other UpdatedOct 23, 2024 -
digipres.github.io Public
Forked from digipres/digipres.github.ioAuto-generated static web site digipres.org
Jupyter Notebook UpdatedOct 19, 2024 -
-
-
wikipedia-sopa-blackout Public
A web archive of the Wikipedia homepage during the 2012 SOPA Blackout
HTML UpdatedAug 22, 2024 -
digipres-notebook Public
Open notebook for digital preservation stuff hosted by GitBook
1 UpdatedMay 24, 2024 -
awesome-digital-preservation Public
Forked from digipres/awesome-digital-preservationCarefully curated list of awesome digital preservation resources.
JavaScript Creative Commons Zero v1.0 Universal UpdatedMay 24, 2024 -
-
datasette-lite Public
Forked from GLAM-Workbench/datasette-liteDatasette running in your browser using WebAssembly and Pyodide
HTML Apache License 2.0 UpdatedJan 30, 2024 -
awesome-web-archiving Public
Forked from iipc/awesome-web-archivingAn Awesome List for getting started with web archiving
Creative Commons Zero v1.0 Universal UpdatedJan 18, 2024 -
ipywardley Public
Bringing Wardley Map magic to Jupyter notebooks
-
-
outbackcdx Public
Forked from nla/outbackcdxWeb archive index server based on RocksDB
Java Apache License 2.0 UpdatedOct 5, 2023 -
warcit Public
Forked from webrecorder/warcitConvert Directories, Files and ZIP Files to Web Archives (WARC)
Python Apache License 2.0 UpdatedOct 3, 2023 -
golem Public
Experimental crawler using Scrapy and Selenium
-
ukwa-monitor Public
Forked from ukwa/ukwa-monitorDashboard and monitoring system for the UK Web Archive
Python UpdatedJul 3, 2023 -
browsertrix-crawler Public
Forked from webrecorder/browsertrix-crawlerRun a high-fidelity browser-based crawler in a single Docker container
JavaScript GNU Affero General Public License v3.0 UpdatedMay 25, 2023 -
sphinx-comments Public
Forked from executablebooks/sphinx-commentshypothes.is interaction layer with Sphinx
Python MIT License UpdatedMay 3, 2023 -
rclone-trials Public
Experimenting with Rclone and how it works with HDFS
Shell UpdatedApr 28, 2023 -
timewarp Public
Making it easier to browse the past.
-
cdx-db Public
Generating Parquet files containing CDX data for SQL queries
-
-
using-ffmpeg Public
Containerised ffpmeg and example Jupyter notebooks.
Jupyter Notebook GNU Affero General Public License v3.0 UpdatedDec 8, 2022 -
scrapy-url-frontier Public
A Scrapy module for URL Frontier integration
-
-
ukwa-manage Public
Forked from ukwa/ukwa-manageShepherding our web archives from crawl to access.
Jupyter Notebook Apache License 2.0 UpdatedSep 22, 2022 -
ebook-test-manifests Public
Forked from UniversalViewer/ebook-test-manifestsHTML UpdatedSep 22, 2022 -
ukwa-services Public
Forked from ukwa/ukwa-servicesDeployment configuration for all UKWA services stacks.
Python Apache License 2.0 UpdatedJul 18, 2022