Skip to content
OpenCatalogcurated by FLOSSK

Archiving & digital preservation

Trustworthy long-term stewardship: OAIS-style processing, PREMIS/METS, BagIt transfer, finding aids, GLAM publishing, institutional and data repositories, and web archiving (WARC). Prefer tools with clear standards alignment and documented exit paths from proprietary hosts.

Tools in this category (16)

End-to-end digital preservation workflow: ingest, virus scan, normalization, METS/PREMIS metadata, AIP storage, and DIP access packages.

digital-preservationoaispremismetsworkflow

Multisite web platform for scholarly and cultural collections: linked open data, resource templates, modules, and IIIF-friendly patterns.

collectionsexhibitsglamweb-publishingmetadata

Institutional repository platform for research outputs, theses, datasets, and OAI-PMH harvesting—DSpace 7+ Angular UI.

repositoryopen-accessoai-pmhresearchuniversity

Linked data–capable digital object repository (API-X, versioning, fixity) often paired with Samvera/Hyrax for scholarly preservation.

repositorylinked-datadigital-preservationapi

Samvera Rails application providing deposit, workflow, discovery, and admin UI on top of Fedora repositories.

repositorysamverafedorarailsdigital-library

Turnkey research data management repository (Zenodo lineage): records, DOIs, OAI-PMH, permissions, and customizable deposit forms.

repositoryresearch-datadoiopen-science

Java desktop GUI from the Library of Congress for building valid BagIt packages with human-friendly validation feedback.

bagitguitransferpackaging

Artefactual command-line tool to schedule and report checksum audits against storage locations—pairs with Archivematica storage.

fixitychecksumauditstoragepreservation

High-performance Python web archive replay stack (WARC) used by Webrecorder and many institutions for Wayback-style access.

web-archivingwarcwaybackcrawlreplay

Extensible, web-scale, archival-quality crawler produced by the Internet Archive for capturing sites into WARC files.

web-archivingcrawlerwarcharvesting

Harvard-led research data repository: datasets, files, citations, DOI workflows, and granular permissions for institutions.

research-datarepositorydoidatasetsuniversity