High-performance Python web archive replay stack (WARC) used by Webrecorder and many institutions for Wayback-style access.
web-archivingwarcwaybackcrawlreplay
Filter by platform, license text, maturity, maintenance cadence, and editorial tags like privacy-focused or self-hosted. Search matches names, summaries, tags, and use cases.
2 tools match your filters
High-performance Python web archive replay stack (WARC) used by Webrecorder and many institutions for Wayback-style access.
Extensible, web-scale, archival-quality crawler produced by the Internet Archive for capturing sites into WARC files.