Skip to main content

Awesome list dedicated to digital and data preservation tools, sources, services and so on.

36
GitHub Stars
96
Curated Resources
19
Categories
1 hour ago
Last Refreshed
Web ArchivingSocial NetworksOther Digital ObjectsPreservation Frameworks & PlatformsPreservation Storage & ReplicationFile Format Identification & ValidationFixity & PackagingEmulation & VirtualizationAudio-Visual & Media PreservationDigital Forensics & IngestMetadata Standards & ToolsPreservation Planning, Risk & PolicyTraining, Education & Best PracticesConferences, Events & CommunitiesStandards and SpecificationsOrganizationsKnowledge BasesMajor Digital ArchivesRelated Lists

Use this list with your AI agent

Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:

"Show me public data api resources from awesome-digital-preservation"

Installation instructions →

What's inside

Other Digital Objects

  • apibackuperPublic Data API

    Backs up API calls with a Python library and CLI.

  • filegetterOnline Storage

    Collects files from public data sources using URL patterns.

  • spcrawlerSpecific CMS

    Backs up SharePoint public installations via API.

  • Telegram-ArchiveMessengers & Chats

    Docker-based tool to export and archive complete Telegram chat histories including media, preserving messages in structured formats.

  • tgarcMessengers & Chats

    Archives Telegram JSON.

  • wparcSpecific CMS

    Archives WordPress API data and files.

Web Archiving

  • ArchiveBoxCapture Operators & Services

    Self-hosted web archiving for URLs, feeds, and bookmarks.

  • Archive-ItCapture Operators & Services

    Subscription web archiving service from the Internet Archive.

  • ArchivenowCapture Operators & Services

    Command-line tool to push resources into web archives.

  • ArchiveSparkAnalysis & Data Processing

    Spark framework for processing web archives.

  • Archives Unleashed Toolkit (AUT)Analysis & Data Processing

    Toolkit for analyzing web archives.

  • ArchiveWeb.pageReplay & Access

    Browser extension for high-fidelity web archiving.

Preservation Frameworks & Platforms

  • Archivematica

    OAIS-aligned preservation workflow system.

  • DSpace

    Repository platform for research outputs and collections.

  • DuraCloud

    Hosted preservation storage and access platform.

  • Fedora

    Modular repository platform for digital assets.

  • Islandora

    Fedora-based repository framework for cultural heritage.

  • Preservica

    Commercial digital preservation platform.

Preservation Storage & Replication

  • Archivematica Storage Service

    Storage management component for Archivematica.

  • iRODS

    Data grid for preservation storage, policy, and replication.

  • LOCKSS

    Distributed preservation network for replicated content.

Knowledge Bases

Related Lists

Fixity & Packaging

  • Bagger

    GUI tool for creating BagIt packages.

  • BagIt

    Packaging format for transferring digital content.

  • bagit-python

    Python library for creating and validating BagIt packages.

  • Fixity

    Fixity checking and monitoring tool.

Digital Forensics & Ingest

Showing a sample of 96 resources. View the full list on GitHub →