awesome-digitization
github.com/ndlrf-rnd/awesome-digitization ↗A list of list of resources related to heritage digitization priograms tech
2
GitHub Stars
58
Curated Resources
10
Categories
18 hours ago
Last Refreshed
Home materialsPipelinesSegmentation and scene readingOCRNoticeable digitization projectsDatasetsImages similarityText content NER/Topic/Subject/ContextRelated areas overview materialsRelated tech at circuits level
Use this list with your AI agent
Add the Context Awesome MCP server to Claude, Cursor, or any MCP client, then ask:
"Show me segmentation and scene reading resources from awesome-digitization"
Installation instructions →What's inside
Segmentation and scene reading
- Aletheia
- Chandna, Swati. Published on 01/09/2019 Automatic Layout Analysis and Visual Exploration of Multidimensional Datasets with Applications in the Digital Humanities
- Full whitepapers list
- Graph-based tables segmentation
- https://www.groundai.com/project/combining-visual-and-textual-features-for-semantic-segmentation-of-historical-newspapers/1
- ICDAR 2019
OCR
Home materials
- Conceptual high-level schema of human operators feedback loop and math models lifecycles for generic digitization pipelines
- Комплекс программных решений и рабочий процесс по оцифровке изданийю Февраль 2020, Российская Государственная Библиотека. (ЛИР, ОИЗ)
Russian Digital Library digitization concept slides (on Russian).
Text content NER/Topic/Subject/Context
Pipelines
- dhSegment: A generic deep-learning approach for document segmentation
pipeline description.
- Europeana.eu
EMPOWERING DIGITAL CHANGE.
- Europeana strategy 2020-2025 - EMPOWERING DIGITAL CHANGE.
EMPOWERING DIGITAL CHANGE.
- GitHub src
- Issue 13: OCR. EuropeanaTech Insight is a multimedia publication about R&D developments by the EuropeanaTech Community. Gregory Markus. Posted on Wednesday July 31, 2019, Europeana PRO - Tech
About PIVAJ Europeana newspapers article extraction pipeline.
- Marcus Bitzl and Ralf Eichinger, DB/MDZ/IWA, 04.04.2019. Software-Development at the MDZ
Related areas overview materials
Datasets
Noticeable digitization projects
Showing a sample of 58 resources. View the full list on GitHub →