Interdisciplinary ML‑powered platform for exploring historical periodical media.

📚 Corpus: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders.
🎯 Vision: Enables a semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio.
💡 Outputs:
- Web App & Datalab platforms for exploratory analysis, search and programmatic access
- NLP resources: Language identificatino, OCR quality assessment, Named Entity Recognition, Named Entity Linking, topic models
- Historical insights under the theme of media influences.
🧑‍🤝‍🧑 Hugging Face Organization hosts multilingual NER, NEL, OCR‑quality assessment models, and Spaces for named entity processing