Interdisciplinary ML‑powered platform for exploring historical periodical media.
- 📚 Corpus: Aggregates an unprecedented multilingual archive of newspapers and radio across time and borders.
- 🎯 Vision: Enables a semantic-enriched workflow for representation, exploration, and historical research across modalities like print and audio.
- 💡 Outputs:
- Web App & Datalab platforms for exploratory analysis, search and programmatic access
- NLP resources: Language identificatino, OCR quality assessment, Named Entity Recognition, Named Entity Linking, topic models
- Historical insights under the theme of media influences.
- 🧑🤝🧑 Hugging Face Organization hosts multilingual NER, NEL, OCR‑quality assessment models, and Spaces for named entity processing