π Research Scientist
π Based in France
π emanuelaboros.github.io
I'm passionate about natural language processing, historical document analysis, and building robust tools for multilingual, noisy, and low-resource data. My work lies at the intersection of machine learning and digital humanities β with a focus on named entity recognition, entity linking, and event extraction in complex, historical datasets.
- Named Entity Recognition (NER), Entity Linking (EL) & Event Extraction (EE)
- Historical data and ephemera
- Multilingual and low-resource NLP
- Benchmarking & model evaluation
- OCR, HTR
- post-correction & text normalization
| Role | Organization |
|---|---|
| π§βπΌ Owner | LRU-classrooms, dh-epfl-students, dhlab-epfl, hipe-eval |
| π©βπ¬ Member & Collaborator | NewsEye, EMBEDDIA, swiss-ai, impresso |
| π€ Outside Collaborator | paris-saclay-cds, ramp-data, ramp-kits, C2DH, epfLLM, eth-easl |
π« Reach me via emanuelaboros.github.io
π¬ Iβm always open to collaboration on open-source NLP, OCR, and digital heritage projects.