- ongoing: DPhil in Linguistics, University of Oxford
- 2019: MSt in Slavonic Studies, University of Oxford
- 2015-2017: Graduate Program in Linguistics (70% of MA credits), Università di Pavia.
- 2015: BA in Russian and English, Università di Pavia
- 2020: Guest Researcher, Department of Linguistics and Scandinavian Studies, Universitetet i Oslo (Norway).
- Feb 2023-: Research Software Engineer, Digital scholarship @ Oxford (DiSc, University of Oxford)
- Jan 2022-: Research Associate in Corpus-Based Digital Humanities, Living with Machines, The Alan Turing Institute (London)
- 2020-2021: Research Assistant, International Multimodal Communication Centre (IMCC), Oxford School of Global and Area Studies (University of Oxford)
- 2020: Research Assistant, ReadOxford Research Group, Department of Experimental Psychology (University of Oxford)
- 2018-2020: IT/AV Support Assistant, Somerville College (University of Oxford)
- 2014-2019: Supervisor & Assistant Manager, Le Pain Quotidien (London & Oxford)
- 2015: Trainee Assistant Curator/Cataloguer, British Library (London)
Other Scientific and Academic Activities
- 2021-now: Fellow at RROx (UK Reproducibility Network)
- 2022: Organizer of the Open Humanities Seminar Series (OHSS)
- 2020-2023: Editorial Assistant for the Journal of Open Humanities Data (JOHD)
- 2019-now: Linguistic annotation for the Tromsø Old Russian and Old Church Slavonic Treebank (TOROT) project
- Computer languages
- Solid Python, R and Bash.
- Intermediate Ruby
- Operating systems
- [In order of preference] macOS, Linux (Ubuntu, ChromeOS), Windows
- Cloud skills
- Microsoft Azure.
- Data handling and NLP
- Extensive experience with Gensim, SpaCy, NLTK, Sci-kit Learn, Pandas, NumPy, BeautifulSoup.
- Web Development
- Solid HTML/CSS, XML.
- Version Control
- Solid Git knowledge.
- GitFlow and trunk-based systems (CI).
- Data Visualisation
- [Python] Extensive experience with Matplotlib, Seaborn and Plotly (& Plotly Dash for web app deployment).
- [R] Base R plotting, ggplot2.
- Italian (native)
- English (C2)
- Russian (C1)
- Serbian, Croatian and Bosnian (reading C1, speaking and listening B2)
- Spanish (Dele B2)
- German (Goethe-Zertifikat B1)
- Norwegian (Bokmål) (A2-B1).
- Good reading knowledge of French and Bulgarian.
- Basic reading skills in Korean and Te Reo Māori.
- Working knowledge of Ancient Greek, Latin, Hittite, Old Church Slavonic.
- Software sustainability. Collaborative software development (GitFlow). Corpus building and linguistic annotation. Linguistic Data Analysis. Development of digital and computational tools for historical languages. Teaching (Linguistics and Digital Humanities). Copyediting. TEX.
Machines in the media: semantic change in the lexical field of mechanization in 19th-century British newspapers
Deep Impact: A Study on the Impact of Data Papers and Datasets in the Humanities and Social Sciences
The Language of Mechanisation (Circle-U prize)
Plenary talk at Conference on interdisciplinary and transdisciplinary research for sustainable development., UCLouvain, Louvain-la-neuve (Belgium)
Machines in the media: semantic change in the lexicon of mechanization in 19th-century British newspapers.
Conference talk at The 2nd International Workshop on Natural Language Processing for Digital Humanities (NLP4DH), Taipei, Taiwan
Evaluating Language Models for Ancient Greek: Design, Challenges, and Future Directions.
Conference talk at International Conference on Historical Linguistics 2022 (ICHL25)", University of Oxford (UK)
Historic machines from ‘prams’ to ’Parliament’: new avenues for collaborative linguistic research.
Conference talk at DH Benelux 2022: RE-MIX. Creation and alteration in DH, University of Luxembourg
Open data and data publishing in the humanities and social sciences.
Discussion group at Humanities and Data Science Discussion Group, The Alan Turing Institute (London, UK)
Le Journal of Open Humanities Data (JOHD): enjeux et défis dans la publication de data papers pour les sciences humaines.
Round table at DHNord2021 "Le data paper: une nouvelle forme de publication scienti que en SHS", Held online
Tackling lack of linguistic data with HTR: A specialized model for the transcription of Serbian Church Slavonic manuscripts.
Conference talk at El’ Manuscript Conference 2021, Freiburg, DE (held online)
Same question, different annotation depths: early Slavonic dative absolute in deeply versus shallowly annotated treebanks.
Conference talk at Grammar & Corpora International Conference, Kraków (Poland)
Exploiting Cross-Dialectal Gold Syntax for Low-Resource Historical Languages: Towards a Generic Parser for Pre-Modern Slavic
Conference talk at Computational Humanities Research Workshop, Amsterdam (The Netherlands)
Participial free adjuncts in early Slavic treebanks: The discourse properties of the dative absolute.
Conference talk at Slavic Linguistics Society (SLS) Conference, Bloomington, IN, USA (held virtually)
Challenges in historical corpus analyses of discourse relations: a case study in Old Church Slavonic.
Conference talk at Corpora & Discourse International Conference, University of Sussex (UK)