I am a PhD candidate working in the Impresso II – Media Monitoring of the Past - Beyond Borders project under the supervision of Simon Clematide, Rico Sennrich and external supervision by Mrinmaya Sachan. My main research focus surrounds Multilingual Embeddings Models and the development of explainable Cross-Lingual Semantic Search in Historical Texts.
Education
- September 2023 - Now: PhD candidate at the Department of Computational Linguistics, University of Zurich
- 2020 - 2023: MSc in Computing and Economics (90) and Data Science (30) at the University of Zurich
- 2017 - 2020 : BSc in Artificial Intelligence and Computer Science(180) at the University of Sheffield
Publications
- Andrianos Michail, Simon Clematide, and Rico Sennrich. 2025. Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples. to appear in EMNLP 2025 findings.
- Juri Opitz, Lucas Möller, Andrianos Michail, Sebastian Padó, and Simon Clematide. 2025. Interpretable Text Embeddings and Text Similarity Explanation: A Survey to appear in EMNLP 2025 main.
- Hongji Li, Andrianos Michail, Reto Gubelmann, Simon Clematide, and Juri Opitz. 2025. Sentence Smith: Controllable Edits for Evaluating Text Embeddings to appear in EMNLP 2025 main.
- Andrianos Michail, Juri Opitz, Yining Wang, Robin Meister, Rico Sennrich, and Simon Clematide. 2025. Cheap Character Noise for OCR-Robust Multilingual Embeddings. In Findings of the Association for Computational Linguistics: ACL 2025, pages 11705–11716, Vienna, Austria. Association for Computational Linguistics.
- Andrianos Michail, Corina Julia Raclé, Juri Opitz, and Simon Clematide. 2025. Adapting Multilingual Embedding Models to Historical Luxembourgish. (to appear) In Proceedings of the 9th Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. (LaTeCH 2025). Association for Computational Linguistics.
- Andrianos Michail, Simon Clematide, and Juri Opitz. 2025. PARAPHRASUS: A Comprehensive Benchmark for Evaluating Paraphrase Detection Models.
In Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025), pages 8749–8762, Abu Dhabi, UAE. Association for Computational Linguistics. - Andrianos Michail*, Pascal Severin Andermatt*, and Tobias Fankhauser. 2024.
SimpleText Best of Labs in CLEF-2023: Scientific Text Simplification Using Multi-prompt Minimum Bayes Risk Decoding. In Experimental IR Meets Multilinguality, Multimodality, and InteractionIn Proceedings of the International Conference of the Cross-Language Evaluation Forum for European Languages (pp. 227-253) CLEF Association, CLEF 2024, Grenoble, France, September 9–12, 2024 Cham: Springer Nature Switzerland - Uluslu, Ahmet Yavuz*, Andrianos Michail*, and Simon Clematide. 2024. Utilizing large language models to identify evidence of suicidality risk through analysis of emotionally charged postsIn Proceedings of the 9th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2024), pages 264-269, St. Julians, Malta. Association for Computational Linguistics.
- Andrianos Michail, Stefanos Konstantinou, and Simon Clematide. 2023. UZH_CLyp at SemEval-2023 Task 9: Head-First Fine-Tuning and ChatGPT Data Generation for Cross-Lingual Learning in Tweet Intimacy PredictionIn Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1021–1029, Toronto, Canada. Association for Computational Linguistics.
Teaching
| HS 2025 | |
| FS 2025 | |
| HS 2024 | |
FS 2024 |
|
| HS 2023 | TA for Machine Learning for Natural Language Processing I |
| FS 2023 | TA for Machine Learning for Natural Language Processing II |
| HS 2022 | TA for Machine Learning for Natural Language Processing I |
| FS 2022 | TA for Machine Learning for Natural Language Processing II |
| HS 2021 | TA for Machine Learning for Natural Language Processing I |
| HS 2021 | |
| FS 2021 | TA for Informatics II |
I am a PhD candidate working in the Impresso II – Media Monitoring of the Past - Beyond Borders project under the supervision of Simon Clematide, Rico Sennrich and external supervision by Mrinmaya Sachan. My main research focus surrounds Multilingual Embeddings Models and the development of explainable Cross-Lingual Semantic Search in Historical Texts.
Education
- September 2023 - Now: PhD candidate at the Department of Computational Linguistics, University of Zurich
- 2020 - 2023: MSc in Computing and Economics (90) and Data Science (30) at the University of Zurich
- 2017 - 2020 : BSc in Artificial Intelligence and Computer Science(180) at the University of Sheffield
Publications
- Andrianos Michail, Simon Clematide, and Rico Sennrich. 2025. Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples. to appear in EMNLP 2025 findings.
- Juri Opitz, Lucas Möller, Andrianos Michail, Sebastian Padó, and Simon Clematide. 2025. Interpretable Text Embeddings and Text Similarity Explanation: A Survey to appear in EMNLP 2025 main.
- Hongji Li, Andrianos Michail, Reto Gubelmann, Simon Clematide, and Juri Opitz. 2025. Sentence Smith: Controllable Edits for Evaluating Text Embeddings to appear in EMNLP 2025 main.
- Andrianos Michail, Juri Opitz, Yining Wang, Robin Meister, Rico Sennrich, and Simon Clematide. 2025. Cheap Character Noise for OCR-Robust Multilingual Embeddings. In Findings of the Association for Computational Linguistics: ACL 2025, pages 11705–11716, Vienna, Austria. Association for Computational Linguistics.
- Andrianos Michail, Corina Julia Raclé, Juri Opitz, and Simon Clematide. 2025. Adapting Multilingual Embedding Models to Historical Luxembourgish. (to appear) In Proceedings of the 9th Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature. (LaTeCH 2025). Association for Computational Linguistics.
- Andrianos Michail, Simon Clematide, and Juri Opitz. 2025. PARAPHRASUS: A Comprehensive Benchmark for Evaluating Paraphrase Detection Models.
In Proceedings of the 31st International Conference on Computational Linguistics (COLING 2025), pages 8749–8762, Abu Dhabi, UAE. Association for Computational Linguistics. - Andrianos Michail*, Pascal Severin Andermatt*, and Tobias Fankhauser. 2024.
SimpleText Best of Labs in CLEF-2023: Scientific Text Simplification Using Multi-prompt Minimum Bayes Risk Decoding. In Experimental IR Meets Multilinguality, Multimodality, and InteractionIn Proceedings of the International Conference of the Cross-Language Evaluation Forum for European Languages (pp. 227-253) CLEF Association, CLEF 2024, Grenoble, France, September 9–12, 2024 Cham: Springer Nature Switzerland - Uluslu, Ahmet Yavuz*, Andrianos Michail*, and Simon Clematide. 2024. Utilizing large language models to identify evidence of suicidality risk through analysis of emotionally charged postsIn Proceedings of the 9th Workshop on Computational Linguistics and Clinical Psychology (CLPsych 2024), pages 264-269, St. Julians, Malta. Association for Computational Linguistics.
- Andrianos Michail, Stefanos Konstantinou, and Simon Clematide. 2023. UZH_CLyp at SemEval-2023 Task 9: Head-First Fine-Tuning and ChatGPT Data Generation for Cross-Lingual Learning in Tweet Intimacy PredictionIn Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1021–1029, Toronto, Canada. Association for Computational Linguistics.
Teaching
| HS 2025 | |
| FS 2025 | |
| HS 2024 | |
FS 2024 |
|
| HS 2023 | TA for Machine Learning for Natural Language Processing I |
| FS 2023 | TA for Machine Learning for Natural Language Processing II |
| HS 2022 | TA for Machine Learning for Natural Language Processing I |
| FS 2022 | TA for Machine Learning for Natural Language Processing II |
| HS 2021 | TA for Machine Learning for Natural Language Processing I |
| HS 2021 | |
| FS 2021 | TA for Informatics II |