Caswell, I., Nielsen, E., Luo, J., Cherry, C., Kovacs, G., Shemtov, H., Talukdar, P., Tewari, D., Doumbouya, M., Diane, D., Diane, B. M., Farabado, S., Ferrante, E., Guasoni, A., Keita, M., Debbarma, S., Kuzhuget, A., Anugraha, D., Shulthan Habibi, M. R., … Eng, J. (2025). SMOL: Professionally Translated Parallel Data for 115 Under-represented Languages 1103–1123. https://doi.org/10.18653/v1/2025.wmt-1.85
Ding, C., Yin, Y., Jäger, L., & Wilcox, E. (2025). Modeling Bottom-up Information Quality during Language Processing (C. Christodoulopoulos, T. Chakraborty, C. Rose, & V. Peng, Eds.; pp. 11709–11721). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.emnlp-main.592
Doneva, Simona
Doneva, S., Hubarava, H., Härvelid, P., Zürrer, W., Bugajska, J., Hild, B., Brüschweiler, D., Schneider, G., Ellendorff, T., & Ineichen, B. (2025). PreClinIE: An Annotated Corpus for Information Extraction in Preclinical Studies (D. Demner-Fushman, S. Ananiadou, M. Miwa, & J. Tsujii, Eds.; pp. 74–87). Association for Computational Linguistics. https://doi.org/10.18653/v1/2025.bionlp-1.8
Enevoldsen, K., Chung, I., Kerboua, I., Kardos, M., Mathur, A., Stap, D., Gala, J., Siblini, W., Krzemiński, D., Winata, G. I., Sturua, S., Utpala, S., Ciancone, M., Schaeffer, M., Sequeira, G., Misra, D., Dhakal, S., Rystrøm, J., Solomatin, R., … Muennighoff, N. (2025, February 19). MMTEB: Massive Multilingual Text Embedding Benchmark International Conference on Learning Representations 2025, Singapore. https://arxiv.org/abs/2502.13595
Fischer, D. P., & Volk, M. (2025). Name Consistency in LLM-based Machine Translation of Historical Texts (P. Bouillon, J. Gerlach, S. Girletti, L. Volkart, R. Rubino, R. Sennrich, A. C. Farinha, M. Gaido, J. Daems, H. Moniz, & S. Szoc, Eds.; XX; pp. 204–219). Association for Computational Linguistics. https://aclanthology.org/2025.mtsummit-1.16/