2026

Quantifying the Impact of Translation Errors on Multilingual LLM Evaluation
Klaudia-Doris Thellmann, Bernhard Stadler, et al.ACL 2026
Diagnosing Translated Benchmarks: An Automated Quality Assurance Study of the EU20 Benchmark Suite
Klaudia-Doris Thellmann, Bernhard Stadler, et al.LREC 2026

2025

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs
Mehdi Ali, Michael Fromm, Klaudia-Doris Thellmann, et al.EACL 2025
Towards Multilingual LLM Evaluation for European Languages
Klaudia-Doris Thellmann, Bernhard Stadler, Michael Fromm, et al.arXiv 2025

2024

Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?
Alexander Arno Weber, Klaudia-Doris Thellmann, Jan Ebert, et al.EMNLP 2024
Tokenizer Choice For LLM Training: Negligible or Crucial?
Mehdi Ali, Michael Fromm, Klaudia-Doris Thellmann, et al.Findings of NAACL 2024