Publications

2026

Quantifying the Impact of Translation Errors on Multilingual LLM Evaluation

Klaudia-Doris Thellmann, Bernhard Stadler, et al. — ACL 2026

arXiv forthcoming Project

Diagnosing Translated Benchmarks: An Automated Quality Assurance Study of the EU20 Benchmark Suite

Klaudia-Doris Thellmann, Bernhard Stadler, et al. — LREC 2026

Teuken-7B-Base & Teuken-7B-Instruct: Towards European LLMs

Mehdi Ali, Michael Fromm, Klaudia-Doris Thellmann, et al. — EACL 2025

Towards Multilingual LLM Evaluation for European Languages

Klaudia-Doris Thellmann, Bernhard Stadler, Michael Fromm, et al. — arXiv 2025

Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions?

Alexander Arno Weber, Klaudia-Doris Thellmann, Jan Ebert, et al. — EMNLP 2024

Tokenizer Choice For LLM Training: Negligible or Crucial?

Mehdi Ali, Michael Fromm, Klaudia-Doris Thellmann, et al. — Findings of NAACL 2024