<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>Blog on Klaudia-Doris Thellmann</title><link>https://klaudiath.github.io/blog/</link><description>Recent content in Blog on Klaudia-Doris Thellmann</description><generator>Hugo -- 0.155.3</generator><language>en-us</language><lastBuildDate>Fri, 15 May 2026 00:00:00 +0000</lastBuildDate><atom:link href="https://klaudiath.github.io/blog/index.xml" rel="self" type="application/rss+xml"/><item><title>ACL 2026 preview: Translation errors and multilingual LLM evaluation</title><link>https://klaudiath.github.io/blog/acl-2026-tqe-paper/</link><pubDate>Fri, 15 May 2026 00:00:00 +0000</pubDate><guid>https://klaudiath.github.io/blog/acl-2026-tqe-paper/</guid><description>&lt;p&gt;In July 2026, I will present our TQE paper at ACL. This draft is a placeholder for the final conference post.&lt;/p&gt;</description></item><item><title>LREC 2026: Diagnosing translated benchmarks</title><link>https://klaudiath.github.io/blog/lrec-2026-eu20-benchmark-qa/</link><pubDate>Fri, 15 May 2026 00:00:00 +0000</pubDate><guid>https://klaudiath.github.io/blog/lrec-2026-eu20-benchmark-qa/</guid><description>&lt;p&gt;In May 2026, I presented our work &lt;strong&gt;&amp;ldquo;Diagnosing Translated Benchmarks: An Automated Quality Assurance Study of the EU20 Benchmark Suite&amp;rdquo;&lt;/strong&gt; at LREC in Palma, Mallorca. What struck me most across the conference was a recurring theme: multilingual evaluation is becoming less about simply translating English benchmarks, and more about diagnosing whether our evaluation data is valid in the first place.&lt;/p&gt;
&lt;img src="images/lrec_poster.png" alt="Presenting the paper at LREC" class="blog-image-medium"&gt;
&lt;p&gt;This post collects the main idea of our paper, the most relevant work I saw at the conference, and a few personal takeaways for researchers working on multilingual LLM evaluation.&lt;/p&gt;</description></item></channel></rss>