Translation Quality Assurance for Multilingual LLM Evaluation
Diagnosing translated benchmarks, and quantifying how translation errors affect multilingual LLM evaluation.
Diagnosing translated benchmarks, and quantifying how translation errors affect multilingual LLM evaluation.