Evaluating Large Language Models for Hausa and Fongbe Machine Translation: Benchmarks, Failures, and Metric Reliability


This is a companion discussion topic for the original entry at https://arxiv.org/abs/2606.22269