Meta reportedly gamed the LM Arena benchmark for Llama 4 by submitting multiple specialized varia..., Sonic AI
“Meta reportedly gamed the LM Arena benchmark for Llama 4 by submitting multiple specialized variants to maximize its score, which did not reflect its general real-world competitiveness.”