Edwin Chen claims that some top-ranked models on the LMSYS leaderboard are actually worse in many..., Sonic AI
“Edwin Chen claims that some top-ranked models on the LMSYS leaderboard are actually worse in many ways than they were six months ago due to being over-optimized for the flawed benchmark.”