“In a comparative test, Opus 4.7 failed to generate a functional version of the game F-Zero, producing a 'janky' result with no AI competitors.”