“Two days after Google's Gemini 3 Deep Think achieved a 45% score on the Arc AGI V2 benchmark, Poetic released results showing a 54% score.”