OpenAI has observed that for visual reasoning tasks, the test-time scaling slope is significantly..., Sonic AI
“OpenAI has observed that for visual reasoning tasks, the test-time scaling slope is significantly steeper when models use tools compared to when they do not.”