“The ARC-AGI 3 benchmark will not provide any explicit instructions, requiring the test-taker to infer the goal by interacting with the environment.”