In the Time Horizons benchmark, there is a large variation in human baseline completion times for..., Sonic AI
“In the Time Horizons benchmark, there is a large variation in human baseline completion times for the same task, often differing by a factor of 3x even among selected experts.”