For the Time Horizons benchmark, human completion times were empirically measured for approximate..., Sonic AI
“For the Time Horizons benchmark, human completion times were empirically measured for approximately two-thirds of the tasks, while the times for the remaining one-third were estimated.”