To control for environmental variance in evaluations, Physical Intelligence runs its new models a..., Sonic AI
“To control for environmental variance in evaluations, Physical Intelligence runs its new models against a baseline model concurrently, using the same human operator, and focuses on relative performance rather than absolute scores over time.”