“Capabilities previously requiring test-time compute can now be performed by models in a single pass due to distillation.”