“Over time, capabilities that initially required inference-time compute to elicit are distilled directly into the model.”