“Performance on the SWE-bench benchmark for software engineering tasks has improved from 1% two and a half years ago to 76% in recent quarters.”