On the SWE-bench benchmark, AI-generated solutions that pass automated tests are merged by human ..., Sonic AI
“On the SWE-bench benchmark, AI-generated solutions that pass automated tests are merged by human maintainers at about half the rate of human-written "golden" solutions.”