On the Terminal Bench 2 coding benchmark, which separates performance by model and agent harness,..., Sonic AI

Use with Claude or ChatGPT

On the Terminal Bench 2 coding benchmark, which separates performance by model and agent harness,..., Sonic AI