The "meter eval" benchmark indicates that AI models are doubling their ability to perform long, c..., Sonic AI
“The "meter eval" benchmark indicates that AI models are doubling their ability to perform long, complex human tasks approximately every seven months.”