The AYA project by Cohere4AI was the largest data collection effort in machine learning history, ..., Sonic AI
“The AYA project by Cohere4AI was the largest data collection effort in machine learning history, involving thousands of native speakers of over 100 languages, with the resulting dataset being open-sourced.”