“The overwhelming majority of the training data Cohere currently generates for its new models is synthetic.”