“While AI training is in the late stages for internet text, it is in the very early stages for non-text data like video, images, and audio.”