“GitHub hosts 400 million public software repositories, providing a vast source of training data for AI models.”