“Robots can be effectively trained using synthetic video data, achieving performance comparable to training on real-world video.”