“Physical Intelligence uses a transformer-based architecture and leverages pre-trained vision-language models for its robotics foundation model.”