TD
The Dwarkesh Patel Podcast
Dwarkesh Patel
Summary
The Dwarkesh Patel Podcast is hosted by Dwarkesh Patel covering Continual Learning, Reinforcement Learning (RL), Sample Efficiency, and AGI.
1episodes
14total claims
12topics covered
1 episodes
What does the next training paradigm look like?
›Jun 26, 2026
The current AI training paradigm, Reinforcement Learning from Verifiable Reward (RLVR), is limited because it struggles with complex, real-world domains that cannot be easily simulated or 'ground o...
Continual LearningReinforcement Learning (RL)Sample EfficiencyAGI+11 more