Karpathy argues against the "year of agents" hype, positing that solving fundamental challenges like continual learning, multimodality, and cognitive deficits will require a decade of research and engineering. He views the path to capable agents as a long, difficult, but ultimately tractable problem.
Karpathy distinguishes between AI trained on human internet data ("ghosts") and intelligence shaped by evolution ("animals"). He argues that because the optimization processes are fundamentally different, direct analogies to human or animal brains are often misleading and that we are creating a new form of digital intelligence.
Karpathy delivers a strong critique of reinforcement learning (RL), calling it "terrible" and inefficient for complex tasks. He uses the analogy of "sucking supervision through a straw" to describe how RL inefficiently applies a single final reward across a long sequence of actions, a process no human would use for learning.
A central theme is the need to separate an AI's reasoning ability (the "cognitive core") from its memorized knowledge. Karpathy suggests that LLMs' powerful memorization is a double-edged sword, often distracting them from generalizable problem-solving, and proposes research to create smaller, knowledge-agnostic reasoning engines.
Drawing from his experience building NanoChat, Karpathy explains that current AI coding tools are excellent for autocomplete and boilerplate tasks but fail at novel, architecturally complex, or intellectually intense programming. They lack the context and understanding to contribute to frontier research code.
Karpathy discusses the challenge of "model collapse," where training models on their own synthetic outputs leads to a loss of diversity and entropy. The generated data, while looking good on a case-by-case basis, occupies a tiny manifold of the true data distribution, leading to degraded performance over time.
Keep pulling the thread on Andrej Karpathy.