Andrej Karpathy

AI Researcher, Engineer, and Educator, formerly with OpenAI and Tesla.

Mentions

Appeared in

Discussed in

Key positions and views

AI Agents are a Decade-Long Trend: He consistently states that achieving truly capable, human-level agents is a 10-year endeavor, not an overnight success, characterizing the current era as the 'decade of agents.' [10, 26, 33, 38]

Strong Critique of Reinforcement Learning: He views standard RL as a highly inefficient and 'terrible' learning paradigm for complex tasks, arguing it extracts too little signal from experience. [7, 43, 60]

Intelligence Should Be Separated from Knowledge: He advocates for a research direction focused on isolating a 'cognitive core' from the vast memorized knowledge in LLMs, believing this knowledge may be holding back true reasoning capabilities. [11, 44]

Pragmatic Optimism on Open Source: He views the current 6-8 month lag of open-source models behind frontier models as a 'healthy' dynamic for the industry and speculates that distributed, open collaboration could eventually outperform centralized labs. [13, 16, 18]

The Primacy of Digital over Physical AI: He predicts progress in digital AI agents will vastly outpace robotics because manipulating bits is millions of times easier and cheaper than manipulating atoms. [15]

Podcast consensus on Karpathy

Points of consensus

▶Multiple sources confirm Karpathy's consistent prediction that achieving Artificial General Intelligence (AGI) or fully capable AI agents is a decade-long endeavor, not an imminent event. [10, 26, 28, 29, 32, 34, 38, 58]Feb–Mar 2026

▶Karpathy has characterized the current period as the 'decade of agents,' a view that has been noted and agreed upon by other industry figures. [22, 31, 33]Feb–Mar 2026

▶There is agreement across multiple claims that Karpathy is critical of current AI models' capabilities, describing their output as 'slop' and stating they are 'not there yet.' [35, 57]Feb–Mar 2026

▶Karpathy's departure from a senior role at a major AI lab is cited as part of a broader trend of top researchers leaving established institutions to pursue new approaches, signaling a belief that current LLM technology has fundamental limitations. [56]

Points of debate

▶Karpathy's strong criticism of reinforcement learning as a 'terrible' and inefficient paradigm [7, 43] contrasts with the historical focus of major labs like DeepMind, which used RL to achieve significant breakthroughs in game environments. [42]Feb–Mar 2026

▶His belief that OpenAI's early focus on video games and RL was a 'misstep' [30, 42] represents a strategic debate on the most effective path to AGI, contrasting with approaches that prioritize learning through interaction in complex environments.Feb 2026

▶Karpathy's vision of a decentralized network of agents potentially outperforming centralized frontier labs [13] presents a contrasting future to the current industry structure dominated by a few large, well-funded organizations.Mar 2026

▶While Karpathy views the current 6-8 month lag of open-source models as 'healthy' [18], this perspective implicitly debates the more alarmist views on the dangers or competitive disadvantages of open-source AI proliferation.Mar 2026

Key themes

▶The Decade of AgentsFeb–Mar 2026

Karpathy consistently frames the current era not as the 'year of agents' but the 'decade of agents.' This theme encompasses his predictions of a 10-year timeline for maturation [33, 38], his personal workflow shift to delegating nearly all coding to agents since December 2023 [50], and his hands-on work creating agents for home automation and auto-research. [47, 49]

This long-term framing suggests that the primary investment and innovation opportunities will be in building the infrastructure, tooling, and applications for agentic workflows, rather than expecting a single 'AGI' breakthrough to immediately change everything.

▶Critique of Current AI ParadigmsFeb 2026

Karpathy expresses significant skepticism about the foundations of current AI development. He criticizes the inefficiency of reinforcement learning ('sucking supervision through a straw') [7], the low quality of internet pre-training data ('total garbage') [5], the lack of diversity in synthetic data ('silently collapsed') [4], and the over-reliance on memorization. [11]

Karpathy's focus on these fundamental flaws indicates that he believes true progress is bottlenecked by methodology, not just scale, creating opportunities for startups or researchers with novel approaches to data generation, training, and model architecture.

▶The Cognitive Core vs. Memorized KnowledgeFeb–Mar 2026

A key research direction proposed by Karpathy involves separating an AI's learned knowledge from its core reasoning ability. He argues that the vast memorized data from pre-training may hinder true intelligence [11] and proposes isolating a 'cognitive core' that could be as small as one billion parameters. [9, 44]

This pursuit of smaller, more efficient reasoning engines over massive knowledge databases could disrupt the current compute-heavy landscape, potentially lowering barriers to entry and shifting the competitive advantage from data hoarders to those who can build the most effective 'cognitive core'.

▶Recursive Self-Improvement and the Future of AI DevelopmentMar 2026

Karpathy identifies recursive self-improvement—using LLMs to improve the next generation of LLMs—as the central goal of all frontier labs. [17] He extends this concept to his own work with an 'auto-researcher' agent that improves model hyperparameters overnight [49] and envisions a future where a distributed 'swarm of agents' could collectively improve models, potentially outpacing centralized labs. [13]

This theme highlights that the process of building AI is itself becoming the product. Companies that can create the most effective feedback loops for AI-driven development will likely gain a compounding advantage that is difficult for competitors to overcome.

Source episodes

Sentiment over time

Not enough data for timeline

Changes over time

Pre-2022 (Tesla)

While leading the Full Self-Driving (FSD) program at Tesla, Karpathy advocated for an end-to-end 'pixels to torque' deep learning approach. [36]

OpenAI (Early Tenure)

Karpathy retrospectively views OpenAI's early focus on reinforcement learning in game environments (like Atari and Universe) as a 'misstep' and 'way too early,' arguing the focus should have remained on language models. [30, 39, 42]

2016-2017 (Benchmark)

Karpathy uses this period as an analogy for the current state of AI agents, suggesting they are at a similar level of development as self-driving cars were at that time. [25]

December 2023

Identified by Karpathy as a pivotal moment when the capabilities of AI agents crossed a threshold, fundamentally changing his and other software engineers' default workflows from manual coding to delegation. [50, 51]

2024-2025 (Implied)

Karpathy actively promotes the 'decade of agents' concept, releases educational code repositories like NanoChat [40], and details his extensive use of personal AI agents for auto-research and home automation. [47, 49]

Suggested prompts

How does Karpathy's concept of a 'cognitive core' challenge the current industry paradigm of scaling ever-larger models on more data? &nearr;What are the technical and economic implications of Karpathy's prediction that a distributed network of agents could outperform centralized frontier labs? &nearr;Analyze Karpathy's critique of reinforcement learning. What alternative training methodologies does he implicitly or explicitly support for developing agentic intelligence? &nearr;Based on Karpathy's claims, what specific capabilities must AI agents develop to transition from their current 'jagged' state to becoming reliable 'enterprise agents' in a decade? &nearr;

Key concepts

AI Agents 9 ep AGI Timelines 7 ep Reinforcement Learning (Critique) 1 ep Recursive Self-Improvement 1 ep Cognitive Core 1 ep Software Development Workflow 2 ep Open Source vs. Closed Models 1 ep Synthetic Data 1 ep Vibe Coding 2 ep

Notable quotes

“something flipped where I kind of went from 80-20 of like, you know, to like 20-80 of writing code by myself versus just delegating to agents. And I don't even think it's 20-80 by now. I think it's a lot more than that. I don't think I've typed like a line of code. probably since December, basically”

Andre Karpathy · Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

“a swarm of agents on the internet could collaborate to improve LLMs and could potentially even like run circles around Frontier Labs. Like who knows, you know?”

Andre Karpathy · Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

“fundamentally, what I'm more interested in is like this idea of recursive self-improvement and to what extent you can actually have LLMs improving LLMs. Because I think all the Frontier Labs, this is like the thing for obvious reasons.”

Andre Karpathy · Skill Issue: Andrej Karpathy on Code Agents, AutoResearch, and the Loopy Era of AI

“Andrei Karpathy characterizes standard reinforcement learning as "sucking supervision through a straw" because it derives a single reward signal from a long, complex trajectory of actions, making it highly inefficient.”

Andrei Karpathy · Andrej Karpathy — “We’re summoning ghosts, not building animals”

Report last updated: Apr 21, 2026

Get started free

Back to Entities Intelligence Report