a16z Podcast• Mar 17, 2026• 46:48Interview

Why Scale Will Not Solve AGI | Vishal Misra - The a16z Show

From a16z Podcast

Vishal Misra

Executive Summary

Large Language Models with Transformer architectures are mathematically proven to perform precise Bayesian inference to predict the next token, which explains their capacity for in-context learning.
Despite their capabilities, current LLMs are fundamentally limited by their architecture; they operate on correlation, not causation, and lack plasticity as their weights are frozen post-training.
The speaker strongly refutes claims of LLM consciousness, arguing they are silicon-based systems optimizing for next-token prediction, not survival, and their behavior is a reflection of training data rather than genuine understanding.
Future progress toward AGI requires moving beyond scaling current models and focusing on developing new architectures that incorporate mechanisms for causality and continuous learning (plasticity).

12 quotes

Concerns Raised

Current LLM architectures are fundamentally limited to correlation and cannot perform causal reasoning.
The lack of plasticity (frozen weights) prevents LLMs from retaining learning across interactions.
Simply scaling up existing models is an insufficient path to AGI.
Public and even expert discourse is prone to anthropomorphizing LLMs and speculating about consciousness without basis.

Opportunities Identified

Developing new AI architectures that explicitly incorporate mechanisms for causality.
Creating models with true plasticity that can learn continuously from experience.
Applying formal frameworks like Judea Pearl's causal hierarchy to build more robust AI.
Using the mathematical understanding of LLMs as Bayesian engines to improve their predictability and performance.

Key Themes

LLMs as Bayesian Inference Engines

The core thesis is that LLMs, particularly those using the Transformer architecture, function as sophisticated Bayesian inference engines. This was empirically observed and later mathematically proven using a 'Bayesian wind tunnel' experiment, which showed the model could compute the precise Bayesian posterior for a given task.

This provides a formal mathematical framework for understanding how LLMs perform in-context learning and update their 'beliefs' based on new information in a prompt, moving the explanation from magic to mathematics.

The Architectural Limits of Current AI

The discussion highlights that an AI's core capabilities are a function of its architecture. While Transformers excel at Bayesian updating, they are fundamentally limited to finding correlations in data and cannot perform causal reasoning. Furthermore, their 'frozen' weights post-training prevent true plasticity and lifelong learning.

This clarifies that simply scaling up current models (more data, more parameters) will not overcome these inherent limitations and will not lead to AGI; new architectural paradigms are necessary.

Causality and Plasticity as the Next Frontiers

The speaker identifies two key areas for future AI research: causality and plasticity. To advance, AI needs to move from correlation to causation, building internal models to simulate outcomes, potentially using frameworks like Judea Pearl's. It also needs plasticity to enable continuous learning from new experiences, similar to a biological brain.

This pinpoints the specific, fundamental research directions required to build more capable and generalizable AI systems that can reason about the world rather than just mimicking patterns in text.

Debunking AI Consciousness

The speaker directly challenges the notion that LLMs could be conscious, dismissing it as anthropomorphism. He emphasizes that models like Anthropic's Claude are matrix multiplication systems driven by the objective of next-token prediction, not a biological imperative for survival, and lack any inner monologue or subjective experience.

This provides a grounding counter-narrative to the hype and speculation surrounding AI consciousness, focusing the conversation on the technical realities of the systems' design and objective functions.

Get started free

Topics

Large Language Models (LLMs)Transformer Architecture Bayesian Inference In-Context Learning Causality vs Correlation Artificial General Intelligence (AGI)AI Architecture Model Limitations Plasticity in AI Judea Pearl Do Calculus Retrieval-Augmented Generation (RAG)Next-Token Prediction Mamba Architecture AI Consciousness

Processed Apr 6, 2026 yt-dlp + mlx-whisper + Gemini