Dwarkesh Podcast• Mar 13, 2026• 2:30:44Interview

Dylan Patel — The single biggest bottleneck to scaling AI compute

From Dwarkesh Podcast

Dylan Patel•CEO, Semi Analysis

Executive Summary

Hyperscalers (Amazon, Meta, Google, Microsoft) and their suppliers are investing nearly a trillion dollars in AI infrastructure, with much of the capital expenditure being for long-lead items years into the future.
A fierce compute race is underway between AI labs.
OpenAI's aggressive, multi-provider strategy has given it an advantage over Anthropic, whose conservative approach has created a compute bottleneck that now threatens its rapid revenue growth.
The entire semiconductor supply chain is facing unprecedented strain, from TSMC's wafer allocation and soaring memory prices to a future bottleneck in ASML's production of EUV lithography tools, which will ultimately cap the pace of the AI buildout.
The economics of AI compute are volatile, with high short-term rental prices for GPUs, but a predicted decline in the long run.
This dynamic is forcing AI model vendors to raise prices, which is expected to significantly increase their margins.

12 quotes

Concerns Raised

Long-term semiconductor equipment (ASML EUV) production is the ultimate bottleneck for the entire AI buildout.
Anthropic's conservative compute strategy has created a significant growth bottleneck, forcing it to seek lower-quality or higher-cost capacity.
TSMC's finite wafer capacity is creating intense competition between hyperscalers and traditional customers like Apple.
China's long-term push for a fully indigenous semiconductor supply chain could shift the geopolitical balance by 2035.

Opportunities Identified

Massive, sustained capital investment in data centers and AI hardware by hyperscalers.
NVIDIA and other AI accelerator providers have significant pricing power and long-term supply contracts.
Memory vendors (e.g., SK Hynix) are poised for significant margin expansion due to high-bandwidth memory (HBM) demand.
AI model vendors like Anthropic are expected to see significant margin improvement as they are forced to raise prices to cover high compute costs.

Key Themes

The AI Infrastructure Arms Race

Major tech companies are engaged in a historic capital expenditure cycle, projected to reach nearly a trillion dollars across the supply chain. This spending is not just for immediate needs but is a long-term strategic play, involving prepayments for data centers, power infrastructure, and semiconductor manufacturing capacity for as far out as 2029.

The sheer scale and long-term nature of this investment signal a fundamental, multi-year buildout of AI infrastructure, creating sustained demand across the entire tech and energy ecosystem and reshaping capital allocation priorities for the world's largest companies.

AI Lab Compute Scramble

Leading AI labs like Anthropic and OpenAI are in a desperate race to secure gigawatts of compute capacity to support both model training and explosive inference demand. Anthropic's conservative strategy has left it compute-constrained and scrambling for capacity, while OpenAI's more aggressive, diversified approach has secured it a more robust supply pipeline.

Access to compute has become the primary bottleneck for AI model revenue growth and innovation. A lab's ability to secure long-term, cost-effective compute contracts is a direct determinant of its competitive position and viability.

Semiconductor Supply Chain Bottlenecks

The AI boom is creating critical chokepoints throughout the semiconductor supply chain. TSMC is struggling to allocate wafer capacity among competing AI and mobile customers, memory vendors are raising prices due to HBM demand, and the ultimate long-term constraint is projected to be ASML's limited production capacity for essential EUV lithography tools.

These physical constraints, not just capital, will dictate the pace of AI advancement. Understanding these bottlenecks is crucial for forecasting compute availability, hardware costs, and the strategic positioning of nations and corporations.

The Economics of AI Compute

The market for AI compute is defined by high costs, with a gigawatt of capacity renting for approximately $10 billion per year. While the total cost of ownership for GPUs is lower than current rental prices, scarcity drives spot prices to extreme highs. This pressure is forcing AI labs to increase their own prices, which is expected to dramatically improve their currently thin gross margins.

The financial viability of the entire AI software layer depends on managing the staggering cost of compute. The interplay between hardware costs, rental prices, and model pricing will determine the profitability and accessibility of advanced AI services.

Geopolitical Tech Competition

The analysis highlights a widening compute gap between US-based AI labs and their Chinese counterparts, driven by the massive infrastructure investment in the West. While China is aggressively pursuing an indigenous semiconductor supply chain, with predictions of domestic EUV capability by 2030, it currently lacks the scale to compete with the AI buildout in the US.

The concentration of advanced AI compute and semiconductor manufacturing outside of China represents a significant geopolitical lever. The long-term race for semiconductor independence will be a defining factor in global technological leadership and economic power.

Get started free

Topics

AI Infrastructure Capital Expenditures (CapEx)Data Centers Semiconductors Supply Chain Bottlenecks NVIDIA TSMC ASML Google Anthropic OpenAI Compute Capacity EUV Lithography Geopolitics AI Economics

Processed Apr 3, 2026 yt-dlp + mlx-whisper + Gemini