June 15, 2026

What are the top operators and VCs saying about the Chip market?

17 episodes13 podcastsMar 24, 2025 – Jun 12, 2026

The AI chip market is characterized by "unfathomable" demand driving a semiconductor super-cycle, with supply struggling to keep pace [21, 29]. While NVIDIA is the dominant market leader, its position is viewed as a defensible moat primarily due to its incumbency as the default solution [8, 16]. However, operators and analysts see significant vulnerabilities. Cerebras CEO Andrew Feldman predicts NVIDIA's market share will decline to **50-60%** within five years, citing the fundamental inefficiency of its GPU architecture for AI inference workloads [2, 26]. This inefficiency stems from a memory bandwidth bottleneck that separates compute from off-chip memory, resulting in utilization rates as low as 5-7% for certain AI tasks [2, 4, 25]. This performance gap creates a significant opening for competitors, particularly in the vast inference market where NVIDIA's CUDA software moat is considered less formidable .

In response to these architectural limitations, a new wave of competitors is emerging with fundamentally different designs rather than incremental improvements . Companies like Cerebras are pursuing a wafer-scale architecture that co-locates vast amounts of fast on-chip SRAM with compute cores, directly addressing the data movement problem that plagues GPU designs [2, 17, 20]. This trend extends to the predicted rise of ASICs, which can be tuned for specific workloads and place the model directly on the chip for greater efficiency . Customers are already diversifying, with hyperscalers developing their own chips to gain pricing leverage over NVIDIA and others striking deals with competitors like AMD [13, 15, 23]. However, there is a notable tension in this outlook; while operators like Feldman have raised over **$1 billion in pre-IPO funding** to scale new architectures , finance professor Aswath Damodaran believes it is too late for new entrants to compete in chip architecture, arguing incumbents have an insurmountable lead .

Go deeper

Search this topic across 400+ expert conversations on Sonic.

Search →

The economics of the AI hardware stack are defined by staggering capital intensity, which many believe will dictate where value accrues. With chip tape-outs costing over $30M and data centers for frontier models requiring tens of billions of dollars in chips, the barriers to entry are immense [1, 27]. This leads to a key investment thesis articulated by multiple sources: over a five-year horizon, chip and hardware providers are expected to capture more enduring value than AI model providers due to the deep technical expertise and capital required to compete [2, 3, 10, 12]. Consequently, the crucial metric for evaluating hardware is shifting from raw performance to economic efficiency, best measured in **dollars per token** [1, 22]. This economic pressure justifies the high cost for frontier labs to build specialized software teams to move away from established ecosystems like CUDA in pursuit of better cost-performance .

Systemic bottlenecks across the supply chain are the primary constraints on AI growth, though opinions differ on their longevity. The industry faces a severe shortage of AI talent, limited semiconductor fabrication capacity at foundries like TSMC, and delays in data center construction [6, 30]. Some parts of the supply chain have enormous backlogs and significant pricing power . Broadcom executive Charlie Kawwas predicts these core bottlenecks will be resolved within **two to three years** as new fabs come online . Others are less optimistic, arguing that the primary constraint exists far upstream in components and fabrication, meaning that simply designing more chips will not solve the fundamental compute shortage . Looking further ahead, some operators predict the industry's reliance on the transformer architecture will wane within 3-5 years, a shift that would profoundly alter hardware requirements and potentially re-shuffle the competitive landscape [2, 5].

What the sources say

Points of agreement

•The demand for AI chips is described as "unfathomable," with demand from end-users far outpacing the supply from hardware builders.
•Chip and hardware companies are expected to capture more long-term value than AI model providers due to higher capital intensity and barriers to entry.
•NVIDIA's GPU architecture is dominant but has fundamental inefficiencies for AI workloads, particularly related to memory bandwidth, creating openings for competitors.
•The primary constraints on AI growth are systemic bottlenecks, including semiconductor fab capacity, talent shortages, and data center construction delays.

Points of disagreement

•One view is that NVIDIA's market share will fall to 50-60% within five years, while another asserts it's too late for new entrants to compete with incumbents.
•Some experts predict current supply bottlenecks for chips, memory, and power will be resolved in 2-3 years, while others believe the core constraints are upstream at fabs like TSMC and will persist.
•The industry's reliance on the transformer architecture is expected to wane within 3-5 years, with a potential rise in ASICs that put the model directly on the chip.

Sources

Reiner Pope of MatX on accelerating AI with transformer-optimized chips (A Cheeky Pint, Feb 26, 2026)

This source emphasizes that the staggering capital cost of AI compute makes cost-performance, measured in "dollars per token," the key battleground in the chip market.

Andrew Feldman, Cerebras Co-Founder and CEO: The AI Chip Wars & The Plan to Break Nvidia's Dominance (20VC with Harry Stebbings, Mar 24, 2025)

This source argues that NVIDIA's GPU architecture is inefficient for AI, predicts its market share will fall, and posits that chip providers will capture more long-term value than model providers.

Cerebras CEO, Andrew Feldman on Why Raise $1BN and Delay the IPO & Why NVIDIA’s Worried About Growth (20VC with Harry Stebbings, Oct 6, 2025)

This source highlights the unprecedented market demand, systemic bottlenecks like talent and fab capacity, and the openings for competitors created by NVIDIA's technical limitations.

Inside Broadcom: Building the Infrastructure the World Runs On | Charlie Kawwas (A Bit Personal, Feb 12, 2026)

This source provides an optimistic outlook, predicting that major AI-related supply bottlenecks in chip capacity, memory, and power will be resolved within two to three years.

America’s Semiconductor Boom is Real (Asianometry, Oct 16, 2025)

This source describes how "unfathomable" AI demand is driving a semiconductor super-cycle and creating immense pressure on the manufacturing supply chain.

The IPO Comeback: Why Tech Giants Are Finally Going Public | All-In Liquidity IPO Panel (All-In Podcast, Jun 6, 2026)

This source explains that solving the fundamental data movement bottleneck with novel architectures, like Cerebras's wafer-scale chip, is key to competing with incumbents like NVIDIA.