June 17, 2026
What are top managers and allocators saying about semiconductors, AI compute buildout, and how is positioning shifting?
Managers and allocators describe the current AI compute buildout as an investment "super cycle" where demand far outstrips the available supply of semiconductors, data centers, and electricity [1, 20]. The scale of this cycle is projected to be an order of magnitude larger than the cloud era, with some analysts estimating **$10 trillion in total capital expenditure** [9, 11]. This spending has propelled the semiconductor industry to a $1 trillion annual revenue run rate years ahead of schedule and is seen as the primary driver of current market and economic growth [9, 25]. While some compare the boom's magnitude to the dot-com era, they note it is more fundamentally sound due to the staggering profitability of the leading infrastructure companies . However, others caution that the capex is more widespread than in the dot-com period, suggesting a future correction could have a broader macroeconomic impact . The primary near-term risk cited is a potential slowdown in enterprise adoption if companies fail to realize a clear and timely return on their massive investments .
The primary limiting factor on AI's growth is a series of cascading supply chain bottlenecks that are expected to persist for years [2, 13, 15]. While initial constraints centered on GPUs, where NVIDIA maintains over **98% market share** for non-Google AI workloads , the shortages have broadened significantly. Severe constraints are now reported in high-bandwidth memory (HBM), advanced packaging (CoWOS), and other critical components, with some supply commitments extending years into the future [2, 14, 18, 19]. Some industry executives and analysts believe this supply-demand mismatch will last for at least a decade [3, 7]. More recently, the primary bottleneck is shifting from chip manufacturing to physical infrastructure, specifically **electrical power and data center availability**, which now constrains even the largest hyperscalers [14, 17, 21]. This has led to a decommoditization of the hardware industry, creating significant pricing power for key suppliers .
Go deeper
Search this topic across 400+ expert conversations on Sonic.
Demand drivers are evolving from cloud-based experimentation to on-premise enterprise production, motivated by data security and sovereignty needs [3, 7, 19]. A more profound shift is underway from generative AI for content creation to "Agentic AI" for autonomous task execution [3, 19]. This paradigm of persistent, agent-based computing is expected to dramatically increase the computational footprint of every user and is driving a new architectural debate . Some allocators predict this will cause a fundamental flip in server design from being GPU-heavy to **CPU-heavy**, potentially shifting the balance of power among semiconductor firms [14, 29]. This new architecture, combined with more advanced "reasoning" models, is also expected to drive a fivefold increase in the amount of memory required per user [22, 26, 28].
In response to these dynamics, investor positioning is shifting. Value is currently accruing to the infrastructure and semiconductor layers, a reversal from the cloud era where software captured the most value . This has driven the semiconductor sector's weight in the S&P 500 from 3% to **17%** over the last decade [5, 10]. Capital allocation frameworks are moving beyond a singular focus on NVIDIA to a broader "sellers of shortage vs. buyers of shortage" thesis . This view favors component suppliers like memory and power companies, which are seen as having expanding profitability, over hyperscalers who face margin compression from their massive capex . The investment mantra is evolving from "follow the GPU" to "follow the gigawatts," identifying electrical power as the next key area for investment . While momentum remains strong, the rally's narrowness is a noted concern .
What the sources say
Points of agreement
- •Demand for AI compute and related hardware far exceeds the available supply, creating a multi-year super cycle and an era of scarcity.
- •The investment boom is broadening beyond GPU leaders to benefit the entire semiconductor ecosystem, including memory chip makers and other companies like Intel.
- •Key supply chain bottlenecks are concentrated in advanced components, particularly high-bandwidth memory (HBM) and advanced semiconductor packaging.
- •Enterprises are increasingly shifting AI workloads from cloud-based testing to on-premise production environments to ensure data security and sovereignty.
Points of disagreement
- •While many identify memory and chips as the primary bottleneck, some argue the main constraint is shifting to electrical power and physical data center availability.
- •Some sources see a fundamentally sound boom driven by profitable giants, while others warn of a bubble with broader macroeconomic risks than the dot-com era.
- •One view is that value is accruing to the entire infrastructure layer, while a more nuanced take is that 'sellers of shortage' (e.g., memory, power) are capturing value at the expense of 'buyers of shortage' (hyperscalers).
- •The current GPU-dominant server architecture may be upended by the rise of 'agentic AI,' which could cause a flip to CPU-heavy configurations to handle persistent, task-oriented workloads.
Sources
Inside Coatue's AI Public Market Update With CIO Jaimin Rangwalla
This source introduces the investment thesis of shifting focus from GPUs to gigawatts, identifying electrical power as the main bottleneck and differentiating between profitable 'sellers of shortage' and margin-compressed 'buyers of shortage'.
Nvidia CEO Jensen Huang & Dell CEO Michael Dell on Agentic AI, Memory Demand and China |...
This source details the shift from cloud testing to on-premise enterprise AI and introduces 'agentic AI' as the next paradigm driving a decade-long infrastructure buildout for both GPUs and CPUs.
The Architects of Value: Mark Edelstone and Colin Stewart on the Economics of Silicon Valley
This source quantifies the AI investment cycle as an order of magnitude larger than the cloud era, with value currently accruing to the semiconductor and infrastructure layers rather than software.
Aswath Damodaran: The AI Boom Is Headed For A Reckoning
This source provides a cautionary view, comparing the widespread AI capital expenditure to the dot-com era and suggesting a future correction could have a more severe macroeconomic impact.
AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner
This source analyzes NVIDIA's market dominance and strategy while arguing that the primary bottleneck for AI expansion is shifting from chip supply to physical infrastructure like power and data centers.
The Supply and Demand of AI Tokens | Dylan Patel Interview
This source explains how insatiable AI compute demand is creating severe shortages in critical hardware like high-bandwidth memory, making physical supply constraints the primary limiting factor on AI's growth.
Related questions
As the primary bottleneck shifts from chips to power, which companies in the energy and data center infrastructure sectors are best positioned to become the new 'sellers of shortage'?
→What are the key metrics enterprises are using to measure ROI on on-premise AI investments, and what is the risk of a capex slowdown if these returns are not realized quickly?
→What are the specific technical and economic drivers behind the predicted architectural shift to CPU-dominant servers for 'agentic AI,' and what is the likely timeline for this transition?
→How are geopolitical factors, particularly US-China tech relations and Taiwan's role in manufacturing, impacting long-term capital allocation for semiconductor fabrication and supply chain diversification?
→Ask your own research questions
Search and synthesize across 400+ expert conversations in real time.
Try: “What are top managers and allocators saying about semiconductors, AI compute buildout, and how is positioning shifting?”
Search this on Sonic →