The Montgomery Summit• Mar 16, 2026• 25:15Fireside ChatFireside Chat

Fireside Chat with Together.ai

From The Montgomery Summit · 2026

Vipul Ved Prakash•Co-Founder & CEO, Together.ai

Executive Summary

The demand for AI compute is exploding, creating a severe and worsening global shortage of GPUs and data center capacity that is now the primary bottleneck for industry growth.
AI-native companies are experiencing unprecedented hyper-growth (e.g., 6-10x YoY for Together.ai, Anthropic doubling revenue in 3 months), signaling a fundamental economic shift that defies traditional software valuation metrics.
The AI market has reached an inflection point where inference workloads now exceed 50% of all GPU usage, marking a transition from an R&D phase to a widespread, revenue-generating deployment of AI services.
AI model capabilities are advancing at a breakneck pace, particularly in software engineering, where performance on key benchmarks has jumped from 1% to 76% in two years, effectively industrializing the field of coding.

12 quotes

Concerns Raised

The severe and worsening shortage of GPU and data center capacity is the main bottleneck for industry growth.
The business growth of even the largest AI companies is ultimately constrained by available compute.
Public market investors may misunderstand the new economics of AI, incorrectly punishing companies for necessary infrastructure investments.

Opportunities Identified

Building infrastructure platforms like Together.ai to serve the massive demand for efficient AI inference and training.
Developing AI-native applications that can achieve hyper-growth by leveraging increasingly capable open-source models.
The emergence of new AI modalities like voice and physical AI (robotics) will create new waves of demand for compute.
Investing in the AI supply chain as the demand for inference-optimized infrastructure continues to outpace supply.

Key Themes

The Great Compute Shortage

The discussion highlights a severe and worsening global shortage of GPU clusters and data center capacity. Demand for AI inference and training now exceeds the entire supply chain's ability to deliver, forcing even major players like Microsoft to throttle sales commitments.

This compute scarcity is the primary bottleneck for the entire AI industry's growth, impacting everything from model development to application deployment and revenue generation for both hyperscalers and AI-native startups.

Unprecedented Growth and Economic Transformation

The speakers describe a period of hyper-growth for AI-native companies that defies traditional SaaS metrics. With companies like Anthropic doubling revenue in a single quarter and Together.ai growing 6-10x year-over-year, the economic scale of the AI industry is expanding at an unprecedented rate.

This explosive growth signals a fundamental economic shift, suggesting that AI-native companies will rapidly become dominant market players and that traditional valuation models are inadequate for this new generation of technology businesses.

The Shift from Training to Inference

A critical inflection point has been reached where AI inference (the use of models to generate results) now consumes over 50% of GPU workloads, surpassing training. This signifies a maturation of the market from R&D to widespread, revenue-generating deployment of AI services.

This shift validates the massive investment in AI infrastructure, as it's now being directly monetized through applications. It also changes the technical challenges, prioritizing efficiency, latency, and scalable deployment over pure model training.

The Rise of Open-Source and a Multipolar AI World

The conversation emphasizes a significant shift towards open-source models as the foundation for new AI applications. It's noted that the four most consequential recent LLMs are from Chinese developers, indicating a decentralization of AI innovation beyond a few US-based labs.

This trend empowers developers with more choice, better economics, and greater control, fostering a more competitive and diverse ecosystem while also introducing new geopolitical dimensions to the AI supply chain.

The Industrialization of Software Development

AI is fundamentally transforming software development from an "artisanal" craft into an industrialized process. With models rapidly improving on coding benchmarks (from 1% to 76% on SWE-bench), they are moving from being co-pilots to autonomous code generators.

This transformation promises massive productivity gains and changes the role of human developers to a higher level of abstraction and co-design. It also creates a massive new market for AI-powered development tools and platforms.

Get started free

Topics

AI Infrastructure GPU Shortage Compute Capacity Data Centers Together.ai Open-Source AI LLMs AI Inference AI Training Generative AI AI Economics Venture Capital Software Engineering SWE-bench Chinese AI Models

Processed Apr 19, 2026 yt-dlp + mlx-whisper + Gemini