Keep pulling the thread on Tuhin Srivastava.
The discussion highlights a severe, ongoing shortage of AI compute, particularly high-end GPUs. This scarcity makes owning or securing long-term compute capacity the most critical strategic asset for AI companies, influencing everything from financing strategies to competitive positioning.
While frontier models from OpenAI and Anthropic set the capability bar, the majority of production workloads are shifting to custom models. Companies are using post-training and reinforcement learning on open-source foundations to create specialized models that are more performant and cost-effective for specific tasks.
The speaker posits that AI inference is the ultimate, enduring market, as even the advent of AGI would simply create more demand for inference. The inference software layer is shown to be incredibly sticky, with high net dollar retention and low churn, as it becomes deeply embedded in customer products.
Despite the hype, the vast majority of the potential enterprise market for AI has not yet come online. The speaker estimates 99% of the market, measured by inference count, is still ahead, suggesting the current boom is just the beginning of a much larger wave of adoption.
True defensibility for AI application companies lies not in having a slightly better model, but in deep integration into user workflows and access to unique user feedback signals. Companies like Abridge in healthcare exemplify this by building a moat around the data and interactions that are difficult for large model providers to replicate.