“DeepInfra predicts that 80% of all AI compute resources will eventually be dedicated to inference rather than training.”