“The biggest hurdle to productionizing LLM-based applications is reliability, stemming from the non-deterministic nature of model outputs.”