Software Engineering Daily• Feb 11, 2025• 41:08Interview

LangChain and Agentic AI Engineering with Erick Friis

Erick Friis•Founding Engineer, LangChain

Executive Summary

LangChain's focus has evolved from simple LLM chaining to orchestrating complex, stateful AI agents using its LangGraph framework, which models agents as state machines.
A key challenge in building reliable AI agents is the performance degradation of LLMs when given too many tools; a best practice is to limit an agent to approximately five tools at any given step.
Reliability, stemming from the non-deterministic nature of LLMs, is the biggest hurdle to productionizing AI applications, necessitating robust evaluation frameworks beyond simple 'vibe checks'.
Tool-calling is the most critical LLM feature for developers, but its performance is inconsistent across different models and even different hosting providers for the same open-source model.

12 quotes

Concerns Raised

The non-deterministic nature of LLMs is the biggest hurdle to production reliability.
Simple agent loops perform poorly as the number of available tools increases.
Tool-calling performance is inconsistent across different LLM models and hosting providers.
Defining concrete evaluation criteria for LLM applications is extremely challenging.

Opportunities Identified

Using graph-based architectures like LangGraph to build more reliable and complex agents.
Leveraging smaller, cheaper LLMs for specific tasks like routing to optimize cost and performance.
The continued and significant decline in LLM inference costs enables more complex applications.
The emergence of capable mixed-modality models (text, image, audio) opens up new application frontiers.

Key Themes

The Evolution of LLM Orchestration

The discussion traces LangChain's journey from a simple framework for chaining LLM calls to a more sophisticated system, LangGraph, for building cyclic, stateful agentic workflows. This shift reflects the industry's move from basic RAG pipelines to complex, multi-step reasoning applications.

This highlights the growing complexity of AI applications and the need for advanced tools that can manage state, handle loops, and enforce structured execution paths to build more reliable and capable systems.

Pragmatic Agent Design Patterns

The conversation emphasizes practical strategies for building effective AI agents. Key patterns include limiting the number of tools available to an LLM at any given time, using smaller models for routing and classification, and implementing 'human-in-the-loop' interruptions for critical decision points.

These patterns provide developers with concrete, battle-tested methods to overcome common failure modes, improve reliability, and manage the costs of agentic applications.

The Challenge of Production-Grade Reliability

The core obstacle to deploying LLM applications in production is their inherent non-determinism. This makes it difficult to guarantee consistent, high-quality outputs and requires a shift from informal 'vibe checks' to structured, repeatable evaluation processes.

Until the reliability problem is solved, widespread adoption of autonomous AI in mission-critical business functions will be limited. This underscores the importance of observability tools like LangSmith and new evaluation methodologies.

Navigating a Fragmented LLM Ecosystem

The AI landscape is characterized by significant variation in capabilities, particularly in crucial features like tool-calling, across different models and hosting providers. This creates a complex, non-standardized environment reminiscent of the early days of the web, where developers must carefully manage integrations.

This fragmentation necessitates abstraction layers like LangChain to maintain application portability and highlights the need for developers to be vigilant about performance differences that can impact their application's success.

The Economics of AI Inference

A prevailing strategy among developers is to architect applications based on the assumption that LLM inference costs will continue to decline significantly. This drives design choices, such as using a cheap, small model for initial request classification before routing to a more powerful, expensive model for complex tasks.

Cost is a primary driver of AI application architecture. Understanding and leveraging the cost-performance trade-offs between different models is essential for building economically viable products.

Get started free

Topics

LangChain LangGraph AI Agents Agentic Workflows LLM Orchestration State Machines Tool Calling LLM Reliability Productionizing AI LLM Evaluation Model Performance Inference Costs Open Source AI RAG (Retrieval-Augmented Generation)Multimodal AI

Processed May 4, 2026 yt-dlp + mlx-whisper + Gemini

You're reading a preview

Get started free →