Unsupervised Learning Notify me• Mar 25, 2025• 45:00Interview

Inside OpenAI's New Agent Development Tools

From Unsupervised Learning

Nikunj Handa & Steve Coffey(guest)

Get the full transcript next time Unsupervised Learning releases an episode

Summary, key quotes, top claims, and the searchable transcript - emailed automatically. No card needed.

Executive Summary

Continue your research

Keep pulling the thread on Nikunj Handa & Steve Coffey.

The Evolution of AI Agents Simplifying the Developer Experience

12 quotes

Concerns Raised

The high complexity of previous APIs (like Assistants API) created a barrier to entry for developers.
AI has not yet delivered a step-change in the speed of scientific research, an early hope for the technology.
The process of orchestrating models, prompt engineering, and maintaining evaluation sets is still extremely difficult for application builders.

Opportunities Identified

Building applications that better leverage the underutilized capabilities of current foundation models through sophisticated orchestration.
Developing domain-specific agents in verticals like medicine, climate tech, and travel.
Extending the runtime of AI agents from minutes to hours or days is expected to unlock significant new capabilities.
Creating third-party AI infrastructure companies that provide specialized environments and tools for agentic workflows.

Key Themes

Research Findings12

Unify GTM used OpenAI's Computer Use model during its alpha phase to automate research for climate tech startups by having an agent open Google Maps, activate Street View, and visually check for the expansion of electric vehicle charging networks.

An OpenAI representative believes AI agents are simultaneously overhyped in general discussion but underhyped in terms of the value created by companies that successfully implement them for complex tasks like Deep Research or full workflow automation.

An OpenAI representative predicts that the most important skill for AI application builders over the next one to two years will be orchestration: the ability to rapidly integrate tools, data, and multiple LLM calls, and then quickly evaluate and improve the system.

An OpenAI representative notes that AI has not yet produced a step-change in the speed of scientific research, which was a primary hope for the O-series models.

An OpenAI representative predicts that the rate of AI model progress will be greater in the next year than it was in the last year.

In 2025, AI agent products like OpenAI's Deep Research have shifted to a multi-step "chain of thought" process, where the model can access the web, reconsider its approach, and call tools multiple times in parallel during its reasoning process.

OpenAI released the Agents SDK in response to developers already creating multi-agent systems, or "swarms," to solve specific business problems.

Extending the runtime of AI agent models like Deep Research from minutes to hours or days is expected to yield significantly more powerful results.

OpenAI is developing reinforcement fine-tuning techniques that will allow developers to create their own domain-specific tasks and graders to train models on correct tool-calling paths.

OpenAI's Computer Use models are being successfully applied in the medical domain to automate manual, multi-application workflows in legacy systems that lack APIs.

Browserbase and YC startup Scrappybara are identified as key platform players providing hosted virtual machine environments optimized for OpenAI's Computer Use models.

The Arc web browser is developing a native integration, potentially called Dia, that allows users to give an instruction in a tab, which then executes a task in the background, similar to OpenAI's Operator.

Topics

AI Agents OpenAI Agents SDK Responses API Assistants API Developer Experience API Design Multi-agent Systems Swarms Chain of Thought Tool Use Computer Use Models Model Orchestration AI Infrastructure Fine-tuning Scientific Research RAG Prompt Engineering

Processed Apr 3, 2026Daily intelligence brief → yt-dlp + mlx-whisper + Gemini

Inside OpenAI's New Agent Development Tools

Continue your research

Concerns Raised

Opportunities Identified

Key Themes

The Evolution of AI Agents

Simplifying the Developer Experience

Orchestration as the Key Differentiator

The Emerging AI Infrastructure Ecosystem

Research Findings12

Topics