Surge, a bootstrapped human data startup, has surpassed $1 billion in annual revenue by providing high-quality data for training AI models to clients like OpenAI, Google, and Anthropic.
CEO Edwin Chen argues that high-quality data embraces human creativity and subjectivity, asserting that a small amount of rich human data is more valuable than millions of synthetic or low-quality data points.
Chen is highly critical of popular AI benchmarks like the LMSYS Chatbot Arena, calling it a "giant plague on AI" that incentivizes superficial qualities like verbosity rather than true capability.
The future of AI training will rely on increasingly complex and diverse reinforcement learning (RL) environments, for which Chen believes there is "no ceiling" on useful richness, and human feedback will remain essential even as models become superhuman.
12 quotes
Concerns Raised
The AI industry's reliance on flawed and gameable benchmarks like LMSYS Chatbot Arena is leading to misleading evaluations of model quality.
Many data labeling companies are simply 'body shops' that scale up mediocrity by focusing on checklists instead of true data quality.
Silicon Valley's startup culture often prioritizes fundraising for status over building a sustainable business.
Opportunities Identified
There is a massive, growing demand for high-quality, nuanced human data to train and evaluate frontier AI models.
The shift towards training agents in complex, rich reinforcement learning environments creates a new frontier for data generation.
The limitations of purely synthetic data create a durable need for human intelligence to guide and refine AI training.
Educating the market on what constitutes high-quality data can create a significant competitive advantage.