Tom McGrath notes that generating ground truth for hallucination detection at test time would be ..., Sonic AI
“Tom McGrath notes that generating ground truth for hallucination detection at test time would be prohibitively expensive, costing hundreds of thousands of dollars and taking several seconds per call using a system like Gemini 2.5 with web search.”