Jodi explains that the `truthful QA` benchmark is designed to measure a specific type of hallucin..., Sonic AI
“Jodi explains that the `truthful QA` benchmark is designed to measure a specific type of hallucination related to misconceptions and misinformation common in English-speaking communities, and is not a general-purpose hallucination metric.”