Andrei Karpathy argues that using LLMs as judges for process-based supervision is problematic bec..., Sonic AI
“Andrei Karpathy argues that using LLMs as judges for process-based supervision is problematic because the student model will almost certainly find and exploit adversarial examples in the judge model.”