Skip to content
Sonic AI
Training a model against a reward-hacking detector may not solve the underlying problem and could... — Sonic AI