Skip to content
Sonic AI
Using a "teacher" model as a judge in a reinforcement learning setup is a viable path to training... — Sonic AI