Skip to content
Sonic AI
Chinese labs are using distillation strategies, including using LLMs as judges in RL post-trainin... — Sonic AI