Skip to content
Sonic AI
The on-policy self-distillation (OP-SD) technique for model training does not require an outer-lo..., Sonic AI