Skip to content
Sonic AI
Reasoning models reduce the maximum concurrent user batch size on a server to one-fourth or one-f..., Sonic AI