Skip to content
Sonic AI
Key performance metrics for LLM inference are time to first token (dominated by the pre-fill pass..., Sonic AI