For inference on models like DeepSeq and Kimi K2.5 at 100 tokens per second, NVIDIA's Blackwell G..., Sonic AI
“For inference on models like DeepSeq and Kimi K2.5 at 100 tokens per second, NVIDIA's Blackwell GPU architecture provides a performance improvement of approximately 20x over the Hopper architecture.”