Skip to content
Sonic AI
HBM-based chips like Google's TPUs typically have a latency of around 20 milliseconds per token, ... — Sonic AI