“OpenAI's Jalapeño chip is an ASIC designed for the specific task of serving inference for LLMs, similar to Google's TPUs.”