“ElevenLabs' pricing model is based on text tokens for text-to-speech services and per minute for voice agent and transcription services.”