Humanness Index™ · Provider

xAI

Best ranked model: #2 Grok TTS
Humanness: 94

Standings as of Jul 28, 2026, 01:54 UTC

Models on the Index: 2
Languages: 20
Price / 1M chars: $15

xAI models on the Humanness Index™

Rank	Model	Humanness	Latency	Languages	Price / 1M chars
#2	Grok TTS	94	460 ms	20	$15
#6	Grok TTS (Streaming)	86	285 ms	20	$15

Compare against the full Humanness Index™ rankings

About xAI

xAI built its voice stack fully in house, from voice activity detection to the audio models themselves, for Grok Voice, the assistant that ships on Grok mobile apps, Tesla vehicles, and Starlink customer support. The Grok Voice Agent API opened the stack to developers in December 2025, and standalone TTS and STT APIs followed in April 2026.

Sources: x.ai, x.ai

The Grok voice stack

Grok TTS offers five expressive voices (Ara, Eve, Leo, Rex, and Sal, with Eve as the default) across 20 languages, with inline speech tags like [laugh], [sigh], and [whisper] for delivery control. REST requests accept up to 15,000 characters, and a WebSocket streaming variant accepts unbounded input. Both variants currently sit at the top of the Humanness Index™.

Sources: docs.x.ai

xAI stats

Languages: 20¹
Price / 1M chars: $15²

docs.x.ai/developers/model-capabilities/audio/voice (checked 2026-06-10)
x.ai/news/grok-stt-and-tts-apis (checked 2026-06-10) $15.00 per 1M characters per the launch post (x.ai/api/voice; docs.x.ai/developers/models/text-to-speech). Secondary coverage reported $4.20 per 1M; the launch post figure is used.