MiniMax models on the Humanness Index™
About MiniMax
MiniMax is the Shanghai AI lab behind the MiniMax speech line, served through its hosted t2a_v2 API with HD and Turbo tiers. It is widely regarded as the strongest text to speech provider for Chinese, and its recent generations brought English accuracy and rhythm up alongside that strength.
Sources: minimax.io
Speech generations
The speech line moved fast through 2025, with the Speech-02 series arriving in April and Speech 2.5 following in August. The current generation supports more than 40 languages and clones a voice from roughly six to ten seconds of reference audio using a learnable speaker encoder that needs no transcript. The clips on this Index were generated with the Speech 2.5 generation, turbo tier.
Sources: minimax.io, platform.minimax.io
How human does your model really sound?
The benchmark is open source. Suggest a model, read the methodology, or ask us to put your voice in the arena.