Skip to content
The Humanness Index™
Built by VapiGitHub

The Humanness Index™

The open benchmark for how human voice AI sounds. Built and operated by Vapi.

MethodologyGitHubContactvapi.ai

Code is Apache-2.0. Standings data is CC BY 4.0. Audio clips and source voices are licensed recordings, all rights reserved. Provider logomarks belong to their respective owners and are used nominatively. “The Humanness Index™” name and logo are Vapi trademarks; see TRADEMARKS.md.

  1. Humanness Index™
  2. ElevenLabs
  3. Flash v2.5

Humanness Index™ · TTS model

ElevenLabs

Flash v2.5

by ElevenLabs

Flash v2.5 is the multilingual member of ElevenLabs' fastest family, generating speech in about 75 ms across 32 languages.

Rank
#7
Humanness
72
Likely rank
#3–16
Blind votes
100

Standings as of Jun 13, 2026, 00:15 UTC

LowerHigher

A real arena clip: a cloned source voice reading a customer support prompt at phone quality.

Flash v2.5 key stats

Latency (measured)
197 ms1
Languages
322
Price / 1M chars
$503
Released
December 18, 20244
  1. Vapi streaming benchmark (50 trials per model) (checked 2026-06-10) Median of 50 sequential live streaming trials, June 2026; includes network RTT from the benchmark machine.
  2. elevenlabs.io/docs/overview/models (checked 2026-06-10) Low-latency tier default (Turbo/Flash v2.5); Turbo v2 and Flash v2 are English-only, Multilingual v2 lists 29, and Eleven v3 lists 70+, encoded per model.
  3. elevenlabs.io/pricing/api (checked 2026-06-10) ElevenAPI pay-as-you-go: Flash/Turbo $0.05 per 1k characters = $50 per 1M. Eleven v3 and Multilingual v2 bill $0.10 per 1k ($100 per 1M), encoded per model.
  4. elevenlabs.io/blog/meet-flash (checked 2026-06-10)

Background

Flash v2.5 is the multilingual member of ElevenLabs' fastest family, generating speech in about 75 ms across 32 languages. Announced in December 2024, it is the model ElevenLabs recommends for real time agents beyond English, and it bills at half the credit cost per character of the flagship v3. That mix makes it the default choice for latency sensitive, cost sensitive voice deployments on the ElevenLabs platform.

Sources: elevenlabs.io

At a glance

Flash v2.5 pairs the Flash latency tier with 32 languages. In our 50 trial streaming benchmark it returned first audio in a median of 197 ms, the fastest ElevenLabs result on the Index.

Sources: elevenlabs.io

Position in the rankings

Standings as of Jun 13, 2026, 00:15 UTC

RankProviderModelHumannessLatency
#5ElevenLabsElevenLabsEleven v376758 ms
#6ElevenLabsElevenLabsTurbo v2.575265 ms
#7ElevenLabsElevenLabsFlash v2.572197 ms
#8MiniMaxMiniMaxSpeech 2.571325 ms
#9InworldInworldTTS-266288 ms

See the full Humanness Index™ rankings

Frequently asked questions

How is Flash v2.5 tested on the Humanness Index™?
Listeners hear Flash v2.5 against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.
Which languages does Flash v2.5 support?
Flash v2.5 generates speech in 32 languages, making it the multilingual member of the fastest ElevenLabs family. The English only sibling is Flash v2.

Keep exploring

ElevenLabsElevenLabsAll ElevenLabs models on the IndexElevenLabsTurbo v2Rank #14 · Humanness 63ElevenLabsTurbo v2.5Rank #6 · Humanness 75ElevenLabsFlash v2Rank #17 · Humanness 57ElevenLabsEleven v3Rank #5 · Humanness 76ElevenLabsMultilingual v2Rank #16 · Humanness 61

Back to the Humanness Index™

How human does your model really sound?

The benchmark is open source. Suggest a model, read the methodology, or ask us to put your voice in the arena.

Add your modelStar on GitHub