Humanness Index™ · TTS model

Cartesia Sonic 3.5

by Cartesia

Sonic 3.5 is Cartesia's current flagship, released in May 2026.

Rank: #12
Humanness: 70
Likely rank: #8–15
Blind votes: 1,050

Standings as of Jul 28, 2026, 02:12 UTC

A real arena clip: a cloned source voice reading a customer support prompt at phone quality.

Sonic 3.5 key stats

Latency (measured): 128 ms¹
Languages: 42²
Price / 1M chars: $50³
Released: May 4, 2026⁴

Vapi streaming benchmark (50 trials per model) (checked 2026-06-10) Median of 50 sequential live streaming trials, June 2026; includes network RTT from the benchmark machine.
docs.cartesia.ai (checked 2026-06-10) Sonic 3 and 3.5; earlier Sonic and Sonic 2 shipped 15, encoded per model.
cartesia.ai/pricing (checked 2026-06-10) 1 credit per character (docs.cartesia.ai/pricing); entry self-serve Pro plan is $5/mo for 100K credits, a $50 per 1M effective rate; larger plans drop to $37-39 per 1M. Same credit rate for every Sonic.
docs.cartesia.ai/build-with-cartesia/tts-models/latest (checked 2026-06-10) Snapshot release.

Background

Sonic 3.5 is Cartesia's current flagship, released in May 2026. Cartesia positions it as its most natural and fastest model, with sub 90 ms latency and native support for 42 languages. It is tuned for production agent transcripts: it reads order numbers, emails, and confirmation codes correctly without preprocessing, and it resolves heteronyms like read and bow from the surrounding words.

Sources: docs.cartesia.ai

At a glance

Alphanumerics and heteronyms without preprocessing, 42 languages, and a published sub 90 ms latency claim. In our 50 trial streaming benchmark it returned first audio in a median of 128 ms, the fastest measured time among current generation models on the Index.

Sources: docs.cartesia.ai

Position in the rankings

Standings as of Jul 28, 2026, 02:12 UTC

Rank	Provider	Model	Humanness	Latency
#10	ElevenLabs	Turbo v2	75	302 ms
#11	Inworld	TTS-2	71	288 ms
#12	Cartesia	Sonic 3.5	70	128 ms
#13	MiniMax	Speech 2 Turbo	70	315 ms
#14	ElevenLabs	Flash v2.5	68	197 ms

See the full Humanness Index™ rankings

Frequently asked questions

How is Sonic 3.5 tested on the Humanness Index™?: Listeners hear Sonic 3.5 against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.
How fast is Sonic 3.5?: Cartesia publishes sub 90 ms latency. In our 50 trial streaming benchmark it returned first audio in a median of 128 ms including network time from the benchmark machine.

Keep exploring

CartesiaAll Cartesia models on the Index SonicRank #20 · Humanness 0 Sonic 2Rank #15 · Humanness 67 Sonic 3Rank #16 · Humanness 65

Back to the Humanness Index™

Find the most human-sounding voice for your agent.

Compare the models in blind tests, read the methodology, or get in touch.

Read the methodology Star on GitHub

Build a TTS model? Add yours to the Index.

Sonic 3.5 key stats

Latency (measured)

128 ms¹

Languages

42²

Price / 1M chars

$50³

Released

May 4, 2026⁴

Vapi streaming benchmark (50 trials per model) (checked 2026-06-10) Median of 50 sequential live streaming trials, June 2026; includes network RTT from the benchmark machine.

docs.cartesia.ai (checked 2026-06-10) Sonic 3 and 3.5; earlier Sonic and Sonic 2 shipped 15, encoded per model.

cartesia.ai/pricing (checked 2026-06-10) 1 credit per character (docs.cartesia.ai/pricing); entry self-serve Pro plan is $5/mo for 100K credits, a $50 per 1M effective rate; larger plans drop to $37-39 per 1M. Same credit rate for every Sonic.

docs.cartesia.ai/build-with-cartesia/tts-models/latest (checked 2026-06-10) Snapshot release.

Background

Rank

Provider

Model

Humanness

Latency

#10

ElevenLabs

Turbo v2

302 ms

#11

Inworld

TTS-2

288 ms

#12

Cartesia

Sonic 3.5

128 ms

#13

MiniMax

Speech 2 Turbo

315 ms

#14

ElevenLabs

Flash v2.5

197 ms

Frequently asked questions

How is Sonic 3.5 tested on the Humanness Index™?

Listeners hear Sonic 3.5 against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.

How fast is Sonic 3.5?

Cartesia publishes sub 90 ms latency. In our 50 trial streaming benchmark it returned first audio in a median of 128 ms including network time from the benchmark machine.