Humanness Index™ · TTS model

Cartesia Sonic 3

by Cartesia

Sonic 3 landed in October 2025 as a major step for Cartesia's state space model architecture.

Rank: #16
Humanness: 65
Likely rank: #12–16
Blind votes: 1,048

Standings as of Jul 28, 2026, 02:12 UTC

A real arena clip: a cloned source voice reading a customer support prompt at phone quality.

Sonic 3 key stats

Latency (measured): 166 ms¹
Languages: 42²
Price / 1M chars: $50³
Voice cloning: instant, about 10 s of audio⁴
Released: October 27, 2025⁵

Vapi streaming benchmark (50 trials per model) (checked 2026-06-10) Median of 50 sequential live streaming trials, June 2026; includes network RTT from the benchmark machine.
docs.cartesia.ai (checked 2026-06-10) Sonic 3 and 3.5; earlier Sonic and Sonic 2 shipped 15, encoded per model.
cartesia.ai/pricing (checked 2026-06-10) 1 credit per character (docs.cartesia.ai/pricing); entry self-serve Pro plan is $5/mo for 100K credits, a $50 per 1M effective rate; larger plans drop to $37-39 per 1M. Same credit rate for every Sonic.
docs.cartesia.ai/build-with-cartesia/tts-models/latest (checked 2026-06-10)
vapi.ai/blog/Cartesia-free-sonic-3-TTS (checked 2026-06-10)

Background

Sonic 3 landed in October 2025 as a major step for Cartesia's state space model architecture. It brought sub 100 ms latency, expanded coverage from 15 to 42 languages including nine Indian languages, and added expressive controls for emotion and even laughter. Instant voice cloning works from about 10 seconds of audio, and the model ships with SOC 2, HIPAA, and PCI compliance for regulated phone work.

Sources: cartesia.ai, vapi.ai

At a glance

Emotion and laughter tags, 42 languages, and 10 second cloning. In our 50 trial streaming benchmark Sonic 3 returned first audio in a median of 166 ms including network time.

Sources: docs.cartesia.ai

Position in the rankings

Standings as of Jul 28, 2026, 02:12 UTC

Rank	Provider	Model	Humanness	Latency
#14	ElevenLabs	Flash v2.5	68	197 ms
#15	Cartesia	Sonic 2	67	159 ms
#16	Cartesia	Sonic 3	65	166 ms
#17	Smallest.ai	Lightning v3.1	45	420 ms
#18	Neuphonic	neu_hq	41	276 ms

See the full Humanness Index™ rankings

Frequently asked questions

How is Sonic 3 tested on the Humanness Index™?: Listeners hear Sonic 3 against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.
Which languages does Sonic 3 support?: Sonic 3 expanded Cartesia's coverage from 15 to 42 languages, including nine Indian languages.

Keep exploring

CartesiaAll Cartesia models on the Index SonicRank #20 · Humanness 0 Sonic 2Rank #15 · Humanness 67 Sonic 3.5Rank #12 · Humanness 70

Back to the Humanness Index™

Find the most human-sounding voice for your agent.

Compare the models in blind tests, read the methodology, or get in touch.

Read the methodology Star on GitHub

Build a TTS model? Add yours to the Index.

Sonic 3 key stats

Latency (measured)

166 ms¹

Languages

42²

Price / 1M chars

$50³

Voice cloning

instant, about 10 s of audio⁴

Released

October 27, 2025⁵

Vapi streaming benchmark (50 trials per model) (checked 2026-06-10) Median of 50 sequential live streaming trials, June 2026; includes network RTT from the benchmark machine.

docs.cartesia.ai (checked 2026-06-10) Sonic 3 and 3.5; earlier Sonic and Sonic 2 shipped 15, encoded per model.

cartesia.ai/pricing (checked 2026-06-10) 1 credit per character (docs.cartesia.ai/pricing); entry self-serve Pro plan is $5/mo for 100K credits, a $50 per 1M effective rate; larger plans drop to $37-39 per 1M. Same credit rate for every Sonic.

docs.cartesia.ai/build-with-cartesia/tts-models/latest (checked 2026-06-10)

vapi.ai/blog/Cartesia-free-sonic-3-TTS (checked 2026-06-10)

Background

Rank

Provider

Model

Humanness

Latency

#14

ElevenLabs

Flash v2.5

197 ms

#15

Cartesia

Sonic 2

159 ms

#16

Cartesia

Sonic 3

166 ms

#17

Smallest.ai

Lightning v3.1

420 ms

#18

Neuphonic

neu_hq

276 ms

Frequently asked questions

How is Sonic 3 tested on the Humanness Index™?

Listeners hear Sonic 3 against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.

Which languages does Sonic 3 support?

Sonic 3 expanded Cartesia's coverage from 15 to 42 languages, including nine Indian languages.