Skip to content
The Humanness Index™
Built by VapiGitHub

The Humanness Index™

The open benchmark for how human voice AI sounds. Built and operated by Vapi.

MethodologyGitHubContactvapi.ai

Code is Apache-2.0. Standings data is CC BY 4.0. Audio clips and source voices are licensed recordings, all rights reserved. Provider logomarks belong to their respective owners and are used nominatively. “The Humanness Index™” name and logo are Vapi trademarks; see TRADEMARKS.md.

  1. Humanness Index™
  2. Canopy Labs
  3. Orpheus

Humanness Index™ · TTS model

Canopy Labs

Orpheus

by Canopy Labs

Orpheus is Canopy Labs' open source speech LLM, released in March 2025 under the Apache 2.0 license.

Rank
#4
Humanness
78
Likely rank
#3–15
Blind votes
96

Standings as of Jun 13, 2026, 00:15 UTC

LowerHigher

A real arena clip: a cloned source voice reading a customer support prompt at phone quality.

Orpheus key stats

Latency (measured)
—Not measured: no publicly reachable API at benchmark time. The Index never shows vendor latency estimates.
Languages
English1
Price / 1M chars
Open source2
Voice cloning
zero-shot3
Open source
Apache 2.04
Released
March 18, 20255
  1. github.com/canopyai/Orpheus-TTS (checked 2026-06-10) Flagship trained on English; a multilingual research preview followed in April 2025.
  2. github.com/canopyai/Orpheus-TTS (checked 2026-06-10) Apache 2.0 code; weights under the Llama 3.2 Community License. No commercial per-character price.
  3. github.com/canopyai/Orpheus-TTS (checked 2026-06-10)
  4. github.com/canopyai/Orpheus-TTS (checked 2026-06-10) Code Apache 2.0; weights under the Llama 3.2 Community License.
  5. canopylabs.ai/model-releases (checked 2026-06-10) Multilingual research preview followed in 2025-04.

Background

Orpheus is Canopy Labs' open source speech LLM, released in March 2025 under the Apache 2.0 license. It is built on a Llama 3.2 3B backbone trained on more than 100,000 hours of English speech, and it showed that an open model can compete with closed source rivals on prosody and empathy. It supports zero shot voice cloning, tag based emotion control, and real time streaming at roughly 200 ms time to first audio on optimized hosts. Teams can self host it or run it through managed providers such as Baseten.

Sources: github.com, canopylabs.ai

At a glance

The open weights entry on the Index: a Llama 3.2 3B backbone, zero shot cloning, emotion tags, and promised 1B, 400M, and 150M variants. Canopy Labs selected Baseten as its preferred inference provider in May 2025, with an FP8 implementation for production serving. It has no hosted first party API, so the Index shows no measured latency.

Sources: baseten.co

Position in the rankings

Standings as of Jun 13, 2026, 00:15 UTC

RankProviderModelHumannessLatency
#2xAIxAIGrok TTS (Streaming)98285 ms
#3CartesiaCartesiaSonic 3.582128 ms
#4Canopy LabsCanopy LabsOrpheus78—
#5ElevenLabsElevenLabsEleven v376758 ms
#6ElevenLabsElevenLabsTurbo v2.575265 ms

See the full Humanness Index™ rankings

Frequently asked questions

How is Orpheus tested on the Humanness Index™?
Listeners hear Orpheus against another model in a blind head to head round, both voices reading the same customer support prompt from the same cloned source voice, and they pick whichever sounds more human. Its Humanness score derives purely from those votes.
Is Orpheus really open source?
Yes. The code is Apache 2.0 and the weights are published under the Llama 3.2 Community License, so teams can self host it or run it through managed providers such as Baseten. It has no commercial per character price.

Keep exploring

Canopy LabsCanopy LabsAll Canopy Labs models on the Index

Back to the Humanness Index™

How human does your model really sound?

The benchmark is open source. Suggest a model, read the methodology, or ask us to put your voice in the arena.

Add your modelStar on GitHub