Models

Every voice model Onepin can route to, planned and validated through the same pipeline.

ElevenLabs logo
ElevenLabs

Multilingual v2

Commercial

Studio-grade multilingual speech with wide locale coverage and full SSML. Best-in-class Chinese and Cantonese.

Voices
300+ voices + cloning
Languages
+2634 regions supported
Pricing
$18-50 / 1M chars
Custom pronunciation
Yes
Controllability

Stability, similarity, style exaggeration and speed (0.7 to 1.2).

ElevenLabs logo
ElevenLabs

Eleven v3

Commercial

Alpha flagship with inline audio-tag emotion ([whispering], [laughing]) across 70+ languages. The most expressive ElevenLabs model.

Voices
300+ voices + cloning
Languages
+5361 regions supported
Pricing
$300 / 1M chars
Custom pronunciation
Yes
Controllability

Audio tags ([whispering], [laughing], [excited]) plus stability, similarity and style exaggeration.

ElevenLabs logo
ElevenLabs

Turbo v2.5

Commercial

Balanced quality and speed with audio markup tags and instant cloning; sub-250ms latency for interactive use.

Voices
300+ voices + instant cloning
Languages
+2937 regions supported
Pricing
$15-25 / 1M chars
Custom pronunciation
Yes
Controllability

Stability, similarity, style and speed.

ElevenLabs logo
ElevenLabs

Flash v2.5

Commercial

Lowest-latency ElevenLabs model (~75ms) for real-time agents, with strong Chinese and Cantonese and 300+ voices.

Voices
300+ voices + cloning
Languages
+2937 regions supported
Pricing
$15 / 1M chars
Custom pronunciation
Yes
Controllability

Stability, similarity and speed (limited expression).

Cartesia logo
Cartesia

Sonic-2

Commercial

Ultra-low-latency speech on state-space models, with sub-100ms streaming and on-device deployments for real-time agents.

Voices
Curated library + cloning
Languages
+715 regions supported
Pricing
$25-50 / 1M chars
Custom pronunciation
Yes
Controllability

Speed, emotion, pause/pacing, volume and voice mixing.

Cartesia logo
Cartesia

Sonic Turbo

Commercial

The fastest Sonic tier, with pacing tags and cross-lingual cloning tuned for high-throughput, latency-critical apps.

Voices
Marketplace + cross-lingual cloning
Languages
+715 regions supported
Pricing
$25 / 1M chars
Custom pronunciation
Yes
Controllability

Speed and pause/pacing tags with faster, lighter cloning.

Rime Labs logo
Rime Labs

Arcana

Commercial

Expressive, production-grade conversational speech with deterministic phoneme control, built for audio at scale.

Voices
Production voice library
Languages
3 regions supported
Pricing
~$25-40 / 1M chars
Custom pronunciation
Yes
Controllability

Phoneme tags, emotion via Arcana tags and pause control.

Rime Labs logo
Rime Labs

Mist V2

Commercial

Low-latency conversational model with 300+ voices and deterministic IVR pronunciation for high-volume agents.

Voices
300+ voices + cloning
Languages
3 regions supported
Pricing
~$25-40 / 1M chars
Custom pronunciation
Yes
Controllability

Speed, deterministic IVR pronunciation and pause control.

Naver Clova logo
Naver Clova

Voice

Commercial

Korea's market-leading TTS, with premium Korean voices plus English, Japanese and Chinese, avatar video and cloning.

Voices
Multiple branded voices
Languages
4 regions supported
Pricing
~$30 / 1M chars
Custom pronunciation
Yes
Controllability

Emotion (where supported), speed, pitch, volume and pause.

Respeecher logo
Respeecher

Custom voice cloning

Commercial

Studio-grade, ethically-sourced voice cloning used by EA, Sony and Lucasfilm; speech-to-speech that captures a real performance.

Voices
Library + bespoke clones
Languages
+210 regions supported
Pricing
Custom / per-project
Custom pronunciation
Yes
Controllability

Pitch, emotion and formant driven by a reference performance (speech-to-speech), plus de-noising.