19

models

Speech Generation

The best AI text-to-speech and voice synthesis models, all available on Segmind via a single pay-per-use API. This collection includes models from ElevenLabs, Google Gemini TTS, Chatterbox, and more — covering everything from natural conversational voices to expressive character speech and multilingual narration. Whether you need realistic voiceovers for video content, interactive voice agents, podcast production, e-learning narration, or audiobook creation, these models deliver production-quality audio from plain text. Key models include ElevenLabs Turbo for ultra-low latency streaming TTS, Gemini 2.5 Flash TTS and Gemini 2.5 Pro TTS for high-fidelity multilingual speech, and Chatterbox Turbo for rapid, expressive voice generation. The collection also includes voice cloning, voice design, dialogue generation with timestamps, and speech-to-speech conversion models for complete voice production workflows. On Segmind, generate professional audio with a single API call and chain TTS models with video generation or lipsync tools in Workflows to automate complete multimedia content pipelines.

Text To Audio
Average Pricing$0.07

Sam Audio Large

13.1s
Text To Audio
Average Pricing$0.01

Gemini TTS 2.5 Flash

18.8s
Text To Audio
Average Pricing$0.02

Gemini TTS 2.5 Pro

25.6s
Text To Audio
Average Pricing$0.02

Chatterbox Turbo TTS

13.2s
Text To Audio
Average Pricing$0.02

Elevenlabs Dialogue

7.1s
Text To Audio
Average Pricing$0.02

VeenaMax TTS

13.0s
Text To Audio
Average Pricing$0.05

Veena TTS

45.1s
Text To Audio
Average Pricing$0.02

Chatterbox TTS

17.6s
Text To Audio
Average Pricing$0.09

Lyria 2

27.2s
Text To Audio
Average Pricing$0.04

Ace Step Music

11.9s
Text To Audio
Average Pricing$0.07

Dia (Text to Speech)

89.7s
Text To Audio
Average Pricing$0.07

Minimax Music-01

44.3s
Text To Audio
Average Pricing$0.12

3B Orpheus TTS (0.1)

117.6s
Text To Audio
Average Pricing$0.04

Meta MusicGen Medium

22.3s
Text To Audio
Average Pricing$0.01

MyShell Text To Speech

7.0s
Text To Audio
Average Pricing$0.01

Openvoice

10.0s
Text To Audio
Average Pricing$0.25

ElevenLabs Dubbing

93.1s
Text To Audio
Average Pricing$0.03

Elevenlabs Sound Generation

7.9s
Text To Audio
Average Pricing$0.1

Elevenlabs Text To Speech

12.3s