Chatterbox Turbo TTS Serverless API

Ultra-fast, human-quality TTS with emotional expression.

~13.00s

 1import requests
 2import json
 3
 4url = "https://api.segmind.com/v1/chatterbox-turbo-tts"
 5headers = {
 6    "x-api-key": "YOUR_API_KEY",
 7    "Content-Type": "application/json"
 8}
 9
10data = {
11    "text": "Introducing the AI-enhanced Skyline T-800. [laugh] It's got integrated AI for superior performance. Want to explore pricing?",
12    "reference_audio": "https://segmind-resources.s3.amazonaws.com/output/70e9a9dd-e0f4-450f-a590-872996b44a01-chatterbox-turbo-input.mp3",
13    "temperature": 0.7,
14    "seed": 42,
15    "min_p": 0,
16    "top_p": 0.9,
17    "top_k": 500,
18    "repetition_penalty": 1.3,
19    "norm_loudness": true
20}
21
22response = requests.post(url, headers=headers, json=data)
23
24if response.status_code == 200:
25    result = response.json()
26    print(json.dumps(result, indent=2))
27else:
28    print(f"Error: {response.status_code}")
29    print(response.text)

 1import requests
 2import json
 3
 4url = "https://api.segmind.com/v1/chatterbox-turbo-tts"
 5headers = {
 6    "x-api-key": "YOUR_API_KEY",
 7    "Content-Type": "application/json"
 8}
 9
10data = {
11    "text": "Introducing the AI-enhanced Skyline T-800. [laugh] It's got integrated AI for superior performance. Want to explore pricing?",
12    "reference_audio": "https://segmind-resources.s3.amazonaws.com/output/70e9a9dd-e0f4-450f-a590-872996b44a01-chatterbox-turbo-input.mp3",
13    "temperature": 0.7,
14    "seed": 42,
15    "min_p": 0,
16    "top_p": 0.9,
17    "top_k": 500,
18    "repetition_penalty": 1.3,
19    "norm_loudness": true
20}
21
22response = requests.post(url, headers=headers, json=data)
23
24if response.status_code == 200:
25    result = response.json()
26    print(json.dumps(result, indent=2))
27else:
28    print(f"Error: {response.status_code}")
29    print(response.text)

API Endpoint

POSThttps://api.segmind.com/v1/chatterbox-turbo-tts

Parameters

reference_audiorequired

string (uri)

Provide a reference audio URL for voice matching. Use a public link for seamless access.

textrequired

string

Enter text to synthesize into speech. Use sound tags like [chuckle] for expression.

min_poptional

number

Set minimum token probability. Higher values filter out unlikely outputs.

Default: 0Range: 0 - 1

norm_loudnessoptional

boolean

Normalize audio to standard loudness. Recommended for consistent volume across outputs.

Default: true

repetition_penaltyoptional

number

Penalty for repeated words. Higher values reduce redundancy in speech.

Default: 1.3Range: 1 - 2

seedoptional

integer

Set random seed for reproducible results. Zero for random, specific value for consistency.

Default: 42

temperatureoptional

number

Adjust creativity and expressiveness in speech. Use lower for consistency, higher for diversity.

Default: 0.7Range: 0.05 - 2

top_koptional

integer

Limit to top K likely tokens. Lower for concise, higher for diverse expressions.

Default: 500Range: 0 - 1000

top_poptional

number

Control randomness via cumulative probability tokens. Lower for predictable, higher for varied outputs.

Default: 0.9Range: 0 - 1

Response Type

Returns: Audio

Common Error Codes

The API returns standard HTTP status codes. Detailed error messages are provided in the response body.

400

Bad Request

Invalid parameters or request format

401

Unauthorized

Missing or invalid API key

403

Forbidden

Insufficient permissions

404

Not Found

Model or endpoint not found

406

Insufficient Credits

Not enough credits to process request

429

Rate Limited

Too many requests

500

Server Error

Internal server error

502

Bad Gateway

Service temporarily unavailable

504

Timeout

Request timed out