Kling 3.0 Standard Text-to-Video Serverless API

Kling 3.0 creates stunning 1080p cinematic videos from simple text prompts with realistic motion and audio.

~174.80s
$0.504 - $8.40 per generation
 1import requests
 2import json
 3
 4url = "https://api.segmind.com/v1/kling-3-standard-text2video"
 5headers = {
 6    "x-api-key": "YOUR_API_KEY",
 7    "Content-Type": "application/json"
 8}
 9
10data = {
11    "prompt": "Slow cinematic push-in through an empty ancient temple. Fog drifts lazily through the valley below. Golden light catches dust particles floating between stone pillars.",
12    "duration": "5",
13    "cfg_scale": 0.5,
14    "aspect_ratio": "16:9",
15    "generate_audio": true,
16    "negative_prompt": "blur, distort, and low quality"
17}
18
19response = requests.post(url, headers=headers, json=data)
20
21if response.status_code == 200:
22    result = response.json()
23    print(json.dumps(result, indent=2))
24else:
25    print(f"Error: {response.status_code}")
26    print(response.text)

API Endpoint

POSThttps://api.segmind.com/v1/kling-3-standard-text2video

Parameters

promptrequired
string

Describe the video content. Be detailed about actions, camera movements, lighting.

aspect_ratiooptional
string

Aspect ratio of the generated video.

Default: "16:9"
Allowed values :
"16:9""9:16""1:1"
cfg_scaleoptional
number

Prompt adherence strength. Higher values follow the prompt more closely.

Default: 0.5Range: 0 - 1
durationoptional
string

Length of the output video in seconds.

Default: "5"
Allowed values (13 total):
"3""4""5""6""7""8""9""10""11""12"+3 more
generate_audiooptional
boolean

Generate synchronized audio. Supports Chinese and English.

Default: true
multi_promptoptional
object[]

Multi-shot video generation. Each segment has its own prompt and duration. Max 5 segments.

Array items:
promptrequired
string

Prompt for this video segment. Max 512 characters.

durationoptional
string

Duration for this segment in seconds.

Default: "5"
negative_promptoptional
string

Specify elements to avoid. E.g. blur, distort, low quality.

Default: "blur, distort, and low quality"
shot_typeoptional
string

Camera shot type.

Default: "customize"
voice_idsoptional
string[]

Voice IDs for audio. Reference in prompt as <<<voice_1>>>, <<<voice_2>>>. Max 2. Get voice IDs from the Kling Create Voice endpoint: https://www.segmind.com/models/kling-create-voice

Response Type

Returns: Video

Common Error Codes

The API returns standard HTTP status codes. Detailed error messages are provided in the response body.

400

Bad Request

Invalid parameters or request format

401

Unauthorized

Missing or invalid API key

403

Forbidden

Insufficient permissions

404

Not Found

Model or endpoint not found

406

Insufficient Credits

Not enough credits to process request

429

Rate Limited

Too many requests

500

Server Error

Internal server error

502

Bad Gateway

Service temporarily unavailable

504

Timeout

Request timed out