Higgsfield Speech 2 Video
Transform images and audio into dynamic, lip-synced videos for engaging digital content.
1import requests
2import json
3
4url = "https://api.segmind.com/v1/higgsfield-speech2video"
5headers = {
6 "x-api-key": "YOUR_API_KEY",
7 "Content-Type": "application/json"
8}
9
10data = {
11 "input_image": "https://segmind-resources.s3.amazonaws.com/input/03cea2dd-87e9-41d7-9932-fbe45d4b2dd5-434b7481-1ddb-43da-a2df-10928effc900.png",
12 "input_audio": "https://segmind-resources.s3.amazonaws.com/input/a846542c-c555-43ae-bdb0-8795ef78e0bb-8fe7c335-9e7f-4729-8230-b3eabc2af49c.wav",
13 "prompt": "Generate an educational video with clear articulation, gentle hand gestures, and warm facial expressions appropriate for teaching content. All transitions needs to be super realistic and smooth.",
14 "quality": "high",
15 "enhance_prompt": false,
16 "seed": 42,
17 "duration": 10
18}
19
20response = requests.post(url, headers=headers, json=data)
21
22if response.status_code == 200:
23 result = response.json()
24 print(json.dumps(result, indent=2))
25else:
26 print(f"Error: {response.status_code}")
27 print(response.text) 1import requests
2import json
3
4url = "https://api.segmind.com/v1/higgsfield-speech2video"
5headers = {
6 "x-api-key": "YOUR_API_KEY",
7 "Content-Type": "application/json"
8}
9
10data = {
11 "input_image": "https://segmind-resources.s3.amazonaws.com/input/03cea2dd-87e9-41d7-9932-fbe45d4b2dd5-434b7481-1ddb-43da-a2df-10928effc900.png",
12 "input_audio": "https://segmind-resources.s3.amazonaws.com/input/a846542c-c555-43ae-bdb0-8795ef78e0bb-8fe7c335-9e7f-4729-8230-b3eabc2af49c.wav",
13 "prompt": "Generate an educational video with clear articulation, gentle hand gestures, and warm facial expressions appropriate for teaching content. All transitions needs to be super realistic and smooth.",
14 "quality": "high",
15 "enhance_prompt": false,
16 "seed": 42,
17 "duration": 10
18}
19
20response = requests.post(url, headers=headers, json=data)
21
22if response.status_code == 200:
23 result = response.json()
24 print(json.dumps(result, indent=2))
25else:
26 print(f"Error: {response.status_code}")
27 print(response.text)API Endpoint
https://api.segmind.com/v1/higgsfield-speech2videoParameters
input_audiorequiredstring (uri)URL for the audio guiding avatar speech. Use articulate speech for clear lip-sync results.
input_imagerequiredstring (uri)Provide a URL of the image to drive animation. Use a clear, high-quality image for best results.
"https://segmind-resources.s3.amazonaws.com/input/03cea2dd-87e9-41d7-9932-fbe45d4b2dd5-434b7481-1ddb-43da-a2df-10928effc900.png"promptrequiredstringDescribe the video output scenario. Create an engaging, emotional prompt for vibrant expressions.
"Generate a captivating avatar video with fluent dialogue and lively facial gestures."durationoptionalintegerDecide video length in seconds. Choose longer durations for in-depth content.
1051015enhance_promptoptionalbooleanAutomatically refine your prompt. Enable to achieve a balanced expression across the video.
falsequalityoptionalstringChoose video quality preference. 'High' is best for detailed videos, while 'mid' helps with speed.
"high""high""mid"seedoptionalintegerSet a seed number for consistent outputs. Use different seeds for variation, 42 is common.
42Range: 1 - 1000000Response Type
Returns: Video
Common Error Codes
The API returns standard HTTP status codes. Detailed error messages are provided in the response body.
Bad Request
Invalid parameters or request format
Unauthorized
Missing or invalid API key
Forbidden
Insufficient permissions
Not Found
Model or endpoint not found
Insufficient Credits
Not enough credits to process request
Rate Limited
Too many requests
Server Error
Internal server error
Bad Gateway
Service temporarily unavailable
Timeout
Request timed out