Veo 3 Fast
Veo 3 Fast rapidly creates high-quality, 8-second videos with synchronized audio for diverse content needs.
API
If you're looking for an API, you can choose from your desired programming language.
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
import requests
import base64
# Use this function to convert an image file from the filesystem to base64
def image_file_to_base64(image_path):
with open(image_path, 'rb') as f:
image_data = f.read()
return base64.b64encode(image_data).decode('utf-8')
# Use this function to fetch an image from a URL and convert it to base64
def image_url_to_base64(image_url):
response = requests.get(image_url)
image_data = response.content
return base64.b64encode(image_data).decode('utf-8')
# Use this function to convert a list of image URLs to base64
def image_urls_to_base64(image_urls):
return [image_url_to_base64(url) for url in image_urls]
api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/veo-3-fast"
# Request payload
data = {
"prompt": "A serene forest landscape at dawn, mist hovering over the trees, a deer slowly wandering through the frame, softened by early light.",
"generate_audio": False
}
headers = {'x-api-key': api_key}
response = requests.post(url, json=data, headers=headers)
print(response.content) # The response is the generated image
Attributes
Defines the video theme or style. Use clear storytelling for narratives or vivid imagery for artistic shots.
Provide the URL of the image to be relit. Use a URL from cloud storage like S3.
Generate Audio.
To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.
Resources to get you started
Everything you need to know to get the most out of Veo 3 Fast
Veo 3 Fast – Effective Usage Guide
Veo 3 Fast is designed to generate crisp 8-second, 720p video clips with synchronized audio and realistic lip sync—using five times fewer credits than its predecessor. Follow these best practices and parameter suggestions to get the most out of the model across different scenarios.
1. Core Best Practices
- •Clear & Concise Prompts
Keep your text prompt under 100 words. Describe setting, mood, key subjects, and actions.
Example:“A vintage bicycle leaning against a brick wall at sunset, warm golden light, soft shadows, gentle breeze moving leaves.”
- •Iterative Refinement
Start broad, then add details in subsequent runs. Split complex sequences into 2–3 segments, then stitch them in post. - •Leverage Frames-to-Video
Upload 3–5 reference frames when you need consistent composition or character poses. The model uses them to guide motion flow.
2. Parameter Suggestions by Use Case
A. Social Media Teasers
- •Mode: Text-to-Video
- •Prompt Style: High-energy, hashtag-ready language (e.g., “#FreshBrew coffee pouring in slow motion, steam rising, morning light through a café window.”)
- •Audio: Built-in upbeat track or supply a 5–8 s custom jingle
- •Iterations: 1–2 drafts, choose the most dynamic framing
B. Marketing & Ads
- •Mode: Text-to-Video (or Frames-to-Video with product shots)
- •Prompt Style: Brand-centric, call-to-action phrases (e.g., “Smartphone gliding into frame, screen lighting up with notifications, sleek metallic finish, dynamic camera pan.”)
- •Audio: Custom VO or licensed track for lip-sync
- •Details: Emphasize product colors/materials; upload 3 reference images of your product if using Frames-to-Video
C. Educational Snippets
- •Mode: Text-to-Video
- •Prompt Style: Instructional and step-by-step (e.g., “A scientist in a white lab coat mixing colorful liquids in a beaker, clear labels, calm voice-over explaining each step.”)
- •Audio: Narration track (upload a script)
- •Tip: Use simpler backgrounds to keep focus on the action
D. Prototype Animations
- •Mode: Frames-to-Video
- •Prompt Style: Storyboard-style directives (“Frame 1: astronaut floating in zero gravity. Frame 2: helmet visor reflecting Earth.”)
- •Reference Frames: 4–5 hand-drawn or rough sketches
- •Outcome: Rapidly iterate pacing and composition before full production
3. Advanced Tips
- •Ambient Details: Include weather, time of day, or emotions (“rain-swept alley at midnight, neon reflections, suspenseful mood”).
- •Audio Customization: Upload any MP3 or select from built-in soundscapes—lip sync adapts automatically.
- •Quality Trade-Offs: For ultra-complex textures, consider using full-scale Veo 3; Veo 3 Fast optimizes for speed and credit efficiency.
Default Prompt Parameter
\{
"prompt": "A serene forest landscape at dawn, mist hovering over the trees, a deer slowly wandering through the frame, softened by early light."
\}
Other Popular Models
Discover other models you might be interested in.
faceswap-v2
Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

insta-depth
InstantID aims to generate customized images with various poses or styles from only a single reference ID image while ensuring high fidelity

codeformer
CodeFormer is a robust face restoration algorithm for old photos or AI-generated faces.

sd2.1-faceswapper
Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training
