OVI Image To Video

Ovi I2V generates synchronized video and audio from text prompts, creating engaging multimedia content effortlessly.

API

If you're looking for an API, you can choose from your desired programming language.

POST

import requests

api_key = "YOUR_API_KEY"
url = "https://api.segmind.com/v1/ovi-i2v"

# Prepare data and files
data = {}
files = {}

data['seed'] = None
# For parameter "image", you can send a raw file or a URI:
# files['image'] = open('IMAGE_PATH', 'rb')  # To send a file
data['image'] = 'https://segmind-resources.s3.amazonaws.com/input/b4562d69-c97e-4b64-887c-7dea6d8fbe13-209f6f09-7fc6-47c0-8b6e-128ae75db915-wan-720-ip.png'  # To send a URI
data['prompt'] = "A young woman wearing a white blouse and dark blazer sits in a warmly lit podcast studio, speaking animatedly into a professional microphone. She wears large studio headphones and gestures with enthusiasm while smiling. Behind her, beige acoustic panels and soft backlighting create a cozy, modern atmosphere. A laptop and coffee mug rest on the desk in front of her as she records. She speaks confidently into the mic, articulating, <S>AI is not replacing us—it’s amplifying our creativity.<E>. She leans slightly forward, eyes bright with engagement. <AUDCAP>Clear female voice discussing technology and innovation, with soft studio ambience.<ENDAUDCAP>"
data['audio_negative_prompt'] = "robotic, muffled, echo, distorted"
data['video_negative_prompt'] = "jitter, bad hands, blur, distortion"

headers = {'x-api-key': api_key}

# If no files, send as JSON
if files:
    response = requests.post(url, data=data, files=files, headers=headers)
else:
    response = requests.post(url, json=data, headers=headers)
print(response.content)  # The response is the generated image

RESPONSE

video/mp4

HTTP Response Codes

200 - OKImage Generated

401 - UnauthorizedUser authentication failed

404 - Not FoundThe requested URL does not exist

405 - Method Not AllowedThe requested HTTP method is not allowed

406 - Not AcceptableNot enough credits

500 - Server ErrorServer had some issue with processing

Attributes

seedint ( default: 1 )

Random seed. Set for reproducible generation.

imagestr *

Input image to generate video from.

promptstr *

Prompt for generated video.

audio_negative_promptstr ( default: robotic, muffled, echo, distorted )

Negative prompt for audio generation.

video_negative_promptstr ( default: jitter, bad hands, blur, distortion )

Negative prompt for video generation.

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Resources to get you started

Everything you need to know to get the most out of OVI Image To Video

Ovi I2V: Effective Usage Guide

Ovi I2V is a state-of-the-art text-to-video-and-audio generation model that turns simple prompts into synchronized 5-second clips (24 FPS) with matching sound. Follow this guide to maximize output quality, streamline your workflow, and experiment with parameter settings for different creative goals.

1. Crafting Your Prompt

•Begin with a clear visual scene description: subjects, background, lighting, and motion verbs.
•Append <AUDCAP> tags around audio cues: speaker tone, ambient noises, music style.
•Example:

“A barista expertly pours latte art in a sunlit café. <AUDCAP>Soft jazz piano with distant chatter and coffee machine whir <ENDAUDCAP>.”

2. Key Parameters & Recommended Settings

•prompt (required): Detailed text+audio description.
•image (required): High-resolution reference yields sharper video.
•
seed:
- •1–100 for exploratory, varied outputs.
- •101–1000 for reproducible, similar sequences.
- •Default: 42.
•video_negative_prompt (advanced): “shaking, pixelation, glitch, noise” to avoid artifacts.
•audio_negative_prompt (advanced): “static, interference, harsh, buzzing” for clean sound.

3. Aspect Ratios & Framing

•16:9 for widescreen presentations and YouTube content.
•9:16 for vertical social-media reels (TikTok, Instagram Stories).
•1:1 for square posts (Instagram feed, ads).

4. Use-Case Parameter Recipes

•
Social Media Short
- •seed: 12
- •aspect_ratio: 9:16
- •video_neg: “shaking, blur”
- •audio_neg: “static, echo”
- •Prompt: Emphasize dynamic motion and catchy sound effects.
•
Educational Explainer
- •seed: 42
- •aspect_ratio: 16:9
- •video_neg: “pixelation, glitch”
- •audio_neg: “interference, harsh”
- •Prompt: Add clear narration tone and subtle classroom ambient audio.
•
Product Demo Ad
- •seed: 256
- •aspect_ratio: 1:1
- •video_neg: “noise, glitch”
- •audio_neg: “buzzing, echo”
- •Prompt: Focus on close-ups, upbeat background music, voice-over.

5. Best Practices & Tips

•Vary seeds to explore styles, then lock a seed for consistency.
•Use negative prompts to eliminate unwanted artifacts in both video and audio.
•Keep audio cues aligned with visual actions for tight sync.
•For complex scenes, break prompts into shorter sentences and merge them logically.
•Review outputs, adjust prompt specificity, and iteratively refine tags and negative lists.

With these guidelines and parameter recipes, you can leverage Ovi I2V to produce polished, engaging multimedia clips tailored to any creative or business need. Happy generating!

Other Popular Models

Discover other models you might be interested in.

face-to-many

Turn a face into 3D, emoji, pixel art, video game, claymation or toy

sdxl1.0-txt2img

The SDXL model is the official upgrade to the v1.5 model. The model is released as open-source software

sd1.5-majicmix

The most versatile photorealistic model that blends various models to achieve the amazing realistic images.

sd2.1-faceswapper

Take a picture/gif and replace the face in it with a face of your choice. You only need one image of the desired face. No dataset, no training

OVI Image To Video

API

Attributes

Resources to get you started

Ovi I2V: Image-to-Video-and-Audio Generation Model

What is Ovi I2V?

Key Features

Best Use Cases

Prompt Tips

FAQs

Ovi I2V: Effective Usage Guide

1. Crafting Your Prompt

2. Key Parameters & Recommended Settings

3. Aspect Ratios & Framing

4. Use-Case Parameter Recipes

5. Best Practices & Tips

FAQs

Other Popular Models

face-to-many

sdxl1.0-txt2img

sd1.5-majicmix

sd2.1-faceswapper

Cookie settings

OVI Image To Video

API

Attributes

Resources to get you started

Ovi I2V: Image-to-Video-and-Audio Generation Model

What is Ovi I2V?

Key Features

Best Use Cases

Prompt Tips

FAQs

Ovi I2V: Effective Usage Guide

1. Crafting Your Prompt

2. Key Parameters & Recommended Settings

3. Aspect Ratios & Framing

4. Use-Case Parameter Recipes

5. Best Practices & Tips

FAQs

What information is logged when I use the model playground

What are the different model types available on Segmind?

Are there any rate limits on the api calls?

Other Popular Models

face-to-many

sdxl1.0-txt2img

sd1.5-majicmix

sd2.1-faceswapper

Cookie settings