OVI Image To Video

Ovi I2V generates synchronized video and audio from text prompts, creating engaging multimedia content effortlessly.


API

If you're looking for an API, you can choose from your desired programming language.

POST
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 import requests api_key = "YOUR_API_KEY" url = "https://api.segmind.com/v1/ovi-i2v" # Prepare data and files data = {} files = {} data['seed'] = None # For parameter "image", you can send a raw file or a URI: # files['image'] = open('IMAGE_PATH', 'rb') # To send a file data['image'] = 'https://segmind-resources.s3.amazonaws.com/input/b4562d69-c97e-4b64-887c-7dea6d8fbe13-209f6f09-7fc6-47c0-8b6e-128ae75db915-wan-720-ip.png' # To send a URI data['prompt'] = "A young woman wearing a white blouse and dark blazer sits in a warmly lit podcast studio, speaking animatedly into a professional microphone. She wears large studio headphones and gestures with enthusiasm while smiling. Behind her, beige acoustic panels and soft backlighting create a cozy, modern atmosphere. A laptop and coffee mug rest on the desk in front of her as she records. She speaks confidently into the mic, articulating, <S>AI is not replacing us—it’s amplifying our creativity.<E>. She leans slightly forward, eyes bright with engagement. <AUDCAP>Clear female voice discussing technology and innovation, with soft studio ambience.<ENDAUDCAP>" data['audio_negative_prompt'] = "robotic, muffled, echo, distorted" data['video_negative_prompt'] = "jitter, bad hands, blur, distortion" headers = {'x-api-key': api_key} # If no files, send as JSON if files: response = requests.post(url, data=data, files=files, headers=headers) else: response = requests.post(url, json=data, headers=headers) print(response.content) # The response is the generated image
RESPONSE
video/mp4
HTTP Response Codes
200 - OKImage Generated
401 - UnauthorizedUser authentication failed
404 - Not FoundThe requested URL does not exist
405 - Method Not AllowedThe requested HTTP method is not allowed
406 - Not AcceptableNot enough credits
500 - Server ErrorServer had some issue with processing

Attributes


seedint ( default: 1 )

Random seed. Set for reproducible generation.


imagestr *

Input image to generate video from.


promptstr *

Prompt for generated video.


audio_negative_promptstr ( default: robotic, muffled, echo, distorted )

Negative prompt for audio generation.


video_negative_promptstr ( default: jitter, bad hands, blur, distortion )

Negative prompt for video generation.

To keep track of your credit usage, you can inspect the response headers of each API call. The x-remaining-credits property will indicate the number of remaining credits in your account. Ensure you monitor this value to avoid any disruptions in your API usage.

Resources to get you started

Everything you need to know to get the most out of OVI Image To Video

Ovi I2V: Effective Usage Guide

Ovi I2V is a state-of-the-art text-to-video-and-audio generation model that turns simple prompts into synchronized 5-second clips (24 FPS) with matching sound. Follow this guide to maximize output quality, streamline your workflow, and experiment with parameter settings for different creative goals.

1. Crafting Your Prompt

  • Begin with a clear visual scene description: subjects, background, lighting, and motion verbs.
  • Append <AUDCAP> tags around audio cues: speaker tone, ambient noises, music style.
  • Example:

    “A barista expertly pours latte art in a sunlit café. <AUDCAP>Soft jazz piano with distant chatter and coffee machine whir <ENDAUDCAP>.”

2. Key Parameters & Recommended Settings

  • prompt (required): Detailed text+audio description.
  • image (required): High-resolution reference yields sharper video.
  • seed:
    • 1–100 for exploratory, varied outputs.
    • 101–1000 for reproducible, similar sequences.
    • Default: 42.
  • video_negative_prompt (advanced): “shaking, pixelation, glitch, noise” to avoid artifacts.
  • audio_negative_prompt (advanced): “static, interference, harsh, buzzing” for clean sound.

3. Aspect Ratios & Framing

  • 16:9 for widescreen presentations and YouTube content.
  • 9:16 for vertical social-media reels (TikTok, Instagram Stories).
  • 1:1 for square posts (Instagram feed, ads).

4. Use-Case Parameter Recipes

  1. Social Media Short
    • seed: 12
    • aspect_ratio: 9:16
    • video_neg: “shaking, blur”
    • audio_neg: “static, echo”
    • Prompt: Emphasize dynamic motion and catchy sound effects.
  2. Educational Explainer
    • seed: 42
    • aspect_ratio: 16:9
    • video_neg: “pixelation, glitch”
    • audio_neg: “interference, harsh”
    • Prompt: Add clear narration tone and subtle classroom ambient audio.
  3. Product Demo Ad
    • seed: 256
    • aspect_ratio: 1:1
    • video_neg: “noise, glitch”
    • audio_neg: “buzzing, echo”
    • Prompt: Focus on close-ups, upbeat background music, voice-over.

5. Best Practices & Tips

  • Vary seeds to explore styles, then lock a seed for consistency.
  • Use negative prompts to eliminate unwanted artifacts in both video and audio.
  • Keep audio cues aligned with visual actions for tight sync.
  • For complex scenes, break prompts into shorter sentences and merge them logically.
  • Review outputs, adjust prompt specificity, and iteratively refine tags and negative lists.

With these guidelines and parameter recipes, you can leverage Ovi I2V to produce polished, engaging multimedia clips tailored to any creative or business need. Happy generating!

Other Popular Models

Discover other models you might be interested in.

Cookie settings

We use cookies to enhance your browsing experience, analyze site traffic, and personalize content. By clicking "Accept all", you consent to our use of cookies.