98
models
First and Last Frame Video
AI video generation models that give you precise control over the start and end frames of your video — ensuring your output begins and ends exactly where you need it to. First-and-last-frame video models are a major advancement over basic image-to-video, letting you specify both the opening and closing keyframes so the model interpolates a smooth, coherent video between them. This is critical for seamless video loops, cinematic transitions, scene-to-scene cuts, and any workflow where temporal consistency matters. Use these models to animate between two product shots, create looping social media videos that connect start and end flawlessly, generate scene transitions for longer-form video editing, or produce controlled motion sequences for use in post-production. Leading image-to-video models including Kling, Wan 2.6, and LTX support this capability. On Segmind, all first-and-last-frame models are available via API — just pass your first frame image, last frame image, and text prompt, and receive a smooth video in return. Chain with image generation models in Segmind Workflows to automate scene creation and keyframe-controlled video production at scale.
Seedance 2.0 Fast
Professional-grade video creation model with native audio, similar to SeeDance 2.0 but faster and cheaper.
Seedance 2.0
Cinematic AI videos with native audio and multi-shot narratives.
Wan 2.7 Reference to Video
Character-consistent multi-subject videos from reference images.
Wan 2.7 Image to Video
Animate any image into cinematic 1080P video with audio.
Pixverse V6
15-second AI videos with native audio and cinematic controls.
Veo 3.1 Lite
Affordable text-to-video with audio, powered by Google.
Kling O3 Image To Video
Images to cinematic videos with precise motion control.
HyperSwap: Video Faceswap by FaceFusion Labs
Realistic face swapping in videos from a single image.
Wan 2.2 Image to Video Flash
Convert a single image into a coherent dynamic video.
Kling 3.0 Pro Image-to-Video
Animated 1080p videos from images with dynamic motion.
Kling 3.0 Standard Image-to-Video
Controlled cinematic 1080p videos from starting images.
Kling O1 Reference Image 2 Video
Identity-preserving videos from static images with character reference.
Kling O1 Image 2 Video
Physics-driven animations from images for creative storytelling.
Kling V2 Pro Avatar
Talking avatar videos from image and audio, high quality.
Kling Avatar V2 Standard
Lifelike video avatars with precise lip synchronization.
Kling 2.6
Still images into immersive cinematic videos with synchronized audio.
Heygen Avatar IV
Single photo into a lifelike talking avatar video.
Seedance 1.5 Pro
Synchronized video and audio generation for dynamic storytelling.
Wan Scail
Professional character animations from reference images.
Wan 2.6 Image To Video
Transform images into high-quality videos with audio sync.
Wan 2.6 Text To Video
Cinematic videos with synchronized audio from text prompts.
LTX 2 Fast
Fast, high-quality text-to-video generation by Lightricks.
LTX 2 Pro
High-quality video generation with advanced motion control.
Hailuo 2.3 Fast
Professional-quality videos from text and images at speed.
Hailuo 2.3
Hyper-realistic videos from text with fluid character motion.
Seedance 1.0 Pro Fast
Cinematic videos from text and images at ultra speed.
Veo 3.1 Fast
Fast image-to-video at 1080p with native audio.
Veo 3.1
Static images into high-quality videos with synchronized audio.
InfiniteTalk
Full-body animation from images synchronized perfectly to audio.
Pixverse 5 Extend
Seamlessly extend and continue AI-generated videos.
Pixverse 5 Transition
Seamless AI-generated video transitions between scenes.
Pixverse 5 Video
Cinematic videos from text and images with photorealism.
OVI Image To Video
Synchronized video and audio generation from text and images.
Kling V1 Pro AI Avatar
Dynamic AI avatars with synchronized speech from image.
Kling V1 Standard AI Avatar
Lifelike AI avatars with precise lip-sync for presentations.
Sora 2 Pro
Cinematic-quality videos from text with temporal consistency.
Video Watermark Remover
Remove watermarks from any video instantly with AI.
Wan Animate
Animate characters and replace video subjects seamlessly.
Sora 2
Stunning dynamic videos from detailed text descriptions.
Video Frame Interpolation
FILM synthesizes smooth, high-quality intermediate frames for fluid motion in videos with significant movement.
Wan 2.5 Image to Video
Wan2.5-Preview creates stunning, high-resolution videos with flawless audio synchronization from multiple inputs.
Kling 2.5 Turbo
Kling AI 2.5 Turbo generates fluid, cinematic videos from text and images, enhancing content creation and storytelling.
Bytedance HuMo: Human-Centric Video Generation
HuMo generates high-quality, human-centric videos from text, images, and audio with unparalleled control and precision.
Higgsfield Speech 2 Video
Transform images and audio into dynamic, lip-synced videos for engaging digital content.
Video Tryon
Video Tryon is Segmind’s next-generation AI video model for instant virtual try-on, allowing users to visualize any outfit on any person in high-quality, fully-preserved motion up to 50 seconds.
Higgsfield Image 2 Video
Transform static images into dynamic, motion-rich videos with unparalleled control and creative depth.
Wan 2.2 Image to Video Fast
Transforms simple text prompts into breathtaking cinematic-quality videos in minutes.
Hailuo 02 Fast
Transform any static image into a captivating, high-quality video clip effortlessly.
Vidu Q1 Reference to Video
Vidu AI reference to video transforms text and images into dynamic, high-quality videos effortlessly.
Minimax Hailou 2
Generate breathtaking 1080P cinematic videos from text or images with ultra-realistic motion and physics.