156
models
Wan 2.1 Video Models
Wan 2.1 is Alibaba's landmark open-source video generation model series that set a new standard for accessible, production-ready AI video when it launched. Available in 480p and 720p resolution variants for image-to-video, plus a powerful text-to-video model, Wan 2.1 delivers smooth motion synthesis, accurate subject tracking, and natural scene dynamics across a wide range of content types. The models are built on large-scale training and an open architecture that became the foundation for subsequent Wan releases and community fine-tunes. Wan 2.1 is complemented by Wan Animate for targeted object and scene animation, Wan Scail for video upscaling workflows, and Wan Video Effects for creative visual transformations. The 480p variant is optimized for speed and throughput, while the 720p variant delivers higher visual fidelity for production use. Wan 2.1's open-weight design makes it particularly valuable for developers building custom video generation applications, researchers experimenting with video AI architectures, and production teams who need a reliable, well-tested video generation backbone. On Segmind, all Wan 2.1 models are available as pay-per-use APIs — no GPU management, no infrastructure overhead. Integrate professional video generation into your application with a single API call.
Seedance 2.0 Fast
Professional-grade video creation model with native audio, similar to SeeDance 2.0 but faster and cheaper.
Seedance 2.0
Cinematic AI videos with native audio and multi-shot narratives.
Wan 2.7 Video Editing
Edit existing videos precisely using natural language text instructions.
Wan 2.7 Reference to Video
Character-consistent multi-subject videos from reference images.
Wan 2.7 Image to Video
Animate any image into cinematic 1080P video with audio.
Wan 2.7 Text to Video
1080P cinematic videos with audio sync and multi-shot control.
Wan 2.7 Image Generation Pro
4K images with chain-of-thought reasoning and multilingual text.
Wan 2.7 Image Generation
2K image generation with precise multilingual text rendering.
Pixverse V6
15-second AI videos with native audio and cinematic controls.
Veo 3.1 Lite
Affordable text-to-video with audio, powered by Google.
Qwen Flash
Fastest low-cost LLM with 1M context for high-volume tasks.
Qwen Plus
Mid-tier 1M context LLM for summarization and content tasks.
Qwen 3 Coder Flash
High-volume code generation with 1M token context window.
Qwen 3 Max
1T-parameter LLM with hybrid reasoning and 262K context.
Qwen 3.5 Plus
Multimodal 1M context AI for image, video, and text.
Kling V3 Image 2 Image
Transform images into photorealistic, production-ready visuals.
Kling O3 Text-to-Video
15-second cinematic AI videos with native audio.
Wan 2.2 Image to Video Flash
Convert a single image into a coherent dynamic video.
Nano Banana 2
Fast photorealistic images — ideal for marketing and ads.
Kling 3.0 Pro Image-to-Video
Animated 1080p videos from images with dynamic motion.
Kling 3.0 Standard Image-to-Video
Controlled cinematic 1080p videos from starting images.
Kling 3.0 Pro Text-to-Video
Cinematic 1080p videos with realistic audio from text.
Kling 3.0 Standard Text-to-Video
Stunning 1080p cinematic videos from simple text prompts.
Flux-2 Klein-4b
Sub-second photorealistic image generation and editing.
Flux-2 Klein-9b
Ultra-fast photorealistic image generation on consumer GPUs.
LTX-2-19B I2V
Synchronized 4K audio-video generation from images, fast.
LTX-2-19B T2V
Synchronized video and audio from text, multiple input types.
Kling O1 Reference Image 2 Video
Identity-preserving videos from static images with character reference.
Kling O1 Video 2 Video Reference
Video style transfer using reference character images.
Kling O1 Image 2 Video
Physics-driven animations from images for creative storytelling.
Kling O1 Video 2 Video Edit
Edit any video with precise natural language commands.
Kling 2.6 Pro Motion Control
Transfer motion from videos to animate custom characters.
Kling 2.6 Standard Motion Control
Precise motion transfer from reference videos to characters.
Kling 2.6
Still images into immersive cinematic videos with synchronized audio.
GPT 5.1
Precise code review and developer workflow assistant.
GPT 5.2
Advanced reasoning with multimodal input for precise tasks.
Gemini TTS 2.5 Flash
Fast, lifelike text-to-speech with expressive emotional tones.
Gemini TTS 2.5 Pro
Human-like speech synthesis with rich expressive emotional depth.
Seedance 1.5 Pro
Synchronized video and audio generation for dynamic storytelling.
Flux 2 Max
Photorealistic images with maximum consistency and fine detail.
GPT Image 1.5 Edit
Precise image editing via natural language instructions.
GPT Image 1.5
Stunning photorealistic images with exceptional instruction-following.
Wan Scail
Professional character animations from reference images.
Wan 2.6 Image To Video
Transform images into high-quality videos with audio sync.
Wan 2.6 Text To Video
Cinematic videos with synchronized audio from text prompts.
Sync.so React 1
Edit video actors' emotions with realistic re-expression.
Flux 2 Flex
Consistent-style photorealistic images using reference inputs.
Flux 2 Pro
High-quality photorealistic images with cross-output consistency.
LTX 2 Fast
Fast, high-quality text-to-video generation by Lightricks.
LTX 2 Pro
High-quality video generation with advanced motion control.