200
models
Qwen Image 2 Models
Qwen Image 2 is Alibaba's second-generation multimodal image generation and editing suite — featuring a powerful lineup of text-to-image and instruction-following image editing models. The collection includes Qwen Image, Qwen Image 2512, Qwen Image Fast, Qwen Image Edit, and the comprehensive Qwen Image Edit Plus series with specialized tools: eraser, relighting, multi-LoRA, product photography, face-to-portrait, group photo generation, texture manipulation, next-scene generation, add-people, and blend-image capabilities. Qwen Image 2 models excel at understanding complex natural language editing instructions and executing them with high fidelity — preserving image structure while making targeted, intelligent modifications. The Edit Plus variants are especially powerful for creative and commercial workflows: apply multiple LoRA style adapters simultaneously, perform surgical object edits, generate product shots in new environments, and relight scenes — all from simple text prompts. On Segmind, every Qwen Image 2 model is available as a pay-per-use API endpoint. Chain editing models together in Segmind Workflows to build fully automated image production pipelines — generate, edit, relight, and upscale in one sequence.
Wan 2.7 Reference to Video
Character-consistent multi-subject videos from reference images.
Wan 2.7 Image to Video
Animate any image into cinematic 1080P video with audio.
Wan 2.7 Image Generation Pro
4K images with chain-of-thought reasoning and multilingual text.
Wan 2.7 Image Generation
2K image generation with precise multilingual text rendering.
Qwen Flash
Fastest low-cost LLM with 1M context for high-volume tasks.
Qwen Plus
Mid-tier 1M context LLM for summarization and content tasks.
Qwen 3 VL Flash
Fast, affordable vision-language model with 262K context OCR.
Qwen 3 VL Plus
Powerful visual QA and document analysis from images.
Qwen 3 Coder Flash
High-volume code generation with 1M token context window.
Qwen 3 Coder Plus
Generates, debugs, and refactors entire codebases efficiently.
Qwen 3 Max
1T-parameter LLM with hybrid reasoning and 262K context.
Qwen 3.5 Plus
Multimodal 1M context AI for image, video, and text.
Qwen 3.5 Flash
Fast multimodal AI processing text, images, and video affordably.
HyperSwap Image Faceswap by FaceFusion Labs
High-quality face swapping built for real production workflows.
Kling O3 Image To Video
Images to cinematic videos with precise motion control.
Kling V3 Image 2 Image
Transform images into photorealistic, production-ready visuals.
Kling V3 Text to Image
Photorealistic, print-ready images from text prompts.
Kling O3 Video To Video Reference
Swap characters and restyle videos using reference images.
HyperSwap: Video Faceswap by FaceFusion Labs
Realistic face swapping in videos from a single image.
Wan 2.2 Image to Video Flash
Convert a single image into a coherent dynamic video.
Seedream 5.0 Lite: Image-to-Image
Transform images intelligently with detailed text prompts.
Seedream 5.0 Lite: Text-to-Image
Fast, affordable instruction-following image generation.
Nano Banana 2
Fast photorealistic images — ideal for marketing and ads.
Kling 3.0 Pro Image-to-Video
Animated 1080p videos from images with dynamic motion.
Kling 3.0 Standard Image-to-Video
Controlled cinematic 1080p videos from starting images.
Segmind Faceswap v5
Ultra-fast face and head swapping in images.
Flux-2 Klein-4b
Sub-second photorealistic image generation and editing.
Flux-2 Klein-9b
Ultra-fast photorealistic image generation on consumer GPUs.
LTX-2-19B I2V
Synchronized 4K audio-video generation from images, fast.
Kling O1 Reference Image 2 Video
Identity-preserving videos from static images with character reference.
Kling O1 Video 2 Video Reference
Video style transfer using reference character images.
Kling O1 Image 2 Video
Physics-driven animations from images for creative storytelling.
Qwen Image 2512
Photorealistic image generation with precise text description following.
Kling V2 Pro Avatar
Talking avatar videos from image and audio, high quality.
Kling 2.6
Still images into immersive cinematic videos with synchronized audio.
Flux 2 Max
Photorealistic images with maximum consistency and fine detail.
GPT Image 1.5 Edit
Precise image editing via natural language instructions.
GPT Image 1.5
Stunning photorealistic images with exceptional instruction-following.
Wan Scail
Professional character animations from reference images.
Wan 2.6 Image To Video
Transform images into high-quality videos with audio sync.
Sam 3D Object
Single 2D image into detailed 3D object models.
Seedream 4.5
Photorealistic image generation with precise text understanding.
Z Image Turbo
Photorealistic images in under one second, bilingual text.
Flux 2 Flex
Consistent-style photorealistic images using reference inputs.
Flux 2 Pro
High-quality photorealistic images with cross-output consistency.
Sam3 Image
Precise object segmentation and tracking in images.
Nano Banana Pro
High-fidelity images with accurate multilingual text rendering.
Qwen Image Edit Plus Blend It
Product placement into backgrounds with precise lighting match.
Qwen Image Edit Plus Eigen Banana
Precise text-guided image transformation and creative editing.
Qwen Image Edit Plus Eraser
Remove unwanted objects while preserving realistic backgrounds.