55
models
Alibaba Models
Alibaba delivers a comprehensive multimodal AI suite — Qwen for image generation and editing, Wan for state-of-the-art video synthesis, and vision-language models for intelligent visual understanding. The Qwen Image Edit Plus series offers surgical image editing with multi-LoRA, relighting, product photography, and more. Wan 2.6 produces high-resolution video with smooth motion and keyframe control. Access every Alibaba model as a pay-per-use API on Segmind, or chain them in Segmind Workflows for automated content pipelines.
Wan 2.7 Video Editing
Edit existing videos precisely using natural language text instructions.
Wan 2.7 Reference to Video
Character-consistent multi-subject videos from reference images.
Wan 2.7 Image to Video
Animate any image into cinematic 1080P video with audio.
Wan 2.7 Text to Video
1080P cinematic videos with audio sync and multi-shot control.
Wan 2.7 Image Generation Pro
4K images with chain-of-thought reasoning and multilingual text.
Wan 2.7 Image Generation
2K image generation with precise multilingual text rendering.
Qwen Flash
Fastest low-cost LLM with 1M context for high-volume tasks.
Qwen Plus
Mid-tier 1M context LLM for summarization and content tasks.
QVQ Max
Chain-of-thought visual reasoning for math, charts, and diagrams.
Qwen 3 VL Flash
Fast, affordable vision-language model with 262K context OCR.
Qwen 3 VL Plus
Powerful visual QA and document analysis from images.
Qwen 3 Coder Flash
High-volume code generation with 1M token context window.
Qwen 3 Coder Plus
Generates, debugs, and refactors entire codebases efficiently.
QwQ Plus
Deep chain-of-thought reasoning for math, code, and logic.
Qwen 3 Max
1T-parameter LLM with hybrid reasoning and 262K context.
Qwen 3.5 Plus
Multimodal 1M context AI for image, video, and text.
Qwen 3.5 Flash
Fast multimodal AI processing text, images, and video affordably.
Wan 2.2 Image to Video Flash
Convert a single image into a coherent dynamic video.
Qwen Image 2512
Photorealistic image generation with precise text description following.
Wan 2.6 Image To Video
Transform images into high-quality videos with audio sync.
Wan 2.6 Text To Video
Cinematic videos with synchronized audio from text prompts.
Qwen Image Edit Plus Blend It
Product placement into backgrounds with precise lighting match.
Qwen Image Edit Plus Eigen Banana
Precise text-guided image transformation and creative editing.
Qwen Image Edit Plus Eraser
Remove unwanted objects while preserving realistic backgrounds.
Qwen Image Edit Plus Face To Portrait
Cropped face into full identity-preserving portrait photo.
Qwen Image Edit Plus Group Photo
Merge individual portraits into realistic group photos.
Qwen Image Edit Plus Multiple Angles
Transform image perspective with natural language prompts.
Qwen Image Edit Plus Next Scene
Create cinematic sequences with seamless visual continuity.
Qwen Image Edit Plus Product Photography
Transform white-background products into immersive lifestyle scenes.
Qwen Image Edit Plus Relight
Advanced image relighting using natural language prompts.
Qwen Image Edit Plus Remove Lighting
Remove artificial lighting effects and restore natural tones.
Qwen Image Edit Plus Texture Apply
Apply precise textures to images using natural language.
Qwen Image Edit Plus Texture Extract
Extract seamless, tileable textures from photographs.
Qwen Image Edit Plus Add People Lora
Generate realistic multi-character scenes with natural interactions.
Qwen Image Edit Plus Multi Lora
Multi-image editing with superior identity and style control.
Qwen Image Edit Plus
Multi-image editing with precise text-guided transformations.
Wan 2.5 Image to Video
Wan2.5-Preview creates stunning, high-resolution videos with flawless audio synchronization from multiple inputs.
Wan 2.5 Text to Video
Wan2.5-Preview generates synchronized multimedia content, merging text, image, video, and audio seamlessly.
Qwen Image Edit Fast
Qwen-Image-Edit enables precise bilingual image editing for seamless localization and professional content creation.
Qwen Image Fast
Qwen-Image expertly generates stunning images with complex text integration, especially for Chinese typography.
Qwen Image Edit
Transform images effortlessly through semantic context and pixel-perfect appearance changes.
Qwen Image
Qwen-Image revolutionizes image generation and editing with seamless multilingual text integration and photorealistic detail.
Qwen2.5-VL 32B Instruct
Qwen2.5-VL processes text and images seamlessly for advanced multimodal instruction and reasoning.
Hunyuan3d-2.1
Transform 2D images into photorealistic, high-fidelity 3D assets effortlessly.
Wan 2.2 Text to Video Fast
Wan2.2 transforms text and images into high-quality video clips with cinematic flair.
Wan 2.2 Image to Video Fast
Transforms simple text prompts into breathtaking cinematic-quality videos in minutes.
Hunyuan-3d 2mv
Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.
Wan_2.1 Text to Video
Create visually impressive and feature varied, lifelike motion videos with Wan2.1 using text prompts.
Wan 2.1 480p image to video
Create high-quality 480p videos with excellent visual quality and a broad spectrum of motion from static images.
Wan 2.1 720p image to video
Create high-quality 720p videos with excellent visual quality and a broad spectrum of motion from static images.