38
models
Alibaba Models
Alibaba delivers a comprehensive multimodal AI suite — Qwen for image generation and editing, Wan for state-of-the-art video synthesis, and vision-language models for intelligent visual understanding. The Qwen Image Edit Plus series offers surgical image editing with multi-LoRA, relighting, product photography, and more. Wan 2.6 produces high-resolution video with smooth motion and keyframe control. Access every Alibaba model as a pay-per-use API on Segmind, or chain them in Segmind Workflows for automated content pipelines.
Wan 2.2 Image to Video Flash
Transform a single image and text prompt into a coherent, dynamic video.
Qwen Image 2512
Qwen-Image-2512 generates highly realistic images from text descriptions, excelling in human depiction and environmental detail.
Wan 2.6 Image To Video
Wan 2.6 transforms text and images into high-quality videos with precise audio sync, perfect for engaging content creation.
Wan 2.6 Text To Video
Transforms text and audio into high-quality cinematic videos with seamless storytelling and synchronization.
Qwen Image Edit Plus Blend It
Seamlessly integrates products into backgrounds with precise lighting and perspective adjustments for realistic composited images.
Qwen Image Edit Plus Eigen Banana
Eigen-Banana-Qwen-Image-Edit enables precise, text-guided transformations of images for diverse applications.
Qwen Image Edit Plus Eraser
Intelligently removes unwanted objects from images while preserving realistic backgrounds and scene integrity.
Qwen Image Edit Plus Face To Portrait
Transforms cropped facial images into stunning, identity-preserving portrait photographs.
Qwen Image Edit Plus Group Photo
Generates realistic group photos by merging multiple individual portraits while ensuring facial consistency and nostalgic aesthetics.
Qwen Image Edit Plus Multiple Angles
Transform any image perspective dynamically using natural language for professional-grade results.
Qwen Image Edit Plus Next Scene
Creates cinematic sequences with seamless visual flow, enhancing storytelling in digital media.
Qwen Image Edit Plus Product Photography
Transforms white-background images into immersive, realistic scenes for professional-quality visual storytelling.
Qwen Image Edit Plus Relight
Transform any image with advanced lighting manipulation using natural language prompts, enhancing realism and atmosphere.
Qwen Image Edit Plus Remove Lighting
Automatically restores natural lighting and removes artificial effects for stunning, professional-quality images.
Qwen Image Edit Plus Texture Apply
Seamlessly applies precise textures to images based on natural language prompts for enhanced visual quality.
Qwen Image Edit Plus Texture Extract
Effortlessly extracts and generates seamless, tileable textures from photographs for digital creatives.
Qwen Image Edit Plus Add People Lora
Effortlessly generates realistic multi-character scenes with natural interactions for diverse creative applications.
Qwen Image Edit Plus Multi Lora
Qwen Image Edit Plus Multi lora enables seamless multi-image editing with superior detail preservation for professional-grade visuals.
Qwen Image Edit Plus
Qwen Image Edit Plus revolutionizes multi-image editing with precise transformations and facial consistency.
Wan 2.5 Image to Video
Wan2.5-Preview creates stunning, high-resolution videos with flawless audio synchronization from multiple inputs.
Wan 2.5 Text to Video
Wan2.5-Preview generates synchronized multimedia content, merging text, image, video, and audio seamlessly.
Qwen Image Edit Fast
Qwen-Image-Edit enables precise bilingual image editing for seamless localization and professional content creation.
Qwen Image Fast
Qwen-Image expertly generates stunning images with complex text integration, especially for Chinese typography.
Qwen Image Edit
Transform images effortlessly through semantic context and pixel-perfect appearance changes.
Qwen Image
Qwen-Image revolutionizes image generation and editing with seamless multilingual text integration and photorealistic detail.
Qwen2.5-VL 32B Instruct
Qwen2.5-VL processes text and images seamlessly for advanced multimodal instruction and reasoning.
Hunyuan3d-2.1
Transform 2D images into photorealistic, high-fidelity 3D assets effortlessly.
Wan 2.2 Text to Video Fast
Wan2.2 transforms text and images into high-quality video clips with cinematic flair.
Wan 2.2 Image to Video Fast
Transforms simple text prompts into breathtaking cinematic-quality videos in minutes.
Hunyuan-3d 2mv
Hunyuan3D-2mv is finetuned from Hunyuan3D-2 to support multiview controlled shape generation.
Wan_2.1 Text to Video
Create visually impressive and feature varied, lifelike motion videos with Wan2.1 using text prompts.
Wan 2.1 480p image to video
Create high-quality 480p videos with excellent visual quality and a broad spectrum of motion from static images.
Wan 2.1 720p image to video
Create high-quality 720p videos with excellent visual quality and a broad spectrum of motion from static images.
Qwen2 VL 72B Instruct
Qwen2-VL-72B-Instruct is a state-of-the-art multimodal model excelling in image and video understanding, with advanced capabilities for text-based interaction.
Hunyuan3D-2
Hunyuan3D 2.0 enables the creation of high-quality 3D models with intricate details. Produce assets that are visually appealing and suitable for professional use.
QWEN2-VL-7B-Instruct
The Qwen2-VL-7B-Instruct is a cutting-edge vision-language model with 7 billion parameters, offering advanced capabilities like object recognition, image analysis and visual localization. It can also generate structured outputs and is optimized for both performance and flexibility. It can recognize objects, analyze image content, act as a visual agent, and generate structured data.
Hunyuan Video
Hunyuan AI Video is a new, state of the art, AI Video Generator that creates high-quality videos from text descriptions. With 13B parameters and state-of-the-art performance, it's the most powerful open-source video generation model available.
Easy Animate
Easy Animate is a state-of-the-art image to animation model to convert static images into dynamic animations with remarkable accuracy and fluidity.