32
models
Qwen AI Models
Qwen is Alibaba's comprehensive multimodal AI model family — spanning image generation, instruction-following image editing, and vision-language understanding in one integrated ecosystem. The collection includes Qwen Image models for high-quality text-to-image generation, the full Qwen Image Edit and Edit Plus suite for advanced instruction-based image editing (with specialized tools for adding people, generating group photos, product photography, texture manipulation, multi-LoRA styling, relighting, and more), and Qwen2-VL and Qwen2.5-VL for state-of-the-art vision-language reasoning — enabling image analysis, document understanding, visual question answering, chart interpretation, and multimodal content extraction. Qwen models are renowned for their exceptional multilingual capabilities, particularly Chinese, Japanese, Korean, and other Asian languages — making them the go-to choice for teams building applications for Asian markets. Qwen2-VL 72B delivers frontier-level multimodal reasoning comparable to leading closed models, while the 7B variant offers an efficient option for high-throughput applications. Qwen2.5-VL 32B combines strong visual understanding with the latest Qwen architecture improvements for even better performance across complex multimodal tasks. On Segmind, all Qwen models are available as pay-per-use APIs — no Alibaba Cloud account needed. Integrate text-to-image generation, image editing, or visual understanding into your application with a single endpoint call.
Qwen Flash
Fastest low-cost LLM with 1M context for high-volume tasks.
Qwen Plus
Mid-tier 1M context LLM for summarization and content tasks.
Qwen 3 VL Flash
Fast, affordable vision-language model with 262K context OCR.
Qwen 3 VL Plus
Powerful visual QA and document analysis from images.
Qwen 3 Coder Flash
High-volume code generation with 1M token context window.
Qwen 3 Coder Plus
Generates, debugs, and refactors entire codebases efficiently.
Qwen 3 Max
1T-parameter LLM with hybrid reasoning and 262K context.
Qwen 3.5 Plus
Multimodal 1M context AI for image, video, and text.
Qwen 3.5 Flash
Fast multimodal AI processing text, images, and video affordably.
Qwen Image 2512
Photorealistic image generation with precise text description following.
Qwen Image Edit Plus Blend It
Product placement into backgrounds with precise lighting match.
Qwen Image Edit Plus Eigen Banana
Precise text-guided image transformation and creative editing.
Qwen Image Edit Plus Eraser
Remove unwanted objects while preserving realistic backgrounds.
Qwen Image Edit Plus Face To Portrait
Cropped face into full identity-preserving portrait photo.
Qwen Image Edit Plus Group Photo
Merge individual portraits into realistic group photos.
Qwen Image Edit Plus Multiple Angles
Transform image perspective with natural language prompts.
Qwen Image Edit Plus Next Scene
Create cinematic sequences with seamless visual continuity.
Qwen Image Edit Plus Product Photography
Transform white-background products into immersive lifestyle scenes.
Qwen Image Edit Plus Relight
Advanced image relighting using natural language prompts.
Qwen Image Edit Plus Remove Lighting
Remove artificial lighting effects and restore natural tones.
Qwen Image Edit Plus Texture Apply
Apply precise textures to images using natural language.
Qwen Image Edit Plus Texture Extract
Extract seamless, tileable textures from photographs.
Qwen Image Edit Plus Add People Lora
Generate realistic multi-character scenes with natural interactions.
Qwen Image Edit Plus Multi Lora
Multi-image editing with superior identity and style control.
Qwen Image Edit Plus
Multi-image editing with precise text-guided transformations.
Qwen Image Edit Fast
Qwen-Image-Edit enables precise bilingual image editing for seamless localization and professional content creation.
Qwen Image Fast
Qwen-Image expertly generates stunning images with complex text integration, especially for Chinese typography.
Qwen Image Edit
Transform images effortlessly through semantic context and pixel-perfect appearance changes.
Qwen Image
Qwen-Image revolutionizes image generation and editing with seamless multilingual text integration and photorealistic detail.
Qwen2.5-VL 32B Instruct
Qwen2.5-VL processes text and images seamlessly for advanced multimodal instruction and reasoning.
Qwen2 VL 72B Instruct
Qwen2-VL-72B-Instruct is a state-of-the-art multimodal model excelling in image and video understanding, with advanced capabilities for text-based interaction.
QWEN2-VL-7B-Instruct
The Qwen2-VL-7B-Instruct is a cutting-edge vision-language model with 7 billion parameters, offering advanced capabilities like object recognition, image analysis and visual localization. It can also generate structured outputs and is optimized for both performance and flexibility. It can recognize objects, analyze image content, act as a visual agent, and generate structured data.