23
models
Qwen AI Models
Qwen is Alibaba's comprehensive multimodal AI model family — spanning image generation, instruction-following image editing, and vision-language understanding in one integrated ecosystem. The collection includes Qwen Image models for high-quality text-to-image generation, the full Qwen Image Edit and Edit Plus suite for advanced instruction-based image editing (with specialized tools for adding people, generating group photos, product photography, texture manipulation, multi-LoRA styling, relighting, and more), and Qwen2-VL and Qwen2.5-VL for state-of-the-art vision-language reasoning — enabling image analysis, document understanding, visual question answering, chart interpretation, and multimodal content extraction. Qwen models are renowned for their exceptional multilingual capabilities, particularly Chinese, Japanese, Korean, and other Asian languages — making them the go-to choice for teams building applications for Asian markets. Qwen2-VL 72B delivers frontier-level multimodal reasoning comparable to leading closed models, while the 7B variant offers an efficient option for high-throughput applications. Qwen2.5-VL 32B combines strong visual understanding with the latest Qwen architecture improvements for even better performance across complex multimodal tasks. On Segmind, all Qwen models are available as pay-per-use APIs — no Alibaba Cloud account needed. Integrate text-to-image generation, image editing, or visual understanding into your application with a single endpoint call.
Qwen Image 2512
Qwen-Image-2512 generates highly realistic images from text descriptions, excelling in human depiction and environmental detail.
Qwen Image Edit Plus Blend It
Seamlessly integrates products into backgrounds with precise lighting and perspective adjustments for realistic composited images.
Qwen Image Edit Plus Eigen Banana
Eigen-Banana-Qwen-Image-Edit enables precise, text-guided transformations of images for diverse applications.
Qwen Image Edit Plus Eraser
Intelligently removes unwanted objects from images while preserving realistic backgrounds and scene integrity.
Qwen Image Edit Plus Face To Portrait
Transforms cropped facial images into stunning, identity-preserving portrait photographs.
Qwen Image Edit Plus Group Photo
Generates realistic group photos by merging multiple individual portraits while ensuring facial consistency and nostalgic aesthetics.
Qwen Image Edit Plus Multiple Angles
Transform any image perspective dynamically using natural language for professional-grade results.
Qwen Image Edit Plus Next Scene
Creates cinematic sequences with seamless visual flow, enhancing storytelling in digital media.
Qwen Image Edit Plus Product Photography
Transforms white-background images into immersive, realistic scenes for professional-quality visual storytelling.
Qwen Image Edit Plus Relight
Transform any image with advanced lighting manipulation using natural language prompts, enhancing realism and atmosphere.
Qwen Image Edit Plus Remove Lighting
Automatically restores natural lighting and removes artificial effects for stunning, professional-quality images.
Qwen Image Edit Plus Texture Apply
Seamlessly applies precise textures to images based on natural language prompts for enhanced visual quality.
Qwen Image Edit Plus Texture Extract
Effortlessly extracts and generates seamless, tileable textures from photographs for digital creatives.
Qwen Image Edit Plus Add People Lora
Effortlessly generates realistic multi-character scenes with natural interactions for diverse creative applications.
Qwen Image Edit Plus Multi Lora
Qwen Image Edit Plus Multi lora enables seamless multi-image editing with superior detail preservation for professional-grade visuals.
Qwen Image Edit Plus
Qwen Image Edit Plus revolutionizes multi-image editing with precise transformations and facial consistency.
Qwen Image Edit Fast
Qwen-Image-Edit enables precise bilingual image editing for seamless localization and professional content creation.
Qwen Image Fast
Qwen-Image expertly generates stunning images with complex text integration, especially for Chinese typography.
Qwen Image Edit
Transform images effortlessly through semantic context and pixel-perfect appearance changes.
Qwen Image
Qwen-Image revolutionizes image generation and editing with seamless multilingual text integration and photorealistic detail.
Qwen2.5-VL 32B Instruct
Qwen2.5-VL processes text and images seamlessly for advanced multimodal instruction and reasoning.
Qwen2 VL 72B Instruct
Qwen2-VL-72B-Instruct is a state-of-the-art multimodal model excelling in image and video understanding, with advanced capabilities for text-based interaction.
QWEN2-VL-7B-Instruct
The Qwen2-VL-7B-Instruct is a cutting-edge vision-language model with 7 billion parameters, offering advanced capabilities like object recognition, image analysis and visual localization. It can also generate structured outputs and is optimized for both performance and flexibility. It can recognize objects, analyze image content, act as a visual agent, and generate structured data.