Llama 3.2 11B Vision Instruct Pricing

Instruction-tuned image reasoning model from Meta with 11B parameters. Optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. The model can understand visual data, such as charts and graphs and also bridge the gap between vision and language by generating text to describe images details.

~1.69s

~$0.001

PlaygroundAPIPricing

~1.69s

~$0.001

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no commitments.

Serverless

Pay-as-you-go pricing with credits that work across all Segmind models

Input

$0.270

Output

$0.270

per million tokens

No upfront costs - Only pay for what you use

Auto-scaling - Handles traffic spikes automatically

Universal credits - Use anywhere on Segmind

Instant deployment - Start using immediately

Need more credits? Buy credits