Llama 3.2 11B Vision Instruct Pricing

Instruction-tuned image reasoning model from Meta with 11B parameters. Optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. The model can understand visual data, such as charts and graphs and also bridge the gap between vision and language by generating text to describe images details.

~1.69s
~$0.001

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no commitments.

Serverless

Pay-as-you-go pricing with credits that work across all Segmind models

Input
$0.270
Output
$0.270
per million tokens
No upfront costs - Only pay for what you use
Auto-scaling - Handles traffic spikes automatically
Universal credits - Use anywhere on Segmind
Instant deployment - Start using immediately

Need more credits? Buy credits