Llama 3.2 11B Vision Instruct Pricing
Instruction-tuned image reasoning model from Meta with 11B parameters. Optimized for visual recognition, image reasoning, captioning, and answering general questions about an image. The model can understand visual data, such as charts and graphs and also bridge the gap between vision and language by generating text to describe images details.
~1.69s
~$0.001
Simple, Transparent Pricing
Pay only for what you use. No hidden fees, no commitments.
Serverless
Pay-as-you-go pricing with credits that work across all Segmind models
Input
$0.270
Output
$0.270
per million tokens
No upfront costs - Only pay for what you use
Auto-scaling - Handles traffic spikes automatically
Universal credits - Use anywhere on Segmind
Instant deployment - Start using immediately
Need more credits? Buy credits