LLaVA 13B
LLaVA 13B is a Vision-language model which allows both image and text as inputs.
~1.52s
~$0
Simple, Transparent Pricing
Pay only for what you use. No hidden fees, no commitments.
Serverless
Pay-as-you-go pricing with credits that work across all Segmind models
Input
$0.300
Output
$0.300
per million tokens
No upfront costs - Only pay for what you use
Auto-scaling - Handles traffic spikes automatically
Universal credits - Use anywhere on Segmind
Instant deployment - Start using immediately
Need more credits? Buy credits