LLaVA 13B

LLaVA 13B is a Vision-language model which allows both image and text as inputs.

~1.52s
~$0

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no commitments.

Serverless

Pay-as-you-go pricing with credits that work across all Segmind models

Input
$0.300
Output
$0.300
per million tokens
No upfront costs - Only pay for what you use
Auto-scaling - Handles traffic spikes automatically
Universal credits - Use anywhere on Segmind
Instant deployment - Start using immediately

Need more credits? Buy credits