LLaVA 13B

LLaVA 13B is a Vision-language model which allows both image and text as inputs.

~1.52s

~$0

~1.52s

~$0

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no commitments.

Pay-as-you-go pricing with credits that work across all Segmind models

Input

$0.300

Output

$0.300

per million tokens

No upfront costs - Only pay for what you use

Auto-scaling - Handles traffic spikes automatically

Universal credits - Use anywhere on Segmind

Instant deployment - Start using immediately

Need more credits? Buy credits