Llama 3 70b Pricing

Meta developed and released the Meta Llama 3 family of large language models (LLMs), a collection of pretrained and instruction tuned generative text models in 8 and 70B sizes. The Llama 3 instruction tuned models are optimized for dialogue use cases and outperform many of the available open source chat models on common industry benchmarks.

Simple, Transparent Pricing

Pay only for what you use. No hidden fees, no commitments.

Serverless

Pay-as-you-go pricing with credits that work across all Segmind models

Input
$1.100
Output
$1.100
per million tokens
No upfront costs - Only pay for what you use
Auto-scaling - Handles traffic spikes automatically
Universal credits - Use anywhere on Segmind
Instant deployment - Start using immediately

Need more credits? Buy credits