Pricing

Usage-based.
No surprises.

Start free and pay only for what you use. Every plan runs on the same production-grade infrastructure — pick the tier that fits your scale.

Free

For developers exploring and building side projects.

$0
1M tokens included · then $0.20 / million
  • 1M tokens included per month
  • 1 active deployment
  • Shared GPU infrastructure
  • OpenAI-compatible API
  • 3 API keys with rate limits
  • Community support

No credit card required

Business

For production workloads with SLA and scale requirements.

$499
per month · 250M tokens included · then $0.10 / million
  • 250M tokens included per month
  • Unlimited deployments
  • 99.9% uptime SLA with service credits
  • A/B fine-tune traffic split
  • Request log viewer (90-day retention)
  • Team-level budget rollup
  • Dedicated solutions engineer
Model pricing

Per-model token rates

Pro and Business plans get progressively lower rates as your volume commitment grows.

ModelProviderContextAvailable onFree ($/M tokens)Pro ($/M tokens)Business ($/M tokens)
Phi-3 MiniMicrosoft4KFree+$0.12$0.09$0.06
Mistral 7BMistral AI32KFree+$0.18$0.13$0.09
Llama 3.1 8BMeta128KFree+$0.20$0.15$0.10
DeepSeek R1 7B DistillDeepSeek64KFree+$0.24$0.18$0.13
CodeLlama 13BMeta16KFree+$0.25$0.18$0.13
Mixtral 8×7BMistral AI32KPro+$0.55$0.40$0.30
Llama 3.1 70BMeta128KPro+$0.85$0.65$0.45
Qwen 2.5 72BAlibaba128KPro+$0.85$0.65$0.45

Rates are per million tokens (combined input + output). Reasoning models are priced with split input/output rates. Custom pricing for 6B+ tokens/month — contact sales.

Calculator

See what you'd pay

Drag the slider to estimate your monthly bill based on token volume.

Estimate your monthly cost

Adjust the sliders to see a real-time cost estimate for your usage.

1M tokens500M tokens
9M overage × $0.20/M$1.80
Estimated monthly total$1.80
Effective rate$0.180 / million tokens

Estimates are approximate. Actual billing is based on metered token usage. See billing docs →

FAQ

Common questions

Start building in minutes.

No credit card required. Deploy your first model on the Free plan and upgrade when you're ready to scale.

5,000+ developers already on Cloudach · No lock-in · Cancel anytime