Skip to content

Pricing

CompactifAI offers simple, transparent pay-as-you-go pricing for our API services.

Our pricing is based on tokens processed through the API. We charge separately for:

  • Input tokens: Text sent to the API
  • Output tokens: Text generated by the API

A token represents approximately 4 characters or 0.75 words in English.

ModelInput (per 1M tokens)Output (per 1M tokens)
cai-llama-4-scout-slim$0.07$0.10
llama-4-scout$0.10$0.14
cai-llama-3-3-70b-slim$0.16$0.31
llama-3-3-70b$0.32$0.64
cai-llama-3-1-8b-slim$0.05$0.07
llama-3-1-8b$0.06$0.10
cai-mistral-small-3-1-slim$0.05$0.08
mistral-small-3-1$0.11$0.17
deepseek-r1-0528$0.46$0.74
  • Pay only for what you use
  • No monthly commitments or minimum fees
  • Billing based on actual token usage
Usage: 5M input tokens + 2M output tokens with cai-llama-3-1-8b-slim
Cost calculation:
- 5M input tokens × $0.05 = $0.25
- 2M output tokens × $0.07 = $0.14
Total cost: $0.39
  • API endpoint: /usage/completions
  • CompactifAI Dashboard: Coming soon! Our upcoming dashboard will make it easy to track your token usage and manage your account.

How do I estimate my token usage?
A token represents approximately 4 characters or 0.75 words in English. For more specific estimations, contact our support team.

How often will I be billed?
Billing occurs monthly based on your actual usage.

Is there a minimum spend requirement?
No, you only pay for what you use.

For questions about pricing or to request a custom enterprise plan, please contact our sales team.