Pricing

CompactifAI Pricing

CompactifAI offers simple, transparent pay-as-you-go pricing for our API services.

Overview

Our pricing is based on tokens processed through the API. We charge separately for:

Input tokens: Text sent to the API
Output tokens: Text generated by the API

A token represents approximately 4 characters or 0.75 words in English.

Model Pricing

Model	Input (per 1M tokens)	Output (per 1M tokens)
cai-llama-4-scout-slim	$0.07	$0.10
llama-4-scout	$0.10	$0.14
cai-llama-3-3-70b-slim	$0.11	$0.21
llama-3-3-70b	$0.15	$0.31
cai-llama-3-1-8b-slim	$0.02	$0.07
llama-3-1-8b	$0.02	$0.09
cai-mistral-small-3-1-slim	$0.05	$0.08
mistral-small-3-1	$0.11	$0.17
cai-deepseek-r1-0528-slim	$0.28	$0.44
gpt-oss-20b	$0.03	$0.10
gpt-oss-120b	$0.05	$0.23
hypernova-60b	$0.04	$0.14
blackstar-10b	$0.02	$0.07

Speech-to-Text Pricing

Model	Price (per audio minute)	Billing Granularity
whisper-large-v3	$0,00034	Per-second billing with 1-minute minimum

Pay-as-you-go

Pay only for what you use
No monthly commitments or minimum fees
Billing based on actual token usage

Billing Example

Usage: 5M input tokens + 2M output tokens with cai-llama-3-1-8b-slim
Cost calculation:
- 5M input tokens × $0.05 = $0.25
- 2M output tokens × $0.07 = $0.14

Total cost: $0.39

Monitor Your Usage

Use the CompactifAI Dashboard to manage API tokens, monitor usage, view account settings, and review billing.

FAQ

How do I estimate my token usage? A token represents approximately 4 characters or 0.75 words in English. For more specific estimations, contact our support team.

How often will I be billed? Billing occurs monthly based on your actual usage.

Is there a minimum spend requirement? No, you only pay for what you use.

Contact Us

For questions about pricing or to request a custom enterprise plan, please contact our sales team.