Skip to content

Models Catalog

Welcome to our Models Catalog, where you can explore our collection of cutting-edge compressed language models. CompactifAI API delivers a new class of compressed, ultra-efficient AI models engineered for performance, sustainability, and adaptability. These dramatically reduce compute and energy costs while maintaining enterprise-grade accuracy and speed. It adapts seamlessly to diverse environments, ensuring consistent, high-performance AI. With turnkey software and an intuitive API, it offers cost effective inference enabling enterprises to innovate faster, scale cost-effectively, and unlock competitive edge.

  • Dramatic Cost Reduction: Up to 70% lower inference costs compared to uncompressed models through optimized resource utilization
  • Massive Throughput Gains: Process up to 4x more requests per second with compressed models requiring fewer computational resources
  • Low-Latency Inference: Achieve faster response times due to reduced model size and optimized memory usage
  • Minimal Quality Loss: Advanced compression techniques preserve model performance with typically <5% benchmark difference
  • Superior Concurrency: Support significantly more simultaneous users and requests with the same hardware resources
  • Resource Efficiency: Reduced memory footprint and computational requirements enable better hardware utilization

Use the following API model identifiers in your requests:

Model NameOriginal ArchitectureModel IDAvailable
DeepSeek R1 Slim by CompactifAIDeepSeek R1cai-deepseek-r1-slimNo (Coming soon)
DeepSeek R1 0528-deepseek-r1-0528Yes
Llama 4 Scout Slim by CompactifAILlama 4 Scoutcai-llama-4-scout-slimYes
Llama 4 Scout-llama-4-scoutYes
Llama 3.3 70B Slim by CompactifAILlama 3.3 70Bcai-llama-3-3-70b-slimYes
Llama 3.3 70B-llama-3-3-70bYes
Llama 3.1 8B Slim by CompactifAILlama 3.1 8Bcai-llama-3-1-8b-slimYes
Llama 3.1 8B Slim Reasoning by CompactifAILlama 3.1 8Bcai-llama-3-1-8b-slim-rYes
Llama 3.1 8B-llama-3-1-8bYes
Mistral Small 3.1 Slim by CompactifAIMistral Small 3.1cai-mistral-small-3-1-slimYes
Mistral Small 3.1-mistral-small-3-1Yes
Openai GPT OSS 20B-gpt-oss-20bYes
Openai GPT OSS 120B-gpt-oss-120bYes