Changelog

This page lists all notable changes to the CompactifAI API.

2025-07-01

deepseek-ai/DeepSeek-R1-0528 is now available through our API. It can be prompted using the deepseek-r1-0528 model id.
deepseek-r1 has been deprecated.

Initial release of the CompactifAI inference API with the following features:
- Models API endpoint for listing and retrieving available compressed models
- Chat Completions API endpoint for conversational interactions
- Completions API endpoint for text generation
OpenAI-compatible API design for easy migration and integration
The release of the following models:
- cai-llama-4-scout-slim
- cai-llama-3-3-70b-slim
- cai-mistral-small-3-1-slim
- cai-llama-3-1-8b-slim