Changelog
This page lists all notable changes to the CompactifAI API.
2025-10-06
Section titled “2025-10-06”- Speech-to-text transcription endpoint
/v1/audio/transcriptionswith Whisper Large V3 support for multilingual transcription workflows. - Feature and API documentation detailing request parameters, Python examples, and guidance for the new speech-to-text capability.
Models Updates
Section titled “Models Updates”- Removed the
deepseek-r1-0528model from the API.
2025-09-24
Section titled “2025-09-24”- Multi-modality support for chat completions, enabling image-plus-text inputs across the API.
Models Updates
Section titled “Models Updates”- Added
mistral-small-3-1model with full multi-modal understanding and refreshed usage examples.
2025-08-18
Section titled “2025-08-18”- Function tool compatibility has been activated in all models except mistral.
2025-07-01
Section titled “2025-07-01”Models Updates
Section titled “Models Updates”- Added deepseek-ai/DeepSeek-R1-0528 model, accessible via the
deepseek-r1-0528model ID. - Deprecated
deepseek-r1.
2025-06-11
Section titled “2025-06-11”- Initial release of the CompactifAI inference API with the following features:
- Models API endpoint for listing and retrieving available compressed models
- Chat Completions API endpoint for conversational interactions
- Completions API endpoint for text generation
- OpenAI-compatible API design for easy migration and integration
Models Updates
Section titled “Models Updates”- Added the following models:
cai-llama-4-scout-slimcai-llama-3-3-70b-slimcai-mistral-small-3-1-slimcai-llama-3-1-8b-slim
Security
Section titled “Security”- HTTPS encryption for all API requests
- Secure authentication using Bearer token scheme