Skip to content

Changelog

This page lists all notable changes to the CompactifAI API.

  • Speech-to-text transcription endpoint /v1/audio/transcriptions with Whisper Large V3 support for multilingual transcription workflows.
  • Feature and API documentation detailing request parameters, Python examples, and guidance for the new speech-to-text capability.
  • Removed the deepseek-r1-0528 model from the API.
  • Multi-modality support for chat completions, enabling image-plus-text inputs across the API.
  • Added mistral-small-3-1 model with full multi-modal understanding and refreshed usage examples.
  • Function tool compatibility has been activated in all models except mistral.
  • Added deepseek-ai/DeepSeek-R1-0528 model, accessible via the deepseek-r1-0528 model ID.
  • Deprecated deepseek-r1.
  • Initial release of the CompactifAI inference API with the following features:
    • Models API endpoint for listing and retrieving available compressed models
    • Chat Completions API endpoint for conversational interactions
    • Completions API endpoint for text generation
  • OpenAI-compatible API design for easy migration and integration
  • Added the following models:
    • cai-llama-4-scout-slim
    • cai-llama-3-3-70b-slim
    • cai-mistral-small-3-1-slim
    • cai-llama-3-1-8b-slim
  • HTTPS encryption for all API requests
  • Secure authentication using Bearer token scheme