Model ID
cai-whisper-large-v3-turbo-slim
Model ID
cai-whisper-large-v3-turbo-slim
Base Architecture
Whisper Large V3 Turbo
| Specification | Value |
|---|---|
| Parameters | ~0.4B |
| Log-Mel | spectrogram with 128 Mel bins, computed at a 16 kHz sampling rate, using an FFT window of 400 samples (~25 ms) and a hop length of 160 samples (~10 ms) |
| Max Audio Duration | ~30 seconds per chunk (sliding window supported) |
| Supported file types | flac, mp3, mp4, mpeg, mpga, m4a, ogg, wav, webm |