Whisper
Free tierOpenAI's open-source speech recognition — best-in-class accuracy across 99 languages.
Free·Technical·Powered by OpenAI·API available·Open source
Key strengths
99 language supportopen-source weightsaccents and noise robustnessword timestamps
Completely free
San Francisco, US
Founded 2022
Self-hostable
No ratings yet
Whisper is an encoder-decoder Transformer trained on 680,000 hours of multilingual audio. Model sizes: Tiny (39M), Base (74M), Small (244M), Medium (769M), Large-v3 (1.5B). Available via the OpenAI API (/v1/audio/transcriptions) or run locally. Word-level timestamps available via faster-whisper (CTranslate2 implementation). Supports transcription, translation to English, and language detection.
