Submit tool

Whisper

Free tier

OpenAI's open-source speech recognition — best-in-class accuracy across 99 languages.

Free·Technical·Powered by OpenAI·API available·Open source

Visit

Key strengths

99 language supportopen-source weightsaccents and noise robustnessword timestamps

Completely free

San Francisco, US

Founded 2022

Self-hostable

No ratings yet

Whisper is an encoder-decoder Transformer trained on 680,000 hours of multilingual audio. Model sizes: Tiny (39M), Base (74M), Small (244M), Medium (769M), Large-v3 (1.5B). Available via the OpenAI API (/v1/audio/transcriptions) or run locally. Word-level timestamps available via faster-whisper (CTranslate2 implementation). Supports transcription, translation to English, and language detection.