Deepgram logo

Deepgram

Free tier

The most accurate and cost-effective real-time APIs for speech-to-text, text-to-speech, and voice agents

Free tier available·All audiences·Powered by Deepgram·API available

Key strengths

Real-time and batch speech-to-text with industry-leading accuracyUnified Voice Agent API combining STT, TTS, and LLM orchestrationAvailable both cloud-hosted and self-hosted for enterprise complianceMultilingual support across 10+ languages including Flux conversational STTCost-effective at scale with enterprise-grade reliability
Free tier + paid plans
San Francisco, USA
Founded 2015
Self-hostable
No ratings yet

Deepgram offers a suite of REST and WebSocket APIs covering Speech-to-Text (STT), Text-to-Speech (TTS), a unified Voice Agent API, and Audio Intelligence. The Voice Agent API eliminates the need to stitch together separate STT, LLM, and TTS components by unifying them into a single low-latency pipeline with built-in business logic and external system hooks. Models include Nova (transcription) and Flux (multilingual conversational STT supporting 10 languages). The platform is available as a managed cloud service or as a self-hosted deployment for enterprises with strict compliance or data residency requirements.