Deepgram
Free tierThe most accurate and cost-effective real-time APIs for speech-to-text, text-to-speech, and voice agents
Free tier available·All audiences·Powered by Deepgram·API available
Key strengths
Real-time and batch speech-to-text with industry-leading accuracyUnified Voice Agent API combining STT, TTS, and LLM orchestrationAvailable both cloud-hosted and self-hosted for enterprise complianceMultilingual support across 10+ languages including Flux conversational STTCost-effective at scale with enterprise-grade reliability
Free tier + paid plans
San Francisco, USA
Founded 2015
Self-hostable
No ratings yet
- Real-time transcription pipelines — Stream audio from WebSocket sources and receive word-level transcripts with low latency using the Nova model for live captioning or call analytics.
- Voice agent development — Use the unified Voice Agent API to build end-to-end conversational AI systems without managing separate STT, LLM, and TTS services.
- Custom model training — Work with Deepgram's enterprise team to fine-tune acoustic and language models on domain-specific vocabulary (e.g., medical, legal, financial).
- Self-hosted deployments — Deploy Deepgram's inference engine on-premises or in a private cloud to meet HIPAA, SOC 2, or data residency compliance requirements.
- Audio Intelligence enrichment — Augment transcription outputs with automated summarization, sentiment scoring, topic tagging, and intent detection via a single API call.
- Platform/partner embedding — Integrate Deepgram's Voice AI as a white-label component into SaaS platforms, telephony systems, or contact center solutions via the partner program.
