Deepgram logo

Deepgram

Free tier

The most accurate and cost-effective real-time APIs for speech-to-text, text-to-speech, and voice agents

Free tier available·All audiences·Powered by Deepgram·API available

Key strengths

Real-time and batch speech-to-text with industry-leading accuracyUnified Voice Agent API combining STT, TTS, and LLM orchestrationAvailable both cloud-hosted and self-hosted for enterprise complianceMultilingual support across 10+ languages including Flux conversational STTCost-effective at scale with enterprise-grade reliability
Free tier + paid plans
San Francisco, USA
Founded 2015
Self-hostable
No ratings yet
  • Real-time transcription pipelines — Stream audio from WebSocket sources and receive word-level transcripts with low latency using the Nova model for live captioning or call analytics.
  • Voice agent development — Use the unified Voice Agent API to build end-to-end conversational AI systems without managing separate STT, LLM, and TTS services.
  • Custom model training — Work with Deepgram's enterprise team to fine-tune acoustic and language models on domain-specific vocabulary (e.g., medical, legal, financial).
  • Self-hosted deployments — Deploy Deepgram's inference engine on-premises or in a private cloud to meet HIPAA, SOC 2, or data residency compliance requirements.
  • Audio Intelligence enrichment — Augment transcription outputs with automated summarization, sentiment scoring, topic tagging, and intent detection via a single API call.
  • Platform/partner embedding — Integrate Deepgram's Voice AI as a white-label component into SaaS platforms, telephony systems, or contact center solutions via the partner program.