Rev AI API — Technical Reference

Authentication

All requests require a Bearer token in the Authorization header:

Authorization: Bearer YOUR_ACCESS_TOKEN

Async Speech-to-Text

Submit a job via POST /speechtotext/v1/jobs:

{
  "media_url": "https://example.com/audio.mp3",
  "metadata": "optional-job-label",
  "language": "en"
}

Poll job status at GET /speechtotext/v1/jobs/{id}, then fetch the transcript at GET /speechtotext/v1/jobs/{id}/transcript.

Streaming Speech-to-Text

Connect via WebSocket at wss://api.rev.ai/speechtotext/v1/stream with query parameters for access_token, content_type, and language. Send raw audio bytes and receive partial/final hypothesis JSON messages in real time.

Key Parameters

language — BCP-47 language code (57+ supported)
speaker_channels_count — Enable multi-speaker diarization
custom_vocabulary_id — Boost domain-specific terms
filter_profanity — Boolean profanity filter
remove_disfluencies — Strip filler words (uh, um)

SDKs

Official SDKs available for Python, Node.js, Java, C#, and Go. Integration time estimated under one hour.