dograh/docs/configurations/transcriber.mdx
2026-06-26 11:57:26 +05:30

10 lines
783 B
Text

---
title: "Transcriber"
description: "Voice Agents use STT (Speech to Text), to transcribe what the user speaks. This transcribed speech as text goes into an LLM to generate the response that gets played out to the user."
---
Dograh platform supports Deepgram, OpenAI, Google, Azure Speech, AssemblyAI, Speechmatics, Cartesia, Gladia, Sarvam, Smallest AI, Hugging Face, and Dograh transcribers. You can take a look at the providers documentation of which language to select for your language requirements.
For locally deployed or self-hosted STT models, Dograh also supports Speaches, an OpenAI API-compatible server for streaming transcription.
Example: Deepgram has their language support documentation at https://developers.deepgram.com/docs/models-languages-overview#nova-3