dograh/docs/configurations/voice.mdx

13 lines
854 B
Text
Raw Normal View History

---
title: "Voice"
description: "Voice Agents use TTS (Text to Speech), which generates audio that LLMs generate during the course of a conversation. This is the audio that the end user having the conversation listens to."
---
2026-06-26 11:57:26 +05:30
Dograh platform supports ElevenLabs, OpenAI, Google, Azure Speech, Deepgram, Cartesia, Smallest AI, MiniMax, Sarvam, Rime, Inworld, Camb.ai, and Dograh TTS engines. There are some voices from the providers that we ship by default. You can refer to the providers API documentation to select a voice ID that's most relevant for your language requirement.
2026-06-26 11:57:26 +05:30
For locally deployed or self-hosted TTS models, Dograh also supports Speaches, an OpenAI API-compatible server for speech generation.
2026-06-26 11:57:26 +05:30
If you don't find your favourite voice, you can always add the voice ID manually.
![Add Voice Manually](../images/add_tts_manually.png)