diff --git a/docs/configurations/llm.mdx b/docs/configurations/llm.mdx index 2aec6717..fc490692 100644 --- a/docs/configurations/llm.mdx +++ b/docs/configurations/llm.mdx @@ -3,10 +3,12 @@ title: "LLM" description: "Voice Agents use LLM (Large Language Models), which are trained to understand the conversational context, and respond to users." --- -You can currently use OpenAI, Google, Groq, Azure and Dograh LLMs in LLM configuration. There are some models provided by default for you to choose from the drop down. +Dograh platform supports OpenAI, Google AI Studio, Google Vertex AI, Azure OpenAI, AWS Bedrock, Groq, OpenRouter, Hugging Face, MiniMax, Sarvam, and Dograh-managed LLMs. There are some models provided by default for you to choose from the drop down. + +For locally deployed or self-hosted LLMs, Dograh also supports OpenAI-compatible endpoints such as Ollama and vLLM. ![Select Models from DropDown](../images/models_dropdown.png) If you don't find a model in the drop down, you can always add a model manually. -![Select Models from DropDown](../images/add_model_manually.png) \ No newline at end of file +![Select Models from DropDown](../images/add_model_manually.png) diff --git a/docs/configurations/transcriber.mdx b/docs/configurations/transcriber.mdx index 874cf415..5b9081ea 100644 --- a/docs/configurations/transcriber.mdx +++ b/docs/configurations/transcriber.mdx @@ -3,6 +3,8 @@ title: "Transcriber" description: "Voice Agents use STT (Speech to Text), to transcribe what the user speaks. This transcribed speech as text goes into an LLM to generate the response that gets played out to the user." --- -Dograh platform ships with Deepgram, Cartesia, OpenAI and Dograh transcribers by default. You can take a look at the providers documentation of which language to select for your language requirements. +Dograh platform supports Deepgram, OpenAI, Google, Azure Speech, AssemblyAI, Speechmatics, Cartesia, Gladia, Sarvam, Smallest AI, Hugging Face, and Dograh transcribers. You can take a look at the providers documentation of which language to select for your language requirements. -Example: Deepgram has their language support documentation at https://developers.deepgram.com/docs/models-languages-overview#nova-3 \ No newline at end of file +For locally deployed or self-hosted STT models, Dograh also supports Speaches, an OpenAI API-compatible server for streaming transcription. + +Example: Deepgram has their language support documentation at https://developers.deepgram.com/docs/models-languages-overview#nova-3 diff --git a/docs/configurations/voice.mdx b/docs/configurations/voice.mdx index 011f8e78..070e81de 100644 --- a/docs/configurations/voice.mdx +++ b/docs/configurations/voice.mdx @@ -3,8 +3,10 @@ title: "Voice" description: "Voice Agents use TTS (Text to Speech), which generates audio that LLMs generate during the course of a conversation. This is the audio that the end user having the conversation listens to." --- -Dograh platform ships with Elevenlabs, Deepgram, OpenAI and Dograh TTS engines by default. There are some voices from the providers that we ship by default. You can refer to the providers API documentation to select a voice ID thats most relevant for your language requirement. +Dograh platform supports ElevenLabs, OpenAI, Google, Azure Speech, Deepgram, Cartesia, Smallest AI, MiniMax, Sarvam, Rime, Inworld, Camb.ai, and Dograh TTS engines. There are some voices from the providers that we ship by default. You can refer to the providers API documentation to select a voice ID that's most relevant for your language requirement. -If you dont find your favourite voice, you can always add the voice ID manually. +For locally deployed or self-hosted TTS models, Dograh also supports Speaches, an OpenAI API-compatible server for speech generation. -![Add Voice Manually](../images/add_tts_manually.png) \ No newline at end of file +If you don't find your favourite voice, you can always add the voice ID manually. + +![Add Voice Manually](../images/add_tts_manually.png)