feat: add voice selectors in elevenlabs (#88)

2026-06-28 08:49:42 +02:00 · 2025-12-25 15:05:53 +05:30 · 2025-12-25 15:05:53 +05:30 · 45c5b7c304
commit 45c5b7c304
parent 480e8a5f60
22 changed files with 978 additions and 166 deletions
--- a/docs/configurations/inference-providers.mdx
+++ b/docs/configurations/inference-providers.mdx
@ -1,18 +1,17 @@
 ---
-title: "Inference Provider"
-description: "Dograh ships with its own inferencing engine, which is hosted at https://services.dograh.com. The inference service provides LLM, TTS and STT services. In this document you can see how you can configure the inferencing engine to your favourite provider, like OpenAI, Gemini etc."
+title: "Model Configurations"
+description: "Voice Agents need AI Models to work, like LLM (Large Language Model), TTS (Voice) and STT (Transcriber). You can use any of your faviourite providers with Dograh Platform to run your Voice Agent."
 ---

-## Configure Inference Provider
+## Configure Models
+Dograh Platform ships with its own models by default. When you sign up on https://app.dograh.com or you setup the platform on your self hosted infrastructure, you get some Dograh model credits by default. 

-You can go to `https://app.dograh.com/model-configurations` if you are on hosted version of Dograh or go to `http://localhost:3010/model-configurations` if you are running Dograh locally.
+If you wish to change the models to a provider of your own choice, ou can go to `https://app.dograh.com/model-configurations` if you are on hosted version of Dograh or go to `http://localhost:3010/model-configurations` if you are running Dograh locally.

 You can see the configuration for the inference provider in the following screenshot.

 ![Model Configuration](../images/service-configuration.png)

-You can select the provider from the dropdown and configure the API key, model, etc. You can see [API Keys](api-keys) documentation for instructions on how to create Service Keys to be used in Model Configuration.
+You can select the provider from the dropdown and configure the API key, model, etc. For Dograh, you can see [Service Keys](api-keys) documentation for instructions on how to create Service Keys to be used in Model Configuration.

-## Next Steps
-
-You can see how to configure the telephony provider in [Telephony Integrations](/telephony/twilio).
+Next there are some in depth documentation of various AI Models that you can configure. 
--- a/docs/configurations/llm.mdx
+++ b/docs/configurations/llm.mdx
@ -0,0 +1,12 @@
+---
+title: "LLM"
+description: "Voice Agents use LLM (Large Language Models), which are trained to understand the conversational context, and respond to users."
+---
+
+You can currently use OpenAI, Google, Groq, Azure and Dograh LLMs in LLM configuration. There are some models provided by default for you to choose from the drop down. 
+
+![Select Models from DropDown](../images/models_dropdown.png)
+
+If you don't find a model in the drop down, you can always add a model manually.
+
+![Select Models from DropDown](../images/add_model_manually.png)
--- a/docs/configurations/transcriber.mdx
+++ b/docs/configurations/transcriber.mdx
@ -0,0 +1,8 @@
+---
+title: "Transcriber"
+description: "Voice Agents use STT (Speech to Text), to transcribe what the user speaks. This transcribed speech as text goes into an LLM to generate the response that gets played out to the user."
+---
+
+Dograh platform ships with Deepgram, Cartesia, OpenAI and Dograh transcribers by default. You can take a look at the providers documentation of which language to select for your language requirements. 
+
+Example: Deepgram has their language support documentation at https://developers.deepgram.com/docs/models-languages-overview#nova-3
--- a/docs/configurations/voice.mdx
+++ b/docs/configurations/voice.mdx
@ -0,0 +1,10 @@
+---
+title: "Voice"
+description: "Voice Agents use TTS (Text to Speech), which generates audio that LLMs generate during the course of a conversation. This is the audio that the end user having the conversation listens to."
+---
+
+Dograh platform ships with Elevenlabs, Deepgram, OpenAI and Dograh TTS engines by default. There are some voices from the providers that we ship by default. You can refer to the providers API documentation to select a voice ID thats most relevant for your language requirement.
+
+If you dont find your favourite voice, you can always add the voice ID manually.
+
+![Add Voice Manually](../images/add_tts_manually.png)
--- a/docs/docs.json
+++ b/docs/docs.json
@ -30,6 +30,9 @@
            "group": "Configurations",
            "pages": [
              "configurations/inference-providers",
+              "configurations/llm",
+              "configurations/voice",
+              "configurations/transcriber",
              "configurations/api-keys"
            ]
          },
--- a/docs/images/add_model_manually.png
+++ b/docs/images/add_model_manually.png
--- a/docs/images/add_tts_manually.png
+++ b/docs/images/add_tts_manually.png
--- a/docs/images/models_dropdown.png
+++ b/docs/images/models_dropdown.png