mirror of
https://github.com/dograh-hq/dograh.git
synced 2026-06-28 08:49:42 +02:00
feat: add voice selectors in elevenlabs (#88)
This commit is contained in:
parent
480e8a5f60
commit
45c5b7c304
22 changed files with 978 additions and 166 deletions
|
|
@ -1,18 +1,17 @@
|
|||
---
|
||||
title: "Inference Provider"
|
||||
description: "Dograh ships with its own inferencing engine, which is hosted at https://services.dograh.com. The inference service provides LLM, TTS and STT services. In this document you can see how you can configure the inferencing engine to your favourite provider, like OpenAI, Gemini etc."
|
||||
title: "Model Configurations"
|
||||
description: "Voice Agents need AI Models to work, like LLM (Large Language Model), TTS (Voice) and STT (Transcriber). You can use any of your faviourite providers with Dograh Platform to run your Voice Agent."
|
||||
---
|
||||
|
||||
## Configure Inference Provider
|
||||
## Configure Models
|
||||
Dograh Platform ships with its own models by default. When you sign up on https://app.dograh.com or you setup the platform on your self hosted infrastructure, you get some Dograh model credits by default.
|
||||
|
||||
You can go to `https://app.dograh.com/model-configurations` if you are on hosted version of Dograh or go to `http://localhost:3010/model-configurations` if you are running Dograh locally.
|
||||
If you wish to change the models to a provider of your own choice, ou can go to `https://app.dograh.com/model-configurations` if you are on hosted version of Dograh or go to `http://localhost:3010/model-configurations` if you are running Dograh locally.
|
||||
|
||||
You can see the configuration for the inference provider in the following screenshot.
|
||||
|
||||

|
||||
|
||||
You can select the provider from the dropdown and configure the API key, model, etc. You can see [API Keys](api-keys) documentation for instructions on how to create Service Keys to be used in Model Configuration.
|
||||
You can select the provider from the dropdown and configure the API key, model, etc. For Dograh, you can see [Service Keys](api-keys) documentation for instructions on how to create Service Keys to be used in Model Configuration.
|
||||
|
||||
## Next Steps
|
||||
|
||||
You can see how to configure the telephony provider in [Telephony Integrations](/telephony/twilio).
|
||||
Next there are some in depth documentation of various AI Models that you can configure.
|
||||
12
docs/configurations/llm.mdx
Normal file
12
docs/configurations/llm.mdx
Normal file
|
|
@ -0,0 +1,12 @@
|
|||
---
|
||||
title: "LLM"
|
||||
description: "Voice Agents use LLM (Large Language Models), which are trained to understand the conversational context, and respond to users."
|
||||
---
|
||||
|
||||
You can currently use OpenAI, Google, Groq, Azure and Dograh LLMs in LLM configuration. There are some models provided by default for you to choose from the drop down.
|
||||
|
||||

|
||||
|
||||
If you don't find a model in the drop down, you can always add a model manually.
|
||||
|
||||

|
||||
8
docs/configurations/transcriber.mdx
Normal file
8
docs/configurations/transcriber.mdx
Normal file
|
|
@ -0,0 +1,8 @@
|
|||
---
|
||||
title: "Transcriber"
|
||||
description: "Voice Agents use STT (Speech to Text), to transcribe what the user speaks. This transcribed speech as text goes into an LLM to generate the response that gets played out to the user."
|
||||
---
|
||||
|
||||
Dograh platform ships with Deepgram, Cartesia, OpenAI and Dograh transcribers by default. You can take a look at the providers documentation of which language to select for your language requirements.
|
||||
|
||||
Example: Deepgram has their language support documentation at https://developers.deepgram.com/docs/models-languages-overview#nova-3
|
||||
10
docs/configurations/voice.mdx
Normal file
10
docs/configurations/voice.mdx
Normal file
|
|
@ -0,0 +1,10 @@
|
|||
---
|
||||
title: "Voice"
|
||||
description: "Voice Agents use TTS (Text to Speech), which generates audio that LLMs generate during the course of a conversation. This is the audio that the end user having the conversation listens to."
|
||||
---
|
||||
|
||||
Dograh platform ships with Elevenlabs, Deepgram, OpenAI and Dograh TTS engines by default. There are some voices from the providers that we ship by default. You can refer to the providers API documentation to select a voice ID thats most relevant for your language requirement.
|
||||
|
||||
If you dont find your favourite voice, you can always add the voice ID manually.
|
||||
|
||||

|
||||
|
|
@ -30,6 +30,9 @@
|
|||
"group": "Configurations",
|
||||
"pages": [
|
||||
"configurations/inference-providers",
|
||||
"configurations/llm",
|
||||
"configurations/voice",
|
||||
"configurations/transcriber",
|
||||
"configurations/api-keys"
|
||||
]
|
||||
},
|
||||
|
|
|
|||
BIN
docs/images/add_model_manually.png
Normal file
BIN
docs/images/add_model_manually.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 105 KiB |
BIN
docs/images/add_tts_manually.png
Normal file
BIN
docs/images/add_tts_manually.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 210 KiB |
BIN
docs/images/models_dropdown.png
Normal file
BIN
docs/images/models_dropdown.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 110 KiB |
Loading…
Add table
Add a link
Reference in a new issue