dograh

mirror of https://github.com/dograh-hq/dograh.git synced 2026-06-19 08:28:10 +02:00

Author	SHA1	Message	Date
Harshita Jain	e79cb42f31	feat: add Smallest AI TTS and STT provider integration (#444 ) * feat: add Smallest AI TTS and STT provider integration Integrates Smallest AI's Waves (TTS) and Pulse (STT) APIs as selectable providers in the Dograh platform. Dograh's pipecat fork already contains the pipecat-level service implementations; this wires them into the API configuration registry and service factory. - Added `SMALLEST = "smallest"` to `ServiceProviders` enum - Registered `SmallestAITTSConfiguration` (lightning-v3.1/v2, voices, language, speed) and `SmallestAISTTConfiguration` (pulse model, 30+ languages) Pydantic config classes with the TTS/STT registries - Added factory branches in `create_tts_service` and `create_stt_service` routing to `SmallestTTSService` and `SmallestSTTService` from pipecat * fix: update Smallest AI models to v4 naming convention - TTS: rename lightning-v3.1 → lightning_v3.1, add lightning_v3.1_pro, drop deprecated lightning-v2 - STT: keep pulse only (pulse-pro is not a streaming model) * fix: change default TTS voice from emily to sophia for lightning_v3.1 emily is not a verified lightning_v3.1 voice; sophia is the pipecat SmallestTTSService default and confirmed to work with the standard pool. * fix: replace 9 invalid lightning_v3.1 voice IDs with verified ones jasmine, james, michael, aria, lara, asel, sarah, rishi, deepika do not exist in the lightning_v3.1 voice catalog. Replaced with avery, liam, lucas, olivia, freya, devansh, maya, dhruv, maithili — all verified against the API. * fix: smallest ai config validation and tts model compatibility * chore: ruff fix * chore: updated smallest ai schema in openapi.json --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com> Co-authored-by: Sabiha Khan <87858386+chewwbaka@users.noreply.github.com>	2026-06-17 12:55:53 +05:30
Abhishek Kumar	dd3f2e7323	feat: add huggingface inferece provider endpoint	2026-06-15 22:56:01 +05:30
Abhishek	1f1149f4d5	feat: billing and credit management v2 (#429 ) * feat: use mps generated correlation ID * chore: update pipecat submodule * feat: add credit purchase URL * feat: carve out billing page and show credit ledger * feat: deprecate dograh based quota tracking * fix: remove cost calculation from dograh codebase * fix: create mps account on migrate to v2 * chore: update pipecat	2026-06-12 14:55:30 +05:30
Vishal Dhateria	7ba95c0fbe	feat: add Azure AI multi-provider support (TTS, STT, Embeddings, Realtime) (#381 ) * feat: add Azure AI multi-provider support (TTS, STT, Embeddings, Realtime) Enables Azure AI services across all model layers so users with Azure credits can consolidate billing on a single provider. - Voice (TTS): AzureSpeechTTSConfiguration via azure_speech provider - Transcriber (STT): AzureSpeechSTTConfiguration via azure_speech provider - Embedding: AzureOpenAIEmbeddingsConfiguration via azure provider - Realtime: AzureRealtimeLLMConfiguration via azure_realtime provider New files: - api/services/pipecat/realtime/azure_realtime.py - api/services/gen_ai/embedding/azure_openai_service.py - api/tests/test_azure_speech_service_factory.py The UI picks up all four providers automatically from the schema — no frontend changes required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: add validation for URL and params --------- Co-authored-by: Vishal Dhateria <vishal@finela.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-06-02 12:50:00 +05:30
developer603	8a4a2e25db	feat: allow overriding base URL of OpenAI STT and TTS (#377 ) Mirrors the LLM treatment from #368 for the OpenAI STT and OpenAI TTS providers. Users running OpenAI-compatible self-hosted services (vLLM, Speaches, llama.cpp, custom proxies) can now point Dograh at them via the OpenAI provider with `base_url`, instead of being forced onto the Speaches provider as a workaround. Changes: * `registry.py` — add `base_url` field (default `https://api.openai.com/v1`) to `OpenAISTTConfiguration` and `OpenAITTSService`, identical in shape to the existing `OpenAILLMService.base_url` from #368. * `service_factory.py` — in the OPENAI branches of `create_stt_service` and `create_tts_service`, lift `base_url` off the user config, run it through `_validate_runtime_service_url`, and forward it as a kwarg to `OpenAISTTService` / `OpenAITTSService` (both already accept it). Same pattern as the LLM branch. * `test_user_configured_service_url_security.py` — adds four runtime validation tests covering private-IP rejection and localhost rejection in SaaS mode for both STT and TTS. Existing OSS-mode permissiveness is unchanged (DEPLOYMENT_MODE=oss skips the validator, as before). No schema migration needed — Pydantic populates the default; existing configurations without `base_url` continue to talk to api.openai.com. `check_validity.py` requires no edits because the per-service validation loop already iterates `("base_url", "endpoint")` via `getattr`, and the `_check_openai_api_key` dispatcher already routes OPENAI providers through the base_url-aware code path (introduced in #368) for STT and TTS too. Tests pass locally: pytest api/tests/test_user_configured_service_url_security.py 23 passed in 4.80s (19 existing + 4 new) Co-authored-by: developer603 <developer603@users.noreply.github.com>	2026-06-02 12:06:58 +05:30
Abhay Babbar	98d2b24cba	Add Sarvam LLM, update Sarvam STT models, expose usage_info on run detail (#351 ) * Add Sarvam LLM provider, update Sarvam STT models, expose usage_info on run detail. Depends on pipecat PR dograh-hq/pipecat#43 for STT string language support. Submodule bump will follow after that merges. * test: cover Sarvam STT language mapping; link Sarvam docs --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-06-01 10:29:31 +05:30
Abhishek	8a58b0992d	feat: allow overriding base URL of OpenAI models (#368 ) * Add OpenAI-compatible API option in model configuration Backend-only cherry-pick from `20617db37a`. * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * fix: harden the base url settings in SaaS mode --------- Co-authored-by: Chris Briddock <briddockchristopher@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-05-27 13:07:45 +05:30
Abhishek	3892b58486	feat: add ultravox realtime and fix signature issue in telephony (#345 ) * feat: add ultravox realtime and fix signature issue in telephony - Add UltraVox realtime - Fix signature issue on telephony * fix: fix regression for wss_backend_endpoint	2026-05-23 12:51:55 +05:30
Abhishek Kumar	9135c2da13	feat: add xai grok as realtime model	2026-05-22 18:04:59 +05:30
Abhishek Kumar	291264de7b	Merge branch 'main' of https://github.com/dograh-hq/dograh	2026-05-22 14:36:54 +05:30
Abhishek Kumar	ad2fa07058	feat: add google stt and tts. add folders to organize agents	2026-05-22 14:36:50 +05:30
Octopus	0e0d3136ca	feat: add MiniMax provider support (Chat + TTS) (#309 ) * feat: add MiniMax provider support (Chat + TTS) - Add MiniMax LLM provider using OpenAI-compatible API - Models: MiniMax-M2.7, MiniMax-M2.7-highspeed - Default base URL: https://api.minimax.io/v1 - Uses MINIMAX_API_KEY for authentication - Add MiniMax TTS provider using Pipecat's MiniMaxHttpTTSService - Models: speech-2.8-hd (default), speech-2.8-turbo - 6 built-in voices - Requires group_id configuration - Add unit tests for both providers * fix(minimax): validator, temperature, session cleanup, reasoning filter - check_validity.py: wire MiniMax into _validator_map and enforce group_id at save time. Without this, saving a config with a valid key was rejected. - registry.py: surface temperature on the LLM config (gt=0; MiniMax rejects 0) and base_url on the TTS config - service_factory.py: * Plumb temperature through create_llm_service * Normalize TTS base_url to include /t2a_v2 — pipecat appends only ?GroupId=... to the URL. * Use the new MiniMaxLLMService (from pipecat) to strip <think>...</think> reasoning that MiniMax-M2.7 emits inline in delta.content (otherwise it leaks straight to TTS). * Use MiniMaxOwnedSessionTTSService so the per-instance aiohttp session gets closed in cleanup() instead of leaking sockets/FDs. - minimax_tts.py: small wrapper around MiniMaxHttpTTSService that owns the session it was handed (pipecat's caller-owns-session API conflicts with the ftory's per-instance pattern). - pipecat submodule: bumps to a commit that adds MiniMaxLLMService — a thin OpenAILLMService subclass with the streaming <think> filter (mirrors NvidiaLLMService's pattern for NIM reasoning models). - Tests updated/added for all of the above. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: octo-patch <octo-patch@github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-22 13:09:41 +05:30
Abhishek	2381a803ad	feat: add openai realtime models (#298 ) * feat: add openai realtime models * chore: bump pipecat * fix: resample telephony audio for openai realtime * fix: sampling rate fix for openai realtime * chore: clean up dead code	2026-05-16 18:05:23 +05:30
Abhishek	7f0dac1ad5	feat: configurable ElevenLabs base URL for Data Residency (#278 ) * feat: configurable ElevenLabs base URL for Data Residency (#269) Adds a `base_url` field to `ElevenlabsTTSConfiguration` so users on an ElevenLabs Data Residency plan (EU, etc.) can point Dograh at the regional endpoint instead of the hardcoded global one. Defaults to `https://api.elevenlabs.io`, preserving existing behaviour. The service factory rewrites the HTTP scheme to WSS when constructing the WebSocket TTS service. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: fix drift --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:01:13 +05:30
Abhishek Kumar	74dbafb055	feat: allow recordings in tool transitions	2026-04-10 16:18:01 +05:30
Abhishek Kumar	9decdb2f4b	fix: send volume in cartesia	2026-04-08 23:20:14 +05:30
Abhishek	38d1d928b7	feat: agent versioning and model configurations override (#227 ) * feat: add tests and migrations * feat: workflow versioning among published and draft * feat: add a new settings page to simplify workflow detail page * fix: fix tsclient generation	2026-04-08 19:20:31 +05:30
Abhishek Kumar	e04ce4e852	chore: add language option for Rime	2026-04-07 18:32:09 +05:30
Abhishek Kumar	e255b33813	feat: add Rime TTS	2026-04-07 14:05:47 +05:30
drascom	95d6dd44ff	fix: Speaches STT service wiring * Fix Speaches STT service wiring * chore: bump pipecat submodule --------- Co-authored-by: drascom <drascom@drascoms-MacBook-Pro.local> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-04-06 14:11:58 +05:30
Abhishek Kumar	c4c4b591db	feat: add gladia stt support	2026-04-04 14:47:48 +05:30
Abhishek Kumar	56763a4527	feat: enable context summarization	2026-04-03 13:39:02 +05:30
Abhishek Kumar	501d06c00d	feat: add Assembly AI STT	2026-04-03 07:10:37 +05:30
Abhishek	87e72d5f6f	feat: add gemini live and speaches integration (#220 ) * feat: add speaches models * feat: add gemini realtime and speaches integration - Add gemini realtime support - Add speaches support for locally hosted LLMs * chore: bump pipecat * feat: add language option * fix: add skip aggregator types to tts settings * fix: make API key optional for realtime	2026-03-31 21:42:03 +05:30
Abhishek Kumar	ac0731a374	feat: add support for self hosted llm models	2026-03-24 17:50:45 +05:30
neil from camb.ai	31e075d114	feat: add CAMB AI TTS integration (#187 ) Co-authored-by: Abhishek <abhishek@a6k.me>	2026-03-24 12:54:07 +05:30
Abhishek Kumar	f8cf433ba3	feat: add speed configuration for cartesia	2026-03-23 21:51:16 +05:30
Abhishek Kumar	fe84f086ba	feat: add AWS Bedrock support	2026-03-19 15:06:59 +05:30
Abhishek	494c60d774	feat: add hybrid text + recording functionality in agents (#191 ) * feat: add recording feature in agents * chore: pin pipecat version * feat: show usage in UI * chore: update pipecat	2026-03-16 15:04:08 +05:30
Abhishek Kumar	771781096e	feat: add early voicemail detection	2026-03-07 12:41:24 +05:30
Abhishek Kumar	e111cbb36d	feat: add cartesia tts	2026-02-20 20:41:11 +05:30
Abhishek	7552b6c819	feat: add asterisk ARI websocket interface (#159 ) * chore: remove old files * feat: ari outbound dialing * feat: add websocket configuration for ARI * feat: handling inbound calls * delete ext channel from redis on stasis end * fix: add lock in workflow run update, refactor _handle_stasis_start * chore: update submodule --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-02-17 19:32:03 +05:30
Abhishek Kumar	4c8c9516dc	chore: update pipecat submodule	2026-02-11 14:15:19 +05:30
Abhishek Kumar	7a102026fb	fix: send sample rate to STT services	2026-02-09 16:25:09 +05:30
Abhishek Kumar	4c936ae57d	feat: add openrouter support	2026-02-09 13:31:32 +05:30
Abhishek	db75d90535	feat: add dictionary support for STT boosting in voice agents (#136 ) * feat: add dictionary support for voice agents Also fixes #132 * chore: add keyterms in evals	2026-01-29 11:20:07 +05:30
Abhishek	911c5ed416	fix: changes to update pipecat version to 0.0.100 (#122 ) * feat: add stt evals * add smart turn as provider * chore: remove deprecations * chore: format files * fix: remove deprecated UserIdleProcessor * fix: remove deprecated TranscriptProcessor * chore: update pipecat submodule * feat: add evals visualisation * fix: trigger llm generation on client connected and pipeline started * chore: update pipecat * chore: update pipecat submodule * Add tests * fix: slow loading of workflow page * chore: update pipecat submodule * Show version after release * Fixes #99 * fix: provider check for websocket connection * Fixes #107 * Fix #96 * chore: fix documentation * fix: cloudonix campaign call error --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-01-23 18:53:59 +05:30
Abhishek Kumar	c58aa557de	feat: add voices in Dograh configuration	2026-01-19 14:52:54 +05:30
Abhishek Kumar	d35eeb1b7b	fix: fix OPENAI_API_KEY bug in retrieval	2026-01-17 18:12:56 +05:30
Abhishek Kumar	1f3b3e2e3c	chore: minor fixes	2026-01-13 14:55:48 +05:30
Abhishek	edf0fa4fbc	fix: migrate from custom audio recorder to native AudioBuffer (#115 ) * fix: update to pipecat VM Detector * fix: refactor to remove audio synchronizer * feat: add speechmatics as STT	2026-01-08 18:03:26 +05:30
Abhishek Kumar	cc2d3e70d2	chore: render initial and gathered context	2025-12-31 22:02:50 +05:30
Abhishek Kumar	e83f3a36d2	fix: change type definition from enum to str for consistency	2025-12-26 16:00:02 +05:30
Abhishek Kumar	74b069354b	fix: fix configuration option	2025-12-25 19:37:00 +05:30
Abhishek	45c5b7c304	feat: add voice selectors in elevenlabs (#88 )	2025-12-25 15:05:53 +05:30
Abhishek	909c258b6a	fix: prevent pipeline freezes when sending endframe (#77 ) * fix: dont cancel task if call is already ending * Update pipecat	2025-12-10 08:22:37 +07:00
Sabiha Khan	0a8ce3f644	fix: add text filter for tts and logs for filter (#74 )	2025-12-09 16:24:24 +05:30
Abhishek	a7bf64a02b	feat: add llm models in Dograh (#64 )	2025-11-25 17:11:04 +05:30
Abhishek	ed3ceaf5ad	feat: allow www domain for embedded websites (#60 )	2025-11-21 22:11:46 +05:30
Abhishek Kumar	0345df6fbe	Optimise requirements.txt and update pipecat imports	2025-09-20 14:07:00 +05:30

1 2

51 commits