dograh

mirror of https://github.com/dograh-hq/dograh.git synced 2026-06-07 07:55:16 +02:00

Author	SHA1	Message	Date
Abhishek Kumar	418592178c	chore: update docs for pre-call data fetch	2026-06-05 14:16:56 +05:30
Vishal Dhateria	7ba95c0fbe	feat: add Azure AI multi-provider support (TTS, STT, Embeddings, Realtime) (#381 ) * feat: add Azure AI multi-provider support (TTS, STT, Embeddings, Realtime) Enables Azure AI services across all model layers so users with Azure credits can consolidate billing on a single provider. - Voice (TTS): AzureSpeechTTSConfiguration via azure_speech provider - Transcriber (STT): AzureSpeechSTTConfiguration via azure_speech provider - Embedding: AzureOpenAIEmbeddingsConfiguration via azure provider - Realtime: AzureRealtimeLLMConfiguration via azure_realtime provider New files: - api/services/pipecat/realtime/azure_realtime.py - api/services/gen_ai/embedding/azure_openai_service.py - api/tests/test_azure_speech_service_factory.py The UI picks up all four providers automatically from the schema — no frontend changes required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix: add validation for URL and params --------- Co-authored-by: Vishal Dhateria <vishal@finela.ai> Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-06-02 12:50:00 +05:30
developer603	8a4a2e25db	feat: allow overriding base URL of OpenAI STT and TTS (#377 ) Mirrors the LLM treatment from #368 for the OpenAI STT and OpenAI TTS providers. Users running OpenAI-compatible self-hosted services (vLLM, Speaches, llama.cpp, custom proxies) can now point Dograh at them via the OpenAI provider with `base_url`, instead of being forced onto the Speaches provider as a workaround. Changes: * `registry.py` — add `base_url` field (default `https://api.openai.com/v1`) to `OpenAISTTConfiguration` and `OpenAITTSService`, identical in shape to the existing `OpenAILLMService.base_url` from #368. * `service_factory.py` — in the OPENAI branches of `create_stt_service` and `create_tts_service`, lift `base_url` off the user config, run it through `_validate_runtime_service_url`, and forward it as a kwarg to `OpenAISTTService` / `OpenAITTSService` (both already accept it). Same pattern as the LLM branch. * `test_user_configured_service_url_security.py` — adds four runtime validation tests covering private-IP rejection and localhost rejection in SaaS mode for both STT and TTS. Existing OSS-mode permissiveness is unchanged (DEPLOYMENT_MODE=oss skips the validator, as before). No schema migration needed — Pydantic populates the default; existing configurations without `base_url` continue to talk to api.openai.com. `check_validity.py` requires no edits because the per-service validation loop already iterates `("base_url", "endpoint")` via `getattr`, and the `_check_openai_api_key` dispatcher already routes OPENAI providers through the base_url-aware code path (introduced in #368) for STT and TTS too. Tests pass locally: pytest api/tests/test_user_configured_service_url_security.py 23 passed in 4.80s (19 existing + 4 new) Co-authored-by: developer603 <developer603@users.noreply.github.com>	2026-06-02 12:06:58 +05:30
Abhay Babbar	98d2b24cba	Add Sarvam LLM, update Sarvam STT models, expose usage_info on run detail (#351 ) * Add Sarvam LLM provider, update Sarvam STT models, expose usage_info on run detail. Depends on pipecat PR dograh-hq/pipecat#43 for STT string language support. Submodule bump will follow after that merges. * test: cover Sarvam STT language mapping; link Sarvam docs --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-06-01 10:29:31 +05:30
Abhishek Kumar	0c0b8383bf	fix: fix rtf logs and gemini live turn taking	2026-05-31 16:05:03 +05:30
Abhishek Kumar	c586d02d5d	feat: abort immediately on max call duration exceed	2026-05-31 13:21:37 +05:30
Abhishek	5ef3be92b5	chore: update pipecat to 1.3.0 (#379 ) * chore: rename PipelineTask to PipelineWorker * fix: fix tests * chore: update pipecat submodule * fix: fix anyio same task cancellation scope	2026-05-29 16:19:42 +05:30
Abhishek	8a58b0992d	feat: allow overriding base URL of OpenAI models (#368 ) * Add OpenAI-compatible API option in model configuration Backend-only cherry-pick from `20617db37a`. * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * fix: harden the base url settings in SaaS mode --------- Co-authored-by: Chris Briddock <briddockchristopher@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-05-27 13:07:45 +05:30
Abhishek	3892b58486	feat: add ultravox realtime and fix signature issue in telephony (#345 ) * feat: add ultravox realtime and fix signature issue in telephony - Add UltraVox realtime - Fix signature issue on telephony * fix: fix regression for wss_backend_endpoint	2026-05-23 12:51:55 +05:30
Abhishek Kumar	9135c2da13	feat: add xai grok as realtime model	2026-05-22 18:04:59 +05:30
Abhishek Kumar	291264de7b	Merge branch 'main' of https://github.com/dograh-hq/dograh	2026-05-22 14:36:54 +05:30
Abhishek Kumar	ad2fa07058	feat: add google stt and tts. add folders to organize agents	2026-05-22 14:36:50 +05:30
Octopus	0e0d3136ca	feat: add MiniMax provider support (Chat + TTS) (#309 ) * feat: add MiniMax provider support (Chat + TTS) - Add MiniMax LLM provider using OpenAI-compatible API - Models: MiniMax-M2.7, MiniMax-M2.7-highspeed - Default base URL: https://api.minimax.io/v1 - Uses MINIMAX_API_KEY for authentication - Add MiniMax TTS provider using Pipecat's MiniMaxHttpTTSService - Models: speech-2.8-hd (default), speech-2.8-turbo - 6 built-in voices - Requires group_id configuration - Add unit tests for both providers * fix(minimax): validator, temperature, session cleanup, reasoning filter - check_validity.py: wire MiniMax into _validator_map and enforce group_id at save time. Without this, saving a config with a valid key was rejected. - registry.py: surface temperature on the LLM config (gt=0; MiniMax rejects 0) and base_url on the TTS config - service_factory.py: * Plumb temperature through create_llm_service * Normalize TTS base_url to include /t2a_v2 — pipecat appends only ?GroupId=... to the URL. * Use the new MiniMaxLLMService (from pipecat) to strip <think>...</think> reasoning that MiniMax-M2.7 emits inline in delta.content (otherwise it leaks straight to TTS). * Use MiniMaxOwnedSessionTTSService so the per-instance aiohttp session gets closed in cleanup() instead of leaking sockets/FDs. - minimax_tts.py: small wrapper around MiniMaxHttpTTSService that owns the session it was handed (pipecat's caller-owns-session API conflicts with the ftory's per-instance pattern). - pipecat submodule: bumps to a commit that adds MiniMaxLLMService — a thin OpenAILLMService subclass with the streaming <think> filter (mirrors NvidiaLLMService's pattern for NIM reasoning models). - Tests updated/added for all of the above. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: octo-patch <octo-patch@github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-22 13:09:41 +05:30
Abhishek	d97d1d72cd	feat: add chat based testing for voice agent (#308 ) * feat: add backend foundations * feat: add text chat UI * chore: simplify the reload behaviour * fix: fix upgrade banner to be triggered after package upload * feat: simplify TesterPanel design * chore: fix formatting and generate client * chore: fix tracing for text chat mode * fix: fix revert and edit CTA * refactor: refactor TesterPanel into smaller components * feat: enable runtime transition of nodes * fix: fix review comments	2026-05-21 15:20:02 +05:30
Mohamed-Mamdouh	5f28c1b2a9	feat: add Tuner Integration to Dograh (#311 ) * Add tuner integration * bump pipecat version * chore: update pipecat submodule to match upstream and use tuner-pipecat-sdk 0.2.0 Update pipecat submodule from 0.0.109.dev23 to 13e98d0d9 (the exact commit upstream dograh-hq/dograh uses after v1.30.1). This installs pipecat-ai as 1.1.0.post277 via setuptools_scm, satisfying tuner-pipecat-sdk 0.2.0's pipecat-ai>=1.0.0 requirement. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * wire tuner * feat: refactor integrations into self contained packages * chore: simplify ensure_public_access_token * fix: remove NodeSpec and make DTOs the source of truth * feat: send relevant signal to mcp using to_mcp_dict * fix: fix tests * cleanup: remove nango integrations * feat: add agents.md for integrations --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-20 14:37:33 +05:30
Abhishek	2381a803ad	feat: add openai realtime models (#298 ) * feat: add openai realtime models * chore: bump pipecat * fix: resample telephony audio for openai realtime * fix: sampling rate fix for openai realtime * chore: clean up dead code	2026-05-16 18:05:23 +05:30
Abhishek	45b00cd5d0	chore: remove looptalk (#299 ) * chore: remove looptalk Remove looptalk in the current version. We will be rethinking looptalk in a fresh way. * chore: formatting fix	2026-05-16 17:45:12 +05:30
Abhishek	7f0dac1ad5	feat: configurable ElevenLabs base URL for Data Residency (#278 ) * feat: configurable ElevenLabs base URL for Data Residency (#269) Adds a `base_url` field to `ElevenlabsTTSConfiguration` so users on an ElevenLabs Data Residency plan (EU, etc.) can point Dograh at the regional endpoint instead of the hardcoded global one. Defaults to `https://api.elevenlabs.io`, preserving existing behaviour. The service factory rewrites the HTTP scheme to WSS when constructing the WebSocket TTS service. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: fix drift --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:01:13 +05:30
Abhishek	e2fe1f3cd4	feat: enable FORCE_TURN_RELAY to diagnose turn connectivity for local deployment setups (#272 ) * filter out local sdp candidates on non local environment * feat: add FORCE_TURN_RELAY variable * add FORCE_TURN_RELAY option in docker-compose * fix: fix github workflow	2026-05-11 17:13:01 +05:30
Abhishek Kumar	5a358d4d29	feat: add workflow graph constraints fixtures	2026-05-08 16:02:51 +05:30
Abhishek Kumar	6d93be3ef6	fix: number pool initialization in multi telephony setup If there are multiple telephony configurations, the form number should be initialized from the campaigns given telephonic configuration rather than the organization default telephonic configuration.	2026-05-08 14:48:53 +05:30
Abhishek Kumar	025bc14392	feat: add voicemail detection in realtime branch	2026-05-06 17:50:02 +05:30
Abhishek	d4b6afb020	feat: add logs in campaigns for failure or pausing (#265 ) * feat: add logs in campaigns on failure * chore: bump pipecat * chore: update format.sh * chore: fix github workflow * fix: fix formatting errors	2026-05-05 19:23:50 +05:30
Abhishek Kumar	abfb678b4d	chore: bump pipecat	2026-05-05 15:59:12 +05:30
Abhishek	0e12c41fc7	chore: bump pipecat version and fix tests (#263 ) * chore: bump pipecat version and fix tests * chore: add github workflow to run tests * fix: install reqirements.dev.txt in test script * fix: fix api-test action * feat: add integration test * test: add integration tests * test: add test for function call mute strategy	2026-05-04 21:35:37 +05:30
Abhishek Kumar	91a62178c1	chore: add telephony configuration docs	2026-05-02 17:37:48 +05:30
Abhishek	e16f6438bd	feat: refactor telephony to support multiple telephony configurations (#251 ) Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-04-29 11:39:57 +05:30
dilipevents2007-cpu	2218ba8ad9	feat: add Plivo telephony provider support (#245 ) * Add Plivo telephony provider support * add Plivo telephony UI, fix audio config, and improve inbound call handling --------- Co-authored-by: Dilip Tiwari <digitalapache20@gmail.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com> Co-authored-by: Abhishek <abhishek@a6k.me>	2026-04-25 20:41:46 +05:30
Abhishek	00a1a22b74	feat: refactor node spec and add mcp tools (#244 ) * refactor: carve out extraction panel * refactor: create spec versions for node types * refactor: create a GenericNode and remove custom nodes * feat: add python and typescript sdk * add dograh sdk * fix: fetch draft workflow definition over published one * fix: fix routes of SDKs to use code gen * chore: remove doclink dependency to reduce image size * chore: format files * chore: bump pipecat * feat: let mcp fetch archived workflows on demand * chore: fix tests * feat: add sdk documentation * chore: change banner and add badge	2026-04-21 07:56:16 +05:30
Abhishek Kumar	e31b38122e	fix: fix interruption handling for Gemini Live 1. Fixes #236 2. Fix run_inference for variable extraction for Gemini Live	2026-04-15 19:29:07 +05:30
Abhishek	7c245051d2	feat: add recording audio option in tool and node transitions (#232 ) * feat: allow uploading recording as part of node transition * feat: allow recordings in tool transitions * chore: fix tests	2026-04-10 17:53:42 +05:30
Sabiha Khan	3f19a16e7f	feat: add posthog events (#231 ) * feat: add posthog events * fix: workflow_duplicated event * chore: add events to enum	2026-04-10 17:52:21 +05:30
Abhishek Kumar	87c8c5e2c8	feat: add full document mode in knowledge base	2026-04-09 13:49:20 +05:30
Abhishek Kumar	9decdb2f4b	fix: send volume in cartesia	2026-04-08 23:20:14 +05:30
Abhishek Kumar	1f5229e2df	chore: update prompt for pre-recorded audio generation	2026-04-08 22:23:14 +05:30
Abhishek	38d1d928b7	feat: agent versioning and model configurations override (#227 ) * feat: add tests and migrations * feat: workflow versioning among published and draft * feat: add a new settings page to simplify workflow detail page * fix: fix tsclient generation	2026-04-08 19:20:31 +05:30
Abhishek Kumar	e04ce4e852	chore: add language option for Rime	2026-04-07 18:32:09 +05:30
Abhishek Kumar	e255b33813	feat: add Rime TTS	2026-04-07 14:05:47 +05:30
drascom	95d6dd44ff	fix: Speaches STT service wiring * Fix Speaches STT service wiring * chore: bump pipecat submodule --------- Co-authored-by: drascom <drascom@drascoms-MacBook-Pro.local> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-04-06 14:11:58 +05:30
Abhishek	ec2f322486	feat: add pre call fetch configuration (#222 ) * feat: add pre call fetch configuration * docs: add NEW tags for pages about new features --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-04-06 12:30:37 +05:30
Abhishek Kumar	c4c4b591db	feat: add gladia stt support	2026-04-04 14:47:48 +05:30
Abhishek Kumar	03df5595c3	feat: add worker sync events Add a worker sync event so that runtime updates on one worker can propagate across other workers using pubsub for multi worker deployments	2026-04-04 14:26:47 +05:30
Abhishek Kumar	56763a4527	feat: enable context summarization	2026-04-03 13:39:02 +05:30
Abhishek Kumar	501d06c00d	feat: add Assembly AI STT	2026-04-03 07:10:37 +05:30
Abhishek	87e72d5f6f	feat: add gemini live and speaches integration (#220 ) * feat: add speaches models * feat: add gemini realtime and speaches integration - Add gemini realtime support - Add speaches support for locally hosted LLMs * chore: bump pipecat * feat: add language option * fix: add skip aggregator types to tts settings * fix: make API key optional for realtime	2026-03-31 21:42:03 +05:30
Abhishek Kumar	2d91336aec	fix: cleanup rtf on pipeline finish	2026-03-25 22:44:38 +05:30
Sabiha Khan	5b820cb0ba	feat: integrate Telnyx telephony for outbound and inbound calling (#206 ) * feat: integrate Telnyx telephony for outbound and inbound calling * chore: remove redundant code --------- Co-authored-by: Abhishek <abhishek@a6k.me>	2026-03-25 18:01:41 +05:30
Abhishek Kumar	2fa4191d9b	feat: allow recording audio in workflow builder	2026-03-25 15:01:39 +05:30
Abhishek Kumar	ac0731a374	feat: add support for self hosted llm models	2026-03-24 17:50:45 +05:30
neil from camb.ai	31e075d114	feat: add CAMB AI TTS integration (#187 ) Co-authored-by: Abhishek <abhishek@a6k.me>	2026-03-24 12:54:07 +05:30

1 2 3

119 commits