dograh

mirror of https://github.com/dograh-hq/dograh.git synced 2026-07-22 11:51:04 +02:00

Author	SHA1	Message	Date
Abhishek Kumar	7a3b1c4a4b	fix: add validation for URL and params	2026-06-02 12:45:13 +05:30
Abhishek Kumar	858c474139	Merge remote-tracking branch 'origin/main' into pr-381	2026-06-02 12:11:57 +05:30
developer603	8a4a2e25db	feat: allow overriding base URL of OpenAI STT and TTS (#377 ) Mirrors the LLM treatment from #368 for the OpenAI STT and OpenAI TTS providers. Users running OpenAI-compatible self-hosted services (vLLM, Speaches, llama.cpp, custom proxies) can now point Dograh at them via the OpenAI provider with `base_url`, instead of being forced onto the Speaches provider as a workaround. Changes: * `registry.py` — add `base_url` field (default `https://api.openai.com/v1`) to `OpenAISTTConfiguration` and `OpenAITTSService`, identical in shape to the existing `OpenAILLMService.base_url` from #368. * `service_factory.py` — in the OPENAI branches of `create_stt_service` and `create_tts_service`, lift `base_url` off the user config, run it through `_validate_runtime_service_url`, and forward it as a kwarg to `OpenAISTTService` / `OpenAITTSService` (both already accept it). Same pattern as the LLM branch. * `test_user_configured_service_url_security.py` — adds four runtime validation tests covering private-IP rejection and localhost rejection in SaaS mode for both STT and TTS. Existing OSS-mode permissiveness is unchanged (DEPLOYMENT_MODE=oss skips the validator, as before). No schema migration needed — Pydantic populates the default; existing configurations without `base_url` continue to talk to api.openai.com. `check_validity.py` requires no edits because the per-service validation loop already iterates `("base_url", "endpoint")` via `getattr`, and the `_check_openai_api_key` dispatcher already routes OPENAI providers through the base_url-aware code path (introduced in #368) for STT and TTS too. Tests pass locally: pytest api/tests/test_user_configured_service_url_security.py 23 passed in 4.80s (19 existing + 4 new) Co-authored-by: developer603 <developer603@users.noreply.github.com>	2026-06-02 12:06:58 +05:30
Matt Van Horn	dd85c4a1b4	fix: support object and array parameters in custom HTTP tools (#373 ) * fix: support object and array parameters in custom HTTP tools * feat(ui): expose object and array types in the custom tool parameter editor * fix: error handling and schema generation --------- Co-authored-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-06-02 11:35:38 +05:30
Abhay Babbar	98d2b24cba	Add Sarvam LLM, update Sarvam STT models, expose usage_info on run detail (#351 ) * Add Sarvam LLM provider, update Sarvam STT models, expose usage_info on run detail. Depends on pipecat PR dograh-hq/pipecat#43 for STT string language support. Submodule bump will follow after that merges. * test: cover Sarvam STT language mapping; link Sarvam docs --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-06-01 10:29:31 +05:30
Abhishek Kumar	fcb7004c7a	feat: create tools using MCP	2026-05-31 16:50:44 +05:30
Abhishek	5c29b6ed94	feat: add mcp guides for various topic and stages for bot building (#380 )	2026-05-31 16:07:32 +05:30
Abhishek Kumar	0c0b8383bf	fix: fix rtf logs and gemini live turn taking	2026-05-31 16:05:03 +05:30
Abhishek Kumar	c586d02d5d	feat: abort immediately on max call duration exceed	2026-05-31 13:21:37 +05:30
Abhishek Kumar	78ba62e185	feat: banner if API is not reachable	2026-05-31 13:05:22 +05:30
Abhishek	8f10bcade3	fix: store channel id in gathered context for ARI outbound	2026-05-29 17:07:58 +00:00
Vishal Dhateria	dbbf362315	feat: add Azure AI multi-provider support (TTS, STT, Embeddings, Realtime) Enables Azure AI services across all model layers so users with Azure credits can consolidate billing on a single provider. - Voice (TTS): AzureSpeechTTSConfiguration via azure_speech provider - Transcriber (STT): AzureSpeechSTTConfiguration via azure_speech provider - Embedding: AzureOpenAIEmbeddingsConfiguration via azure provider - Realtime: AzureRealtimeLLMConfiguration via azure_realtime provider New files: - api/services/pipecat/realtime/azure_realtime.py - api/services/gen_ai/embedding/azure_openai_service.py - api/tests/test_azure_speech_service_factory.py The UI picks up all four providers automatically from the schema — no frontend changes required. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-05-29 20:48:42 +05:30
Abhishek	5ef3be92b5	chore: update pipecat to 1.3.0 (#379 ) * chore: rename PipelineTask to PipelineWorker * fix: fix tests * chore: update pipecat submodule * fix: fix anyio same task cancellation scope	2026-05-29 16:19:42 +05:30
Abhishek Kumar	e695436fb3	fix: fix inbound for Cloudonix with softphone	2026-05-28 19:49:41 +05:30
Abhishek	b891091e0e	fix: fix service key validation in OSS (#371 ) Fixes #303	2026-05-28 08:09:35 +05:30
Abhishek Kumar	6d78537297	chore: remove unused smart_turn service Fixes #323, #324, #325.	2026-05-27 09:49:36 +00:00
nuthalapativarun	5b61ad645f	feat: stamp API key into model override at save time to survive global provider change (#362 ) * fix: stamp API key into model override at save time to survive global provider change When a workflow overrides the TTS/LLM/STT provider to match the current global config, the override dict only stores model/voice fields, not the API key. If the global config later switches to a different provider, the override can no longer inherit the API key and calls fail. Fix: enrich_overrides_with_api_keys() copies the global provider's API key (and other secret fields) into the override dict at workflow-save time, making the override self-contained regardless of future global config changes. * feat: add test coverage and masking logic --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-27 14:01:14 +05:30
Abhishek	8a58b0992d	feat: allow overriding base URL of OpenAI models (#368 ) * Add OpenAI-compatible API option in model configuration Backend-only cherry-pick from `20617db37a`. * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * fix: harden the base url settings in SaaS mode --------- Co-authored-by: Chris Briddock <briddockchristopher@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-05-27 13:07:45 +05:30
Abhishek Kumar	7810923bca	chore: return formatted transcript url - Return formatted transcript and recording URL - Harden campaign dispatcher logic	2026-05-26 13:24:12 +05:30
Abhishek Kumar	285de92528	fix: fix vobiz webhook signature validation	2026-05-25 18:30:06 +05:30
Abhishek	3892b58486	feat: add ultravox realtime and fix signature issue in telephony (#345 ) * feat: add ultravox realtime and fix signature issue in telephony - Add UltraVox realtime - Fix signature issue on telephony * fix: fix regression for wss_backend_endpoint	2026-05-23 12:51:55 +05:30
Abhishek Kumar	9135c2da13	feat: add xai grok as realtime model	2026-05-22 18:04:59 +05:30
Abhishek Kumar	291264de7b	Merge branch 'main' of https://github.com/dograh-hq/dograh	2026-05-22 14:36:54 +05:30
Abhishek Kumar	ad2fa07058	feat: add google stt and tts. add folders to organize agents	2026-05-22 14:36:50 +05:30
Octopus	0e0d3136ca	feat: add MiniMax provider support (Chat + TTS) (#309 ) * feat: add MiniMax provider support (Chat + TTS) - Add MiniMax LLM provider using OpenAI-compatible API - Models: MiniMax-M2.7, MiniMax-M2.7-highspeed - Default base URL: https://api.minimax.io/v1 - Uses MINIMAX_API_KEY for authentication - Add MiniMax TTS provider using Pipecat's MiniMaxHttpTTSService - Models: speech-2.8-hd (default), speech-2.8-turbo - 6 built-in voices - Requires group_id configuration - Add unit tests for both providers * fix(minimax): validator, temperature, session cleanup, reasoning filter - check_validity.py: wire MiniMax into _validator_map and enforce group_id at save time. Without this, saving a config with a valid key was rejected. - registry.py: surface temperature on the LLM config (gt=0; MiniMax rejects 0) and base_url on the TTS config - service_factory.py: * Plumb temperature through create_llm_service * Normalize TTS base_url to include /t2a_v2 — pipecat appends only ?GroupId=... to the URL. * Use the new MiniMaxLLMService (from pipecat) to strip <think>...</think> reasoning that MiniMax-M2.7 emits inline in delta.content (otherwise it leaks straight to TTS). * Use MiniMaxOwnedSessionTTSService so the per-instance aiohttp session gets closed in cleanup() instead of leaking sockets/FDs. - minimax_tts.py: small wrapper around MiniMaxHttpTTSService that owns the session it was handed (pipecat's caller-owns-session API conflicts with the ftory's per-instance pattern). - pipecat submodule: bumps to a commit that adds MiniMaxLLMService — a thin OpenAILLMService subclass with the streaming <think> filter (mirrors NvidiaLLMService's pattern for NIM reasoning models). - Tests updated/added for all of the above. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: octo-patch <octo-patch@github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-22 13:09:41 +05:30
Abhishek	d97d1d72cd	feat: add chat based testing for voice agent (#308 ) * feat: add backend foundations * feat: add text chat UI * chore: simplify the reload behaviour * fix: fix upgrade banner to be triggered after package upload * feat: simplify TesterPanel design * chore: fix formatting and generate client * chore: fix tracing for text chat mode * fix: fix revert and edit CTA * refactor: refactor TesterPanel into smaller components * feat: enable runtime transition of nodes * fix: fix review comments	2026-05-21 15:20:02 +05:30
Mohamed-Mamdouh	67479e98fd	fix timestamps in tuner accumelator (#335 ) * fix timestamps in tuner accumelator * chore: refactor strip_thought_ids_from_messages --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-21 07:43:50 +05:30
Leoy	5762095edf	feat(mcp): add search_docs tool over docs corpus (closes #295 ) (#316 ) * feat(mcp): add search_docs tool over Mintlify docs corpus Closes #295. The docs at https://docs.dograh.com promise "Search the Dograh docs for how to configure a TURN server" as an MCP example prompt, but no search_docs tool exists in the MCP server — agents can list workspace resources but cannot search the documentation. This adds a dependency-free, in-process keyword search over the `docs/` tree shipped into the API image (`COPY ./docs ./docs`): - New `api/mcp_server/tools/docs_search.py` — async `search_docs(query, limit=10)` with weighted scoring (path > title > body), a 25-result hard cap, snippet extraction around the first term hit, and graceful empty-list degradation when docs aren't on disk. `DOGRAH_DOCS_PATH` env var overrides location discovery for non-Docker layouts. - Registered in `api/mcp_server/server.py` alongside the other tools, keeping the existing list-alphabetical convention. - `api/tests/test_mcp_docs_search.py` — 18 unit tests covering the pure helpers (tokenizer, frontmatter stripping, title extraction, scoring weights, URL building) and end-to-end ranking, limit clamping, empty-corpus degradation, and input-validation errors. Mocks `authenticate_mcp_request` to avoid the DB dependency, mirroring `test_mcp_save_workflow.py`. Implementation notes: - The docs corpus is ~100 files / ~140k LoC, so a per-call scan runs well under 50 ms; avoiding a vector index / embedding backend keeps the tool zero-dependency and works for fully offline self-hosted deployments. - Authentication is required for consistency with the other MCP tools (and to route through the existing rate-limit middleware), even though docs are not org-scoped data. - Title/path matches deliberately outweigh body matches so a page whose subject IS the query term outranks one that merely mentions it incidentally. * feat: improve docs search --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-20 18:20:35 +05:30
Abhishek Kumar	d93d7aff4d	feat: add Review AGENTS.md Skill	2026-05-20 16:20:07 +05:30
Abhishek Kumar	ee216c0e40	chore: refactor AGENTS.md	2026-05-20 15:56:52 +05:30
Mohamed-Mamdouh	5f28c1b2a9	feat: add Tuner Integration to Dograh (#311 ) * Add tuner integration * bump pipecat version * chore: update pipecat submodule to match upstream and use tuner-pipecat-sdk 0.2.0 Update pipecat submodule from 0.0.109.dev23 to 13e98d0d9 (the exact commit upstream dograh-hq/dograh uses after v1.30.1). This installs pipecat-ai as 1.1.0.post277 via setuptools_scm, satisfying tuner-pipecat-sdk 0.2.0's pipecat-ai>=1.0.0 requirement. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * wire tuner * feat: refactor integrations into self contained packages * chore: simplify ensure_public_access_token * fix: remove NodeSpec and make DTOs the source of truth * feat: send relevant signal to mcp using to_mcp_dict * fix: fix tests * cleanup: remove nango integrations * feat: add agents.md for integrations --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-20 14:37:33 +05:30
palinko91	afa78fe859	fix(stt): align Speechmatics language registry with official transcription codes (#317 )	2026-05-19 19:00:38 +05:30
Abhishek	151bf77e40	feat: add agent skills to review PR (#320 )	2026-05-19 17:02:26 +05:30
Sabiha Khan	8778bb453e	chore: mandate telnyx signature verification (#319 )	2026-05-19 16:50:27 +05:30
Paulo Busato Favarato	75839f9de5	feat(mcp): generic MCP tool source with per-node function filtering (#301 ) * feat(mcp): generic MCP tool source with per-node function filtering Adds a Model Context Protocol tool category: connect a customer MCP server and expose its tools to the agent, with optional per-node allow-listing of individual MCP functions. - ToolCategory.MCP enum + alembic migration - MCP definition validator and collision-safe function-name namespacing - McpToolSession wrapper: graceful-degrade, per-call open/close lifecycle - CustomToolManager MCP branch (schemas + proxy handlers) - Per-node mcp_tool_filters threaded through DTO/graph/engine - Best-effort discovered_tools catalog cache + POST /tools/{uuid}/mcp/refresh - UI: MCP create/edit config, tabbed ToolSelector with per-node toggles * feat: refactor for code standardisation and documentation --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-19 16:10:00 +05:30
Abhishek Kumar	fc04f31639	fix: force FORCE_TURN_RELAY for local IPs in setup	2026-05-16 18:37:38 +05:30
Abhishek	2381a803ad	feat: add openai realtime models (#298 ) * feat: add openai realtime models * chore: bump pipecat * fix: resample telephony audio for openai realtime * fix: sampling rate fix for openai realtime * chore: clean up dead code	2026-05-16 18:05:23 +05:30
Abhishek	45b00cd5d0	chore: remove looptalk (#299 ) * chore: remove looptalk Remove looptalk in the current version. We will be rethinking looptalk in a fresh way. * chore: formatting fix	2026-05-16 17:45:12 +05:30
Sabiha Khan	0523dcb079	fix: provider resolution in telephony cost calculation post workflow integration calls	2026-05-16 13:19:26 +05:30
Nir Simionovich	7df73beea3	Resolve an issue with direct socket connections using the wrong event… (#289 ) * Resolve an issue with direct socket connections using the wrong event data. * Resolve the formatting issus in the provider file * chore: fix import ordering with ruff Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Nir Simionovich <nirs@cloudonix.io> Co-authored-by: Abhishek Kumar <abhishek@a6k.me> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 18:20:14 +05:30
Abhishek Kumar	b728cc4922	chore: fix ARI documentation	2026-05-13 21:53:18 +05:30
Sabiha Khan	ebeffdbc40	fix(ari): pre-register ext channel id and defer bridge to its StasisS… (#284 ) * fix(ari): pre-register ext channel id and defer bridge to its StasisStart Two race conditions in the inbound ARI flow could leave a call silent: 1. Bridging both channels immediately after creating the ext media leg raced against the ext channel entering the Stasis application; slow chan_websocket handshakes produced "Channel not in Stasis application" 422 errors on addChannel. 2. Asterisk could fire StasisStart for the ext channel before the externalMedia POST response returned, so _is_ext_channel returned False and the event was dropped as an unknown outbound call. Fixes: - Generate the ext channel id as dograh-ext-<uuid> client-side and pass it to Asterisk via the channelId query param. Mark the ext channel, set its channel->run mapping, register the pending bridge entry, and persist gathered_context.ext_channel_id all before the POST. - Defer the bridge to a new _complete_bridge_after_ext_ready handler triggered by the ext channel's own StasisStart. Both channels are guaranteed in Stasis by then, so addChannel cannot 422. - On POST failure or channelId mismatch, roll back the pending entry and ERROR loudly. * fix: replace in-memory dict with redis storage	2026-05-13 18:33:34 +05:30
Sabiha Khan	b670004725	feat: verify telnyx webhook signature optionally (#279 )	2026-05-12 19:47:28 +05:30
Abhishek	7f0dac1ad5	feat: configurable ElevenLabs base URL for Data Residency (#278 ) * feat: configurable ElevenLabs base URL for Data Residency (#269) Adds a `base_url` field to `ElevenlabsTTSConfiguration` so users on an ElevenLabs Data Residency plan (EU, etc.) can point Dograh at the regional endpoint instead of the hardcoded global one. Defaults to `https://api.elevenlabs.io`, preserving existing behaviour. The service factory rewrites the HTTP scheme to WSS when constructing the WebSocket TTS service. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: fix drift --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:01:13 +05:30
Abhishek Sharma	137b5e9f89	Verify Telnyx webhook signatures (#271 ) * Verify Telnyx webhook signatures * feat: harden telnyx webhook signature verification --------- Co-authored-by: a692570 <a692570@users.noreply.github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-12 18:37:31 +05:30
Sabiha Khan	a1902829db	fix: prior pre-pr drift check failures (#276 ) * fix: prior pre-pr drift check failures * docs: update api reference openapi json	2026-05-12 14:17:40 +05:30
Sabiha Khan	4a6752e62b	feat(telephony/telnyx): add call transfer via conference bridge (#274 ) Conference-based transfer for Telnyx with a two-step flow: - transfer_call dials the destination with a per-transfer webhook URL. - On call.answered, the webhook seeds a conference with the destination's live call_control_id and publishes DESTINATION_ANSWERED. - TelnyxConferenceStrategy joins the caller into the conference on pipeline teardown (EndTaskReason.TRANSFER_CALL). - On post-answer destination hangup, the webhook hangs up the caller — Telnyx's create_conference doesn't accept end_conference_on_exit on the seed leg, so we tear down the bridge ourselves. TransferContext gains optional workflow_run_id (for webhook→provider resolution in multi-config orgs) and conference_id (set on answer, rd by the strategy). Also fixes the transfer tool's provider lookup to go through get_telephony_provider_for_run instead of the deprecated org-default shim, which was returning the wrong provider in multi-config orgs.	2026-05-12 13:44:39 +05:30
Abhishek Kumar	4afe426f12	chore: fix tests	2026-05-11 17:21:02 +05:30
Abhishek	e2fe1f3cd4	feat: enable FORCE_TURN_RELAY to diagnose turn connectivity for local deployment setups (#272 ) * filter out local sdp candidates on non local environment * feat: add FORCE_TURN_RELAY variable * add FORCE_TURN_RELAY option in docker-compose * fix: fix github workflow	2026-05-11 17:13:01 +05:30
Sabiha Khan	01c201bf09	feat: add telnyx webhook api key in telephony config (#270 )	2026-05-09 18:03:42 +05:30

1 2 3 4 5

237 commits