dograh

mirror of https://github.com/dograh-hq/dograh.git synced 2026-06-07 07:55:16 +02:00

Author	SHA1	Message	Date
developer603	8a4a2e25db	feat: allow overriding base URL of OpenAI STT and TTS (#377 ) Mirrors the LLM treatment from #368 for the OpenAI STT and OpenAI TTS providers. Users running OpenAI-compatible self-hosted services (vLLM, Speaches, llama.cpp, custom proxies) can now point Dograh at them via the OpenAI provider with `base_url`, instead of being forced onto the Speaches provider as a workaround. Changes: * `registry.py` — add `base_url` field (default `https://api.openai.com/v1`) to `OpenAISTTConfiguration` and `OpenAITTSService`, identical in shape to the existing `OpenAILLMService.base_url` from #368. * `service_factory.py` — in the OPENAI branches of `create_stt_service` and `create_tts_service`, lift `base_url` off the user config, run it through `_validate_runtime_service_url`, and forward it as a kwarg to `OpenAISTTService` / `OpenAITTSService` (both already accept it). Same pattern as the LLM branch. * `test_user_configured_service_url_security.py` — adds four runtime validation tests covering private-IP rejection and localhost rejection in SaaS mode for both STT and TTS. Existing OSS-mode permissiveness is unchanged (DEPLOYMENT_MODE=oss skips the validator, as before). No schema migration needed — Pydantic populates the default; existing configurations without `base_url` continue to talk to api.openai.com. `check_validity.py` requires no edits because the per-service validation loop already iterates `("base_url", "endpoint")` via `getattr`, and the `_check_openai_api_key` dispatcher already routes OPENAI providers through the base_url-aware code path (introduced in #368) for STT and TTS too. Tests pass locally: pytest api/tests/test_user_configured_service_url_security.py 23 passed in 4.80s (19 existing + 4 new) Co-authored-by: developer603 <developer603@users.noreply.github.com>	2026-06-02 12:06:58 +05:30
Matt Van Horn	dd85c4a1b4	fix: support object and array parameters in custom HTTP tools (#373 ) * fix: support object and array parameters in custom HTTP tools * feat(ui): expose object and array types in the custom tool parameter editor * fix: error handling and schema generation --------- Co-authored-by: Matt Van Horn <455140+mvanhorn@users.noreply.github.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-06-02 11:35:38 +05:30
Abhay Babbar	98d2b24cba	Add Sarvam LLM, update Sarvam STT models, expose usage_info on run detail (#351 ) * Add Sarvam LLM provider, update Sarvam STT models, expose usage_info on run detail. Depends on pipecat PR dograh-hq/pipecat#43 for STT string language support. Submodule bump will follow after that merges. * test: cover Sarvam STT language mapping; link Sarvam docs --------- Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-06-01 10:29:31 +05:30
Abhishek Kumar	fcb7004c7a	feat: create tools using MCP	2026-05-31 16:50:44 +05:30
Abhishek	5c29b6ed94	feat: add mcp guides for various topic and stages for bot building (#380 )	2026-05-31 16:07:32 +05:30
Abhishek Kumar	0c0b8383bf	fix: fix rtf logs and gemini live turn taking	2026-05-31 16:05:03 +05:30
Abhishek Kumar	c586d02d5d	feat: abort immediately on max call duration exceed	2026-05-31 13:21:37 +05:30
Abhishek Kumar	78ba62e185	feat: banner if API is not reachable	2026-05-31 13:05:22 +05:30
Abhishek	8f10bcade3	fix: store channel id in gathered context for ARI outbound	2026-05-29 17:07:58 +00:00
Abhishek	5ef3be92b5	chore: update pipecat to 1.3.0 (#379 ) * chore: rename PipelineTask to PipelineWorker * fix: fix tests * chore: update pipecat submodule * fix: fix anyio same task cancellation scope	2026-05-29 16:19:42 +05:30
Abhishek Kumar	e695436fb3	fix: fix inbound for Cloudonix with softphone	2026-05-28 19:49:41 +05:30
Abhishek	b891091e0e	fix: fix service key validation in OSS (#371 ) Fixes #303	2026-05-28 08:09:35 +05:30
Abhishek Kumar	6d78537297	chore: remove unused smart_turn service Fixes #323, #324, #325.	2026-05-27 09:49:36 +00:00
nuthalapativarun	5b61ad645f	feat: stamp API key into model override at save time to survive global provider change (#362 ) * fix: stamp API key into model override at save time to survive global provider change When a workflow overrides the TTS/LLM/STT provider to match the current global config, the override dict only stores model/voice fields, not the API key. If the global config later switches to a different provider, the override can no longer inherit the API key and calls fail. Fix: enrich_overrides_with_api_keys() copies the global provider's API key (and other secret fields) into the override dict at workflow-save time, making the override self-contained regardless of future global config changes. * feat: add test coverage and masking logic --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-27 14:01:14 +05:30
Abhishek	8a58b0992d	feat: allow overriding base URL of OpenAI models (#368 ) * Add OpenAI-compatible API option in model configuration Backend-only cherry-pick from `20617db37a`. * Potential fix for pull request finding Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com> * fix: harden the base url settings in SaaS mode --------- Co-authored-by: Chris Briddock <briddockchristopher@gmail.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-05-27 13:07:45 +05:30
Abhishek Kumar	7810923bca	chore: return formatted transcript url - Return formatted transcript and recording URL - Harden campaign dispatcher logic	2026-05-26 13:24:12 +05:30
Abhishek Kumar	285de92528	fix: fix vobiz webhook signature validation	2026-05-25 18:30:06 +05:30
Abhishek	3892b58486	feat: add ultravox realtime and fix signature issue in telephony (#345 ) * feat: add ultravox realtime and fix signature issue in telephony - Add UltraVox realtime - Fix signature issue on telephony * fix: fix regression for wss_backend_endpoint	2026-05-23 12:51:55 +05:30
Abhishek Kumar	9135c2da13	feat: add xai grok as realtime model	2026-05-22 18:04:59 +05:30
Abhishek Kumar	291264de7b	Merge branch 'main' of https://github.com/dograh-hq/dograh	2026-05-22 14:36:54 +05:30
Abhishek Kumar	ad2fa07058	feat: add google stt and tts. add folders to organize agents	2026-05-22 14:36:50 +05:30
Octopus	0e0d3136ca	feat: add MiniMax provider support (Chat + TTS) (#309 ) * feat: add MiniMax provider support (Chat + TTS) - Add MiniMax LLM provider using OpenAI-compatible API - Models: MiniMax-M2.7, MiniMax-M2.7-highspeed - Default base URL: https://api.minimax.io/v1 - Uses MINIMAX_API_KEY for authentication - Add MiniMax TTS provider using Pipecat's MiniMaxHttpTTSService - Models: speech-2.8-hd (default), speech-2.8-turbo - 6 built-in voices - Requires group_id configuration - Add unit tests for both providers * fix(minimax): validator, temperature, session cleanup, reasoning filter - check_validity.py: wire MiniMax into _validator_map and enforce group_id at save time. Without this, saving a config with a valid key was rejected. - registry.py: surface temperature on the LLM config (gt=0; MiniMax rejects 0) and base_url on the TTS config - service_factory.py: * Plumb temperature through create_llm_service * Normalize TTS base_url to include /t2a_v2 — pipecat appends only ?GroupId=... to the URL. * Use the new MiniMaxLLMService (from pipecat) to strip <think>...</think> reasoning that MiniMax-M2.7 emits inline in delta.content (otherwise it leaks straight to TTS). * Use MiniMaxOwnedSessionTTSService so the per-instance aiohttp session gets closed in cleanup() instead of leaking sockets/FDs. - minimax_tts.py: small wrapper around MiniMaxHttpTTSService that owns the session it was handed (pipecat's caller-owns-session API conflicts with the ftory's per-instance pattern). - pipecat submodule: bumps to a commit that adds MiniMaxLLMService — a thin OpenAILLMService subclass with the streaming <think> filter (mirrors NvidiaLLMService's pattern for NIM reasoning models). - Tests updated/added for all of the above. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: octo-patch <octo-patch@github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-22 13:09:41 +05:30
Abhishek	d97d1d72cd	feat: add chat based testing for voice agent (#308 ) * feat: add backend foundations * feat: add text chat UI * chore: simplify the reload behaviour * fix: fix upgrade banner to be triggered after package upload * feat: simplify TesterPanel design * chore: fix formatting and generate client * chore: fix tracing for text chat mode * fix: fix revert and edit CTA * refactor: refactor TesterPanel into smaller components * feat: enable runtime transition of nodes * fix: fix review comments	2026-05-21 15:20:02 +05:30
Mohamed-Mamdouh	67479e98fd	fix timestamps in tuner accumelator (#335 ) * fix timestamps in tuner accumelator * chore: refactor strip_thought_ids_from_messages --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-21 07:43:50 +05:30
Leoy	5762095edf	feat(mcp): add search_docs tool over docs corpus (closes #295 ) (#316 ) * feat(mcp): add search_docs tool over Mintlify docs corpus Closes #295. The docs at https://docs.dograh.com promise "Search the Dograh docs for how to configure a TURN server" as an MCP example prompt, but no search_docs tool exists in the MCP server — agents can list workspace resources but cannot search the documentation. This adds a dependency-free, in-process keyword search over the `docs/` tree shipped into the API image (`COPY ./docs ./docs`): - New `api/mcp_server/tools/docs_search.py` — async `search_docs(query, limit=10)` with weighted scoring (path > title > body), a 25-result hard cap, snippet extraction around the first term hit, and graceful empty-list degradation when docs aren't on disk. `DOGRAH_DOCS_PATH` env var overrides location discovery for non-Docker layouts. - Registered in `api/mcp_server/server.py` alongside the other tools, keeping the existing list-alphabetical convention. - `api/tests/test_mcp_docs_search.py` — 18 unit tests covering the pure helpers (tokenizer, frontmatter stripping, title extraction, scoring weights, URL building) and end-to-end ranking, limit clamping, empty-corpus degradation, and input-validation errors. Mocks `authenticate_mcp_request` to avoid the DB dependency, mirroring `test_mcp_save_workflow.py`. Implementation notes: - The docs corpus is ~100 files / ~140k LoC, so a per-call scan runs well under 50 ms; avoiding a vector index / embedding backend keeps the tool zero-dependency and works for fully offline self-hosted deployments. - Authentication is required for consistency with the other MCP tools (and to route through the existing rate-limit middleware), even though docs are not org-scoped data. - Title/path matches deliberately outweigh body matches so a page whose subject IS the query term outranks one that merely mentions it incidentally. * feat: improve docs search --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-20 18:20:35 +05:30
Abhishek Kumar	d93d7aff4d	feat: add Review AGENTS.md Skill	2026-05-20 16:20:07 +05:30
Abhishek Kumar	ee216c0e40	chore: refactor AGENTS.md	2026-05-20 15:56:52 +05:30
Mohamed-Mamdouh	5f28c1b2a9	feat: add Tuner Integration to Dograh (#311 ) * Add tuner integration * bump pipecat version * chore: update pipecat submodule to match upstream and use tuner-pipecat-sdk 0.2.0 Update pipecat submodule from 0.0.109.dev23 to 13e98d0d9 (the exact commit upstream dograh-hq/dograh uses after v1.30.1). This installs pipecat-ai as 1.1.0.post277 via setuptools_scm, satisfying tuner-pipecat-sdk 0.2.0's pipecat-ai>=1.0.0 requirement. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * wire tuner * feat: refactor integrations into self contained packages * chore: simplify ensure_public_access_token * fix: remove NodeSpec and make DTOs the source of truth * feat: send relevant signal to mcp using to_mcp_dict * fix: fix tests * cleanup: remove nango integrations * feat: add agents.md for integrations --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com> Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-20 14:37:33 +05:30
palinko91	afa78fe859	fix(stt): align Speechmatics language registry with official transcription codes (#317 )	2026-05-19 19:00:38 +05:30
Abhishek	151bf77e40	feat: add agent skills to review PR (#320 )	2026-05-19 17:02:26 +05:30
Sabiha Khan	8778bb453e	chore: mandate telnyx signature verification (#319 )	2026-05-19 16:50:27 +05:30
Paulo Busato Favarato	75839f9de5	feat(mcp): generic MCP tool source with per-node function filtering (#301 ) * feat(mcp): generic MCP tool source with per-node function filtering Adds a Model Context Protocol tool category: connect a customer MCP server and expose its tools to the agent, with optional per-node allow-listing of individual MCP functions. - ToolCategory.MCP enum + alembic migration - MCP definition validator and collision-safe function-name namespacing - McpToolSession wrapper: graceful-degrade, per-call open/close lifecycle - CustomToolManager MCP branch (schemas + proxy handlers) - Per-node mcp_tool_filters threaded through DTO/graph/engine - Best-effort discovered_tools catalog cache + POST /tools/{uuid}/mcp/refresh - UI: MCP create/edit config, tabbed ToolSelector with per-node toggles * feat: refactor for code standardisation and documentation --------- Co-authored-by: Abhishek Kumar <abhishek@a6k.me>	2026-05-19 16:10:00 +05:30
Abhishek Kumar	fc04f31639	fix: force FORCE_TURN_RELAY for local IPs in setup	2026-05-16 18:37:38 +05:30
Abhishek	2381a803ad	feat: add openai realtime models (#298 ) * feat: add openai realtime models * chore: bump pipecat * fix: resample telephony audio for openai realtime * fix: sampling rate fix for openai realtime * chore: clean up dead code	2026-05-16 18:05:23 +05:30
Abhishek	45b00cd5d0	chore: remove looptalk (#299 ) * chore: remove looptalk Remove looptalk in the current version. We will be rethinking looptalk in a fresh way. * chore: formatting fix	2026-05-16 17:45:12 +05:30
Sabiha Khan	0523dcb079	fix: provider resolution in telephony cost calculation post workflow integration calls	2026-05-16 13:19:26 +05:30
Nir Simionovich	7df73beea3	Resolve an issue with direct socket connections using the wrong event… (#289 ) * Resolve an issue with direct socket connections using the wrong event data. * Resolve the formatting issus in the provider file * chore: fix import ordering with ruff Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Nir Simionovich <nirs@cloudonix.io> Co-authored-by: Abhishek Kumar <abhishek@a6k.me> Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 18:20:14 +05:30
Abhishek Kumar	b728cc4922	chore: fix ARI documentation	2026-05-13 21:53:18 +05:30
Sabiha Khan	ebeffdbc40	fix(ari): pre-register ext channel id and defer bridge to its StasisS… (#284 ) * fix(ari): pre-register ext channel id and defer bridge to its StasisStart Two race conditions in the inbound ARI flow could leave a call silent: 1. Bridging both channels immediately after creating the ext media leg raced against the ext channel entering the Stasis application; slow chan_websocket handshakes produced "Channel not in Stasis application" 422 errors on addChannel. 2. Asterisk could fire StasisStart for the ext channel before the externalMedia POST response returned, so _is_ext_channel returned False and the event was dropped as an unknown outbound call. Fixes: - Generate the ext channel id as dograh-ext-<uuid> client-side and pass it to Asterisk via the channelId query param. Mark the ext channel, set its channel->run mapping, register the pending bridge entry, and persist gathered_context.ext_channel_id all before the POST. - Defer the bridge to a new _complete_bridge_after_ext_ready handler triggered by the ext channel's own StasisStart. Both channels are guaranteed in Stasis by then, so addChannel cannot 422. - On POST failure or channelId mismatch, roll back the pending entry and ERROR loudly. * fix: replace in-memory dict with redis storage	2026-05-13 18:33:34 +05:30
Sabiha Khan	b670004725	feat: verify telnyx webhook signature optionally (#279 )	2026-05-12 19:47:28 +05:30
Abhishek	7f0dac1ad5	feat: configurable ElevenLabs base URL for Data Residency (#278 ) * feat: configurable ElevenLabs base URL for Data Residency (#269) Adds a `base_url` field to `ElevenlabsTTSConfiguration` so users on an ElevenLabs Data Residency plan (EU, etc.) can point Dograh at the regional endpoint instead of the hardcoded global one. Defaults to `https://api.elevenlabs.io`, preserving existing behaviour. The service factory rewrites the HTTP scheme to WSS when constructing the WebSocket TTS service. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * fix: fix drift --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:01:13 +05:30
Abhishek Sharma	137b5e9f89	Verify Telnyx webhook signatures (#271 ) * Verify Telnyx webhook signatures * feat: harden telnyx webhook signature verification --------- Co-authored-by: a692570 <a692570@users.noreply.github.com> Co-authored-by: Sabiha Khan <sabihak89@gmail.com>	2026-05-12 18:37:31 +05:30
Sabiha Khan	a1902829db	fix: prior pre-pr drift check failures (#276 ) * fix: prior pre-pr drift check failures * docs: update api reference openapi json	2026-05-12 14:17:40 +05:30
Sabiha Khan	4a6752e62b	feat(telephony/telnyx): add call transfer via conference bridge (#274 ) Conference-based transfer for Telnyx with a two-step flow: - transfer_call dials the destination with a per-transfer webhook URL. - On call.answered, the webhook seeds a conference with the destination's live call_control_id and publishes DESTINATION_ANSWERED. - TelnyxConferenceStrategy joins the caller into the conference on pipeline teardown (EndTaskReason.TRANSFER_CALL). - On post-answer destination hangup, the webhook hangs up the caller — Telnyx's create_conference doesn't accept end_conference_on_exit on the seed leg, so we tear down the bridge ourselves. TransferContext gains optional workflow_run_id (for webhook→provider resolution in multi-config orgs) and conference_id (set on answer, rd by the strategy). Also fixes the transfer tool's provider lookup to go through get_telephony_provider_for_run instead of the deprecated org-default shim, which was returning the wrong provider in multi-config orgs.	2026-05-12 13:44:39 +05:30
Abhishek Kumar	4afe426f12	chore: fix tests	2026-05-11 17:21:02 +05:30
Abhishek	e2fe1f3cd4	feat: enable FORCE_TURN_RELAY to diagnose turn connectivity for local deployment setups (#272 ) * filter out local sdp candidates on non local environment * feat: add FORCE_TURN_RELAY variable * add FORCE_TURN_RELAY option in docker-compose * fix: fix github workflow	2026-05-11 17:13:01 +05:30
Sabiha Khan	01c201bf09	feat: add telnyx webhook api key in telephony config (#270 )	2026-05-09 18:03:42 +05:30
Abhishek Kumar	45a81c88e0	chore: add more fixtures	2026-05-08 16:28:09 +05:30
Abhishek Kumar	5a358d4d29	feat: add workflow graph constraints fixtures	2026-05-08 16:02:51 +05:30
Abhishek Kumar	6d93be3ef6	fix: number pool initialization in multi telephony setup If there are multiple telephony configurations, the form number should be initialized from the campaigns given telephonic configuration rather than the organization default telephonic configuration.	2026-05-08 14:48:53 +05:30

1 2 3 4 5

234 commits