Commit graph

144 commits

Author SHA1 Message Date
Adil Hafeez
0d190a6e5c
update code to use new json based system prompt for routing (#493) 2025-05-30 17:40:46 -07:00
Adil Hafeez
176f039bbc
fix model warning and use openwebui for preference based router demo 2025-05-30 12:29:56 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
9c4733590f
add support for openwebui (#487) 2025-05-28 19:08:00 -07:00
Adil Hafeez
79cbcb5fe1
add claude-4 in llm_routing demo (#486) 2025-05-23 10:21:21 -07:00
Adil Hafeez
218e9c540d
Add support for json based content types in Message (#480) 2025-05-23 00:51:53 -07:00
Adil Hafeez
f5e77bbe65
add support for claude and add first class support for groq and deepseek (#479) 2025-05-22 22:55:46 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
2e346143dd
use separate host port for chat ui and for app_server (#473)
We were using same port for both chatui and app_server which was causing conflict. This code change updates host port for app_server to 18083 and updates arch_config
2025-04-23 14:05:48 -07:00
Shuguang Chen
7d4b261a68
Integrate Arch-Function-Chat (#449) 2025-04-15 14:39:12 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460)
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change

* updated the GH actions with keys for Groq

* adding missing groq API keys

* add llama-3.2-3b-preview to the model based on addin groq to the demo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Mat Sylvia
e7b0de2a72
Tweak readme docs for minor nits (#461)
Co-authored-by: darkdatter <msylvia@tradestax.io>
2025-04-12 23:52:20 -07:00
Adil Hafeez
eb48f3d5bb
use passed in model name in chat completion request (#445) 2025-03-21 15:56:17 -07:00
Adil Hafeez
84cd1df7bf
add preliminary support for llm agents (#432) 2025-03-19 15:21:34 -07:00
Adil Hafeez
d2cb1427fb
add hurl tests for currency exchange demo (#435) 2025-03-17 14:21:41 -07:00
Adil Hafeez
5526314b3c
fix ollama demo - add egress endpoint for llm routing (#433) 2025-03-17 11:50:14 -07:00
Salman Paracha
ffb8566c36
updating the network agent to work agin (#425)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-08 14:29:11 -08:00
Adil Hafeez
e8dc7f18d3
start using base_url in place of endpoint (#430) 2025-03-05 17:20:04 -08:00
Adil Hafeez
ed3845040e
add demo for deepseek (#426) 2025-03-05 14:08:06 -08:00
Salman Paracha
2e2680c98c
Salanap/java debug ai (#416)
* stashing changes on my local branch

* updated the java demo with debug points and jaeger tracing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-04 12:49:34 -08:00
Shuguang Chen
e77fc47225
Handle intent matching better in arch gateway (#391) 2025-03-04 12:49:13 -08:00
Adil Hafeez
e40b13be05
Update arch_config and add tests for arch config file (#407) 2025-02-14 19:28:10 -08:00
Adil Hafeez
2f6c4348fd
update jaeger (#411) 2025-02-14 14:55:41 -08:00
Salman Paracha
28ee14a75c
updated the spotify bearer authorization README and fixed main README… (#402)
* updated the spotify bearer authorization README and fixed main README links

* minor fixes to SPOTIFY README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-02-10 17:56:28 -08:00
Salman Paracha
6ad70d9737
Fixing relative link to the shared chatbotui in the spotify demo, and add references to it in the README (#400) 2025-02-10 11:23:04 -08:00
Adil Hafeez
8de6eacfbd
spotify demo with optimized context window code change (#397) 2025-02-07 19:14:15 -08:00
Salman Paracha
b3c95a6698
refactor demos (#398) 2025-02-07 18:45:42 -08:00
Adil Hafeez
2bd61d628c
add ability to specify custom http headers in api endpoint (#386) 2025-02-06 11:48:09 -08:00
Adil Hafeez
962727f244
Infer port from protocol if port is not specified and add ability to override hostname in clusters def (#389) 2025-02-03 14:51:59 -08:00
Adil Hafeez
38f7691163
add support for custom llm with ssl support (#380)
* add support for custom llm with ssl support

Add support for using custom llm that are served through https protocol.

* add instructions on how to add custom inference endpoint

* fix formatting

* add more details

* Apply suggestions from code review

Co-authored-by: Salman Paracha <salman.paracha@gmail.com>

* Apply suggestions from code review

* fix precommit

---------

Co-authored-by: Salman Paracha <salman.paracha@gmail.com>
2025-01-24 17:14:24 -08:00
Adil Hafeez
2c67fa3bc0
Fix llm_routing provider element (#382)
* Fix llm_routing provider element

We replaced provider with provider_interface to make it more clear to developers about provider api/backend being used. During that upgrade we removed support for mistral in provider to encourage developers to start using provider_interface. But this demo was not updated to use provider_interface as it was using mistral. This code change fixes it by replacing provider with provider_interface.

Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>

* fix the path

* move

* add more details

* fix

* Apply suggestions from code review

* fix

* fix

---------

Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>
2025-01-24 16:34:11 -08:00
Guy Stone
84af476c75
Add run_demo.sh for llm_routing demo (#381)
* Create run_demo.sh for llm_routing demo

* Update run_demo.sh
2025-01-24 14:41:51 -08:00
Adil Hafeez
6740a09952
add docker-compose file for honeycomb tracing (#377) 2025-01-22 14:02:59 -08:00
Aayush
fcd8cfb9fc
add in honeycomb support for weather-forecast demo (#345) 2025-01-21 17:15:27 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm (#365) 2025-01-17 18:25:55 -08:00
Adil Hafeez
8407edae99
only test currency exchange in demo test (#348) 2024-12-21 11:33:08 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Adil Hafeez
93d3d349a2
fix code bug (#340) 2024-12-06 17:20:59 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00
Aayush
9d8fe02729
fix the README for the weather_forecasting demo (#336)
* README fix

* add missing colon
2024-12-06 14:02:41 -08:00
Aayush
885acc899f
322 add support for pydantic logfire for llm agent tracing (#329)
* set up otel-collector and implement sending to logfire

* moved rest of the files for the demo into the folder

* update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY

* refactor weather_forecast demo to only be one demo

* add a default docker-compose for e2e tests

* update based on requested changes

* fix replace comma with colon in readme

* remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set

* remove the unused weather forecast service folder

* Changed stop_demo to only stop one file at a time

* update readme with new demo stopping setup

* Revert changes to end behavior

* fix silly formatting mistake
2024-12-06 13:44:22 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts (#309)
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw (#305) 2024-11-26 13:13:02 -08:00
Adil Hafeez
726f1a3185
add schema change to use enum in arch_config (#304) 2024-11-25 17:51:25 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards (#303) 2024-11-25 17:16:35 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter (#267) 2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot (#269)
* add in tpot and tokens per second

* add in debug logs for new stats and update integration tests

* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared (#265) 2024-11-12 15:23:30 -08:00
Aayush
5993e36f22
Update arch stats (#250) 2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector (#262) 2024-11-12 11:09:40 -08:00