Commit graph

392 commits

Author SHA1 Message Date
Adil Hafeez
79cbcb5fe1
add claude-4 in llm_routing demo (#486) 2025-05-23 10:21:21 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
99dd900a34
fix panic in brightstaff (#485)
make router section optional in arch_config
2025-05-23 09:37:25 -07:00
Adil Hafeez
21faae605f
correctly map envoy stats to host (#484)
host port 19901 -> envoy container port 9901
2025-05-23 09:37:15 -07:00
Adil Hafeez
a0d10153f9
update archgw logs file to stream access logs from container (#482) 2025-05-23 09:15:44 -07:00
Adil Hafeez
d050dfb85a
When router usage is defined ensure that router model is defined too (#481) 2025-05-23 08:46:12 -07:00
Adil Hafeez
218e9c540d
Add support for json based content types in Message (#480) 2025-05-23 00:51:53 -07:00
Adil Hafeez
f5e77bbe65
add support for claude and add first class support for groq and deepseek (#479) 2025-05-22 22:55:46 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
1f95fac4af
update arch_config sample on readme to match with new format (#475) 2025-04-29 12:36:46 -07:00
Salman Paracha
9659b2baf6
updating README based on reddit feedback (#474)
* updating README based on reddit feedback

* minor edits

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
2025-04-27 23:09:29 -07:00
Adil Hafeez
2e346143dd
use separate host port for chat ui and for app_server (#473)
We were using same port for both chatui and app_server which was causing conflict. This code change updates host port for app_server to 18083 and updates arch_config
2025-04-23 14:05:48 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 (#472) 2025-04-21 17:02:36 -07:00
Adil Hafeez
5fe2444341
use archfc v1.1 on archfc.katanemo.dev (#471) 2025-04-21 16:27:17 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 (#469) 2025-04-16 13:55:24 -07:00
Adil Hafeez
6d6c03a7e8
fix docker hub release tag source image name (#468) 2025-04-16 13:08:43 -07:00
Adil Hafeez
3eb438550a
fix source name for docker images (#467) 2025-04-16 12:24:17 -07:00
Adil Hafeez
e17d5fb2eb
test docker rel (#466) 2025-04-16 12:18:03 -07:00
Adil Hafeez
3cda4d6b69
fix docker hub tag (#465) 2025-04-16 11:46:12 -07:00
Adil Hafeez
ceca553399
fix release image (#464) 2025-04-16 11:34:45 -07:00
Adil Hafeez
c7c0553427
release 0.2.6 (#463) 2025-04-15 14:50:09 -07:00
Shuguang Chen
7d4b261a68
Integrate Arch-Function-Chat (#449) 2025-04-15 14:39:12 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460)
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change

* updated the GH actions with keys for Groq

* adding missing groq API keys

* add llama-3.2-3b-preview to the model based on addin groq to the demo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Mat Sylvia
e7b0de2a72
Tweak readme docs for minor nits (#461)
Co-authored-by: darkdatter <msylvia@tradestax.io>
2025-04-12 23:52:20 -07:00
Adil Hafeez
4d2d8bd7a1
release 0.2.5 (#457) 2025-04-06 01:24:01 -07:00
Joseph D Alchemist
8ba1f71430
remove typo (#456) 2025-04-03 11:34:57 -07:00
Ikko Eltociear Ashimine
49e8216061
docs: update llm_provider.rst (#448)
minor fix
2025-03-28 14:35:55 -07:00
Adil Hafeez
de221525de
Use better logs (#452) 2025-03-27 10:40:20 -07:00
Adil Hafeez
76ec5cda68
fix ollama demo (#450) 2025-03-26 11:01:32 -07:00
Adil Hafeez
9f59943041
update code to use 0.2.4 release (#446)
* update code to use 0.2.4 release

* update lock file
2025-03-21 16:08:59 -07:00
Adil Hafeez
eb48f3d5bb
use passed in model name in chat completion request (#445) 2025-03-21 15:56:17 -07:00
Salman Paracha
bd8004d1ae
updated docs to reflect agent routing and hand off (#443)
* updated docs to reflect agent routing and hand off

* updated prompt targets based on review

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-20 13:57:33 -07:00
Adil Hafeez
84cd1df7bf
add preliminary support for llm agents (#432) 2025-03-19 15:21:34 -07:00
Salman Paracha
8d66fefded
Update README.md (#439)
* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md

* Update README.md
2025-03-19 14:19:49 -07:00
Salman Paracha
85c8299a2b
Update README.md (#438)
fixed typo
2025-03-19 13:45:35 -07:00
Salman Paracha
852ecf73f2 fixing typos 2025-03-18 22:37:24 -07:00
Salman Paracha
c43513caff
updating README to better describe the problems we are solving (#437)
* updating README to better describe the problems we are solving
* fixing formatting issues

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-18 22:34:42 -07:00
Adil Hafeez
d2cb1427fb
add hurl tests for currency exchange demo (#435) 2025-03-17 14:21:41 -07:00
Salman Paracha
6072d6ef30
updating the docs to improve usage guide for prompt_targets and function calling (#434)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-17 14:07:06 -07:00
Adil Hafeez
5526314b3c
fix ollama demo - add egress endpoint for llm routing (#433) 2025-03-17 11:50:14 -07:00
Salman Paracha
ffb8566c36
updating the network agent to work agin (#425)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-08 14:29:11 -08:00
Adil Hafeez
e8dc7f18d3
start using base_url in place of endpoint (#430) 2025-03-05 17:20:04 -08:00
Adil Hafeez
ed3845040e
add demo for deepseek (#426) 2025-03-05 14:08:06 -08:00
Pat Thibodeau
2d357ddd5e
Update discord server invite url (#428) 2025-03-05 13:21:35 -08:00
Adil Hafeez
d8b833fe69
release 0.2.3 (#423) 2025-03-04 14:30:44 -08:00
Adil Hafeez
09c9d3706e
publish docker release images (#422)
* add release docker publish

* more testing

* fix more
2025-03-04 14:16:03 -08:00
Salman Paracha
2e2680c98c
Salanap/java debug ai (#416)
* stashing changes on my local branch

* updated the java demo with debug points and jaeger tracing

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-03-04 12:49:34 -08:00
Shuguang Chen
e77fc47225
Handle intent matching better in arch gateway (#391) 2025-03-04 12:49:13 -08:00
Adil Hafeez
10cad4d0b7
add health check endpoint for llm gateway (#420)
* add health check endpoint for llm gateway

* fix rust tests
2025-03-03 13:11:57 -08:00
Sid Golestane
a402fee13b
fix: add --type=container to docker inspect to prevent Podman conflicts (#418)
* fix: add --type=container to docker inspect to prevent Podman conflicts

Adding `--type=container` ensures `docker inspect` targets containers
specifically, preventing conflicts with images in Podman.

* Format Python code using pre-commit hook

---------

Co-authored-by: Sid Golestaneh <sid@golestaneh.com>
2025-02-28 17:03:21 -08:00