Adil Hafeez
2f58645ca8
ignore some tests in prompt gateway
2025-06-09 16:41:23 -07:00
Adil Hafeez
3f9eae4aa3
fix llm gateway rust tests
2025-06-09 16:32:42 -07:00
Adil Hafeez
641d41de1c
add tests
2025-06-06 00:37:12 -07:00
Adil Hafeez
04f6c20e38
remove extra log
2025-06-05 23:41:27 -07:00
Adil Hafeez
b1233da268
add arch
2025-06-05 20:17:07 -07:00
Adil Hafeez
8e4960df4a
fix more
2025-06-05 20:03:27 -07:00
Adil Hafeez
db19395eb7
more changes
2025-06-05 16:58:45 -07:00
Adil Hafeez
25f1b72e7c
more changes
2025-06-05 16:14:40 -07:00
Adil Hafeez
22fde1f333
fmt/clippy
2025-06-05 12:06:50 -07:00
Adil Hafeez
29503ba95e
revert supervisord
2025-06-04 16:49:46 -07:00
Adil Hafeez
6a6afb0888
move models to hermesllm
2025-06-04 16:36:47 -07:00
Adil Hafeez
0c7aa132ee
use hermesllm in arch gateway for both stream and non stream messages
2025-06-04 16:19:45 -07:00
Adil Hafeez
670907145a
more changes
2025-06-04 10:58:09 -07:00
Adil Hafeez
b0c1e97dc5
use req/resp from hermesllm in llm gateway
2025-06-03 15:57:30 -07:00
Adil Hafeez
f10e0fcece
add more changes
2025-06-03 15:00:57 -07:00
Adil Hafeez
2d4d0b01ee
add more changes
2025-06-03 09:58:06 -07:00
Adil Hafeez
9befd6364c
more changes
2025-06-03 00:17:22 -07:00
Adil Hafeez
59dbbd6743
add openai protocol
2025-06-02 23:57:03 -07:00
Adil Hafeez
21ca21dc3c
Merge branch 'main' into adil/hermesllm
2025-06-02 09:29:38 -07:00
Adil Hafeez
0d190a6e5c
update code to use new json based system prompt for routing ( #493 )
2025-05-30 17:40:46 -07:00
Adil Hafeez
8d12a9a6e0
add arch provider ( #494 )
2025-05-30 17:12:52 -07:00
Adil Hafeez
176f039bbc
fix model warning and use openwebui for preference based router demo
2025-05-30 12:29:56 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing ( #491 )
2025-05-30 02:14:37 -07:00
Adil Hafeez
cd7b8ef649
add hermes llm
2025-05-29 13:33:41 -07:00
Adil Hafeez
470cdf9843
use provider_name as model_id /v1/models api ( #490 )
2025-05-29 11:23:18 -07:00
Adil Hafeez
9c4733590f
add support for openwebui ( #487 )
2025-05-28 19:08:00 -07:00
Adil Hafeez
d29eba4102
trim conversation if it exceed max limit of what router model can handle ( #488 )
2025-05-27 20:28:22 -07:00
Adil Hafeez
99dd900a34
fix panic in brightstaff ( #485 )
...
make router section optional in arch_config
2025-05-23 09:37:25 -07:00
Adil Hafeez
d050dfb85a
When router usage is defined ensure that router model is defined too ( #481 )
2025-05-23 08:46:12 -07:00
Adil Hafeez
218e9c540d
Add support for json based content types in Message ( #480 )
2025-05-23 00:51:53 -07:00
Adil Hafeez
f5e77bbe65
add support for claude and add first class support for groq and deepseek ( #479 )
2025-05-22 22:55:46 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing ( #477 )
2025-05-19 09:59:22 -07:00
Shuguang Chen
7d4b261a68
Integrate Arch-Function-Chat ( #449 )
2025-04-15 14:39:12 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… ( #460 )
...
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change
* updated the GH actions with keys for Groq
* adding missing groq API keys
* add llama-3.2-3b-preview to the model based on addin groq to the demo
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Adil Hafeez
de221525de
Use better logs ( #452 )
2025-03-27 10:40:20 -07:00
Adil Hafeez
76ec5cda68
fix ollama demo ( #450 )
2025-03-26 11:01:32 -07:00
Adil Hafeez
eb48f3d5bb
use passed in model name in chat completion request ( #445 )
2025-03-21 15:56:17 -07:00
Adil Hafeez
84cd1df7bf
add preliminary support for llm agents ( #432 )
2025-03-19 15:21:34 -07:00
Adil Hafeez
ed3845040e
add demo for deepseek ( #426 )
2025-03-05 14:08:06 -08:00
Shuguang Chen
e77fc47225
Handle intent matching better in arch gateway ( #391 )
2025-03-04 12:49:13 -08:00
Adil Hafeez
10cad4d0b7
add health check endpoint for llm gateway ( #420 )
...
* add health check endpoint for llm gateway
* fix rust tests
2025-03-03 13:11:57 -08:00
Adil Hafeez
e40b13be05
Update arch_config and add tests for arch config file ( #407 )
2025-02-14 19:28:10 -08:00
Adil Hafeez
8de6eacfbd
spotify demo with optimized context window code change ( #397 )
2025-02-07 19:14:15 -08:00
Adil Hafeez
2bd61d628c
add ability to specify custom http headers in api endpoint ( #386 )
2025-02-06 11:48:09 -08:00
Adil Hafeez
e82f8f216f
Encode parameter values in http path and ... ( #395 )
...
* Encode parameter values in http path and ...
- don't send param values in request body in http get request
- send param values in http post request
* rust tests
* refactor code
* add tests
2025-02-06 11:00:47 -08:00
Adil Hafeez
a62f906432
remove unused cargo.lock files ( #396 )
2025-02-05 20:25:41 -08:00
Adil Hafeez
39266b5084
log improvements and some code refactor ( #379 )
2025-01-31 10:37:53 -08:00
Adil Hafeez
6887d52750
When using ollama token count was not coming in ( #375 )
...
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
2025-01-21 18:01:56 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm ( #365 )
2025-01-17 18:25:55 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt ( #368 )
...
* update prompt target name after arch_fc has identified tool
* add test for currency exchange
2025-01-16 15:11:37 -08:00