Adil Hafeez
1fdde8181a
release 0.3.10 ( #555 )
2025-08-13 14:50:10 -07:00
Adil Hafeez
5908ca6a16
publish to ghrc ( #553 )
2025-08-13 10:35:26 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 ( #552 )
2025-08-12 13:43:43 -07:00
Adil Hafeez
2639323dab
release 0.3.8 ( #550 )
2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 ( #526 )
2025-08-11 13:23:40 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 ( #542 )
2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 ( #536 )
2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 ( #534 )
2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 ( #525 )
...
* release 0.3.4
* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names ( #517 )
2025-07-11 16:42:16 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 ( #519 )
2025-07-08 00:59:33 -07:00
Adil Hafeez
7baec20772
release 0.3.2 ( #507 )
2025-06-13 17:02:20 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo ( #498 )
2025-05-31 01:52:25 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 ( #495 )
2025-05-30 17:47:59 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing ( #491 )
2025-05-30 02:14:37 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 ( #483 )
2025-05-23 09:52:23 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing ( #477 )
2025-05-19 09:59:22 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 ( #472 )
2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 ( #469 )
2025-04-16 13:55:24 -07:00
Adil Hafeez
6d6c03a7e8
fix docker hub release tag source image name ( #468 )
2025-04-16 13:08:43 -07:00
Adil Hafeez
3eb438550a
fix source name for docker images ( #467 )
2025-04-16 12:24:17 -07:00
Adil Hafeez
e17d5fb2eb
test docker rel ( #466 )
2025-04-16 12:18:03 -07:00
Adil Hafeez
3cda4d6b69
fix docker hub tag ( #465 )
2025-04-16 11:46:12 -07:00
Adil Hafeez
ceca553399
fix release image ( #464 )
2025-04-16 11:34:45 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… ( #460 )
...
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change
* updated the GH actions with keys for Groq
* adding missing groq API keys
* add llama-3.2-3b-preview to the model based on addin groq to the demo
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Adil Hafeez
d2cb1427fb
add hurl tests for currency exchange demo ( #435 )
2025-03-17 14:21:41 -07:00
Adil Hafeez
09c9d3706e
publish docker release images ( #422 )
...
* add release docker publish
* more testing
* fix more
2025-03-04 14:16:03 -08:00
Adil Hafeez
e40b13be05
Update arch_config and add tests for arch config file ( #407 )
2025-02-14 19:28:10 -08:00
Salman Paracha
b3c95a6698
refactor demos ( #398 )
2025-02-07 18:45:42 -08:00
Adil Hafeez
fa089ef32d
Publish both arm64 and amd64 in docker on main ( #392 )
2025-02-04 15:04:09 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 ( #358 )
2025-01-06 14:23:46 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only ( #284 )
2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image ( #283 )
2024-11-18 17:55:47 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) ( #202 )
2024-10-28 17:05:06 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
c6ba28dfcc
Code refactor and some improvements - see description ( #194 )
2024-10-18 12:53:44 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server ( #189 )
...
* remove mode/hardware
* add test and pre commit hook
* add pytest dependieces
* fix format
* fix lint
* fix precommit
* fix pre commit
* fix pre commit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter ( #186 )
...
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
2024-10-16 14:20:26 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Salman Paracha
976b2eaae0
fixing docs so that GH pages picks up the right CNAME for DNS ( #135 )
...
* fixing docs so that GH pages picks up the right CNAME for DNS
* updating workflow to pick CNAME
* making sure to correctly set permissions on the build/html directory
* fixing GH actions issues with CNAME
* updated docker build command to run as local user
* fixed the CNAME issue and udpated GH actions
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 11:03:02 -07:00
Salman Paracha
2a7b95582c
Build arch docs ( #128 )
...
* Build arch docs
The following action would build the docs from main
* triggering a push
* updating the command to run on pull request
* fixed bug
* updated documentation workflow to create a gh-pages branch and update it with the latest build
* fixing the deployment option to mick the right folder
* tested that the pages get build via GH actions, and get published appropriately
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 11:39:22 -07:00
Adil Hafeez
4182879717
add precommit check ( #97 )
...
* add precommit check
* remove check
* Revert "remove check"
This reverts commit 9987b62b9b .
* fix checks
* fix whitespace errors
2024-09-30 14:54:01 -07:00
Adil Hafeez
ea86f73605
rename envoyfilter => arch ( #91 )
...
* rename envoyfilter => arch
* fix more files
* more fixes
* more renames
2024-09-27 16:41:39 -07:00
José Ulises Niño Rivera
370f3bb2c5
Fix bug in PromptGuard configuration ( #80 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-25 23:36:55 -05:00
Adil Hafeez
87900beddd
Remove OMF ( #78 )
...
* Remove OMF
* remove submodule from github workflow
2024-09-24 15:18:20 -07:00
José Ulises Niño Rivera
a51a467cad
Add initial integration style tests ( #20 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-25 14:41:36 -07:00