Commit graph

58 commits

Author SHA1 Message Date
Adil Hafeez
1fdde8181a
release 0.3.10 (#555) 2025-08-13 14:50:10 -07:00
Adil Hafeez
5908ca6a16
publish to ghrc (#553) 2025-08-13 10:35:26 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 (#552) 2025-08-12 13:43:43 -07:00
Adil Hafeez
2639323dab
release 0.3.8 (#550) 2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 (#526) 2025-08-11 13:23:40 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 (#542) 2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 (#536) 2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 (#534) 2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 (#525)
* release 0.3.4

* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
a7fddf30f9
better model names (#517) 2025-07-11 16:42:16 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 (#519) 2025-07-08 00:59:33 -07:00
Adil Hafeez
7baec20772
release 0.3.2 (#507) 2025-06-13 17:02:20 -07:00
Adil Hafeez
2e47d41a8c
Add ARCH_API_KEY in preference based routing demo (#498) 2025-05-31 01:52:25 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 (#495) 2025-05-30 17:47:59 -07:00
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 (#472) 2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 (#469) 2025-04-16 13:55:24 -07:00
Adil Hafeez
6d6c03a7e8
fix docker hub release tag source image name (#468) 2025-04-16 13:08:43 -07:00
Adil Hafeez
3eb438550a
fix source name for docker images (#467) 2025-04-16 12:24:17 -07:00
Adil Hafeez
e17d5fb2eb
test docker rel (#466) 2025-04-16 12:18:03 -07:00
Adil Hafeez
3cda4d6b69
fix docker hub tag (#465) 2025-04-16 11:46:12 -07:00
Adil Hafeez
ceca553399
fix release image (#464) 2025-04-16 11:34:45 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460)
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change

* updated the GH actions with keys for Groq

* adding missing groq API keys

* add llama-3.2-3b-preview to the model based on addin groq to the demo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Adil Hafeez
d2cb1427fb
add hurl tests for currency exchange demo (#435) 2025-03-17 14:21:41 -07:00
Adil Hafeez
09c9d3706e
publish docker release images (#422)
* add release docker publish

* more testing

* fix more
2025-03-04 14:16:03 -08:00
Adil Hafeez
e40b13be05
Update arch_config and add tests for arch config file (#407) 2025-02-14 19:28:10 -08:00
Salman Paracha
b3c95a6698
refactor demos (#398) 2025-02-07 18:45:42 -08:00
Adil Hafeez
fa089ef32d
Publish both arm64 and amd64 in docker on main (#392) 2025-02-04 15:04:09 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 (#358) 2025-01-06 14:23:46 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only (#284) 2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image (#283) 2024-11-18 17:55:47 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
Salman Paracha
708fa15a9b
HR agent demo (#206)
* commiting my hr_agent branch

* updating the HR agent config

* pushing to remote

* fix hr agent

* committing to merge with main

* updating to merge from main

* updating the demo and model-server-tests to pull from poetry

* updating the poetry.lock files

* updating based on feedback

* updated sysmte prompt for hr_agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
c6ba28dfcc
Code refactor and some improvements - see description (#194) 2024-10-18 12:53:44 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server (#189)
* remove mode/hardware

* add test and pre commit hook

* add pytest dependieces

* fix format

* fix lint

* fix precommit

* fix pre commit

* fix pre commit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter (#186)
* split wasm filter

* fix int and unit tests

* rename public_types => common and move common code there

* rename

* fix int test
2024-10-16 14:20:26 -07:00
Adil Hafeez
7d5f760884
Improve cli (#179) 2024-10-10 17:44:41 -07:00
Salman Paracha
976b2eaae0
fixing docs so that GH pages picks up the right CNAME for DNS (#135)
* fixing docs so that GH pages picks up the right CNAME for DNS

* updating workflow to pick CNAME

* making sure to correctly set permissions on the build/html directory

* fixing GH actions issues with CNAME

* updated docker build command to run as local user

* fixed the CNAME issue and udpated GH actions

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 11:03:02 -07:00
Salman Paracha
2a7b95582c
Build arch docs (#128)
* Build arch docs

The following action would build the docs from main

* triggering a push

* updating the command to run on pull request

* fixed bug

* updated documentation workflow to create a gh-pages branch and update it with the latest build

* fixing the deployment option to mick the right folder

* tested that the pages get build via GH actions, and get published appropriately

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 11:39:22 -07:00
Adil Hafeez
4182879717
add precommit check (#97)
* add precommit check

* remove check

* Revert "remove check"

This reverts commit 9987b62b9b.

* fix checks

* fix whitespace errors
2024-09-30 14:54:01 -07:00
Adil Hafeez
ea86f73605
rename envoyfilter => arch (#91)
* rename envoyfilter => arch

* fix more files

* more fixes

* more renames
2024-09-27 16:41:39 -07:00
José Ulises Niño Rivera
370f3bb2c5
Fix bug in PromptGuard configuration (#80)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-25 23:36:55 -05:00
Adil Hafeez
87900beddd
Remove OMF (#78)
* Remove OMF

* remove submodule from github workflow
2024-09-24 15:18:20 -07:00
José Ulises Niño Rivera
a51a467cad
Add initial integration style tests (#20)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-25 14:41:36 -07:00