Commit graph

44 commits

Author SHA1 Message Date
Adil Hafeez
fffa837a06
separate out currency exchange and preference based routing (#491) 2025-05-30 02:14:37 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 (#483) 2025-05-23 09:52:23 -07:00
Adil Hafeez
27c0f2fdce
Introduce brightstaff a new terminal service for llm routing (#477) 2025-05-19 09:59:22 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 (#472) 2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 (#469) 2025-04-16 13:55:24 -07:00
Adil Hafeez
6d6c03a7e8
fix docker hub release tag source image name (#468) 2025-04-16 13:08:43 -07:00
Adil Hafeez
3eb438550a
fix source name for docker images (#467) 2025-04-16 12:24:17 -07:00
Adil Hafeez
e17d5fb2eb
test docker rel (#466) 2025-04-16 12:18:03 -07:00
Adil Hafeez
3cda4d6b69
fix docker hub tag (#465) 2025-04-16 11:46:12 -07:00
Adil Hafeez
ceca553399
fix release image (#464) 2025-04-16 11:34:45 -07:00
Salman Paracha
f31aa59fac
fixed issue with groq LLMs that require the openai in the /v1/chat/co… (#460)
* fixed issue with groq LLMs that require the openai in the /v1/chat/completions path. My first change

* updated the GH actions with keys for Groq

* adding missing groq API keys

* add llama-3.2-3b-preview to the model based on addin groq to the demo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-04-13 14:00:16 -07:00
Adil Hafeez
d2cb1427fb
add hurl tests for currency exchange demo (#435) 2025-03-17 14:21:41 -07:00
Adil Hafeez
09c9d3706e
publish docker release images (#422)
* add release docker publish

* more testing

* fix more
2025-03-04 14:16:03 -08:00
Adil Hafeez
e40b13be05
Update arch_config and add tests for arch config file (#407) 2025-02-14 19:28:10 -08:00
Salman Paracha
b3c95a6698
refactor demos (#398) 2025-02-07 18:45:42 -08:00
Adil Hafeez
fa089ef32d
Publish both arm64 and amd64 in docker on main (#392) 2025-02-04 15:04:09 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 (#358) 2025-01-06 14:23:46 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only (#284) 2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image (#283) 2024-11-18 17:55:47 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
Salman Paracha
708fa15a9b
HR agent demo (#206)
* commiting my hr_agent branch

* updating the HR agent config

* pushing to remote

* fix hr agent

* committing to merge with main

* updating to merge from main

* updating the demo and model-server-tests to pull from poetry

* updating the poetry.lock files

* updating based on feedback

* updated sysmte prompt for hr_agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
Adil Hafeez
c6ba28dfcc
Code refactor and some improvements - see description (#194) 2024-10-18 12:53:44 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server (#189)
* remove mode/hardware

* add test and pre commit hook

* add pytest dependieces

* fix format

* fix lint

* fix precommit

* fix pre commit

* fix pre commit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit

* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter (#186)
* split wasm filter

* fix int and unit tests

* rename public_types => common and move common code there

* rename

* fix int test
2024-10-16 14:20:26 -07:00
Adil Hafeez
7d5f760884
Improve cli (#179) 2024-10-10 17:44:41 -07:00
Salman Paracha
976b2eaae0
fixing docs so that GH pages picks up the right CNAME for DNS (#135)
* fixing docs so that GH pages picks up the right CNAME for DNS

* updating workflow to pick CNAME

* making sure to correctly set permissions on the build/html directory

* fixing GH actions issues with CNAME

* updated docker build command to run as local user

* fixed the CNAME issue and udpated GH actions

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 11:03:02 -07:00
Salman Paracha
2a7b95582c
Build arch docs (#128)
* Build arch docs

The following action would build the docs from main

* triggering a push

* updating the command to run on pull request

* fixed bug

* updated documentation workflow to create a gh-pages branch and update it with the latest build

* fixing the deployment option to mick the right folder

* tested that the pages get build via GH actions, and get published appropriately

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 11:39:22 -07:00
Adil Hafeez
4182879717
add precommit check (#97)
* add precommit check

* remove check

* Revert "remove check"

This reverts commit 9987b62b9b.

* fix checks

* fix whitespace errors
2024-09-30 14:54:01 -07:00
Adil Hafeez
ea86f73605
rename envoyfilter => arch (#91)
* rename envoyfilter => arch

* fix more files

* more fixes

* more renames
2024-09-27 16:41:39 -07:00
José Ulises Niño Rivera
370f3bb2c5
Fix bug in PromptGuard configuration (#80)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-09-25 23:36:55 -05:00
Adil Hafeez
87900beddd
Remove OMF (#78)
* Remove OMF

* remove submodule from github workflow
2024-09-24 15:18:20 -07:00
José Ulises Niño Rivera
a51a467cad
Add initial integration style tests (#20)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-07-25 14:41:36 -07:00
Adil Hafeez
cad38295bf
Use open-message-format to serialize and deserialize embeddings api (#18)
* Use open-message-format to serialize and deserialize embeddings api
2024-07-23 11:56:49 -07:00
Aayush
31c4ac267a
Add testing to GitHub workflow (#15)
* update workflow and pre-commit to test only unit tests

* Update .github/workflows/checks.yml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

* Update .pre-commit-config.yaml

Co-authored-by: Adil Hafeez <adil@katanemo.com>

* update formatting of comments

* Update .pre-commit-config.yaml

* Update checks.yml

---------

Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-07-18 14:20:33 -07:00
Aayush
f5d6cfbcc0 ensure that linting check fails on warning as well as removing cargo check since it is included in clippy. Updated name back to static analysis due to removal of check. 2024-07-15 14:43:13 -07:00
Aayush
a5661d9f52 update name for checks from Static Analsysis to Checks on PR 2024-07-15 11:49:04 -07:00
Aayush
8993853efb remove check on push, just have check on pr 2024-07-15 11:42:06 -07:00
Aayush
b59588c68e open the directory before running check 2024-07-15 11:36:16 -07:00
Aayush
5dc8acd8f1 open directory after checking out code 2024-07-15 11:31:56 -07:00
Aayush
b7273eab55 move folder to correct spot and open correct directory in workflow 2024-07-15 11:29:56 -07:00