Commit graph

307 commits

Author SHA1 Message Date
Adil Hafeez
3bd2ffe9fb
split wasm filter (#186)
* split wasm filter

* fix int and unit tests

* rename public_types => common and move common code there

* rename

* fix int test
2024-10-16 14:20:26 -07:00
Co Tran
b1746b38b4
concatenate history of user messages for hallucination (#177)
* concatenate history of user messages for hallucination

* add history of messages

* fix gpt to not arch

* add model prefix

* fix

* correct init of user_messages

* fmt

* fix test
2024-10-15 11:43:05 -07:00
Salman Paracha
35c5e303b7
Salmanap/fix discord invite (#185) 2024-10-13 21:01:36 -07:00
Salman Paracha
bfb549b075
ensuring that logo appears in dark mode (#184)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-13 20:46:12 -07:00
Salman Paracha
5e22c24f77
minor changes to the README to highlight what Arch is designed to do in a single line (#182)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-12 18:02:03 -07:00
Adil Hafeez
f72588c6da fix lint 2024-10-10 23:34:21 -07:00
Shuguang Chen
11fba23f1f
Update doc (#178)
* Update doc

* Update links
2024-10-10 22:30:54 -07:00
Adil Hafeez
7b51cce2f7 fix demo broken links 2024-10-10 17:52:33 -07:00
Adil Hafeez
50bea9135c
update broken link 2024-10-10 17:50:57 -07:00
Adil Hafeez
7d5f760884
Improve cli (#179) 2024-10-10 17:44:41 -07:00
Aayush
ceca0dba28
fix prometheus target and update dashboard. (#165)
* fix prometheus target and update dashboard.

* Update envoy_overview.json with whitespace at the end
2024-10-10 14:57:20 -07:00
Co Tran
2c45de26e6
fix for linux (#175)
* fix for linux

* fix pre commit

* fix

* fix extra white space

* fix commit
2024-10-10 14:56:23 -07:00
Salman Paracha
639839fbb1
Create LICENSE 2024-10-10 06:30:23 -07:00
Adil Hafeez
7b05f304a1
Set python version 3.10 2024-10-09 23:23:42 -07:00
Adil Hafeez
d3ccddb72c
update llm router port value 2024-10-09 21:39:55 -07:00
Adil Hafeez
c0f0c22fb4
update access logs docs (#170) 2024-10-09 21:37:21 -07:00
Adil Hafeez
2b501d10bd update lock file 2024-10-09 21:01:12 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi (#169)
* updated archgw cli to pull from archgw_modelserver from pypi

* fix image name

* update rev

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Adil Hafeez
6b70768170
make ratelimit section optional (#168) 2024-10-09 19:53:00 -07:00
Co Tran
f9e3a052fc
change nli model (#167)
* change nli model

* Fix bug in hallucination

---------

Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2024-10-09 19:10:08 -07:00
Shuguang Chen
3b7c58698f
Update model_server (#164)
* Update model server

* Delete model_server/.vscode/settings.json

* Update loader.py

* Fix errors

* Update log mode
2024-10-09 18:04:52 -07:00
Salman Paracha
b8d2756ff7 updated README.md 2024-10-09 17:46:55 -07:00
Adil Hafeez
71cdf69f77
dont send default target to archfc (#166) 2024-10-09 17:43:02 -07:00
Salman Paracha
dc3a9813c3 fixed function calling arch config yaml 2024-10-09 17:10:54 -07:00
Adil Hafeez
3e9327cf36 fix bug in jinja template for tracing 2024-10-09 16:44:50 -07:00
Salman Paracha
0ed88def8f updated settuptools packages 2024-10-09 16:31:35 -07:00
Adil Hafeez
6991fbb7a7 rename 2024-10-09 16:24:40 -07:00
Adil Hafeez
c254dfb16a
update cli and update docs (#161)
* add services to cli

* more changes
2024-10-09 16:22:27 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… (#160) 2024-10-09 15:53:12 -07:00
Adil Hafeez
e81ca8d5cf
llm listener split (#155) 2024-10-09 15:47:32 -07:00
Co Tran
8b5db45507
Fix gpu dependency and only leverage onnx when GPU is available (#157)
* replacing appending instead of write

* fix eetq dependency

* gpu guard required eetq

* fix bug when gpu is available

* fix for gpu device

* reverse

* fix

* replace gpu -> cuda
2024-10-09 11:42:05 -07:00
Co Tran
5c4a6bc8ff
lint + formating with black (#158)
* lint + formating with black

* add black as pre commit
2024-10-09 11:25:07 -07:00
Salman Paracha
498e7f9724
minor fixes to README (#156)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-09 10:12:36 -07:00
Salman Paracha
42d4a28e13
updated all demo READMes and minor doc changes (#154)
* updated all demo READMes and minor doc changes

* minor typo fixes

* updated main Readme

* fixed README and docs

* fixed README and docs

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 23:58:55 -07:00
Salman Paracha
b63a01fe82
Salmanap/fix network agent demo (#153)
* staging my changes to re-based from main

* adding debug statements to rust

* merged with main

* ready to push network agent

* removed the incomplete sql example

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 22:19:20 -07:00
Adil Hafeez
6acfea7787 bug fix - send all parameter irrespective of type
earlier we were only sending parameter if the type is string
2024-10-08 20:28:32 -07:00
Adil Hafeez
47c9c0aafc
fix lock file (#151) 2024-10-08 18:16:00 -07:00
Adil Hafeez
3b26b16fc8
add days and units to api server (#150)
* add days and units to api server

* add more stuff

* fix more
2024-10-08 18:14:06 -07:00
Adil Hafeez
e08d406be5
Update README.md 2024-10-08 17:20:51 -07:00
Adil Hafeez
ede125a4f3
ensure that tracing is optional in arch_config (#149) 2024-10-08 17:15:40 -07:00
Co Tran
e62c6e75ea
fix dependcy + logg info (#148) 2024-10-08 16:42:40 -07:00
Adil Hafeez
285aa1419b
Split listener (#141) 2024-10-08 16:24:08 -07:00
Co Tran
22bc3d2798
Cotran/prompt guard doc (#147)
* repalce prompt injection with jailbreak and removing toxc

* repalce prompt injection with jailbreak and removing toxc
2024-10-08 15:58:50 -07:00
Adil Hafeez
fab71abdac update readme for filter 2024-10-08 15:32:49 -07:00
Co Tran
80d2229053
Cotran/onnx conversion (#145)
* onnx replacement

* onnx conversion for nli and embedding model

* fix naming

* fix naming

* fix naming

* pin version
2024-10-08 14:37:48 -07:00
Shuguang Chen
b30ad791f7
Fix errors and improve Doc (#143)
* Fix link issues and add icons

* Improve Doc

* fix test

* making minor modifications to shuguangs' doc changes

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-08 13:18:34 -07:00
Salman Paracha
3ed50e61d2
ensure that we can call the new api.fc.archgw.com url, logging fixes … (#142)
* ensure that we can call the new api.fc.archgw.com url, logging fixes and minor cli bug fixes

* fixed a bug where model_server printed on terminal after start script stopped running

* updating the logo and fixing the website styles

* updated the branch with feedback from Co and Adil

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 12:40:24 -07:00
Salman Paracha
82fc91495e
Salmanap/fix demos (#140)
* Comitting to bring in main

* insurance agent updated

* updated the insurance agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 20:27:07 -07:00
Co Tran
b1fa127704
Hallucination integration with rust (#122) 2024-10-07 18:38:55 -07:00
Adil Hafeez
43dc2a0a73
system prompt (keep system prompt at the top) (#139)
* add system prompt

* fix
2024-10-07 17:50:07 -07:00