Shuguang Chen
11fba23f1f
Update doc ( #178 )
...
* Update doc
* Update links
2024-10-10 22:30:54 -07:00
Adil Hafeez
7b51cce2f7
fix demo broken links
2024-10-10 17:52:33 -07:00
Adil Hafeez
50bea9135c
update broken link
2024-10-10 17:50:57 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Aayush
ceca0dba28
fix prometheus target and update dashboard. ( #165 )
...
* fix prometheus target and update dashboard.
* Update envoy_overview.json with whitespace at the end
2024-10-10 14:57:20 -07:00
Co Tran
2c45de26e6
fix for linux ( #175 )
...
* fix for linux
* fix pre commit
* fix
* fix extra white space
* fix commit
2024-10-10 14:56:23 -07:00
Salman Paracha
639839fbb1
Create LICENSE
2024-10-10 06:30:23 -07:00
Adil Hafeez
7b05f304a1
Set python version 3.10
2024-10-09 23:23:42 -07:00
Adil Hafeez
d3ccddb72c
update llm router port value
2024-10-09 21:39:55 -07:00
Adil Hafeez
c0f0c22fb4
update access logs docs ( #170 )
2024-10-09 21:37:21 -07:00
Adil Hafeez
2b501d10bd
update lock file
2024-10-09 21:01:12 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Adil Hafeez
6b70768170
make ratelimit section optional ( #168 )
2024-10-09 19:53:00 -07:00
Co Tran
f9e3a052fc
change nli model ( #167 )
...
* change nli model
* Fix bug in hallucination
---------
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2024-10-09 19:10:08 -07:00
Shuguang Chen
3b7c58698f
Update model_server ( #164 )
...
* Update model server
* Delete model_server/.vscode/settings.json
* Update loader.py
* Fix errors
* Update log mode
2024-10-09 18:04:52 -07:00
Salman Paracha
b8d2756ff7
updated README.md
2024-10-09 17:46:55 -07:00
Adil Hafeez
71cdf69f77
dont send default target to archfc ( #166 )
2024-10-09 17:43:02 -07:00
Salman Paracha
dc3a9813c3
fixed function calling arch config yaml
2024-10-09 17:10:54 -07:00
Adil Hafeez
3e9327cf36
fix bug in jinja template for tracing
2024-10-09 16:44:50 -07:00
Salman Paracha
0ed88def8f
updated settuptools packages
2024-10-09 16:31:35 -07:00
Adil Hafeez
6991fbb7a7
rename
2024-10-09 16:24:40 -07:00
Adil Hafeez
c254dfb16a
update cli and update docs ( #161 )
...
* add services to cli
* more changes
2024-10-09 16:22:27 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… ( #160 )
2024-10-09 15:53:12 -07:00
Adil Hafeez
e81ca8d5cf
llm listener split ( #155 )
2024-10-09 15:47:32 -07:00
Co Tran
8b5db45507
Fix gpu dependency and only leverage onnx when GPU is available ( #157 )
...
* replacing appending instead of write
* fix eetq dependency
* gpu guard required eetq
* fix bug when gpu is available
* fix for gpu device
* reverse
* fix
* replace gpu -> cuda
2024-10-09 11:42:05 -07:00
Co Tran
5c4a6bc8ff
lint + formating with black ( #158 )
...
* lint + formating with black
* add black as pre commit
2024-10-09 11:25:07 -07:00
Salman Paracha
498e7f9724
minor fixes to README ( #156 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-09 10:12:36 -07:00
Salman Paracha
42d4a28e13
updated all demo READMes and minor doc changes ( #154 )
...
* updated all demo READMes and minor doc changes
* minor typo fixes
* updated main Readme
* fixed README and docs
* fixed README and docs
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 23:58:55 -07:00
Salman Paracha
b63a01fe82
Salmanap/fix network agent demo ( #153 )
...
* staging my changes to re-based from main
* adding debug statements to rust
* merged with main
* ready to push network agent
* removed the incomplete sql example
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 22:19:20 -07:00
Adil Hafeez
6acfea7787
bug fix - send all parameter irrespective of type
...
earlier we were only sending parameter if the type is string
2024-10-08 20:28:32 -07:00
Adil Hafeez
47c9c0aafc
fix lock file ( #151 )
2024-10-08 18:16:00 -07:00
Adil Hafeez
3b26b16fc8
add days and units to api server ( #150 )
...
* add days and units to api server
* add more stuff
* fix more
2024-10-08 18:14:06 -07:00
Adil Hafeez
e08d406be5
Update README.md
2024-10-08 17:20:51 -07:00
Adil Hafeez
ede125a4f3
ensure that tracing is optional in arch_config ( #149 )
2024-10-08 17:15:40 -07:00
Co Tran
e62c6e75ea
fix dependcy + logg info ( #148 )
2024-10-08 16:42:40 -07:00
Adil Hafeez
285aa1419b
Split listener ( #141 )
2024-10-08 16:24:08 -07:00
Co Tran
22bc3d2798
Cotran/prompt guard doc ( #147 )
...
* repalce prompt injection with jailbreak and removing toxc
* repalce prompt injection with jailbreak and removing toxc
2024-10-08 15:58:50 -07:00
Adil Hafeez
fab71abdac
update readme for filter
2024-10-08 15:32:49 -07:00
Co Tran
80d2229053
Cotran/onnx conversion ( #145 )
...
* onnx replacement
* onnx conversion for nli and embedding model
* fix naming
* fix naming
* fix naming
* pin version
2024-10-08 14:37:48 -07:00
Shuguang Chen
b30ad791f7
Fix errors and improve Doc ( #143 )
...
* Fix link issues and add icons
* Improve Doc
* fix test
* making minor modifications to shuguangs' doc changes
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-08 13:18:34 -07:00
Salman Paracha
3ed50e61d2
ensure that we can call the new api.fc.archgw.com url, logging fixes … ( #142 )
...
* ensure that we can call the new api.fc.archgw.com url, logging fixes and minor cli bug fixes
* fixed a bug where model_server printed on terminal after start script stopped running
* updating the logo and fixing the website styles
* updated the branch with feedback from Co and Adil
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-08 12:40:24 -07:00
Salman Paracha
82fc91495e
Salmanap/fix demos ( #140 )
...
* Comitting to bring in main
* insurance agent updated
* updated the insurance agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 20:27:07 -07:00
Co Tran
b1fa127704
Hallucination integration with rust ( #122 )
2024-10-07 18:38:55 -07:00
Adil Hafeez
43dc2a0a73
system prompt (keep system prompt at the top) ( #139 )
...
* add system prompt
* fix
2024-10-07 17:50:07 -07:00
Adil Hafeez
422efd3887
add system prompt ( #138 )
2024-10-07 17:25:37 -07:00
José Ulises Niño Rivera
c1cfbcd44d
Implement Client trait for StreamContext ( #134 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-07 16:50:15 -07:00
Shuguang Chen
5bfccd3959
Update arch_config.yaml ( #137 )
2024-10-07 16:01:12 -07:00
Co Tran
93abe553e3
formating and mointoring change ( #136 )
2024-10-07 15:21:05 -07:00
Salman Paracha
976b2eaae0
fixing docs so that GH pages picks up the right CNAME for DNS ( #135 )
...
* fixing docs so that GH pages picks up the right CNAME for DNS
* updating workflow to pick CNAME
* making sure to correctly set permissions on the build/html directory
* fixing GH actions issues with CNAME
* updated docker build command to run as local user
* fixed the CNAME issue and udpated GH actions
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-07 11:03:02 -07:00
Adil Hafeez
96686dc606
Serialize tool calls for Arch FC ( #131 )
...
* Serialize tool calls
* fix int tests
2024-10-07 00:03:25 -07:00