Shuguang Chen
5f3aff4922
Update chatbot UI and update hallucination check ( #218 )
...
* update chatbot UI
* Update docker-compose for demos
* Fix bugs
* fix for emtadata (#219 )
* fix for emtadata
* fix
* revert
* merge main
---------
Co-authored-by: CTran <cotran2@utexas.edu>
2024-10-24 14:11:53 -07:00
Azib Farooq
05f0491f76
updated key name ( #211 )
2024-10-23 21:02:24 -07:00
Ikko Eltociear Ashimine
87ce0b1be0
docs: update README.md ( #220 )
...
vist -> visit
2024-10-23 20:37:26 -07:00
Salman Paracha
7a5852b401
fixing discord link and moving contributing guide to root ( #215 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-23 15:45:49 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
CTran
8495f89fda
Cotran/hallucination ( #208 )
2024-10-22 12:52:01 -07:00
Adil Hafeez
ea76d85b43
Improve logging ( #209 )
...
* improve logging
* fix int tests
* better
* fix more logs
* fix more
* fix int
2024-10-22 12:07:40 -07:00
Adil Hafeez
2f374df034
refactor prompt gateway ( #204 )
2024-10-21 15:04:15 -07:00
Adil Hafeez
dced8a5708
Add separate util for hallucination and add tests for it ( #203 )
2024-10-18 19:34:17 -07:00
Adil Hafeez
faf64960df
update observability and dashboards ( #198 )
2024-10-18 15:07:49 -07:00
Adil Hafeez
f189d5703b
update .dockerignore file after filter move
2024-10-18 14:44:39 -07:00
Adil Hafeez
dd1c7be706
Pass tool call and app function response back in metadata ( #193 )
2024-10-18 13:25:39 -07:00
José Ulises Niño Rivera
62a000036e
Update arch Dockerfile ( #200 )
2024-10-18 16:15:19 -04:00
Adil Hafeez
1719b7d5f8
Send back developer error correctly ( #195 )
2024-10-18 13:14:18 -07:00
Adil Hafeez
28421353fd
Update vscode workspce ( #199 )
...
- add recommended extensions
- set python interpreter path for all python projects to be venv/bin/python
- update project structure in workspace
- rename project file from gatewa -> archgw
2024-10-18 12:57:58 -07:00
Adil Hafeez
c6ba28dfcc
Code refactor and some improvements - see description ( #194 )
2024-10-18 12:53:44 -07:00
José Ulises Niño Rivera
aa30353c85
Add cargo workspace to allow rust-analyzer to work correctly ( #197 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-18 15:44:52 -04:00
Salman Paracha
6fb63510b3
fix cli models and logs ( #196 )
...
* removing unnecessar setup.py files
* updated the cli for debug and access logs
* ran the pre-commit locally to fix pull request
* fixed bug where if archgw_process is None we didn't handle it gracefully
* Apply suggestions from code review
Co-authored-by: Adil Hafeez <adil@katanemo.com>
* fixed changes based on PR
* fixed version not found message
* fixed message based on PR feedback
* adding poetry lock
* fixed pre-commit
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-18 12:09:45 -07:00
Adil Hafeez
6cd05572c4
update lock file ( #192 )
...
```
Installing dependencies from lock file
pyproject.toml changed significantly since poetry.lock was last generated. Run `poetry lock [--no-update]` to fix the lock file.
Error installing model server dependencies: Command '['poetry', 'install', '--no-cache']' returned non-zero exit status 1.
```
2024-10-17 10:42:15 -07:00
Adil Hafeez
21e7fe2cef
Split arch wasm filter code into prompt and llm gateway filters ( #190 )
2024-10-17 10:16:40 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server ( #189 )
...
* remove mode/hardware
* add test and pre commit hook
* add pytest dependieces
* fix format
* fix lint
* fix precommit
* fix pre commit
* fix pre commit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter ( #186 )
...
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
2024-10-16 14:20:26 -07:00
Co Tran
b1746b38b4
concatenate history of user messages for hallucination ( #177 )
...
* concatenate history of user messages for hallucination
* add history of messages
* fix gpt to not arch
* add model prefix
* fix
* correct init of user_messages
* fmt
* fix test
2024-10-15 11:43:05 -07:00
Salman Paracha
35c5e303b7
Salmanap/fix discord invite ( #185 )
2024-10-13 21:01:36 -07:00
Salman Paracha
bfb549b075
ensuring that logo appears in dark mode ( #184 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-13 20:46:12 -07:00
Salman Paracha
5e22c24f77
minor changes to the README to highlight what Arch is designed to do in a single line ( #182 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-12 18:02:03 -07:00
Adil Hafeez
f72588c6da
fix lint
2024-10-10 23:34:21 -07:00
Shuguang Chen
11fba23f1f
Update doc ( #178 )
...
* Update doc
* Update links
2024-10-10 22:30:54 -07:00
Adil Hafeez
7b51cce2f7
fix demo broken links
2024-10-10 17:52:33 -07:00
Adil Hafeez
50bea9135c
update broken link
2024-10-10 17:50:57 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Aayush
ceca0dba28
fix prometheus target and update dashboard. ( #165 )
...
* fix prometheus target and update dashboard.
* Update envoy_overview.json with whitespace at the end
2024-10-10 14:57:20 -07:00
Co Tran
2c45de26e6
fix for linux ( #175 )
...
* fix for linux
* fix pre commit
* fix
* fix extra white space
* fix commit
2024-10-10 14:56:23 -07:00
Salman Paracha
639839fbb1
Create LICENSE
2024-10-10 06:30:23 -07:00
Adil Hafeez
7b05f304a1
Set python version 3.10
2024-10-09 23:23:42 -07:00
Adil Hafeez
d3ccddb72c
update llm router port value
2024-10-09 21:39:55 -07:00
Adil Hafeez
c0f0c22fb4
update access logs docs ( #170 )
2024-10-09 21:37:21 -07:00
Adil Hafeez
2b501d10bd
update lock file
2024-10-09 21:01:12 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Adil Hafeez
6b70768170
make ratelimit section optional ( #168 )
2024-10-09 19:53:00 -07:00
Co Tran
f9e3a052fc
change nli model ( #167 )
...
* change nli model
* Fix bug in hallucination
---------
Co-authored-by: Shuguang Chen <54548843+nehcgs@users.noreply.github.com>
2024-10-09 19:10:08 -07:00
Shuguang Chen
3b7c58698f
Update model_server ( #164 )
...
* Update model server
* Delete model_server/.vscode/settings.json
* Update loader.py
* Fix errors
* Update log mode
2024-10-09 18:04:52 -07:00
Salman Paracha
b8d2756ff7
updated README.md
2024-10-09 17:46:55 -07:00
Adil Hafeez
71cdf69f77
dont send default target to archfc ( #166 )
2024-10-09 17:43:02 -07:00
Salman Paracha
dc3a9813c3
fixed function calling arch config yaml
2024-10-09 17:10:54 -07:00
Adil Hafeez
3e9327cf36
fix bug in jinja template for tracing
2024-10-09 16:44:50 -07:00
Salman Paracha
0ed88def8f
updated settuptools packages
2024-10-09 16:31:35 -07:00
Adil Hafeez
6991fbb7a7
rename
2024-10-09 16:24:40 -07:00
Adil Hafeez
c254dfb16a
update cli and update docs ( #161 )
...
* add services to cli
* more changes
2024-10-09 16:22:27 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… ( #160 )
2024-10-09 15:53:12 -07:00