cotran
286eb58c87
fix black
2024-10-23 14:39:17 -07:00
Salman Paracha
83c579ea82
updating the poetry.lock files
2024-10-22 19:44:30 -07:00
Salman Paracha
51259b393a
updating the demo and model-server-tests to pull from poetry
2024-10-22 18:34:25 -07:00
Salman Paracha
6a72cb45b7
merging from main
2024-10-22 17:03:10 -07:00
Salman Paracha
415453ed53
updating to merge from main
2024-10-22 17:02:30 -07:00
Salman Paracha
aa21f41d0b
committing to merge with main
2024-10-22 17:01:23 -07:00
CTran
8495f89fda
Cotran/hallucination ( #208 )
2024-10-22 12:52:01 -07:00
Adil Hafeez
ea76d85b43
Improve logging ( #209 )
...
* improve logging
* fix int tests
* better
* fix more logs
* fix more
* fix int
2024-10-22 12:07:40 -07:00
Adil Hafeez
2f374df034
refactor prompt gateway ( #204 )
2024-10-21 15:04:15 -07:00
Salman Paracha
d799fc98fd
merged from main
2024-10-19 13:57:48 -07:00
Adil Hafeez
dced8a5708
Add separate util for hallucination and add tests for it ( #203 )
2024-10-18 19:34:17 -07:00
Adil Hafeez
baa84eb035
fix hr agent
2024-10-18 19:19:44 -07:00
Adil Hafeez
b037c603cd
Merge branch 'main' into salmanap/hr_agent
2024-10-18 15:08:22 -07:00
Adil Hafeez
faf64960df
update observability and dashboards ( #198 )
2024-10-18 15:07:49 -07:00
Salman Paracha
8a233df119
pushing to remote
2024-10-18 15:07:28 -07:00
Adil Hafeez
f189d5703b
update .dockerignore file after filter move
2024-10-18 14:44:39 -07:00
Salman Paracha
5235f27d70
merging from main, again
2024-10-18 14:41:55 -07:00
Salman Paracha
dfaa93e2fa
updating the HR agent config
2024-10-18 14:41:04 -07:00
Salman Paracha
93c88d60c5
merging from main
2024-10-18 14:35:41 -07:00
Adil Hafeez
dd1c7be706
Pass tool call and app function response back in metadata ( #193 )
2024-10-18 13:25:39 -07:00
José Ulises Niño Rivera
62a000036e
Update arch Dockerfile ( #200 )
2024-10-18 16:15:19 -04:00
Adil Hafeez
1719b7d5f8
Send back developer error correctly ( #195 )
2024-10-18 13:14:18 -07:00
Adil Hafeez
28421353fd
Update vscode workspce ( #199 )
...
- add recommended extensions
- set python interpreter path for all python projects to be venv/bin/python
- update project structure in workspace
- rename project file from gatewa -> archgw
2024-10-18 12:57:58 -07:00
Adil Hafeez
c6ba28dfcc
Code refactor and some improvements - see description ( #194 )
2024-10-18 12:53:44 -07:00
José Ulises Niño Rivera
aa30353c85
Add cargo workspace to allow rust-analyzer to work correctly ( #197 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-18 15:44:52 -04:00
Salman Paracha
6fb63510b3
fix cli models and logs ( #196 )
...
* removing unnecessar setup.py files
* updated the cli for debug and access logs
* ran the pre-commit locally to fix pull request
* fixed bug where if archgw_process is None we didn't handle it gracefully
* Apply suggestions from code review
Co-authored-by: Adil Hafeez <adil@katanemo.com>
* fixed changes based on PR
* fixed version not found message
* fixed message based on PR feedback
* adding poetry lock
* fixed pre-commit
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-18 12:09:45 -07:00
Adil Hafeez
6cd05572c4
update lock file ( #192 )
...
```
Installing dependencies from lock file
pyproject.toml changed significantly since poetry.lock was last generated. Run `poetry lock [--no-update]` to fix the lock file.
Error installing model server dependencies: Command '['poetry', 'install', '--no-cache']' returned non-zero exit status 1.
```
2024-10-17 10:42:15 -07:00
Adil Hafeez
21e7fe2cef
Split arch wasm filter code into prompt and llm gateway filters ( #190 )
2024-10-17 10:16:40 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server ( #189 )
...
* remove mode/hardware
* add test and pre commit hook
* add pytest dependieces
* fix format
* fix lint
* fix precommit
* fix pre commit
* fix pre commit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
2024-10-16 16:58:10 -07:00
Salman Paracha
0e5130ec95
updated the hr_agent
2024-10-16 15:32:58 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter ( #186 )
...
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
2024-10-16 14:20:26 -07:00
Co Tran
b1746b38b4
concatenate history of user messages for hallucination ( #177 )
...
* concatenate history of user messages for hallucination
* add history of messages
* fix gpt to not arch
* add model prefix
* fix
* correct init of user_messages
* fmt
* fix test
2024-10-15 11:43:05 -07:00
Salman Paracha
35c5e303b7
Salmanap/fix discord invite ( #185 )
2024-10-13 21:01:36 -07:00
Salman Paracha
bfb549b075
ensuring that logo appears in dark mode ( #184 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-13 20:46:12 -07:00
Salman Paracha
5e22c24f77
minor changes to the README to highlight what Arch is designed to do in a single line ( #182 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-12 18:02:03 -07:00
Salman Paracha
341344a583
commiting my hr_agent branch
2024-10-12 17:58:35 -07:00
Adil Hafeez
f72588c6da
fix lint
2024-10-10 23:34:21 -07:00
Shuguang Chen
11fba23f1f
Update doc ( #178 )
...
* Update doc
* Update links
2024-10-10 22:30:54 -07:00
Adil Hafeez
7b51cce2f7
fix demo broken links
2024-10-10 17:52:33 -07:00
Adil Hafeez
50bea9135c
update broken link
2024-10-10 17:50:57 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Aayush
ceca0dba28
fix prometheus target and update dashboard. ( #165 )
...
* fix prometheus target and update dashboard.
* Update envoy_overview.json with whitespace at the end
2024-10-10 14:57:20 -07:00
Co Tran
2c45de26e6
fix for linux ( #175 )
...
* fix for linux
* fix pre commit
* fix
* fix extra white space
* fix commit
2024-10-10 14:56:23 -07:00
Salman Paracha
639839fbb1
Create LICENSE
2024-10-10 06:30:23 -07:00
Adil Hafeez
7b05f304a1
Set python version 3.10
2024-10-09 23:23:42 -07:00
Adil Hafeez
d3ccddb72c
update llm router port value
2024-10-09 21:39:55 -07:00
Adil Hafeez
c0f0c22fb4
update access logs docs ( #170 )
2024-10-09 21:37:21 -07:00
Adil Hafeez
2b501d10bd
update lock file
2024-10-09 21:01:12 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Adil Hafeez
6b70768170
make ratelimit section optional ( #168 )
2024-10-09 19:53:00 -07:00