Adil Hafeez
202409cc9a
update torch==2.6.0 ( #526 )
2025-08-11 13:23:40 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 ( #542 )
2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 ( #536 )
2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 ( #534 )
2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 ( #525 )
...
* release 0.3.4
* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 ( #519 )
2025-07-08 00:59:33 -07:00
Adil Hafeez
7baec20772
release 0.3.2 ( #507 )
2025-06-13 17:02:20 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 ( #495 )
2025-05-30 17:47:59 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 ( #483 )
2025-05-23 09:52:23 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 ( #472 )
2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 ( #469 )
2025-04-16 13:55:24 -07:00
Adil Hafeez
c7c0553427
release 0.2.6 ( #463 )
2025-04-15 14:50:09 -07:00
Adil Hafeez
4d2d8bd7a1
release 0.2.5 ( #457 )
2025-04-06 01:24:01 -07:00
Adil Hafeez
9f59943041
update code to use 0.2.4 release ( #446 )
...
* update code to use 0.2.4 release
* update lock file
2025-03-21 16:08:59 -07:00
Adil Hafeez
d8b833fe69
release 0.2.3 ( #423 )
2025-03-04 14:30:44 -08:00
Adil Hafeez
1bbc5d2233
release 0.2.2 ( #413 )
2025-02-14 20:02:59 -08:00
Adil Hafeez
0ea237fbac
release 0.2.1 ( #399 )
2025-02-07 19:21:20 -08:00
Adil Hafeez
7830f4b431
release 0.2.0 ( #384 )
...
* release 0.2.0
* update versions
2025-01-24 17:31:48 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 ( #371 )
2025-01-17 18:47:26 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server ( #362 )
...
* Some fixes on model server
* Remove prompt_prefilling message
* Fix logging
* Fix poetry issues
* Improve logging and update the support for text truncation
* Fix tests
* Fix tests
* Fix tests
* Fix modelserver tests
* Update modelserver tests
2025-01-10 16:45:36 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 ( #239 )
...
* set version to 0.1.0
* update readme
2024-10-30 18:56:49 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server ( #189 )
...
* remove mode/hardware
* add test and pre commit hook
* add pytest dependieces
* fix format
* fix lint
* fix precommit
* fix pre commit
* fix pre commit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Shuguang Chen
3b7c58698f
Update model_server ( #164 )
...
* Update model server
* Delete model_server/.vscode/settings.json
* Update loader.py
* Fix errors
* Update log mode
2024-10-09 18:04:52 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… ( #160 )
2024-10-09 15:53:12 -07:00
Co Tran
e62c6e75ea
fix dependcy + logg info ( #148 )
2024-10-08 16:42:40 -07:00
Salman Paracha
b60ceb9168
model server build ( #127 )
...
* first commit to have model_server not be dependent on Docker
* making changes to fix the docker-compose file for archgw to set DNS_V4 and minor fixes with the build
* additional fixes for model server to be separated out in the build
* additional fixes for model server to be separated out in the build
* fix to get model_server to be built as a separate python process. TODO: fix the embeddings logs after cli completes
* fixing init to pull tempfile using the tempfile python package
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-06 18:21:43 -07:00