Adil Hafeez
126b029345
release 0.3.18 ( #611 )
2025-10-31 12:24:49 -07:00
Branch Vincent
0a7e932837
support python 3.14 ( #605 )
...
* add python 3.14 to ci
* allow torch 2.9 for python 3.14
2025-10-30 09:17:31 -07:00
Adil Hafeez
f26bb05d35
release 0.3.17 ( #604 )
2025-10-24 17:52:15 -07:00
Branch Vincent
662546481a
move pytest to dev deps and migrate to poetry 2 ( #602 )
...
* move pytest to dev deps
* migrate to poetry 2 and standard metadata
2025-10-24 15:58:54 -07:00
Adil Hafeez
6d70545459
release 0.3.16 ( #596 )
2025-10-22 14:43:33 -07:00
Adil Hafeez
cd563c2706
release 0.3.15 ( #579 )
2025-09-30 13:44:11 -07:00
Adil Hafeez
7df1b8cdb0
release 0.3.14 ( #577 )
2025-09-29 23:11:43 -07:00
Adil Hafeez
7ce8d44d8e
release 0.3.13 ( #572 )
2025-09-19 11:26:49 -07:00
Adil Hafeez
118f60eea7
release 0.3.12 ( #567 )
2025-09-16 11:56:05 -07:00
Adil Hafeez
1e8c81d8f6
release 0.3.11 ( #565 )
2025-09-11 18:44:18 -07:00
Adil Hafeez
1fdde8181a
release 0.3.10 ( #555 )
2025-08-13 14:50:10 -07:00
Adil Hafeez
ad4cea227f
release 0.3.9 ( #552 )
2025-08-12 13:43:43 -07:00
Adil Hafeez
2639323dab
release 0.3.8 ( #550 )
2025-08-11 14:12:17 -07:00
Adil Hafeez
202409cc9a
update torch==2.6.0 ( #526 )
2025-08-11 13:23:40 -07:00
Adil Hafeez
ac3fb4cb5b
release 0.3.7 ( #542 )
2025-07-25 19:08:24 -07:00
Adil Hafeez
92a425facd
release 0.3.6 ( #536 )
2025-07-22 12:48:20 -07:00
Adil Hafeez
79a62fffe8
release 0.3.5 ( #534 )
2025-07-21 10:09:22 -07:00
Adil Hafeez
14f19f02a7
release 0.3.4 ( #525 )
...
* release 0.3.4
* update lock file
2025-07-11 17:24:21 -07:00
Adil Hafeez
5fb7ce576c
release 0.3.3 ( #519 )
2025-07-08 00:59:33 -07:00
Adil Hafeez
7baec20772
release 0.3.2 ( #507 )
2025-06-13 17:02:20 -07:00
Adil Hafeez
ed28bbaf04
release 0.3.1 ( #495 )
2025-05-30 17:47:59 -07:00
Adil Hafeez
dc271f1f76
release 0.3.0 ( #483 )
2025-05-23 09:52:23 -07:00
Adil Hafeez
9c803f4d69
release 0.2.8 ( #472 )
2025-04-21 17:02:36 -07:00
Adil Hafeez
00fb1be8a0
release 0.2.7 ( #469 )
2025-04-16 13:55:24 -07:00
Adil Hafeez
c7c0553427
release 0.2.6 ( #463 )
2025-04-15 14:50:09 -07:00
Adil Hafeez
4d2d8bd7a1
release 0.2.5 ( #457 )
2025-04-06 01:24:01 -07:00
Adil Hafeez
9f59943041
update code to use 0.2.4 release ( #446 )
...
* update code to use 0.2.4 release
* update lock file
2025-03-21 16:08:59 -07:00
Adil Hafeez
d8b833fe69
release 0.2.3 ( #423 )
2025-03-04 14:30:44 -08:00
Adil Hafeez
1bbc5d2233
release 0.2.2 ( #413 )
2025-02-14 20:02:59 -08:00
Adil Hafeez
0ea237fbac
release 0.2.1 ( #399 )
2025-02-07 19:21:20 -08:00
Adil Hafeez
7830f4b431
release 0.2.0 ( #384 )
...
* release 0.2.0
* update versions
2025-01-24 17:31:48 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 ( #371 )
2025-01-17 18:47:26 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server ( #362 )
...
* Some fixes on model server
* Remove prompt_prefilling message
* Fix logging
* Fix poetry issues
* Improve logging and update the support for text truncation
* Fix tests
* Fix tests
* Fix tests
* Fix modelserver tests
* Update modelserver tests
2025-01-10 16:45:36 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 ( #239 )
...
* set version to 0.1.0
* update readme
2024-10-30 18:56:49 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Salman Paracha
708fa15a9b
HR agent demo ( #206 )
...
* commiting my hr_agent branch
* updating the HR agent config
* pushing to remote
* fix hr agent
* committing to merge with main
* updating to merge from main
* updating the demo and model-server-tests to pull from poetry
* updating the poetry.lock files
* updating based on feedback
* updated sysmte prompt for hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
CTran
8e54ac20d8
Refactor model server hardware config + add unit tests to load/request to the server ( #189 )
...
* remove mode/hardware
* add test and pre commit hook
* add pytest dependieces
* fix format
* fix lint
* fix precommit
* fix pre commit
* fix pre commit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
* fix precommit
2024-10-16 16:58:10 -07:00
Adil Hafeez
7d5f760884
Improve cli ( #179 )
2024-10-10 17:44:41 -07:00
Salman Paracha
95a0f1be5b
updated archgw cli to pull from archgw_modelserver from pypi ( #169 )
...
* updated archgw cli to pull from archgw_modelserver from pypi
* fix image name
* update rev
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-09 21:00:26 -07:00
Shuguang Chen
3b7c58698f
Update model_server ( #164 )
...
* Update model server
* Delete model_server/.vscode/settings.json
* Update loader.py
* Fix errors
* Update log mode
2024-10-09 18:04:52 -07:00
Salman Paracha
1acf43ff7a
fixed cli to use poetry as well. this way we make it easy to have the… ( #160 )
2024-10-09 15:53:12 -07:00
Co Tran
e62c6e75ea
fix dependcy + logg info ( #148 )
2024-10-08 16:42:40 -07:00