Adil Hafeez
|
d76ca01980
|
fix int tests
|
2024-11-12 11:31:58 -08:00 |
|
Adil Hafeez
|
5421953ea9
|
Merge branch 'main' into collect-stats-in-stream-context
|
2024-11-12 11:01:16 -08:00 |
|
Adil Hafeez
|
d87105882b
|
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
|
2024-11-12 10:35:14 -08:00 |
|
aayushwhiz
|
6fc32b0152
|
update weather_forecast demo to spin up grafana and prometheus when using monitoring profile
has full dashboard with total requests, time per output token, time to
first token, total latency, output sequence length, and input sequence
length.
|
2024-11-11 17:00:48 -08:00 |
|
aayushwhiz
|
f4e9624c03
|
update integration tests to expect new stats and new request for time
|
2024-11-08 18:09:37 -08:00 |
|
José Ulises Niño Rivera
|
662a840ac5
|
Add support for streaming and fixes few issues (see description) (#202)
|
2024-10-28 17:05:06 -07:00 |
|
José Ulises Niño Rivera
|
aa30353c85
|
Add cargo workspace to allow rust-analyzer to work correctly (#197)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-10-18 15:44:52 -04:00 |
|
Adil Hafeez
|
21e7fe2cef
|
Split arch wasm filter code into prompt and llm gateway filters (#190)
|
2024-10-17 10:16:40 -07:00 |
|
Adil Hafeez
|
3bd2ffe9fb
|
split wasm filter (#186)
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
|
2024-10-16 14:20:26 -07:00 |
|