Commit graph

9 commits

Author SHA1 Message Date
Adil Hafeez
d76ca01980 fix int tests 2024-11-12 11:31:58 -08:00
Adil Hafeez
5421953ea9 Merge branch 'main' into collect-stats-in-stream-context 2024-11-12 11:01:16 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
aayushwhiz
6fc32b0152 update weather_forecast demo to spin up grafana and prometheus when using monitoring profile
has full dashboard with total requests, time per output token, time to
first token, total latency, output sequence length, and input sequence
length.
2024-11-11 17:00:48 -08:00
aayushwhiz
f4e9624c03 update integration tests to expect new stats and new request for time 2024-11-08 18:09:37 -08:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
José Ulises Niño Rivera
aa30353c85
Add cargo workspace to allow rust-analyzer to work correctly (#197)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-10-18 15:44:52 -04:00
Adil Hafeez
21e7fe2cef
Split arch wasm filter code into prompt and llm gateway filters (#190) 2024-10-17 10:16:40 -07:00
Adil Hafeez
3bd2ffe9fb
split wasm filter (#186)
* split wasm filter

* fix int and unit tests

* rename public_types => common and move common code there

* rename

* fix int test
2024-10-16 14:20:26 -07:00