Adil Hafeez
|
ca41f92d67
|
When using ollama token count was not coming in
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
|
2025-01-21 17:13:23 -08:00 |
|
Adil Hafeez
|
07ef3149b8
|
add support for using custom upstream llm (#365)
|
2025-01-17 18:25:55 -08:00 |
|
José Ulises Niño Rivera
|
cd1b561192
|
Break apart metrics into their own module (#335)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-12-09 10:46:46 -08:00 |
|
José Ulises Niño Rivera
|
d002b2042a
|
Break apart common_types mod (#334)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-12-06 17:25:42 -08:00 |
|
Adil Hafeez
|
36489b4adc
|
use envoy to publish traces (#270)
|
2024-11-18 17:55:39 -08:00 |
|
Adil Hafeez
|
097513ee60
|
fix start time of llm filter (#278)
* fix start time of llm filter
* fix int tests
|
2024-11-17 17:01:19 -08:00 |
|
Adil Hafeez
|
d3c17c7abd
|
move custom tracer to llm filter (#267)
|
2024-11-15 10:44:01 -08:00 |
|
Aayush
|
1d229cba8f
|
Add in tpot (#269)
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
|
2024-11-14 15:03:08 -08:00 |
|
Aayush
|
5993e36f22
|
Update arch stats (#250)
|
2024-11-12 15:03:26 -08:00 |
|
Adil Hafeez
|
d87105882b
|
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
|
2024-11-12 10:35:14 -08:00 |
|
Adil Hafeez
|
9081eb0f7f
|
obfuscate auth header (#254)
|
2024-11-08 15:17:39 -06:00 |
|
Adil Hafeez
|
a72bb804eb
|
add support for jaeger tracing (#229)
|
2024-11-07 22:11:00 -06:00 |
|
José Ulises Niño Rivera
|
662a840ac5
|
Add support for streaming and fixes few issues (see description) (#202)
|
2024-10-28 17:05:06 -07:00 |
|
Adil Hafeez
|
1719b7d5f8
|
Send back developer error correctly (#195)
|
2024-10-18 13:14:18 -07:00 |
|
Adil Hafeez
|
c6ba28dfcc
|
Code refactor and some improvements - see description (#194)
|
2024-10-18 12:53:44 -07:00 |
|
José Ulises Niño Rivera
|
aa30353c85
|
Add cargo workspace to allow rust-analyzer to work correctly (#197)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-10-18 15:44:52 -04:00 |
|
Adil Hafeez
|
21e7fe2cef
|
Split arch wasm filter code into prompt and llm gateway filters (#190)
|
2024-10-17 10:16:40 -07:00 |
|
Adil Hafeez
|
3bd2ffe9fb
|
split wasm filter (#186)
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
|
2024-10-16 14:20:26 -07:00 |
|