Adil Hafeez
|
ed3845040e
|
add demo for deepseek (#426)
|
2025-03-05 14:08:06 -08:00 |
|
Adil Hafeez
|
10cad4d0b7
|
add health check endpoint for llm gateway (#420)
* add health check endpoint for llm gateway
* fix rust tests
|
2025-03-03 13:11:57 -08:00 |
|
Adil Hafeez
|
e40b13be05
|
Update arch_config and add tests for arch config file (#407)
|
2025-02-14 19:28:10 -08:00 |
|
Adil Hafeez
|
39266b5084
|
log improvements and some code refactor (#379)
|
2025-01-31 10:37:53 -08:00 |
|
Adil Hafeez
|
6887d52750
|
When using ollama token count was not coming in (#375)
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
|
2025-01-21 18:01:56 -08:00 |
|
Adil Hafeez
|
07ef3149b8
|
add support for using custom upstream llm (#365)
|
2025-01-17 18:25:55 -08:00 |
|
José Ulises Niño Rivera
|
cd1b561192
|
Break apart metrics into their own module (#335)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-12-09 10:46:46 -08:00 |
|
José Ulises Niño Rivera
|
d002b2042a
|
Break apart common_types mod (#334)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-12-06 17:25:42 -08:00 |
|
Adil Hafeez
|
36489b4adc
|
use envoy to publish traces (#270)
|
2024-11-18 17:55:39 -08:00 |
|
Adil Hafeez
|
097513ee60
|
fix start time of llm filter (#278)
* fix start time of llm filter
* fix int tests
|
2024-11-17 17:01:19 -08:00 |
|
Adil Hafeez
|
d3c17c7abd
|
move custom tracer to llm filter (#267)
|
2024-11-15 10:44:01 -08:00 |
|
Aayush
|
1d229cba8f
|
Add in tpot (#269)
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
|
2024-11-14 15:03:08 -08:00 |
|
Aayush
|
5993e36f22
|
Update arch stats (#250)
|
2024-11-12 15:03:26 -08:00 |
|
Adil Hafeez
|
9081eb0f7f
|
obfuscate auth header (#254)
|
2024-11-08 15:17:39 -06:00 |
|
Adil Hafeez
|
a72bb804eb
|
add support for jaeger tracing (#229)
|
2024-11-07 22:11:00 -06:00 |
|
José Ulises Niño Rivera
|
662a840ac5
|
Add support for streaming and fixes few issues (see description) (#202)
|
2024-10-28 17:05:06 -07:00 |
|
Adil Hafeez
|
1719b7d5f8
|
Send back developer error correctly (#195)
|
2024-10-18 13:14:18 -07:00 |
|
Adil Hafeez
|
c6ba28dfcc
|
Code refactor and some improvements - see description (#194)
|
2024-10-18 12:53:44 -07:00 |
|
Adil Hafeez
|
21e7fe2cef
|
Split arch wasm filter code into prompt and llm gateway filters (#190)
|
2024-10-17 10:16:40 -07:00 |
|
Adil Hafeez
|
3bd2ffe9fb
|
split wasm filter (#186)
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
|
2024-10-16 14:20:26 -07:00 |
|