Adil Hafeez
|
27c0f2fdce
|
Introduce brightstaff a new terminal service for llm routing (#477)
|
2025-05-19 09:59:22 -07:00 |
|
Adil Hafeez
|
de221525de
|
Use better logs (#452)
|
2025-03-27 10:40:20 -07:00 |
|
Adil Hafeez
|
eb48f3d5bb
|
use passed in model name in chat completion request (#445)
|
2025-03-21 15:56:17 -07:00 |
|
Adil Hafeez
|
84cd1df7bf
|
add preliminary support for llm agents (#432)
|
2025-03-19 15:21:34 -07:00 |
|
Adil Hafeez
|
ed3845040e
|
add demo for deepseek (#426)
|
2025-03-05 14:08:06 -08:00 |
|
Adil Hafeez
|
10cad4d0b7
|
add health check endpoint for llm gateway (#420)
* add health check endpoint for llm gateway
* fix rust tests
|
2025-03-03 13:11:57 -08:00 |
|
Adil Hafeez
|
39266b5084
|
log improvements and some code refactor (#379)
|
2025-01-31 10:37:53 -08:00 |
|
Adil Hafeez
|
07ef3149b8
|
add support for using custom upstream llm (#365)
|
2025-01-17 18:25:55 -08:00 |
|
Adil Hafeez
|
36489b4adc
|
use envoy to publish traces (#270)
|
2024-11-18 17:55:39 -08:00 |
|
Adil Hafeez
|
097513ee60
|
fix start time of llm filter (#278)
* fix start time of llm filter
* fix int tests
|
2024-11-17 17:01:19 -08:00 |
|
Adil Hafeez
|
d3c17c7abd
|
move custom tracer to llm filter (#267)
|
2024-11-15 10:44:01 -08:00 |
|
Aayush
|
1d229cba8f
|
Add in tpot (#269)
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
|
2024-11-14 15:03:08 -08:00 |
|
Aayush
|
5993e36f22
|
Update arch stats (#250)
|
2024-11-12 15:03:26 -08:00 |
|
Adil Hafeez
|
d87105882b
|
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
|
2024-11-12 10:35:14 -08:00 |
|
José Ulises Niño Rivera
|
662a840ac5
|
Add support for streaming and fixes few issues (see description) (#202)
|
2024-10-28 17:05:06 -07:00 |
|
José Ulises Niño Rivera
|
aa30353c85
|
Add cargo workspace to allow rust-analyzer to work correctly (#197)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-10-18 15:44:52 -04:00 |
|
Adil Hafeez
|
21e7fe2cef
|
Split arch wasm filter code into prompt and llm gateway filters (#190)
|
2024-10-17 10:16:40 -07:00 |
|
Adil Hafeez
|
3bd2ffe9fb
|
split wasm filter (#186)
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
|
2024-10-16 14:20:26 -07:00 |
|