Aayush
|
1d229cba8f
|
Add in tpot (#269)
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
|
2024-11-14 15:03:08 -08:00 |
|
Aayush
|
5993e36f22
|
Update arch stats (#250)
|
2024-11-12 15:03:26 -08:00 |
|
Adil Hafeez
|
30647fd508
|
Add service to stream custom otel traces to otel-collector (#262)
|
2024-11-12 11:09:40 -08:00 |
|
Adil Hafeez
|
d87105882b
|
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
|
2024-11-12 10:35:14 -08:00 |
|
Adil Hafeez
|
9081eb0f7f
|
obfuscate auth header (#254)
|
2024-11-08 15:17:39 -06:00 |
|
Adil Hafeez
|
a72bb804eb
|
add support for jaeger tracing (#229)
|
2024-11-07 22:11:00 -06:00 |
|
Ikko Eltociear Ashimine
|
f48489f7c0
|
chore: update stream_context.rs (#248)
initalize -> initialize
|
2024-11-05 10:18:33 -08:00 |
|
Adil Hafeez
|
9a6ae2efee
|
retry embeddings fetch (#245)
|
2024-11-05 10:04:36 -08:00 |
|
Adil Hafeez
|
e462e393b1
|
Use large github action machine to run e2e tests (#230)
|
2024-10-30 17:54:51 -07:00 |
|
Salman Paracha
|
bb882fb59b
|
Updated hr_agent to be full stack: gradio + fastAPI (#235)
* commiting to remove
* fix
* updating hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
|
2024-10-30 15:05:34 -07:00 |
|
Adil Hafeez
|
60299244b9
|
Improve Gradio UI and fix arch_state bug (#227)
|
2024-10-29 11:27:13 -07:00 |
|
José Ulises Niño Rivera
|
662a840ac5
|
Add support for streaming and fixes few issues (see description) (#202)
|
2024-10-28 17:05:06 -07:00 |
|
Shuguang Chen
|
5f3aff4922
|
Update chatbot UI and update hallucination check (#218)
* update chatbot UI
* Update docker-compose for demos
* Fix bugs
* fix for emtadata (#219)
* fix for emtadata
* fix
* revert
* merge main
---------
Co-authored-by: CTran <cotran2@utexas.edu>
|
2024-10-24 14:11:53 -07:00 |
|
Azib Farooq
|
05f0491f76
|
updated key name (#211)
|
2024-10-23 21:02:24 -07:00 |
|
CTran
|
8495f89fda
|
Cotran/hallucination (#208)
|
2024-10-22 12:52:01 -07:00 |
|
Adil Hafeez
|
ea76d85b43
|
Improve logging (#209)
* improve logging
* fix int tests
* better
* fix more logs
* fix more
* fix int
|
2024-10-22 12:07:40 -07:00 |
|
Adil Hafeez
|
2f374df034
|
refactor prompt gateway (#204)
|
2024-10-21 15:04:15 -07:00 |
|
Adil Hafeez
|
dced8a5708
|
Add separate util for hallucination and add tests for it (#203)
|
2024-10-18 19:34:17 -07:00 |
|
Adil Hafeez
|
faf64960df
|
update observability and dashboards (#198)
|
2024-10-18 15:07:49 -07:00 |
|
Adil Hafeez
|
dd1c7be706
|
Pass tool call and app function response back in metadata (#193)
|
2024-10-18 13:25:39 -07:00 |
|
Adil Hafeez
|
1719b7d5f8
|
Send back developer error correctly (#195)
|
2024-10-18 13:14:18 -07:00 |
|
Adil Hafeez
|
c6ba28dfcc
|
Code refactor and some improvements - see description (#194)
|
2024-10-18 12:53:44 -07:00 |
|
José Ulises Niño Rivera
|
aa30353c85
|
Add cargo workspace to allow rust-analyzer to work correctly (#197)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
|
2024-10-18 15:44:52 -04:00 |
|
Adil Hafeez
|
21e7fe2cef
|
Split arch wasm filter code into prompt and llm gateway filters (#190)
|
2024-10-17 10:16:40 -07:00 |
|
Adil Hafeez
|
3bd2ffe9fb
|
split wasm filter (#186)
* split wasm filter
* fix int and unit tests
* rename public_types => common and move common code there
* rename
* fix int test
|
2024-10-16 14:20:26 -07:00 |
|