Adil Hafeez
69b43a7abf
add more files
2025-01-29 16:11:44 -08:00
Adil Hafeez
5d4cd624c1
more changes
2025-01-23 17:36:56 -08:00
Adil Hafeez
5ed7e0b185
Merge branch 'adil/log_improvements' into adil/add_acm_demo
2025-01-23 14:23:06 -08:00
Adil Hafeez
4ab7665c30
log improvements
2025-01-23 14:22:16 -08:00
Adil Hafeez
613192f71c
Merge branch 'main' into adil/add_acm_demo
2025-01-22 14:10:18 -08:00
Adil Hafeez
6887d52750
When using ollama token count was not coming in ( #375 )
...
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
2025-01-21 18:01:56 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm ( #365 )
2025-01-17 18:25:55 -08:00
Adil Hafeez
5c066c9825
add more changes
2025-01-17 18:24:59 -08:00
Adil Hafeez
c532a5f4c7
Merge branch 'adil/fix_prompt_target_name' into adil/add_acm_demo
2025-01-17 17:50:40 -08:00
Adil Hafeez
1b3c1b8ba5
fix tests
2025-01-17 16:52:38 -08:00
Adil Hafeez
36c9c0d414
fix tests
2025-01-17 16:41:55 -08:00
Adil Hafeez
aca1631b49
add more changes
2025-01-17 16:33:17 -08:00
Adil Hafeez
a7b9458e5a
fix rust tests
2025-01-17 11:00:36 -08:00
Adil Hafeez
46cca42040
fix more
2025-01-16 18:31:40 -08:00
Adil Hafeez
00e4ba55a8
ensure that only openai is used for provider
2025-01-16 17:13:45 -08:00
Adil Hafeez
c7f8c2cef9
add demo for ollama
2025-01-16 16:34:17 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt ( #368 )
...
* update prompt target name after arch_fc has identified tool
* add test for currency exchange
2025-01-16 15:11:37 -08:00
Adil Hafeez
5017e7931e
fix tracing
2025-01-16 14:37:09 -08:00
Adil Hafeez
1a10b82724
pending changes
2025-01-14 16:32:52 -08:00
Adil Hafeez
9570b167db
fix tracing
2025-01-14 11:56:10 -08:00
Adil Hafeez
516d9a7c4a
update prompt target name after arch_fc has identified tool
2025-01-10 15:28:33 -08:00
Adil Hafeez
42ab061971
pending
2025-01-10 12:52:29 -08:00
Adil Hafeez
e55127d325
remove extra http_method
2025-01-08 16:58:43 -08:00
Adil Hafeez
68097fde07
Merge branch 'main' into adil/add_acm_demo
2025-01-08 16:55:07 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module ( #335 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod ( #334 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00
Adil Hafeez
285a66fdb6
Merge branch 'main' into adil/add_acm_demo
2024-12-06 16:17:26 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Adil Hafeez
5e182b6c09
pending
2024-12-05 10:11:55 -08:00
Adil Hafeez
af02807004
remove debug statement
2024-12-03 19:35:37 -08:00
Adil Hafeez
4343387adc
Add demo for acm
2024-12-03 19:22:31 -08:00
José Ulises Niño Rivera
be8c3c9ea3
Remove blanket unused imports from the common crate ( #292 )
...
* Remove blanket unused imports from the common crate
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
* updatE
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
---------
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-11-25 17:19:06 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces ( #270 )
2024-11-18 17:55:39 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter ( #278 )
...
* fix start time of llm filter
* fix int tests
2024-11-17 17:01:19 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter ( #267 )
2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot ( #269 )
...
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Aayush
5993e36f22
Update arch stats ( #250 )
2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector ( #262 )
2024-11-12 11:09:40 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
Adil Hafeez
9081eb0f7f
obfuscate auth header ( #254 )
2024-11-08 15:17:39 -06:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
Ikko Eltociear Ashimine
f48489f7c0
chore: update stream_context.rs ( #248 )
...
initalize -> initialize
2024-11-05 10:18:33 -08:00
Adil Hafeez
9a6ae2efee
retry embeddings fetch ( #245 )
2024-11-05 10:04:36 -08:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Salman Paracha
bb882fb59b
Updated hr_agent to be full stack: gradio + fastAPI ( #235 )
...
* commiting to remove
* fix
* updating hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-30 15:05:34 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug ( #227 )
2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) ( #202 )
2024-10-28 17:05:06 -07:00
Shuguang Chen
5f3aff4922
Update chatbot UI and update hallucination check ( #218 )
...
* update chatbot UI
* Update docker-compose for demos
* Fix bugs
* fix for emtadata (#219 )
* fix for emtadata
* fix
* revert
* merge main
---------
Co-authored-by: CTran <cotran2@utexas.edu>
2024-10-24 14:11:53 -07:00