Commit graph

619 commits

Author SHA1 Message Date
Adil Hafeez
3d3d015aea
publish docker image (#283) 2024-11-18 17:55:47 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces (#270) 2024-11-18 17:55:39 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 (#280)
* release 0.1.3

* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter (#278)
* fix start time of llm filter

* fix int tests
2024-11-17 17:01:19 -08:00
Salman Paracha
df0cd50cbd
updating website to track analytics (#273)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-16 21:37:04 -08:00
Salman Paracha
8e9608995c
updated website with PH badge (#272) 2024-11-16 13:02:23 -08:00
Salman Paracha
a0d87d86c9
updating docs to reflect changes in 0.1.2 like tracing via signoz and… (#271) 2024-11-15 16:55:27 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter (#267) 2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot (#269)
* add in tpot and tokens per second

* add in debug logs for new stats and update integration tests

* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Salman Paracha
9eeb790c7f
updating README with PH launch results (#268) 2024-11-14 01:11:26 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 (#266) 2024-11-12 23:56:33 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared (#265) 2024-11-12 15:23:30 -08:00
Aayush
5993e36f22
Update arch stats (#250) 2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector (#262) 2024-11-12 11:09:40 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 (#255)
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13

* use python:3.12
2024-11-12 10:35:14 -08:00
Salman Paracha
4b2b371876
removing depdency on mistral keys (#256) 2024-11-08 16:09:04 -08:00
Adil Hafeez
9081eb0f7f
obfuscate auth header (#254) 2024-11-08 15:17:39 -06:00
Adil Hafeez
88d0f99866
add requirements to readme (#249) 2024-11-08 10:43:18 -08:00
Adil Hafeez
6b62662e01
update docs with weather_forecast path (#253) 2024-11-08 10:00:15 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing (#229) 2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test (#236)
* add prefill and test

* fix stream

* fix

* feedback

* address comments

* update

* add e2e test

* fix e2e test

* update fix

* fix

* address cmt

* address cmt
2024-11-07 11:59:29 -08:00
Ikko Eltociear Ashimine
f48489f7c0
chore: update stream_context.rs (#248)
initalize -> initialize
2024-11-05 10:18:33 -08:00
Adil Hafeez
9a6ae2efee
retry embeddings fetch (#245) 2024-11-05 10:04:36 -08:00
Adil Hafeez
9a5c5cc3a3
add http files for llm and prompt gateway for local testing (#244) 2024-11-04 15:53:15 -08:00
Salman Paracha
e4d5293af4
updating README logo (#242)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 21:58:08 -07:00
Salman Paracha
21f4e7a5e4
fixing ports in arch_config for demos (#241)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 14:37:04 -07:00
Salman Paracha
3bff4a597b
fix ports and update README for paths to agent/chat (#240)
* fix ports and update README for paths to agent/chat

* minor fix

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 09:25:24 -07:00
Adil Hafeez
5196fb28a9
update build status badges 2024-10-30 23:27:01 -07:00
Adil Hafeez
9a30afa7e1
update status badges 2024-10-30 23:24:36 -07:00
Adil Hafeez
ad70e540b6
add status badges 2024-10-30 23:24:00 -07:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 (#239)
* set version to 0.1.0

* update readme
2024-10-30 18:56:49 -07:00
Salman Paracha
dab7a44053
several fixes to demos (#238)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-30 18:38:18 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests (#230) 2024-10-30 17:54:51 -07:00
Salman Paracha
bb882fb59b
Updated hr_agent to be full stack: gradio + fastAPI (#235)
* commiting to remove

* fix

* updating hr_agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-30 15:05:34 -07:00
Salman Paracha
bb9a774a72
moving chatbot-ui in demos and out of root project structure (#228)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-29 12:05:29 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug (#227) 2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) (#202) 2024-10-28 17:05:06 -07:00
Salman Paracha
29ff8da60f
fixed typos in intro to arch docs (#225) 2024-10-26 10:41:01 -07:00
CTran
25dddcbfd9
fix model server stop process (#217)
* fix model server stop process

* replace

* replace

* add test

* add multiple pids test

* add check install for linux

* reformat
2024-10-24 19:21:47 -07:00
Salman Paracha
ff6e9bd9bd
add README for hr_agent (#224)
* add README for hr_agent

* fixed sample prompt for hr_agent in README

* added screenshot and updated README.md

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-24 18:21:52 -07:00
Salman Paracha
f88740582f
fixed typos in arch_config.yaml file based on issue #221 (#223)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-24 14:57:29 -07:00
Shuguang Chen
5f3aff4922
Update chatbot UI and update hallucination check (#218)
* update chatbot UI

* Update docker-compose for demos

* Fix bugs

* fix for emtadata (#219)

* fix for emtadata

* fix

* revert

* merge main

---------

Co-authored-by: CTran <cotran2@utexas.edu>
2024-10-24 14:11:53 -07:00
Azib Farooq
05f0491f76
updated key name (#211) 2024-10-23 21:02:24 -07:00
Ikko Eltociear Ashimine
87ce0b1be0
docs: update README.md (#220)
vist -> visit
2024-10-23 20:37:26 -07:00
Salman Paracha
7a5852b401
fixing discord link and moving contributing guide to root (#215)
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-23 15:45:49 -07:00
Salman Paracha
708fa15a9b
HR agent demo (#206)
* commiting my hr_agent branch

* updating the HR agent config

* pushing to remote

* fix hr agent

* committing to merge with main

* updating to merge from main

* updating the demo and model-server-tests to pull from poetry

* updating the poetry.lock files

* updating based on feedback

* updated sysmte prompt for hr_agent

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-23 14:32:40 -07:00
CTran
8495f89fda
Cotran/hallucination (#208) 2024-10-22 12:52:01 -07:00
Adil Hafeez
ea76d85b43
Improve logging (#209)
* improve logging

* fix int tests

* better

* fix more logs

* fix more

* fix int
2024-10-22 12:07:40 -07:00
Adil Hafeez
2f374df034
refactor prompt gateway (#204) 2024-10-21 15:04:15 -07:00
Adil Hafeez
dced8a5708
Add separate util for hallucination and add tests for it (#203) 2024-10-18 19:34:17 -07:00