Adil Hafeez
adec02e742
add note about hosted arch-fc ( #308 )
2024-11-26 14:19:10 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw ( #305 )
2024-11-26 13:13:02 -08:00
Adil Hafeez
726f1a3185
add schema change to use enum in arch_config ( #304 )
2024-11-25 17:51:25 -08:00
José Ulises Niño Rivera
be8c3c9ea3
Remove blanket unused imports from the common crate ( #292 )
...
* Remove blanket unused imports from the common crate
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
* updatE
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
---------
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-11-25 17:19:06 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
6f4a57b56d
update readme with python version ( #302 )
2024-11-25 16:01:40 -08:00
Salman Paracha
970db68575
updating readme and docs with note about Arch-Function ( #285 )
...
* updating readme and docs with note about Arch-Function
* minor fixes to README
* a few more minor updates to the README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-19 08:43:56 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only ( #284 )
2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image ( #283 )
2024-11-18 17:55:47 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces ( #270 )
2024-11-18 17:55:39 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter ( #278 )
...
* fix start time of llm filter
* fix int tests
2024-11-17 17:01:19 -08:00
Salman Paracha
df0cd50cbd
updating website to track analytics ( #273 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-16 21:37:04 -08:00
Salman Paracha
8e9608995c
updated website with PH badge ( #272 )
2024-11-16 13:02:23 -08:00
Salman Paracha
a0d87d86c9
updating docs to reflect changes in 0.1.2 like tracing via signoz and… ( #271 )
2024-11-15 16:55:27 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter ( #267 )
2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot ( #269 )
...
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Salman Paracha
9eeb790c7f
updating README with PH launch results ( #268 )
2024-11-14 01:11:26 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared ( #265 )
2024-11-12 15:23:30 -08:00
Aayush
5993e36f22
Update arch stats ( #250 )
2024-11-12 15:03:26 -08:00
Adil Hafeez
30647fd508
Add service to stream custom otel traces to otel-collector ( #262 )
2024-11-12 11:09:40 -08:00
Adil Hafeez
d87105882b
update rust toolchain to 1.82 ( #255 )
...
* update rust to 1.82 pin it, also update envoy to 1.32 and python to 3.13
* use python:3.12
2024-11-12 10:35:14 -08:00
Salman Paracha
4b2b371876
removing depdency on mistral keys ( #256 )
2024-11-08 16:09:04 -08:00
Adil Hafeez
9081eb0f7f
obfuscate auth header ( #254 )
2024-11-08 15:17:39 -06:00
Adil Hafeez
88d0f99866
add requirements to readme ( #249 )
2024-11-08 10:43:18 -08:00
Adil Hafeez
6b62662e01
update docs with weather_forecast path ( #253 )
2024-11-08 10:00:15 -08:00
Adil Hafeez
a72bb804eb
add support for jaeger tracing ( #229 )
2024-11-07 22:11:00 -06:00
CTran
fb67788be0
add prefill and test ( #236 )
...
* add prefill and test
* fix stream
* fix
* feedback
* address comments
* update
* add e2e test
* fix e2e test
* update fix
* fix
* address cmt
* address cmt
2024-11-07 11:59:29 -08:00
Ikko Eltociear Ashimine
f48489f7c0
chore: update stream_context.rs ( #248 )
...
initalize -> initialize
2024-11-05 10:18:33 -08:00
Adil Hafeez
9a6ae2efee
retry embeddings fetch ( #245 )
2024-11-05 10:04:36 -08:00
Adil Hafeez
9a5c5cc3a3
add http files for llm and prompt gateway for local testing ( #244 )
2024-11-04 15:53:15 -08:00
Salman Paracha
e4d5293af4
updating README logo ( #242 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 21:58:08 -07:00
Salman Paracha
21f4e7a5e4
fixing ports in arch_config for demos ( #241 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 14:37:04 -07:00
Salman Paracha
3bff4a597b
fix ports and update README for paths to agent/chat ( #240 )
...
* fix ports and update README for paths to agent/chat
* minor fix
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-31 09:25:24 -07:00
Adil Hafeez
5196fb28a9
update build status badges
2024-10-30 23:27:01 -07:00
Adil Hafeez
9a30afa7e1
update status badges
2024-10-30 23:24:36 -07:00
Adil Hafeez
ad70e540b6
add status badges
2024-10-30 23:24:00 -07:00
Adil Hafeez
8c6ad87c1c
release 0.1.0 ( #239 )
...
* set version to 0.1.0
* update readme
2024-10-30 18:56:49 -07:00
Salman Paracha
dab7a44053
several fixes to demos ( #238 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-30 18:38:18 -07:00
Adil Hafeez
e462e393b1
Use large github action machine to run e2e tests ( #230 )
2024-10-30 17:54:51 -07:00
Salman Paracha
bb882fb59b
Updated hr_agent to be full stack: gradio + fastAPI ( #235 )
...
* commiting to remove
* fix
* updating hr_agent
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-10-30 15:05:34 -07:00
Salman Paracha
bb9a774a72
moving chatbot-ui in demos and out of root project structure ( #228 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-29 12:05:29 -07:00
Adil Hafeez
60299244b9
Improve Gradio UI and fix arch_state bug ( #227 )
2024-10-29 11:27:13 -07:00
José Ulises Niño Rivera
662a840ac5
Add support for streaming and fixes few issues (see description) ( #202 )
2024-10-28 17:05:06 -07:00
Salman Paracha
29ff8da60f
fixed typos in intro to arch docs ( #225 )
2024-10-26 10:41:01 -07:00
CTran
25dddcbfd9
fix model server stop process ( #217 )
...
* fix model server stop process
* replace
* replace
* add test
* add multiple pids test
* add check install for linux
* reformat
2024-10-24 19:21:47 -07:00
Salman Paracha
ff6e9bd9bd
add README for hr_agent ( #224 )
...
* add README for hr_agent
* fixed sample prompt for hr_agent in README
* added screenshot and updated README.md
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-24 18:21:52 -07:00
Salman Paracha
f88740582f
fixed typos in arch_config.yaml file based on issue #221 ( #223 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-10-24 14:57:29 -07:00