Salman Paracha
8d1f132b75
Update README.md
2025-01-20 15:02:43 -08:00
Salman Paracha
966901d2a5
Update README.md
2025-01-20 14:46:14 -08:00
Salman Paracha
0fe0e775ee
Update README.md
2025-01-20 14:45:51 -08:00
Salman Paracha
e2ec2f6bb8
Salmanap/fix readme 019a ( #373 )
...
* updated README based on feedback on reddit
* fixed typo
* updating README with minor fixes
* more fixes to README
* updated README
* updated README
* updated README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 14:44:40 -08:00
Salman Paracha
c8b5137d37
updated README based on feedback on reddit ( #372 )
...
* updated README based on feedback on reddit
* fixed typo
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 13:56:09 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 ( #371 )
2025-01-17 18:47:26 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm ( #365 )
2025-01-17 18:25:55 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt ( #368 )
...
* update prompt target name after arch_fc has identified tool
* add test for currency exchange
2025-01-16 15:11:37 -08:00
Adil Hafeez
a24d62af1a
add github pull request in vscode ( #367 )
2025-01-14 10:20:27 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server ( #362 )
...
* Some fixes on model server
* Remove prompt_prefilling message
* Fix logging
* Fix poetry issues
* Improve logging and update the support for text truncation
* Fix tests
* Fix tests
* Fix tests
* Fix modelserver tests
* Update modelserver tests
2025-01-10 16:45:36 -08:00
Salman Paracha
ebda682b30
updated docs for 0.1.8 support ( #366 )
...
* updated docs for 0.1.8 support
* updated REAMDE on root
* updated version reference to 0.1.8 in other parts of the repo
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-10 16:38:48 -08:00
Adil Hafeez
dae6239b81
use per user docker socket if system docker socket doesn't exist ( #361 )
...
* use per user docker socket if system docker socket doesn't exist
* add retry
2025-01-08 14:55:42 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 ( #358 )
2025-01-06 14:23:46 -08:00
Adil Hafeez
8407edae99
only test currency exchange in demo test ( #348 )
2024-12-21 11:33:08 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Aayush
67b8fd635e
add more granular bucket sizes for ttft ( #343 )
...
* add more granular bucket sizes for ttft
2024-12-12 14:38:36 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module ( #335 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod ( #334 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00
Adil Hafeez
93d3d349a2
fix code bug ( #340 )
2024-12-06 17:20:59 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
c186c3dfc0
update quick start rst to be in sync with readme.md ( #337 )
2024-12-06 15:15:26 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Aayush
9d8fe02729
fix the README for the weather_forecasting demo ( #336 )
...
* README fix
* add missing colon
2024-12-06 14:02:41 -08:00
Ikko Eltociear Ashimine
4e919613f1
docs: update README.md ( #332 )
...
minor fix
2024-12-06 13:44:33 -08:00
Aayush
885acc899f
322 add support for pydantic logfire for llm agent tracing ( #329 )
...
* set up otel-collector and implement sending to logfire
* moved rest of the files for the demo into the folder
* update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY
* refactor weather_forecast demo to only be one demo
* add a default docker-compose for e2e tests
* update based on requested changes
* fix replace comma with colon in readme
* remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set
* remove the unused weather forecast service folder
* Changed stop_demo to only stop one file at a time
* update readme with new demo stopping setup
* Revert changes to end behavior
* fix silly formatting mistake
2024-12-06 13:44:22 -08:00
Salman Paracha
a0c159c9ba
updating doc versions, images and cleaning up section for prompt-guard ( #320 )
...
* updating doc versions, images and cleaning up section for prompt-guard
* updating based on feedback
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-12-01 23:02:08 -08:00
CTran
cadd3cdaf9
hallucination with log probs ( #281 )
...
* first init
* fix
* fix test
* new implemenetation
* fix bug
* fix bug
* fix bug
* address issue
* address issues
* address comments
* fix test
* fix
* move constatns
* remove consts
2024-11-27 15:17:02 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts ( #309 )
...
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
ec5326250e
correctly map stats port to host ( #311 )
2024-11-27 11:28:41 -08:00
Adil Hafeez
adec02e742
add note about hosted arch-fc ( #308 )
2024-11-26 14:19:10 -08:00
Adil Hafeez
704b928d61
release 0.1.5 ( #307 )
2024-11-26 13:28:52 -08:00
Adil Hafeez
0ff3d43008
remove dependency on docker-compose when starting up archgw ( #305 )
2024-11-26 13:13:02 -08:00
Adil Hafeez
726f1a3185
add schema change to use enum in arch_config ( #304 )
2024-11-25 17:51:25 -08:00
José Ulises Niño Rivera
be8c3c9ea3
Remove blanket unused imports from the common crate ( #292 )
...
* Remove blanket unused imports from the common crate
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
* updatE
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
---------
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-11-25 17:19:06 -08:00
Adil Hafeez
9c6fcdb771
use fix prompt guards ( #303 )
2024-11-25 17:16:35 -08:00
Adil Hafeez
6f4a57b56d
update readme with python version ( #302 )
2024-11-25 16:01:40 -08:00
Salman Paracha
970db68575
updating readme and docs with note about Arch-Function ( #285 )
...
* updating readme and docs with note about Arch-Function
* minor fixes to README
* a few more minor updates to the README
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-19 08:43:56 -08:00
Adil Hafeez
33ab24292c
publish docker image on release only ( #284 )
2024-11-18 18:18:46 -08:00
Adil Hafeez
3d3d015aea
publish docker image ( #283 )
2024-11-18 17:55:47 -08:00
Adil Hafeez
36489b4adc
use envoy to publish traces ( #270 )
2024-11-18 17:55:39 -08:00
Adil Hafeez
9cee04ed31
release 0.1.3 ( #280 )
...
* release 0.1.3
* udpate ver
2024-11-17 17:12:01 -08:00
Adil Hafeez
097513ee60
fix start time of llm filter ( #278 )
...
* fix start time of llm filter
* fix int tests
2024-11-17 17:01:19 -08:00
Salman Paracha
df0cd50cbd
updating website to track analytics ( #273 )
...
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-11-16 21:37:04 -08:00
Salman Paracha
8e9608995c
updated website with PH badge ( #272 )
2024-11-16 13:02:23 -08:00
Salman Paracha
a0d87d86c9
updating docs to reflect changes in 0.1.2 like tracing via signoz and… ( #271 )
2024-11-15 16:55:27 -08:00
Adil Hafeez
d3c17c7abd
move custom tracer to llm filter ( #267 )
2024-11-15 10:44:01 -08:00
Aayush
1d229cba8f
Add in tpot ( #269 )
...
* add in tpot and tokens per second
* add in debug logs for new stats and update integration tests
* update shared dashboard to include new stats
2024-11-14 15:03:08 -08:00
Salman Paracha
9eeb790c7f
updating README with PH launch results ( #268 )
2024-11-14 01:11:26 -08:00
Adil Hafeez
d1dd8710a4
release 0.1.2 ( #266 )
2024-11-12 23:56:33 -08:00
Adil Hafeez
31749bfc74
move grafana and prometheus to shared ( #265 )
2024-11-12 15:23:30 -08:00