Adil Hafeez
253017e93d
Merge branch 'main' into adil/add_acm_demo
2025-01-17 18:26:27 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm ( #365 )
2025-01-17 18:25:55 -08:00
Adil Hafeez
5c066c9825
add more changes
2025-01-17 18:24:59 -08:00
Adil Hafeez
c532a5f4c7
Merge branch 'adil/fix_prompt_target_name' into adil/add_acm_demo
2025-01-17 17:50:40 -08:00
Adil Hafeez
1b3c1b8ba5
fix tests
2025-01-17 16:52:38 -08:00
Adil Hafeez
36c9c0d414
fix tests
2025-01-17 16:41:55 -08:00
Adil Hafeez
c235eaf762
fix test
2025-01-17 16:39:13 -08:00
Adil Hafeez
aca1631b49
add more changes
2025-01-17 16:33:17 -08:00
Adil Hafeez
a7b9458e5a
fix rust tests
2025-01-17 11:00:36 -08:00
Adil Hafeez
46cca42040
fix more
2025-01-16 18:31:40 -08:00
Adil Hafeez
aa649d5d80
add schema validator for provider
2025-01-16 17:18:26 -08:00
Adil Hafeez
00e4ba55a8
ensure that only openai is used for provider
2025-01-16 17:13:45 -08:00
Adil Hafeez
c7f8c2cef9
add demo for ollama
2025-01-16 16:34:17 -08:00
Adil Hafeez
2928b7630f
Merge branch 'main' into adil/fix_prompt_target_name
2025-01-16 15:14:57 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt ( #368 )
...
* update prompt target name after arch_fc has identified tool
* add test for currency exchange
2025-01-16 15:11:37 -08:00
Adil Hafeez
5017e7931e
fix tracing
2025-01-16 14:37:09 -08:00
Adil Hafeez
2413f56980
pending changes
2025-01-16 14:33:59 -08:00
Adil Hafeez
1a10b82724
pending changes
2025-01-14 16:32:52 -08:00
Adil Hafeez
9570b167db
fix tracing
2025-01-14 11:56:10 -08:00
Adil Hafeez
35065e2e41
Merge branch 'main' into adil/fix_prompt_target_name
2025-01-14 10:20:49 -08:00
Adil Hafeez
a24d62af1a
add github pull request in vscode ( #367 )
2025-01-14 10:20:27 -08:00
Adil Hafeez
b8474f42c9
Merge branch 'main' into adil/fix_prompt_target_name
2025-01-13 15:53:40 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server ( #362 )
...
* Some fixes on model server
* Remove prompt_prefilling message
* Fix logging
* Fix poetry issues
* Improve logging and update the support for text truncation
* Fix tests
* Fix tests
* Fix tests
* Fix modelserver tests
* Update modelserver tests
2025-01-10 16:45:36 -08:00
Salman Paracha
ebda682b30
updated docs for 0.1.8 support ( #366 )
...
* updated docs for 0.1.8 support
* updated REAMDE on root
* updated version reference to 0.1.8 in other parts of the repo
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-10 16:38:48 -08:00
Adil Hafeez
516d9a7c4a
update prompt target name after arch_fc has identified tool
2025-01-10 15:28:33 -08:00
Adil Hafeez
42ab061971
pending
2025-01-10 12:52:29 -08:00
Adil Hafeez
e55127d325
remove extra http_method
2025-01-08 16:58:43 -08:00
Adil Hafeez
68097fde07
Merge branch 'main' into adil/add_acm_demo
2025-01-08 16:55:07 -08:00
Adil Hafeez
dae6239b81
use per user docker socket if system docker socket doesn't exist ( #361 )
...
* use per user docker socket if system docker socket doesn't exist
* add retry
2025-01-08 14:55:42 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 ( #358 )
2025-01-06 14:23:46 -08:00
Adil Hafeez
8407edae99
only test currency exchange in demo test ( #348 )
2024-12-21 11:33:08 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway ( #328 )
2024-12-20 13:25:01 -08:00
Aayush
67b8fd635e
add more granular bucket sizes for ttft ( #343 )
...
* add more granular bucket sizes for ttft
2024-12-12 14:38:36 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module ( #335 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod ( #334 )
...
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00
Adil Hafeez
93d3d349a2
fix code bug ( #340 )
2024-12-06 17:20:59 -08:00
Adil Hafeez
285a66fdb6
Merge branch 'main' into adil/add_acm_demo
2024-12-06 16:17:26 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 ( #338 )
2024-12-06 15:48:07 -08:00
Adil Hafeez
c186c3dfc0
update quick start rst to be in sync with readme.md ( #337 )
2024-12-06 15:15:26 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples ( #330 )
2024-12-06 14:37:33 -08:00
Aayush
9d8fe02729
fix the README for the weather_forecasting demo ( #336 )
...
* README fix
* add missing colon
2024-12-06 14:02:41 -08:00
Ikko Eltociear Ashimine
4e919613f1
docs: update README.md ( #332 )
...
minor fix
2024-12-06 13:44:33 -08:00
Aayush
885acc899f
322 add support for pydantic logfire for llm agent tracing ( #329 )
...
* set up otel-collector and implement sending to logfire
* moved rest of the files for the demo into the folder
* update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY
* refactor weather_forecast demo to only be one demo
* add a default docker-compose for e2e tests
* update based on requested changes
* fix replace comma with colon in readme
* remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set
* remove the unused weather forecast service folder
* Changed stop_demo to only stop one file at a time
* update readme with new demo stopping setup
* Revert changes to end behavior
* fix silly formatting mistake
2024-12-06 13:44:22 -08:00
Adil Hafeez
5e182b6c09
pending
2024-12-05 10:11:55 -08:00
Adil Hafeez
af02807004
remove debug statement
2024-12-03 19:35:37 -08:00
Adil Hafeez
4343387adc
Add demo for acm
2024-12-03 19:22:31 -08:00
Salman Paracha
a0c159c9ba
updating doc versions, images and cleaning up section for prompt-guard ( #320 )
...
* updating doc versions, images and cleaning up section for prompt-guard
* updating based on feedback
---------
Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-12-01 23:02:08 -08:00
CTran
cadd3cdaf9
hallucination with log probs ( #281 )
...
* first init
* fix
* fix test
* new implemenetation
* fix bug
* fix bug
* fix bug
* address issue
* address issues
* address comments
* fix test
* fix
* move constatns
* remove consts
2024-11-27 15:17:02 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts ( #309 )
...
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
ec5326250e
correctly map stats port to host ( #311 )
2024-11-27 11:28:41 -08:00