Commit graph

628 commits

Author SHA1 Message Date
Adil Hafeez
e82f8f216f
Encode parameter values in http path and ... (#395)
* Encode parameter values in http path and ...

- don't send param values in request body in http get request
- send param values in http post request

* rust tests

* refactor code

* add tests
2025-02-06 11:00:47 -08:00
Adil Hafeez
a62f906432
remove unused cargo.lock files (#396) 2025-02-05 20:25:41 -08:00
Adil Hafeez
afa2142d93
run rust tests for all crates upon commit (#393) 2025-02-05 18:57:01 -08:00
Adil Hafeez
fa089ef32d
Publish both arm64 and amd64 in docker on main (#392) 2025-02-04 15:04:09 -08:00
Adil Hafeez
962727f244
Infer port from protocol if port is not specified and add ability to override hostname in clusters def (#389) 2025-02-03 14:51:59 -08:00
obinnascale3
25692bbbfc
Add Langtrace as a supported observability tool (#376)
* add langtrace as a tracing tool

* add setup step for Arch installation

---------

Co-authored-by: Obinna Okafor <obinna.okafor01@gmail.com>
2025-01-31 11:16:30 -08:00
Adil Hafeez
39266b5084
log improvements and some code refactor (#379) 2025-01-31 10:37:53 -08:00
Salman Paracha
e79d16ec81
Updated README layout (#387)
* Updated README layout

* minor fixes

* minor fixes

* minor fixes

* minor fixes

* minor fixes

* minor clean up

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-30 17:36:11 -08:00
Adil Hafeez
a7feb6bffb
fix llm_provider format (#385) 2025-01-24 20:35:56 -08:00
Adil Hafeez
7830f4b431
release 0.2.0 (#384)
* release 0.2.0

* update versions
2025-01-24 17:31:48 -08:00
Adil Hafeez
38f7691163
add support for custom llm with ssl support (#380)
* add support for custom llm with ssl support

Add support for using custom llm that are served through https protocol.

* add instructions on how to add custom inference endpoint

* fix formatting

* add more details

* Apply suggestions from code review

Co-authored-by: Salman Paracha <salman.paracha@gmail.com>

* Apply suggestions from code review

* fix precommit

---------

Co-authored-by: Salman Paracha <salman.paracha@gmail.com>
2025-01-24 17:14:24 -08:00
Adil Hafeez
2c67fa3bc0
Fix llm_routing provider element (#382)
* Fix llm_routing provider element

We replaced provider with provider_interface to make it more clear to developers about provider api/backend being used. During that upgrade we removed support for mistral in provider to encourage developers to start using provider_interface. But this demo was not updated to use provider_interface as it was using mistral. This code change fixes it by replacing provider with provider_interface.

Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>

* fix the path

* move

* add more details

* fix

* Apply suggestions from code review

* fix

* fix

---------

Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>
2025-01-24 16:34:11 -08:00
Guy Stone
84af476c75
Add run_demo.sh for llm_routing demo (#381)
* Create run_demo.sh for llm_routing demo

* Update run_demo.sh
2025-01-24 14:41:51 -08:00
Salman Paracha
1e75c1c5e0 updating README 2025-01-23 11:31:10 -08:00
Salman Paracha
1d9bfcb7db
Update README.md 2025-01-23 11:26:21 -08:00
Salman Paracha
f9bed5d75c
Update README.md 2025-01-23 11:25:46 -08:00
Adil Hafeez
6740a09952
add docker-compose file for honeycomb tracing (#377) 2025-01-22 14:02:59 -08:00
Adil Hafeez
6887d52750
When using ollama token count was not coming in (#375)
When using ollama token count was not coming in resulting in token count and other metrics to show up as zero. This was not causing tracing to break.
2025-01-21 18:01:56 -08:00
Aayush
fcd8cfb9fc
add in honeycomb support for weather-forecast demo (#345) 2025-01-21 17:15:27 -08:00
Salman Paracha
bea0dd4a83
Update README.md 2025-01-21 10:56:17 -08:00
Salman Paracha
4bbf6c382e
Update README.md 2025-01-20 15:03:19 -08:00
Salman Paracha
8d1f132b75
Update README.md 2025-01-20 15:02:43 -08:00
Salman Paracha
966901d2a5
Update README.md 2025-01-20 14:46:14 -08:00
Salman Paracha
0fe0e775ee
Update README.md 2025-01-20 14:45:51 -08:00
Salman Paracha
e2ec2f6bb8
Salmanap/fix readme 019a (#373)
* updated README based on feedback on reddit

* fixed typo

* updating README with minor fixes

* more fixes to README

* updated README

* updated README

* updated README

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 14:44:40 -08:00
Salman Paracha
c8b5137d37
updated README based on feedback on reddit (#372)
* updated README based on feedback on reddit

* fixed typo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-20 13:56:09 -08:00
Adil Hafeez
452084423c
add PR to release 0.1.9 (#371) 2025-01-17 18:47:26 -08:00
Adil Hafeez
07ef3149b8
add support for using custom upstream llm (#365) 2025-01-17 18:25:55 -08:00
Adil Hafeez
3fc21de60c
Send per prompt target system prompt (#368)
* update prompt target name after arch_fc has identified tool

* add test for currency exchange
2025-01-16 15:11:37 -08:00
Adil Hafeez
a24d62af1a
add github pull request in vscode (#367) 2025-01-14 10:20:27 -08:00
Shuguang Chen
88a02dc478
Some fixes on model server (#362)
* Some fixes on model server

* Remove prompt_prefilling message

* Fix logging

* Fix poetry issues

* Improve logging and update the support for text truncation

* Fix tests

* Fix tests

* Fix tests

* Fix modelserver tests

* Update modelserver tests
2025-01-10 16:45:36 -08:00
Salman Paracha
ebda682b30
updated docs for 0.1.8 support (#366)
* updated docs for 0.1.8 support

* updated REAMDE on root

* updated version reference to 0.1.8 in other parts of the repo

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2025-01-10 16:38:48 -08:00
Adil Hafeez
dae6239b81
use per user docker socket if system docker socket doesn't exist (#361)
* use per user docker socket if system docker socket doesn't exist

* add retry
2025-01-08 14:55:42 -08:00
Adil Hafeez
aa11113cea
pin poetry to 1.8.5 (#358) 2025-01-06 14:23:46 -08:00
Adil Hafeez
8407edae99
only test currency exchange in demo test (#348) 2024-12-21 11:33:08 -08:00
Shuguang Chen
ba7279becb
Use intent model from archfc to pick prompt gateway (#328) 2024-12-20 13:25:01 -08:00
Aayush
67b8fd635e
add more granular bucket sizes for ttft (#343)
* add more granular bucket sizes for ttft
2024-12-12 14:38:36 -08:00
José Ulises Niño Rivera
cd1b561192
Break apart metrics into their own module (#335)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-09 10:46:46 -08:00
José Ulises Niño Rivera
d002b2042a
Break apart common_types mod (#334)
Signed-off-by: José Ulises Niño Rivera <junr03@users.noreply.github.com>
2024-12-06 17:25:42 -08:00
Adil Hafeez
93d3d349a2
fix code bug (#340) 2024-12-06 17:20:59 -08:00
Adil Hafeez
af0e7d178b
update cli to 0.1.6 (#338) 2024-12-06 15:48:07 -08:00
Adil Hafeez
c186c3dfc0
update quick start rst to be in sync with readme.md (#337) 2024-12-06 15:15:26 -08:00
Adil Hafeez
a54db1a098
update getting started guide and add llm gateway and prompt gateway samples (#330) 2024-12-06 14:37:33 -08:00
Aayush
9d8fe02729
fix the README for the weather_forecasting demo (#336)
* README fix

* add missing colon
2024-12-06 14:02:41 -08:00
Ikko Eltociear Ashimine
4e919613f1
docs: update README.md (#332)
minor fix
2024-12-06 13:44:33 -08:00
Aayush
885acc899f
322 add support for pydantic logfire for llm agent tracing (#329)
* set up otel-collector and implement sending to logfire

* moved rest of the files for the demo into the folder

* update docker-compose.yaml and run_demo.sh to properly check for LOGFIRE_API_KEY

* refactor weather_forecast demo to only be one demo

* add a default docker-compose for e2e tests

* update based on requested changes

* fix replace comma with colon in readme

* remove weather_forecast_service folder, and make logfire demo fail instantly if no key is set

* remove the unused weather forecast service folder

* Changed stop_demo to only stop one file at a time

* update readme with new demo stopping setup

* Revert changes to end behavior

* fix silly formatting mistake
2024-12-06 13:44:22 -08:00
Salman Paracha
a0c159c9ba
updating doc versions, images and cleaning up section for prompt-guard (#320)
* updating doc versions, images and cleaning up section for prompt-guard

* updating based on feedback

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
2024-12-01 23:02:08 -08:00
CTran
cadd3cdaf9
hallucination with log probs (#281)
* first init

* fix

* fix test

* new implemenetation

* fix bug

* fix bug

* fix bug

* address issue

* address issues

* address comments

* fix test

* fix

* move constatns

* remove consts
2024-11-27 15:17:02 -08:00
Peter Jausovec
f5cdafb7c8
update alertmanager version to v2, remove the merge artifacts (#309)
Signed-off-by: Peter Jausovec <peter.jausovec@solo.io>
2024-11-27 11:41:31 -08:00
Adil Hafeez
ec5326250e
correctly map stats port to host (#311) 2024-11-27 11:28:41 -08:00