Fix llm_routing provider element

We replaced provider with provider_interface to make it more clear to developers about provider api/backend being used. During that upgrade we removed support for mistral in provider to encourage developers to start using provider_interface. But this demo was not updated to use provider_interface as it was using mistral. This code change fixes it by replacing provider with provider_interface.

Signed-off-by: Adil Hafeez <adil.hafeez@gmail.com>
This commit is contained in:
Adil Hafeez 2025-01-24 14:59:11 -08:00
parent 84af476c75
commit 16a86c0f3b
13 changed files with 28 additions and 20 deletions

View file

@ -7,9 +7,9 @@ listener:
connect_timeout: 0.005s
llm_providers:
- name: gpt-4o
- name: openai-gpt-4o
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o
system_prompt: |

View file

@ -7,7 +7,7 @@ listener:
# Centralized way to manage LLMs, manage keys, retry logic, failover and limits in a central way
llm_providers:
- name: OpenAI
provider: openai
provider_interface: openai
access_key: $OPENAI_API_KEY
model: gpt-4o-mini
default: true

View file

@ -9,7 +9,7 @@ system_prompt: |
llm_providers:
- name: OpenAI
provider: openai
provider_interface: openai
access_key: $OPENAI_API_KEY
model: gpt-4o
default: true

View file

@ -10,10 +10,18 @@ This demo shows how you can arch gateway to manage keys and route to appropricat
1. Navigate to http://localhost:18080/
# Observability
Arch gateway publishes stats endpoint at http://localhost:19901/stats. In this demo we are using prometheus to pull stats from arch and we are using grafana to visalize the stats in dashboard. To see grafana dashboard follow instructions below,
Arch gateway publishes stats endpoint at http://localhost:19901/stats. In this demo we are using prometheus to pull stats from arch and we are using grafana to visualize the stats in dashboard. To see grafana dashboard follow instructions below,
1. Navigate to http://localhost:3000/ to open grafana UI (use admin/grafana as credentials)
1. From grafana left nav click on dashboards and select "Intelligent Gateway Overview" to view arch gateway stats
# Selecting different LLM
You can pick different LLM based on header `x-arch-llm-provider-hint` to override default LLM.
You can pick different LLM based on header `x-arch-llm-provider-hint` to override default LLM. Or you can use chatbot UI to override LLM.
Here is a preview of chat UI,
![LLM Routing Demo](docs/source/_static/img/llm_routing_demo.png)
And you can also see tracing using jaeger UI that we start by heading over to http://localhost:16686/
![Jaeger Tracing](docs/source/_static/img/jaeger_tracing_llm_routing.png)

View file

@ -9,23 +9,23 @@ listener:
llm_providers:
- name: gpt-4o-mini
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o-mini
default: true
- name: gpt-3.5-turbo-0125
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-3.5-turbo-0125
- name: gpt-4o
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o
- name: ministral-3b
access_key: $MISTRAL_API_KEY
provider: mistral
provider_interface: mistral
model: ministral-3b-latest
tracing:

View file

@ -14,7 +14,7 @@ endpoints:
llm_providers:
- name: gpt-4o-mini
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o-mini
default: true

View file

@ -7,7 +7,7 @@ listener:
# Centralized way to manage LLMs, manage keys, retry logic, failover and limits in a central way
llm_providers:
- name: OpenAI
provider: openai
provider_interface: openai
access_key: $OPENAI_API_KEY
model: gpt-3.5-turbo
default: true

View file

@ -18,18 +18,18 @@ overrides:
llm_providers:
- name: gpt-4o-mini
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o-mini
default: true
- name: gpt-3.5-turbo-0125
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-3.5-turbo-0125
- name: gpt-4o
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o
system_prompt: |

Binary file not shown.

After

Width:  |  Height:  |  Size: 273 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 284 KiB

View file

@ -9,7 +9,7 @@ listener:
# Centralized way to manage LLMs, manage keys, retry logic, failover and limits in a central way
llm_providers:
- name: OpenAI
provider: openai
provider_interface: openai
access_key: $OPENAI_API_KEY
model: gpt-4o
default: true

View file

@ -9,7 +9,7 @@ listener:
# Centralized way to manage LLMs, manage keys, retry logic, failover and limits in a central way
llm_providers:
- name: OpenAI
provider: openai
provider_interface: openai
access_key: $OPENAI_API_KEY
model: gpt-4o
default: true

View file

@ -14,18 +14,18 @@ endpoints:
llm_providers:
- name: gpt-4o-mini
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o-mini
default: true
- name: gpt-3.5-turbo-0125
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-3.5-turbo-0125
- name: gpt-4o
access_key: $OPENAI_API_KEY
provider: openai
provider_interface: openai
model: gpt-4o
system_prompt: |