mirror of
https://github.com/katanemo/plano.git
synced 2026-05-15 11:02:39 +02:00
updating the messaging to call ourselves the edge and AI gateway for … (#527)
* updating the messaging to call ourselves the edge and AI gateway for agents * updating README to tidy up some language * updating README to tidy up some language * updating README to tidy up some language --------- Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-329.local>
This commit is contained in:
parent
14f19f02a7
commit
5e65572573
6 changed files with 25 additions and 33 deletions
Binary file not shown.
|
Before Width: | Height: | Size: 359 KiB After Width: | Height: | Size: 218 KiB |
|
|
@ -35,28 +35,23 @@ make outbound LLM calls.
|
|||
Adding custom LLM Provider
|
||||
--------------------------
|
||||
|
||||
We support any OpenAI compliant LLM for example mistral, openai, ollama etc. We offer first class support for openai and ollama. You can easily configure an LLM that communicates over the OpenAI API interface, by following the below guide.
|
||||
We support any OpenAI compliant LLM for example mistral, openai, ollama etc. We also offer first class support for OpenAI, Anthropic, DeepSeek, Mistral, Groq, and Ollama based models.
|
||||
You can easily configure an LLM that communicates over the OpenAI API interface, by following the below guide.
|
||||
|
||||
For example following code block shows you how to add an ollama-supported LLM in the `arch_config.yaml` file.
|
||||
|
||||
.. code-block:: yaml
|
||||
|
||||
- name: local-llama
|
||||
llm_providers:
|
||||
- model: some_custom_llm_provider/llama3.2
|
||||
provider_interface: openai
|
||||
model: llama3.2
|
||||
endpoint: host.docker.internal:11434
|
||||
base_url: http://host.docker.internal:11434
|
||||
|
||||
|
||||
For example following code block shows you how to add mistral llm provider in the `arch_config.yaml` file.
|
||||
And in the following code block shows you how to add mistral llm provider in the `arch_config.yaml` file.
|
||||
|
||||
.. code-block:: yaml
|
||||
|
||||
- name: mistral-ai
|
||||
provider_interface: openai
|
||||
model: ministral-3b-latest
|
||||
endpoint: api.mistral.ai:443
|
||||
protocol: https
|
||||
|
||||
llm_providers:
|
||||
- name: mistral/ministral-3b-latest
|
||||
access_key: $MISTRAL_API_KEY
|
||||
|
||||
Example: Using the OpenAI Python SDK
|
||||
------------------------------------
|
||||
|
|
@ -65,15 +60,15 @@ Example: Using the OpenAI Python SDK
|
|||
|
||||
from openai import OpenAI
|
||||
|
||||
# Initialize the Arch client
|
||||
client = OpenAI(base_url="http://127.0.0.12000/")
|
||||
# Initialize the Arch client
|
||||
client = OpenAI(base_url="http://127.0.0.1:2000/")
|
||||
|
||||
# Define your LLM provider and prompt
|
||||
llm_provider = "openai"
|
||||
prompt = "What is the capital of France?"
|
||||
# Define your model and messages
|
||||
model = "llama3.2"
|
||||
messages = [{"role": "user", "content": "What is the capital of France?"}]
|
||||
|
||||
# Send the prompt to the LLM through Arch
|
||||
response = client.completions.create(llm_provider=llm_provider, prompt=prompt)
|
||||
# Send the messages to the LLM through Arch
|
||||
response = client.chat.completions.create(model=model, messages=messages)
|
||||
|
||||
# Print the response
|
||||
print("LLM Response:", response)
|
||||
# Print the response
|
||||
print("LLM Response:", response.choices[0].message.content)
|
||||
|
|
|
|||
|
|
@ -20,7 +20,7 @@ Arch is designed to solve these problems by providing a unified, out-of-process
|
|||
High-level network flow of where Arch Gateway sits in your agentic stack. Designed for both ingress and egress prompt traffic.
|
||||
|
||||
|
||||
Arch is an AI-native proxy server and the universal data plane for AI built by the contributors of Envoy Proxy with the belief that:
|
||||
`Arch <https://github.com/katanemo/arch>`_ is a modular edge and AI gateway for AI-native apps - built by the contributors of Envoy Proxy with the belief that:
|
||||
|
||||
*Prompts are nuanced and opaque user requests, which require the same capabilities as traditional HTTP requests
|
||||
including secure handling, intelligent routing, robust observability, and integration with backend (API)
|
||||
|
|
|
|||
|
|
@ -3,7 +3,7 @@
|
|||
|
||||
Overview
|
||||
============
|
||||
`Arch <https://github.com/katanemo/arch>`_ is an AI-native proxy server and the universal data plane for AI - one that is natively designed to handle and process AI prompts, not just network traffic.
|
||||
`Arch <https://github.com/katanemo/arch>`_ is a modular edge and AI gateway for AI-native apps - one that is natively designed to handle and process prompts, not just network traffic.
|
||||
|
||||
Built by contributors to the widely adopted `Envoy Proxy <https://www.envoyproxy.io/>`_, Arch helps you move faster by handling the pesky *low-level* work in AI agent development—fast input clarification, intelligent agent routing, seamless prompt-to-tool integration, and unified LLM access and observability—all without locking you into a framework.
|
||||
|
||||
|
|
|
|||
|
|
@ -14,7 +14,7 @@ Welcome to Arch!
|
|||
|
||||
<a href="https://www.producthunt.com/posts/arch-3?embed=true&utm_source=badge-top-post-badge&utm_medium=badge&utm_souce=badge-arch-3" target="_blank"><img src="https://api.producthunt.com/widgets/embed-image/v1/top-post-badge.svg?post_id=565761&theme=dark&period=daily&t=1742433071161" alt="Arch - Build fast, hyper-personalized agents with intelligent infra | Product Hunt" style="width: 250px; height: 54px;" width="250" height="54" /></a>
|
||||
|
||||
`Arch <https://github.com/katanemo/arch>`_ is an AI-native proxy server and the universal data plane for AI - one that is natively designed to handle and process AI prompts, not just network traffic.
|
||||
`Arch <https://github.com/katanemo/arch>`_ is a modular edge and AI gateway for AI-native apps - one that is natively designed to handle and process prompts, not just network traffic.
|
||||
|
||||
Built by contributors to the widely adopted `Envoy Proxy <https://www.envoyproxy.io/>`_, Arch helps you move faster by handling the pesky *low-level* work in AI agent development—fast input clarification, intelligent agent routing, seamless prompt-to-tool integration, and unified LLM access and observability—all without locking you into a framework.
|
||||
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue