Plano is an AI-native proxy and data plane for agentic apps — with built-in orchestration, safety, observability, and smart LLM routing so you stay focused on your agents core logic. https://planoai.dev
Find a file
Salman Paracha 80c554ce1a
Docs branch - v1 of our tech docs (#69)
* added the first set of docs for our technical docs

* more docuemtnation changes

* added support for prompt processing and updated life of a request

* updated docs to including getting help sections and updated life of a request

* committing local changes for getting started guide, sample applications, and full reference spec for prompt-config

* updated configuration reference, added sample app skeleton, updated favico

* fixed the configuration refernce file, and made minor changes to the intent detection. commit v1 for now

---------

Co-authored-by: Salman Paracha <salmanparacha@MacBook-Pro-261.local>
Co-authored-by: Adil Hafeez <adil@katanemo.com>
2024-09-20 17:08:42 -07:00
.github/workflows Add initial integration style tests (#20) 2024-07-25 14:41:36 -07:00
chatbot_ui move demo functions out of model_server (#67) 2024-09-20 14:38:10 -07:00
config_generator Add ability to define clusters in config (#61) 2024-09-18 20:03:26 -07:00
demos move demo functions out of model_server (#67) 2024-09-20 14:38:10 -07:00
docs Docs branch - v1 of our tech docs (#69) 2024-09-20 17:08:42 -07:00
envoyfilter comment required param check 2024-09-20 15:49:49 -07:00
function_resolver use openai standard response in arch-fc and in gradio client (#62) 2024-09-19 12:19:14 -07:00
model_server move demo functions out of model_server (#67) 2024-09-20 14:38:10 -07:00
open-message-format@1e838f3f40 update open-message-format (#30) 2024-07-31 15:56:13 -07:00
public_types Include param default in parameters (#68) 2024-09-20 09:02:24 -07:00
.gitignore Docs branch - v1 of our tech docs (#69) 2024-09-20 17:08:42 -07:00
.gitmodules Add function calling support using bolt-fc-1b (#35) 2024-09-10 14:24:46 -07:00
.pre-commit-config.yaml Add ability to define clusters in config (#61) 2024-09-18 20:03:26 -07:00
gateway.code-workspace move demo functions out of model_server (#67) 2024-09-20 14:38:10 -07:00
README.md Update README.md 2024-09-10 14:27:14 -07:00

A open source project for developers to build and secure faster, more personalized generative AI apps. Katanemo is a high performance gateway designed with state of the art (SOTA) fast LLMs to process, route and evaluate prompts.

Demos

Complete

In progress

  • Network Co-pilot

Not Started

  • Show routing between different prompt targets (keyword search vs. top-k semantic search).
  • Show routing between different prompt-resolver vs RAG-based resolver targets.
  • Text Summarization Based on Lightweight vs. Thoughtful Dialogue using OpenAI
  • Show conversational and system observability metrics. This includes topic/intent detection
  • Show how we can help developers implement safeguards customized to their application requirements and responsible AI policies.

Dev setup

Pre-commit

Use instructions at pre-commit.com to set it up for your machine. Once installed make sure github hooks are setup, so that when you upstream your change pre-commit hooks can run and validate your change. Follow command below to setup github hooks,

$ brew install pre-commit
$ pre-commit install
pre-commit installed at .git/hooks/pre-commit