mirror of https://github.com/katanemo/plano.git synced 2026-05-05 13:53:03 +02:00

Adil Hafeez e5f3039924 make tiktoken token counting optional via enable_token_counting override By default, use cheap len/4 estimate for input token counting (metrics and ratelimit). When enable_token_counting is set to true in overrides, use tiktoken BPE for exact counts. This eliminates ~80ms of per-request latency from tiktoken in the WASM filter while keeping metrics and ratelimit functional. Made-with: Cursor		2026-03-22 21:53:47 -07:00
..
docker-compose.dev.yaml	Overhaul demos directory: cleanup, restructure, and standardize configs (#760 )	2026-02-17 03:09:28 -08:00
env.list	restructure cli (#656 )	2025-12-25 14:55:29 -08:00
envoy.template.yaml	Unified overrides for custom router and orchestrator models (#820 )	2026-03-15 09:36:11 -07:00
plano_config_schema.yaml	make tiktoken token counting optional via enable_token_counting override	2026-03-22 21:53:47 -07:00
README.md	Adding support for wildcard models in the model_providers config (#696 )	2026-01-28 17:47:33 -08:00
requirements.txt	restructure cli (#656 )	2025-12-25 14:55:29 -08:00
supervisord.conf	separate config generation from process startup in supervisord (#838 )	2026-03-19 22:37:56 -07:00
test_passthrough.yaml	Run plano natively by default (#744 )	2026-03-05 07:35:25 -08:00
validate_plano_config.sh	Run plano natively by default (#744 )	2026-03-05 07:35:25 -08:00

Envoy filter code for gateway

Add toolchain

$ rustup target add wasm32-wasip1

$ cargo build --target wasm32-wasip1 --release

$ cargo test

Build docker image for Plano. Note this needs to be built once.
```
$ sh build_filter_image.sh
```

Build filter binary,

$ cargo build --target wasm32-wasip1 --release

Start envoy with config.yaml and test,

$ docker compose -f docker-compose.dev.yaml up plano

dev version of docker-compose file uses following files that are mounted inside the container. That means no docker rebuild is needed if any of these files change. Just restart the container and chagne will be picked up,
- envoy.template.yaml
- intelligent_prompt_gateway.wasm