plano

mirror of https://github.com/katanemo/plano.git synced 2026-05-27 14:17:15 +02:00

Adil Hafeez e5f3039924 make tiktoken token counting optional via enable_token_counting override By default, use cheap len/4 estimate for input token counting (metrics and ratelimit). When enable_token_counting is set to true in overrides, use tiktoken BPE for exact counts. This eliminates ~80ms of per-request latency from tiktoken in the WASM filter while keeping metrics and ratelimit functional. Made-with: Cursor		2026-03-22 21:53:47 -07:00
..
api	removing model_server python module to brightstaff (function calling) (#615 )	2025-11-22 12:55:00 -08:00
traces	use standard tracing and logging in brightstaff (#721 )	2026-02-09 13:33:27 -08:00
configuration.rs	make tiktoken token counting optional via enable_token_counting override	2026-03-22 21:53:47 -07:00
consts.rs	Unified overrides for custom router and orchestrator models (#820 )	2026-03-15 09:36:11 -07:00
errors.rs	[ISSUE 706]: Standardize returned errors from Plano (#772 )	2026-02-24 14:34:33 -08:00
http.rs	Integrate Arch-Function-Chat (#449 )	2025-04-15 14:39:12 -07:00
lib.rs	cargo clippy (#660 )	2025-12-25 21:08:37 -08:00
llm_providers.rs	Adding support for wildcard models in the model_providers config (#696 )	2026-01-28 17:47:33 -08:00
path.rs	upgrade rust to 1.93.0 and fix pre-commit (#720 )	2026-02-02 11:03:12 -08:00
pii.rs	obfuscate auth header (#254 )	2024-11-08 15:17:39 -06:00
ratelimit.rs	Integrate Arch-Function-Chat (#449 )	2025-04-15 14:39:12 -07:00
routing.rs	Adding support for wildcard models in the model_providers config (#696 )	2026-01-28 17:47:33 -08:00
stats.rs	Break apart metrics into their own module (#335 )	2024-12-09 10:46:46 -08:00
tokenizer.rs	add support for v1/messages and transformations (#558 )	2025-09-10 07:40:30 -07:00
tracing.rs	cargo clippy (#660 )	2025-12-25 21:08:37 -08:00
utils.rs	Introduce brightstaff a new terminal service for llm routing (#477 )	2025-05-19 09:59:22 -07:00