fix memory regression: jemalloc, debug endpoints, state eviction, stress tests

Switch brightstaff to jemalloc to fix glibc malloc fragmentation causing
OOM in prod routing service deployments. Add /debug/memstats and
/debug/state_size endpoints for runtime observability. Add TTL eviction
and max_entries cap to MemoryConversationalStorage. Cap tracing span
attributes/events. Include routing stress tests proving zero per-request
leak, and a Python load test for E2E validation.
This commit is contained in:
Adil Hafeez 2026-04-14 15:03:49 -07:00
parent 980faef6be
commit ec5d3660cd
13 changed files with 1550 additions and 1050 deletions

1707
crates/Cargo.lock generated

File diff suppressed because it is too large Load diff