mirror of
https://github.com/flakestorm/flakestorm.git
synced 2026-04-25 00:36:54 +02:00
Update version to 2.0.0 and enhance chaos engineering features in Flakestorm. Added support for environment chaos, behavioral contracts, and replay regression. Expanded documentation and improved scoring mechanisms. Updated .gitignore to include new documentation files.
This commit is contained in:
parent
59cca61f3c
commit
9c3450a75d
63 changed files with 4147 additions and 134 deletions
|
|
@ -15,7 +15,7 @@ This generates an `flakestorm.yaml` with sensible defaults. Customize it for you
|
|||
## Configuration Structure
|
||||
|
||||
```yaml
|
||||
version: "1.0"
|
||||
version: "1.0" # or "2.0" for chaos, contract, replay, scoring
|
||||
|
||||
agent:
|
||||
# Agent connection settings
|
||||
|
|
@ -39,6 +39,21 @@ advanced:
|
|||
# Advanced options
|
||||
```
|
||||
|
||||
### V2: Chaos, Contracts, Replay, and Scoring
|
||||
|
||||
With `version: "2.0"` you can add the three **chaos engineering pillars** and a unified score:
|
||||
|
||||
| Block | Purpose | Documentation |
|
||||
|-------|---------|---------------|
|
||||
| `chaos` | **Environment chaos** — Inject faults into tools, LLMs, and context (timeouts, errors, rate limits, context attacks). | [Environment Chaos](ENVIRONMENT_CHAOS.md) |
|
||||
| `contract` + `chaos_matrix` | **Behavioral contracts** — Named invariants verified across a matrix of chaos scenarios; produces a resilience score. | [Behavioral Contracts](BEHAVIORAL_CONTRACTS.md) |
|
||||
| `replays.sessions` | **Replay regression** — Import production failure sessions and replay them as deterministic tests. | [Replay Regression](REPLAY_REGRESSION.md) |
|
||||
| `scoring` | **Unified score** — Weights for mutation_robustness, chaos_resilience, contract_compliance, replay_regression (used by `flakestorm ci`). | See [README](../README.md) “Scores at a glance” |
|
||||
|
||||
**Context attacks** (chaos on tool/context, not the user prompt) are configured under `chaos.context_attacks`. See [Context Attacks](CONTEXT_ATTACKS.md).
|
||||
|
||||
All v1.0 options remain valid; v2.0 blocks are optional and additive.
|
||||
|
||||
---
|
||||
|
||||
## Agent Configuration
|
||||
|
|
@ -926,6 +941,22 @@ advanced:
|
|||
|
||||
---
|
||||
|
||||
## Scoring (V2)
|
||||
|
||||
When using `version: "2.0"` and running `flakestorm ci`, the **overall** score is a weighted combination of up to four components. Configure the weights so they sum to 1.0:
|
||||
|
||||
```yaml
|
||||
scoring:
|
||||
mutation: 0.25 # Weight for mutation robustness score
|
||||
chaos: 0.25 # Weight for chaos-only resilience score
|
||||
contract: 0.25 # Weight for contract compliance (resilience matrix)
|
||||
replay: 0.25 # Weight for replay regression (passed/total sessions)
|
||||
```
|
||||
|
||||
Only components that actually run are included; the overall score is the weighted average of the components that ran. See [README](../README.md) “Scores at a glance” and the pillar docs: [Environment Chaos](ENVIRONMENT_CHAOS.md), [Behavioral Contracts](BEHAVIORAL_CONTRACTS.md), [Replay Regression](REPLAY_REGRESSION.md).
|
||||
|
||||
---
|
||||
|
||||
## Environment Variables
|
||||
|
||||
Use `${VAR_NAME}` syntax to inject environment variables:
|
||||
|
|
|
|||
Loading…
Add table
Add a link
Reference in a new issue