4.3 KiB
Release Notes
Version 0.9.1 - 24 Mutation Types Update
🎯 Major Update: Comprehensive Mutation Coverage
Flakestorm now supports 24 mutation types for comprehensive robustness testing, expanding from the original 8 core types to cover advanced prompt-level attacks and system/network-level vulnerabilities.
✨ What's New
Expanded Mutation Types (24 Total)
Core Prompt-Level Attacks (8 types):
- Paraphrase - Semantic rewrites preserving intent
- Noise - Typos and spelling errors
- Tone Shift - Aggressive/impatient phrasing
- Prompt Injection - Basic adversarial attacks
- Encoding Attacks - Base64, Unicode, URL encoding
- Context Manipulation - Adding/removing/reordering context
- Length Extremes - Empty, minimal, or very long inputs
- Custom - User-defined mutation templates
Advanced Prompt-Level Attacks (7 new types):
- Multi-Turn Attack - Fake conversation history with contradictory turns
- Advanced Jailbreak - Sophisticated injection techniques (DAN, role-playing, hypothetical scenarios)
- Semantic Similarity Attack - Adversarial examples that look similar but have different meanings
- Format Poisoning - Structured data injection (JSON, XML, markdown, YAML)
- Language Mixing - Multilingual inputs, code-switching, mixed scripts
- Token Manipulation - Tokenizer edge cases, special tokens, boundary attacks
- Temporal Attack - Impossible dates, outdated references, temporal confusion
System/Network-Level Attacks (9 new types):
- HTTP Header Injection - Header manipulation and injection attacks
- Payload Size Attack - Extremely large payloads, memory exhaustion
- Content-Type Confusion - MIME type manipulation and format confusion
- Query Parameter Poisoning - Parameter pollution and query-based injection
- Request Method Attack - HTTP method confusion and manipulation
- Protocol-Level Attack - Request smuggling, chunked encoding, protocol confusion
- Resource Exhaustion - CPU/memory exhaustion, infinite loops, DoS patterns
- Concurrent Request Pattern - Race conditions, concurrent state manipulation
- Timeout Manipulation - Slow processing, timeout-inducing patterns
🔧 Improvements
- Comprehensive Testing Coverage: All 24 mutation types are fully implemented with templates and default weights
- Updated Documentation: README and Usage Guide now reflect all 24 mutation types
- Enhanced Test Suite: Test coverage expanded to validate all 24 mutation types
- Production Status: Updated development status to Production/Stable
📚 Documentation Updates
- README.md updated to reflect 24 mutation types with clear categorization
- Usage Guide includes detailed explanations of all mutation types
- Test suite (
tests/test_mutations.py) now validates all 24 types
🐛 Bug Fixes
- Fixed mutation type count inconsistencies in documentation
- Updated test assertions to cover all mutation types
📦 Technical Details
- All 24 mutation types have:
- Complete template definitions in
src/flakestorm/mutations/templates.py - Default weights configured in
src/flakestorm/mutations/types.py - Display names and descriptions
- Full test coverage
- Complete template definitions in
🚀 Migration Guide
No breaking changes. Existing configurations continue to work. The default mutation types remain the original 8 core types. To use the new advanced types, add them to your flakestorm.yaml:
mutations:
types:
- paraphrase
- noise
- tone_shift
- prompt_injection
- encoding_attacks
- context_manipulation
- length_extremes
- custom
# Add new types as needed:
- multi_turn_attack
- advanced_jailbreak
- semantic_similarity_attack
# ... and more
📊 Impact
This update significantly expands Flakestorm's ability to test agent robustness across:
- Security vulnerabilities (advanced jailbreaks, protocol attacks)
- Input parsing edge cases (format poisoning, token manipulation)
- System-level attacks (resource exhaustion, timeout manipulation)
- Internationalization (language mixing, character set handling)
🙏 Acknowledgments
Thank you to all contributors and users who have helped shape Flakestorm into a comprehensive chaos engineering tool for AI agents.
Full Changelog: See GitHub Releases for detailed commit history.