mirror of
https://github.com/elicpeter/nyx.git
synced 2026-06-12 19:55:14 +02:00
* feat: Add const_bound_vars tracking to prevent false positives in ownership checks
* feat: Introduce field interner and typed bounded vars for enhanced type tracking
* feat: Add typed_call_receivers and typed_bounded_dto_fields for enhanced type tracking
* feat: Centralize method name extraction with bare_method_name helper
* feat: Implement Phase-6 hierarchy fan-out for runtime virtual dispatch
* feat: Enhance C++ taint tracking with additional container operations and inline method resolution
* feat: Introduce field-sensitive points-to analysis for enhanced resource tracking
* feat: Implement Pointer-Phase 6 subscript handling for enhanced container analysis
* test: Add comprehensive tests for JavaScript control flow constructs and lattice operations
* docs: Update advanced analysis documentation with field-sensitive points-to and hierarchy fan-out details
* test: Add comprehensive tests for lattice algebra laws and SSA edge cases
* feat: Add destructured session user handling and safe user ID access patterns
* feat: Implement row-population reverse-walk for enhanced authorization checks
* feat: Enhance authorization checks with local alias chain for self-actor types
* feat: Introduce ActiveRecord query safety checks and enhance snippet extraction
* feat: Implement chained method call inner-gate rebinding for SSRF prevention
* feat: Add observability and error modules, enhance debug functionality, and implement theme context
* feat: Remove Auth Analysis page and update navigation to redirect to Explorer
* feat: Optimize SSA lowering by sharing results between taint engine and artifact extractor
* feat: Optimize SSA lowering by sharing results between taint engine and artifact extractor
* feat: Reset path-safe-suppressed spans before lowering to maintain analysis integrity
* fix(ssa): ungate debug_assert_bfs_ordering for release-tests build
The helper at src/ssa/lower.rs was gated `#[cfg(debug_assertions)]` while
the unit test at the bottom of the file was gated only `#[cfg(test)]`.
Since `cfg(test)` is set in release builds with `--tests` but
`cfg(debug_assertions)` is not, `cargo build --release --tests` failed
with E0425. Removing the gate fixes the build; the body is `debug_assert!`
only, so the helper is free in release. Also drop the gate at the call
site to avoid a `dead_code` warning when the lib is built without
`--tests`.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
* test(closure-capture): flip JS/TS fixtures to required-finding
The JS and TS closure-capture fixtures pinned the old broken behaviour
via `forbidden_findings: [{ "id_prefix": "taint-" }]`. The engine now
correctly traces taint through the closure boundary (env source captured
by an arrow function, sunk via `child_process.exec` inside the body), so
the formerly-forbidden finding is a true positive.
Match the Python sibling's shape — `required_findings` with
`id_prefix` + `min_count` plus a small `noise_budget` — and rewrite the
companion READMEs and the phase8_fragility_tests doc-comments from
"known gap" to "regression guard".
Verified:
- cargo test --release --test phase8_fragility_tests → 8/8 pass
- cargo test --release --lib bfs_assertion → pass
- corpus benchmark F1 = 0.9976 (TP=205, FP=1, FN=0) — unchanged
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
* feat: Add OWASP mapping and baseline mutation hooks for enhanced security analysis
* feat: Introduce health module and enhance health score computation with calibration tests
* feat: Add expectations configuration and cleanup .gitignore for log files
* feat: Implement theme selection and enhance settings panel for triage sync
* feat: Suppress false positives for strcpy calls with literal sources in AST
* feat: Update analyse_function_ssa to return body CFG for accurate analysis
* feat: Add bug report and feature request templates for improved issue tracking
* feat: removed dev scripts
* feat: update README.md for clarity and consistency in fixture descriptions
* feat: removed dev docs
* feat: clean up error handling and UI elements for improved user experience
* feat: adjust button sizes in HeaderBar for better UI consistency
* feat: enhance taint analysis with additional context for sanitizer and taint findings
* cargo fmt
* prettier
* refactor: simplify conditional checks and improve code readability in AST and screenshot capture scripts
* feat: add script to frame PNG screenshots with brand gradient
* feat: add fuzzing support with new targets and CI workflows
* refactor: streamline match expressions and improve formatting in CLI and output handling
* feat: enhance configuration display with detailed output options
* feat: stage demo configuration for improved CLI screenshot output
* feat: expose merge_configs function for user-configurable settings
* refactor: simplify code structure and improve readability in config handling
* refactor: improve descriptions for vulnerability patterns in various languages
* feat: update MIT License section with additional usage details and copyright information
* feat: update screenshots
* refactor: update build process and paths for frontend assets
* feat: add cross-file taint fuzzing target and supporting dictionary
* refactor: clean up formatting and comments in fuzz configuration and example files
* refactor: remove outdated comments and clean up CI configuration files
* chore: update changelog dates and improve formatting in documentation
* refactor: update Cargo.toml and CI configuration for improved packaging and build process
* refactor: enhance quote-stripping logic to prevent panics and add regression tests
---------
Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
143 lines
6.5 KiB
Markdown
143 lines
6.5 KiB
Markdown
# Taint analysis
|
|
|
|
Nyx tracks untrusted data from **sources** (where it enters the program) through assignments and function calls to **sinks** (where it's used dangerously). If the flow reaches a sink without passing a matching **sanitizer**, a finding fires.
|
|
|
|
The engine is a monotone forward dataflow over a finite lattice with guaranteed termination. It's flow-sensitive inside a function, and interprocedural across files via persisted per-function summaries.
|
|
|
|
## Rule ID
|
|
|
|
```
|
|
taint-unsanitised-flow (source <line>:<col>)
|
|
```
|
|
|
|
One rule ID, parameterized by the source location. Suppressions can target either the base ID or the full string.
|
|
|
|
## What it detects
|
|
|
|
- User input flowing to shell execution: `req.body.cmd` → `child_process.exec`
|
|
- User input flowing to code evaluation: `req.query.code` → `eval`
|
|
- User input flowing to SQL: `request.args.get('id')` → `cursor.execute(f"... {id}")`
|
|
- Environment variables flowing to shell: `env::var("CMD")` → `Command::new("sh").arg("-c")`
|
|
- Request parameters flowing to HTML: `req.query.name` → `innerHTML`
|
|
- File contents flowing to privileged sinks: `fs::read_to_string` → `db.execute`
|
|
- Any other source-to-sink flow where the sink's required capability is not stripped along the way
|
|
|
|
## What it can't detect
|
|
|
|
- **Library calls without summaries.** If a callee has no summary (no source, binary-only dependency), Nyx treats it as neither propagating nor sanitizing. This is conservative for sanitization but lossy for propagation.
|
|
- **Deep pointer aliasing.** `let y = &x; sink(*y)` works through one level, but arbitrary chains of pointer arithmetic and aliased writes (`*p`, `p->field` in C/C++) are not tracked end-to-end. Function pointers and indirect calls resolve to no callee.
|
|
- **Implicit flows.** Taint follows explicit data, not branching signal. `if (secret) x = 1 else x = 0` does not taint `x`.
|
|
- **Globals and statics across functions.** Not tracked across function boundaries.
|
|
|
|
## Common false positives
|
|
|
|
| Scenario | Why | Mitigation |
|
|
|---|---|---|
|
|
| Custom sanitizer not recognised | Only built-in + configured sanitizers match | Add a custom sanitizer rule in config |
|
|
| Container holds mixed-typed items the engine cannot tell apart | A `vector<int>` of port numbers and a `vector<string>` of user input share the same store/load model | Sanitize the values on the way in (numeric parse / explicit validator) so the values themselves carry no cap, not just the container |
|
|
| Dead branches | Path-insensitive within a function | Constraint solving catches trivially infeasible combos; path-validated findings are scored lower |
|
|
| Library wrapper re-introduces taint | Wrapper opaque, or summary marks it as propagating | Summarize the wrapper explicitly or add it as a sanitizer |
|
|
|
|
## Common false negatives
|
|
|
|
| Scenario | Why |
|
|
|---|---|
|
|
| Third-party library on the path | No summary available, callee treated opaquely |
|
|
| Globals / statics across function boundaries | Not tracked |
|
|
| Some closure captures | Closure analysis is limited. JS/TS/Ruby/Go anonymous functions passed as callbacks *are* analyzed as separate scopes |
|
|
| Very deep cross-file chains | Summary approximation loses precision at depth |
|
|
|
|
## Confidence signals
|
|
|
|
Higher confidence:
|
|
- Source + Sink both present in evidence with specific call locations.
|
|
- `source_kind: user_input` (direct attacker control).
|
|
- `path_validated: false`.
|
|
- No dominating guard on the path.
|
|
- Symex produced a witness string (rendered sink value visible in JSON/SARIF `evidence.symbolic.witness`).
|
|
|
|
Lower confidence:
|
|
- Path-validated taint (`path_validated: true`).
|
|
- Source is a database read or internal file (pre-validated at insertion is common).
|
|
- Engine note `ForwardBailed` / `PathWidened`. Use `--require-converged` to drop these in strict gates.
|
|
|
|
## Tuning
|
|
|
|
### Custom sanitizer
|
|
|
|
```toml
|
|
# nyx.local
|
|
[[analysis.languages.javascript.rules]]
|
|
matchers = ["escapeHtml", "sanitizeInput"]
|
|
kind = "sanitizer"
|
|
cap = "html_escape"
|
|
```
|
|
|
|
Or: `nyx config add-rule --lang javascript --matcher escapeHtml --kind sanitizer --cap html_escape`.
|
|
|
|
### Filter by severity or confidence
|
|
|
|
```bash
|
|
nyx scan . --severity HIGH
|
|
nyx scan . --min-confidence medium
|
|
```
|
|
|
|
### Skip dataflow entirely
|
|
|
|
```bash
|
|
nyx scan . --mode ast
|
|
```
|
|
|
|
AST-only mode gives you structural pattern matches without taint.
|
|
|
|
In the browser UI, taint findings render as a numbered flow walk so you can see each hop the engine took:
|
|
|
|
<p align="center"><img src="../../assets/screenshots/docs/serve-finding-detail.png" alt="Nyx finding detail: HIGH taint-unsanitised-flow with numbered source → call → sink steps and How to fix guidance" width="900"/></p>
|
|
|
|
## Example
|
|
|
|
Rust:
|
|
|
|
```rust
|
|
use std::env;
|
|
use std::process::Command;
|
|
|
|
fn main() {
|
|
let cmd = env::var("USER_CMD").unwrap(); // source
|
|
Command::new("sh").arg("-c").arg(&cmd).output(); // sink
|
|
}
|
|
```
|
|
|
|
Finding:
|
|
|
|
```
|
|
[HIGH] taint-unsanitised-flow (source 5:15) src/main.rs:6:5
|
|
Unsanitised user input flows from env::var → Command::new
|
|
Source: env::var (5:15)
|
|
Sink: Command::new
|
|
```
|
|
|
|
Safe rewrite: drop the shell and pass the value as argv directly (`Command::new(&cmd).output()`), or validate against an allowlist before passing to the shell.
|
|
|
|
## Capabilities
|
|
|
|
Sources, sanitizers, and sinks are linked by named capabilities. A sanitizer only clears taint for the cap it declares. A sink only fires when the remaining taint still carries its required cap.
|
|
|
|
| Capability | Typical source | Typical sanitizer | Typical sink |
|
|
|---|---|---|---|
|
|
| `env_var` | `env::var`, `getenv`, `process.env` | | |
|
|
| `html_escape` | | `html.escape`, `DOMPurify.sanitize` | `innerHTML`, `document.write` |
|
|
| `shell_escape` | | `shlex.quote`, `shell_escape::escape` | `system`, `Command::new`, `eval` |
|
|
| `url_encode` | | `encodeURIComponent` | `location.href`, HTTP client URL arg |
|
|
| `json_parse` | | `JSON.parse` | |
|
|
| `file_io` | | `os.path.realpath`, `filepath.Clean` | `open`, `fs::read_to_string`, `send_file` |
|
|
| `fmt_string` | | | `printf(var)` |
|
|
| `sql_query` | | parameterized query binders | `cursor.execute`, `db.query` with concatenation |
|
|
| `deserialize` | | | `pickle.loads`, `yaml.load`, `Marshal.load` |
|
|
| `ssrf` | | URL-prefix locks | `requests.get`, `fetch`, `HttpClient.send` |
|
|
| `code_exec` | | | `eval`, `exec`, `Function` |
|
|
| `crypto` | | | weak-algorithm constructors |
|
|
| `unauthorized_id` | request-bound scoped IDs (Rust auth analysis) | ownership check | row-level write |
|
|
| `all` | Sources typically use `all` so they match any sink | | |
|
|
|
|
Sources typically use `cap = "all"` so they match every sink. Sinks declare the specific cap they need. Sanitizers only clear the cap they name.
|