feat: merge ingest and scan

* docs: add CLI component reuse guidance * docs: add unified ingest ux design * Refine unified ingest UX design after adversarial review iteration 1 * Refine unified ingest UX design after adversarial review iteration 2 * Refine unified ingest UX design after adversarial review iteration 3 * feat(cli): route public connection ingest command * feat(cli): hide standalone scan from public help * feat(cli): plan public ingest depth and query history * feat(cli): execute public database ingest facets * feat(ingest): read connection query history config * fix(cli): use public ingest wording * fix(config): stop generating ingest adapter allow lists * docs: document public ingest command * test: align ingest surface expectations * docs: add unified ingest public CLI surface plan * feat(cli): preflight deep public ingest readiness * feat(setup): store query history in connection context * feat(setup): store database context depth * feat(setup): verify context readiness by database depth * fix(setup): keep context build foreground only * fix(config): reject reserved ingest connection ids * test: close unified ingest v1 expectations * docs: add unified ingest v1 closure plan * fix(ingest): bypass adapter allow-list for public source ingest * fix(ingest): honor query history window intent * fix(ingest): hide scan internals from public database ingest * feat(ingest): use foreground view for interactive public ingest * fix(setup): use schema context and query history wording * test(cli): verify unified ingest public output * docs: add unified ingest v1 public output closure plan * fix(setup): forward query history flags * fix(setup): prompt for postgres query history * fix(status): report query history readiness * fix(ingest): remove legacy public guidance * fix(ingest): polish foreground retry copy * docs(examples): use unified query history wording * chore(ingest): finish public query history cleanup * docs: add unified ingest v1 query history status cleanup plan * test(docs): cover unified ingest public docs * docs: align ingest CLI reference with unified UX * docs: update context build guides for unified ingest * docs: update setup and primary source ingest wording * docs: stop advertising adapter-backed example ingest * docs: close unified ingest public docs gaps * docs: add unified ingest v1 docs site closure plan * fix: render unified ingest foreground warnings * fix: explain query history schema order * fix: add public ingest retry guidance * fix: align setup next steps with unified ingest * fix: remove scan wording from demo progress * test: verify unified ingest ux closure * docs: add unified ingest v1 foreground and retry closure plan * fix(cli): preserve query-history pull config in public ingest * fix(cli): omit hidden commands from docs command tree * test(cli): close unified ingest final public surface checks * docs: add unified ingest v1 final public surface closure plan * fix(cli): use public source labels in ingest reports * fix(cli): suppress low-level public ingest output * test(cli): verify unified ingest public plain output * docs: add unified ingest v1 public plain output closure plan * fix(cli): add public ingest copy sanitizers * fix(cli): sanitize public ingest progress copy * fix(cli): rename setup schema scope prompt * docs(plan): add progress copy closure; test: align setup back-nav fixture Adds the iter9 plan and updates the setup back-navigation test fixture to pass disableQueryHistory plus listSchemas/listTables stubs that the unified ingest setup step now requires. * docs(plan): add final ux labels plan with narrowed label scans * fix(cli): aggregate unsupported query-history warnings * fix(cli): align setup database labels * test(cli): fix setup database test type-check * fix(cli): remove primary-source wording from setup output * test(cli): verify unified ingest setup closure * docs(plan): add unified ingest v1 verification copy closure plan * fix(cli): remove top-level scan command * fix(cli): remove legacy ingest and wiki commands * Merge scan into ingest flow * feat(cli): split ingest progress into per-phase rows, rename work units to tasks Each database target in the unified ingest dashboard now renders one row per real subprocess (Schema, then Query history when enabled) instead of a single combined bar. Each phase has its own monotonic 0-100% bar so the progress never snaps back to zero when historic-sql starts after scan completes. Completed phases keep their final bar, summary, and elapsed time visible as an inline audit trail; queued and skipped phases are shown explicitly. Also rename user-facing "work units" / "Failed work units" to "tasks" / "Failed tasks" in ingest output and parseIngestSummary. The parser still accepts the legacy "Work units:" wording in captured output for backward compat. Internal memory-flow event names and type fields are left alone. * Fix test harness failures * Fix CI smoke checks --------- Co-authored-by: Andrey Avtomonov <7889985+andreybavt@users.noreply.github.com>
2026-07-01 08:59:39 +02:00 · 2026-05-14 01:43:06 +02:00 · 2026-05-14 01:43:06 +02:00 · b00c1a11a9
commit b00c1a11a9
parent 1a472cf3ed
118 changed files with 16890 additions and 2992 deletions
--- a/docs/superpowers/specs/2026-05-13-unified-ingest-ux-design.md
+++ b/docs/superpowers/specs/2026-05-13-unified-ingest-ux-design.md
@ -0,0 +1,593 @@
+# Unified Ingest UX Design
+
+**Date:** 2026-05-13
+**Author:** Andrey Avtomonov
+**Status:** Design — pending implementation plan
+
+## Background
+
+KTX currently exposes multiple user-facing ideas for one product action:
+building context from configured connections. Database connections use
+`ktx scan <connectionId>`, source connections use
+`ktx ingest run --connection-id <id> --adapter <adapter>`, and setup uses a
+context-build wrapper that plans database scans before source ingestion.
+
+The implementation already points toward one concept. `ktx scan` runs a
+stage-only ingest with the `live-database` adapter, then writes scan-specific
+reports, schema manifests, and enrichment artifacts. `ktx setup` already
+builds context from all configured connections by routing database connections
+to scan internals and source connections to source-ingest internals.
+
+The user-facing model must become simpler:
+
+- Setup configures KTX.
+- Ingest builds or refreshes context.
+- Status explains readiness.
+
+`scan`, `live-database`, and adapter selection are implementation details.
+
+## Goals
+
+The redesign makes `ktx ingest` the single public context-building command and
+keeps the foreground experience rich, clear, and robust.
+
+- Remove `ktx scan` as a normal external verb.
+- Remove `live-database` from user-facing CLI help, output, docs, and
+  `ktx.yaml`.
+- Treat database schema ingest as mandatory baseline behavior for database
+  connections.
+- Keep slow AI-heavy database behavior explicit with `--deep`; keep fast,
+  deterministic behavior explicit with `--fast`.
+- Fold query-history ingestion into database connection ingest as an optional
+  facet.
+- Keep `ktx setup` guided. It stores defaults in `ktx.yaml` and uses the same
+  foreground context-build engine as `ktx ingest`.
+- Remove detach, attach, watch, resume, stop, and background context-build
+  flows.
+- Preserve a polished foreground progress view for TTY users and scriptable
+  output for non-TTY and JSON users.
+
+## Non-goals
+
+This spec does not redesign the semantic-layer YAML format, the ingest bundle
+agent loop, or warehouse verification tools.
+
+- Do not remove the internal scan implementation if it remains the cleanest
+  module boundary.
+- Do not remove internal adapter/source keys in one large rename. User-facing
+  terminology changes first; internal cleanup can follow where it reduces
+  complexity.
+- Do not make query-history ingestion mandatory.
+- Do not make AI enrichment mandatory for database connections.
+- Do not add `--fast` or `--deep` to top-level `ktx setup`.
+- Do not preserve compatibility shims for old public `scan` or
+  `ingest run --adapter live-database` usage unless an implementation plan
+  explicitly chooses a short deprecation window.
+
+## Public command model
+
+`ktx ingest` becomes the direct command for building context from one
+connection or all configured connections.
+
+```bash
+ktx ingest warehouse
+ktx ingest warehouse --fast
+ktx ingest warehouse --deep
+ktx ingest warehouse --deep --query-history
+ktx ingest warehouse --no-query-history
+ktx ingest notion
+ktx ingest --all
+ktx ingest --all --deep
+```
+
+The command dispatches by connection driver:
+
+- Database drivers run database ingest.
+- Source drivers run source ingest.
+- `--all` runs database ingest targets first, then source ingest targets.
+
+The old `ktx ingest run --connection-id <id> --adapter <adapter>` command is
+removed from the public interface. Normal users configure and ingest
+connections, not adapters.
+
+`ktx scan` is no longer a documented public command. Database schema scanning
+continues as an internal phase of database ingest.
+
+Stored report inspection is separate from live context-build control. The
+public `ktx ingest` namespace has no subcommands, so `run`, `status`, `watch`,
+and `replay` are ordinary connection IDs:
+
+```bash
+ktx ingest run
+ktx ingest status
+ktx ingest watch
+ktx ingest replay
+```
+
+No setup or config validation rejects those names. Old adapter-backed command
+shapes such as `ktx ingest run --connection-id warehouse --adapter
+live-database` fail through normal option parsing because `--connection-id` and
+`--adapter` are not public `ktx ingest` options.
+
+## Database ingest depth
+
+Database ingest always includes a schema baseline. The depth controls how much
+extra work KTX may perform.
+
+Depth is the public abstraction over the current scan engine:
+
+- `fast` maps to `KtxScanMode: structural` with `detectRelationships: false`.
+- `deep` maps to `KtxScanMode: enriched` and requests relationship detection.
+- The internal `relationships` scan mode remains an advanced implementation
+  detail. It is not a separate public depth in this v1.
+
+Deep mode includes relationship discovery when the project's
+`scan.relationships.enabled` setting is true. Relationship validation thresholds
+and budgets remain governed by the existing internal `scan.relationships`
+configuration; users do not get a separate public relationship flag in this
+surface. If `scan.relationships.enabled` is false, `--deep` still runs enriched
+database ingest but relationship discovery remains disabled.
+
+### Fast
+
+`--fast` means KTX builds deterministic schema context quickly.
+
+- No LLM calls.
+- No embeddings.
+- No AI-generated descriptions.
+- No expensive relationship discovery that depends on sampling, read-only SQL,
+  or model calls.
+- Introspect tables, columns, native types, comments, declared primary keys,
+  and declared foreign keys when the connector can read them.
+- Write or update database schema context that agents can use as grounding.
+- Do not run query-history synthesis, because the current query-history path
+  uses ingest work units and model-backed synthesis.
+
+This is the safe default for new database connections, CI, smoke tests, and
+large unknown warehouses.
+
+### Deep
+
+`--deep` means KTX builds richer database context through the enriched scan path
+and uses slower capabilities.
+
+- Requires LLM, embedding, and scan-enrichment readiness before work starts.
+- Generates table and column descriptions.
+- Generates embeddings.
+- May sample or query data through read-only connector capabilities.
+- Discovers and validates relationships when relationship discovery is enabled.
+- May process query history into usage patterns when query history is enabled.
+
+Deep mode is the best agent-readiness mode, but it can take longer and can
+require model, embedding, and database permissions.
+
+KTX must not silently downgrade an explicit or stored `deep` request to `fast`.
+For a single database target, if the project is missing the model, embedding, or
+scan-enrichment configuration required for deep ingest, KTX errors before
+starting the run and tells the user to run `ktx setup` or rerun with `--fast`.
+For `--all`, deep-readiness failures follow the per-target rule in
+**Error handling and warnings**.
+
+### Flag rules
+
+`--fast` and `--deep` are mutually exclusive. Passing both is an error.
+
+When neither flag is passed, `ktx ingest` uses the stored connection default.
+If no default exists, database connections use `fast`.
+
+If a depth flag is passed for a non-database source, KTX prints a warning and
+continues:
+
+```text
+--deep affects database ingest only; ignoring it for notion.
+```
+
+For `--all`, KTX aggregates warnings instead of repeating noisy lines:
+
+```text
+--deep ignored for 2 non-database sources.
+```
+
+## Query history
+
+Historic SQL becomes the database connection's query-history facet. The term
+`historic-sql` remains an internal source key unless a later cleanup renames
+it.
+
+Query history is optional because it can require extra grants and can expose
+sensitive SQL text. Setup asks about it only for database drivers that support
+it.
+
+```bash
+ktx ingest warehouse --query-history
+ktx ingest warehouse --no-query-history
+ktx ingest warehouse --query-history-window-days 30
+```
+
+Query-history flags apply only to database connections that support the feature.
+In v1, supported query-history drivers are `postgres` or `postgresql`,
+`bigquery`, and `snowflake`. They map to the existing historic-SQL dialects
+`postgres`, `bigquery`, and `snowflake`. `sqlite`, `mysql`, `clickhouse`, and
+`sqlserver` are database ingest targets but do not support query history in v1.
+
+Non-applicable query-history flags produce warnings and continue when the target
+can otherwise be ingested. For a single unsupported database target,
+`--query-history` or `--query-history-window-days` runs schema ingest, skips the
+query-history facet, and prints a warning. For `--all`, KTX aggregates those
+warnings and continues other eligible targets. Stored
+`connections.<id>.context.queryHistory.enabled: true` on an unsupported driver
+is a config warning and is skipped for that driver; it must not abort schema
+ingest for that target.
+
+Query history uses schema context as grounding. KTX must run the database
+schema facet before query-history processing in the same ingest run. If a user
+explicitly enables query history for a run, the output states that schema
+ingest runs first.
+
+Because query-history synthesis is model-backed in the current architecture,
+`--query-history` upgrades the effective database depth to deep for that run.
+KTX prints a warning when a user combines `--fast` with `--query-history`:
+
+```text
+--query-history requires deep ingest; running warehouse with --deep.
+```
+
+Stored `connections.<id>.context.queryHistory.enabled: true` has the same
+depth requirement. When no explicit depth flag is passed, stored query-history
+enablement upgrades the effective database depth to `deep` for that run. When a
+user explicitly passes `--fast` and does not pass `--query-history`, KTX honors
+the explicit fast request, skips stored query-history processing for that run,
+does not modify `ktx.yaml`, and prints a warning:
+
+```text
+warehouse has query history enabled in ktx.yaml, but --fast skips query-history processing.
+```
+
+`--query-history-window-days <n>` overrides
+`connections.<id>.context.queryHistory.windowDays` only for the current run. It
+must not rewrite `ktx.yaml`. The effective value flows into the same
+`historicSqlUnifiedPullConfigSchema.windowDays` field used by the current
+historic-SQL pull path.
+
+## Configuration model
+
+User-authored `ktx.yaml` becomes connection-centric. Database schema ingest is
+implied by the database connection and no longer appears as an ingest adapter.
+
+```yaml
+connections:
+  warehouse:
+    driver: postgres
+    readonly: true
+    context:
+      depth: fast
+      queryHistory:
+        enabled: false
+
+  notion:
+    driver: notion
+    context:
+      enabled: true
+```
+
+Deep database defaults and query history use the same connection-local shape:
+
+```yaml
+connections:
+  warehouse:
+    driver: postgres
+    readonly: true
+    context:
+      depth: deep
+      queryHistory:
+        enabled: true
+        windowDays: 90
+        minExecutions: 5
+        filters:
+          dropTrivialProbes: true
+          serviceAccounts:
+            mode: exclude
+            patterns:
+              - "^svc_"
+        redactionPatterns: []
+```
+
+`context.queryHistory` is the canonical user-facing shape. Runtime code maps it
+to the existing historic-SQL pull config as follows:
+
+- `dialect` is derived from the database driver (`postgres` or `postgresql`,
+  `bigquery`, or `snowflake`) and is not normally user-authored.
+- `windowDays`, `minExecutions`, and `redactionPatterns` copy through directly.
+- `filters.dropTrivialProbes` defaults to `true`.
+- `filters.serviceAccounts.patterns` and `filters.serviceAccounts.mode` map to
+  the existing service-account filter fields. The default mode is `exclude`.
+- `concurrency`, `staleArchiveAfterDays`,
+  `filters.orchestrators.mode`, and `filters.dropFailedBelow` are advanced
+  query-history fields. When present, they map directly to the same fields in
+  `historicSqlUnifiedPullConfigSchema`. When absent, KTX uses the existing
+  historic-SQL schema defaults and omitted-field behavior.
+
+Existing `connection.historicSql` blocks are legacy cutover input. Setup or the
+explicit config rewrite path must migrate them into
+`connection.context.queryHistory` while preserving all mapped query-history
+fields, including the advanced fields listed above. `ktx ingest` must not
+rewrite `ktx.yaml`; it may read legacy `historicSql` blocks for the current run
+and emit a cleanup warning. If both `context.queryHistory` and `historicSql` are
+present, `context.queryHistory` wins and KTX emits a config-cleanup warning
+instead of running both.
+
+Config migration must be idempotent. A setup or explicit rewrite pass that
+migrates a connection removes the legacy `connection.historicSql` block after
+copying preserved fields, does not regenerate normal `ingest.adapters` entries,
+and produces the same `ktx.yaml` on repeated runs. If `ktx ingest` sees a legacy
+block before cleanup, the warning may repeat because ingest is config-read-only.
+
+`ingest.adapters` is no longer normal user config. Existing `ingest.adapters`
+entries load as advanced/internal overrides during the transition, but
+public `ktx ingest <connectionId>` must not fail solely because the
+driver-to-adapter mapping chooses an adapter missing from that list. The rule
+applies to database internals (`live-database` and `historic-sql`) and to all
+source adapters selected from configured drivers, including `notion`, `dbt`,
+`metabase`, `looker`, `metricflow`, and `lookml`.
+
+The implementation can satisfy this by bypassing the adapter allow-list for
+connection-centric public ingest, or by synthesizing the adapters required by
+configured connections before dispatch. The old adapter-backed advanced command
+may continue to honor `ingest.adapters` while it exists. Normal generated
+`ktx.yaml` must not include `live-database`, `historic-sql`, or source adapter
+entries just to make public `ktx ingest <connectionId>` work.
+
+## Setup flow
+
+`ktx setup` remains a guided configuration flow. It does not expose
+`ktx setup --fast` or `ktx setup --deep`.
+
+During interactive setup, KTX asks for database context depth when a database
+connection is configured or when setup reaches the context-build step:
+
+```text
+How much database context should KTX build?
+
+Fast: schema only, no AI, quickest
+Deep: AI descriptions, embeddings, relationships, slower
+```
+
+The recommended selection depends on readiness:
+
+- Recommend Fast when model, embedding, or scan-enrichment configuration is
+  missing.
+- Recommend Deep when model, embedding, and scan-enrichment configuration are
+  ready.
+
+The recommendation is based on the final configuration produced by the current
+setup run, not on an earlier intermediate state. Setup must either ask the depth
+question after the model, embedding, and scan-enrichment setup paths complete,
+or defer or repeat the depth prompt before the foreground context build starts
+when those capabilities are configured later in the same setup run.
+
+Setup stores the chosen default in `connections.<id>.context.depth`. The
+foreground context build uses that stored default. Setup can still expose a
+non-prominent automation flag later, such as `--context-depth fast`, if
+headless setup needs it, but the main product surface is guided.
+
+Setup readiness is depth-aware:
+
+- For `fast`, a database context is ready when the latest non-dry-run
+  structural scan for the connection completed and wrote schema manifest shards.
+  Model, embedding, description-enrichment, and scan-enrichment checks are
+  skipped for fast contexts.
+- For `deep`, a database context is ready only when the enriched scan completed
+  table descriptions, column descriptions, embeddings, and schema manifest
+  shards. When relationship discovery is enabled, readiness requires the
+  relationship stage to have completed for the latest enriched scan. A
+  completed relationship stage with zero accepted, review, rejected, or skipped
+  relationships still counts as ready; readiness must not require non-empty
+  relationship artifacts or accepted relationships. If relationship discovery is
+  disabled, the relationship stage is not part of the readiness gate.
+
+The missing-input gate uses the same rule. Missing model, embedding, or
+scan-enrichment configuration must not block a user who selected `fast`. The
+same missing inputs must block `deep` before the foreground build starts, with a
+message that offers `fast` as the no-AI path.
+
+## Foreground progress UX
+
+KTX keeps a rich foreground progress view. It removes detach and background
+execution.
+
+The shared build view groups work by user-facing source type:
+
+```text
+Building KTX context  (2/4 · 1m 12s)
+───────────────────────────────────
+
+Databases
+  ✓ warehouse      42 tables · 6 changed · relationships found
+  ⠹ billing        reading schema · 18/64 tables
+
+Context sources
+  ✓ dbt            18 models · 42 metrics
+  ○ notion         queued
+
+Warnings
+  --deep ignored for notion; it only applies to database connections.
+```
+
+The view must not show `scan` or `live-database` in normal mode. It uses:
+
+- `Databases` instead of `Primary sources`.
+- `Context sources` for docs, BI, metrics, and modeling sources.
+- `reading schema` or `building schema context` instead of `scanning`.
+- `query history` or `usage patterns` instead of `historic-sql`.
+
+Non-TTY output remains append-only and scriptable. `--json` returns structured
+results. Routine artifact paths and internal adapter names appear only in
+`--debug` or JSON output.
+
+## Removing detach and watch
+
+The context build is foreground only.
+
+- `Ctrl+C` stops the current run.
+- KTX records interrupted or failed state where useful for status reporting.
+- Rerunning `ktx setup` or `ktx ingest` starts a fresh foreground build or
+  reuses existing completed artifacts when safe.
+
+Remove these user-facing concepts from context build:
+
+- detach
+- attach
+- watch
+- resume
+- stop
+- background context-build subprocesses
+- prompts that offer "Watch progress"
+- hints such as `d to detach`
+
+Existing `running` or `detached` state from older versions must be treated as
+stale or interrupted with a clear rerun instruction.
+
+`.ktx/setup/context-build.json` remains only as a foreground status cache, not a
+background control plane. New writes may use `not_started`, `running`,
+`completed`, `failed`, `interrupted`, or `stale`. `running` means the current
+foreground process is active; a later setup process that finds a leftover
+`running` record from an older process must mark it `stale` or `interrupted`
+before offering a fresh run. `detached` and `paused` are legacy-only statuses
+and must be normalized to `stale` or `interrupted` on read or on the next setup
+write.
+
+The state file must not keep user-facing `watch`, `resume`, or `stop` command
+affordances after this redesign. It may retain run ids, report ids, artifact
+paths, source progress, failure details, and a retry/build command when those
+help status reporting.
+
+## Internal naming and migration
+
+User-facing surfaces must stop saying `live-database`.
+
+This includes:
+
+- CLI help.
+- Normal command output.
+- Setup prompts.
+- Generated `ktx.yaml`.
+- README quickstart and examples.
+- Friendly errors and warnings.
+
+Internal paths and source keys can keep `live-database` during the first
+implementation if renaming them would add risk. Debug output and JSON may
+include internal names when they are necessary for troubleshooting.
+
+The implementation plan must also update stale command suggestions. For
+example, setup source recovery must no longer tell users to run
+`ktx ingest run --connection-id ... --adapter <adapter>`. It must suggest the
+new connection-centric command:
+
+```bash
+ktx ingest <connectionId>
+```
+
+## Error handling and warnings
+
+Warnings are non-fatal when KTX can still perform the requested ingest.
+
+- Ignored depth flag on a non-database source: warn and continue.
+- Ignored query-history flag on an unsupported database: warn and continue if
+  schema ingest can run.
+- Both `--fast` and `--deep`: error before any work starts.
+- Explicit or stored `deep` without required model, embedding, or
+  scan-enrichment readiness: error before any work starts for that target.
+- `--query-history` without required model, embedding, or scan-enrichment
+  readiness: error before any work starts for that target because query history
+  upgrades the run to `deep`.
+- Query-history requested without required grants: fail that query-history
+  facet and keep schema results when schema ingest succeeded.
+- Database schema ingest failure: fail that database target.
+
+`--all` isolates target failures. It runs all database targets first, then all
+source targets, even when one or more database targets fail. Source targets may
+therefore run against previously completed database context if the current
+database refresh failed. The final exit code is non-zero when any target or
+required facet fails, and the summary identifies partial failures by
+connection.
+
+For `--all`, readiness is evaluated per target after resolving each target's
+effective depth and query-history settings. A database target whose effective
+run requires deep readiness but lacks model, embedding, or scan-enrichment
+configuration fails before work starts for that target; eligible database and
+source targets still run. Command-level errors that make target planning
+impossible, such as mutually exclusive flags, an unreadable project config, or
+no eligible targets, still abort before any target work starts.
+
+Failure messages focus on the connection and user action:
+
+```text
+warehouse failed: connection refused.
+Retry: ktx ingest warehouse --deep
+```
+
+They do not mention internal adapter names unless debug output is enabled.
+
+## Acceptance criteria
+
+The implementation is complete when these conditions hold:
+
+- `ktx ingest <connectionId>` works for database and source connections.
+- `ktx ingest --all` runs database targets before source targets.
+- `ktx ingest <connectionId>` does not require `ingest.adapters` entries for
+  any adapter chosen from the configured connection driver.
+- Connection ids that collide with surviving `ktx ingest` subcommands are
+  rejected during setup or config validation.
+- `--fast` and `--deep` control database depth and are mutually exclusive.
+- `--fast` maps to structural database ingest without relationship detection.
+- `--deep` maps to enriched database ingest with relationship detection when
+  `scan.relationships.enabled` is true.
+- `--deep` and `--query-history` fail before work starts when required model,
+  embedding, or scan-enrichment configuration is missing.
+- `ktx ingest --all` continues independent targets after partial failures and
+  exits non-zero when any target or required facet fails.
+- `ktx ingest --all` treats deep-readiness failures as per-target failures
+  after target planning, rather than aborting eligible independent targets.
+- `ktx setup` stores a database context depth without exposing top-level
+  `--fast` or `--deep`.
+- `ktx setup` bases the recommended/default database context depth on the final
+  model, embedding, and scan-enrichment readiness reached by the setup run.
+- `ktx setup` treats fast database context as ready after completed structural
+  schema ingest and does not require AI descriptions or embeddings for fast.
+- Generated `ktx.yaml` does not include `live-database` for normal projects.
+- Generated `ktx.yaml` uses `connections.<id>.context.queryHistory`, not
+  `connections.<id>.historicSql`, for query-history configuration.
+- Normal CLI help and output do not mention `live-database`.
+- Normal CLI help and output do not present `scan` as a public verb.
+- Normal CLI help and output do not present `ktx ingest watch` as live context
+  build control.
+- Query history is optional, connection-local, and overridable per ingest run.
+- Query history is supported only for `postgres` or `postgresql`, `bigquery`,
+  and `snowflake` in v1; unsupported database drivers warn and skip the
+  query-history facet without blocking schema ingest.
+- Stored query-history enablement upgrades default database ingest to deep, but
+  explicit `--fast` skips stored query history for that run with a warning.
+- `--query-history-window-days` overrides the effective historic-SQL
+  `windowDays` pull config for the current run only and does not rewrite
+  `ktx.yaml`.
+- Legacy `connection.historicSql` migration is idempotent, preserves all mapped
+  query-history fields, and is performed by setup or an explicit config rewrite,
+  not by `ktx ingest`.
+- Context build has no detach, attach, watch, resume, stop, or background
+  execution path.
+- `.ktx/setup/context-build.json` is retained only as foreground status cache
+  state; legacy `detached` or `paused` records do not trigger background
+  recovery branches.
+- Existing setup context progress UX is consolidated with `ktx ingest` rather
+  than duplicated.
+- Non-TTY and JSON output remain suitable for scripts.
+
+## Open implementation questions
+
+The implementation plan must decide these lower-level details:
+
+- Whether old `ktx scan` exits with an error, is hidden, or remains as a
+  temporary undocumented debug command.
+- Whether internal artifact paths keep `raw-sources/<connection>/live-database`
+  for the first implementation.
+- Whether setup needs a headless `--context-depth fast|deep` flag for CI.