# Branches, Commits, Snapshots ## L1 — Lance per-dataset branches Lance supports branching at the dataset level: a branch is a named lineage of versions, and `fork_branch_from_state(source_branch, target_branch, source_version)` creates a copy-on-write fork. ## L2 — Graph-level branches OmniGraph builds *graph branches* on top by branching every sub-table coherently: - `branch_create(name)` / `branch_create_from(target, name)` — disallowed name `main`; fails if branch exists; ensures the schema-apply lock is idle. - `branch_list()` — returns public branches, **filters internal** `__run__…` and `__schema_apply_lock__` prefixes. - `branch_delete(name)` — refuses if there are descendants or active runs on the branch; cleans up owned per-branch fragments. - **Lazy forking**: a branch only forks a sub-table when that sub-table is first mutated on it. Pure-read branches share fragments with their source. - `sync_branch(branch)` — re-binds the in-memory handle to the latest head of the branch. ## L2 — Commit graph (`db/commit_graph.rs`) In-memory shape of a graph commit: ``` GraphCommit { graph_commit_id: ULID, manifest_branch: Option, manifest_version: u64, parent_commit_id: Option, merged_parent_commit_id: Option, // populated for merge commits actor_id: Option, // joined in-memory from _graph_commit_actors.lance, NOT a column on _graph_commits.lance created_at: i64 (microseconds since epoch), } ``` Storage is split across two Lance datasets (both with stable row IDs): - `_graph_commits.lance` — every column above *except* `actor_id`. - `_graph_commit_actors.lance` — optional separate `(graph_commit_id, actor_id)` map, created on demand. The `actor_id` field above is populated by joining this dataset in-memory at load time. Notes: - Every successful publish (load / change / merge / schema_apply / publish_run) appends one commit. - Merge commits have two parents; linear commits have one. - API: `list_commits(branch)`, `get_commit(id)`, `head_commit_id_for_branch(branch)`. ## L2 — Snapshots & time travel - `snapshot()` — current snapshot for the bound branch; cached. - `snapshot_of(target)` — snapshot at a `ReadTarget` (branch | snapshot id). - `snapshot_at_version(v: u64)` — historical snapshot from any manifest version. - `entity_at(table_key, id, version)` — single-entity time travel without building a full snapshot. - A `Snapshot` is a `(version, HashMap)` — cheap to build, snapshot-isolated cross-table reads. ## L2 — Internal system branches Filtered from `branch_list()` but visible to internals: - `__run__` — ephemeral isolation branch for a transactional run. - `__schema_apply_lock__` — serializes schema migrations.