mirror of https://github.com/ModernRelay/omnigraph.git synced 2026-06-09 01:35:18 +02:00

Lakehouse-native graph engine with git-style workflows https://omnigraph.dev

Find a file

Claude 0d72cc69fb research: restructure lance-autoresearch as multi-target workspace The original lance-autoresearch was one Cargo crate optimizing one Lance kernel (PQ L2 distance). With 9+ candidate targets enumerated in the research note, a single-crate shape doesn't scale: per-target deps will collide, the agent's edits to one target's kernels.rs would conflict with another's lib path, and build/test isolation is lost. Restructure into a Cargo workspace. Layout: research/lance-autoresearch/ ├── Cargo.toml (workspace root) ├── README.md (target table, contract overview, repo layout) ├── HARNESS.md (universal loop contract every target inherits) ├── crates/ │ ├── harness-common/ (shared: SplitMix64, geomean, peak RSS, │ │ MAX_ABS_ERR, TOPK_DIST_TOL, TIME_BUDGET_SECS) │ └── pq-l2/ (the landed target; was the previous single crate) └── docs/ ├── design.md (rationale for workspace shape, no Target trait) ├── adding-a-target.md (step-by-step workflow for new targets) └── targets/pq-l2.md (per-target capsule) Decisions documented in docs/design.md: - Workspace, not single crate: per-target Cargo.toml so deps don't collide; per-target src tree so agent edits don't conflict; per-target build/test isolation for faster agent iteration. - harness-common as a plumbing-only crate (PRNG, geomean, peak RSS, tolerance constants, time budget). Intentionally NO Target trait - decode kernel signatures and distance kernel signatures differ enough that a unifying trait would either bloat or require erased boxing. Each target is its own natural shape. - Per-target program.md + shared HARNESS.md: the loop contract is universal, the priors and API spec are per-target. Two files instead of one because copy-pasting the universal loop into every program.md would drift. pq-l2 refactor: - src/* moved into crates/pq-l2/src/* via git mv (preserves history) - crate renamed lance-autoresearch -> pq-l2 - SplitMix64, geomean, peak_rss_mb, MAX_ABS_ERR, TOPK_DIST_TOL, TIME_BUDGET_SECS now imported from harness-common (drops ~70 lines of duplication that would have been copy-pasted into every new target) - program.md trimmed: setup/loop/hygiene moved to HARNESS.md; only the PQ-L2-specific API contract and SIMD priors remain - Cargo.toml depends on harness-common via path; workspace.dependencies pins criterion uniformly across targets The 9 candidate targets from the research note (A1 cosine/dot/hamming, A2 IVF partition select, A3 FTS BM25, A4 bitpack decode, A5 dictionary decode, A6 FSST decode, A7 take/gather, A8 predicate eval, A9 posting list intersect, A10 top-K merge) are listed in README.md's target table as "candidate"; each gets a docs/targets/<name>.md capsule when it's spun up. docs/adding-a-target.md documents the cp -r + edit-Cargo.toml + rewrite-three-files workflow. Verified end-to-end: - cargo build --release: clean, both crates compile - cargo clippy --release --workspace --all-targets -- -D warnings: clean - cargo test --release --workspace: 6/6 pass (4 harness-common + 2 pq-l2) - cargo run --release --bin run_experiment -p pq-l2: correctness pass, geomean ~880k ns, exit 0, ~30s wall-clock - omnigraph parent workspace unchanged (research/ excluded as before) https://claude.ai/code/session_01Aq8kBUcjmEPobcEufnWbW5		2026-05-15 00:15:02 +00:00
.cargo	Raise LANCE_MEM_POOL_SIZE to 1 GB in .cargo/config.toml	2026-04-19 22:27:49 +03:00
.context	Investigate Lance MergeInsertBuilder CAS granularity (MR-766 prereq)	2026-04-28 23:30:17 +00:00
.github	branch-protection: declarative policy + apply script (#89 )	2026-05-13 17:38:20 +03:00
crates	schema-lint chassis v0: code-tagged diagnostics (MR-694) (#87 )	2026-05-13 17:08:18 +03:00
docker	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00
docs	docs/research: expand Cluster A with non-distance autoresearch targets	2026-05-14 23:41:40 +00:00
research/lance-autoresearch	research: restructure lance-autoresearch as multi-target workspace	2026-05-15 00:15:02 +00:00
scripts	branch-protection: declarative policy + apply script (#89 )	2026-05-13 17:38:20 +03:00
.dockerignore	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00
.gitignore	chore: gitignore the mdrip/ markdown snapshot cache	2026-05-12 17:02:14 -07:00
AGENTS.md	branch-protection: declarative policy + apply script (#89 )	2026-05-13 17:38:20 +03:00
Cargo.lock	release: prepare omnigraph 0.4.2	2026-05-10 14:02:28 +00:00
Cargo.toml	research: lance-autoresearch — PQ L2 kernel autoresearch harness	2026-05-14 22:38:39 +00:00
CLAUDE.md	Add AGENTS.md as canonical agent guide; symlink CLAUDE.md to it	2026-04-28 23:10:09 +02:00
CODE_OF_CONDUCT.md	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00
CONTRIBUTING.md	Merge remote-tracking branch 'origin/main' into ragnorc/explore-api	2026-04-18 20:24:39 +02:00
Dockerfile	Dockerfile: switch base from Docker Hub to ECR Public	2026-04-20 13:46:23 +03:00
LICENSE	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00
og-cheet-sheet.md	Add query lint and check commands	2026-04-13 00:37:44 +03:00
omnigraph.example.yaml	example config: use graphs / cli.graph, matching the MR-603 rename	2026-04-18 23:40:35 +03:00
openapi.json	release: prepare omnigraph 0.4.2	2026-05-10 14:02:28 +00:00
README.md	Update README.md	2026-05-12 08:17:31 -07:00
rust-toolchain.toml	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00
SECURITY.md	Initial public Omnigraph repository	2026-04-10 20:49:41 +03:00

README.md

Omnigraph

Lakehouse-native graph engine with git-style workflows.

Branch, commit, and merge typed graph data like source code. Multi-modal, self-hosted, open source.

Built on Rust, Arrow, DataFusion and Lance.

Join the Omnigraph Slack community

Use Cases

Company brains
Context graphs
Backbone for multi-agent research
Incident response graphs
Compliance & audit graphs
Enterprise knowledge systems

Capabilities

Typed schema, typed queries, and typed mutations
Schema-as-code, query validation and linting
Git-style graph workflows: branches, commits, merges, and transactional runs
Local, on-prem & cloud S3-native storage with snapshot-pinned reads
Graph traversal + text, fuzzy, BM25, vector, and RRF search in one runtime
Policy-as-code for server-side access control
Single CLI for multiple deployments

Quick Install

curl -fsSL https://raw.githubusercontent.com/ModernRelay/omnigraph/main/scripts/install.sh | bash

This installs omnigraph and omnigraph-server into ~/.local/bin from published release binaries.

Or install with Homebrew:

brew tap ModernRelay/tap
brew install ModernRelay/tap/omnigraph

For starter graphs and agent skills to bootstrap and operate Omnigraph, see ModernRelay/omnigraph-cookbooks.

One-Command Local RustFS Bootstrap

curl -fsSL https://raw.githubusercontent.com/ModernRelay/omnigraph/main/scripts/local-rustfs-bootstrap.sh | bash

That bootstrap:

starts RustFS on 127.0.0.1:9000
creates a bucket and S3-backed repo
loads the checked-in context fixture
launches omnigraph-server on 127.0.0.1:8080

Docker must be installed and running first.

The RustFS bootstrap prefers the rolling edge binaries and only falls back to source builds when release assets are unavailable.

If a previous run left objects under the same repo prefix but did not finish initializing the repo, rerun with RESET_REPO=1 or set PREFIX to a new value.

Common Commands

The same URI works for local paths, s3://…, or http://host:port.

omnigraph init   --schema ./schema.pg ./repo.omni
omnigraph load   --data   ./data.jsonl ./repo.omni
omnigraph read   --query  ./queries.gq --name get_person --params '{"name":"Alice"}' ./repo.omni
omnigraph change --query  ./queries.gq --name insert_person --params '{"name":"Mina"}' ./repo.omni
omnigraph branch create --from main feature-x ./repo.omni
omnigraph branch merge  feature-x --into main ./repo.omni

See docs/cli.md for schema apply, snapshots, ingest, runs, and policy commands.

Docs

Build And Test

cargo build --workspace
cargo check --workspace
cargo test --workspace

Notes:

Rust stable toolchain, edition 2024
CI runs cargo test --workspace --locked
Full CI and some local test flows require protobuf-compiler
S3 integration tests expect an S3-compatible endpoint such as RustFS

Workspace Crates

crates/omnigraph-compiler: shared schema/query parser, typechecker, catalog, and IR lowering
crates/omnigraph: storage/runtime, branching, merge, change detection, and query execution
crates/omnigraph-cli: CLI for init/load/ingest/read/change/branch/snapshot/export/policy operations
crates/omnigraph-server: Axum HTTP server for remote reads, changes, ingest, export, branches, commits, and runs

Contributing

Please open an issue, spec, or design discussion before sending large code changes. Design feedback and concrete problem statements are the fastest way to collaborate on the roadmap.

Community

Join the Omnigraph Slack community to ask questions, share feedback, and follow development.