omnigraph

apunkt/omnigraph

Fork 0

mirror of https://github.com/ModernRelay/omnigraph.git synced 2026-06-09 01:35:18 +02:00

Commit graph

Author	SHA1	Message	Date
Claude	1d3cca1e76	docs/research: track that the first harness landed differs from proposed shape The note proposed surface 1 (index-build tuning) with recall@K oracle and BauplanLabs evolutionary tournament as the "smallest experiment that would produce signal." What landed at research/lance-autoresearch/ is a different shape: PQ kernel optimization with bit-exact correctness oracle and Karpathy single-agent autoresearch loop. Add a "First implementation landed" section that records the divergence and the reasoning (seconds-scale eval favors the autoresearch shape; kernel work has a more direct upstream PR path; the bit-exact oracle removes dataset-overfitting incentive). Bumps the note to revision 3. scripts/check-agents-md.sh still passes (30/30 links). https://claude.ai/code/session_01Aq8kBUcjmEPobcEufnWbW5	2026-05-14 23:32:57 +00:00
Claude	0de7fb3057	research: reframe LLM evolutionary sampling note around Lance directly User clarified the target: optimize Lance directly rather than OmniGraph's IR layer. Rewrites the note with Lance as the primary target. Key reframe: Lance is parameter-heavy (not just plan-shape-heavy). The biggest wins come from configuration tuples (IvfPq num_partitions / num_sub_vectors / quantizer choice, nprobes / refine_factor / prefilter, batch_size / io_buffer_size / thread pools, AIMD throttle, scalar-index choice per column, compaction policy). None of these need a Lance fork — Lance accepts them as config and emits the metrics. That makes parameter-search a no-fork, substrate-respecting application of the BauplanLabs JSON-Patch-on-DAG mechanic (patches over config objects instead of plan trees). The plan-patching angle (LanceTableProvider → DataFusion ExecutionPlan, HashJoinExec swap, multi-join reorder) is parked as the long-term play behind an upstream-contribution step: serializing/round-tripping ExecutionPlan as JSON is the prerequisite Bauplan added in their fork, and the right move is to contribute it upstream rather than maintain a fork. Ranks six surfaces by value/difficulty, proposes a smallest experiment on surface 1 (workload-conditioned IvfPq tuning on SIFT1M or LAION-sample with recall@10 / p95-latency fitness, bol_evol with n_steps=3, n_samples=4), and treats OmniGraph-IR work as a complementary footnote since it composes cleanly with a Lance-tuner output.	2026-05-14 21:38:12 +00:00
Claude	92a518a4b8	research: LLM evolutionary sampling — applicability to OmniGraph Note on Erol et al. (arXiv 2602.10387) — DBPlanBench's evolutionary search over DataFusion physical plans — and where the mechanic does and does not port to OmniGraph. The direct port (fork DataFusion, patch physical plans) is the wrong target since we touch DataFusion only as a MemTable in table_store::scan_pending_batches; the adapted form (JSON-Patch search over QueryIR, especially multi-hop Expand ordering / direction) fits cleanly above the substrate without violating §I substrate respect. Lists application surfaces by value/difficulty (multi-hop Expand reorder, RRF hybrid-retrieval k-tuning, filter-pushdown shape, vector index params, compaction policy) and proposes the smallest experiment that would produce signal — bol_evol on a ~30-query .gq corpus with bit-identical result validation. Calls out the Hyrum's Law / determinism discipline (search offline, freeze plans for serving) and the corpus bootstrap problem. Filed under docs/research/ as exploratory; not a committed plan.	2026-05-14 21:14:31 +00:00

Author

SHA1

Message

Date

Claude

1d3cca1e76

docs/research: track that the first harness landed differs from proposed shape

The note proposed surface 1 (index-build tuning) with recall@K oracle and
BauplanLabs evolutionary tournament as the "smallest experiment that would
produce signal." What landed at research/lance-autoresearch/ is a different
shape: PQ kernel optimization with bit-exact correctness oracle and Karpathy
single-agent autoresearch loop. Add a "First implementation landed" section
that records the divergence and the reasoning (seconds-scale eval favors the
autoresearch shape; kernel work has a more direct upstream PR path; the
bit-exact oracle removes dataset-overfitting incentive). Bumps the note to
revision 3.

scripts/check-agents-md.sh still passes (30/30 links).

https://claude.ai/code/session_01Aq8kBUcjmEPobcEufnWbW5

2026-05-14 23:32:57 +00:00

Claude

0de7fb3057

research: reframe LLM evolutionary sampling note around Lance directly

User clarified the target: optimize Lance directly rather than OmniGraph's
IR layer. Rewrites the note with Lance as the primary target.

Key reframe: Lance is parameter-heavy (not just plan-shape-heavy). The
biggest wins come from configuration tuples (IvfPq num_partitions /
num_sub_vectors / quantizer choice, nprobes / refine_factor / prefilter,
batch_size / io_buffer_size / thread pools, AIMD throttle, scalar-index
choice per column, compaction policy). None of these need a Lance fork —
Lance accepts them as config and emits the metrics. That makes
parameter-search a no-fork, substrate-respecting application of the
BauplanLabs JSON-Patch-on-DAG mechanic (patches over config objects
instead of plan trees).

The plan-patching angle (LanceTableProvider → DataFusion ExecutionPlan,
HashJoinExec swap, multi-join reorder) is parked as the long-term play
behind an upstream-contribution step: serializing/round-tripping
ExecutionPlan as JSON is the prerequisite Bauplan added in their fork,
and the right move is to contribute it upstream rather than maintain a
fork.

Ranks six surfaces by value/difficulty, proposes a smallest experiment on
surface 1 (workload-conditioned IvfPq tuning on SIFT1M or LAION-sample
with recall@10 / p95-latency fitness, bol_evol with n_steps=3,
n_samples=4), and treats OmniGraph-IR work as a complementary footnote
since it composes cleanly with a Lance-tuner output.

2026-05-14 21:38:12 +00:00

Claude

92a518a4b8

research: LLM evolutionary sampling — applicability to OmniGraph

Note on Erol et al. (arXiv 2602.10387) — DBPlanBench's evolutionary search
over DataFusion physical plans — and where the mechanic does and does not
port to OmniGraph. The direct port (fork DataFusion, patch physical plans)
is the wrong target since we touch DataFusion only as a MemTable in
table_store::scan_pending_batches; the adapted form (JSON-Patch search over
QueryIR, especially multi-hop Expand ordering / direction) fits cleanly
above the substrate without violating §I substrate respect.

Lists application surfaces by value/difficulty (multi-hop Expand reorder,
RRF hybrid-retrieval k-tuning, filter-pushdown shape, vector index params,
compaction policy) and proposes the smallest experiment that would produce
signal — bol_evol on a ~30-query .gq corpus with bit-identical result
validation. Calls out the Hyrum's Law / determinism discipline (search
offline, freeze plans for serving) and the corpus bootstrap problem.

Filed under docs/research/ as exploratory; not a committed plan.

2026-05-14 21:14:31 +00:00

3 commits