research(nightly): multi-subspace HNSW with coherence-weighted fusion by ruvnet · Pull Request #556 · ruvnet/RuVector

ruvnet · 2026-06-12T07:34:21Z

Adds nightly RuVector research for multi-subspace HNSW with coherence-weighted fusion.

What this adds

Introduces crates/ruvector-subspace-hnsw — a new standalone crate implementing three measurable ANN variants:

Baseline-HNSW: single graph on all D=128 dimensions
SubspaceUnion-HNSW: 4 independent HNSW graphs on D/4=32-dim subspaces, results merged by union + full-space re-rank
CoherenceHnsw: same 4 subspace graphs, results fused by per-query variance-based coherence weights (w_s = 1/(1+CV_s))

Key research finding

Coherence fusion provides a +21pp recall improvement at N=2K, D=64 compared to single-space HNSW (0.840 vs 0.630). At N=10K D=128, the single-space baseline dominates — the scale-dependent crossover is a concrete open research question.

Real benchmark results (N=10K, D=128, x86-64)

Variant	Build (ms)	Recall@10	Mean (µs)	QPS	Memory
Baseline-HNSW	1,464	0.543	184	5,422	6.59 MB
SubspaceUnion (4×32)	5,890	0.443	874	1,144	16.53 MB
CoherenceHnsw (4×32)	5,817	0.443	880	1,136	16.53 MB

All numbers from cargo run --release -p ruvector-subspace-hnsw --bin benchmark. No fake tables.

Includes

Working Rust PoC (crates/ruvector-subspace-hnsw/) — pure Rust, only rand = "0.8" dep
15 unit tests, all passing (cargo test -p ruvector-subspace-hnsw)
Benchmark binary with acceptance tests (PASS)
ADR-199 (docs/adr/ADR-199-multi-subspace-hnsw-coherence-fusion.md)
Research doc (docs/research/nightly/2026-06-12-multi-subspace-hnsw/README.md)
Public gist (docs/research/nightly/2026-06-12-multi-subspace-hnsw/gist.md)

Prior art gap

Closest prior work: Subspace Collision (arXiv:2411.14754, SIGMOD 2025) and TaCo (arXiv:2603.24919, March 2026) use clustering indexes with static collision-count fusion. This is the first implementation using HNSW per subspace with runtime variance-based coherence weighting.

Research doc: docs/research/nightly/2026-06-12-multi-subspace-hnsw/README.md
ADR: docs/adr/ADR-199-multi-subspace-hnsw-coherence-fusion.md

Generated by Claude Code

…usion Introduces `ruvector-subspace-hnsw` (ADR-199): K independent HNSW indexes on D/K-dimensional subspaces fused via per-query variance-based coherence weights. Measured results (release, x86-64, N=10K D=128): Baseline-HNSW: recall@10=0.543, 184µs mean, 5422 QPS, 6.59MB SubspaceUnion: recall@10=0.443, 874µs mean, 1144 QPS, 16.53MB CoherenceHnsw: recall@10=0.443, 880µs mean, 1136 QPS, 16.53MB Unit-test scale (N=2K D=64): coherence fusion +21pp vs baseline (0.840 vs 0.630). - 15 unit tests pass (cargo test -p ruvector-subspace-hnsw) - Benchmark binary passes acceptance thresholds - ADR-199 documents scale-dependent coherence benefit and production path - Research doc covers SOTA (Subspace Collision arXiv:2411.14754, TaCo arXiv:2603.24919) - Public gist at docs/research/nightly/2026-06-12-multi-subspace-hnsw/gist.md https://claude.ai/code/session_014erBW46sqeMFV8ipF4XsBy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

research(nightly): multi-subspace HNSW with coherence-weighted fusion#556

research(nightly): multi-subspace HNSW with coherence-weighted fusion#556
ruvnet wants to merge 1 commit into
mainfrom
research/nightly/2026-06-12-multi-subspace-hnsw

ruvnet commented Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ruvnet commented Jun 12, 2026

What this adds

Key research finding

Real benchmark results (N=10K, D=128, x86-64)

Includes

Prior art gap

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants