perf: add random-input percentile benchmarks to exact arithmetic suite

## Summary

Add a benchmark group to `benches/exact.rs` that runs N random matrices per dimension and reports median/p95/p99 timings, capturing variance that fixed single-input benchmarks miss.

Originally proposed as item #3 in #80 and explicitly scoped out of that PR; tracking here for a future release.

## Current State

`benches/exact.rs` measures exact-arithmetic performance on fixed inputs only:

- Per-dimension well-conditioned matrices (`exact_d{2..5}`)
- Four adversarial groups (`exact_near_singular_3x3`, `exact_large_entries_3x3`, `exact_hilbert_4x4`, `exact_hilbert_5x5`)

Each bench measures a single input, so reported times only characterise that specific matrix. Branch misprediction, allocation patterns, and intermediate `BigInt` sizes vary with input structure in ways a single fixed input cannot capture.

## Proposed Changes

Add a new bench group (e.g. `exact_random_percentile_d{2..5}`) that:

- Generates N (e.g. 50 or 100) random matrices per dimension with a seeded RNG for reproducibility
- Runs `det_exact`, `det_sign_exact`, `solve_exact`, `solve_exact_f64` on each
- Reports median, p95, and p99 timings per bench using Criterion's custom measurement / `iter_batched` APIs

Design considerations:

- Use a fixed seed (e.g. `[0u8; 32]`) so CI and local runs agree
- Matrix generation must avoid singularity — use a diagonally-dominant construction (same shape as `make_diagonally_dominant` in `tests/proptest_exact.rs`) rather than reject-retry, which is harder to reproduce
- RHS generation should match — small integers keep construction simple and exact
- Budget bench runtime: consider limiting N per dimension, restricting to D=3..=5, or gating behind a separate recipe (`just bench-exact-random`) given the higher runtime cost

## Benefits

- Captures variance that fixed single-input benches miss (branch misprediction, allocation patterns, intermediate `BigInt` sizes)
- Detects tail-case regressions where the median is unchanged but p99 worsens
- Provides stronger empirical grounding for `docs/PERFORMANCE.md` claims

## Implementation Notes

- Criterion supports custom measurement and iteration counts for random-input benchmarks
- Will likely need a new entry in `EXACT_GROUPS` in `scripts/bench_compare.py` (and a corresponding `_group_heading` branch)
- See item #3 in #80 for the original proposal context
- Follow-up to the adversarial-input bench coverage landed in the issue-80 PR

Related: #80


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: add random-input percentile benchmarks to exact arithmetic suite #98

Summary

Current State

Proposed Changes

Benefits

Implementation Notes

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

perf: add random-input percentile benchmarks to exact arithmetic suite #98

Description

Summary

Current State

Proposed Changes

Benefits

Implementation Notes

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions