feat(benchmarks): add pool viewer and auto-shutdown by abrichr · Pull Request #20 · OpenAdaptAI/openadapt-ml

abrichr · 2026-02-05T21:00:10Z

Summary

Add HTML viewer for WAA pool benchmark results (view-pool command)
Implement VM auto-shutdown feature to prevent orphaned VMs
Update documentation to use correct pool-* CLI commands
Add analyze_pool_logs.py script for parsing results

Changes

Pool Results Viewer (`pool_viewer.py`)

Parses waa-pool-*.log files to extract task results
Generates interactive HTML with:
- Summary stats (total tasks, success rate, avg time)
- Per-worker breakdown
- Domain breakdown with success rates
- Filterable task list with pass/fail status

Auto-Shutdown

Default 4-hour auto-shutdown via Azure policy
--auto-shutdown-hours flag on pool-create
try/finally cleanup for test VMs to prevent orphans

Documentation

Updated README to use pool-status/pool-vnc/pool-logs (not outdated vm monitor)
Updated CLAUDE.md with auto-shutdown documentation

Test plan

pool-create --workers 2 sets auto-shutdown policy
view-pool generates HTML from benchmark_results/pool_run_20260204/
pool-cleanup -y cleans up all resources

🤖 Generated with Claude Code

Add pool_viewer.py module and CLI command for generating interactive HTML viewers from WAA parallel benchmark runs. Features: - Parse waa-pool-*.log files to extract task results - Summary stats (total tasks, success rate, avg time per task) - Per-worker breakdown showing tasks per worker - Task list with pass/fail status and step counts - Domain breakdown with per-domain success rates - Interactive filters for domain and status Usage: uv run python -m openadapt_ml.benchmarks.cli view-pool uv run python -m openadapt_ml.benchmarks.cli view-pool --run-name pool_run_20260204 uv run python -m openadapt_ml.benchmarks.cli view-pool --no-open Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Add documentation for the auto-shutdown feature: - Explain auto-shutdown policy (default 4 hours) - Document --auto-shutdown-hours flag for pool-create and create - Document -y flag for pool-cleanup (skip confirmation) - Document test VM cleanup via try/finally Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Update documentation to reflect the current working CLI: - Replace outdated `vm monitor` with `pool-status/pool-vnc/pool-logs` - Update single VM workflow to use `pool-create --workers 1` - Add analyze_pool_logs.py script for parsing benchmark results Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

abrichr and others added 3 commits February 5, 2026 12:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(benchmarks): add pool viewer and auto-shutdown#20

feat(benchmarks): add pool viewer and auto-shutdown#20
abrichr wants to merge 3 commits intomainfrom
feat/pool-viewer-auto-shutdown

abrichr commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

abrichr commented Feb 5, 2026

Summary

Changes

Pool Results Viewer (pool_viewer.py)

Auto-Shutdown

Documentation

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Pool Results Viewer (`pool_viewer.py`)