You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Define the four-tier validation architecture that enables agents to self-validate locally before pushing to remote CI, eliminating the slow feedback loops that waste compute and tokens.
The problem
Current validation has two tiers:
Pre-commit (< 5s) — formatting, secrets, lint
Remote CI (5–20 min) — full build, test, security, deploy
The gap between these means agents discover failures only after pushing — leading to 15+ minute feedback loops, wasted inference tokens, and cascade failures in stacked PR chains.
The four tiers
Tier
Time
What
Status
0 — Pre-commit
< 5s
Format, lint, secrets
✅ Implemented
1 — Local build
30–90s
Compile, unit test, synth
⚠️ Exists but not gated
2 — Local sandbox
1–5 min
Integration, container tests
❌ Gap
3 — Remote CI
5–20 min
Full matrix, deploy, E2E
✅ Implemented
Tier 2 is the critical gap. This ADR defines what it should contain and how to build it progressively.
Scope
ADR-013 documenting the tiered validation architecture decision
Tier definitions with what each catches
Progressive build-out plan for Tier 2 (LocalStack → agent smoke → ephemeral stack → full sandbox)
Enforcement model (which tier gates which event)
Agent interaction model (skill-driven validation before push)
Investment priority (P0–P4)
Escape hatches for legitimate bypasses
Acceptance criteria
docs/decisions/013-tiered-validation-pyramid.md exists with status proposed
All four tiers defined with time budget, scope, and current implementation status
Summary
Define the four-tier validation architecture that enables agents to self-validate locally before pushing to remote CI, eliminating the slow feedback loops that waste compute and tokens.
The problem
Current validation has two tiers:
The gap between these means agents discover failures only after pushing — leading to 15+ minute feedback loops, wasted inference tokens, and cascade failures in stacked PR chains.
The four tiers
Tier 2 is the critical gap. This ADR defines what it should contain and how to build it progressively.
Scope
Acceptance criteria
docs/decisions/013-tiered-validation-pyramid.mdexists with statusproposedDependencies
References
.pre-commit-config.yaml(Tier 0)mise.toml(Tier 1)