From fc390bfad5841a38c41e6473a3b4b3e776dba340 Mon Sep 17 00:00:00 2001
From: Gabor Szabo <shellsnake@icloud.com>
Date: Fri, 12 Jun 2026 23:42:27 +0200
Subject: [PATCH] docs(repo): track showcase-completion e1-e5 prps (#406)

---
 ...pletion-E1-metadata-provenance-backbone.md | 1031 ++++++++++++++
 ...ase-completion-E2-safe-replay-lifecycle.md | 1247 +++++++++++++++++
 ...howcase-completion-E3-seed-config-scope.md | 1080 ++++++++++++++
 ...completion-E4-run-config-phase-controls.md |  820 +++++++++++
 ...e-completion-E5-agent-rag-story-capture.md | 1185 ++++++++++++++++
 5 files changed, 5363 insertions(+)
 create mode 100644 PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md
 create mode 100644 PRPs/PRP-showcase-completion-E2-safe-replay-lifecycle.md
 create mode 100644 PRPs/PRP-showcase-completion-E3-seed-config-scope.md
 create mode 100644 PRPs/PRP-showcase-completion-E4-run-config-phase-controls.md
 create mode 100644 PRPs/PRP-showcase-completion-E5-agent-rag-story-capture.md

diff --git a/PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md b/PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md
new file mode 100644
index 00000000..101fdf00
--- /dev/null
+++ b/PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md
@@ -0,0 +1,1031 @@
+name: "PRP — Showcase Completion E1: Workspace Metadata & Provenance Backbone (issue #407)"
+description: |
+
+## Purpose
+
+Implement the Foundation epic of the showcase-completion initiative (umbrella #406):
+one Alembic migration extends `showcase_workspace` with lifecycle + provenance columns
+(`replayed_from_workspace_id`, `archived`, `pinned`, `notes`, `tags`,
+`config_schema_version`) and six documented JSONB story-slot columns
+(`seed_overrides`, `user_scope`, `approval_events`, `rag_events`, `job_ids`,
+`phase_summaries`); a `PATCH /demo/workspaces/{id}` lifecycle endpoint
+(rename/notes/tags/archive/pin) lands with its Pydantic schema surface; and Replay
+writes `replayed_from_workspace_id`. Every Parallel epic (#408–#412) writes into or
+reads from this surface, so it ships first. Blocks E2 #408, E3 #409, E4 #410,
+E5 #411, E6 #412.
+
+## Core Principles
+
+1. **Context is King**: every reference below was verified against the live code on 2026-06-12 (branch `dev` @ `bdf85f6`).
+2. **Validation Loops**: each level is executable as written.
+3. **Information Dense**: patterns cite exact file:line.
+4. **Progressive Success**: model+migration → schemas → service helpers → PATCH route → replay wiring → tests → docs.
+5. **Global rules**: follow CLAUDE.md / AGENTS.md; all five CI gates must pass; all changes ADDITIVE.
+
+---
+
+## Goal
+
+The `showcase_workspace` table gains the metadata + provenance backbone every other
+epic of umbrella #406 consumes:
+
+- **Lifecycle columns**: `archived` (bool), `pinned` (bool), `notes` (free text),
+  `tags` (queryable JSONB string array, GIN-indexed — exact `scenario_plan.tags`
+  pattern), `config_schema_version` (int, schema-evolution marker).
+- **Provenance column**: `replayed_from_workspace_id` — a SOFT reference (String(32),
+  indexed, deliberately **no ForeignKey**, not even self-referential) recorded when a
+  run is a Replay of a saved workspace.
+- **Six documented JSONB story slots** as dedicated nullable JSONB columns:
+  `seed_overrides`, `user_scope`, `approval_events`, `rag_events`, `job_ids`,
+  `phase_summaries`. E1 ships the columns + the documented per-slot schema; E1 writes
+  NONE of them (all stay NULL) — E3 (#409) writes `seed_overrides` + `user_scope`,
+  E5 (#411) writes `approval_events` + `rag_events`, later parallel epics write
+  `job_ids` + `phase_summaries`.
+- **`PATCH /demo/workspaces/{workspace_id}`** — partial-update lifecycle endpoint:
+  rename / notes / tags / archive / pin. Missing id → RFC 7807 404. Returns the
+  updated `WorkspaceDetailResponse`.
+- **Replay provenance**: `DemoRunRequest` gains an additive Optional
+  `replayed_from_workspace_id` field; the frontend Replay handler sends the source
+  row's `workspace_id`; `create_workspace` records it on the NEW row.
+
+A run/request without any new field behaves **byte-identically to today** (legacy WS
+start frames and HTTP bodies unchanged). One migration applies AND downgrades cleanly
+on a fresh DB.
+
+**Deliverable** (all additive):
+
+- `app/features/demo/models.py` — 12 new columns on `ShowcaseWorkspace` + tags GIN index + replayed-from index.
+- `alembic/versions/<new>_add_showcase_workspace_metadata_provenance.py` — `down_revision = "324a2fa37fcc"`; add-columns + indexes; clean downgrade.
+- `app/features/demo/schemas.py` — `DemoRunRequest.replayed_from_workspace_id`; new `WorkspaceUpdateRequest`; `WorkspaceListItem` / `WorkspaceDetailResponse` additive response fields.
+- `app/features/demo/workspace.py` — `create_workspace` records `replayed_from_workspace_id`; new `update_workspace` helper.
+- `app/features/demo/routes.py` — `PATCH /demo/workspaces/{workspace_id}`.
+- `frontend/src/types/api.ts` + `frontend/src/pages/showcase.tsx` — two-line additive Replay wiring (see "Why the (ui) sliver" below).
+- Tests: schema unit tests, model constraint/roundtrip integration tests, workspace-helper integration tests, PATCH route tests (2xx + 404 + 422), migration up/down.
+- Docs: `docs/_base/API_CONTRACTS.md` + `docs/_base/DOMAIN_MODEL.md` additive notes (the documented story-slot schema lives in DOMAIN_MODEL — umbrella #406 risk mitigation).
+
+**Success definition**: all Success Criteria below check off; the five CI gates are
+green; integration suite green; a manual Replay from the `/showcase` Saved-workspaces
+panel produces a new row whose `replayed_from_workspace_id` equals the source row's
+`workspace_id`; `PATCH /demo/workspaces/{id}` round-trips rename/notes/tags/archive/pin.
+
+## Why
+
+- Umbrella #406: today workspaces cannot be renamed/archived/annotated/searched, the
+  row lacks replay lineage, seed overrides, user scope, approval history, and RAG
+  events. E1 is the Foundation — **every** Parallel epic writes into or reads from
+  the columns added here, so the frozen column/slot contract ships first.
+- Replays are currently indistinguishable from fresh keep-runs except by
+  name/timestamp (documented gap, `docs/_base/RUNBOOKS.md` § Showcase workspace,
+  "Explicitly out of scope" — the `replayed_from` provenance column is this epic).
+- The umbrella's junk-drawer risk ("JSONB story slots become a junk drawer") is
+  mitigated here by `config_schema_version` + a documented per-slot schema in
+  `docs/_base/DOMAIN_MODEL.md`.
+
+### Why the (ui) sliver in an (api,db) epic
+
+"Replay writes `replayed_from_workspace_id`" is a frozen epic-level success
+criterion, and Replay is frontend-initiated: `handleReplayWorkspace`
+(`frontend/src/pages/showcase.tsx:174-186`) re-submits the recorded config through
+the WS start frame. Without the sender including the field, the backend has nothing
+to record. The wiring is two additive lines (one TS interface field + one start-frame
+key) — deliberately included here so the criterion is verifiable in E1; the lineage
+*rendering* (badge + chain) stays in E2 (#408).
+
+## What
+
+### User-visible behavior
+
+- `PATCH /demo/workspaces/{workspace_id}` accepts a partial body of
+  `{name?, notes?, tags?, archived?, pinned?}`; only provided fields change; explicit
+  `null` clears `name` / `notes`. Missing id → `404 application/problem+json`. A
+  malformed body (bad name pattern, unknown key, >20 tags) → `422
+  application/problem+json`. Empty body `{}` → `200` no-op returning the current row
+  (mirrors the `RunUpdate` precedent — see Decisions).
+- `POST /demo/run` and the `WS /demo/stream` start frame accept an additive Optional
+  `replayed_from_workspace_id: str | null` (`^[0-9a-f]{32}$`); supplying it without
+  `preservation="keep"` is a 422 (a lineage pointer is meaningless when no row is
+  written — same validator pattern as `workspace_name`).
+- Clicking **Replay** on the Saved-workspaces panel now records the source
+  `workspace_id` on the new row. The original row is never mutated (E4 #393
+  invariant preserved).
+- `GET /demo/workspaces` list items additively carry `archived`, `pinned`, `tags`,
+  `replayed_from_workspace_id`; the detail response additively carries those plus
+  `notes`, `config_schema_version`, and the six story slots. **List behavior is
+  otherwise unchanged in E1** — archived rows are still listed; default-filtering /
+  search / sort is E2 (#408).
+
+### Technical requirements
+
+- One Alembic migration off head `324a2fa37fcc` (verified `uv run alembic heads`,
+  2026-06-12). Forward-only: a NEW revision — never edit
+  `324a2fa37fcc_create_showcase_workspace_table.py`.
+- Every new column is nullable OR carries a `server_default` so the migration applies
+  on a table with existing rows; downgrade drops indexes then columns, cleanly.
+- **No ForeignKeys anywhere** — `replayed_from_workspace_id` is an opaque soft
+  reference, consistent with the table-wide invariant
+  (`docs/_base/DOMAIN_MODEL.md` § `showcase_workspace`: "`created_objects` carries
+  SOFT references only — no ForeignKeys by design"). Even a *self-referential* FK is
+  ruled out: ancestor workspace rows must remain independently deletable
+  (metadata-only delete, #404) without cascading to or blocking descendants. State
+  this in the model docstring.
+- `status` is NOT patchable — the pipeline finalize hook owns the
+  running/completed/failed lifecycle; `archived` is an orthogonal boolean so the
+  existing `ck_showcase_workspace_status` CHECK is untouched.
+- Vertical slice: all backend changes inside `app/features/demo/` +
+  `alembic/versions/`; no cross-slice imports (demo imports only `app.core.*`,
+  `app.shared.*`, stdlib/3rd-party).
+- RFC 7807 errors only — `NotFoundError` from `app/core/exceptions.py` (the demo
+  routes' existing pattern, `routes.py:134`), never bare `HTTPException`.
+- Pydantic v2 `ConfigDict(strict=True)` on the new request body. All new fields are
+  JSON-native (`str`/`bool`/`list[str]`) → NO `Field(strict=False)` override needed;
+  the AST policy walker (`app/core/tests/test_strict_mode_policy.py`) only fires on
+  date/datetime/time/UUID/Decimal.
+- Warn-and-continue invariant untouched: `create_workspace` /`finalize_workspace`
+  keep swallowing all DB errors. The new `update_workspace` helper is
+  request-scoped (caller-owned session, raises normally) — it backs an HTTP
+  endpoint, not the pipeline.
+
+### Success Criteria
+
+- [ ] Migration applies AND downgrades cleanly on a fresh DB (`upgrade head` →
+  `downgrade -1` → `upgrade head`); applies on a DB with pre-existing
+  `showcase_workspace` rows (server defaults backfill `archived=false`,
+  `pinned=false`, `tags=[]`, `config_schema_version=1`).
+- [ ] `DemoRunRequest()` (no args) serializes identically to today plus
+  `replayed_from_workspace_id=None`; a legacy start frame (no new keys) validates;
+  `replayed_from_workspace_id` without `preservation="keep"` → 422; a non-32-hex
+  value → 422.
+- [ ] A keep-run with `replayed_from_workspace_id="<32hex>"` produces a row whose
+  `replayed_from_workspace_id` column equals that value; the source row is unread
+  and unmodified (the value is recorded verbatim — no existence check, it is a soft
+  reference).
+- [ ] Frontend Replay sends `replayed_from_workspace_id: ws.workspace_id`;
+  `pnpm tsc -b` introduces no NEW errors (see gotcha on the pre-existing-failure
+  baseline) and `pnpm test --run` green.
+- [ ] `PATCH /demo/workspaces/{id}`: happy path updates exactly the provided fields
+  and returns the updated detail; `{}` is a 200 no-op; missing id → 404
+  problem+json; bad name pattern / unknown key / 21 tags → 422 problem+json.
+- [ ] `tags` round-trips as a JSONB string array and is GIN-indexed
+  (`ix_showcase_workspace_tags_gin`); a `.contains(["x"])` containment query works
+  (E2 will route it — E1 proves it in an integration test).
+- [ ] All six story-slot columns exist, default NULL, and round-trip a JSONB payload
+  in an integration test; E1 production code writes none of them.
+- [ ] `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ &&
+  uv run pyright app/ && uv run pytest -v -m "not integration"` all green;
+  integration suite green against docker-compose Postgres;
+  `test_strict_mode_policy.py` green.
+
+## Decisions (the open questions this PRP resolves)
+
+> These are FROZEN for the parallel epics. #408–#412 PRP authors: consume, don't re-decide.
+
+1. **`tags` representation — CONFIRMED: mirror `scenario_plan.tags` exactly.**
+   A dedicated JSONB string-array column, `nullable=False`,
+   `server_default=text("'[]'::jsonb")`, with a GIN index
+   (`ix_showcase_workspace_tags_gin`). Verified in code:
+   `app/features/scenarios/models.py:74-76,97` (column + index), migration
+   `alembic/versions/bb8c4587ef1d_add_scenario_library_columns.py:26-45`
+   (add_column + GIN), and the containment query
+   `app/features/scenarios/service.py:464` (`ScenarioPlan.tags.contains(tags)`).
+   No deviation: the pattern is proven, queryable, and E2's tag filter reuses the
+   same `.contains()` shape. Tags are free-text strings (scenario precedent has no
+   per-item pattern); the PATCH boundary caps the list at 20 items
+   (`Field(max_length=20)` — same cap as `ScenarioCreateRequest.tags`,
+   `app/features/scenarios/schemas.py:203-206`).
+
+2. **Story slots — six dedicated nullable JSONB columns** (NOT keys inside one
+   `story` blob, NOT keys inside `created_objects`). Rationale: the existing
+   precedent is purpose-named JSONB columns with documented internal schemas
+   (`created_objects`, `result_summary` — `app/features/demo/models.py:77-81`);
+   each slot has a different writer epic and a different write moment
+   (create-time vs mid-run append vs finalize), and separate columns keep each
+   write isolated, independently nullable (NULL = "never written", distinct from
+   empty), individually typed in the ORM (`dict[str, Any] | None` vs
+   `list[dict] | None`), and trivially additive in responses. A single `story`
+   column would force read-modify-write of one blob across four epics and would
+   itself need a documented sub-schema anyway — more coupling, zero benefit on a
+   low-cardinality audit table. Per-slot documented schema: see the Data-models
+   blueprint below + the DOMAIN_MODEL doc task.
+
+3. **`replayed_from_workspace_id` — SOFT reference, no FK, confirmed.** String(32)
+   nullable, btree index (`ix_showcase_workspace_replayed_from`), NO ForeignKey —
+   including no self-referential FK: `docs/_base/DOMAIN_MODEL.md` pins
+   "deletion in either direction never cascades", and an FK (even `ON DELETE SET
+   NULL`) would couple delete behavior to lineage. Dangling lineage pointers after
+   an ancestor delete are expected and harmless (same semantics as every
+   `created_objects` id). Recorded verbatim from the request — no existence
+   validation (a replay of a just-deleted workspace still records the id it came
+   from; E2's liveness check surfaces dangles).
+
+4. **PATCH semantics — `exclude_unset` partial update, `extra="forbid"`, empty body
+   = no-op 200.** `model_dump(exclude_unset=True)` distinguishes absent from
+   explicit-null (runtime-verified, see Gotchas); explicit `null` clears `name` /
+   `notes`; `extra="forbid"` catches typo'd field names (the `RunUpdate` precedent,
+   `app/features/registry/schemas.py:113-123`); an empty body is a valid no-op
+   (mirrors `RunUpdate`, which has no min-fields validator). `archived`/`pinned`
+   accept only `true`/`false` and `tags` accepts only a list (not null — all
+   three back NOT NULL columns; send `[]` to clear tags). Explicit `null` on any
+   of the three is rejected at the schema boundary (422), never reaching
+   `setattr` → IntegrityError 500.
+
+5. **E1 writes no story slot.** `seed_overrides`/`user_scope` writers land in E3
+   (#409), `approval_events`/`rag_events` in E5 (#411), `job_ids`/
+   `phase_summaries` in the remaining parallel epics (E2 #408 health summary /
+   E4 #410 run-config echo — whichever lands first follows the documented schema).
+   E1 ships columns + schema docs + roundtrip tests only.
+
+6. **`config_schema_version` starts at 1.** Integer NOT NULL, `server_default
+   text("1")`, ORM `default=1`. It versions the *workspace config + story-slot
+   schema* as a whole; any epic that changes a documented slot shape bumps the
+   ORM default and documents the delta in DOMAIN_MODEL. E1 does not branch on it.
+
+### Assumptions (explicit, decided without user input)
+
+- `notes` is `sa.Text()` in the DB with a 2000-char cap enforced at the Pydantic
+  boundary only (no DB CHECK) — matches the repo's boundary-validation style
+  (`RunUpdate.error_message` caps at the schema layer, `registry/schemas.py:123`).
+- Renaming via PATCH uses the same `^[a-z0-9][a-z0-9\-_]*$` / ≤100 pattern as
+  `DemoRunRequest.workspace_name` (`demo/schemas.py:72-77`) — names stay
+  non-unique by design (E4 #393 invariant).
+- The PATCH route reuses `WorkspaceDetailResponse` as its response model (the
+  updated row, full detail) rather than introducing a new response shape.
+- Pin/archive carry NO behavioral semantics in E1 (no list reordering, no
+  default-filtering) — E2 (#408) wires the UX. E1 just persists the booleans.
+- The umbrella's "destructive-replay confirmation" is E2 (#408) — NOT here.
+  E1's replay change is provenance-recording only.
+- `replayed_from_workspace_id` requires `preservation="keep"`: a lineage pointer
+  on an ephemeral run has no row to land on. (The frontend Replay always sends
+  `preservation: 'keep'` — `showcase.tsx:179-185` — so this constraint is
+  invisible to the shipped UI.)
+
+## All Needed Context
+
+### Documentation & References
+
+```yaml
+# MUST READ — codebase patterns (all verified 2026-06-12, branch dev @ bdf85f6)
+
+- file: app/features/demo/models.py
+  why: |
+    THE file you extend. ShowcaseWorkspace at line 37; status constants 32-34;
+    JSONB precedent created_objects/result_summary at 77-81; __table_args__ with
+    named CheckConstraint + composite index at 83-89. Module docstring documents
+    the no-FK soft-reference decision — extend that docstring for
+    replayed_from_workspace_id. GOTCHA in docstring: SQLAlchemy reserves the
+    attr name `metadata`.
+
+- file: alembic/versions/324a2fa37fcc_create_showcase_workspace_table.py
+  why: |
+    CURRENT HEAD (verified `uv run alembic heads` → 324a2fa37fcc). Your
+    down_revision. Header/docstring format, typing (`revision: str`,
+    `down_revision: str | None`), op.f() index-naming convention to mirror.
+    NEVER edit this file — forward-only.
+
+- file: alembic/versions/bb8c4587ef1d_add_scenario_library_columns.py
+  why: |
+    THE add-columns migration to mirror: op.add_column with JSONB
+    server_default text("'[]'::jsonb") (lines 26-34), GIN index creation
+    (39-45), downgrade drops index-then-columns (48-52) incl. the
+    postgresql_using='gin' kwarg on drop_index.
+
+- file: app/features/scenarios/models.py
+  why: |
+    tags JSONB-array pattern (lines 74-76: Mapped[list[str]], nullable=False,
+    default=list, server_default=text("'[]'::jsonb")) + GIN index in
+    __table_args__ (line 97). This is the tags representation E1 mirrors
+    verbatim (Decision 1).
+
+- file: app/features/scenarios/service.py
+  why: |
+    Line 464: `ScenarioPlan.tags.contains(tags)` — the JSONB containment query
+    shape the tags column must support (prove it in an integration test; E2
+    routes it).
+
+- file: app/features/demo/schemas.py
+  why: |
+    DemoRunRequest at 29-85: ConfigDict(strict=True) line 40; the
+    workspace_name pattern + model_validator _workspace_name_requires_keep
+    (72-85) — copy this exact validator shape for replayed_from_workspace_id.
+    WorkspaceListItem (169-189) / WorkspaceDetailResponse (192-203) /
+    WorkspaceListResponse (205-213) — the response models you extend
+    additively. Response models are plain BaseModel + from_attributes (NOT
+    strict) — keep that split.
+
+- file: app/features/demo/workspace.py
+  why: |
+    create_workspace (46-79): the insert you extend with one kwarg
+    (replayed_from_workspace_id=req.replayed_from_workspace_id). get_workspace
+    (158-171) — reuse inside update_workspace. delete_workspace (199-221) —
+    the caller-owned-session + commit + logger.info shape update_workspace
+    mirrors. NOTE the split: create/finalize open their OWN sessions
+    (pipeline-scoped, warn-and-continue); get/list/delete take a caller-owned
+    AsyncSession (request-scoped, raise normally) — update_workspace is the
+    second kind.
+
+- file: app/features/demo/routes.py
+  why: |
+    The router you extend. delete_showcase_workspace (138-163) — the exact
+    route shape for PATCH: Depends(get_db), NotFoundError on missing (RFC 7807
+    via registered handler), docstring style. get_showcase_workspace (110-135)
+    — WorkspaceDetailResponse return shape.
+
+- file: app/features/registry/schemas.py
+  why: |
+    RunUpdate (113-123) — THE partial-update request precedent:
+    ConfigDict(extra="forbid"), all-Optional fields, no min-fields validator
+    (empty body = no-op). E1's WorkspaceUpdateRequest adds strict=True on top
+    (post-PRP-14 request-body policy; RunUpdate predates it).
+
+- file: app/features/demo/pipeline.py
+  why: |
+    DemoContext workspace fields at 258-263; the keep-branch create hook at
+    2652-2657; finalize hook at 2741-2746. E1 does NOT touch the pipeline —
+    create_workspace reads the new field straight off `req`. Read only to
+    confirm no hook change is needed.
+
+- file: app/core/exceptions.py
+  why: |
+    NotFoundError (line 72) → RFC 7807 404 via registered handler. The 422s
+    come FREE from Pydantic validation at the boundary (FastAPI → 422
+    problem+json).
+
+- file: app/features/demo/tests/test_schemas.py
+  why: |
+    Existing DemoRunRequest tests INCLUDING the mandatory JSON-dict path
+    (Model.model_validate({...}) per .claude/rules/security-patterns.md
+    § strict mode). Extend for the new field + add a WorkspaceUpdateRequest
+    block.
+
+- file: app/features/demo/tests/test_workspace.py
+  why: |
+    Integration-test patterns for create/finalize/get/list/delete — session
+    fixture, @pytest.mark.integration, row-cleanup conventions. Extend with
+    update_workspace + replayed_from cases.
+
+- file: app/features/demo/tests/test_models.py
+  why: |
+    Constraint/roundtrip integration tests for ShowcaseWorkspace — extend with
+    new-column defaults, tags containment, story-slot roundtrip.
+
+- file: app/features/demo/tests/test_routes.py
+  why: |
+    Route-test conventions: ASGITransport client from conftest, workspace
+    module monkeypatched for unit-shaped route tests, integration-marked tests
+    for DB-backed paths. The DELETE 404 test is the template for PATCH 404.
+
+- file: frontend/src/pages/showcase.tsx
+  why: |
+    handleReplayWorkspace at 174-186 — the start() call that gains ONE key:
+    `replayed_from_workspace_id: ws.workspace_id`. handleLoadWorkspace
+    (160-168) stays untouched (Load is read-only).
+
+- file: frontend/src/types/api.ts
+  why: |
+    DemoRunRequest interface at 778-788 — add
+    `replayed_from_workspace_id?: string` with an `// E1 (#407)` comment in
+    the existing style.
+
+- file: docs/_base/DOMAIN_MODEL.md
+  why: |
+    § showcase_workspace aggregate — additively document the new columns, the
+    six story-slot schemas, the config_schema_version semantics, and restate
+    that replayed_from_workspace_id is a soft reference (no FK). This is the
+    umbrella's junk-drawer risk mitigation — non-optional.
+
+- file: docs/_base/API_CONTRACTS.md
+  why: |
+    The /demo rows + "WebSocket Events (/demo/stream)" section — additive
+    notes for the PATCH endpoint, the new request field, and the response
+    additions, in the established "E1 (#407) — ..." style.
+
+# Issue / initiative context
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/407
+  why: The epic this PRP implements (Foundation; frozen column/slot/endpoint contract).
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/406
+  why: Umbrella — success criteria, out-of-scope list, risk table (junk-drawer mitigation = config_schema_version + documented slot schema).
+
+# Exemplar PRPs (style + validation-gate conventions)
+- file: PRPs/PRP-showcase-workspace-E1-persistence-backbone.md
+  why: Closest analog — created the table this PRP extends; task style, gates, anti-patterns.
+- file: PRPs/PRP-showcase-workspace-E4-restore-replay.md
+  why: Replay flow context — verbatim re-submission through the WS path; original row never mutated.
+```
+
+### Current Codebase tree (relevant subset)
+
+```bash
+app/features/demo/
+├── models.py          # ShowcaseWorkspace @37 (16 columns today)
+├── workspace.py       # create @46 / finalize @106 / get @158 / list @174 / delete @199 / count @224
+├── schemas.py         # DemoRunRequest @29; WorkspaceListItem @169; WorkspaceDetailResponse @192
+├── routes.py          # GET list @80; GET detail @110; DELETE @138; POST /run @51; WS @166
+├── pipeline.py        # keep-branch create hook @2652; finalize hook @2741 (NO E1 changes)
+├── service.py         # (NO E1 changes)
+└── tests/             # conftest, test_models, test_workspace, test_schemas, test_routes, test_pipeline
+alembic/
+├── env.py             # demo models import already present @19
+└── versions/          # head: 324a2fa37fcc
+frontend/src/
+├── pages/showcase.tsx # handleReplayWorkspace @174
+└── types/api.ts       # DemoRunRequest @778
+```
+
+### Desired Codebase tree (files added/modified)
+
+```bash
+app/features/demo/
+├── models.py                # MOD — +12 columns, +2 indexes, extended docstring
+├── schemas.py               # MOD — DemoRunRequest +replayed_from_workspace_id (+validator);
+│                            #       NEW WorkspaceUpdateRequest; ListItem/Detail additive fields
+├── workspace.py             # MOD — create_workspace records replayed_from; NEW update_workspace
+├── routes.py                # MOD — PATCH /demo/workspaces/{workspace_id}
+└── tests/
+    ├── test_schemas.py      # MOD — new-field + WorkspaceUpdateRequest unit tests
+    ├── test_models.py       # MOD — column defaults, tags containment, slot roundtrip (integration)
+    ├── test_workspace.py    # MOD — replayed_from recording; update_workspace semantics (integration)
+    └── test_routes.py       # MOD — PATCH 200/404/422 (+ list/detail field passthrough)
+alembic/versions/<rev>_add_showcase_workspace_metadata_provenance.py   # NEW
+frontend/src/types/api.ts    # MOD — +replayed_from_workspace_id?: string
+frontend/src/pages/showcase.tsx  # MOD — one start-frame key in handleReplayWorkspace
+docs/_base/API_CONTRACTS.md  # MOD — additive contract notes
+docs/_base/DOMAIN_MODEL.md   # MOD — columns + documented story-slot schemas
+```
+
+### Known Gotchas & Library Quirks
+
+```python
+# CRITICAL — forward-only migrations: down_revision = "324a2fa37fcc" (verified
+#   `uv run alembic heads` → 324a2fa37fcc, 2026-06-12). NEVER edit the merged
+#   create-table migration. Revision ids are hand-written 12-hex continuing the
+#   chain (or keep an `alembic revision -m ...` generated id).
+
+# CRITICAL — every new NOT NULL column needs a server_default or the migration
+#   fails on tables with existing rows: archived/pinned text("false"),
+#   config_schema_version text("1"), tags text("'[]'::jsonb"). All six story
+#   slots + notes + replayed_from_workspace_id are nullable (no default needed).
+
+# CRITICAL — strict-mode policy: WorkspaceUpdateRequest and the new
+#   DemoRunRequest field are all JSON-native (str/bool/list[str]) → NO
+#   Field(strict=False) override. The AST walker
+#   (app/core/tests/test_strict_mode_policy.py) only fires on
+#   date/datetime/time/UUID/Decimal — nothing here triggers it.
+
+# CRITICAL — do NOT add extra="forbid" to DemoRunRequest (unknown-key tolerance
+#   is the WS forward/backward-compat contract, routes.py:182). DO add it to
+#   WorkspaceUpdateRequest (HTTP-only body; typo'd PATCH fields must 422, not
+#   silently no-op — RunUpdate precedent).
+
+# CRITICAL — JSONB change detection: always ASSIGN whole values
+#   (row.tags = [...]), never mutate in place (row.tags.append(...)) — in-place
+#   mutation is invisible to SQLAlchemy without flag_modified. The existing
+#   finalize_workspace assigns; keep that style in update_workspace.
+
+# GOTCHA — SQLAlchemy reserves the declarative attr name `metadata`
+#   (demo/models.py docstring). None of the new names collide — keep it that way.
+
+# GOTCHA — `status` stays out of WorkspaceUpdateRequest; the CHECK constraint
+#   ck_showcase_workspace_status is untouched. `archived` is orthogonal.
+
+# GOTCHA — update_workspace is caller-owned-session + raises normally (it backs
+#   an HTTP route). Do NOT wrap it in the warn-and-continue pattern — that
+#   contract is for the PIPELINE-scoped create/finalize only.
+
+# GOTCHA — repo has mixed CRLF/LF line endings; run `git diff --stat` before
+#   committing — Edit/Write emit LF, so verify schema/route/model diffs are
+#   surgical, not whole-file noise.
+
+# GOTCHA — frontend type gate: `pnpm tsc --noEmit` is vacuous (solution-style
+#   tsconfig checks zero files) and `pnpm tsc -b` already fails on dev with
+#   pre-existing errors. Gate on "no NEW errors vs the dev baseline" +
+#   `pnpm lint` + `pnpm test --run`.
+
+# GOTCHA — mypy --strict AND pyright --strict gate merge: full annotations incl.
+#   `-> None` on tests and typed fixtures.
+
+# CONVENTION — branch: feat/showcase-completion-e1-metadata-provenance (off dev).
+#   Commits reference #407, e.g. `feat(db): ... (#407)` for the migration,
+#   `feat(api): ... (#407)` for slice code, `feat(ui): ... (#407)` for the
+#   replay wiring (or `feat(api,ui)` if combined). NO AI trailer (hook-enforced).
+
+# RUNTIME-VERIFICATION LOG (per prp-create step 3 — re-run on library upgrade):
+#   1. `uv run alembic heads` → 324a2fa37fcc (2026-06-12).
+#   2. Pydantic exclude_unset distinguishes absent vs explicit-null, pattern
+#      constraint skips the None arm of `str | None`, extra="forbid" 422s
+#      unknown keys, strict=True accepts list[str] and rejects a bare str:
+#      uv run python -c "
+#      from pydantic import BaseModel, ConfigDict, Field
+#      class P(BaseModel):
+#          model_config = ConfigDict(strict=True, extra='forbid')
+#          name: str | None = Field(default=None, max_length=100, pattern=r'^[a-z0-9][a-z0-9\-_]*$')
+#          notes: str | None = Field(default=None, max_length=2000)
+#          tags: list[str] | None = Field(default=None, max_length=20)
+#      p = P.model_validate({'notes': None}); assert p.model_fields_set == {'notes'}
+#      assert p.model_dump(exclude_unset=True) == {'notes': None}
+#      assert P.model_validate({'name': None}).name is None        # null clears
+#      assert P.model_validate({'tags': ['a','b']}).tags == ['a','b']
+#      "
+#      → verified on pydantic in-repo (2026-06-12).
+#   3. SQLAlchemy 2.0.46: Boolean/Integer/JSONB server_default DDL compiles as
+#      expected (`DEFAULT false NOT NULL`, `DEFAULT 1 NOT NULL`,
+#      `DEFAULT '[]'::jsonb NOT NULL`):
+#      uv run python -c "import sqlalchemy as sa; from sqlalchemy.dialects import postgresql; from sqlalchemy.schema import CreateTable; md=sa.MetaData(); t=sa.Table('x',md, sa.Column('archived',sa.Boolean(),nullable=False,server_default=sa.text('false')), sa.Column('v',sa.Integer(),nullable=False,server_default=sa.text('1')), sa.Column('tags',postgresql.JSONB(),nullable=False,server_default=sa.text(\"'[]'::jsonb\"))); print(CreateTable(t).compile(dialect=postgresql.dialect()))"
+#      → verified (2026-06-12).
+#   4. JSONB .contains() containment is already production code in this repo
+#      (scenarios/service.py:464) — no external claim to probe.
+```
+
+## Implementation Blueprint
+
+### Data models and structure
+
+```python
+# app/features/demo/models.py — ADD after result_summary (line 81), keep the
+# existing __table_args__ entries and append the two new indexes.
+
+    # ── E1 (#407) — lifecycle metadata ────────────────────────────────────
+    # Orthogonal to `status` (which the pipeline owns): archive/pin are
+    # operator curation flags, PATCH-mutable, default false.
+    archived: Mapped[bool] = mapped_column(
+        nullable=False, default=False, server_default=text("false")
+    )
+    pinned: Mapped[bool] = mapped_column(
+        nullable=False, default=False, server_default=text("false")
+    )
+    # Free-text operator annotation; length capped at the Pydantic boundary (2000).
+    notes: Mapped[str | None] = mapped_column(Text, nullable=True)
+    # Queryable JSONB string array — EXACT scenario_plan.tags pattern
+    # (app/features/scenarios/models.py:74-76); GIN-indexed below.
+    tags: Mapped[list[str]] = mapped_column(
+        JSONB, nullable=False, default=list, server_default=text("'[]'::jsonb")
+    )
+    # Version of the workspace config + story-slot schema (umbrella #406
+    # junk-drawer mitigation). Bump the ORM default when a slot shape changes.
+    config_schema_version: Mapped[int] = mapped_column(
+        Integer, nullable=False, default=1, server_default=text("1")
+    )
+
+    # ── E1 (#407) — replay provenance ─────────────────────────────────────
+    # SOFT reference to the workspace this run replayed (uuid4().hex of the
+    # source row). Deliberately NO ForeignKey — not even self-referential:
+    # ancestor rows must stay independently deletable (metadata-only delete),
+    # and dangling lineage pointers are expected, like every created_objects id.
+    replayed_from_workspace_id: Mapped[str | None] = mapped_column(
+        String(32), nullable=True
+    )
+
+    # ── E1 (#407) — documented JSONB story slots ──────────────────────────
+    # Six dedicated nullable JSONB columns (precedent: created_objects /
+    # result_summary). NULL = "slot never written" (distinct from empty).
+    # E1 writes NONE of them; documented schema per slot (authoritative copy
+    # in docs/_base/DOMAIN_MODEL.md):
+    #   seed_overrides   (E3 #409 writes) — dict: the curated seeder-override
+    #                    payload from the start frame, stored verbatim
+    #                    (model_dump(mode="json")); replay echoes it.
+    #   user_scope       (E3 #409 writes) — dict: operator-selected focus,
+    #                    {"store_id": int, "product_id": int} (additive keys
+    #                    allowed later).
+    #   approval_events  (E5 #411 writes) — list[dict], append-only:
+    #                    {"action_id": str, "tool_name": str,
+    #                     "decision": "approved"|"rejected",
+    #                     "decided_at": iso8601-str, "session_id": str}.
+    #   rag_events       (E5 #411 writes) — list[dict], append-only:
+    #                    {"event": "index"|"retrieve"|"skip", "detail": str,
+    #                     "count": int, "occurred_at": iso8601-str}.
+    #   job_ids          (later parallel epic) — list[str]: job / batch
+    #                    sub-job ids the run submitted (soft references).
+    #   phase_summaries  (later parallel epic) — list[dict], one per phase:
+    #                    {"phase_name": str, "status": "pass"|"fail"|"warn"|"skip",
+    #                     "steps": int, "duration_ms": float}.
+    seed_overrides: Mapped[dict[str, Any] | None] = mapped_column(JSONB, nullable=True)
+    user_scope: Mapped[dict[str, Any] | None] = mapped_column(JSONB, nullable=True)
+    approval_events: Mapped[list[dict[str, Any]] | None] = mapped_column(JSONB, nullable=True)
+    rag_events: Mapped[list[dict[str, Any]] | None] = mapped_column(JSONB, nullable=True)
+    job_ids: Mapped[list[str] | None] = mapped_column(JSONB, nullable=True)
+    phase_summaries: Mapped[list[dict[str, Any]] | None] = mapped_column(JSONB, nullable=True)
+
+    # __table_args__ — APPEND (keep existing CheckConstraint + composite index):
+    #   Index("ix_showcase_workspace_tags_gin", "tags", postgresql_using="gin"),
+    #   Index("ix_showcase_workspace_replayed_from", "replayed_from_workspace_id"),
+    # imports to extend: Text from sqlalchemy (others already imported).
+```
+
+```python
+# app/features/demo/schemas.py — DemoRunRequest addition (after workspace_name,
+# line 78) + validator extension.
+
+    # E1 (#407): replay provenance. The frontend Replay handler sends the
+    # SOURCE row's workspace_id; create_workspace records it verbatim on the
+    # NEW row (soft reference — no existence check). JSON-native str → no
+    # Field(strict=False) needed.
+    replayed_from_workspace_id: str | None = Field(
+        default=None,
+        pattern=r"^[0-9a-f]{32}$",   # uuid4().hex shape of workspace_id
+        description="workspace_id this run replays; requires preservation='keep'.",
+    )
+
+    @model_validator(mode="after")
+    def _replayed_from_requires_keep(self) -> DemoRunRequest:
+        """Reject a lineage pointer on a run that writes no workspace row."""
+        if self.replayed_from_workspace_id is not None and self.preservation != "keep":
+            raise ValueError("replayed_from_workspace_id requires preservation='keep'")
+        return self
+
+
+# NEW request model — place after DemoRunRequest.
+# (add `field_validator` to the pydantic import at schemas.py:14 — the file
+#  currently imports only BaseModel/ConfigDict/Field/model_validator)
+class WorkspaceUpdateRequest(BaseModel):
+    """Partial lifecycle update for PATCH /demo/workspaces/{workspace_id}.
+
+    exclude_unset semantics: only fields present in the body are applied;
+    explicit ``null`` clears ``name`` / ``notes``. Explicit ``null`` on
+    ``archived`` / ``pinned`` / ``tags`` is rejected (422) — they back NOT NULL
+    columns; send ``[]`` to clear tags. ``extra="forbid"`` so a typo'd field
+    422s instead of silently no-opping (RunUpdate precedent,
+    app/features/registry/schemas.py:113). All fields JSON-native -> the
+    model-level strict=True needs no per-field override. ``status`` is
+    deliberately absent — the pipeline owns the run lifecycle.
+    """
+
+    model_config = ConfigDict(strict=True, extra="forbid")
+
+    name: str | None = Field(
+        default=None,
+        max_length=100,
+        pattern=r"^[a-z0-9][a-z0-9\-_]*$",   # same as workspace_name
+        description="Rename the workspace; explicit null clears the label.",
+    )
+    notes: str | None = Field(
+        default=None, max_length=2000,
+        description="Free-text annotation; explicit null clears it.",
+    )
+    tags: list[str] | None = Field(
+        default=None, max_length=20,
+        description="Replace the full tag list (not a merge).",
+    )
+    archived: bool | None = Field(default=None, description="Archive flag.")
+    pinned: bool | None = Field(default=None, description="Pin flag.")
+
+    @field_validator("archived", "pinned", "tags")
+    @classmethod
+    def _reject_explicit_null(cls, v: bool | list[str] | None) -> bool | list[str]:
+        # Fires only on explicitly provided values (pydantic skips validators for
+        # defaults unless validate_default=True), so absent stays None/unset while
+        # an explicit {"archived": null} / {"tags": null} 422s instead of reaching
+        # the NOT NULL column via exclude_unset -> setattr -> IntegrityError 500.
+        # tags: send [] to clear, never null.
+        if v is None:
+            raise ValueError(
+                "archived/pinned accept only true/false and tags accepts a list "
+                "(send [] to clear) — explicit null is not allowed"
+            )
+        return v
+
+
+# Response additions (additive — keep from_attributes, NOT strict):
+# WorkspaceListItem  += archived: bool, pinned: bool, tags: list[str]
+#                       (default_factory=list), replayed_from_workspace_id: str | None
+# WorkspaceDetailResponse += notes: str | None, config_schema_version: int,
+#                       seed_overrides / user_scope: dict[str, Any] | None,
+#                       approval_events / rag_events / phase_summaries:
+#                       list[dict[str, Any]] | None, job_ids: list[str] | None
+```
+
+```python
+# app/features/demo/workspace.py — update_workspace (NEW; caller-owned session,
+# raises normally — this backs an HTTP route, NOT the pipeline).
+async def update_workspace(
+    db: AsyncSession,
+    workspace_id: str,
+    update: WorkspaceUpdateRequest,
+) -> ShowcaseWorkspace | None:
+    """Apply a partial lifecycle update; return the row or None when missing."""
+    row = await get_workspace(db, workspace_id)
+    if row is None:
+        return None
+    changes = update.model_dump(exclude_unset=True)   # absent != explicit null
+    for field, value in changes.items():
+        setattr(row, field, value)                    # whole-value ASSIGNMENT (JSONB gotcha)
+    await db.commit()
+    await db.refresh(row)
+    logger.info("demo.workspace_updated", workspace_id=workspace_id, fields=sorted(changes))
+    return row
+
+# create_workspace — ONE added kwarg in the ShowcaseWorkspace(...) constructor:
+#     replayed_from_workspace_id=req.replayed_from_workspace_id,
+```
+
+```python
+# app/features/demo/routes.py — PATCH route (mirror the DELETE shape @138).
+@router.patch(
+    "/workspaces/{workspace_id}",
+    response_model=WorkspaceDetailResponse,
+    summary="Update a saved showcase workspace's lifecycle metadata",
+    description=(
+        "Partial update: rename / notes / tags / archive / pin. Only fields "
+        "present in the body change; explicit null clears name/notes. The run "
+        "lifecycle status is not patchable."
+    ),
+)
+async def update_showcase_workspace(
+    workspace_id: str,
+    update: WorkspaceUpdateRequest,
+    db: AsyncSession = Depends(get_db),
+) -> WorkspaceDetailResponse:
+    row = await workspace.update_workspace(db, workspace_id, update)
+    if row is None:
+        raise NotFoundError(message=f"Workspace not found: {workspace_id}")
+    return WorkspaceDetailResponse.model_validate(row)
+```
+
+### List of tasks (dependency order)
+
+```yaml
+Task 1 — branch & issue hygiene:
+  RUN: git switch dev && git pull && git switch -c feat/showcase-completion-e1-metadata-provenance
+  VERIFY: gh issue view 407 --json state   # open
+  NOTE: git status shows untracked docker-compose.lan.yml on this host — leave it alone.
+
+Task 2 — MODIFY app/features/demo/models.py:
+  - ADD the 12 columns per the blueprint (lifecycle block, provenance column, six slots)
+  - ADD `Text` to the sqlalchemy import line (others already imported)
+  - APPEND the two indexes to __table_args__ (tags GIN + replayed_from btree)
+  - EXTEND the module docstring: replayed_from_workspace_id is a soft reference
+    (no FK, not even self-referential); story slots NULL until their writer epic lands
+  - PRESERVE: existing columns, constants, CheckConstraint, composite index — untouched
+
+Task 3 — CREATE alembic/versions/<rev>_add_showcase_workspace_metadata_provenance.py:
+  - down_revision = "324a2fa37fcc"
+  - MIRROR: bb8c4587ef1d_add_scenario_library_columns.py (add_column + GIN + downgrade order)
+  - upgrade(): op.add_column x12 (server_defaults: archived/pinned text("false"),
+    config_schema_version text("1"), tags text("'[]'::jsonb"); the rest nullable),
+    then op.create_index("ix_showcase_workspace_tags_gin", ..., postgresql_using="gin")
+    and op.create_index("ix_showcase_workspace_replayed_from", ...)
+  - downgrade(): drop the two indexes (GIN drop with postgresql_using="gin",
+    matching bb8c4587ef1d:50), then drop the 12 columns in reverse order
+  - VERIFY: docker compose up -d &&
+    uv run alembic upgrade head && uv run alembic downgrade -1 && uv run alembic upgrade head
+
+Task 4 — MODIFY app/features/demo/schemas.py:
+  - ADD DemoRunRequest.replayed_from_workspace_id + _replayed_from_requires_keep
+    validator (blueprint); UPDATE the docstring sentence listing JSON-native fields
+  - ADD WorkspaceUpdateRequest (blueprint) — placed after DemoRunRequest
+  - EXTEND WorkspaceListItem (+archived/pinned/tags/replayed_from_workspace_id)
+    and WorkspaceDetailResponse (+notes/config_schema_version/six slots) additively
+
+Task 5 — MODIFY app/features/demo/workspace.py:
+  - create_workspace: add replayed_from_workspace_id=req.replayed_from_workspace_id
+    to the ShowcaseWorkspace(...) constructor (one line; warn-and-continue untouched)
+  - ADD update_workspace (blueprint) + the WorkspaceUpdateRequest import
+  - UPDATE module docstring routing note (PATCH now routed too)
+
+Task 6 — MODIFY app/features/demo/routes.py:
+  - ADD the PATCH route (blueprint) between GET detail and DELETE
+  - ADD WorkspaceUpdateRequest to the schemas import block
+  - UPDATE the module docstring endpoint list
+
+Task 7 — MODIFY frontend (two additive lines):
+  - frontend/src/types/api.ts DemoRunRequest (@778): add
+    `// E1 (#407) — replay provenance: the source workspace_id a Replay re-runs.`
+    `replayed_from_workspace_id?: string`
+  - frontend/src/pages/showcase.tsx handleReplayWorkspace start() call (@179-185):
+    add `replayed_from_workspace_id: ws.workspace_id,`
+  - DO NOT touch handleLoadWorkspace (Load is read-only) or WorkspacePanel
+
+Task 8 — tests (full matrix in Validation Loop):
+  - MODIFY tests/test_schemas.py   (unit)
+  - MODIFY tests/test_models.py    (@pytest.mark.integration)
+  - MODIFY tests/test_workspace.py (@pytest.mark.integration)
+  - MODIFY tests/test_routes.py    (PATCH 200/404/422; unit-shaped via monkeypatched
+    workspace.update_workspace where the existing file does so, integration otherwise —
+    follow whichever convention the existing GET/DELETE tests use)
+
+Task 9 — docs (additive):
+  - docs/_base/API_CONTRACTS.md:
+    * NEW row: `demo | PATCH | /demo/workspaces/{workspace_id} | E1 (#407) — partial
+      lifecycle update (name/notes/tags/archived/pinned; exclude_unset, explicit null
+      clears name/notes; status NOT patchable); 404 problem+json when missing; 422 on
+      unknown keys / bad name pattern / >20 tags; empty body = 200 no-op`
+    * POST /demo/run row + WS /demo/stream section: additive Optional
+      `replayed_from_workspace_id` (`^[0-9a-f]{32}$`, requires preservation='keep');
+      Replay now sends it; recorded verbatim as a soft reference
+    * GET /demo/workspaces rows: note the additive response fields
+  - docs/_base/DOMAIN_MODEL.md § showcase_workspace:
+    * Stored metadata: add lifecycle columns + config_schema_version semantics
+    * JSONB fields: add the six story slots WITH their documented schemas (copy the
+      model-comment schemas verbatim — this is the authoritative copy)
+    * Invariants: replayed_from_workspace_id is a SOFT reference (no FK, dangles OK);
+      status not patchable; archived orthogonal to status
+    * Trim the "Out of scope" line that lists `replayed_from` as not-modeled (now shipped)
+  - docs/_base/RUNBOOKS.md § Showcase workspace: remove `replayed_from` from the
+    "Explicitly out of scope" list (one-line edit; the full runbook sweep is E7)
+
+Task 10 — gates, commit, PR:
+  - RUN the full Validation Loop (Levels 1-4)
+  - git diff --stat   # surgical diffs only (CRLF noise check)
+  - COMMITS (reference #407, no AI trailer), e.g.:
+      feat(db): extend showcase_workspace with metadata and provenance columns (#407)
+      feat(api): add workspace patch lifecycle endpoint and replay provenance (#407)
+      feat(ui): send replayed_from_workspace_id on showcase replay (#407)
+      docs(repo): document workspace story slots and patch contract (#407)
+  - PR into dev; title `feat(api,db): showcase-completion E1 — workspace metadata & provenance backbone (#407)`
+```
+
+### Integration Points
+
+```yaml
+DATABASE:
+  - migration: 12 add_column on showcase_workspace + ix_showcase_workspace_tags_gin (GIN)
+    + ix_showcase_workspace_replayed_from (btree); clean downgrade
+  - registration: alembic/env.py already imports demo models (line 19) — NO change
+
+CONFIG: none — no new settings, no env vars.
+
+ROUTES: PATCH /demo/workspaces/{workspace_id} on the existing demo router — no
+  app/main.py change (router already wired).
+
+PIPELINE: none — create_workspace reads the new field straight off req; the
+  keep-branch hook (pipeline.py:2652) and finalize hook (2741) are untouched.
+
+FRONTEND: two additive lines (Task 7). No new components; lineage badge/chain is E2.
+
+DOCS: API_CONTRACTS + DOMAIN_MODEL (+ one-line RUNBOOKS trim). Full sweep is E7.
+```
+
+## Validation Loop
+
+### Level 1: Syntax & Style
+
+```bash
+uv run ruff check . && uv run ruff format --check .
+uv run mypy app/ && uv run pyright app/
+# Expected: clean. Both type checkers are --strict and gate merge.
+```
+
+### Level 2: Unit Tests (no DB)
+
+```python
+# tests/test_schemas.py — add:
+def test_demo_run_request_replayed_from_default_none() -> None: ...
+    # DemoRunRequest() -> replayed_from_workspace_id is None; legacy frame
+    # model_validate({"seed": 7}) still validates
+
+def test_demo_run_request_replayed_from_json_path() -> None: ...
+    # MANDATORY json-dict path (security-patterns.md § strict mode):
+    # model_validate({"preservation": "keep", "replayed_from_workspace_id": "a"*32})
+
+def test_demo_run_request_replayed_from_requires_keep() -> None: ...
+    # pytest.raises(ValidationError): model_validate({"replayed_from_workspace_id": "a"*32})
+
+def test_demo_run_request_replayed_from_pattern_rejected() -> None: ...
+    # "not-hex!", "ABC..." (uppercase), 31-char and 33-char values all raise
+
+def test_workspace_update_request_partial_fields_set() -> None: ...
+    # model_validate({"notes": None}).model_dump(exclude_unset=True) == {"notes": None}
+    # model_validate({}).model_dump(exclude_unset=True) == {}
+
+def test_workspace_update_request_rejects_unknown_key() -> None: ...
+    # model_validate({"status": "archived"}) raises (extra="forbid" — status not patchable)
+
+def test_workspace_update_request_name_pattern_and_tags_cap() -> None: ...
+    # "Bad Name!" raises; 21 tags raises; ["workspace:x", "demo"] passes
+
+def test_workspace_update_request_rejects_explicit_null_flags() -> None: ...
+    # pytest.raises(ValidationError): model_validate({"archived": None})
+    # pytest.raises(ValidationError): model_validate({"pinned": None})
+    # pytest.raises(ValidationError): model_validate({"tags": None})
+    # model_validate({"tags": []}) passes (the sanctioned clear path)
+    # (NOT NULL columns — explicit null must 422, never reach setattr)
+
+# tests/test_routes.py — add (follow the file's existing GET/DELETE conventions):
+async def test_patch_workspace_happy_path(...) -> None: ...
+    # PATCH {"name": "renamed", "pinned": true, "tags": ["t1"]} -> 200; response
+    # echoes the changes and the untouched fields
+async def test_patch_workspace_missing_404_problem_json(...) -> None: ...
+    # status 404; content-type application/problem+json
+async def test_patch_workspace_unknown_field_422(...) -> None: ...
+    # body {"bogus": 1} -> 422 problem+json
+async def test_patch_workspace_explicit_null_archived_422(...) -> None: ...
+    # body {"archived": null} -> 422 problem+json (NOT NULL column guard)
+async def test_patch_workspace_empty_body_noop_200(...) -> None: ...
+async def test_run_demo_rejects_replayed_from_without_keep_422(...) -> None: ...
+```
+
+```bash
+uv run pytest app/features/demo -v -m "not integration"
+uv run pytest app/core/tests/test_strict_mode_policy.py -v   # AST walker still green
+```
+
+### Level 3: Integration (real Postgres)
+
+```python
+# tests/test_models.py — @pytest.mark.integration, extend:
+#   - insert with NO new kwargs -> archived=False, pinned=False, tags=[],
+#     config_schema_version=1, all six slots None, replayed_from None
+#     (server_default + ORM default agreement)
+#   - tags JSONB roundtrip + containment: insert tags=["workspace:x","demo"];
+#     select(...).where(ShowcaseWorkspace.tags.contains(["demo"])) finds it
+#     (scenarios/service.py:464 query shape)
+#   - story-slot roundtrip: write a dict into seed_overrides and a list[dict]
+#     into approval_events; read back identical
+#   - status CHECK still enforced (regression — constraint untouched)
+
+# tests/test_workspace.py — @pytest.mark.integration, extend:
+#   - create_workspace with req.replayed_from_workspace_id set -> column recorded
+#     verbatim; without it -> None (legacy identical)
+#   - update_workspace partial: set name+pinned only -> other fields untouched;
+#     explicit name=None clears; tags replaced whole (not merged);
+#     missing workspace_id -> returns None (route maps to 404)
+#   - update_workspace empty request -> no-op, row returned
+```
+
+```bash
+docker compose up -d
+uv run alembic upgrade head
+uv run alembic downgrade -1 && uv run alembic upgrade head   # downgrade is clean
+uv run pytest app/features/demo -v -m integration
+```
+
+### Level 4: Manual smoke (seeded local stack, uvicorn on :8123 + vite)
+
+```bash
+# 1. Keep-run, then PATCH lifecycle round-trip:
+curl -s -X POST http://localhost:8123/demo/run -H 'Content-Type: application/json' \
+  -d '{"skip_seed": true, "preservation": "keep", "workspace_name": "e1-smoke"}' \
+  | python3 -c "import sys,json; print(json.load(sys.stdin)['workspace_id'])"
+WS=<that id>
+curl -s -X PATCH http://localhost:8123/demo/workspaces/$WS \
+  -H 'Content-Type: application/json' \
+  -d '{"name": "e1-renamed", "notes": "smoke", "tags": ["smoke"], "pinned": true}' | python3 -m json.tool
+curl -s -X PATCH http://localhost:8123/demo/workspaces/deadbeef -H 'Content-Type: application/json' -d '{}' \
+  | python3 -m json.tool    # 404 problem+json
+
+# 2. Replay provenance (browser): /showcase -> Saved workspaces -> Replay on
+#    the e1-renamed row; after the run:
+docker exec forecastlab-postgres psql -U forecastlab -d forecastlab -c \
+  "SELECT workspace_id, name, replayed_from_workspace_id FROM showcase_workspace ORDER BY created_at DESC LIMIT 2;"
+# Expect: newest row's replayed_from_workspace_id == $WS; the $WS row unchanged.
+
+# 3. Frontend gates:
+cd frontend && pnpm lint && pnpm test --run
+# pnpm tsc -b — confirm no NEW errors vs the dev baseline (gate is vacuous-aware,
+# see Known Gotchas).
+```
+
+## Final validation Checklist
+
+- [ ] All five gates green: `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ && uv run pyright app/ && uv run pytest -v -m "not integration"`
+- [ ] Integration suite green: `uv run pytest -v -m integration` (fresh docker-compose DB; reset first if the shared DB is polluted)
+- [ ] Migration upgrade + downgrade clean on a fresh DB AND applies on a DB with existing workspace rows
+- [ ] Legacy surfaces byte-identical: start frame without new keys, GET list/detail for old rows (new fields all default/null), `test_strict_mode_policy.py` green
+- [ ] PATCH 200 / 404 / 422 paths verified (Level 2 + Level 4)
+- [ ] Replay records `replayed_from_workspace_id`; source row untouched (Level 4 step 2)
+- [ ] `git diff --stat` shows surgical diffs (no CRLF whole-file noise)
+- [ ] docs/_base/API_CONTRACTS.md + DOMAIN_MODEL.md updated additively (slot schemas documented); RUNBOOKS out-of-scope line trimmed
+- [ ] Commits `feat(db)/feat(api)/feat(ui)/docs(repo): ... (#407)`, no AI trailer; PR into dev
+
+---
+
+## Anti-Patterns to Avoid
+
+- ❌ Don't add ANY ForeignKey — not even self-referential on `replayed_from_workspace_id`. Soft references only.
+- ❌ Don't edit `324a2fa37fcc_create_showcase_workspace_table.py` — new revision off head `324a2fa37fcc`.
+- ❌ Don't make `status` patchable or widen `ck_showcase_workspace_status` — `archived` is the orthogonal flag.
+- ❌ Don't add `extra="forbid"` to `DemoRunRequest` (WS compat) — but DO add it to `WorkspaceUpdateRequest`.
+- ❌ Don't write any story slot from E1 production code — columns + docs + roundtrip tests only.
+- ❌ Don't validate that `replayed_from_workspace_id` points at an existing row — it's a soft reference; dangles are designed.
+- ❌ Don't wrap `update_workspace` in warn-and-continue — that contract is pipeline-only; HTTP helpers raise.
+- ❌ Don't add list filtering/sorting/search or archive-hiding — that's E2 (#408).
+- ❌ Don't add a replay confirmation dialog or lineage UI — E2 (#408).
+- ❌ Don't mutate JSONB values in place — always assign whole values.
+- ❌ Don't import another feature slice from `app/features/demo/` — core/shared only.
+
+## Notes for parallel-epic PRP authors (#408–#412)
+
+- The column set, slot names, and per-slot schemas above are the frozen E1 contract.
+  `job_ids` / `phase_summaries` have a documented schema but NO assigned writer in
+  E1 — E2 (#408, health summary) and E4 (#410, config echo) should agree on which
+  populates which and follow the documented shapes.
+- Slot writes that happen DURING a pipeline run inherit the warn-and-continue
+  invariant (extend `finalize_workspace` / add sibling helpers in `workspace.py`);
+  slot writes via HTTP go through caller-owned-session helpers like
+  `update_workspace`.
+- Tag filtering on `GET /demo/workspaces` (E2) should reuse the
+  `ShowcaseWorkspace.tags.contains([...])` containment shape proven in E1's
+  integration test, mirroring `GET /scenarios?tags=` (scenarios/routes.py:180).
+- A schema change to any slot bumps `config_schema_version` (ORM default) and
+  documents the delta in DOMAIN_MODEL.
+
+## Confidence Score
+
+**9/10** for one-pass implementation success. Every element has a verified in-repo
+precedent: the add-columns+GIN migration (`bb8c4587ef1d`), the tags column
+(`scenarios/models.py:74`), the partial-update schema (`registry RunUpdate`), the
+404-on-missing route shape (the demo DELETE), and the request-field+validator pattern
+(`workspace_name`, same file). The three judgment calls (tags representation, slot
+shape, no-FK soft reference) are resolved and frozen above, and all changes are
+additive — a wrong slot-schema guess costs a documented `config_schema_version` bump,
+not a rework. The −1: the PATCH route tests must match whichever
+unit-vs-integration convention `test_routes.py` currently uses for the workspace
+GET/DELETE endpoints (read it first), and the frontend type-gate baseline is fuzzy
+on this host (`tsc -b` has pre-existing dev failures — gate on "no NEW errors").
diff --git a/PRPs/PRP-showcase-completion-E2-safe-replay-lifecycle.md b/PRPs/PRP-showcase-completion-E2-safe-replay-lifecycle.md
new file mode 100644
index 00000000..6adfe994
--- /dev/null
+++ b/PRPs/PRP-showcase-completion-E2-safe-replay-lifecycle.md
@@ -0,0 +1,1247 @@
+name: "PRP — Showcase Completion E2: Safe Replay & Workspace Lifecycle (issue #408)"
+description: |
+
+## Purpose
+
+Implement the safe-replay + workspace-lifecycle epic of the showcase-completion
+initiative (umbrella #406): an explicit confirmation step (with preview/diff)
+before every replay — destructive copy when `reset=true` — lineage rendering of
+the E1 `replayed_from_workspace_id` chain, full lifecycle management on the
+saved-workspaces panel (rename / archive / pin / notes / tags / search /
+filter / sort / multi-select delete), a two-workspace compare view, and the
+folded-in ops slice: artifact-link liveness checks with dead-link warnings on
+soft references plus a per-workspace health summary (partial-run warning
+included). Parallel epic after Foundation E1 (#407) — **execution starts only
+AFTER E1 merges**; this PRP treats E1's epic body as a frozen contract (every
+dependency on it is tagged `CONTRACT(E1)` below).
+
+## Core Principles
+
+1. **Context is King**: every reference below was verified against the live code on 2026-06-12 (branch `dev`, post-#404/#405 merge — E1 #407 NOT yet merged; see the E1-reconciliation task).
+2. **Validation Loops**: each level is executable as written.
+3. **Information Dense**: patterns cite exact file:line.
+4. **Progressive Success**: backend list-filters + health endpoint → frontend types/hooks → confirm/diff dialog → lifecycle panel rework → lineage → compare page → docs.
+5. **Global rules**: follow CLAUDE.md / AGENTS.md; all five CI gates must pass; UI work follows `.claude/rules/ui-design.md` + `.claude/rules/shadcn-ui.md`.
+
+---
+
+## Goal
+
+An operator on `/showcase` can:
+
+- (a) **Replay safely** — clicking Replay opens a confirmation dialog showing a
+  preview/diff: the recorded config (seed / scenario / reset / skip_seed /
+  name) side-by-side with the exact `DemoRunRequest` about to be sent, any
+  divergence highlighted. When the recorded config has `reset=true`, the
+  dialog carries explicit destructive copy ("Replaying this workspace WIPES
+  the database") and a destructive-styled confirm button. No replay starts
+  without confirmation.
+- (b) **See lineage** — a workspace created by a replay carries a "replay"
+  badge in the list; the loaded-workspace view renders the
+  `replayed_from_workspace_id` chain (newest → original), with dangling
+  ancestors (deleted rows) marked rather than erroring.
+- (c) **Manage the library** — per-row actions: rename, edit notes, edit tags,
+  pin/unpin, archive/unarchive (all via the E1 `PATCH /demo/workspaces/{id}`),
+  plus the existing single delete. The list gains a search box (name), a
+  show-archived toggle (archived hidden by default), a tag filter, and an
+  allow-listed sort; pinned rows always sort first.
+- (d) **Multi-select delete** — checkbox per row, "Delete selected (N)" behind
+  one confirmation dialog, implemented as N sequential single
+  `DELETE /demo/workspaces/{id}` calls. **No new bulk endpoint** (metadata-only
+  singles; vision-compatible — no "wipe everything" operation).
+- (e) **Compare two workspaces** — select exactly two rows → Compare navigates
+  to a new deep-linkable page (`/showcase/compare?a=&b=`) mirroring the
+  run-compare two-picker pattern: config diff, result-summary diff (winner /
+  WAPE delta / wall-clock), created-objects presence matrix, lineage relation.
+- (f) **See link health** — loading a workspace probes its soft references
+  (model runs, scenario plans, alias, batch, agent session, E1 `job_ids`)
+  through a new backend aggregation endpoint
+  `GET /demo/workspaces/{id}/health`; dead references render a warning marker
+  on the artifact cards and a per-workspace health summary chip shows
+  alive/dead counts plus a partial-run warning when the run never completed.
+
+**Deliverable** (all additive — no migration in E2; the schema delta is E1's):
+
+- `app/features/demo/workspace.py` — `list_workspaces` / `count_workspaces`
+  gain filter/sort parameters (`q`, `tags`, `include_archived`, `sort_by`,
+  `sort_order`; pinned-first ordering).
+- `app/features/demo/link_health.py` — NEW: in-process soft-reference probe
+  module (httpx `ASGITransport`, mirroring `pipeline._Client`).
+- `app/features/demo/schemas.py` — `WorkspaceRefHealth`,
+  `WorkspaceHealthResponse` response models (plain BaseModel, NOT strict).
+- `app/features/demo/routes.py` — query params on `GET /demo/workspaces`;
+  NEW `GET /demo/workspaces/{workspace_id}/health`.
+- `frontend/src/types/api.ts` — lifecycle fields on the workspace types
+  (verify-or-add per CONTRACT(E1)), health types, list-params type,
+  `WorkspaceUpdate` type.
+- `frontend/src/hooks/use-workspaces.ts` — params-aware `useWorkspaces`,
+  `usePatchWorkspace`, `useWorkspaceHealth`, `useWorkspaceLineage`.
+- `frontend/src/components/demo/ReplayConfirmDialog.tsx` — NEW confirm +
+  preview/diff dialog.
+- `frontend/src/components/demo/WorkspaceEditDialog.tsx` — NEW
+  rename/notes/tags editor.
+- `frontend/src/components/demo/WorkspaceLineageStrip.tsx` — NEW lineage chain.
+- `frontend/src/components/demo/WorkspacePanel.tsx` — reworked: toolbar
+  (search / show-archived / sort), row badges (pinned, archived, replay),
+  per-row actions dropdown, multi-select + delete-selected + compare-selected.
+- `frontend/src/components/demo/WorkspaceArtifactsPanel.tsx` — health-aware
+  cards (dead-link warnings) + health summary chip.
+- `frontend/src/pages/workspace-compare.tsx` — NEW two-workspace compare page;
+  route + `ROUTES.SHOWCASE_COMPARE` constant.
+- `frontend/src/pages/showcase.tsx` — replay-confirm flow, lineage strip +
+  health wiring, `replayed_from_workspace_id` on the replay start frame.
+- Tests: backend route + module unit tests, integration tests for list filters
+  and health; frontend vitest for every new/changed component + hook.
+- `docs/_base/API_CONTRACTS.md` + `docs/_base/RUNBOOKS.md` — additive updates
+  (incl. superseding the "deliberately no confirm dialog" note).
+
+**Success definition**: all Success Criteria below check off, the five backend
+CI gates and the frontend gates are green, and a manual browser dogfood on a
+seeded stack walks: save → search/sort → rename/pin/archive → replay (confirm
+dialog with diff, destructive variant on a reset workspace) → lineage chain
+visible → two-workspace compare → delete a referenced run → health shows the
+dead link.
+
+## Why
+
+- Umbrella #406 success criteria commit: "a `reset=true` replay requires an
+  explicit confirmation step before it runs" and "Workspaces can be renamed,
+  archived, pinned, annotated (notes/tags), searched, filtered, sorted, and
+  multi-select-deleted (metadata-only) from the saved-workspaces panel".
+- Today a replay of a `reset=true` workspace wipes the database with **no
+  confirmation** — documented designed behavior
+  (`docs/_base/RUNBOOKS.md` § "Showcase workspace", item 1: "there is
+  deliberately no confirm dialog") that #406 explicitly reverses.
+- E1 (#407) ships the storage + PATCH surface but no UI consumes it; E2 is the
+  delivery surface that makes lifecycle, lineage, and provenance visible.
+- `created_objects` ids are soft references by design — operator deletes leave
+  dangling deep links ("expected; the workspace row records what WAS created,
+  not what still exists", RUNBOOKS § Showcase workspace item 4). Link health
+  turns that silent staleness into a visible, per-workspace signal — the novel
+  ops slice #406 folded into this epic.
+
+## What
+
+### Decisions locked here (so implementation doesn't re-litigate)
+
+These were the open questions this PRP owns; the decisions below are final for E2.
+
+1. **Replay-policy picker (exact / safe-keep / modified): OUT OF SCOPE.**
+   Replay stays verbatim (`E4 #393` semantics). Rationale: the umbrella
+   commits only confirm + preview/diff; a "modified replay" already exists as
+   Load → edit controls → Run (the Load path repopulates every control); a
+   policy enum would add request-surface + backend validation for zero new
+   capability. The confirm dialog's footer carries a one-line hint —
+   "Want to change the config first? Use Load instead." Document the
+   deferral in the PR description.
+2. **Confirmation applies to EVERY replay, not just `reset=true`.** The
+   preview/diff panel needs a pre-flight surface and a sometimes-there dialog
+   is worse UX than an always-there one. The `reset=true` variant escalates:
+   destructive copy + destructive-styled action button. This satisfies the
+   umbrella's "explicit confirmation before any reset=true replay" as a
+   strict superset. The direct Run button (operator-configured runs) is
+   unchanged — confirmation guards replays only.
+3. **Link-health architecture: BACKEND aggregation endpoint**
+   (`GET /demo/workspaces/{id}/health`), implemented by probing the public
+   API **in-process** via `httpx.ASGITransport` — the exact mechanism
+   `pipeline._Client` already uses from inside a request context
+   (`app/features/demo/pipeline.py:141-148`; `POST /demo/run` passes
+   `request.app` into the pipeline at `routes.py:75`). Justification:
+   (a) the demo slice may NOT import registry/scenarios/jobs/agents services
+   (vertical-slice rule), and in-process HTTP through the public surface is
+   the slice's established cross-slice seam; (b) one workspace has up to ~10+
+   references (3 runs + N plans + alias + batch + session + M jobs) — a
+   frontend-probed design costs 1+N browser round-trips per workspace and
+   duplicates existence semantics per artifact type; (c) a backend endpoint
+   gives the health summary a single testable contract and a place for the
+   partial-run flag. Probes run concurrently (`asyncio.gather`), classify
+   2xx→`alive`, 404→`dead`, anything else→`unknown`, and are fetched
+   on-demand (loaded workspace only — never for every list row).
+4. **Compare view: FRONTEND-ONLY page.** A workspace compare is a plain field
+   diff over two already-served `WorkspaceDetail` payloads — no new backend
+   endpoint (contrast: `GET /registry/compare/{a}/{b}` exists because metric
+   diffing has server-side logic). New page `/showcase/compare?a=&b=`
+   mirroring `frontend/src/pages/explorer/run-compare.tsx` (two `Select`
+   pickers + `useSearchParams` deep-linking).
+5. **Multi-select delete = N sequential single DELETEs.** The existing
+   `DELETE /demo/workspaces/{id}` is called once per selected row behind one
+   confirmation dialog. NO new bulk endpoint — product-vision guardrail ("no
+   wipe-everything operations"); failures are collected and toasted, the list
+   refetches once at the end.
+6. **Search/filter/sort: SERVER-SIDE additive query params** on
+   `GET /demo/workspaces`, mirroring established precedents: name search →
+   `dimensions` `search` ILIKE pattern (`app/features/dimensions/routes.py:65`),
+   tags → `scenarios` repeated-`tags` JSONB containment
+   (`app/features/scenarios/routes.py:180`, `service.py:462-465`), sort →
+   allow-listed `sort_by`/`sort_order` with silent fallback to default
+   (`dimensions/routes.py:70-75`). `include_archived=false` is the default
+   (archived rows hidden). Pinned rows always order first
+   (`ORDER BY pinned DESC, <sort>`). Server-side keeps the panel honest as
+   rows accumulate and gives the filter a route-test contract.
+
+### Frozen contract — CONTRACT(E1) (#407 ships these; E2 consumes, never re-decides)
+
+Every assumption below MUST be reconciled against the merged E1 diff before
+implementation (Task 1). Where E1's PRP chose different names, adapt E2's code
+to E1's names — never the reverse.
+
+- `CONTRACT(E1)-1` — `showcase_workspace` columns exist post-migration:
+  `replayed_from_workspace_id` (nullable String(32), soft reference — NO FK,
+  consistent with `models.py` no-FK doctrine), `archived` (bool, default
+  false), `pinned` (bool, default false), `notes` (nullable text), `tags`
+  (JSONB string array, default `[]`), `config_schema_version` (int).
+- `CONTRACT(E1)-2` — `tags` representation is a JSONB string array with a GIN
+  index, mirroring `scenario_plan.tags`
+  (`app/features/scenarios/models.py:74,97`), so SQLAlchemy
+  `.contains([tag])` containment filtering works.
+- `CONTRACT(E1)-3` — `PATCH /demo/workspaces/{workspace_id}` exists with an
+  all-Optional partial-update body (rename/notes/tags/archive/pin — assumed
+  schema name `WorkspaceUpdateRequest`, semantics mirroring registry
+  `RunUpdate`, `app/features/registry/schemas.py:113-121`: absent field =
+  unchanged), returns the updated workspace (assumed
+  `WorkspaceDetailResponse`), 404 problem+json on a missing id.
+- `CONTRACT(E1)-4` — the GET list/detail response schemas expose the new
+  columns (`WorkspaceListItem` += `archived`, `pinned`, `tags`,
+  `replayed_from_workspace_id`; `WorkspaceDetailResponse` += `notes`,
+  `config_schema_version` and the JSONB story slots it serves). **Defensive
+  rule**: if E1 did NOT extend the GET responses, E2 adds the fields
+  additively in Task 3 (they are required reading surface for this epic).
+- `CONTRACT(E1)-5` — replay provenance mechanism: `DemoRunRequest` (and the
+  WS start frame) carries an additive Optional
+  `replayed_from_workspace_id: str | None` that `workspace.create_workspace`
+  persists onto the new row (E1's epic body: "Replay writes
+  `replayed_from_workspace_id`"). NOTE: E1's PRP itself wires the frontend
+  send (handleReplayWorkspace sends `ws.workspace_id` — an E1 success
+  criterion), so E2 PRESERVES the field through the executeReplay refactor
+  rather than adding it; if E1 instead derived it server-side, E2 adapts.
+- `CONTRACT(E1)-6` — the `job_ids` JSONB story slot is a `list[str]` of job
+  ids; the health endpoint probes each via `GET /jobs/{job_id}` when the slot
+  is non-empty (and silently skips when absent/empty — pre-E1-backfill rows).
+- `CONTRACT(E1)-7` — E1 does NOT add filtering/sorting to
+  `GET /demo/workspaces` (its scope is migration + PATCH + schemas); the list
+  query params are E2's to add. If E1's merged code already added any of
+  them, reuse instead of duplicating.
+
+### User-visible behavior
+
+- **Replay confirm/diff**: Replay button → dialog titled "Replay workspace
+  \"name\"?" with a two-column table (Recorded / Will send) over seed,
+  scenario, reset, skip_seed, workspace name, preservation (always `keep`),
+  replayed-from (the source workspace id). Rows where the two values differ
+  are highlighted (defensive — verbatim replay means they normally match).
+  `reset=true` → red warning block + destructive confirm button labeled
+  "Replay & wipe database"; otherwise a default confirm labeled "Replay".
+  Cancel never starts a run.
+- **Lineage**: list rows with `replayed_from_workspace_id != null` show an
+  outline `Badge` "replay". The loaded-workspace view renders a breadcrumb
+  strip: `this ← parent ← grandparent …` (depth-capped at 5), each ancestor
+  clickable (loads it); a deleted ancestor renders as
+  "(original deleted)" — dangling soft references are expected, never errors.
+- **Lifecycle panel**: toolbar = search `Input` (filters by name,
+  debounced/enter-applied), "Show archived" `Checkbox`, sort `Select`
+  (Newest / Oldest / Name / Status). Rows: pin icon (filled when pinned),
+  muted styling + "archived" badge on archived rows, tags rendered as small
+  chips (clicking a chip filters the list by that tag; an active tag filter
+  shows as a clearable chip in the toolbar). Per-row `DropdownMenu` (lucide
+  `MoreHorizontal`): Pin/Unpin, Archive/Unarchive, Edit details…, Delete….
+  "Edit details…" opens `WorkspaceEditDialog` (name input with the
+  `^[a-z0-9][a-z0-9\-_]*$` client validation already used by the run controls,
+  notes `Textarea`, tags comma-separated input).
+- **Multi-select**: leading `Checkbox` per row + header select-all; selection
+  shows "N selected" with **Delete selected** (AlertDialog: "Delete N
+  workspace records? Their created objects are NOT deleted.") and **Compare**
+  (enabled only when exactly 2 selected → navigates to the compare page).
+- **Compare page** (`/showcase/compare?a=&b=`): back-link to `/showcase`, two
+  workspace `Select` pickers (deep-linkable URL params), then: config table
+  (seed/scenario/reset/skip_seed/name/tags, mismatches highlighted),
+  result-summary table (winner, WAPE with the `DeltaCell` sign-only
+  indicator, wall-clock), created-objects presence matrix (per soft-reference
+  key: recorded A / recorded B), lineage note when one side is a replay of
+  the other, partial-run badge per side when `status != "completed"`.
+- **Link health**: loading a workspace fires
+  `GET /demo/workspaces/{id}/health`; the artifacts panel shows a summary
+  chip — `✓ N live · ✕ M dead` (plus "partial run" warning chip when the
+  row's status is not `completed`) — and each card whose reference probed
+  `dead` gets a lucide `AlertTriangle` + tooltip "This object no longer
+  exists — it was deleted after the run." `unknown` references render
+  without a marker (no false alarms on transient 5xx).
+
+### Technical requirements
+
+- All five backend gates green; frontend `pnpm lint && pnpm test --run` green.
+- New/changed endpoints: route tests covering 2xx + at least one error path
+  (`.claude/rules/test-requirements.md`).
+- RFC 7807 for every error path (`NotFoundError` from `app/core/exceptions.py:72`).
+- Response models stay plain `BaseModel` (+`from_attributes` where ORM-built)
+  — strict mode is request-body-only policy (`demo/schemas.py:88-95` precedent).
+- The demo slice imports NO other feature slice — link health goes through
+  in-process HTTP (`request.app` + `ASGITransport`), never a service import.
+- Frontend: TanStack Query for all IO; shadcn/ui new-york primitives only
+  (everything needed is already installed — see gotchas); lucide icons;
+  semantic tokens only (`text-destructive`, `bg-muted`, …) — no raw colors.
+- Legacy behavior byte-identical: a client that never touches the new query
+  params / endpoints sees today's responses (new list params all default to
+  today's semantics EXCEPT archived-hidden — see gotcha on `include_archived`).
+
+### Success Criteria
+
+- [ ] Replay (panel button) always opens the confirm dialog with the
+      recorded-vs-sent preview; confirming a `reset=true` workspace requires
+      the destructive-styled button; Cancel starts nothing. No code path
+      starts a replay without the dialog.
+- [ ] A confirmed replay sends the recorded config verbatim +
+      `preservation="keep"` + the recorded name + `replayed_from_workspace_id`
+      (CONTRACT(E1)-5); the new row carries the provenance id and the list
+      shows its "replay" badge; the loaded view renders the ancestor chain,
+      tolerating deleted ancestors.
+- [ ] Rename / notes / tags / pin / archive each round-trip through
+      `PATCH /demo/workspaces/{id}` and re-render without a manual refresh
+      (query invalidation on list + detail).
+- [ ] `GET /demo/workspaces` supports `q` (name ILIKE), `tags` (repeated,
+      containment), `include_archived` (default false), allow-listed
+      `sort_by`/`sort_order` (unknown → default `created_at desc`); pinned
+      rows order first; `total` respects the active filters; route tests
+      cover each param + the bad-param paths.
+- [ ] Multi-select delete removes N metadata rows via N single DELETEs behind
+      one confirmation; created objects untouched; NO new bulk endpoint exists.
+- [ ] `/showcase/compare?a=&b=` deep-links two workspaces and renders config
+      diff, result diff, created-objects matrix, lineage note, partial-run
+      badges; invalid/missing ids degrade to the picker (no crash).
+- [ ] `GET /demo/workspaces/{id}/health` returns per-reference
+      `alive`/`dead`/`unknown` + counts + `partial_run`; 404 problem+json on a
+      missing workspace; integration test proves a bogus reference probes
+      `dead` and a real one probes `alive`.
+- [ ] Loaded-workspace artifact cards show dead-link warnings + the health
+      summary chip; the partial-run warning renders for non-completed rows.
+- [ ] Legacy list calls (no new params) return archived-free, pinned-first,
+      newest-first pages; all pre-existing demo tests still pass.
+- [ ] `uv run ruff check . && uv run ruff format --check . && uv run mypy app/
+      && uv run pyright app/ && uv run pytest -v -m "not integration"` green;
+      integration suite green; `cd frontend && pnpm lint && pnpm test --run`
+      green.
+
+## Assumptions (no user available — documented, not asked)
+
+1. E1 (#407) merges before E2 execution begins (implementation-order gate from
+   the umbrella). This PRP is authored against pre-E1 `dev`; Task 1
+   reconciles every CONTRACT(E1) point against E1's actual merged shape.
+2. Exact E1 schema/endpoint names (`WorkspaceUpdateRequest`, field names as
+   listed in CONTRACT(E1)) — adapt to E1's real names on divergence.
+3. Archived-by-default-hidden is the correct list semantics (that is what
+   "archive" means for a library); the only consumer of `GET /demo/workspaces`
+   is the Showcase panel (verified — no other frontend or backend caller), so
+   the default-flip is safe.
+4. Health probing is acceptable on-demand-only (loaded workspace), not for
+   every list row — probing N rows × M references on list render would be a
+   self-inflicted thundering herd through the in-process transport.
+5. The lineage chain depth cap of 5 is sufficient (a replay-of-a-replay chain
+   deeper than 5 is a pathological case; the strip renders "…" beyond it).
+6. `sonner` `toast` (already used by `WorkspacePanel.tsx:20`) is the
+   feedback surface for mutation success/failure — no new notification system.
+7. Tag editing via a comma-separated text input is acceptable UX for a
+   single-operator tool (no tag-autocomplete component is installed; building
+   one is out of scope).
+
+## All Needed Context
+
+### Documentation & References
+
+```yaml
+# MUST READ — issues (the contract stack)
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/408
+  why: The epic this PRP implements — scope list is exhaustive (this PRP covers all of it).
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/406
+  why: Umbrella — success criteria rows 2 & 3 are E2's acceptance bar; out-of-scope list (no replay-policy infra beyond confirm+diff).
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/407
+  why: Foundation epic body = the frozen CONTRACT(E1) surface (columns, JSONB slots, PATCH endpoint, replay provenance write).
+- file: PRPs/PRP-showcase-workspace-E4-restore-replay.md
+  why: Closest-analog predecessor PRP — the E4 restore/replay semantics E2 hardens; its "decisions locked" #2/#3 (no confirm dialog, no provenance) are the two designed behaviors #406/#407 now reverse.
+
+# MUST READ — backend (verified 2026-06-12, dev pre-E1)
+- file: app/features/demo/routes.py
+  why: |
+    Current surface: POST /run @51 (passes request.app into the pipeline @75 —
+    the request-context app handle the health route also needs), GET
+    /workspaces @80-107 (limit/offset only — EXTEND with filters), GET
+    /workspaces/{workspace_id} @110-135 (NotFoundError 404 pattern @133-134),
+    DELETE @138-163, WS /stream @166. Router prefix="/demo" @48. Health route
+    lands between the GET detail and DELETE.
+- file: app/features/demo/workspace.py
+  why: |
+    list_workspaces @174-196 (order created_at.desc, id.desc @192) and
+    count_workspaces @224-234 — the two functions E2 extends with q/tags/
+    include_archived/sort_by/sort_order. get_workspace @158, delete_workspace
+    @199. All take caller-owned AsyncSession. create_workspace @46 is E1's to
+    extend (replayed_from) — DO NOT touch unless E1 missed it.
+- file: app/features/demo/models.py
+  why: |
+    ShowcaseWorkspace @37; current columns @59-81; CHECK + composite index
+    @83-89. E1 adds the lifecycle/provenance columns here — E2 reads them,
+    never migrates. No-FK doctrine in the module docstring @4-11 (the health
+    feature exists BECAUSE of this doctrine).
+- file: app/features/demo/schemas.py
+  why: |
+    DemoRunRequest @29 (strict=True @40; preservation @68; workspace_name
+    pattern @72-78; requires-keep validator @80-85 — the model E1 extends with
+    replayed_from_workspace_id). Response-model non-strict precedent: StepEvent
+    docstring @88-95, WorkspaceListItem @169 (from_attributes @177),
+    WorkspaceDetailResponse @192, WorkspaceListResponse @205. Append the two
+    health models here.
+- file: app/features/demo/pipeline.py
+  why: |
+    THE in-process probe mechanism to copy into link_health.py: _Client
+    @127-204 — httpx.AsyncClient(transport=httpx.ASGITransport(app=app,
+    raise_app_exceptions=False), base_url cosmetic, timeout @98) and
+    request() status handling @188-200. link_health needs a SIMPLER client:
+    status-code classification only, no _StepError. DO NOT modify pipeline.py
+    in E2 (E1 owns the provenance write; replay flows through unchanged).
+- file: app/features/demo/tests/test_routes.py
+  why: |
+    Route-test conventions to extend: unit tests monkeypatch the workspace
+    module functions (list @236-251, pagination pass-through @253-276, 404
+    @286-298, delete @324-347); integration tests @359+ use the db_session
+    fixture and seed real rows. New filter/health tests follow these shapes.
+- file: app/features/demo/tests/conftest.py
+  why: client fixture (ASGITransport over app.main.app) + db_session fixture
+    (real Postgres, wipes showcase_workspace on teardown).
+- file: app/features/scenarios/routes.py
+  why: |
+    Repeated-tags Query param precedent @168-195 (tags: list[str] | None =
+    Query(default=None)) — copy for the workspace list. GET detail 404 style
+    @198-223.
+- file: app/features/scenarios/service.py
+  why: list_plans @436-472 — tags containment filter @462-465
+    (stmt.where(ScenarioPlan.tags.contains(tags))) applied to BOTH count and
+    rows statements; total respects filters. Mirror exactly.
+- file: app/features/scenarios/models.py
+  why: tags JSONB string-array column @70-74 + GIN index @97 — the
+    representation CONTRACT(E1)-2 assumes for workspace tags.
+- file: app/features/dimensions/routes.py
+  why: |
+    search + allow-listed sort precedent @65-105 (search Query min-2-chars,
+    sort_by Query with allow-list note "unknown values use default order",
+    sort_order asc|desc). Mirror the docstring + silent-fallback style.
+- file: app/features/registry/schemas.py
+  why: RunUpdate @113-121 — the all-Optional partial-update body shape
+    CONTRACT(E1)-3 assumes for WorkspaceUpdateRequest (extra="forbid").
+- file: app/features/registry/routes.py
+  why: |
+    PATCH precedent @235; probe targets for link health: GET /registry/runs/
+    {run_id} @200-201, GET /registry/aliases/{alias_name} @503-504.
+- file: app/features/jobs/routes.py
+  why: probe target GET /jobs/{job_id} @219-220.
+- file: app/features/batch/routes.py
+  why: probe target GET /batch/{batch_id} @55-62 (NotFoundError on miss).
+- file: app/features/agents/routes.py
+  why: probe target GET /agents/sessions/{session_id} @80-104 — 404 via plain
+    HTTPException (status code is all the probe reads; body shape irrelevant).
+- file: app/core/exceptions.py
+  why: NotFoundError @72 (RFC 7807 404). No new exception classes needed.
+
+# MUST READ — frontend (verified 2026-06-12)
+- file: frontend/src/pages/showcase.tsx
+  why: |
+    453 lines. State block @118-131 (seed/keepWorkspace/workspaceName/
+    selectedWorkspaceId + useWorkspace detail resolution @128-131); handleRun
+    @139-156; handleLoadWorkspace @160-168; handleReplayWorkspace @174-186 —
+    THE function the confirm dialog intercepts (today it calls start()
+    directly); WorkspacePanel mount @245-255; name-pattern client validation
+    @26 + @135-137 (reuse in WorkspaceEditDialog); WorkspaceArtifactsPanel
+    mount @448-450 (gets health props).
+- file: frontend/src/components/demo/WorkspacePanel.tsx
+  why: |
+    219 lines — the component this epic reworks. Props @37-48; statusClass
+    @50-59 (semantic-token status colors); DESTRUCTIVE marker @144-148
+    (text-destructive span); per-row buttons @153-183; the AlertDialog
+    delete-confirm pattern @191-216 (open-state via pendingDelete, shared
+    one dialog for all rows, data-testid on the action) — COPY this pattern
+    for ReplayConfirmDialog + the multi-delete confirm; list invalidation
+    effect @106-110.
+- file: frontend/src/components/demo/WorkspacePanel.test.tsx
+  why: vitest conventions for this component family (mock use-workspaces
+    hooks via vi.mock, fire dialog actions, assert mutation calls).
+- file: frontend/src/components/demo/WorkspaceArtifactsPanel.tsx
+  why: |
+    157 lines. ArtifactCard shape @15-20, buildCards key mapping @30-107
+    (winning_run_id/v2_run_id/scenario_plan_ids/batch_id/alias/
+    agent_session_id + grain), disabled-card opacity-50 + title tooltip
+    @128-149. Health markers extend buildCards: each card gains an optional
+    `dead: boolean` resolved from the health response keyed by reference id.
+- file: frontend/src/hooks/use-workspaces.ts
+  why: |
+    43 lines — extend in place. useWorkspaces @10-16 (queryKey ['workspaces',
+    {limit}] — params object grows), useWorkspace @19-25, useDeleteWorkspace
+    @33-42 (invalidate ['workspaces'] on success — same invalidation for
+    usePatchWorkspace). useWorkspaceHealth + useWorkspaceLineage are new
+    siblings here.
+- file: frontend/src/pages/explorer/run-compare.tsx
+  why: |
+    THE compare-page pattern (370 lines): useSearchParams a/b @87-89,
+    selectRun setParams updater @103-109, RunPicker Select @56-84, DeltaCell
+    sign-only indicator @33-54, side-by-side Card/Table layout @114+. The
+    workspace compare page mirrors all of it with useWorkspace×2 instead of
+    useCompareRuns (frontend-only diff — Decision 4).
+- file: frontend/src/lib/constants.ts
+  why: ROUTES.SHOWCASE='/showcase' @4, ROUTES.EXPLORER.RUN_COMPARE @20 — add
+    SHOWCASE_COMPARE='/showcase/compare' beside SHOWCASE.
+- file: frontend/src/App.tsx
+  why: lazy-page + Suspense route registration pattern (ShowcasePage @12,
+    @54-61; RunComparePage @21, @119-126) — register WorkspaceComparePage
+    identically.
+- file: frontend/src/lib/api.ts
+  why: api<T>(endpoint, {params, method, body}) wrapper; ApiError carries
+    status (WorkspacePanel.tsx:97 shows instanceof usage); getErrorMessage.
+- file: frontend/src/types/api.ts
+  why: workspace types block @806-831 (WorkspaceListItem @806, WorkspaceDetail
+    @819, WorkspaceListResponse @828); DemoRunRequest @778-787 — extend here.
+- file: frontend/src/hooks/use-demo-pipeline.ts
+  why: start(req) signature + the picker-desync gotcha (start() does not sync
+    the scenario picker — Replay must setScenario first; already handled in
+    handleReplayWorkspace, keep that ordering inside the confirmed path).
+
+# Project docs to update (additive)
+- file: docs/_base/API_CONTRACTS.md
+  why: GET /demo/workspaces row gains the filter params; new health-endpoint
+    row; WS section note for replayed_from (if E1 didn't already add it).
+- file: docs/_base/RUNBOOKS.md
+  why: § "Showcase workspace — preserve/restore/replay/delete semantics" item 1
+    says "there is deliberately no confirm dialog" — E2 supersedes this
+    (update the item; keep the DESTRUCTIVE-marker sentence). Items 2-4 gain
+    one-line pointers to lineage badges / metadata-only multi-delete / health.
+- file: docs/_base/DOMAIN_MODEL.md
+  why: showcase_workspace § "Out of scope" lists the replayed_from column —
+    E1's PRP owns that doc edit; E2 only verifies it happened (do not double-edit).
+```
+
+### Current Codebase tree (relevant subset, pre-E1)
+
+```bash
+app/features/demo/
+├── link_health.py     # DOES NOT EXIST — E2 creates
+├── models.py          # ShowcaseWorkspace @37 (E1 extends; E2 reads)
+├── pipeline.py        # 2771 lines; _Client @127 — UNTOUCHED in E2
+├── routes.py          # POST /run @51; GETs @80,@110; DELETE @138; WS @166
+├── schemas.py         # 214 lines; workspace response models @169-213
+├── service.py         # lock + PipelineBusyError — untouched
+├── workspace.py       # 235 lines; list @174 / count @224 — E2 extends
+└── tests/             # conftest, test_{models,pipeline,routes,schemas,workspace}.py
+frontend/src/
+├── pages/showcase.tsx                       # 453 lines
+├── pages/explorer/run-compare.tsx           # 370 lines — compare pattern
+├── components/demo/WorkspacePanel.tsx       # 219 lines — reworked in E2
+├── components/demo/WorkspaceArtifactsPanel.tsx  # 157 lines — health-aware in E2
+├── hooks/use-workspaces.ts                  # 43 lines — extended in E2
+├── types/api.ts                             # workspace block @806-831
+└── components/ui/                           # 27 primitives incl. alert-dialog,
+                                             # dialog, dropdown-menu, textarea,
+                                             # table, select, tooltip, badge
+```
+
+### Desired Codebase tree (files added/modified)
+
+```bash
+app/features/demo/
+├── link_health.py                           # NEW — probe targets + probe_workspace_links()
+├── schemas.py                               # MOD — +WorkspaceRefHealth +WorkspaceHealthResponse
+├── workspace.py                             # MOD — list/count filters + sort
+├── routes.py                                # MOD — list query params; +GET /workspaces/{id}/health
+└── tests/
+    ├── test_link_health.py                  # NEW — probe classification vs a stub ASGI app
+    ├── test_routes.py                       # MOD — filter/sort/health unit + integration tests
+    └── test_workspace.py                    # MOD — list/count filter unit coverage (db-less where possible)
+frontend/src/
+├── types/api.ts                             # MOD — lifecycle fields (verify-or-add), health types, params, update type
+├── hooks/use-workspaces.ts                  # MOD — params-aware list; +usePatchWorkspace +useWorkspaceHealth +useWorkspaceLineage
+├── hooks/use-workspaces.test.ts             # MOD — new hooks covered
+├── components/demo/ReplayConfirmDialog.tsx       # NEW (+ .test.tsx)
+├── components/demo/WorkspaceEditDialog.tsx       # NEW (+ .test.tsx)
+├── components/demo/WorkspaceLineageStrip.tsx     # NEW (+ .test.tsx)
+├── components/demo/WorkspacePanel.tsx       # MOD — toolbar/badges/dropdown/multi-select (+ test MOD)
+├── components/demo/WorkspaceArtifactsPanel.tsx   # MOD — health markers + summary chip (+ test MOD)
+├── components/demo/index.ts                 # MOD — barrel exports
+├── pages/workspace-compare.tsx              # NEW (+ workspace-compare.test.tsx)
+├── pages/showcase.tsx                       # MOD — confirm flow, lineage, health, provenance field
+├── lib/constants.ts                         # MOD — ROUTES.SHOWCASE_COMPARE
+└── App.tsx                                  # MOD — compare route registration
+docs/_base/API_CONTRACTS.md                  # MOD — list params + health endpoint
+docs/_base/RUNBOOKS.md                       # MOD — supersede "no confirm dialog"; lifecycle notes
+```
+
+### Known Gotchas & Library Quirks
+
+```python
+# CRITICAL — EXECUTION GATE: do not start until E1 (#407) is merged to dev.
+#   Task 1 reconciles every CONTRACT(E1) point against the real merged code
+#   (git log --oneline --grep "#407"; read the E1 PRP + diff). Adapt E2 to
+#   E1's names; flag (don't silently fix) any E1 contract gap in the PR body.
+
+# CRITICAL — NO migration, NO models.py edit, NO pipeline.py edit in E2.
+#   The schema delta and the provenance/PATCH plumbing are E1's. If a column
+#   you need is missing post-E1, STOP and surface it — don't ship a stealth
+#   migration under E2.
+
+# CRITICAL — no cross-slice imports from app/features/demo/. Link health MUST
+#   go through in-process HTTP (request.app + httpx.ASGITransport — precedent
+#   pipeline.py:141-148 driven from a request context via routes.py:75).
+#   Importing RegistryService/ScenarioService/etc. fails the architecture rule.
+
+# CRITICAL — health probe classification: 2xx -> "alive", 404 -> "dead",
+#   EVERYTHING else (5xx, timeout, transport error) -> "unknown". Never let a
+#   probe exception escape the endpoint (asyncio.gather(..., return_exceptions=
+#   True) or per-probe try/except) — a flaky slice must not 500 the health
+#   route. raise_app_exceptions=False is REQUIRED on the ASGITransport (an
+#   unhandled error in a probed endpoint must surface as a 500 *response*).
+
+# CRITICAL — multi-select delete is N SINGLE DELETEs (existing endpoint).
+#   Adding POST /demo/workspaces/bulk-delete or DELETE /demo/workspaces is a
+#   product-vision violation (no bulk-wipe operations) — do not create it.
+
+# CRITICAL — the `total` returned by the filtered list MUST respect the active
+#   filters (scenarios precedent: BOTH count_stmt and rows_stmt get the same
+#   .where chain, scenarios/service.py:462-465). A filter-blind total breaks
+#   the "showing X of Y" header.
+
+# GOTCHA — include_archived default false flips list semantics for archived
+#   rows. Pre-E1 rows have archived=false (E1 migration default), so legacy
+#   lists are unchanged; route tests must still pin: no-param call returns
+#   only archived=false rows, include_archived=true returns both.
+
+# GOTCHA — sort allow-list: {created_at, name, seed, status}; unknown sort_by
+#   silently falls back to created_at desc (dimensions precedent — NOT a 422).
+#   Pinned-first is unconditional: ORDER BY pinned DESC, <sort>, id DESC
+#   tiebreak. name sort: NULLS LAST (unnamed rows sink) — use
+#   sqlalchemy .nulls_last() on the asc/desc expression.
+
+# GOTCHA — tags Query param: list[str] | None = Query(default=None) gives
+#   repeated-param parsing (?tags=a&tags=b). JSONB containment via
+#   ShowcaseWorkspace.tags.contains(tags) requires CONTRACT(E1)-2 (JSONB array
+#   column). Frontend sends ONE tag at a time (chip filter) — a single
+#   `tags` param serializes fine through api()'s params.
+
+# GOTCHA — q search: mirror dimensions ILIKE (case-insensitive, escape % and _
+#   if the precedent does; check dimensions/service.py before writing).
+#   Search NAME only (workspace_id prefixes are copy-paste handles, not search).
+
+# GOTCHA — strict-mode policy: the new health/response models are response
+#   models -> plain BaseModel, NO ConfigDict(strict=True). The AST walker
+#   (app/core/tests/test_strict_mode_policy.py) only inspects strict=True
+#   request models — keep it that way.
+
+# GOTCHA — agents GET /agents/sessions/{id} 404s via plain HTTPException (not
+#   NotFoundError) — irrelevant to the probe (status code only), but do NOT
+#   "fix" the agents slice as a drive-by.
+
+# GOTCHA — an EXPIRED-but-existing agent session returns 200 (row exists) ->
+#   "alive". That is correct link-health semantics (the row is the link
+#   target); the artifacts card blurb already says "the recorded session has
+#   likely expired".
+
+# GOTCHA — ReplayConfirmDialog destructive styling: AlertDialogAction renders
+#   buttonVariants default; pass className="bg-destructive text-destructive-
+#   foreground hover:bg-destructive/90" (semantic tokens — NEVER raw colors
+#   like bg-red-500). Copy the shared-dialog open-state pattern from
+#   WorkspacePanel.tsx:191-216 (pendingX state, one dialog for all rows).
+
+# GOTCHA — confirm-dialog flow ordering: the confirmed replay must run the
+#   EXISTING handleReplayWorkspace body (setScenario BEFORE start() — the
+#   picker-desync gotcha from E4 still applies). Refactor: handleReplayWorkspace
+#   becomes "setPendingReplay(ws)"; a new executeReplay(ws) holds the old body
+#   + the CONTRACT(E1)-5 replayed_from_workspace_id field.
+
+# GOTCHA — lineage walking: a deleted ancestor's GET returns 404 (ApiError
+#   .status === 404) — render "(original deleted)" and STOP the walk; never
+#   throw. Implement as one useQuery whose queryFn loops (await api(...) per
+#   ancestor, depth cap 5), queryKey ['workspaces', id, 'lineage'] — N
+#   serial fetches inside one query keeps cache + loading states simple.
+
+# GOTCHA — useWorkspaces signature change (limit -> params object) touches its
+#   existing call sites + use-workspaces.test.ts — update them in the same
+#   commit; keep queryKey shape ['workspaces', paramsObject] so the blanket
+#   invalidateQueries({queryKey: ['workspaces']}) keeps matching everything.
+
+# GOTCHA — pnpm tsc --noEmit is VACUOUS (solution-style tsconfig, zero files)
+#   and `tsc -b` fails on dev with PRE-EXISTING errors (known issue — memory
+#   [[frontend-tsc-noemit-gate-vacuous]]). Do NOT chase those. JS gates that
+#   must be green: pnpm lint && pnpm test --run. Optionally verify ONLY your
+#   new files compile via their vitest imports.
+
+# GOTCHA — every shadcn primitive needed (alert-dialog, dialog, dropdown-menu,
+#   checkbox, input, textarea, select, table, tooltip, badge, card, button) is
+#   ALREADY in frontend/src/components/ui/ (verified 2026-06-12). Do NOT run
+#   `shadcn add`. If you believe a new primitive is required, stop and recheck
+#   (.claude/rules/shadcn-ui.md; memory [[shadcn-cli-version-pin]]).
+
+# GOTCHA — never call crypto.randomUUID directly (issue #332; ESLint guard) —
+#   safeRandomUUID from @/lib/uuid-utils if any client id is needed.
+
+# GOTCHA — repo has mixed CRLF/LF; Write/Edit emit LF. New files fine; for
+#   showcase.tsx / WorkspacePanel.tsx / routes.py edits run `git diff --stat`
+#   and confirm surgical line counts before committing.
+
+# GOTCHA — mypy --strict AND pyright --strict gate merge: full annotations on
+#   the new probe module (TypedDict/dataclass or Pydantic for probe targets),
+#   `-> None` on tests, annotated fixtures.
+
+# COORDINATION — E3 (#409), E4 (#410), E5 (#411), E6 (#412) are open parallel
+#   epics. Shared-file risk: schemas.py / routes.py / showcase.tsx /
+#   API_CONTRACTS.md. Keep every edit additive + self-contained; rebase on dev
+#   before the PR.
+
+# RUNTIME-VERIFICATION LOG (per prp-create step 3):
+#   - demo routes/handlers + line refs — read routes.py (2026-06-12)
+#   - list/count signatures + ordering — read workspace.py:174-234
+#   - ShowcaseWorkspace pre-E1 columns — read models.py:59-89
+#   - response-model non-strict precedent — read schemas.py:88-95,169-213
+#   - ASGITransport in-process pattern — read pipeline.py:127-204
+#   - scenario tags containment + GIN — read scenarios/service.py:462-465, models.py:74,97
+#   - dimensions search/sort params — grep dimensions/routes.py:65-105
+#   - probe targets exist: /registry/runs/{run_id} (registry/routes.py:200),
+#     /registry/aliases/{alias_name} (:503), /jobs/{job_id} (jobs/routes.py:219),
+#     /batch/{batch_id} (batch/routes.py:55), /agents/sessions/{session_id}
+#     (agents/routes.py:80), /scenarios/{scenario_id} (scenarios/routes.py:198)
+#   - RunUpdate partial-update shape — read registry/schemas.py:113-121
+#   - frontend: WorkspacePanel AlertDialog pattern (191-216), run-compare
+#     useSearchParams pattern (87-109), installed ui primitives (ls), api.ts
+#     ApiError usage (WorkspacePanel.tsx:97)
+#   - E1 #407 OPEN / unmerged as of 2026-06-12 — CONTRACT(E1) tags mark every
+#     dependency; no third-party API claims beyond in-repo working patterns
+#     (httpx ASGITransport, sqlalchemy .contains, TanStack useQuery/useMutation
+#     — all already exercised in this repo; .nulls_last is standard
+#     SQLAlchemy 2.0 API but has NO in-repo precedent — verify at impl time).
+```
+
+## Implementation Blueprint
+
+### Data models and structure
+
+```python
+# app/features/demo/schemas.py — APPEND (response models; NOT strict)
+
+RefHealthStatus = Literal["alive", "dead", "unknown"]
+RefType = Literal["model_run", "scenario_plan", "alias", "batch", "agent_session", "job"]
+
+
+class WorkspaceRefHealth(BaseModel):
+    """Liveness of one soft reference recorded on a workspace (E2, #408)."""
+
+    key: str = Field(..., description="created_objects key, e.g. 'winning_run_id' or 'scenario_plan_ids[0]'.")
+    ref_type: RefType = Field(..., description="Kind of referenced object.")
+    ref_id: str = Field(..., description="The recorded soft-reference id.")
+    status: RefHealthStatus = Field(..., description="alive (2xx) / dead (404) / unknown (other).")
+    probe_path: str = Field(..., description="The public API path probed.")
+
+
+class WorkspaceHealthResponse(BaseModel):
+    """Per-workspace link-health summary (E2, #408)."""
+
+    workspace_id: str
+    workspace_status: str = Field(..., description="running / completed / failed.")
+    partial_run: bool = Field(..., description="True when workspace_status != 'completed'.")
+    references: list[WorkspaceRefHealth] = Field(default_factory=list)
+    alive: int = Field(..., ge=0)
+    dead: int = Field(..., ge=0)
+    unknown: int = Field(..., ge=0)
+    checked_at: datetime = Field(default_factory=_utc_now)
+```
+
+```python
+# app/features/demo/link_health.py — NEW (sketch; CRITICAL details only)
+
+@dataclass(frozen=True)
+class _ProbeTarget:
+    key: str          # e.g. "scenario_plan_ids[1]"
+    ref_type: str     # RefType value
+    ref_id: str
+    probe_path: str   # e.g. f"/registry/runs/{ref_id}"
+
+def build_probe_targets(ws: ShowcaseWorkspace) -> list[_ProbeTarget]:
+    # created_objects keys (workspace.py:_collect_created_objects:82-103):
+    #   winning_run_id / v2_run_id / stale_alias_run_id -> /registry/runs/{id}
+    #   scenario_plan_ids[i]                            -> /scenarios/{id}
+    #   alias                                           -> /registry/aliases/{name}
+    #   batch_id                                        -> /batch/{id}
+    #   agent_session_id                                -> /agents/sessions/{id}
+    # CONTRACT(E1)-6: job_ids JSONB slot [i]            -> /jobs/{id}
+    # NON-probeable keys (v2_model_path, scenario_artifact_key,
+    # train_model_types) are SKIPPED — no HTTP identity to check.
+    ...
+
+async def probe_workspace_links(app: FastAPI, ws: ShowcaseWorkspace) -> WorkspaceHealthResponse:
+    targets = build_probe_targets(ws)
+    async with httpx.AsyncClient(
+        transport=httpx.ASGITransport(app=app, raise_app_exceptions=False),
+        base_url="http://demo.internal",
+        timeout=httpx.Timeout(10.0, connect=5.0),
+    ) as client:
+        results = await asyncio.gather(
+            *(_probe_one(client, t) for t in targets), return_exceptions=False
+        )  # _probe_one NEVER raises: try/except httpx.HTTPError/OSError -> "unknown"
+    # classify: 200<=s<300 alive; s==404 dead; else unknown
+    # partial_run = ws.status != WORKSPACE_STATUS_COMPLETED
+    ...
+```
+
+```typescript
+// frontend/src/types/api.ts — extend the workspace block (806-831)
+
+// CONTRACT(E1)-4 — verify E1 added these; add additively if not:
+export interface WorkspaceListItem {
+  /* existing fields ... */
+  archived: boolean
+  pinned: boolean
+  tags: string[]
+  replayed_from_workspace_id: string | null
+}
+export interface WorkspaceDetail extends WorkspaceListItem {
+  /* existing fields ... */
+  notes: string | null
+  config_schema_version: number
+}
+
+// E2 (#408) — lifecycle PATCH body (CONTRACT(E1)-3 shape; adapt to E1 names):
+export interface WorkspaceUpdate {
+  name?: string | null
+  notes?: string | null
+  tags?: string[]
+  archived?: boolean
+  pinned?: boolean
+}
+
+export interface WorkspaceListParams {
+  limit?: number
+  offset?: number
+  q?: string
+  tags?: string
+  include_archived?: boolean
+  sort_by?: 'created_at' | 'name' | 'seed' | 'status'
+  sort_order?: 'asc' | 'desc'
+}
+
+export type RefHealthStatus = 'alive' | 'dead' | 'unknown'
+export interface WorkspaceRefHealth {
+  key: string
+  ref_type: 'model_run' | 'scenario_plan' | 'alias' | 'batch' | 'agent_session' | 'job'
+  ref_id: string
+  status: RefHealthStatus
+  probe_path: string
+}
+export interface WorkspaceHealth {
+  workspace_id: string
+  workspace_status: 'running' | 'completed' | 'failed'
+  partial_run: boolean
+  references: WorkspaceRefHealth[]
+  alive: number
+  dead: number
+  unknown: number
+  checked_at: string
+}
+```
+
+### List of tasks (dependency order)
+
+```yaml
+Task 1 — gate, branch & E1 reconciliation:
+  VERIFY: gh issue view 407 --json state  -> MUST be closed (E1 merged) before continuing
+  RUN: git switch dev && git pull && git switch -c feat/showcase-completion-e2-safe-replay-lifecycle
+  VERIFY: gh issue view 408 --json state   # open
+  RECONCILE every CONTRACT(E1) tag against the merged code:
+    - read app/features/demo/models.py    -> column names (CONTRACT(E1)-1/-2)
+    - read app/features/demo/schemas.py   -> PATCH body + GET response fields (CONTRACT(E1)-3/-4)
+    - read app/features/demo/routes.py    -> PATCH route exists
+    - grep replayed_from app/features/demo/ -> provenance mechanism (CONTRACT(E1)-5)
+    - read PRPs/PRP-showcase-completion-E1-*.md (whatever E1's PRP file is named)
+  ADAPT all names below to E1's reality; note any E1 gap in the PR body.
+
+Task 2 — MODIFY app/features/demo/workspace.py (filters + sort):
+  - EXTEND list_workspaces(db, *, limit=50, offset=0, q=None, tags=None,
+      include_archived=False, sort_by=None, sort_order="desc"):
+      # base stmt; if not include_archived: .where(~ShowcaseWorkspace.archived)
+      # if q: .where(ShowcaseWorkspace.name.ilike(f"%{q}%"))   (name only)
+      # if tags: .where(ShowcaseWorkspace.tags.contains(tags)) (CONTRACT(E1)-2)
+      # sort: allow-list {created_at,name,seed,status}; unknown -> created_at
+      #   desc; name uses .nulls_last(); ALWAYS ORDER BY pinned.desc() first,
+      #   then the sort expr, then id.desc() tiebreak
+  - EXTEND count_workspaces(db, *, q=None, tags=None, include_archived=False)
+      # SAME where-chain as list (scenarios/service.py:462-465 precedent) —
+      # extract a shared _apply_filters(stmt, ...) helper to keep them in sync
+  - Update module docstring (E2 routes the filters).
+
+Task 3 — MODIFY app/features/demo/schemas.py:
+  - APPEND WorkspaceRefHealth + WorkspaceHealthResponse (blueprint above);
+    docstring notes: response models, NOT strict (StepEvent precedent @88-95).
+  - CONTRACT(E1)-4 defensive check: if E1 did not expose archived/pinned/tags/
+    replayed_from_workspace_id on WorkspaceListItem (+notes/
+    config_schema_version on WorkspaceDetailResponse), ADD them here
+    additively (from_attributes picks them up from the ORM row).
+
+Task 4 — CREATE app/features/demo/link_health.py:
+  - build_probe_targets(ws) + probe_workspace_links(app, ws) per the blueprint.
+  - MIRROR pipeline._Client transport flags exactly (raise_app_exceptions=False).
+  - _probe_one catches (httpx.HTTPError, OSError) -> "unknown"; NEVER raises.
+  - Full --strict annotations; module docstring states the no-cross-slice-
+    import rationale (Decision 3) and the 2xx/404/other classification table.
+
+Task 5 — MODIFY app/features/demo/routes.py:
+  - EXTEND GET /workspaces signature with q / tags / include_archived /
+    sort_by / sort_order Query params (mirror dimensions/routes.py:65-75 +
+    scenarios/routes.py:180 styles; document the allow-list + silent fallback
+    in the docstring); pass through to workspace.list_workspaces /
+    count_workspaces (same filter args to BOTH).
+  - ADD GET /workspaces/{workspace_id}/health -> WorkspaceHealthResponse:
+      # async def get_workspace_health(workspace_id: str, request: Request,
+      #                                db: AsyncSession = Depends(get_db)):
+      #   row = await workspace.get_workspace(db, workspace_id)
+      #   if row is None: raise NotFoundError(message=f"Workspace not found: {workspace_id}")
+      #   return await link_health.probe_workspace_links(request.app, row)
+      # Place between the GET detail (@110) and DELETE (@138). No path
+      # collision: /workspaces/{id}/health is more specific than /workspaces/{id}.
+  - Update the module docstring route inventory.
+
+Task 6 — backend tests:
+  - CREATE app/features/demo/tests/test_link_health.py (unit, no DB):
+      # build a THROWAWAY FastAPI stub app with routes returning 200 / 404 /
+      # 500 at the probed paths; construct a ShowcaseWorkspace instance
+      # in-memory (not persisted) with created_objects covering every key +
+      # job_ids slot; assert classification alive/dead/unknown + counts +
+      # partial_run on status='failed'; assert non-probeable keys skipped;
+      # assert empty created_objects -> empty references, partial_run logic.
+  - MODIFY app/features/demo/tests/test_routes.py:
+      UNIT (monkeypatch app.features.demo.routes.workspace / .link_health):
+        - list passes q/tags/include_archived/sort args through (capture kwargs)
+        - list rejects bad limit (existing) — keep green
+        - health 404 on missing workspace (problem+json content-type)
+        - health 200 happy path (monkeypatched probe returns canned response)
+      INTEGRATION (@pytest.mark.integration, db_session):
+        - seed rows: named/unnamed, archived, pinned, tagged ->
+          default list hides archived; include_archived=true shows it;
+          q matches name substring case-insensitively; tags containment;
+          sort_by=name asc with NULLS LAST; pinned row first regardless of sort;
+          total respects filters
+        - health integration: insert a workspace whose created_objects carry
+          one REAL reference (insert a scenario_plan row via its ORM, or use a
+          bogus-vs-real registry pair) + one bogus id -> assert alive + dead
+  - MODIFY app/features/demo/tests/test_workspace.py: filter unit coverage of
+    _apply_filters where practical (or fold into the integration tests above).
+
+Task 7 — MODIFY frontend/src/types/api.ts:
+  - Lifecycle fields per CONTRACT(E1)-4 (verify-or-add), WorkspaceUpdate,
+    WorkspaceListParams, WorkspaceRefHealth/WorkspaceHealth (blueprint above).
+  - DemoRunRequest: verify E1 added replayed_from_workspace_id?: string
+    (CONTRACT(E1)-5); add if missing.
+
+Task 8 — MODIFY frontend/src/hooks/use-workspaces.ts (+ test):
+  - useWorkspaces(params: WorkspaceListParams = {}, enabled = true):
+      queryKey ['workspaces', params]; api('/demo/workspaces', { params })
+      # update existing call site: WorkspacePanel.tsx:77 (the sole useWorkspaces
+      # caller — showcase.tsx never calls it directly)
+  - ADD usePatchWorkspace():
+      mutationFn: ({workspaceId, update}: {workspaceId: string; update: WorkspaceUpdate}) =>
+        api<WorkspaceDetail>(`/demo/workspaces/${workspaceId}`, { method: 'PATCH', body: update })
+      onSuccess: invalidate ['workspaces']   # blanket key matches list+detail
+  - ADD useWorkspaceHealth(workspaceId: string, enabled = true):
+      queryKey ['workspaces', workspaceId, 'health']; staleTime 30_000
+  - ADD useWorkspaceLineage(workspaceId: string | null):
+      one useQuery; queryFn walks replayed_from_workspace_id via sequential
+      api<WorkspaceDetail>() calls, depth cap 5; a 404 (ApiError.status===404)
+      terminates the walk with a {deleted: true} sentinel entry; returns
+      Array<{workspace_id, name, deleted}> oldest-last.
+  - MODIFY use-workspaces.test.ts: params serialization, PATCH invalidation,
+    lineage walk incl. 404 termination (mock api module).
+
+Task 9 — CREATE frontend/src/components/demo/ReplayConfirmDialog.tsx (+ test):
+  - Props: { workspace: WorkspaceListItem | null,        # null = closed
+             requestPreview: DemoRunRequest | null,      # built by the page
+             onConfirm: () => void, onCancel: () => void }
+  - AlertDialog (open={workspace !== null}; onOpenChange close -> onCancel) —
+    copy the shared-dialog pattern from WorkspacePanel.tsx:191-216.
+  - Body: 3-column table (Field / Recorded / Will send) over seed, scenario,
+    reset, skip_seed, name, preservation, replayed_from; per-row mismatch
+    highlight (font-semibold text-destructive on the "Will send" cell when
+    values differ — defensive; verbatim replay normally matches).
+  - reset=true -> warning block (AlertTriangle + "Replaying this workspace
+    WIPES the database and reseeds it.") + AlertDialogAction
+    className="bg-destructive text-destructive-foreground hover:bg-destructive/90"
+    label "Replay & wipe database"; else label "Replay".
+  - Footer hint: "Want to change the config first? Use Load instead." (muted).
+  - data-testid="replay-confirm" on the action (WorkspacePanel test precedent).
+  - Test: renders preview values; destructive copy/label only when reset;
+    confirm fires onConfirm once; cancel fires onCancel; mismatch highlight.
+
+Task 10 — CREATE frontend/src/components/demo/WorkspaceEditDialog.tsx (+ test):
+  - Props: { workspace: WorkspaceListItem | null, onClose: () => void }
+  - Dialog (ui/dialog.tsx — form dialog, not AlertDialog) with: name Input
+    (reuse WORKSPACE_NAME_PATTERN from showcase.tsx:26 — export it from a
+    shared location, e.g. components/demo/workspace-name.ts, instead of
+    duplicating), notes Textarea, tags Input (comma-separated -> trimmed
+    string[]; render current tags as chips above the input).
+  - Save -> usePatchWorkspace().mutate({workspaceId, update}); toast on
+    success/failure (sonner pattern WorkspacePanel.tsx:88-99); close on success.
+  - Send ONLY changed fields (partial update — CONTRACT(E1)-3 semantics).
+  - Test: pattern violation disables Save with inline hint; save sends only
+    dirty fields; success closes + toasts (mock usePatchWorkspace).
+
+Task 11 — CREATE frontend/src/components/demo/WorkspaceLineageStrip.tsx (+ test):
+  - Props: { workspaceId: string, onLoadAncestor: (id: string) => void }
+  - useWorkspaceLineage(workspaceId); render breadcrumb: current ← parent ←
+    … oldest; ancestors as Button variant="link" size="sm" (click ->
+    onLoadAncestor); deleted sentinel renders muted "(original deleted)";
+    depth-cap overflow renders trailing "…". Render nothing (null) when the
+    workspace has no replayed_from_workspace_id.
+  - Test: chain render order, deleted sentinel, null when no lineage.
+
+Task 12 — MODIFY frontend/src/components/demo/WorkspacePanel.tsx (+ test):
+  - Toolbar row above the list: search Input (icon lucide Search; applies as
+    `q` on Enter/debounce), "Show archived" Checkbox, sort Select
+    (Newest/Oldest/Name/Status -> sort_by+sort_order pairs), active-tag chip
+    (clearable) when a tag filter is set.
+  - Panel owns the list-params state and calls useWorkspaces(params).
+  - Row additions: leading multi-select Checkbox; Pin icon button (lucide Pin
+    / PinOff, fires usePatchWorkspace toggle); archived rows: opacity-60 +
+    outline Badge "archived"; replay Badge (outline, "replay") when
+    replayed_from_workspace_id != null; tags as clickable chips (sets the tag
+    filter); DropdownMenu (MoreHorizontal): Pin/Unpin, Archive/Unarchive,
+    Edit details…, Delete… (Delete keeps the existing pendingDelete dialog).
+  - Replay button now calls a NEW prop onRequestReplay(ws) (the page owns the
+    confirm dialog) — RENAME the old onReplay prop to make the break explicit.
+  - Selection footer: "N selected" + Delete selected (AlertDialog confirm ->
+    sequential `for (const id of selected) await deleteWorkspace.mutateAsync(id)`
+    with per-failure collection -> one summary toast; clear selection) +
+    Compare button (disabled unless exactly 2; useNavigate ->
+    `${ROUTES.SHOWCASE_COMPARE}?a=${id1}&b=${id2}`).
+  - Keep the component lean: extract WorkspaceToolbar + WorkspaceRow as
+    file-local components if the file passes ~300 lines.
+  - Tests: search/sort/archived params flow into useWorkspaces (mock + assert
+    last call args); multi-select count + delete-selected confirm calls N
+    mutateAsync; compare disabled at 1 and 3 selections; pin/archive fire
+    PATCH mutations; replay fires onRequestReplay (NOT start).
+
+Task 13 — MODIFY frontend/src/components/demo/WorkspaceArtifactsPanel.tsx (+ test):
+  - Props += { health?: WorkspaceHealth | null }
+  - buildCards gains the refId per card; a card whose refId matches a
+    health.references entry with status==='dead' renders AlertTriangle
+    (h-3 w-3 text-destructive) beside the label + title tooltip "This object
+    no longer exists — it was deleted after the run." ('unknown' -> no marker).
+  - Header chip row: `✓ {alive} live` (text-success) + `✕ {dead} dead`
+    (text-destructive, only when dead>0) + outline Badge "partial run" when
+    health.partial_run (tooltip: "This run never completed — artifacts may be
+    missing."). Skeleton/silent when health undefined (query in flight/disabled).
+  - Test: dead marker on matching card; summary chip counts; partial-run badge.
+
+Task 14 — MODIFY frontend/src/pages/showcase.tsx:
+  - State += pendingReplay: WorkspaceListItem | null.
+  - handleReplayWorkspace(ws) -> setPendingReplay(ws)  (no start()).
+  - NEW executeReplay(ws): the post-E1 body (showcase.tsx:174-186 today —
+    setScenario first; E1 shifts these anchors and adds
+    replayed_from_workspace_id: ws.workspace_id, which executeReplay PRESERVES
+    — CONTRACT(E1)-5, preserve-not-add); clear pendingReplay.
+  - buildReplayRequest(ws): pure helper producing the DemoRunRequest preview
+    passed to the dialog AND used by executeReplay (single source — the diff
+    can never lie about what's sent). Export for unit testing.
+  - Mount <ReplayConfirmDialog workspace={pendingReplay}
+      requestPreview={pendingReplay && buildReplayRequest(pendingReplay)}
+      onConfirm={() => pendingReplay && executeReplay(pendingReplay)}
+      onCancel={() => setPendingReplay(null)} />
+  - Health: const health = useWorkspaceHealth(selectedWorkspaceId ?? '',
+      !!selectedWorkspaceId); pass health.data into WorkspaceArtifactsPanel.
+  - Lineage: mount <WorkspaceLineageStrip workspaceId={selectedWorkspaceId}
+      onLoadAncestor={(id) => { /* fetch list item via detail + handleLoad */ }} />
+      inside the loaded-workspace block (@448-450 region); simplest
+    onLoadAncestor: setSelectedWorkspaceId(id) + repopulate controls from the
+    lineage entry's detail (the strip's hook already has the details — pass
+    the full WorkspaceDetail up instead of just the id if cleaner).
+  - WorkspacePanel prop rename: onRequestReplay={handleReplayWorkspace}.
+
+Task 15 — CREATE frontend/src/pages/workspace-compare.tsx (+ test) + routing:
+  - MODIFY frontend/src/lib/constants.ts: SHOWCASE_COMPARE: '/showcase/compare'
+    (beside SHOWCASE @4).
+  - MODIFY frontend/src/App.tsx: lazy WorkspaceComparePage + <Route> (mirror
+    RunComparePage @21, @119-126). '/showcase/compare' and '/showcase' are
+    distinct paths — no nesting needed.
+  - Page mirrors run-compare.tsx: useSearchParams a/b (@87-109 pattern);
+    pickers = Select over useWorkspaces({limit: 100, include_archived: true})
+    items (label: name ?? id.slice(0,8) · scenario · status); two
+    useWorkspace(a/b) detail queries; render:
+      * config table — seed/scenario/reset/skip_seed/name/tags; mismatch rows
+        highlighted (font-semibold)
+      * results table — winner_model_type, winner_wape (DeltaCell-style
+        sign-only delta — copy the component from run-compare.tsx:33-54
+        file-locally), wall_clock_s
+      * created-objects matrix — union of soft-reference keys × (A: ✓/—,
+        B: ✓/—)
+      * lineage note — "B is a replay of A" (or inverse) when
+        replayed_from_workspace_id links them
+      * partial-run outline Badge per side when status !== 'completed'
+    Missing/invalid id -> that side renders the picker + muted "select a
+    workspace" (no crash; ApiError 404 -> same fallback).
+  - Test: renders diff for two mocked details; mismatch highlight; lineage
+    note; 404 side falls back to picker state.
+
+Task 16 — barrel + docs:
+  - MODIFY frontend/src/components/demo/index.ts — export the three new
+    components.
+  - MODIFY docs/_base/API_CONTRACTS.md:
+      * GET /demo/workspaces row: append "E2 (#408) — `q` name search, `tags`
+        containment filter, `include_archived` (default false), allow-listed
+        `sort_by`/`sort_order`; pinned rows first; `total` respects filters"
+      * NEW row: | demo | GET | `/demo/workspaces/{workspace_id}/health` |
+        E2 (#408) — probe the workspace's soft references in-process; per-ref
+        alive/dead/unknown + counts + `partial_run`; `404` when missing |
+  - MODIFY docs/_base/RUNBOOKS.md § "Showcase workspace — …":
+      * item 1: replace "there is deliberately no confirm dialog" with the E2
+        reality (every panel Replay confirms; reset=true gets destructive
+        copy; the DESTRUCTIVE row marker stays)
+      * item 3/4: one-line additions — multi-select delete = N metadata-only
+        singles; dead links now SURFACE via the health summary instead of
+        silently dangling
+  - VERIFY (not edit) DOMAIN_MODEL.md replayed_from note was updated by E1.
+
+Task 17 — gates, dogfood, commits, PR:
+  - Backend gates + integration suite (Validation Loop below).
+  - Frontend: cd frontend && pnpm lint && pnpm test --run.
+  - Browser dogfood via the webapp-testing skill (CLAUDE.md workflow step 4):
+    seeded stack -> save 3 workspaces (one reset=true, one tagged, one
+    replayed) -> search/sort/archive/pin -> replay with confirm (destructive
+    variant) -> lineage chain -> compare page -> delete a referenced scenario
+    plan -> reload workspace -> dead-link warning + health chip.
+  - git diff --stat (CRLF surgical-diff check on edited files).
+  - COMMITS (reference #408, no AI trailer), e.g.:
+      feat(api): add workspace list filters and link-health endpoint (#408)
+      feat(ui): add replay confirmation with config diff to showcase (#408)
+      feat(ui): add workspace lifecycle controls and lineage rendering (#408)
+      feat(ui): add two-workspace compare page (#408)
+      test(api): cover workspace filters and link-health probes (#408)
+      docs(api): document workspace lifecycle and health contracts (#408)
+  - PR into dev; title `feat(api,ui): showcase-completion E2 — safe replay &
+    workspace lifecycle (#408)`; body notes the replay-policy-picker deferral
+    (Decision 1) + any CONTRACT(E1) reconciliation deltas.
+```
+
+### Integration Points
+
+```yaml
+DATABASE: none in E2 — reads the E1-migrated table; NO new migration.
+
+CONFIG: none — no new settings or env vars (probe timeout is a module constant).
+
+ROUTES: existing demo router only (app/main.py wiring unchanged): extended GET
+  /demo/workspaces + new GET /demo/workspaces/{id}/health. PATCH is E1's.
+
+FRONTEND ROUTES: one new React Router page at ROUTES.SHOWCASE_COMPARE
+  ('/showcase/compare'); registered in App.tsx beside the existing pages.
+
+DOCS: API_CONTRACTS.md + RUNBOOKS.md (Task 16). Full doc sweep belongs to the
+  E7 release gate — keep E2's edits additive and minimal.
+```
+
+## Validation Loop
+
+### Level 1: Syntax & Style
+
+```bash
+uv run ruff check . && uv run ruff format --check .
+uv run mypy app/ && uv run pyright app/
+cd frontend && pnpm lint
+# Expected: clean. Both Python type checkers are --strict and gate merge.
+# (pnpm tsc --noEmit is vacuous; tsc -b fails with PRE-EXISTING errors — do
+# not chase them. lint + vitest are the JS gates.)
+```
+
+### Level 2: Unit Tests (no DB)
+
+```bash
+uv run pytest app/features/demo -v -m "not integration"
+uv run pytest app/core/tests/test_strict_mode_policy.py -v   # AST walker still green
+cd frontend && pnpm test --run
+# New/changed: test_link_health (stub-app probe classification), test_routes
+# filter/health unit tests, use-workspaces hooks, ReplayConfirmDialog,
+# WorkspaceEditDialog, WorkspaceLineageStrip, WorkspacePanel rework,
+# WorkspaceArtifactsPanel health markers, workspace-compare page.
+```
+
+### Level 3: Integration (real Postgres)
+
+```bash
+docker compose up -d && uv run alembic upgrade head
+uv run pytest app/features/demo -v -m integration
+# List filters against seeded rows (archived hidden / shown, q, tags,
+# sort + pinned-first, filtered total) + health probe (real + bogus refs).
+```
+
+### Level 4: Manual smoke + browser dogfood (seeded local stack, uvicorn :8123)
+
+```bash
+# 1. Filtered list + health round-trip
+curl -s "http://localhost:8123/demo/workspaces?q=demo&sort_by=name&sort_order=asc" | python3 -m json.tool | head -30
+curl -s "http://localhost:8123/demo/workspaces?include_archived=true" | python3 -c "import sys,json; d=json.load(sys.stdin); print(d['total'])"
+WS_ID=$(curl -s -X POST http://localhost:8123/demo/run -H 'Content-Type: application/json' \
+  -d '{"skip_seed": true, "preservation": "keep", "workspace_name": "e2-smoke"}' \
+  | python3 -c "import sys,json; print(json.load(sys.stdin)['workspace_id'])")
+curl -s "http://localhost:8123/demo/workspaces/${WS_ID}/health" | python3 -m json.tool
+curl -s -o /dev/null -w "%{http_code} %{content_type}\n" \
+  http://localhost:8123/demo/workspaces/deadbeefdeadbeefdeadbeefdeadbeef/health   # 404 problem+json
+
+# 2. Dead-link proof: delete a referenced scenario plan, re-probe
+#    (pick a scenario_plan_id from the workspace detail's created_objects)
+curl -s -X DELETE http://localhost:8123/scenarios/<plan-id> -o /dev/null -w "%{http_code}\n"
+curl -s "http://localhost:8123/demo/workspaces/${WS_ID}/health" \
+  | python3 -c "import sys,json; print([r for r in json.load(sys.stdin)['references'] if r['status']=='dead'])"
+
+# 3. Browser dogfood (webapp-testing skill / agent-browser):
+#    /showcase -> save workspaces -> toolbar search/sort/show-archived ->
+#    pin (row jumps first) -> archive (vanishes until toggle) -> Edit details
+#    (rename + tags chips) -> Replay -> confirm dialog shows the diff table ->
+#    a reset=true workspace shows destructive copy + red button -> confirmed
+#    replay goes green, new row carries the "replay" badge -> Load it ->
+#    lineage strip shows the chain -> select 2 rows -> Compare page diff ->
+#    multi-select 2 -> Delete selected -> rows gone, created objects intact ->
+#    loaded workspace with the deleted plan shows the dead-link warning + chip.
+```
+
+## Final validation Checklist
+
+- [ ] All five gates green: `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ && uv run pyright app/ && uv run pytest -v -m "not integration"`
+- [ ] Integration suite green: `uv run pytest -v -m integration` (fresh docker-compose DB)
+- [ ] Frontend gates green: `cd frontend && pnpm lint && pnpm test --run`
+- [ ] No replay path bypasses the confirm dialog; reset=true shows destructive variant (vitest + dogfood)
+- [ ] List filters: archived hidden by default, q/tags/sort behave, pinned-first, filtered total (route tests + curl)
+- [ ] Health endpoint classifies alive/dead/unknown; dead-link warning + partial-run chip render (integration + dogfood step 2/3)
+- [ ] Lineage chain renders incl. deleted-ancestor sentinel
+- [ ] Compare page deep-links `?a=&b=` and degrades gracefully on bad ids
+- [ ] Multi-select delete = N single DELETEs; **no new bulk endpoint in the diff**
+- [ ] Legacy list calls + all pre-existing demo tests unchanged-green
+- [ ] CONTRACT(E1) reconciliation notes in the PR body; replay-policy deferral noted
+- [ ] `git diff --stat` surgical (no CRLF whole-file noise)
+- [ ] docs/_base/API_CONTRACTS.md + RUNBOOKS.md updated additively
+- [ ] Commits `type(scope): description (#408)`, no AI trailer; PR into dev; browser dogfood evidence per `.claude/rules/ui-design.md`
+
+---
+
+## Anti-Patterns to Avoid
+
+- ❌ Don't start before E1 (#407) merges; don't re-implement E1 surface (migration, PATCH, provenance write).
+- ❌ Don't import another feature slice from `app/features/demo/` — link health is in-process HTTP only.
+- ❌ Don't add a bulk-delete endpoint or any "wipe everything" operation — N singles, period.
+- ❌ Don't add a replay-policy picker (exact/safe-keep/modified) — explicitly deferred (Decision 1).
+- ❌ Don't make health/response models strict — strict mode is request-body policy.
+- ❌ Don't probe health for every list row — loaded workspace only.
+- ❌ Don't let a probe exception 500 the health route — classify as `unknown`.
+- ❌ Don't mutate the original workspace row on replay — replay still creates a NEW row (provenance points back).
+- ❌ Don't duplicate the name pattern regex — share it between run controls and the edit dialog.
+- ❌ Don't run `shadcn add` — every needed primitive is installed; don't use raw colors — semantic tokens only.
+- ❌ Don't call `crypto.randomUUID` directly — `safeRandomUUID` (ESLint-enforced).
+- ❌ Don't chase pre-existing `tsc -b` errors — lint + vitest are the JS gates.
+
+## Confidence Score
+
+**7.5/10** for one-pass implementation success. The backend half (list filters
++ health endpoint) is a composition of three verified in-repo precedents
+(dimensions search/sort, scenarios tags containment, pipeline ASGITransport)
+with clear test shapes. The deductions: (a) E2 is authored against a frozen
+but UNMERGED E1 contract — seven CONTRACT(E1) points must reconcile against
+E1's real merged shape, and any naming/shape divergence costs an adaptation
+pass (mitigated by Task 1's reconciliation gate and verify-or-add fallbacks);
+(b) the WorkspacePanel rework is the single largest UI delta of the showcase
+initiative so far (toolbar + badges + dropdown + multi-select + confirm
+rerouting in one component) where an interaction miss costs an iteration; and
+(c) four parallel epics share `schemas.py` / `routes.py` / `showcase.tsx`,
+so rebase friction is plausible even with additive-only edits.
diff --git a/PRPs/PRP-showcase-completion-E3-seed-config-scope.md b/PRPs/PRP-showcase-completion-E3-seed-config-scope.md
new file mode 100644
index 00000000..e5a0df6e
--- /dev/null
+++ b/PRPs/PRP-showcase-completion-E3-seed-config-scope.md
@@ -0,0 +1,1080 @@
+name: "PRP — Showcase Completion E3: Advanced Seed Config MVP + Store/Product Scope Selection (issue #409)"
+description: |
+
+## Purpose
+
+Implement Parallel epic E3 of the showcase-completion initiative (umbrella #406):
+an additive, allow-listed nested override schema on the seeder HTTP contract
+(7 curated knobs), an additive `seed_overrides` field on `DemoRunRequest` / the
+WS start frame, a store/product focus-pair selector with pre-run preview on the
+Showcase page, frontend + backend validation of every knob, and persistence of
+overrides + user-selected scope into the workspace row (E1 #407 story slots) so
+replay honors them verbatim.
+
+**Execution gate:** this epic is Parallel after Foundation — implementation
+starts ONLY after E1 #407 merges to `dev` (its migration ships the
+`seed_overrides` / `user_scope` JSONB story slots E3 writes into). Every
+dependency on E1's surface is tagged `CONTRACT(E1):` below; re-verify each tag
+against the merged E1 code before starting Task 1.
+
+## Core Principles
+
+1. **Context is King**: every file reference below was verified against the live code on 2026-06-12 (branch dev @ bdf85f6, post-E4/#404 merge — PRE-E1-#407; line numbers will drift slightly after E1 merges, re-anchor by symbol name).
+2. **Validation Loops**: each level is executable as written.
+3. **Information Dense**: patterns cite exact file:line (or symbol when post-E1 drift is likely).
+4. **Progressive Success**: shared override schema → seeder contract → demo start frame → pipeline consumption → workspace persistence → frontend → docs → browser dogfood.
+5. **Global rules**: follow CLAUDE.md / AGENTS.md; all five backend CI gates must pass; UI work follows `.claude/rules/ui-design.md` + `.claude/rules/shadcn-ui.md`.
+
+---
+
+## Goal
+
+A user on `/showcase` ticking **Re-seed first** can open an **Advanced seed
+config** panel and turn 7 curated knobs (store count, product count, window
+days, sparsity, promotion intensity, stockout intensity, noise sigma) before
+running; independently, the user can pick an explicit **store/product focus
+pair** (with a pre-run preview of the selected entities and the seeded window)
+that the pipeline models instead of the auto-discovered first pair. Both the
+overrides and the scope persist into a kept workspace row and are re-submitted
+verbatim on Replay. A start frame without the new fields behaves
+byte-identically to today.
+
+**Deliverable** (all additive; ZERO migrations — E1 #407 owns the schema):
+
+- `app/shared/seeder/overrides.py` — NEW: `SeederOverrides` Pydantic model (the single shared allow-list, `extra="forbid"`), importable by both the seeder and demo slices through `app/shared/` (vertical-slice-legal).
+- `app/features/seeder/schemas.py` — `GenerateParams.overrides: SeederOverrides | None = None` (additive nested optional object on the EXISTING endpoint — decision rationale below).
+- `app/features/seeder/service.py` — `_apply_seed_overrides(config, overrides)` applied LAST in `_build_config_from_params` (wins over the legacy scalar `stores`/`products`/`sparsity`), mapping each knob onto its `SeederConfig` sub-dataclass via `dataclasses.replace`.
+- `app/features/demo/schemas.py` — `DemoRunRequest.seed_overrides: SeederOverrides | None` + `DemoRunRequest.user_scope: UserScope | None` (NEW small model) + two cross-field validators.
+- `app/features/demo/pipeline.py` — `DemoContext` carries both; `step_seed` forwards `overrides` to `POST /seeder/generate`; `step_status` honors `user_scope` (validate via `/dimensions/*/{id}`; warn + fallback to discovery on a dangling pair).
+- `app/features/demo/workspace.py` — `create_workspace` writes the two E1 story slots; list/detail response schemas expose them (replay reads list rows).
+- `frontend/src` — `SeedConfigPanel.tsx` + `ScopeSelector.tsx` (composed from installed shadcn primitives), `lib/workspace-replay.ts` pure replay-frame builder, `types/api.ts` additions, `showcase.tsx` wiring.
+- Tests: seeder schema/route/service tests (incl. out-of-bounds 422 + unknown-knob 422), demo schema JSON-path tests, pipeline `_RecordingClient` forwarding tests, workspace slot persistence tests, replay-verbatim regression (backend integration + frontend pure-helper test), component vitests.
+- Docs: `docs/_base/API_CONTRACTS.md` (3 rows), `docs/_base/RUNBOOKS.md` (new incident entry + workspace-section update), `docs/_base/DOMAIN_MODEL.md` (slot schema documentation).
+
+**Success definition**: all Success Criteria check off, the five backend gates +
+frontend lint/test are green, and a real-browser dogfood shows: an
+overridden re-seed run (e.g. 8 stores × 20 products, promo 0.3) goes green with
+the seed card echoing the overrides; a scope-selected run models the chosen
+pair; a kept run replays both verbatim.
+
+## Why
+
+- Umbrella #406: today the showcase accepts only `seed`/`scenario`/`reset`/`skip_seed`; the preset's behavioral character (noise, promos, stockouts, sparsity) is take-it-or-leave-it, and the modeled grain is always the first discovered `(store, product)` pair (`app/features/demo/pipeline.py:582-631`) — the operator cannot tell the story of a specific SKU.
+- The seeder HTTP contract already accepts 25+ FLAT scalar/flag fields (`app/features/seeder/schemas.py:78-298`) — the umbrella's top risk is that surface growing unbounded. A curated nested object with `extra="forbid"` is the documented mitigation: 7 knobs, mechanically allow-listed, everything else stays preset-driven.
+- E1 #407 reserves `seed_overrides` + `user_scope` JSONB story slots on `showcase_workspace` precisely so this epic's config survives into Replay — without persistence, replay of an overridden run would silently regenerate different data.
+- E3 is Parallel after Foundation: it can land independently of E2 #408 / E4 #410 / E5 #411 / E6 #412 (no shared files beyond additive edits to `showcase.tsx` / `workspace.py` — coordinate merge order if simultaneous).
+
+## What
+
+### Open question resolved — seeder override contract shape
+
+**DECISION: expand `GenerateParams` with an additive nested optional object
+(`overrides: SeederOverrides | None = None`). NO new endpoint.** Rationale,
+researched against the current code:
+
+1. **The layering already exists.** `_build_config_from_params` (`app/features/seeder/service.py:202-247`) is a layered override pipeline: preset → scalar dims/window/sparsity → `_apply_phase1_overrides` (:74-137) → `_apply_phase2_overrides` (:139-199). A `_apply_seed_overrides` applied last is a fourth layer in an established pattern — a new endpoint would have to reimplement or call into this exact function anyway.
+2. **A new endpoint duplicates load-bearing guards.** `POST /seeder/generate` carries `_check_seeder_enabled()` (production guard, `routes.py:21-33`), the ValueError→400 / Exception→500 RFC 7807 envelope (`routes.py:114-136`), and the seeder-is-the-only-bulk-mutation-path invariant. A second generate-shaped endpoint doubles that audit surface for zero contract benefit.
+3. **Back-compat is free.** Absent field = `None` = byte-identical behavior — the exact precedent the Phase 1/Phase 2 field comments in `schemas.py:121-123,175-177` already promise and test.
+4. **Nested (not more flat scalars) is the allow-list mechanism.** `ConfigDict(extra="forbid")` on the nested model makes an unknown knob a 422 — the umbrella's "contract grows unbounded" mitigation becomes machine-enforced, and the 7 curated knobs stay visually distinct from the 25+ legacy scalars.
+5. **One schema serves both slices.** The demo start frame forwards the same object verbatim; placing `SeederOverrides` in `app/shared/seeder/overrides.py` lets `app/features/seeder/schemas.py` and `app/features/demo/schemas.py` both import it without a cross-slice import (precedent: `demo/schemas.py:16` already imports `ScenarioPreset` from `app/shared/seeder/config`).
+
+Trade-off accepted: `extra="forbid"` means a FUTURE knob sent by a newer client
+to an older backend errors loudly instead of being ignored. That asymmetry vs.
+the top-level start frame (unknown TOP-LEVEL keys remain ignored) is
+deliberate — silent knob-dropping would fake-honor a config the run never used.
+
+### Allow-listed knob → config-field mapping (the complete MVP surface)
+
+| Knob (wire name) | Type / bounds | Maps to (via `dataclasses.replace`) | Preset reference values |
+|---|---|---|---|
+| `stores` | `int`, ge=1 le=100 | `config.dimensions.stores` (`DimensionConfig.stores`, `app/shared/seeder/config.py:118`) | demo profiles 3–5; scalar `GenerateParams.stores` caps 100 |
+| `products` | `int`, ge=1 le=500 | `config.dimensions.products` (`DimensionConfig.products`, config.py:119) | demo profiles 10–25; scalar caps 500 |
+| `window_days` | `int`, ge=75 le=365 | `config.start_date = config.end_date - timedelta(days=window_days)` (end_date untouched) | ≥75 keeps the `historical_backfill` gate clear (`pipeline.py` gate = `3*(14+1)+30 = 75`); ≤365 = `DEFAULT_SEED_SPAN_DAYS` |
+| `sparsity` | `float`, ge=0.0 le=0.9 | `config.sparsity = replace(config.sparsity, missing_combinations_pct=v)` (`SparsityConfig.missing_combinations_pct`, config.py:141) — `replace` PRESERVES the preset's `random_gaps_*` fields | sparse preset uses 0.5; 1.0 would seed zero series (hard-fail), hence the 0.9 cap |
+| `promotion_intensity` | `float`, ge=0.0 le=0.5 | `config.retail = replace(config.retail, promotion_probability=v)` (`RetailPatternConfig.promotion_probability`, config.py:101) | preset max 0.25 (holiday_rush); 0.5 cap = 2× headroom |
+| `stockout_intensity` | `float`, ge=0.0 le=0.5 | `config.retail = replace(config.retail, stockout_probability=v)` (config.py:102) | preset max 0.25 (stockout_heavy); higher values risk NaN-WAPE (documented expected-fail, mirrors sparse) |
+| `noise_sigma` | `float`, ge=0.0 le=0.5 | `config.time_series = replace(config.time_series, noise_sigma=v)` (`TimeSeriesConfig.noise_sigma`, config.py:72) | preset max 0.4 (high_variance) |
+
+Precedence (document in the field description AND a service test): nested
+`overrides` is applied LAST in `_build_config_from_params` and therefore WINS
+over the legacy scalar `stores` / `products` / `sparsity` when both are sent.
+`window_days` recomputes `start_date` from the (scalar-or-default) `end_date`.
+The pipeline keeps sending `sparsity=0.0` as the scalar (preserves preset
+character per the `if params.sparsity > 0` guard at `service.py:225-226`);
+`overrides.sparsity` is the only way the demo overrides sparsity.
+
+### `seed_overrides` / `user_scope` slot schemas (THIS PRP's contract to define)
+
+E1 #407 reserves the slots; the JSON inside them is defined HERE:
+
+```jsonc
+// showcase_workspace.seed_overrides  (JSONB; NULL when the run had none)
+// = SeederOverrides.model_dump(mode="json", exclude_none=True) — SPARSE:
+//   only operator-set knobs appear; {} never stored (None instead).
+{
+  "stores": 8,                  // int 1..100, optional
+  "products": 20,               // int 1..500, optional
+  "window_days": 120,           // int 75..365, optional
+  "sparsity": 0.3,              // float 0.0..0.9, optional
+  "promotion_intensity": 0.3,   // float 0.0..0.5, optional
+  "stockout_intensity": 0.1,    // float 0.0..0.5, optional
+  "noise_sigma": 0.25           // float 0.0..0.5, optional
+}
+
+// showcase_workspace.user_scope  (JSONB; NULL when no pair was picked)
+// = UserScope.model_dump(mode="json") — both keys always present when non-null:
+{
+  "store_id": 12,               // int ge=1 — REAL discovered id (sequences
+  "product_id": 47              // int ge=1    never reset; ids are NOT 1-based)
+}
+```
+
+Replay semantics: the slots record the REQUESTED config (replay-verbatim
+contract, mirrors the E1 seed/scenario/reset/skip_seed columns). The EFFECTIVE
+grain a run actually modeled is already recorded separately by
+`finalize_workspace` into the `store_id` / `product_id` columns
+(`workspace.py:136-137`) — when a replayed `user_scope` dangles (warn+fallback,
+below), the two will legitimately differ; that divergence is visible, not
+hidden.
+
+### User-visible behavior
+
+- **Advanced seed config panel** (`/showcase`): a collapsible "Advanced seed config" section appears under the run controls, enabled ONLY while **Re-seed first** is ticked (overrides are meaningless on `skip_seed=true` and the backend rejects the combination). 7 controls with the bounds above; a "live summary" line echoes the effective config (e.g. "8 stores × 20 products × 120 days · promo 0.30"); a caveat notes high sparsity/stockout values can legitimately fail the backtest (NaN WAPE — same documented semantics as the `sparse` preset). `window_days` control is disabled with an explanatory tooltip when the `holiday_rush` preset is selected (calendar-pinned window).
+- **Store/product focus-pair selector**: two dropdowns (stores, products — fed by `GET /dimensions/stores` / `GET /dimensions/products`, `page_size=100`) plus a pre-run preview card showing the chosen store (code/name/region/type), product (sku/name/category/brand) and the currently seeded window (from `GET /seeder/status`). Works WITHOUT re-seeding (scope selection on the existing dataset is the primary use). Ticking **Reset database** clears the selection with a caveat ("a wipe re-issues ids — re-pick after the run"), because Postgres sequences never reset (memory anchor: seeder-does-not-reset-id-sequences).
+- **Run**: the start frame carries `seed_overrides` (only when re-seeding and ≥1 knob set) and `user_scope` (when a pair is picked). The seed step card echoes the overridden dims; the status step card says "user-selected pair" vs "discovered pair".
+- **Replay** of a kept run re-submits recorded `seed_overrides` + `user_scope` verbatim alongside the existing 4 config fields. Load repopulates the panel + selector.
+- **Legacy behavior**: a start frame without the new fields is byte-identical to today (contract test).
+
+### Technical requirements
+
+- All new request fields are additive `Optional` with `None` defaults; the WS start frame keeps ignoring unknown TOP-LEVEL keys (`DemoRunRequest` default `extra=ignore`); the nested models use `extra="forbid"` (allow-list enforcement).
+- `SeederOverrides` and `UserScope` carry `ConfigDict(strict=True, extra="forbid")`. All fields are JSON-native (`int`/`float`) → NO `Field(strict=False)` override needed and the strict-mode AST policy test (`app/core/tests/test_strict_mode_policy.py`) stays green. Runtime-verified on pydantic 2.12.5: a nested-model field under a `strict=True` parent validates from the JSON-parsed dict (FastAPI's `validate_python` path) — see verification log.
+- All config is start-frame-time. NOTHING is configurable mid-run — the pipeline is strictly linear under the module-level `asyncio.Lock` (design invariant from umbrella #406; do not add any mid-run mutation channel).
+- The demo slice must not import `app/features/seeder/*` — `SeederOverrides` lives in `app/shared/seeder/overrides.py`; `UserScope` lives in `app/features/demo/schemas.py` (demo-only concept). `pipeline.py` may import both (`app.shared.*` + own-slice schemas are already imported at `pipeline.py:43-45`).
+- The seeder stays the only bulk-mutation path; no new wipe semantics; `_check_seeder_enabled` untouched.
+- E3 ships ZERO Alembic migrations. CONTRACT(E1): the `seed_overrides` + `user_scope` JSONB slots exist on `showcase_workspace` (E1 #407 migration) before this epic executes.
+
+### Success Criteria
+
+- [ ] `POST /seeder/generate` accepts `{"overrides": {"stores": 8, "promotion_intensity": 0.3}}` → 201, and the generated config reflects the knobs (service unit test); `{"overrides": {"stores": 0}}` → 422; `{"overrides": {"bogus_knob": 1}}` → 422; a body WITHOUT `overrides` produces a byte-identical `SeederConfig` to today (regression test).
+- [ ] `DemoRunRequest.model_validate({...})` JSON-path tests: `seed_overrides` with `skip_seed=true` → ValidationError; `window_days` with `scenario="holiday_rush"` → ValidationError; legacy 4-field frame still validates; `user_scope` happy path.
+- [ ] `step_seed` forwards `overrides` in the `/seeder/generate` POST body (`_RecordingClient` assertion); `step_status` uses a valid `user_scope` pair (asserts the GET-by-id calls + ctx fields), and WARNS + falls back to discovery on a 404 pair.
+- [ ] A `preservation="keep"` run records `seed_overrides` + `user_scope` into the E1 story slots; `GET /demo/workspaces` list items AND `/{id}` detail expose both; the e2e replay regression (`tests/test_e2e_demo.py::test_demo_replay_same_config_twice` extended or sibling test) proves a replayed row carries identical slot JSON.
+- [ ] Frontend: panel renders 7 bounded controls only when Re-seed is ticked; selector previews the chosen pair; `workspaceToRunRequest(ws)` unit test proves replay-verbatim including the new fields; `pnpm lint && pnpm test --run` green; no NEW `tsc -b` errors in touched files.
+- [ ] Legacy start frames byte-identical (backend contract test + existing demo tests untouched-green).
+- [ ] Backend gates green: `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ && uv run pyright app/ && uv run pytest -v -m "not integration"`.
+- [ ] Docs updated additively: API_CONTRACTS (seeder + demo + WS rows), RUNBOOKS (new showcase incident entry + workspace-section note), DOMAIN_MODEL (slot schemas under the `showcase_workspace` aggregate).
+- [ ] Real-browser dogfood (Level 4) performed.
+
+## All Needed Context
+
+### Documentation & References
+
+```yaml
+# MUST READ — codebase patterns (verified 2026-06-12, dev @ bdf85f6 — PRE-E1;
+# re-anchor line numbers by symbol after E1 #407 merges)
+
+- file: app/features/seeder/schemas.py
+  why: |
+    GenerateParams at 78-298 — the contract to extend. Note the Phase 1
+    comment block at 121-123 ("All flags default off so existing scenarios
+    remain byte-identical") — copy that promise onto the new field. The model
+    is plain BaseModel (NO ConfigDict(strict=True)) — do NOT add strict mode
+    to GenerateParams itself (it has date fields start_date/end_date; only
+    the NEW nested SeederOverrides model is strict).
+    ChangepointEventParam at 51-64 is the existing nested-model-in-params
+    precedent (list[ChangepointEventParam] at 153-156).
+
+- file: app/features/seeder/service.py
+  why: |
+    _build_config_from_params at 202-247 — THE integration point. Scalar
+    overrides at 218-226 (dataclasses.replace on dimensions; sparsity only
+    when > 0); _apply_phase1_overrides at 74-137 and _apply_phase2_overrides
+    at 139-199 are the mutate-config-in-place pattern to mirror for
+    _apply_seed_overrides. APPLY THE NEW LAYER LAST (after :241) so nested
+    wins over scalars. from dataclasses import replace already imported (:7).
+
+- file: app/shared/seeder/config.py
+  why: |
+    The override targets: TimeSeriesConfig.noise_sigma :72,
+    RetailPatternConfig.promotion_probability/stockout_probability :101-102,
+    DimensionConfig.stores/products :118-119,
+    SparsityConfig.missing_combinations_pct :141 (+ random_gaps fields to
+    PRESERVE via replace). ScenarioPreset :37-47. holiday_rush pinned window
+    :553-579 (the reason window_days is rejected for that preset).
+    DEFAULT_SEED_SPAN_DAYS=365 :10. NO Pydantic here — config.py stays
+    dataclasses; the new Pydantic model goes in a NEW sibling module
+    app/shared/seeder/overrides.py.
+
+- file: app/features/seeder/routes.py
+  why: |
+    POST /seeder/generate at 85-136 — NO route-code change needed (the body
+    model change flows through); read for the _check_seeder_enabled guard
+    (21-33) and the error envelope you must NOT duplicate (the
+    no-new-endpoint rationale).
+
+- file: app/features/demo/schemas.py
+  why: |
+    DemoRunRequest at 29-85 — the model to extend. The model_validator
+    _workspace_name_requires_keep at 80-85 is the EXACT cross-field-rule
+    pattern for the two new validators. The docstring at 30-38 explains the
+    strict-mode policy; scenario's strict=False override at 59-63 (enum) —
+    nested BaseModel fields need NO such override (runtime-verified).
+    WorkspaceListItem at 169-190 / WorkspaceDetailResponse at 192-203 — add
+    seed_overrides + user_scope to BOTH (replay reads LIST rows:
+    showcase.tsx:174-186). CONTRACT(E1): E1's PRP may already have surfaced
+    the story slots on these response models — if so, verify shape
+    (dict[str, Any] | None) and skip the duplicate edit.
+
+- file: app/features/demo/pipeline.py
+  why: |
+    DemoContext at 212-263 — add seed_overrides/user_scope fields (follow the
+    PRP-38/39/40 additive-Optional comment style). step_seed at 541-579 —
+    extend the POST body; _SCENARIO_SEED_PROFILE at 513-538 supplies the
+    defaults overrides partially replace. step_status at 582-631 — the
+    first-pair discovery to branch around for user_scope (its docstring
+    already states ids are NOT 1-based). run_pipeline ctx construction at
+    2646-2651 — thread the two new req fields. StepStatus literal includes
+    "warn" (schemas.py:19) and only "fail" stops the run (:2729-2738) — the
+    warn+fallback path is safe. CRITICAL header rule :18-19: pipeline must
+    NOT import app.features.* outside its own slice — app.shared.* is fine.
+
+- file: app/features/demo/workspace.py
+  why: |
+    create_workspace at 46-79 — add the two slot writes on the
+    ShowcaseWorkspace(...) constructor; warn-and-continue contract at 10-13
+    (a slot-write failure must never break the run — the try/except already
+    guarantees it). finalize_workspace at 106-155 — NO change for the slots
+    (recorded at create); note store_id/product_id columns at 136-137 record
+    the EFFECTIVE grain (divergence-visible design).
+    CONTRACT(E1): E1 refactors create_workspace to write its new columns —
+    rebase this edit onto E1's merged version.
+
+- file: app/features/demo/models.py
+  why: |
+    ShowcaseWorkspace ORM — E3 does NOT edit this file. CONTRACT(E1): after
+    E1 merges it carries seed_overrides/user_scope as JSONB story slots;
+    verify the exact attribute names/types there before writing
+    workspace.py code. (Assumed shape: nullable JSONB columns mirroring the
+    created_objects precedent at 77-79.)
+
+- file: app/features/demo/tests/test_pipeline.py
+  why: |
+    _RecordingClient at 1025-1068 (records (method, path, json_body) per
+    call, canned responses keyed by (method, path-prefix)); _as_client cast
+    at 1070+. Reuse for: overrides-forwarding, user_scope GET-by-id calls,
+    warn+fallback (404 canned response).
+
+- file: app/features/demo/tests/test_schemas.py
+  why: |
+    The JSON-path test conventions: test_demo_run_request_json_path_keep_
+    with_name :67, test_demo_run_request_legacy_frame_still_validates :75,
+    test_demo_run_request_workspace_name_requires_keep :83 — mirror all
+    three shapes for the new fields.
+
+- file: app/features/seeder/tests/test_routes.py
+  why: |
+    Route-test harness: client fixture :15 (TestClient + mocked settings,
+    seeder_allow_production=True), TestGenerate :96 — add overrides 201 /
+    422-bounds / 422-unknown-knob cases here. test_generate_validation_error
+    :157 is the 422 pattern.
+
+- file: app/features/seeder/tests/test_service.py
+  why: |
+    Service-test patterns for _build_config_from_params — add: knob→field
+    mapping, precedence-over-scalars, window_days math, preset-character
+    preservation (e.g. sparse preset's random_gaps survive an overrides.
+    sparsity replace), and the no-overrides byte-identical regression.
+
+- file: tests/test_e2e_demo.py
+  why: |
+    test_demo_replay_same_config_twice at 561-609 — the replay-regression
+    guard to extend (or sibling): a keep-run with seed_overrides+user_scope,
+    replayed, must produce a second row with identical slot JSON.
+
+- file: frontend/src/pages/showcase.tsx
+  why: |
+    Wiring surface. handleRun start frame at 139-156 (conditional-spread
+    pattern for optional fields — reuse for seed_overrides/user_scope);
+    handleLoadWorkspace at 160-168 (repopulate panel+selector);
+    handleReplayWorkspace at 174-186 (REPLACE its inline object with the new
+    workspaceToRunRequest helper); controls block at 269-363 (panel +
+    selector land after the existing checkboxes); reset checkbox at 301-311
+    (hook the scope-clearing caveat here).
+
+- file: frontend/src/types/api.ts
+  why: |
+    DemoRunRequest at 778-788 (+ seed_overrides?/user_scope?);
+    WorkspaceListItem at 806-816 and WorkspaceDetail at 819-825 (+ both
+    fields, `| null`); add SeedOverrides + UserScope interfaces near the
+    demo block. WARNING: MIXED CRLF/LF line endings — surgical edits only;
+    verify `git diff --stat` stays small.
+
+- file: frontend/src/hooks/use-stores.ts
+  why: |
+    useStores at 16-43 (TanStack Query over /dimensions/stores with
+    page/page_size/enabled) — the selector's data source; use-products.ts
+    mirrors it (useProducts :16, useProduct :45). page_size hard cap is 100
+    (app/features/dimensions/routes.py:62,187).
+
+- file: frontend/src/hooks/use-seeder.ts
+  why: useSeederStatus :15 — the seeded-window source for the preview card.
+
+- file: frontend/src/hooks/use-demo-pipeline.ts
+  why: |
+    start(req) at 241-249 sends the req object as the WS start frame
+    verbatim — generic over the widened DemoRunRequest; NO change needed
+    (read to confirm). RunHistoryStrip replays stored req objects, so
+    localStorage replays inherit the new fields for free.
+
+- file: frontend/src/components/demo/ScenarioPicker.test.tsx
+  why: |
+    The vitest + @testing-library/react + afterEach(cleanup) harness pattern
+    for the two new component test files.
+
+- file: frontend/src/components/ui/
+  why: |
+    Installed primitives: collapsible.tsx, select.tsx, slider.tsx, input.tsx,
+    badge.tsx, card.tsx, tooltip.tsx, checkbox.tsx — the panel + selector
+    compose from these; NO new shadcn install required. If one becomes
+    necessary anyway: pin `pnpm dlx shadcn@4.7.0 add ...` (5.x writes a stub
+    pnpm-workspace.yaml and skips the component) and use per-component
+    @radix-ui/react-X imports, never the radix barrel.
+
+- file: docs/_base/RUNBOOKS.md
+  why: |
+    "Showcase page (/showcase) pipeline fails at step X" — numbered entries
+    end at 28; append entry 29 (overrides/scope incident matrix) in the same
+    bold-trigger/Cause/Fix format. The "Showcase workspace —
+    preserve/restore/replay/delete semantics" section's "Explicitly out of
+    scope" list says advanced seed configuration is NOT implemented — E3
+    DELIVERS it: rewrite that bullet (move seed_overrides/user_scope to the
+    documented surface; phase-level config stays out of scope).
+
+- file: docs/_base/API_CONTRACTS.md
+  why: |
+    Rows to extend additively: the /seeder/* row (mention the overrides
+    object on POST /seeder/generate), POST /demo/run, and the WS
+    /demo/stream start-frame bullet (E1/E2 notes were just added — append an
+    "E3 (#409)" note, don't disturb them).
+
+- file: docs/_base/DOMAIN_MODEL.md
+  why: |
+    showcase_workspace aggregate section — document the seed_overrides /
+    user_scope slot JSON schemas (the umbrella's "JSONB story slots become a
+    junk drawer" mitigation requires documented slot schemas here).
+
+- file: PRPs/PRP-showcase-workspace-E2-preset-exposure.md
+  why: |
+    Closest predecessor (preset exposure + seed profiles) — its gotcha block
+    (holiday_rush pinning, seeder precedence, sparse NaN-WAPE, frontend tsc
+    gate) all recur in E3; this PRP inherits and extends them.
+
+# Issue / initiative context
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/409
+  why: The epic this PRP implements (Parallel after Foundation E1 #407).
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/406
+  why: |
+    Umbrella — Approach ("all configuration is start-frame-time", "no new
+    router outside existing slices"), Risks table row 1 (the allow-list
+    mitigation this PRP implements), out-of-scope list (NO mid-run controls,
+    NO embedded scenario-builder).
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/407
+  why: |
+    Foundation epic whose contract is GIVEN: JSONB story slots incl.
+    seed_overrides + user_scope; columns replayed_from_workspace_id /
+    archived / pinned / notes / tags / config_schema_version; PATCH
+    /demo/workspaces/{id}. E3 builds on, never re-decides, this surface.
+
+# External references
+- url: https://docs.pydantic.dev/latest/concepts/strict_mode/
+  why: |
+    Strict-mode semantics for nested models: a model-typed field validates
+    dict input using the NESTED model's own config — confirmed empirically
+    (verification log) so no doc-faith is required. NOTE: the docs site
+    301-redirects and anchors have drifted; the runtime verification in the
+    Known Gotchas log is the authoritative claim, not this URL.
+- url: https://docs.pydantic.dev/latest/api/config/#pydantic.config.ConfigDict.extra
+  why: extra="forbid" → unknown nested keys raise ValidationError (the 422 allow-list mechanism).
+```
+
+### Current Codebase tree (relevant subset, pre-E1)
+
+```bash
+app/shared/seeder/
+├── config.py                 # dataclasses; override TARGETS (no Pydantic here)
+├── core.py / generators/     # consume SeederConfig — untouched by E3
+app/features/seeder/
+├── schemas.py                # GenerateParams @78 (25+ flat fields)
+├── service.py                # _build_config_from_params @202; _apply_phaseN @74/@139
+├── routes.py                 # POST /generate @85 (guard @21; no route change)
+└── tests/                    # test_routes.py, test_service.py, test_schemas.py
+app/features/demo/
+├── schemas.py                # DemoRunRequest @29; Workspace* responses @169
+├── pipeline.py               # DemoContext @212; step_seed @541; step_status @582; run_pipeline @2618
+├── workspace.py              # create_workspace @46; finalize_workspace @106
+├── models.py                 # ShowcaseWorkspace (E1 adds the story slots — not edited here)
+└── tests/                    # test_pipeline.py (_RecordingClient @1025), test_schemas.py, test_workspace.py
+tests/test_e2e_demo.py        # replay regression @561
+frontend/src/
+├── pages/showcase.tsx        # handleRun @139; handleLoad @160; handleReplay @174; controls @269
+├── types/api.ts              # DemoRunRequest @778; WorkspaceListItem @806 (MIXED CRLF/LF)
+├── hooks/use-stores.ts, use-products.ts, use-seeder.ts, use-demo-pipeline.ts
+└── components/demo/          # ScenarioPicker, WorkspacePanel, ... (+ index.ts barrel)
+```
+
+### Desired Codebase tree (files added/modified)
+
+```bash
+app/shared/seeder/overrides.py            # NEW — SeederOverrides (strict, extra=forbid, 7 knobs)
+app/shared/seeder/tests/test_overrides.py # NEW — bounds, forbid, JSON-path, sparse-dump tests
+app/features/seeder/schemas.py            # MOD — GenerateParams.overrides: SeederOverrides | None
+app/features/seeder/service.py            # MOD — _apply_seed_overrides, wired LAST in _build_config_from_params
+app/features/seeder/tests/test_service.py # MOD — mapping/precedence/window/byte-identical tests
+app/features/seeder/tests/test_routes.py  # MOD — 201-with-overrides, 422-bounds, 422-unknown-knob
+app/features/demo/schemas.py              # MOD — UserScope; DemoRunRequest fields + validators; Workspace* responses
+app/features/demo/pipeline.py             # MOD — DemoContext fields; step_seed forward; step_status scope branch
+app/features/demo/workspace.py            # MOD — create_workspace writes both slots
+app/features/demo/tests/test_schemas.py   # MOD — JSON-path + validator tests
+app/features/demo/tests/test_pipeline.py  # MOD — forwarding + scope + warn/fallback tests
+app/features/demo/tests/test_workspace.py # MOD — slot persistence tests
+tests/test_e2e_demo.py                    # MOD — replay-verbatim regression incl. slots (integration)
+frontend/src/types/api.ts                 # MOD — SeedOverrides, UserScope, DemoRunRequest, Workspace* (surgical)
+frontend/src/lib/workspace-replay.ts      # NEW — workspaceToRunRequest(ws) pure helper
+frontend/src/lib/workspace-replay.test.ts # NEW — replay-verbatim FE regression
+frontend/src/components/demo/SeedConfigPanel.tsx        # NEW — collapsible 7-knob panel
+frontend/src/components/demo/SeedConfigPanel.test.tsx   # NEW
+frontend/src/components/demo/ScopeSelector.tsx          # NEW — pair selector + preview card
+frontend/src/components/demo/ScopeSelector.test.tsx     # NEW
+frontend/src/components/demo/index.ts     # MOD — export the two new components (match barrel style)
+frontend/src/pages/showcase.tsx           # MOD — wiring (state, panel, selector, start frames)
+docs/_base/API_CONTRACTS.md               # MOD — seeder overrides + /demo/run + WS start-frame E3 notes
+docs/_base/RUNBOOKS.md                    # MOD — showcase incident 29 + workspace-section scope update
+docs/_base/DOMAIN_MODEL.md                # MOD — slot schemas on the showcase_workspace aggregate
+```
+
+### Known Gotchas & Library Quirks
+
+```python
+# CRITICAL — EXECUTION ORDER: do not start until E1 #407 is merged to dev.
+#   E3 writes JSONB slots that E1's migration creates. First action of Task 1:
+#   re-read app/features/demo/models.py + workspace.py on the post-E1 dev and
+#   re-anchor every CONTRACT(E1) tag in this PRP.
+
+# CRITICAL — pydantic strict + nested models (runtime-verified 2026-06-12 on
+#   pydantic 2.12.5; re-run on lib upgrade):
+#   uv run python -c "
+#   from pydantic import BaseModel, ConfigDict, Field
+#   class N(BaseModel):
+#       model_config = ConfigDict(strict=True, extra='forbid')
+#       stores: int | None = Field(default=None, ge=1, le=100)
+#   class P(BaseModel):
+#       model_config = ConfigDict(strict=True)
+#       seed_overrides: N | None = None
+#   print(P.model_validate({'seed_overrides': {'stores': 5}}))          # OK — dict→model under strict
+#   P.model_validate({'seed_overrides': {'stores': 999}})               # ValidationError (bounds)
+#   "
+#   and N.model_validate({'stores': 5, 'bogus': 1}) → ValidationError (forbid).
+#   Conclusions baked into the design: NO Field(strict=False) needed on the
+#   nested field; extra='forbid' IS the allow-list; FastAPI's validate_python
+#   path (the JSON dict) works. All knobs are int/float → the strict-mode AST
+#   policy test (app/core/tests/test_strict_mode_policy.py) does not fire.
+
+# CRITICAL — do NOT add ConfigDict(strict=True) to GenerateParams itself: it
+#   has date fields (start_date/end_date) and is deliberately non-strict today.
+#   Only the NEW nested models are strict.
+
+# CRITICAL — seeder override precedence (service.py:213-226 + the new layer):
+#   preset → scalar stores/products/window/sparsity → phase1 → phase2 →
+#   overrides (LAST, wins). Use dataclasses.replace for every sub-config so
+#   preset-customized sibling fields survive (e.g. sparse preset's
+#   random_gaps_per_series when overrides.sparsity is set; scenario-customized
+#   region/category lists when overrides.stores is set — same reason the
+#   existing scalar override at :218-222 uses replace).
+
+# CRITICAL — holiday_rush is CALENDAR-PINNED (config.py:553-579): its
+#   HolidayConfig spikes are fixed 2024 dates. seed_overrides.window_days on
+#   scenario='holiday_rush' must be REJECTED at DemoRunRequest validation
+#   (clear ValueError message), not silently ignored — a shifted window
+#   silently drops every holiday spike. Direct /seeder/generate callers who
+#   combine them are out of scope (the preset docstring already documents
+#   explicit-dates-to-shift).
+
+# CRITICAL — seed_overrides requires skip_seed=False. The seed step is skipped
+#   on skip_seed=true (pipeline.py:543-544) so overrides would be a silent
+#   no-op; reject in a model_validator (mirror _workspace_name_requires_keep,
+#   schemas.py:80-85). The frontend enforces the same by gating the panel on
+#   the Re-seed checkbox.
+
+# CRITICAL — ids are NOT 1-based (step_status docstring, pipeline.py:585-587;
+#   memory anchor seeder-does-not-reset-id-sequences). The scope selector MUST
+#   be fed from live /dimensions data, never synthesized ids. user_scope can
+#   dangle after reset+reseed → step_status WARN + fallback to discovery (the
+#   replay path of a reset=true workspace would otherwise hard-fail forever).
+#   "warn" does NOT stop the run (only "fail" does — pipeline.py:2729-2738).
+
+# CRITICAL — high stockout_intensity / sparsity overrides can legitimately
+#   FAIL the backtest (all-NaN WAPE → step_backtest FAIL by design; same
+#   semantics as the sparse preset, RUNBOOKS incident 28). Do NOT add a
+#   graceful-skip; ship the panel caveat + runbook entry 29 instead.
+
+# CRITICAL — workspace writes stay warn-and-continue (workspace.py:10-13).
+#   The slot writes go INSIDE the existing try/except in create_workspace; a
+#   failure yields workspace_id=None and a green run, never an exception.
+
+# GOTCHA — replay reads WorkspaceListItem (the LIST row — showcase.tsx:174):
+#   seed_overrides/user_scope must be on the LIST response, not detail-only.
+#   CONTRACT(E1): if E1 already exposed the slots detail-only, ADD them to the
+#   list item here (cheap; sparse JSONB).
+
+# GOTCHA — frontend type gates: `pnpm tsc --noEmit` is vacuous (solution-style
+#   tsconfig) and `pnpm tsc -b` fails with ~24 PRE-EXISTING errors on dev,
+#   none in demo components. Gate on `pnpm lint && pnpm test --run` plus:
+#   cd frontend && pnpm tsc -b 2>&1 | grep -E "SeedConfigPanel|ScopeSelector|workspace-replay|types/api|pages/showcase"  # expect empty
+
+# GOTCHA — frontend/src/types/api.ts has MIXED CRLF/LF line endings; repo-wide
+#   files are inconsistently CRLF/LF. Keep edits surgical; check
+#   `git diff --stat` before committing (Edit/Write emit LF — avoid whole-file
+#   noise diffs).
+
+# GOTCHA — shadcn: compose from INSTALLED primitives (collapsible, select,
+#   slider, input, badge, tooltip — frontend/src/components/ui/). Semantic
+#   tokens only (text-muted-foreground, border-primary, text-destructive for
+#   the reset caveat — mirrors showcase.tsx:309). Never raw colors.
+
+# GOTCHA — mypy --strict AND pyright --strict gate every backend edit. The
+#   DemoContext additions need full annotations (SeederOverrides | None);
+#   pipeline.py imports them from app.shared.seeder.overrides (NOT from the
+#   seeder feature slice — vertical-slice rule, pipeline.py:18-19).
+
+# GOTCHA — step_seed currently derives the detail line from profile dims
+#   (pipeline.py:577). With overrides, compute effective stores/products =
+#   override-or-profile for BOTH the POST scalars and the detail string so
+#   the card tells the truth; keep scalar sparsity=0.0 (preset-character
+#   guard); the nested object carries the operator's sparsity.
+
+# CONVENTION — commits (every one references #409; no AI trailer; scopes from
+#   .claude/rules/commit-format.md — seeder slice ⊂ `data`, demo slice ⊂ `api`):
+#   feat(data): add allow-listed nested seed overrides to seeder contract (#409)
+#   feat(api): thread seed overrides and user scope through demo pipeline (#409)
+#   feat(ui): add advanced seed config panel and scope selector to showcase (#409)
+#   test(api): cover replay-verbatim seed overrides and scope slots (#409)
+#   docs(docs): document seed override contract and workspace slots (#409)
+#   docs(repo): track showcase completion e3 prp (#409)
+#   Branch off dev: feat/showcase-completion-e3-seed-config-scope (49 chars ≤ 50).
+
+# RUNTIME-VERIFICATION LOG (per prp-create step 3):
+#   - pydantic 2.12.5 nested-strict + extra=forbid + bounds behavior verified
+#     with the command in the CRITICAL block above (all four assertions pass).
+#   - Seeder precedence semantics read directly from service.py:202-247 (not
+#     inferred); the `if params.sparsity > 0` guard confirmed at :225-226.
+#   - dimensions page_size cap 100 confirmed at app/features/dimensions/
+#     routes.py:62 and :187.
+#   - `pnpm tsc -b` pre-existing-failure state re-confirmed by the E2 PRP log
+#     (2026-06-12); no demo-component errors.
+#   - No other third-party API claims — everything else cites in-repo code.
+```
+
+## Implementation Blueprint
+
+### Data models and structure
+
+```python
+# app/shared/seeder/overrides.py  (NEW)
+"""Curated, allow-listed seed-override schema (E3, issue #409).
+
+Shared between the seeder slice (GenerateParams.overrides) and the demo slice
+(DemoRunRequest.seed_overrides) — app/shared is the sanctioned cross-slice
+home (vertical-slice rule). extra='forbid' IS the allow-list: any knob not
+listed here is a 422 at the HTTP boundary (umbrella #406 risk mitigation —
+the full 25+ knob surface stays preset-driven).
+"""
+from pydantic import BaseModel, ConfigDict, Field
+
+class SeederOverrides(BaseModel):
+    # strict=True catches JSON-native coercion bugs ("5" → 5); every field is
+    # int/float so no Field(strict=False) override is needed (security-patterns.md).
+    model_config = ConfigDict(strict=True, extra="forbid")
+
+    stores: int | None = Field(default=None, ge=1, le=100, description="Store count → DimensionConfig.stores; wins over the scalar `stores` param.")
+    products: int | None = Field(default=None, ge=1, le=500, description="Product count → DimensionConfig.products; wins over the scalar `products` param.")
+    window_days: int | None = Field(default=None, ge=75, le=365, description="Seeded window length; start_date = end_date - window_days. >=75 keeps the showcase historical_backfill gate clear. Rejected on the calendar-pinned holiday_rush preset (demo surface).")
+    sparsity: float | None = Field(default=None, ge=0.0, le=0.9, description="Missing (store,product) grain fraction → SparsityConfig.missing_combinations_pct; preserves the preset's gap config. 1.0 disallowed (zero series).")
+    promotion_intensity: float | None = Field(default=None, ge=0.0, le=0.5, description="→ RetailPatternConfig.promotion_probability (preset max 0.25).")
+    stockout_intensity: float | None = Field(default=None, ge=0.0, le=0.5, description="→ RetailPatternConfig.stockout_probability. High values can legitimately NaN-WAPE-fail the backtest (documented).")
+    noise_sigma: float | None = Field(default=None, ge=0.0, le=0.5, description="→ TimeSeriesConfig.noise_sigma (preset max 0.4).")
+
+    def is_empty(self) -> bool:
+        """True when no knob is set ({} on the wire) — treated as None everywhere."""
+        return not self.model_dump(exclude_none=True)
+```
+
+```python
+# app/features/demo/schemas.py — additions (demo-only concept stays in-slice)
+class UserScope(BaseModel):
+    """Operator-selected (store, product) focus pair (E3, issue #409).
+
+    Ids are REAL discovered ids (sequences never reset — ids are not 1-based);
+    step_status validates them and warn-falls-back to discovery when dangling.
+    """
+    model_config = ConfigDict(strict=True, extra="forbid")
+    store_id: int = Field(..., ge=1)
+    product_id: int = Field(..., ge=1)
+
+# DemoRunRequest — two additive Optional fields + two validators:
+#   seed_overrides: SeederOverrides | None = None   (import from app.shared.seeder.overrides)
+#   user_scope: UserScope | None = None
+#
+# @model_validator(mode="after") _seed_overrides_require_reseed:
+#   if self.seed_overrides is not None and not self.seed_overrides.is_empty()
+#      and self.skip_seed:
+#       raise ValueError("seed_overrides requires skip_seed=false (Re-seed first)")
+#   # normalize: an empty overrides object collapses to None
+#   if self.seed_overrides is not None and self.seed_overrides.is_empty():
+#       self.seed_overrides = None      # NOTE: model_validator(after) may mutate self
+#
+# @model_validator(mode="after") _window_days_forbidden_on_holiday_rush:
+#   if (self.seed_overrides is not None
+#       and self.seed_overrides.window_days is not None
+#       and self.scenario is ScenarioPreset.HOLIDAY_RUSH):
+#       raise ValueError("window_days cannot override the calendar-pinned holiday_rush window")
+#
+# WorkspaceListItem (+ WorkspaceDetailResponse inherits):
+#   seed_overrides: dict[str, Any] | None = Field(default=None, ...)
+#   user_scope: dict[str, Any] | None = Field(default=None, ...)
+#   (from_attributes=True already set — ORM JSONB maps straight through.
+#    CONTRACT(E1): skip if E1's PRP already added them; ensure LIST exposure.)
+```
+
+```python
+# app/features/seeder/service.py — the new layer (mirror _apply_phase2_overrides)
+def _apply_seed_overrides(config: SeederConfig, overrides: SeederOverrides | None) -> None:
+    """Apply the curated nested overrides LAST — wins over scalar params.
+
+    dataclasses.replace is field-precise: preset-customized sibling fields
+    (region/category lists, random_gaps_*) survive every knob.
+    """
+    if overrides is None:
+        return
+    if overrides.stores is not None or overrides.products is not None:
+        config.dimensions = replace(
+            config.dimensions,
+            stores=overrides.stores if overrides.stores is not None else config.dimensions.stores,
+            products=overrides.products if overrides.products is not None else config.dimensions.products,
+        )
+    if overrides.window_days is not None:
+        config.start_date = config.end_date - timedelta(days=overrides.window_days)
+    if overrides.sparsity is not None:
+        config.sparsity = replace(config.sparsity, missing_combinations_pct=overrides.sparsity)
+    if overrides.promotion_intensity is not None or overrides.stockout_intensity is not None:
+        config.retail = replace(
+            config.retail,
+            promotion_probability=(overrides.promotion_intensity
+                                   if overrides.promotion_intensity is not None
+                                   else config.retail.promotion_probability),
+            stockout_probability=(overrides.stockout_intensity
+                                  if overrides.stockout_intensity is not None
+                                  else config.retail.stockout_probability),
+        )
+    if overrides.noise_sigma is not None:
+        config.time_series = replace(config.time_series, noise_sigma=overrides.noise_sigma)
+# Wire-in (one line, AFTER _apply_phase2_overrides at :241):
+#   _apply_seed_overrides(config, params.overrides)
+```
+
+```python
+# app/features/demo/pipeline.py — step changes (sketch)
+
+# DemoContext additions (after workspace_name, with an E3 #409 comment):
+#   seed_overrides: SeederOverrides | None = None
+#   user_scope: UserScope | None = None
+# run_pipeline ctx construction: thread req.seed_overrides / req.user_scope.
+
+# step_seed — effective dims + verbatim forward:
+#   stores = ctx.seed_overrides.stores if (ctx.seed_overrides and ctx.seed_overrides.stores) else profile.stores
+#   products = ... same for products ...
+#   window: if ctx.seed_overrides and ctx.seed_overrides.window_days:
+#       seed_end = datetime.now(UTC).date(); seed_start = seed_end - timedelta(days=ctx.seed_overrides.window_days)
+#   elif profile.window is not None: ... (existing pinned branch; validator already
+#       guarantees window_days is never set on holiday_rush)
+#   json_body gains: **({"overrides": ctx.seed_overrides.model_dump(exclude_none=True)}
+#                      if ctx.seed_overrides else {})
+#   detail line + data echo the effective dims and "overrides" keys applied.
+
+# step_status — user-scope branch BEFORE first-pair discovery:
+#   if ctx.user_scope is not None:
+#       try:
+#           store_body = await client.request("status[scope-store]", "GET",
+#               f"/dimensions/stores/{ctx.user_scope.store_id}")
+#           product_body = await client.request("status[scope-product]", "GET",
+#               f"/dimensions/products/{ctx.user_scope.product_id}")
+#       except _StepError:
+#           scope_warn = ("user_scope (store=%d, product=%d) not found — fell back "
+#                         "to discovered pair" % (...))   # WARN, never fail (replay safety)
+#       else:
+#           ctx.store_id, ctx.product_id = ctx.user_scope.store_id, ctx.user_scope.product_id
+#           -> return ("pass", f"... store_id={..} product_id={..} (user-selected)",
+#                      {..., "user_scope_applied": True})
+#   # fallback / no-scope path: existing discovery (582-631) unchanged; when the
+#   # scope dangled return ("warn", scope_warn + discovery detail,
+#   #                       {..., "user_scope_applied": False}).
+```
+
+```python
+# app/features/demo/workspace.py — create_workspace constructor additions
+#   (INSIDE the existing try; attribute names per the merged E1 model —
+#    CONTRACT(E1): assumed `seed_overrides` / `user_scope` nullable JSONB):
+#   seed_overrides=(req.seed_overrides.model_dump(mode="json", exclude_none=True)
+#                   if req.seed_overrides else None),
+#   user_scope=(req.user_scope.model_dump(mode="json") if req.user_scope else None),
+```
+
+```tsx
+// frontend/src/lib/workspace-replay.ts (NEW) — replay-verbatim in ONE place
+import type { DemoRunRequest, WorkspaceListItem } from '@/types/api'
+
+/** Build the verbatim replay start frame for a saved workspace (E4 semantics
+ *  + E3 #409 slots). Omits absent optionals so legacy rows replay byte-
+ *  identically to today. */
+export function workspaceToRunRequest(ws: WorkspaceListItem): DemoRunRequest {
+  return {
+    seed: ws.seed,
+    scenario: ws.scenario,
+    reset: ws.reset,
+    skip_seed: ws.skip_seed,
+    preservation: 'keep',
+    // CONTRACT(E1): replay provenance — post-E1, handleReplayWorkspace's inline
+    // object sends this field (an E1 frozen success criterion); this helper
+    // REPLACES that object and must preserve it or lineage silently regresses.
+    replayed_from_workspace_id: ws.workspace_id,
+    ...(ws.name ? { workspace_name: ws.name } : {}),
+    ...(ws.seed_overrides ? { seed_overrides: ws.seed_overrides } : {}),
+    ...(ws.user_scope ? { user_scope: ws.user_scope } : {}),
+  }
+}
+
+// types/api.ts additions (surgical):
+//   export interface SeedOverrides { stores?: number; products?: number;
+//     window_days?: number; sparsity?: number; promotion_intensity?: number;
+//     stockout_intensity?: number; noise_sigma?: number }
+//   export interface UserScope { store_id: number; product_id: number }
+//   DemoRunRequest += seed_overrides?: SeedOverrides; user_scope?: UserScope
+//   WorkspaceListItem += seed_overrides: SeedOverrides | null; user_scope: UserScope | null
+
+// SeedConfigPanel.tsx — props: { value: SeedOverrides | null; onChange(v: SeedOverrides | null): void;
+//   disabled?: boolean; windowLocked?: boolean /* holiday_rush */ }
+//   <Collapsible> "Advanced seed config"; Inputs (stores 1..20 UI-range, products 1..50,
+//   window_days 75..365) + Sliders (sparsity 0..0.9 step .05, promo/stockout 0..0.5,
+//   noise 0..0.5); live summary line; NaN-WAPE caveat <Badge>; emits null when all unset.
+//   UI ranges are TIGHTER than the API bounds (laptop-scale); the API bounds are the law.
+
+// ScopeSelector.tsx — props: { value: UserScope | null; onChange(v: UserScope | null): void;
+//   disabled?: boolean }
+//   two shadcn <Select>s fed by useStores/useProducts({ page: 1, pageSize: 100 });
+//   preview <Card>: store code/name/region/type · product sku/name/category/brand ·
+//   seeded window from useSeederStatus(); "Clear" button → onChange(null).
+
+// showcase.tsx wiring:
+//   const [seedOverrides, setSeedOverrides] = useState<SeedOverrides | null>(null)
+//   const [userScope, setUserScope] = useState<UserScope | null>(null)
+//   - panel rendered when `reseed` ticked (windowLocked={scenario === 'holiday_rush'});
+//     unticking Re-seed clears overrides (validator parity).
+//   - ticking Reset database clears userScope + shows the re-pick caveat
+//     (text-destructive, mirrors :309).
+//   - handleRun spread: ...(reseed && seedOverrides ? { seed_overrides: seedOverrides } : {}),
+//                       ...(userScope ? { user_scope: userScope } : {})
+//   - handleLoadWorkspace: setSeedOverrides(ws.seed_overrides ?? null); setUserScope(ws.user_scope ?? null)
+//   - handleReplayWorkspace: start(workspaceToRunRequest(ws))  // replaces the inline object
+```
+
+### List of tasks (dependency order)
+
+```yaml
+Task 0 — E1 gate & re-anchor (BLOCKING):
+  VERIFY: gh issue view 407 --json state   # must be CLOSED (E1 merged)
+  RUN: git switch dev && git pull
+  READ on the post-E1 dev: app/features/demo/models.py (slot attribute names/types),
+    app/features/demo/workspace.py (create_workspace shape), app/features/demo/schemas.py
+    (whether E1 surfaced slots on Workspace* responses), frontend/src/types/api.ts,
+    AND frontend/src/pages/showcase.tsx handleReplayWorkspace — E1 wires
+    replayed_from_workspace_id into the inline replay object that Task 9's
+    workspaceToRunRequest replaces; confirm the helper preserves it.
+  RESOLVE every CONTRACT(E1) tag in this PRP against reality; adjust attribute
+    names below if E1's PRP chose different ones (e.g. a single story JSONB).
+  RUN: git switch -c feat/showcase-completion-e3-seed-config-scope
+  VERIFY: gh issue view 409 --json state   # open
+
+Task 1 — CREATE app/shared/seeder/overrides.py (+ tests):
+  - SeederOverrides per the blueprint (strict, extra=forbid, 7 bounded knobs, is_empty()).
+  - CREATE app/shared/seeder/tests/test_overrides.py (the shared/seeder/tests dir exists):
+      bounds (each knob low/high rejection), unknown-knob forbid, JSON-path
+      model_validate({...}) happy path, model_dump(exclude_none=True) sparseness,
+      is_empty() truth table.
+  - Optionally re-export from app/shared/seeder/__init__.py (match how
+    ScenarioPreset/SeederConfig are exported there — service.py:32 imports them
+    from the package).
+
+Task 2 — MODIFY app/features/seeder/schemas.py + service.py:
+  - GenerateParams: ADD `overrides: SeederOverrides | None = Field(default=None,
+    description="Curated nested overrides (E3 #409); applied LAST — wins over the
+    scalar stores/products/sparsity. Absent = byte-identical legacy behavior.")`
+    (import from app.shared.seeder.overrides; do NOT touch strict-mode config).
+  - service.py: ADD _apply_seed_overrides (blueprint); CALL it after
+    _apply_phase2_overrides(config, params) in _build_config_from_params.
+  - timedelta already imported in service.py (:8).
+
+Task 3 — seeder tests:
+  - test_service.py: (a) each knob maps to its config field; (b) overrides.stores
+    beats params.stores (precedence); (c) window_days math
+    (config.start_date == config.end_date - timedelta(days=N)); (d) sparse-preset
+    character preserved (overrides.sparsity set → random_gaps_per_series still 3);
+    (e) REGRESSION: params without overrides → config equal to today's output.
+  - test_routes.py (TestGenerate class): 201 with {"overrides": {"stores": 8,
+    "promotion_intensity": 0.3}}; 422 on {"overrides": {"stores": 0}};
+    422 on {"overrides": {"bogus_knob": 1}} (extra=forbid).
+
+Task 4 — MODIFY app/features/demo/schemas.py:
+  - ADD UserScope; ADD DemoRunRequest.seed_overrides / .user_scope; ADD the two
+    model_validators (blueprint). Update the class docstring's strict-mode note
+    (nested models are JSON-native — cite the runtime verification).
+  - ADD seed_overrides/user_scope to WorkspaceListItem (Detail inherits) —
+    CONTRACT(E1): skip/merge if E1 already exposed them; ensure LIST exposure.
+
+Task 5 — demo schema tests (app/features/demo/tests/test_schemas.py):
+  - JSON-path: DemoRunRequest.model_validate({"skip_seed": False,
+    "seed_overrides": {"stores": 8}}) OK; seed_overrides + skip_seed True →
+    ValidationError; empty overrides {} normalizes to None; window_days +
+    scenario "holiday_rush" → ValidationError; user_scope happy path +
+    extra-key forbid + ge=1 bounds; LEGACY 4-field frame still validates
+    (extend test_demo_run_request_legacy_frame_still_validates' sibling).
+  - WorkspaceListItem from_attributes round-trip with slot dicts and with NULLs.
+
+Task 6 — MODIFY app/features/demo/pipeline.py:
+  - DemoContext: + seed_overrides / user_scope (typed, E3 #409 comment block).
+  - run_pipeline: thread req.seed_overrides / req.user_scope into ctx (:2646-2651).
+  - step_seed: effective dims + window_days branch + "overrides" body key
+    (blueprint); detail/data echo.
+  - step_status: user-scope validate/adopt/warn-fallback branch (blueprint);
+    data gains "user_scope_applied".
+
+Task 7 — pipeline tests (test_pipeline.py, _RecordingClient @1025):
+  - test_step_seed_forwards_seed_overrides: ctx with overrides; assert POST
+    /seeder/generate body["overrides"] == {"stores": 8, ...}, body["stores"] == 8
+    (effective), sparsity scalar stays 0.0.
+  - test_step_seed_window_days_overrides_profile_window: 120-day delta between
+    posted start/end.
+  - test_step_status_honors_user_scope: canned 200s for
+    /dimensions/stores/{id} + /dimensions/products/{id}; assert ctx.store_id/
+    product_id == scope, status "pass", data["user_scope_applied"] is True.
+  - test_step_status_dangling_scope_warns_and_falls_back: canned 404 for the
+    store GET + normal discovery responses; assert status "warn",
+    ctx ids == discovered pair, data["user_scope_applied"] is False.
+  - test_run_pipeline_threads_new_fields (ctx construction).
+
+Task 8 — MODIFY app/features/demo/workspace.py + tests:
+  - create_workspace: write both slots (blueprint; INSIDE the try —
+    warn-and-continue intact).
+  - test_workspace.py: keep-run with overrides+scope persists sparse JSON;
+    keep-run without them persists NULLs; create failure still returns None
+    (existing warn-and-continue test stays green).
+  - tests/test_e2e_demo.py (integration): extend test_demo_replay_same_config_
+    twice (or add a sibling test_demo_replay_preserves_seed_overrides_and_scope):
+    keep-run with seed_overrides + user_scope (skip_seed=False so the validator
+    passes — use the smallest overrides, e.g. {"stores": 3, "products": 10},
+    to keep wall-clock sane); replay via a second run with the row's recorded
+    config; assert both rows' seed_overrides/user_scope JSON identical.
+
+Task 9 — frontend types + replay helper:
+  - types/api.ts: SeedOverrides + UserScope interfaces; DemoRunRequest +2
+    optional fields; WorkspaceListItem +2 nullable fields (surgical — CRLF trap).
+  - CREATE lib/workspace-replay.ts + workspace-replay.test.ts:
+    legacy row (null slots) → frame WITHOUT the E3 keys (seed_overrides/
+    user_scope) but ALWAYS WITH replayed_from_workspace_id = ws.workspace_id
+    (CONTRACT(E1): deep-equal to the POST-E1 inline object, not the pre-E1
+    shape); slotted row → frame includes both E3 keys verbatim; named/unnamed.
+
+Task 10 — CREATE SeedConfigPanel.tsx + ScopeSelector.tsx (+ tests, + barrel):
+  - Blueprint above; compose from installed primitives; semantic tokens only.
+  - SeedConfigPanel.test.tsx: renders 7 controls; emits a sparse object (only
+    touched knobs); emits null when cleared; disabled state; windowLocked
+    disables the window control; caveat badge visible at high stockout/sparsity.
+  - ScopeSelector.test.tsx: renders options from mocked useStores/useProducts
+    (mock the hooks via vi.mock — keep the harness light per
+    test-requirements.md); selection fires onChange with real ids; preview
+    shows store/product names; Clear → onChange(null).
+  - components/demo/index.ts: export both (match barrel style).
+
+Task 11 — MODIFY frontend/src/pages/showcase.tsx:
+  - State + wiring per the blueprint; handleReplayWorkspace uses
+    workspaceToRunRequest; handleLoadWorkspace repopulates panel + selector;
+    Reset-database tick clears userScope (+ caveat); Re-seed untick clears
+    seedOverrides.
+
+Task 12 — docs:
+  - API_CONTRACTS.md: seeder row — "E3 (#409) — POST /seeder/generate accepts an
+    additive Optional `overrides` object (allow-listed knobs: stores, products,
+    window_days, sparsity, promotion_intensity, stockout_intensity, noise_sigma;
+    `extra=forbid` → unknown knob 422; applied last, wins over the scalar
+    stores/products/sparsity)". POST /demo/run row + WS start-frame bullet —
+    "E3 (#409) — additive Optional `seed_overrides` (same object; requires
+    skip_seed=false; window_days rejected on holiday_rush) and `user_scope`
+    ({store_id, product_id}; validated by the status step, warn+fallback on a
+    dangling pair); both persist to the workspace row and replay verbatim."
+  - RUNBOOKS.md: showcase incident 29 — overrides/scope failure matrix:
+    (a) 422 "seed_overrides requires skip_seed=false" → tick Re-seed first;
+    (b) 422 window_days on holiday_rush → expected, pinned window;
+    (c) status step ⚠️ "user_scope ... not found" → expected after reset/reseed
+    (ids re-issued; sequences never reset) — re-pick the pair;
+    (d) backtest ❌ NaN WAPE on high stockout/sparsity overrides → documented
+    expected outcome (mirrors incident 28's sparse row).
+    Workspace section: move "advanced seed configuration" out of the
+    "Explicitly out of scope" list (now shipped: seed_overrides + user_scope;
+    phase-level config remains out of scope) and note replay-verbatim covers
+    the two new slots.
+  - DOMAIN_MODEL.md: showcase_workspace aggregate — document both slot JSON
+    schemas (the table above) + the requested-vs-effective-grain distinction.
+
+Task 13 — gates, dogfood, commit, PR:
+  - Validation Loop below (all levels).
+  - Level 4 browser dogfood (mandatory per .claude/rules/ui-design.md).
+  - git diff --stat surgical check (types/api.ts CRLF trap).
+  - Commits per the convention block; PR into dev titled
+    "feat(api,ui): showcase advanced seed config and scope selection (#409)".
+```
+
+### Integration Points
+
+```yaml
+DATABASE: none in E3 — the seed_overrides/user_scope JSONB slots ship in E1
+  #407's migration. CONTRACT(E1): verify slots exist before Task 1.
+CONFIG: none — no new settings or env vars.
+ROUTES: none new — POST /seeder/generate, POST /demo/run, WS /demo/stream all
+  extend via request-model changes only (umbrella: "no new router outside
+  existing slices").
+SHARED: app/shared/seeder/overrides.py is the one new module — the sanctioned
+  cross-slice seam (both slices already import app/shared/seeder).
+WS CONTRACT: start frame gains two additive optional keys; event stream shape
+  unchanged (step data dicts gain echo keys only).
+WORKSPACE ROW: create_workspace writes the slots; finalize untouched;
+  PATCH /demo/workspaces/{id} (E1) deliberately NOT extended — overrides/scope
+  are immutable run records, not patchable metadata.
+FRONTEND: 2 new components + 1 lib helper + types + showcase wiring; WorkspacePanel /
+  RunHistoryStrip / use-demo-pipeline are generic over the widened types (no edits).
+```
+
+## Validation Loop
+
+### Level 1: Syntax & Style
+
+```bash
+uv run ruff check . && uv run ruff format --check .
+uv run mypy app/ && uv run pyright app/          # both --strict, gate merge
+cd frontend && pnpm lint
+# Types: no NEW errors mentioning touched files (pre-existing tsc -b failures exist on dev):
+cd frontend && pnpm tsc -b 2>&1 | grep -E "SeedConfigPanel|ScopeSelector|workspace-replay|types/api|pages/showcase" ; echo "exit=$? (1 = no matches = good)"
+```
+
+### Level 2: Unit Tests
+
+```bash
+uv run pytest app/shared/seeder app/features/seeder app/features/demo -v -m "not integration"
+cd frontend && pnpm test --run src/components/demo/ src/lib/
+cd frontend && pnpm test --run                      # full frontend suite
+```
+
+### Level 3: Integration (real Postgres — E1's migrated schema)
+
+```bash
+docker compose up -d && uv run alembic upgrade head
+# CAVEAT: destructive seeder tests pollute the shared DB mid-suite — reset to a
+# fresh DB before trusting Level-3 results (DROP/CREATE DATABASE, never `down -v`).
+uv run pytest -v -m integration -k "demo or seeder"   # incl. the replay-slot regression
+# Manual contract probes:
+curl -s -X POST localhost:8123/seeder/generate -H 'content-type: application/json' \
+  -d '{"scenario":"demo_minimal","stores":3,"products":10,"overrides":{"promotion_intensity":0.3,"noise_sigma":0.25}}' | head -c 300
+curl -s -X POST localhost:8123/seeder/generate -H 'content-type: application/json' \
+  -d '{"overrides":{"bogus":1}}' -o /dev/null -w '%{http_code}\n'        # 422
+curl -s -X POST localhost:8123/demo/run -H 'content-type: application/json' \
+  -d '{"skip_seed":true,"seed_overrides":{"stores":5}}' -o /dev/null -w '%{http_code}\n'  # 422
+```
+
+### Level 4: Browser dogfood (uvicorn :8123 + vite :5173)
+
+```bash
+uv run uvicorn app.main:app --port 8123 &
+cd frontend && ./node_modules/.bin/vite --host 0.0.0.0 &   # bypasses pnpm 11 depsStatusCheck
+# Real browser (webapp-testing / agent-browser; on this host Playwright needs
+# executable_path=/snap/bin/chromium):
+#  1. /showcase: tick "Re-seed first" → Advanced seed config panel appears;
+#     untick → panel collapses and overrides clear.
+#  2. Set stores=8, products=20, promo=0.3 → Run: green; seed card detail
+#     echoes "8 stores x 20 products"; /seeder/status confirms dims.
+#  3. Pick a focus pair in the ScopeSelector (preview shows names + window) →
+#     Run (skip_seed): status card says "(user-selected)"; train/backtest
+#     Inspect links target the chosen pair.
+#  4. Save as workspace + Run → workspace panel row → Replay: the replayed run
+#     uses the same overrides + scope (status card user-selected; second
+#     workspace row's slots identical — check GET /demo/workspaces).
+#  5. Tick "Reset database" → scope selection clears with the caveat.
+#  6. Pick holiday_rush + Re-seed → window_days control disabled (tooltip).
+#  7. Legacy path: no overrides, no scope → run is indistinguishable from today.
+```
+
+## Final validation Checklist
+
+- [ ] Backend gates: `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ && uv run pyright app/ && uv run pytest -v -m "not integration"`
+- [ ] Frontend: `pnpm lint && pnpm test --run` green; no NEW tsc -b errors in touched files
+- [ ] Seeder: overrides 201 / bounds 422 / unknown-knob 422 / no-overrides byte-identical (tests enforce)
+- [ ] Demo validators: seed_overrides×skip_seed and window_days×holiday_rush rejected; legacy frame green (JSON-path tests)
+- [ ] Pipeline: overrides forwarded; user_scope honored; dangling scope WARNS + falls back (tests enforce)
+- [ ] Workspace: slots persisted sparse/NULL; replay-verbatim regression green (integration)
+- [ ] Replay helper: workspaceToRunRequest covers legacy + slotted rows (FE test)
+- [ ] Browser dogfood (Level 4) performed in a real browser — not just tests
+- [ ] `git diff --stat` surgical (types/api.ts CRLF trap)
+- [ ] API_CONTRACTS + RUNBOOKS 29 + workspace-section + DOMAIN_MODEL slot schemas updated additively
+- [ ] Commits reference #409, scopes from the allow-list, no AI trailer; PR into dev
+- [ ] Every CONTRACT(E1) tag was re-verified against the merged E1 code (Task 0)
+
+---
+
+## Assumptions (explicit — no user clarification was available)
+
+1. **CONTRACT(E1):** `showcase_workspace` carries `seed_overrides` and `user_scope` as TWO separate nullable JSONB columns (precedent: `created_objects` / `result_summary`, `models.py:77-81`). If E1's PRP instead nests all story slots under one JSONB column, only the `create_workspace` write and the response-schema mapping change (Task 0 re-anchor).
+2. **CONTRACT(E1):** E1's migration ships the slots; E3 ships ZERO migrations. If E1 somehow deferred a slot, E3 must STOP and add it to E1, not ship its own migration.
+3. **CONTRACT(E1):** `config_schema_version` semantics are E1's; populating reserved slots does NOT bump it (assumed to stay at E1's initial value). E3 writes nothing to that column.
+4. **CONTRACT(E1):** workspace API responses — E3 requires `seed_overrides` + `user_scope` on the LIST item (replay reads list rows, `showcase.tsx:174-186`). If E1 exposed them detail-only (or not at all), E3 adds them to `WorkspaceListItem`; if E1 already added them, Task 4 merges instead of duplicating.
+5. **CONTRACT(E1):** replay provenance (`replayed_from_workspace_id`) is written by the E1/E2 replay surface; E3's replay-verbatim test must tolerate (not assert away) that column being populated.
+6. **CONTRACT(E1):** `PATCH /demo/workspaces/{id}` exists (E1) and is deliberately NOT extended by E3 — overrides/scope are immutable run records.
+7. Knob names (`promotion_intensity`, `stockout_intensity`, `noise_sigma`, `window_days`) are this PRP's choice — business-friendly on the wire, mapped to the internal dataclass names in one documented table. Renaming costs a constant, not a rework.
+8. Bounds are this PRP's choice (table above), justified against preset reference values; the UI constrains tighter (laptop-scale) than the API.
+9. `user_scope` dangling resolution = WARN + fallback (not fail): chosen so replay of a `reset=true` workspace can never hard-fail forever; divergence stays visible via the requested-slot vs effective-columns split.
+10. The seeder-side field is named `overrides` (the slice context makes `seed_` redundant); the demo-side field is `seed_overrides` (epic-specified name). The pipeline maps one to the other in `step_seed`.
+
+## Anti-Patterns to Avoid
+
+- ❌ Don't create a new seeder endpoint — the decision above is final for E3; the nested object rides the existing contract.
+- ❌ Don't widen the knob allow-list beyond the 7 — the umbrella names this the top risk; everything else stays preset-driven (`extra="forbid"` enforces it).
+- ❌ Don't add any mid-run configuration channel — all config is start-frame-time; the single-`asyncio.Lock` linear pipeline is a design invariant.
+- ❌ Don't import `app/features/seeder/*` from the demo slice (or vice versa) — the shared schema lives in `app/shared/seeder/overrides.py`.
+- ❌ Don't add `ConfigDict(strict=True)` to `GenerateParams` (it has date fields) — only the new nested models are strict.
+- ❌ Don't make a dangling `user_scope` fail the run — warn + fallback (replay safety); equally, don't silently adopt it without validation.
+- ❌ Don't let a workspace slot write break the pipeline — slot writes stay inside the warn-and-continue try/except.
+- ❌ Don't ship a migration — E1 owns the schema.
+- ❌ Don't NaN-WAPE-proof the backtest for extreme overrides — document the expected fail (runbook 29), mirroring the sparse-preset decision in E2/#391.
+- ❌ Don't hand-roll new UI primitives or install shadcn components when collapsible/select/slider/input/badge/tooltip already exist; if forced, pin `shadcn@4.7.0`.
+- ❌ Don't ship the UI without a real-browser check — `.claude/rules/ui-design.md` makes that a hard requirement.
+
+## Confidence Score
+
+**8/10** for one-pass implementation success. Every backend change extends a
+verified, line-cited in-repo pattern (the seeder's layered override pipeline,
+the `DemoRunRequest` cross-field validators, `_RecordingClient` step tests, the
+warn-and-continue workspace writes), the pydantic strict/nested/forbid
+behavior was runtime-verified rather than assumed, and the riskiest judgment
+calls (contract shape, knob mapping, bounds, dangling-scope semantics, slot
+schemas) are decided with rationale and pinned by tests. The −2: (a) this PRP
+is authored PRE-E1 — six CONTRACT(E1) tags must survive a cross-check against
+the merged E1 code, and attribute-name drift there would touch 3 files (Task 0
+exists precisely to absorb this); (b) the two new frontend components are the
+usual UI-iteration surface (styling/dogfood may need a second pass), and
+`showcase.tsx` is a merge hotspot shared with parallel epics E2/E4/E5.
diff --git a/PRPs/PRP-showcase-completion-E4-run-config-phase-controls.md b/PRPs/PRP-showcase-completion-E4-run-config-phase-controls.md
new file mode 100644
index 00000000..85826e58
--- /dev/null
+++ b/PRPs/PRP-showcase-completion-E4-run-config-phase-controls.md
@@ -0,0 +1,820 @@
+name: "PRP — showcase-completion E4: run-config phase controls (model set + backtest params in start frame)"
+issue: "#410 (epic) · umbrella #406 · depends on E1 #407 (Foundation — MUST be merged first)"
+branch: "feat/showcase-run-config-phase-controls (off dev)"
+description: |
+  Start-frame-time run configuration for the showcase pipeline: a model-family
+  picker (baselines + feature-aware, with opt-in lightgbm/xgboost/random_forest
+  toggles surfaced ONLY when the matching `forecast_enable_*` flag is on),
+  backtest configuration (horizon, split strategy, min train size, n_splits,
+  gap, ranking metric WAPE/MAE/RMSE), a train-candidate preview before launch,
+  and the chosen config echoed into the workspace row and visible on the run.
+  NO mid-run re-entry — the linear single-`asyncio.Lock` pipeline is preserved;
+  all configuration happens in the start frame.
+
+## Core Principles
+
+1. **Context is King** — every file/line cited below was verified on 2026-06-12.
+2. **Validation Loops** — Levels 1–4 below are executable; Level 4 browser dogfood is MANDATORY (UI work, `.claude/rules/ui-design.md`).
+3. **Additive only** — a legacy start frame (no new fields) behaves **byte-identically** to today. This is a frozen umbrella #406 success criterion.
+4. **Global rules** — CLAUDE.md / AGENTS.md / `.claude/rules/*` apply. Commits: `feat(api,db): … (#410)` for backend+migration, `feat(ui): … (#410)` for frontend (or one `feat(api,ui): … (#410)`).
+
+---
+
+## Goal
+
+**Feature Goal**: An operator on `/showcase` can, before launching a run, (a) pick which forecasting models the pipeline trains/backtests, (b) tune the backtest split (horizon / strategy / n_splits / min_train_size / gap) and the winner-ranking metric (WAPE / MAE / RMSE), (c) see a train-candidate preview of exactly what will run, and (d) find that config recorded on the saved workspace row and honored verbatim on Replay.
+
+**Deliverable** (all additive):
+
+- `app/features/demo/schemas.py` — new `DemoBacktestConfig`; `DemoRunRequest` gains `train_model_types: list[str] | None` + `backtest: DemoBacktestConfig | None`; `WorkspaceListItem` gains `run_config: dict | None`.
+- `app/shared/model_taxonomy.py` — public `KNOWN_MODEL_TYPES` frozenset (validation allow-list source of truth).
+- `app/features/demo/models.py` + one Alembic migration — nullable `run_config` JSONB column on `showcase_workspace` (a **replay-input column** like `seed`/`scenario`, NOT an E1 story slot — see Decision D1).
+- `app/features/demo/workspace.py` — `create_workspace` records `run_config`.
+- `app/features/demo/pipeline.py` — `DemoContext` carries the resolved run config; `step_train` / `step_backtest` / `step_v2_train` honor it; `_select_winner` gains a metric parameter; `pipeline_complete` echoes the config.
+- `app/features/model_selection/` — `CandidateModelInfo` gains `enabled: bool` (settings overlay in the service; `capabilities.py` stays pure) so the frontend knows which opt-in toggles to surface.
+- `frontend/` — `RunConfigPanel` (collapsible advanced section on `/showcase`): model picker (reuses `CandidateModelPicker` with an enabled-filtered catalog), `DemoBacktestSettingsForm` (mirrors the champion-selector form), train-candidate preview; start-frame wiring with a dirty-only inclusion rule; Load/Replay honor `run_config`; WorkspacePanel shows a config summary.
+- Docs: `docs/_base/API_CONTRACTS.md`, `docs/_base/DOMAIN_MODEL.md`, `docs/_base/RUNBOOKS.md` additive notes.
+- Tests at every layer (schema, taxonomy drift-lock, pipeline, workspace, migration, catalog overlay, vitest).
+
+**Success Definition**: all Success Criteria check off; five CI gates green; integration suite green; a Level-4 dogfood run launches a custom-config run from `/showcase`, the preview matched what ran, the workspace row carries `run_config`, and Replay re-runs it verbatim.
+
+## Why
+
+- Umbrella #406 success criterion: *"The start frame accepts model-set + backtest config; the chosen config is echoed into the workspace row and visible on the run."*
+- Today `DEMO_MODEL_TYPES` (`pipeline.py:67`) hard-codes 3 baselines and `DEMO_HORIZON`/`DEMO_BACKTEST_SPLITS`/`DEMO_MIN_TRAIN_SIZE` (`pipeline.py:54-56`) hard-code the split — the showcase cannot demonstrate the 11-model zoo (PRP-36) or metric-driven champion selection it actually ships.
+- Replay (E4 #393) is verbatim-by-design; without recording the run config, a custom run could not be replayed faithfully — breaking the workspace story the whole umbrella is about.
+- Brainstorm Round 5 (`.flow/brainstorm-log.md`): mid-run/per-phase re-run was explicitly DEFERRED ("re-architects locked linear pipeline") — start-frame-only is the negotiated scope. Do not add mid-run controls.
+
+## What
+
+### User-visible behavior
+
+1. `/showcase` controls card gains a collapsible **"Run configuration (advanced)"** section (collapsed by default — untouched = legacy behavior):
+   - **Model picker**: checkboxes grouped by family (Baseline / Additive / Tree-based), fed by `GET /model-selection/models`. Opt-in models (`lightgbm`, `xgboost`, `random_forest`) appear **only when** their `forecast_enable_*` flag is on (new catalog `enabled` field). Default selection: `naive`, `seasonal_naive`, `moving_average` (the legacy trio). Cap 10, min 1.
+   - **Backtest settings**: ranking metric (WAPE default / MAE / RMSE), horizon (1–90, default 14), and an "Advanced split settings" collapsible: strategy (expanding/sliding), splits (2–20, default 3), min train (≥7, default 30), gap (0–30, default 0). Inline validation mirrors backend bounds; soft warning when `min_train_size + n_splits×(horizon+gap)` exceeds the scenario's seeded window.
+   - **Train-candidate preview**: a read-only chip list of exactly which models will train (selection, plus `prophet_like (V2)` appended on `showcase_rich`), with family badges and a count.
+2. The WS start frame / `POST /demo/run` body carry `train_model_types` + `backtest` **only when the operator changed something** (dirty-only rule → untouched UI sends a byte-identical legacy frame).
+3. The pipeline trains/backtests the selected models; the winner is the best **configured metric**; `pipeline_complete.data.run_config` echoes the config; the train/backtest step cards show what was requested.
+4. On `preservation="keep"` runs the workspace row records `run_config`; the Saved-workspaces panel shows a compact config summary; **Load** repopulates the controls; **Replay** re-submits it verbatim.
+5. A request naming a disabled/unknown model fails fast with an actionable message (422 on unknown at validation; a clear `fail` step detail on disabled-flag models).
+
+### Technical requirements
+
+- Pydantic v2 strict-mode policy respected (all new request fields JSON-native; nested model validated from a plain dict — add the JSON-path test).
+- Vertical-slice rule: the demo slice NEVER imports `app/features/model_selection` (or any sibling) in Python — the model allow-list comes from `app/shared/model_taxonomy.py`; the frontend talks to the catalog over HTTP.
+- Migration forward-only, applies + downgrades cleanly on a fresh DB.
+- Workspace writes stay warn-and-continue (must never break a green run).
+
+### Success Criteria
+
+- [ ] `DemoRunRequest` accepts `train_model_types` + `backtest` (additive Optional); a legacy frame validates byte-identically (existing `test_demo_run_request_legacy_frame_still_validates` extended).
+- [ ] Unknown model type → 422 / WS `error` event; duplicate model types rejected; `gap >= horizon` rejected; selection size 1–10 enforced.
+- [ ] A run with `train_model_types=["naive","seasonal_average"]` trains exactly those models; `step_backtest` sends the configured `split_config` and picks the winner by the configured metric (unit-asserted against the canned `_Client` request bodies).
+- [ ] A disabled opt-in model in the selection fails the `train` step with a detail naming the flag (`forecast_enable_lightgbm=false …`).
+- [ ] `GET /model-selection/models` items carry `enabled`; `enabled=false` exactly when the matching `forecast_enable_*` flag is off (lightgbm/xgboost/random_forest), `true` for all always-on models.
+- [ ] `showcase_workspace.run_config` records the config on keep-runs (NULL when defaults were used); migration up+down clean.
+- [ ] `/showcase` advanced section renders the picker (opt-ins hidden when disabled), backtest form, and preview; Load/Replay honor `run_config`; untouched controls send a legacy frame (vitest-asserted).
+- [ ] `pipeline_complete.data.run_config` echo present on custom runs, absent (None) on legacy runs.
+- [ ] All five CI gates green; integration tests green; Level-4 dogfood evidence captured.
+
+## All Needed Context
+
+### Documentation & References
+
+```yaml
+# ── The work order ───────────────────────────────────────────────────────────
+- issue: "#410"
+  why: Epic scope (verbatim). Parallel after Foundation E1 #407.
+- issue: "#406"
+  why: Umbrella — approach ("additive-only delta", "start-frame-time only"), success criteria, risk table.
+- file: PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md
+  why: |
+    The Foundation this epic builds on. CRITICAL: E1 defines six JSONB story
+    slots (seed_overrides, user_scope, approval_events, rag_events, job_ids,
+    phase_summaries) — NONE of them is a run-config slot, and E1 assigns no
+    slot to E4. See Decision D1 below. Also the migration-task pattern to
+    MIRROR (down_revision discovery, up/down test) and config_schema_version
+    semantics (lines 25-70, 228-236, 560-610 of that PRP).
+
+# ── Backend: demo slice (primary surface) ────────────────────────────────────
+- file: app/features/demo/schemas.py
+  why: |
+    DemoRunRequest (lines 29-86) — the additive-field pattern to MIRROR exactly:
+    PRP-38 scenario field (enum-on-wire strict=False override, lines 51-63),
+    E1 #390 preservation/workspace_name + model_validator (lines 64-85).
+    WorkspaceListItem lines 169-189 (from_attributes response pattern).
+- file: app/features/demo/pipeline.py
+  why: |
+    THE file. Constants to make configurable: DEMO_HORIZON=14,
+    DEMO_BACKTEST_SPLITS=3, DEMO_MIN_TRAIN_SIZE=30 (54-56), DEMO_MODEL_TYPES
+    (67). _model_config_payload (271-286) — extend. _select_winner (446-460)
+    — hard-codes "wape"; gains metric param. step_train (669-703) — gather
+    over DEMO_MODEL_TYPES, train tail = date_end - DEMO_HORIZON. step_backtest
+    (731-836) — two branches (SHOWCASE_RICH single-call include_baselines=True
+    at 743-788 vs legacy per-model loop at 789-818); split_config bodies at
+    753-760/801-808. step_v2_train (998-1090) — V2 train tail also uses
+    DEMO_HORIZON (1021). run_pipeline (2618-2771) — ctx construction (2646),
+    create_workspace keep-branch (2655-2657), pipeline_complete data (2758-2770).
+    DemoContext dataclass (212-264) — where resolved config fields land.
+- file: app/features/demo/workspace.py
+  why: |
+    create_workspace (46-79) — records replay inputs at insert time; E4 adds
+    run_config here (NOT in finalize: it is an input, known before step 1).
+    Warn-and-continue pattern is load-bearing.
+- file: app/features/demo/models.py
+  why: ShowcaseWorkspace ORM (37-89). run_config column lands next to the
+       "Run configuration -- replay inputs" block (line 65-69 comment).
+- file: app/features/demo/routes.py
+  why: WS start-frame parse (166-194) — ValidationError → one error event +
+       close. No route changes needed beyond what schemas give for free
+       (POST /demo/run + WS validate via DemoRunRequest).
+- file: app/features/demo/tests/test_schemas.py
+  why: Test naming + the legacy-frame contract test to extend
+       (test_demo_run_request_legacy_frame_still_validates, line 75).
+- file: app/features/demo/tests/test_pipeline.py
+  why: Canned-_Client mocking pattern for step unit tests (assert on captured
+       request bodies — exactly how the split_config assertion should work).
+- file: app/features/demo/tests/test_workspace.py
+  why: Integration-test pattern for create/finalize roundtrips.
+
+# ── Backend: contracts the pipeline drives ───────────────────────────────────
+- file: app/features/forecasting/schemas.py
+  why: |
+    TrainRequest (441-525): store/product/dates/config (+feature_frame_version,
+    feature_groups — leave at V1 defaults for E4 training). ModelConfig is a
+    discriminated union; ALL 11 members validate from a minimal
+    {"model_type": X} payload (runtime-verified, see Gotchas). season_length
+    default 7 (line 86), window_size default 7 (line 107).
+- file: app/features/forecasting/routes.py
+  why: Flag gates — lightgbm (line 76-81) and xgboost (82-86) raise
+       BadRequestError 400 "… is disabled. Set forecast_enable_…". NOTE:
+       random_forest is gated deeper (forecasting/models.py:1761) — another
+       reason step_train must pre-check flags itself for a clean message.
+- file: app/core/config.py
+  why: "forecast_enable_lightgbm / forecast_enable_xgboost /
+       forecast_enable_random_forest — all default False (lines 118-120)."
+- file: app/features/backtesting/schemas.py
+  why: |
+    SplitConfig (24-73) — the canonical bounds DemoBacktestConfig MUST mirror:
+    strategy Literal["expanding","sliding"] def "expanding"; n_splits 2-20
+    def 5; min_train_size ge=7 def 30; gap 0-30 def 0; horizon 1-90 def 14;
+    validator horizon > gap. BacktestConfig (81-108): split_config +
+    model_config_main + include_baselines + store_fold_details.
+    aggregated_metrics keys: mae, smape, wape, bias, rmse (PRP-36;
+    rmse verified at app/features/backtesting/metrics.py:349).
+- file: app/shared/model_taxonomy.py
+  why: _MODEL_FAMILY_MAP — the 11 known model types. E4 adds public
+       KNOWN_MODEL_TYPES here (one-way import app/features/* → app/shared OK).
+
+# ── Backend: model catalog (flag exposure) ───────────────────────────────────
+- file: app/features/model_selection/capabilities.py
+  why: build_model_catalog (line 126) — pure/static by design (module
+       docstring). Do NOT read settings here; overlay in the service.
+- file: app/features/model_selection/service.py
+  why: get_model_catalog (113-119) — thin pass-through; the enabled overlay
+       goes here (model_copy(update={"enabled": …}) per item).
+- file: app/features/model_selection/schemas.py
+  why: CandidateModelInfo (412-429) + ModelCatalogResponse (431) — add
+       `enabled: bool = True` (additive, defaulted for back-compat).
+- file: app/features/model_selection/routes.py
+  why: GET /model-selection/models (74-86) — no route change; response model
+       picks up the new field automatically.
+
+# ── Frontend ─────────────────────────────────────────────────────────────────
+- file: frontend/src/pages/showcase.tsx
+  why: |
+    453 lines. Start-frame construction handleRun (139-156) — the
+    spread-only-when-set pattern for byte-compat; handleLoadWorkspace
+    (160-168) + handleReplayWorkspace (174-186) — must consume run_config;
+    controls card (257-371) — the advanced section slots after the
+    workspace-name block (line 362).
+- file: frontend/src/components/champion-selector/candidate-model-picker.tsx
+  why: REUSE this component (family-grouped checkbox grid, cap badge,
+       extra/feature-aware badges). Feed it an enabled-filtered catalog.
+- file: frontend/src/components/champion-selector/backtest-settings-form.tsx
+  why: MIRROR for DemoBacktestSettingsForm — Field helper, metric Select,
+       Collapsible advanced split knobs, splitConfigErrors display. Differences:
+       horizon is EDITABLE here (champion's is locked), metric list is
+       wape/mae/rmse (champion's is wape/smape/mae/bias).
+- file: frontend/src/components/champion-selector/split-config.ts
+  why: splitConfigErrors — REUSE as-is (field names match DemoBacktestConfig).
+- file: frontend/src/hooks/use-model-selection.ts
+  why: useModelCatalog (line 30) — REUSE for the picker's data.
+- file: frontend/src/types/api.ts
+  why: DemoRunRequest (778-788), WorkspaceListItem (805-815),
+       CandidateModelInfo (1279-1290), SplitConfig comment block (~1262-1268).
+- file: frontend/src/hooks/use-demo-pipeline.ts
+  why: start(req) serializes DemoRunRequest verbatim into the WS start frame —
+       no hook change needed; the dirty-only rule lives in showcase.tsx.
+- file: frontend/src/components/demo/ScenarioPicker.tsx
+  why: Disabled-while-running prop pattern; the scenario value feeds the
+       preview (windowDays map mirrors pipeline.py _SCENARIO_SEED_PROFILE,
+       513-538: demo_minimal/sparse/holiday_rush = 92d window, others = 180d).
+- file: frontend/src/components/demo/WorkspacePanel.tsx
+  why: Row layout to extend with the compact run-config summary line/badge.
+- file: frontend/src/components/demo/RunHistoryStrip.test.tsx
+  why: Representative vitest + RTL pattern for the new component tests.
+
+# ── Project docs to update (additive) ────────────────────────────────────────
+- file: docs/_base/API_CONTRACTS.md
+  why: DemoRunRequest/WS start-frame field docs + catalog `enabled` +
+       workspace run_config (follow the existing E1/E2/PRP-38 annotation style).
+- file: docs/_base/DOMAIN_MODEL.md
+  why: showcase_workspace aggregate — document run_config as a replay-input
+       column (explicitly NOT a story slot; D1 rationale).
+- file: docs/_base/RUNBOOKS.md
+  why: § Showcase runbook — two new numbered incidents (disabled-model fail;
+       aggressive split → NaN/insufficient-fold fail is a documented outcome,
+       sparse-preset precedent in incident 28).
+```
+
+### Current Codebase tree (relevant subset)
+
+```bash
+app/
+├── core/config.py                      # forecast_enable_* flags (118-120)
+├── shared/model_taxonomy.py            # ModelFamily + _MODEL_FAMILY_MAP (11 types)
+└── features/
+    ├── demo/
+    │   ├── models.py                   # ShowcaseWorkspace (89 lines)
+    │   ├── schemas.py                  # DemoRunRequest / StepEvent / Workspace* (213)
+    │   ├── pipeline.py                 # orchestrator + steps (2771)
+    │   ├── workspace.py                # create/finalize/list/get/delete helpers
+    │   ├── routes.py                   # POST /demo/run, WS /demo/stream, workspaces CRUD
+    │   ├── service.py                  # run lock + sync/stream wrappers
+    │   └── tests/                      # test_{schemas,pipeline,workspace,models,routes}.py
+    ├── forecasting/{schemas,routes,models}.py   # TrainRequest, flag gates
+    ├── backtesting/schemas.py          # SplitConfig / BacktestConfig
+    └── model_selection/
+        ├── capabilities.py             # build_model_catalog (pure)
+        ├── service.py                  # get_model_catalog pass-through
+        ├── schemas.py                  # CandidateModelInfo / ModelCatalogResponse
+        └── routes.py                   # GET /model-selection/models
+alembic/versions/                       # head TODAY = 324a2fa37fcc; E1 #407 adds one on top
+frontend/src/
+├── pages/showcase.tsx                  # controls card + start frame + load/replay
+├── hooks/{use-demo-pipeline,use-model-selection,use-workspaces}.ts
+├── components/demo/                    # ScenarioPicker, WorkspacePanel, … (+ tests)
+├── components/champion-selector/       # candidate-model-picker, backtest-settings-form, split-config
+└── types/api.ts                        # DemoRunRequest, WorkspaceListItem, CandidateModelInfo
+```
+
+### Desired Codebase tree (files added/changed)
+
+```bash
+app/shared/model_taxonomy.py                          # MODIFY: + KNOWN_MODEL_TYPES frozenset
+app/shared/tests/test_model_taxonomy.py               # MODIFY (or create if missing): drift-lock test
+app/features/demo/schemas.py                          # MODIFY: + DemoBacktestConfig; DemoRunRequest fields; WorkspaceListItem.run_config
+app/features/demo/models.py                           # MODIFY: + run_config JSONB column
+alembic/versions/<rev>_add_showcase_workspace_run_config.py   # CREATE: add/drop run_config
+app/features/demo/workspace.py                        # MODIFY: create_workspace records run_config
+app/features/demo/pipeline.py                         # MODIFY: ResolvedRunConfig, ctx, steps, winner metric, echo
+app/features/model_selection/schemas.py               # MODIFY: CandidateModelInfo.enabled
+app/features/model_selection/service.py               # MODIFY: enabled settings-overlay
+app/features/model_selection/tests/test_capabilities.py  # MODIFY: overlay unit tests (patched settings)
+app/features/demo/tests/test_schemas.py               # MODIFY: new-field + legacy-frame + JSON-path tests
+app/features/demo/tests/test_pipeline.py              # MODIFY: selection/flag/split/metric/echo tests
+app/features/demo/tests/test_workspace.py             # MODIFY: run_config persistence (integration)
+app/features/demo/tests/test_models.py                # MODIFY: column roundtrip
+frontend/src/types/api.ts                             # MODIFY: DemoBacktestConfig, DemoRunRequest, WorkspaceListItem, CandidateModelInfo.enabled
+frontend/src/components/demo/run-config-utils.ts      # CREATE: defaults, isDefault*, buildTrainPlan, windowDays
+frontend/src/components/demo/run-config-utils.test.ts # CREATE
+frontend/src/components/demo/DemoBacktestSettingsForm.tsx       # CREATE (mirror champion form)
+frontend/src/components/demo/DemoBacktestSettingsForm.test.tsx  # CREATE
+frontend/src/components/demo/RunConfigPanel.tsx       # CREATE: collapsible section composing picker+form+preview
+frontend/src/components/demo/RunConfigPanel.test.tsx  # CREATE
+frontend/src/pages/showcase.tsx                       # MODIFY: state + dirty-rule + load/replay + panel mount
+frontend/src/components/demo/WorkspacePanel.tsx       # MODIFY: config summary line
+docs/_base/{API_CONTRACTS,DOMAIN_MODEL,RUNBOOKS}.md   # MODIFY: additive notes
+```
+
+### Design Decisions (locked — do not re-litigate during implementation)
+
+```text
+D1 — run_config is a DEDICATED nullable JSONB COLUMN, not an E1 story slot.
+     E1 (#407) defines six slots and assigns writers for all of them to E3/E5/
+     "later epics" — none is a run-config slot. The model set + backtest params
+     are REPLAY INPUTS (same class as the existing seed/scenario/reset/skip_seed
+     columns, models.py:65-69), not run-story output. So: one additive column
+     `run_config JSONB NULL`, written by create_workspace at insert time,
+     consumed by Load/Replay. config_schema_version is NOT bumped — E1 defines
+     it as the STORY-SLOT schema marker; run_config presence is detectable by
+     NULL-check and carries its own documented shape in DOMAIN_MODEL.md.
+     NOTE: the E1 PRP (~line 230) loosely names "E4 #410 run-config echo" as a
+     candidate writer for job_ids/phase_summaries — this PRP supersedes that
+     phrasing: neither slot is run-config-shaped; job_ids/phase_summaries
+     writing stays with the later parallel epics (E2 #408 / E5 #411).
+
+D2 — E1 #407 MUST merge before this epic's migration is authored.
+     E4's migration down_revision = the head AT IMPLEMENTATION TIME (E1's
+     revision). Discover with `uv run alembic heads` — do NOT hardcode
+     324a2fa37fcc (that is today's pre-E1 head).
+
+D3 — Flag exposure rides the EXISTING catalog endpoint.
+     CandidateModelInfo gains `enabled: bool = True`; the model_selection
+     SERVICE overlays get_settings() (lightgbm→forecast_enable_lightgbm,
+     xgboost→forecast_enable_xgboost, random_forest→forecast_enable_random_forest,
+     everything else True). capabilities.build_model_catalog stays pure/static
+     (its module docstring is a contract). No new /config endpoint.
+
+D4 — Selection semantics in step_backtest:
+     • train_model_types is None → BOTH branches byte-identical to today
+       (SHOWCASE_RICH single call include_baselines=True; legacy per-model loop).
+     • train_model_types provided → ONE unified per-model loop over
+       selection ∪ ({prophet_like} when scenario==SHOWCASE_RICH), each call
+       include_baselines=False; bucketed_aggregated_metrics captured from the
+       prophet_like call's main_model_results when present. prophet_like is
+       appended because v2_train trains/registers it unconditionally on
+       SHOWCASE_RICH — it must stay in the competition or the V2 story breaks.
+
+D5 — Winner metric: Literal["wape","mae","rmse"], default "wape", all
+     lower-is-better; _select_winner(results, metric=…) skips missing/NaN.
+     (smape/bias deliberately excluded — issue #410 names WAPE/MAE/RMSE.)
+
+D6 — Flag enforcement is fail-fast in step_train (clear detail naming the
+     flag), NOT in the Pydantic schema. Settings reads inside schemas caused
+     the documented ".env-bleed" test incidents (RUNBOOKS § Settings tests);
+     schemas validate only against the static KNOWN_MODEL_TYPES allow-list.
+
+D7 — Dirty-only start-frame inclusion: showcase.tsx omits train_model_types /
+     backtest keys when they equal the defaults (legacy trio + default split).
+     Untouched UI ⇒ byte-identical legacy frame (umbrella criterion).
+
+D8 — The configured horizon drives ONLY the modeling steps: step_train /
+     step_v2_train train-tail reservation and step_backtest split_config.
+     Planning/scenario steps keep DEMO_HORIZON (out of scope; document).
+```
+
+### Known Gotchas & Library Quirks
+
+```python
+# VERIFIED 2026-06-12 (re-run these on library/schema upgrades):
+#
+# 1. ALL 11 ModelConfig union members validate from a minimal {"model_type": X}:
+#    uv run python -c "
+#    from pydantic import TypeAdapter
+#    from app.features.forecasting.schemas import TrainRequest
+#    ta = TypeAdapter(TrainRequest.model_fields['config'].annotation)
+#    [ta.validate_python({'model_type': t}) for t in (
+#      'naive','seasonal_naive','moving_average','weighted_moving_average',
+#      'seasonal_average','trend_regression_baseline','regression',
+#      'prophet_like','lightgbm','xgboost','random_forest')]"
+#    → _model_config_payload can fall back to {"model_type": t} for new types.
+#    KEEP the explicit seasonal_naive(season_length=7)/moving_average(window_size=7)
+#    branches — they match schema defaults but are load-bearing for config_hash
+#    stability of existing registry rows.
+#
+# 2. "rmse" IS in aggregated_metrics (backtesting/metrics.py:349, PRP-36) —
+#    alongside mae/smape/wape/bias. Do not invent other keys.
+#
+# 3. forecast_enable_{lightgbm,xgboost,random_forest} all default False
+#    (app/core/config.py:118-120). lightgbm/xgboost are gated at the train
+#    ROUTE (BadRequestError 400, routes.py:76-86); random_forest only deep in
+#    the model factory (models.py:1761) → without the D6 pre-check a
+#    random_forest request fails uglier. Also: flag ON but extra NOT installed
+#    still ImportErrors (catalog requires_extra badge covers the UI hint).
+#
+# 4. Pydantic strict mode: ConfigDict(strict=True) on DemoRunRequest is fine
+#    for the new fields — list[str] and a nested BaseModel validated from a
+#    JSON dict are allowed under strict (strict forbids primitive coercion,
+#    not dict→model validation). STILL add the JSON-path test
+#    (Model.model_validate({...nested dict...})) per the repo strict-mode
+#    policy (docs/_base/SECURITY.md, test_strict_mode_policy.py precedent).
+#    All new fields are JSON-native → no Field(strict=False) needed anywhere.
+#
+# 5. Demo windows are finite: demo_minimal/sparse = 92d, holiday_rush = 92d
+#    pinned, others = 180d (pipeline.py:513-538 _SCENARIO_SEED_PROFILE). An aggressive
+#    split (e.g. h=28, n_splits=5, min_train=60) CANNOT fit → backtest NaN /
+#    splitter error → step fail. This is a DOCUMENTED OUTCOME (same policy as
+#    the sparse preset's expected-fail, RUNBOOKS incident 28) — the backend
+#    must NOT silently clamp; the frontend shows the soft warning.
+#
+# 6. Feature-aware models (regression/prophet_like/…) train fine through
+#    POST /forecasting/train with V1 defaults (feature_frame_version=1) — the
+#    service builds the feature frame internally. Do NOT set
+#    feature_frame_version=2 in step_train; V2 stays step_v2_train's job.
+#    Expect noticeably longer wall-clock when selected (no budget gate change).
+#
+# 7. step_register reuses _model_config_payload(winner) and the winner's
+#    train_results model_path — both work unchanged for any selected winner.
+#    BUT registry _find_duplicate accumulation across repeated identical runs
+#    is a known trap (RUNBOOKS showcase incident 2) — unchanged risk profile,
+#    just more reachable configs now. No action; aware.
+#
+# 8. The demo slice may NOT import app/features/model_selection (vertical-slice
+#    rule). The model allow-list source is app/shared/model_taxonomy.py.
+#    Add KNOWN_MODEL_TYPES there + a drift-lock test asserting it equals
+#    _MODEL_FAMILY_MAP.keys() (precedent: forecasting's
+#    test_model_family_map_covers_every_known_model_type).
+#
+# 9. capabilities.build_model_catalog is PURE by contract (docstring: "No DB,
+#    no I/O… deterministic and unit-tested directly"). The enabled overlay
+#    belongs in ModelSelectionService.get_model_catalog (D3) via
+#    item.model_copy(update={"enabled": …}).
+#
+# 10. WS error path: a ValidationError on the start frame becomes ONE error
+#     StepEvent then close (routes.py:188-191) — new-field validation failures
+#     surface there for free; assert it in test_routes.py.
+#
+# 11. Repo quirks: mixed CRLF/LF — check `git diff --stat` for whole-file
+#     noise before committing. `pnpm tsc --noEmit` is VACUOUS (solution-style
+#     tsconfig) — rely on `pnpm lint` + `pnpm test --run` + the real `tsc -b`
+#     only informationally (it has pre-existing failures on dev). A stale
+#     uvicorn can squat :8123 during Level 3/4 — check `ps etime` first.
+#     NEVER `docker compose down -v` (kills the Ollama models volume).
+```
+
+## Implementation Blueprint
+
+### Data models and structure
+
+```python
+# ── app/shared/model_taxonomy.py (additive) ──────────────────────────────────
+KNOWN_MODEL_TYPES: frozenset[str] = frozenset(_MODEL_FAMILY_MAP)
+# Public allow-list for request validation across slices. Drift-locked by test.
+
+# ── app/features/demo/schemas.py (additive) ──────────────────────────────────
+class DemoBacktestConfig(BaseModel):
+    """Backtest knobs for the showcase pipeline (E4 #410).
+
+    Bounds MIRROR app/features/backtesting/schemas.py:SplitConfig exactly —
+    the pipeline forwards them verbatim into POST /backtesting/run.
+    """
+    model_config = ConfigDict(strict=True)
+
+    horizon: int = Field(default=14, ge=1, le=90)
+    strategy: Literal["expanding", "sliding"] = "expanding"
+    n_splits: int = Field(default=3, ge=2, le=20)        # demo default 3, NOT SplitConfig's 5
+    min_train_size: int = Field(default=30, ge=7)
+    gap: int = Field(default=0, ge=0, le=30)
+    metric: Literal["wape", "mae", "rmse"] = "wape"      # D5
+
+    @model_validator(mode="after")
+    def _gap_lt_horizon(self) -> DemoBacktestConfig:
+        if self.gap >= self.horizon:
+            raise ValueError(f"horizon ({self.horizon}) must be greater than gap ({self.gap})")
+        return self
+
+class DemoRunRequest(BaseModel):
+    ...existing fields unchanged...
+    # E4 (#410): additive run-config. None → legacy DEMO_MODEL_TYPES +
+    # legacy split constants, byte-identical behavior.
+    train_model_types: list[str] | None = Field(default=None, min_length=1, max_length=10)
+    backtest: DemoBacktestConfig | None = None
+
+    @field_validator("train_model_types")
+    @classmethod
+    def _known_unique_models(cls, v: list[str] | None) -> list[str] | None:
+        if v is None:
+            return v
+        unknown = [m for m in v if m not in KNOWN_MODEL_TYPES]
+        if unknown:
+            raise ValueError(f"Unknown model type(s): {unknown!r}. Valid: {sorted(KNOWN_MODEL_TYPES)}")
+        if len(set(v)) != len(v):
+            raise ValueError("train_model_types contains duplicates")
+        return v
+
+class WorkspaceListItem(BaseModel):
+    ...existing...
+    # E4 (#410): replay-input echo; None on default-config / pre-E4 rows.
+    run_config: dict[str, Any] | None = Field(default=None)
+
+# ── app/features/demo/models.py (additive column) ────────────────────────────
+# E4 (#410) — replay-input column (NOT an E1 story slot, see PRP D1):
+# {"train_model_types": [...], "backtest": {...}} via model_dump(mode="json");
+# NULL when the run used defaults.
+run_config: Mapped[dict[str, Any] | None] = mapped_column(JSONB, nullable=True)
+
+# ── app/features/demo/pipeline.py (resolved config) ──────────────────────────
+@dataclass(frozen=True)
+class ResolvedRunConfig:
+    """req.train_model_types/backtest with legacy defaults filled in."""
+    model_types: tuple[str, ...] = DEMO_MODEL_TYPES
+    horizon: int = DEMO_HORIZON
+    strategy: str = "expanding"
+    n_splits: int = DEMO_BACKTEST_SPLITS
+    min_train_size: int = DEMO_MIN_TRAIN_SIZE
+    gap: int = 0
+    metric: str = "wape"
+    customized: bool = False     # True when the request carried either field
+
+# DemoContext gains: run_config: ResolvedRunConfig = field(default_factory=ResolvedRunConfig)
+
+# ── app/features/model_selection/schemas.py (additive) ───────────────────────
+class CandidateModelInfo(BaseModel):
+    ...existing...
+    enabled: bool = True   # E4 #410 — forecast_enable_* overlay (service-set)
+```
+
+### Tasks (dependency-ordered)
+
+```yaml
+Task 0 — PRE-FLIGHT (read-only):
+  - VERIFY E1 #407 is merged: `gh issue view 407 --json state` + `uv run alembic heads`
+    (head must be E1's revision, NOT 324a2fa37fcc). If E1 is not merged: STOP —
+    this epic is Parallel-after-Foundation.
+  - RE-RUN the three verification commands in Known Gotchas 1-3.
+  - READ: PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md (slot
+    contract), pipeline.py:40-90/440-470/660-840, schemas.py (demo), showcase.tsx.
+
+Task 1 — shared taxonomy allow-list:
+  MODIFY app/shared/model_taxonomy.py:
+    - ADD `KNOWN_MODEL_TYPES: frozenset[str] = frozenset(_MODEL_FAMILY_MAP)` below the map,
+      with a docstring naming it the cross-slice request-validation allow-list.
+  CREATE/EXTEND app/shared/tests/test_model_taxonomy.py:
+    - test_known_model_types_matches_family_map (drift-lock: == set(_MODEL_FAMILY_MAP)).
+    - test_known_model_types_contains_demo_trio.
+
+Task 2 — demo schemas:
+  MODIFY app/features/demo/schemas.py:
+    - ADD DemoBacktestConfig (exact shape above; module placement after DemoRunRequest's
+      dependencies — define BEFORE DemoRunRequest).
+    - ADD train_model_types + backtest to DemoRunRequest (after workspace_name block,
+      comment-tagged "E4 (#410)"); field_validator as above; import KNOWN_MODEL_TYPES
+      from app.shared.model_taxonomy.
+    - ADD run_config to WorkspaceListItem (detail inherits).
+  EXTEND app/features/demo/tests/test_schemas.py (mirror existing naming):
+    - test_demo_run_request_run_config_defaults_none
+    - test_demo_run_request_accepts_model_selection_json_path  # model_validate on plain dicts
+    - test_demo_run_request_rejects_unknown_model_type
+    - test_demo_run_request_rejects_duplicate_model_types
+    - test_demo_run_request_rejects_empty_and_oversized_selection  # [] and 11 entries
+    - test_demo_backtest_config_defaults_and_bounds              # n_splits=1→err, gap>=horizon→err
+    - test_demo_run_request_legacy_frame_still_validates         # EXTEND: assert new fields None
+    - test_workspace_list_item_run_config_round_trip
+
+Task 3 — ORM column + migration:
+  MODIFY app/features/demo/models.py: run_config column (snippet above) inside the
+    "Run configuration -- replay inputs" block; extend class docstring Attributes.
+  CREATE alembic/versions/<rev>_add_showcase_workspace_run_config.py:
+    - revision = autogen id; down_revision = OUTPUT OF `uv run alembic heads` (D2).
+    - upgrade: op.add_column("showcase_workspace", sa.Column("run_config",
+      postgresql.JSONB(astext_type=sa.Text()), nullable=True))
+    - downgrade: op.drop_column. MIRROR the E1 migration's structure/comments.
+  EXTEND app/features/demo/tests/test_models.py: run_config JSONB roundtrip +
+    NULL-default assertions (integration-marked, same pattern as existing).
+
+Task 4 — workspace write:
+  MODIFY app/features/demo/workspace.py:
+    - ADD module-level `def _run_config_payload(req: DemoRunRequest) -> dict[str, Any] | None`:
+      returns None when BOTH fields are None; else
+      {"train_model_types": req.train_model_types,
+       "backtest": req.backtest.model_dump(mode="json") if req.backtest else None}.
+    - create_workspace: pass run_config=_run_config_payload(req) into ShowcaseWorkspace(...).
+  EXTEND app/features/demo/tests/test_workspace.py:
+    - test_create_workspace_records_run_config (custom req → JSONB persisted verbatim)
+    - test_create_workspace_run_config_null_on_defaults
+
+Task 5 — pipeline:
+  MODIFY app/features/demo/pipeline.py:
+    - ADD ResolvedRunConfig dataclass (near DemoContext) + a
+      `def _resolve_run_config(req: DemoRunRequest) -> ResolvedRunConfig` helper.
+    - DemoContext: ADD run_config field (default_factory=ResolvedRunConfig).
+    - run_pipeline (2646): ctx = DemoContext(..., run_config=_resolve_run_config(req)).
+    - _model_config_payload (271): ADD fallback branch
+      `if model_type in KNOWN_MODEL_TYPES: return {"model_type": model_type}`
+      BEFORE the raise; keep existing explicit branches untouched (Gotcha 1).
+    - _select_winner (446): signature → (backtest_results, metric="wape");
+      replace metrics.get("wape") with metrics.get(metric). ONE production call
+      site (pipeline.py:820); existing tests keep passing via the default.
+    - step_train (669): iterate ctx.run_config.model_types; train tail uses
+      ctx.run_config.horizon; PREPEND fail-fast flag check (D6):
+        settings = get_settings()
+        _FLAG_BY_MODEL = {"lightgbm": settings.forecast_enable_lightgbm,
+                          "xgboost": settings.forecast_enable_xgboost,
+                          "random_forest": settings.forecast_enable_random_forest}
+        disabled = [m for m in ctx.run_config.model_types if _FLAG_BY_MODEL.get(m) is False]
+        if disabled: return ("fail", f"model(s) {disabled} requested but the matching "
+                             "forecast_enable_* flag is off — enable it or deselect", {...})
+      step data: ADD "requested_models": list(ctx.run_config.model_types).
+    - step_backtest (731): implement D4. Extract one
+      `_backtest_body(ctx, model_type, *, include_baselines)` helper building the
+      request body from ctx.run_config (split_config: strategy/n_splits/
+      min_train_size/gap/horizon). Branching:
+        if not ctx.run_config.customized: → EXISTING two branches verbatim.
+        else: loop over models = list(ctx.run_config.model_types)
+              + ([SHOWCASE_V2_MODEL_TYPE] if scenario is SHOWCASE_RICH and not
+                 already in selection else [])
+              each include_baselines=False; capture bucketed metrics from the
+              SHOWCASE_V2_MODEL_TYPE call when present.
+      winner = _select_winner(ctx.backtest_results, ctx.run_config.metric)
+      step data: ADD "metric": ctx.run_config.metric.
+    - step_v2_train (1021): train tail uses ctx.run_config.horizon (D8).
+    - run_pipeline pipeline_complete data (2758): ADD
+      "run_config": ({"train_model_types": ..., "backtest": {...}} if customized else None).
+  EXTEND app/features/demo/tests/test_pipeline.py (canned-_Client pattern):
+    - test_resolve_run_config_defaults_and_custom
+    - test_model_config_payload_minimal_fallback_for_all_known_types
+    - test_select_winner_honors_metric (+ NaN/missing skip per metric)
+    - test_step_train_trains_selected_models (capture POSTed bodies)
+    - test_step_train_fails_fast_on_disabled_flag (patch get_settings)
+    - test_step_backtest_sends_configured_split_config (assert body verbatim)
+    - test_step_backtest_custom_selection_appends_prophet_like_on_showcase_rich
+    - test_step_backtest_legacy_path_unchanged_when_not_customized
+    - test_pipeline_complete_echoes_run_config (+ None on legacy run)
+
+Task 6 — catalog enabled overlay:
+  MODIFY app/features/model_selection/schemas.py: CandidateModelInfo.enabled: bool = True
+    (comment: "E4 #410 — runtime forecast_enable_* overlay; service-set").
+  MODIFY app/features/model_selection/service.py get_model_catalog:
+    base = build_model_catalog(); settings = get_settings()
+    flag = {"lightgbm": settings.forecast_enable_lightgbm,
+            "xgboost": settings.forecast_enable_xgboost,
+            "random_forest": settings.forecast_enable_random_forest}
+    return ModelCatalogResponse(
+        models=[m.model_copy(update={"enabled": flag.get(m.model_type, True)}) for m in base.models],
+        default_candidate_model_types=base.default_candidate_model_types)
+  EXTEND model_selection tests (mirror existing catalog tests):
+    - test_catalog_enabled_false_when_flags_off (default settings)
+    - test_catalog_enabled_true_when_flag_on (patched settings)
+    - test_capabilities_stays_pure (build_model_catalog items default enabled=True)
+
+Task 7 — frontend types:
+  MODIFY frontend/src/types/api.ts:
+    - ADD `export interface DemoBacktestConfig` (horizon/strategy/n_splits/
+      min_train_size/gap + `metric: DemoRankingMetric`) and
+      `export type DemoRankingMetric = 'wape' | 'mae' | 'rmse'`.
+    - DemoRunRequest: + `train_model_types?: string[]`, `backtest?: DemoBacktestConfig`
+      (comment-tagged E4 #410, mirror the E1 comment style at 783-787).
+    - WorkspaceListItem: + `run_config?: Record<string, unknown> | null`.
+    - CandidateModelInfo: + `enabled: boolean`.
+
+Task 8 — frontend run-config building blocks:
+  CREATE frontend/src/components/demo/run-config-utils.ts:
+    - DEFAULT_TRAIN_MODELS = ['naive','seasonal_naive','moving_average']
+    - DEFAULT_BACKTEST: DemoBacktestConfig = {horizon:14, strategy:'expanding',
+      n_splits:3, min_train_size:30, gap:0, metric:'wape'}
+    - isDefaultSelection(models) / isDefaultBacktest(cfg) (order-insensitive for models)
+    - buildTrainPlan(models, scenario): {model_type, family?, v2?}[] — appends
+      'prophet_like (V2)' marker on showcase_rich (skip if already selected)
+    - windowDaysFor(scenario): 92 for demo_minimal/sparse/holiday_rush, 180 others
+      (source of truth: pipeline.py _SCENARIO_SEED_PROFILE:513-538 — keep a sync comment)
+    - splitFitWarning(cfg, scenario): string | null when
+      min_train_size + n_splits*(horizon+gap) > windowDaysFor(scenario)
+  CREATE run-config-utils.test.ts covering all of the above.
+  CREATE frontend/src/components/demo/DemoBacktestSettingsForm.tsx:
+    - MIRROR champion-selector/backtest-settings-form.tsx structure (Field
+      helper, metric Select, Collapsible advanced knobs), DIFFERENCES:
+      editable horizon Input (1-90), metrics wape/mae/rmse, REUSE
+      splitConfigErrors from '@/components/champion-selector/split-config'
+      (field names align), plus the splitFitWarning line (amber, non-blocking).
+  CREATE DemoBacktestSettingsForm.test.tsx (mirror champion form's test).
+  CREATE frontend/src/components/demo/RunConfigPanel.tsx:
+    - Collapsible "Run configuration (advanced)" (collapsed default; chevron
+      pattern from backtest-settings-form.tsx:125-137); props: scenario,
+      disabled, selection+onSelectionChange, backtest+onBacktestChange.
+    - Inside: CandidateModelPicker fed `{...catalog, models: catalog.models
+      .filter(m => m.enabled)}` from useModelCatalog() (REUSE both);
+      DemoBacktestSettingsForm; train-candidate preview (Badge chips from
+      buildTrainPlan + count line).
+    - "Reset to defaults" ghost button (restores DEFAULT_* values).
+  CREATE RunConfigPanel.test.tsx:
+    - opt-in models hidden when enabled=false (mock catalog)
+    - preview appends prophet_like on showcase_rich only
+    - reset restores defaults
+
+Task 9 — showcase page wiring:
+  MODIFY frontend/src/pages/showcase.tsx:
+    - state: trainModels (DEFAULT_TRAIN_MODELS), backtestCfg (DEFAULT_BACKTEST).
+    - handleRun: spread-only-when-dirty (D7), mirroring the existing
+      preservation spread (149-154):
+        ...(isDefaultSelection(trainModels) ? {} : {train_model_types: trainModels}),
+        ...(isDefaultBacktest(backtestCfg) ? {} : {backtest: backtestCfg}),
+    - handleLoadWorkspace: when ws.run_config present, repopulate
+      trainModels/backtestCfg (fallback to defaults for missing keys);
+      when absent, reset to defaults.
+    - handleReplayWorkspace: forward ws.run_config fields verbatim into start()
+      (same omit-when-null rule).
+    - Mount <RunConfigPanel/> inside the controls CardContent below the
+      flex-wrap control row (after line 363), disabled={isRunning}.
+    - Run button disabled when trainModels.length === 0 (picker enforces ≥1
+      anyway via toggle, belt-and-braces).
+  MODIFY frontend/src/components/demo/WorkspacePanel.tsx:
+    - rows with run_config render a compact summary line, e.g.
+      "custom: 4 models · rmse · 5×h21" (Badge 'custom config' + muted text).
+  EXTEND showcase/WorkspacePanel vitest specs:
+    - untouched controls → start() called WITHOUT the new keys (dirty rule)
+    - changed metric → start() includes backtest
+    - replay of a run_config workspace forwards it verbatim
+    - WorkspacePanel renders the custom-config badge only when run_config set.
+
+Task 10 — docs sweep (docs(docs): … (#410) or fold into the feat commits):
+  - docs/_base/API_CONTRACTS.md: POST /demo/run + WS /demo/stream rows — E4
+    (#410) additive fields (shape, defaults, validation, dirty-rule note);
+    GET /demo/workspaces run_config field; GET /model-selection/models
+    `enabled` field.
+  - docs/_base/DOMAIN_MODEL.md: showcase_workspace — run_config replay-input
+    column + D1 rationale sentence ("NOT a story slot; config_schema_version
+    unaffected").
+  - docs/_base/RUNBOOKS.md § Showcase: two numbered incidents — (a) train step
+    fails "forecast_enable_* flag is off"; (b) custom split too aggressive for
+    the seeded window → backtest fail is a documented outcome (cite incident 28
+    sparse precedent).
+```
+
+### Integration Points
+
+```yaml
+DATABASE:
+  - migration: add nullable JSONB run_config to showcase_workspace
+  - NO index (read path is by workspace_id; config is display/replay payload)
+CONFIG:
+  - none added; READS forecast_enable_* via get_settings() in step_train (D6)
+    and model_selection service (D3). Never os.environ.
+ROUTES:
+  - none added. DemoRunRequest changes flow through POST /demo/run + WS
+    /demo/stream automatically; catalog field flows through GET /model-selection/models.
+FRONTEND DATA:
+  - useModelCatalog() (existing) powers the picker; no new hooks.
+COMMITS (every one references #410, no AI trailers):
+  - feat(api,db): showcase run-config start-frame contract + workspace column (#410)
+  - feat(api): honor run config in demo pipeline + catalog enabled overlay (#410)
+  - feat(ui): showcase run-config panel, preview, and replay wiring (#410)
+  - docs(docs): document showcase run-config contract (#410)
+```
+
+## Validation Loop
+
+### Level 1 — Syntax & Style (after every task)
+
+```bash
+uv run ruff check . && uv run ruff format --check .
+uv run mypy app/ && uv run pyright app/          # both --strict, both gate merge
+cd frontend && pnpm lint                          # NOTE: pnpm tsc --noEmit is vacuous (memory)
+```
+
+### Level 2 — Unit tests (no DB)
+
+```bash
+uv run pytest app/shared/tests/ app/features/demo/tests/ app/features/model_selection/tests/ -v -m "not integration"
+uv run pytest -v -m "not integration"             # full unit suite before push
+cd frontend && pnpm test --run                    # vitest incl. new specs
+```
+
+### Level 3 — Integration (real Postgres; respect [[fresh-stack-gate-procedure]] — no `down -v`)
+
+```bash
+docker compose up -d && uv run alembic upgrade head
+uv run alembic downgrade -1 && uv run alembic upgrade head   # migration round-trip
+uv run pytest app/features/demo/tests/ -v -m integration
+# Live contract probe (backend on :8123 — kill stale uvicorn first, check ps etime):
+curl -s -X POST http://localhost:8123/demo/run -H 'Content-Type: application/json' -d '{
+  "skip_seed": true, "preservation": "keep", "workspace_name": "e4-probe",
+  "train_model_types": ["naive", "seasonal_average"],
+  "backtest": {"horizon": 14, "n_splits": 3, "min_train_size": 30, "gap": 0,
+               "strategy": "expanding", "metric": "rmse"}}' | python3 -m json.tool
+# Expect: steps green, winner picked by rmse, data.run_config echoed, workspace_id set.
+curl -s "http://localhost:8123/demo/workspaces?limit=1" | python3 -m json.tool   # run_config on the row
+curl -s http://localhost:8123/model-selection/models | python3 -c "
+import json,sys; [print(m['model_type'], m['enabled']) for m in json.load(sys.stdin)['models']]"
+# Error paths:
+curl -s -X POST http://localhost:8123/demo/run -d '{"train_model_types":["bogus"]}' \
+  -H 'Content-Type: application/json' | head -c 300    # 422 problem+json
+```
+
+### Level 4 — Browser dogfood (MANDATORY — UI change; webapp-testing / agent-browser per ui-design.md; [[playwright-dogfood-needs-snap-chromium]] on this host)
+
+```bash
+# Backend :8123 + vite :5173 up, then drive /showcase:
+# 1. Expand "Run configuration (advanced)" — opt-in models absent with default flags.
+# 2. Select naive + seasonal_average, metric RMSE → preview shows 2 chips (+V2 only on showcase_rich).
+# 3. Tick "Save as workspace", name e4-dogfood, Run → pipeline green, train card
+#    shows the 2 requested models, summary winner consistent with RMSE.
+# 4. Saved-workspaces panel: row shows the custom-config badge; Load repopulates
+#    the panel controls; Replay re-runs verbatim (watch the WS frame in devtools).
+# 5. Run once with UNTOUCHED controls → WS start frame has NO new keys (devtools).
+# Capture screenshots for the PR.
+```
+
+## Final Validation Checklist
+
+- [ ] `uv run ruff check . && uv run ruff format --check .` clean
+- [ ] `uv run mypy app/ && uv run pyright app/` clean (strict)
+- [ ] `uv run pytest -v -m "not integration"` green
+- [ ] `uv run pytest -v -m integration` green on a fresh stack (reset first — [[integration-suite-shared-state-pollution]])
+- [ ] Migration upgrade + downgrade + re-upgrade clean on fresh DB
+- [ ] `cd frontend && pnpm lint && pnpm test --run` green
+- [ ] Level-3 curl probes match expectations (incl. 422 path)
+- [ ] Level-4 dogfood evidence captured (screenshots + WS frame byte-compat check)
+- [ ] Legacy-frame byte-compat test extended and green (umbrella criterion)
+- [ ] Docs updated (API_CONTRACTS, DOMAIN_MODEL, RUNBOOKS)
+- [ ] `git diff --stat` shows no CRLF whole-file noise
+- [ ] Commits `type(scope): … (#410)`, no AI trailers; PR into dev
+
+## Anti-Patterns to Avoid
+
+- ❌ Don't add mid-run / per-phase re-entry of any kind — explicitly DEFERRED scope (brainstorm Round 5); the single `asyncio.Lock` linear stream is preserved.
+- ❌ Don't write run_config into an E1 story slot or bump `config_schema_version` (D1).
+- ❌ Don't import `app/features/model_selection` (or any sibling slice) from the demo slice — allow-list lives in `app/shared/model_taxonomy.py`.
+- ❌ Don't read settings inside Pydantic schemas (`.env`-bleed incident class) — flags are enforced in `step_train` and overlaid in the catalog service.
+- ❌ Don't make `capabilities.build_model_catalog` impure — overlay in the service.
+- ❌ Don't clamp/auto-fix an aggressive split server-side — fail honestly (sparse-preset policy precedent).
+- ❌ Don't send the new start-frame keys when the controls are untouched — byte-compat is a frozen criterion.
+- ❌ Don't hand-roll new UI primitives — reuse `CandidateModelPicker`, mirror `BacktestSettingsForm`, shadcn components only (`.claude/rules/shadcn-ui.md`).
+- ❌ Don't weaken or touch `test_leakage.py`, merged migrations, or the champion-selector's existing behavior beyond the additive `enabled` field.
+
+---
+
+## Confidence Score: 8.5/10
+
+One-pass implementation likelihood. **+** Every contract was read and runtime-verified today (minimal model-config payloads, rmse key, flag names/defaults, SplitConfig bounds, catalog purity, start-frame parse path); the additive-field, migration, and byte-compat patterns have three shipped precedents in this exact slice (PRP-38 scenario, E1 #390, E2 #391); the frontend reuses two existing, tested components. **−0.5** D4's unified-loop branch in `step_backtest` is the one genuinely new control-flow path (showcase_rich + custom selection interplay with bucketed metrics). **−0.5** Pre-flight dependency: E1 #407 must merge first and its final migration revision id is unknowable today (mitigated by the `alembic heads` instruction in Task 0/D2).
diff --git a/PRPs/PRP-showcase-completion-E5-agent-rag-story-capture.md b/PRPs/PRP-showcase-completion-E5-agent-rag-story-capture.md
new file mode 100644
index 00000000..f522467b
--- /dev/null
+++ b/PRPs/PRP-showcase-completion-E5-agent-rag-story-capture.md
@@ -0,0 +1,1185 @@
+name: "PRP — Showcase Completion E5: Agent/HITL + RAG Story Capture (issue #411)"
+description: |
+
+## Purpose
+
+Implement Parallel epic E5 of the showcase-completion initiative (umbrella #406):
+persist the HITL approval story (decision approved/rejected/timed_out, action ids,
+tool-call summary, transcript summary) into the workspace row's `approval_events`
+slot; add a **Reject** button to the Showcase HITL step card alongside Approve —
+and make both genuinely clickable by streaming the intermediate
+`awaiting_approval` event DURING the decision window (today it flushes only after
+the step ends, so the button can never render in time); render approval history
+on Showcase and `/ops`; capture RAG events (probe/index/retrieve with provider
+state) into `rag_events`; and mark on replay whether the knowledge/agent story
+was reproduced. Capture is warn-and-continue — it must never fail a green
+pipeline. **No widening of `agent_require_approval`. No agents-slice changes.**
+
+## Core Principles
+
+1. **Context is King**: every reference below was verified against live code on 2026-06-12 (branch `dev` @ `bdf85f6`).
+2. **Validation Loops**: each level is executable as written.
+3. **Information Dense**: patterns cite exact file:line.
+4. **Progressive Success**: hitl relay module → pipeline capture → workspace writes → routes → frontend → tests → docs.
+5. **Global rules**: follow CLAUDE.md / AGENTS.md; all five CI gates must pass; all changes ADDITIVE.
+
+---
+
+## ⛔ BLOCKED BY — E1 #407 (Foundation)
+
+This epic writes the `approval_events` + `rag_events` JSONB story slots that the
+E1 migration (`PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md`)
+creates, reads `replayed_from_workspace_id` for the reproduction marker, and
+follows E1's frozen Decisions (slot-per-column, soft references, documented slot
+schema, `config_schema_version` bump rule). **Do not start until E1 #407 is
+merged to `dev`.** Verify before branching:
+
+```bash
+gh issue view 407 --json state          # must be CLOSED
+grep -n "approval_events\|rag_events\|replayed_from_workspace_id" app/features/demo/models.py
+# all three column names must exist on ShowcaseWorkspace
+```
+
+If E1 landed with deviations from its PRP (column names, slot shapes, response
+fields), **the merged code wins** — re-anchor the blueprint below to it.
+
+---
+
+## Goal
+
+A `showcase_rich` keep-run records its agent and knowledge story on the
+workspace row, the operator can genuinely approve OR reject the HITL action
+from the step card, and the story is visible afterwards:
+
+- **`approval_events` capture**: `step_agent_hitl_flow` appends one entry per
+  resolved approval (operator approve, operator reject, window-lapse
+  auto-approve, hard timeout) carrying the E1-frozen base keys plus E5's
+  documented additive keys (auto_approved, reason, execution_status,
+  tool_call_summary, transcript_summary, tokens_used, tool_calls_count).
+  `finalize_workspace` writes the list to the row (warn-and-continue).
+- **`rag_events` capture**: the three knowledge steps
+  (`embedding_provider_probe`, `rag_index_subset`, `rag_retrieve_probe`) append
+  one entry each — event kind, status, detail, count, provider state, timestamp.
+- **Interactive Reject (and a real Approve)**: a new in-demo decision relay —
+  `POST /demo/hitl-decision` + a single-slot in-memory store — makes the
+  PIPELINE the sole caller of `/agents/sessions/{id}/approve`. The step card's
+  Approve/Reject buttons relay operator intent through the demo slice; the
+  pipeline forwards the real decision to the agents HITL gate. The decision
+  window grows 3 s → 10 s so a human can actually click.
+- **Timely intermediate events**: `run_pipeline` drains the intermediate-event
+  sink concurrently with the in-flight step (today it drains only after the
+  step returns — `pipeline.py:2701-2715` — so the FE sees `awaiting_approval`
+  only after the auto-approve already fired).
+- **Approval history surfaces**: `GET /demo/approval-events` flattens recent
+  workspaces' `approval_events` newest-first; the `/ops` page renders it as an
+  "Approval History" table (frontend-only — no ops-slice backend change); the
+  Showcase loaded-workspace view renders the full story (approval events + RAG
+  events + reproduction marker).
+- **Replay reproduction marker**: on a replay keep-run
+  (`replayed_from_workspace_id` set), `finalize_workspace` compares the source
+  row's story slots against the new run's capture and records
+  `result_summary.story_reproduction = {"agent": ..., "knowledge": ...,
+  "source_workspace_id": ...}` with values
+  `reproduced | not_reproduced | not_applicable | unknown`.
+
+A run/request without the new surfaces behaves byte-identically (ephemeral
+runs, `demo_minimal`/`sparse` runs, legacy WS frames). **No Alembic migration**
+— E1 shipped every column E5 touches.
+
+**Deliverable** (all additive):
+
+- `app/features/demo/hitl.py` — NEW single-slot in-memory decision relay
+  (register / wait / resolve / clear), safe under the single-flight pipeline lock.
+- `app/features/demo/pipeline.py` — DemoContext `approval_events`/`rag_events`
+  accumulators; `step_agent_hitl_flow` rework (decision window, relay wait,
+  reject path, event entry); RAG-event appends in the three knowledge steps;
+  concurrent intermediate-event drain in `run_pipeline`.
+- `app/features/demo/workspace.py` — `finalize_workspace` writes both slots +
+  `story_reproduction`; NEW `list_approval_events` helper.
+- `app/features/demo/schemas.py` — `HitlDecisionRequest`,
+  `ApprovalEventItem`, `ApprovalEventsResponse`.
+- `app/features/demo/routes.py` — `POST /demo/hitl-decision`,
+  `GET /demo/approval-events`.
+- `app/features/demo/models.py` — `config_schema_version` ORM default 1 → 2
+  (slot-shape delta; E1 Decision 6 rule) + slot-schema comment delta.
+- Frontend — `HitlDecisionButtons` (Approve + Reject) on the step card;
+  `WorkspaceStoryPanel` on Showcase; "Approval History" section on `/ops`;
+  `use-approval-events` hook; types.
+- Tests: hitl-relay unit tests, HITL-step path tests, drain-ordering test,
+  RAG-event capture tests, route tests, finalize/reproduction integration
+  tests, FE component/hook tests.
+- Docs: `docs/_base/API_CONTRACTS.md`, `docs/_base/DOMAIN_MODEL.md` (slot-schema
+  v2 delta), `docs/_base/RUNBOOKS.md` (HITL incidents 23-25 + workspace section).
+
+**Success definition**: all Success Criteria below check off; five CI gates
+green; integration suite green; a manual `showcase_rich` keep-run lets the
+operator click **Reject** within the 10 s window, the run stays green, the
+workspace row carries the rejected `approval_events` entry + three `rag_events`
+entries, `/ops` lists the event, and a Replay of that workspace records a
+`story_reproduction` marker.
+
+## Why
+
+- Umbrella #406 success criterion: "HITL approval decisions (approve AND the
+  new Reject path) and RAG events are captured on the workspace row and
+  rendered as history on Showcase and /ops".
+- The workspace row today records WHAT a run created but not the agent/HITL or
+  knowledge STORY — the demo's most distinctive moments are unrecoverable
+  after the run ends (RUNBOOKS § Showcase workspace, "Explicitly out of scope":
+  "RAG-event and approval-decision capture on the workspace row" — this epic).
+- The PRP-41 Approve button is effectively decorative: the intermediate
+  `awaiting_approval` event is buffered in a plain list that `run_pipeline`
+  drains only AFTER the step function returns (`pipeline.py:2660-2715`), and the
+  step auto-approves after a 3 s sleep — so the browser learns about the
+  approval window only once it has closed. E5's Reject button is meaningless
+  without fixing this.
+- No approval audit trail exists anywhere today: `AgentService.approve_action`
+  clears `pending_action`, logs, and returns — nothing durable records the
+  decision (`app/features/agents/service.py:825-907`). E5 is the first capture
+  (brainstorm Round 5, `.flow/brainstorm-log.md`).
+
+## What
+
+### User-visible behavior
+
+- The HITL step card on `/showcase` (scenario `showcase_rich`) shows **Approve**
+  and **Reject** buttons while awaiting, with a live "auto-approve in Ns"
+  countdown (10 s window). Either click resolves the action; no click
+  auto-approves at window end. A reject keeps the pipeline GREEN — the step
+  passes with detail `rejected by operator`, and the gated `save_scenario`
+  never executes (no scenario_plan row is written).
+- `POST /demo/hitl-decision` accepts `{action_id, decision: "approved"|"rejected",
+  reason?}`; `404 application/problem+json` when no matching action is pending;
+  `409` when the action was already decided; `422` on a malformed body.
+- `GET /demo/approval-events?limit=N` returns recent approval events flattened
+  across saved workspaces, newest-workspace-first; `200` + empty list when none.
+- The `/ops` page gains an "Approval History" card (table: decision badge, tool,
+  workspace, transcript snippet, when). The Showcase loaded-workspace view gains
+  a story panel: approval events, RAG events (with provider state), and — on
+  replay rows — a "story reproduced / not reproduced" marker.
+- Ephemeral runs and `demo_minimal` / `sparse` runs are unchanged; legacy WS
+  start frames are byte-identical (no new request fields on `DemoRunRequest`).
+
+### Technical requirements
+
+- **No agents-slice changes.** The pipeline remains the only writer of the
+  approve POST in the showcase path; `agent_require_approval` is untouched;
+  no agents migration, no `AgentSession` column. (The durable per-session
+  approval audit is deliberately deferred — see Decisions D8.)
+- **No Alembic migration** — E1 (#407) shipped `approval_events`, `rag_events`,
+  `replayed_from_workspace_id`, `config_schema_version`.
+- **Warn-and-continue invariant**: all capture writes ride inside the existing
+  `finalize_workspace` try/except (`workspace.py:147-154`); a capture failure
+  must never break a green run. ctx accumulators always append in-memory (cheap,
+  cannot fail); only the DB write is fallible.
+- **Single-flight safety**: the in-memory decision relay is correct because at
+  most one pipeline runs per process (`service.py:19` `_pipeline_lock`) and the
+  HITL step registers at most one pending action per run. The relay is
+  module-level state in the demo slice (precedent: `_pipeline_lock`).
+- **Vertical slice**: all backend changes inside `app/features/demo/`; the
+  `/ops` approval-history surface is FRONTEND-ONLY (the ops page queries the
+  demo endpoint — no ops-slice import of demo code, no cross-slice edge).
+- RFC 7807 errors only — `NotFoundError` / `ConflictError` from
+  `app/core/exceptions.py` (demo routes precedent, `routes.py:34,76,134`).
+- Pydantic v2 `ConfigDict(strict=True, extra="forbid")` on `HitlDecisionRequest`
+  (HTTP-only body; all fields JSON-native → no `Field(strict=False)`; the AST
+  policy walker `app/core/tests/test_strict_mode_policy.py` only fires on
+  date/datetime/time/UUID/Decimal).
+- `StepEvent` data additions are additive dict keys only (legacy clients ignore
+  unknown keys — the WS forward-compat contract).
+
+### Success Criteria
+
+- [ ] `run_pipeline` yields buffered intermediate events while the step is
+  still executing: an orchestrator-level test proves the `awaiting_approval`
+  event is received BEFORE the HITL step's terminal `step_complete` in wall
+  time (not just stream order).
+- [ ] Operator approve within the window → approve POST `approved=true`,
+  `approval_events` entry `decision="approved"`, `auto_approved=false`.
+- [ ] Operator reject within the window → approve POST `approved=false`, step
+  terminal `pass` with detail `rejected by operator`, entry
+  `decision="rejected"` (+ optional `reason`), pipeline green, NO scenario_plan
+  row written by the agent.
+- [ ] No decision → auto-approve at 10 s, entry `decision="approved"`,
+  `auto_approved=true`. Hard timeout (90 s) → entry `decision="timed_out"`,
+  step skips (existing semantics preserved).
+- [ ] Each knowledge step appends exactly one `rag_events` entry on every
+  outcome path (pass / warn / skip / auth-skip), carrying `provider` state.
+- [ ] `finalize_workspace` writes both slots (NULL when empty — never `[]`),
+  and on a replay row writes `result_summary.story_reproduction` with the
+  documented values incl. `unknown` for a dangling source.
+- [ ] `POST /demo/hitl-decision`: 204 happy path; 404 no-pending; 409
+  already-decided; 422 bad body (problem+json each).
+- [ ] `GET /demo/approval-events`: 200 + empty list on empty table; flattened
+  entries carry `workspace_id` / `workspace_name`.
+- [ ] FE: Reject button renders alongside Approve, both POST the demo relay via
+  `lib/api.ts` `api()` (not bare `fetch`), countdown reads
+  `data.decision_window_s`; `/ops` Approval History table renders; Showcase
+  story panel renders events + reproduction marker.
+- [ ] Legacy byte-compat: `DemoRunRequest` unchanged; `demo_minimal`/`sparse`
+  emit no relay events and write no slots; `config_schema_version` ORM default
+  is 2 (new rows) while old rows keep 1.
+- [ ] `uv run ruff check . && uv run ruff format --check . && uv run mypy app/
+  && uv run pyright app/ && uv run pytest -v -m "not integration"` green;
+  integration suite green; `cd frontend && pnpm lint && pnpm test --run` green
+  with no NEW `tsc -b` errors vs the dev baseline.
+
+## Decisions (the open questions this PRP resolves)
+
+> Frozen for execution. E7 (release gate) authors: consume, don't re-decide.
+
+1. **D1 — Decision relay: the pipeline is the SOLE approver.** The FE buttons
+   POST `/demo/hitl-decision` (demo slice, in-memory single-slot store); the
+   HITL step waits on the relay up to the window, then POSTs
+   `/agents/sessions/{id}/approve` with the operator's decision (or
+   `approved=true` on window lapse). Rationale: `approve_action` persists NO
+   decision record (`agents/service.py:868-871` clears `pending_action`, logs,
+   returns), so the PRP-41 pattern — FE pre-empts the agents endpoint directly
+   and the pipeline absorbs the 4xx as "executed" (`pipeline.py:2357-2366`) —
+   cannot distinguish an FE approve from an FE reject. Routing intent through
+   the demo slice gives the pipeline ground truth with zero agents-slice
+   changes and zero migrations. The 4xx-absorb stays as belt-and-braces for an
+   operator curl-ing `/agents/.../approve` directly mid-run (recorded as
+   `decision="approved"`, `execution_status="external_4xx"` — honest about the
+   residual ambiguity).
+2. **D2 — Concurrent intermediate-event drain in `run_pipeline`.** Replace
+   `await fn(ctx, client)` with `task = asyncio.ensure_future(fn(ctx, client))`
+   + a `while` loop that `asyncio.wait({task}, timeout=0.25)` and flushes the
+   sink each tick (stamping index/phase fields exactly as the existing
+   post-step drain does, `pipeline.py:2707-2714`). This is NOT a pipeline
+   re-architecture: steps still run strictly one at a time under the same lock;
+   only event flushing overlaps the in-flight step. The post-step drain block
+   stays (final flush). Exception mapping moves to `task.result()` inside the
+   same try/except ladder (`pipeline.py:2681-2699`).
+3. **D3 — Decision window 10 s.** `_APPROVAL_DISPLAY_DELAY_S = 3.0`
+   (`pipeline.py:317`) is replaced by `_APPROVAL_DECISION_WINDOW_S = 10.0`.
+   3 s is unclickable by a human; 10 s keeps the showcase brisk (well under the
+   90 s hard timeout and the 180 s soft budget) and is emitted to the FE as
+   `data.decision_window_s` so the countdown never hardcodes it.
+4. **D4 — Slot-schema delta ⇒ `config_schema_version` ORM default 1 → 2.**
+   E5 widens the E1-frozen `approval_events.decision` enum
+   (`"approved"|"rejected"` → `+"timed_out"`), adds additive entry keys, and
+   adds `"probe"` to the `rag_events.event` enum + additive keys. Per E1
+   Decision 6 ("any epic that changes a documented slot shape bumps the ORM
+   default and documents the delta") this is a bump; E3's PRP explicitly does
+   NOT bump (its CONTRACT(E1) note: populating verbatim ≠ shape change), so no
+   collision is expected — but if another parallel epic bumped first, take the
+   next integer and update DOMAIN_MODEL accordingly. ORM `default=` only —
+   `server_default` stays `text("1")` (no migration; old rows legitimately
+   read 1).
+5. **D5 — Reject keeps the pipeline GREEN.** A human rejection is a SUCCESSFUL
+   demonstration of the HITL gate, not an error: terminal `("pass", "rejected
+   by operator", {..., "approval_decision": "rejected"})`. Only transport/5xx
+   failures keep the existing skip semantics. `step_cleanup` still closes the
+   session either way.
+6. **D6 — Approval history endpoint lives in the DEMO slice.** "Render on
+   /ops" is a frontend statement: the ops PAGE queries
+   `GET /demo/approval-events`. Putting the endpoint in the ops slice would
+   force a cross-slice demo import for data the demo slice owns
+   (`showcase_workspace.approval_events`). Flattening is Python-side over the
+   newest ≤50 rows with a non-NULL slot — a low-cardinality audit table; no
+   `jsonb_array_elements` SQL needed.
+7. **D7 — Reproduction marker lives in `result_summary`** (an existing
+   demo-owned JSONB whose shape is not E1-frozen), NOT a new column and not a
+   slot entry: `{"story_reproduction": {"agent": V, "knowledge": V,
+   "source_workspace_id": str}}` with `V ∈ reproduced | not_reproduced |
+   not_applicable | unknown`. `agent`: source row had ≥1 approval event →
+   compare with the new run (`reproduced`/`not_reproduced`); source had none →
+   `not_applicable`; source row missing (soft reference dangles) → `unknown`.
+   `knowledge`: same logic over `rag_events` entries whose `event` is
+   `index`/`retrieve` with `status != "skip"`. Computed inside
+   `finalize_workspace` (one extra `get`-by-id select in the same session,
+   inside the existing warn-and-continue try).
+8. **D8 — No durable approval audit on `agent_session` (deferred).** The
+   architecturally complete fix (an `approval_history` JSONB on the agents
+   aggregate) needs an agents migration + schema surface — out of this epic
+   per the umbrella approach ("additive-only delta on the existing demo +
+   seeder slices") and the epic's own scope line ("No widening of
+   agent_require_approval"; agents untouched). If E7 review wants it, it is a
+   follow-up issue, not scope creep here.
+
+### Assumptions (explicit, decided without user input)
+
+- `tool_call_summary` carries `{"description": str, "arguments_keys":
+  list[str]}` from `pending_action` — argument KEYS only, never values
+  (security-patterns.md: never echo full payloads; values may embed
+  user-supplied text).
+- `transcript_summary` is the agent's chat `message` truncated to 200 chars
+  (precedent: the #335 failure-detail 300-char cap).
+- The relay rejects decisions for an `action_id` that is not the registered
+  one with 404 (not 409): a mismatched id is "nothing pending under that id".
+- `GET /demo/approval-events` scans the newest 50 workspace rows with a
+  non-NULL slot and caps the flattened list at `limit` (1-200, default 50).
+  No offset/pagination — audit-glance surface, not a browse API.
+- The live-run Showcase surface for history is the step card itself (terminal
+  detail + `HitlFlowSummary`); the story PANEL renders for loaded workspaces.
+- FE buttons disable after either click; 404/409 responses are absorbed
+  silently (the auto-approve raced) — same UX contract as the PRP-41 button.
+
+## All Needed Context
+
+### Documentation & References
+
+```yaml
+# MUST READ — codebase patterns (verified 2026-06-12, dev @ bdf85f6)
+
+- file: PRPs/PRP-showcase-completion-E1-metadata-provenance-backbone.md
+  why: |
+    THE frozen upstream contract: slot columns + per-slot documented schemas
+    (approval_events / rag_events base keys), Decision 2 (slot-per-column),
+    Decision 5 (E5 writes these two slots), Decision 6 (config_schema_version
+    bump rule), and "Notes for parallel-epic PRP authors" (warn-and-continue
+    for pipeline-time slot writes; HTTP writes go through caller-owned-session
+    helpers). Re-verify the merged code matches before relying on line numbers.
+
+- file: app/features/demo/pipeline.py
+  why: |
+    THE file you rework. _Client.__init__ event_sink @136-155 and
+    yield_event @163-174 (plain-list sink, silently dropped when None);
+    DemoContext @213-263 (PRP-41 approval fields @254-257 — the comment style
+    your new accumulator fields follow); _llm_key_present @289;
+    HITL constants @314-322 (_APPROVAL_DISPLAY_DELAY_S=3.0 @317 — replaced;
+    _APPROVAL_HARD_TIMEOUT_S=90.0 @318 — kept; _HITL_PROMPT @319);
+    _embedding_provider_reachable @390; _is_embedding_auth_error @431;
+    step_embedding_provider_probe @1449-1468; step_rag_index_subset
+    @1471-1525 (note the auth-skip path @1493-1501); step_rag_retrieve_probe
+    @1528-1576 (warn-on-zero-hits @1552-1560); step_agent_hitl_flow
+    @2192-2394 (every outcome path you extend — skip-no-key @2222, skip-no-
+    pending @2269, intermediate event @2295-2318, display sleep @2320-2324,
+    hard-timeout @2326-2341, approve POST + 4xx absorb @2343-2377, terminal
+    @2381-2394); _phase_table @2528 (knowledge steps @2589-2593, agents step
+    @2560-2564 — registry unchanged in E5); run_pipeline @2618-2771
+    (intermediate_events buffer @2660-2663, step await + except ladder
+    @2681-2699, post-step drain @2701-2715 — THE BLOCK D2 generalizes,
+    finalize hook @2744-2747).
+
+- file: app/features/demo/workspace.py
+  why: |
+    finalize_workspace @106-155 — the warn-and-continue write you extend with
+    the two slot assignments + story_reproduction (whole-value assignment,
+    inside the existing try). get_workspace @158-171 (reuse the select shape
+    for the source-row read INSIDE finalize's own session — do NOT call
+    get_workspace, it takes a caller-owned session). list_workspaces @174-196
+    — the newest-first select your list_approval_events mirrors (add
+    .where(ShowcaseWorkspace.approval_events.isnot(None))).
+    CONTRACT in module docstring @10-13: create/finalize swallow all errors.
+
+- file: app/features/demo/service.py
+  why: |
+    _pipeline_lock @19 — the single-flight guarantee that makes the in-memory
+    relay safe. PipelineBusyError @22 + the 409 mapping (routes.py:74-77) —
+    the error-translation pattern the hitl-decision route mirrors.
+
+- file: app/features/demo/routes.py
+  why: |
+    Router you extend. delete_showcase_workspace @138-163 — NotFoundError
+    shape; run_demo_pipeline @74-77 — ConflictError shape; list_showcase_
+    workspaces @80-107 — Query(ge/le) param + list response shape for
+    GET /demo/approval-events; WS handler @166-194 (unchanged in E5).
+
+- file: app/features/demo/schemas.py
+  why: |
+    DemoRunRequest @29-85 (UNCHANGED in E5 — no new request fields);
+    StepEvent @88-127 (data is dict[str, Any] — additive keys free);
+    WorkspaceListItem @169-189 / WorkspaceDetailResponse @192-203 — E1 adds
+    the slot fields to Detail; E5 only READS them. New models follow the
+    response-model split: plain BaseModel, from_attributes only where built
+    from ORM rows (ApprovalEventItem is built from dicts — no from_attributes).
+
+- file: app/features/demo/models.py
+  why: |
+    ShowcaseWorkspace — after E1: config_schema_version (ORM default you bump
+    to 2), approval_events / rag_events slot columns + the documented
+    per-slot schema comments (extend with the E5 delta; DOMAIN_MODEL carries
+    the authoritative copy).
+
+- file: app/features/agents/schemas.py
+  why: |
+    PendingAction @170-190 (action_id / action_type / description / arguments
+    — the tool_call_summary source); ApprovalRequest @192-206 (action_id,
+    approved: bool, reason ≤500 — REJECT ALREADY EXISTS in the agents API;
+    the pipeline just sends approved=false); ApprovalResponse @208+ (status:
+    "executed"|"rejected"|"expired" — mapped into execution_status; NOTE an
+    approved-but-failed execution also reports "rejected", see
+    frontend/src/lib/approval-report.ts:10-16).
+
+- file: app/features/agents/service.py
+  why: |
+    approve_action @825-907 — READ ONLY: proves no decision is persisted
+    (pending_action cleared @868, status → ACTIVE @869, returns the response)
+    and that a consumed action raises NoApprovalPendingError → the 400 the
+    4xx-absorb handles. DO NOT MODIFY (D8).
+
+- file: app/features/demo/tests/test_pipeline.py
+  why: |
+    _make_hitl_client @1838-1921 — THE fake-client harness for HITL step
+    tests; extend it (approve body capture, decision injection).
+    test_agent_hitl_flow_happy_path @1959 + the FOUR monkeypatches of
+    _APPROVAL_DISPLAY_DELAY_S @1973/2047/2063/2081 — every one must move to
+    _APPROVAL_DECISION_WINDOW_S (set 0.0 so tests don't sleep). Phase-table
+    test @629-674 pins the 24-row layout (unchanged).
+
+- file: app/features/demo/tests/conftest.py
+  why: |
+    client fixture (ASGITransport, monkeypatched-service unit route tests) +
+    db_session fixture (integration; wipes showcase_workspace on teardown) —
+    reuse both; do not invent new fixtures.
+
+- file: frontend/src/components/demo/demo-step-card.tsx
+  why: |
+    ApproveButton @371-421 — REPLACED by HitlDecisionButtons. Note the bare
+    relative fetch(approvalUrl) @393 — only works when SPA origin == API
+    origin; the replacement MUST use lib/api.ts api() (API_BASE_URL-prefixed,
+    frontend/src/lib/api.ts:3,23-26). Render condition @496-505 (keep shape;
+    swap component). HitlFlowSummary mount @494.
+
+- file: frontend/src/pages/showcase.tsx
+  why: |
+    Page wiring: loadedWorkspace detail query @128-131; WorkspaceArtifacts
+    Panel mount @448-450 — mount WorkspaceStoryPanel beside it (same
+    `phase !== 'running' && loadedWorkspace` guard). handleReplayWorkspace
+    @174-186 (E1 adds replayed_from_workspace_id here — E5 does not touch it).
+
+- file: frontend/src/pages/ops.tsx
+  why: |
+    "Needs Attention" section @394-446 — THE Card+Table pattern (empty-state
+    paragraph, StatusBadge, formatWhen) the Approval History section mirrors.
+    Place the new section directly after Needs Attention.
+
+- file: frontend/src/hooks/use-workspaces.ts + frontend/src/hooks/use-ops.ts
+  why: |
+    TanStack patterns: queryKey arrays, api<T>() calls, refetchInterval
+    choices. use-approval-events.ts mirrors useWorkspaces (no polling — the
+    table changes only when a run finishes; document that in the hook docstring
+    like useRetrainingCandidates does).
+
+- file: frontend/src/types/api.ts
+  why: |
+    StepEvent @760 / DemoRunRequest @778 / WorkspaceListItem @806 /
+    WorkspaceDetail @819 — add ApprovalEventItem / ApprovalEventsResponse and
+    (if E1 did not already) the WorkspaceDetail slot fields E5 reads
+    (approval_events, rag_events). Comment style: `// E5 (#411) — ...`.
+
+- file: frontend/src/lib/approval-report.ts
+  why: |
+    Documents the executed/rejected/expired semantics of ApprovalResponse
+    (incl. approved-but-execution-failed → "rejected") — the mapping
+    execution_status follows.
+
+- file: docs/_base/DOMAIN_MODEL.md
+  why: |
+    § showcase_workspace — E1 documents the frozen slot schemas; E5 appends
+    the v2 delta (decision enum widening, additive keys, "probe" event,
+    story_reproduction in result_summary) and the config_schema_version=2
+    note. Authoritative slot-schema copy lives HERE.
+
+- file: docs/_base/RUNBOOKS.md
+  why: |
+    Incidents 23-25 (agent_hitl_flow) — update for the 10 s window, the
+    Reject path, and the relay endpoint; § Showcase workspace — trim
+    "RAG-event and approval-decision capture" from the out-of-scope list.
+
+- file: PRPs/PRP-showcase-completion-E3-seed-config-scope.md
+  why: |
+    Parallel-epic coordination: E3 also extends DemoContext and touches
+    create_workspace-time writes. Expect textual merge conflicts in
+    DemoContext / workspace.py if E3 lands first — both additions are
+    independent; resolve by keeping both blocks. E3's CONTRACT(E1) note
+    (line 1031) confirms E3 does NOT bump config_schema_version — E5 does (D4).
+
+# Issue / initiative context
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/411
+  why: The epic this PRP implements.
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/406
+  why: Umbrella — success criteria, out-of-scope list, warn-and-continue risk row.
+- url: https://github.com/w7-mgfcode/ForecastLabAI/issues/407
+  why: Foundation epic (BLOCKING) — frozen column/slot/endpoint contract.
+
+# Exemplar PRPs (style + validation-gate conventions)
+- file: PRPs/PRP-41-showcase-agent-ops-polish.md
+  why: Authored the HITL step + intermediate-event sink E5 reworks.
+- file: PRPs/ai_docs/prp-41-contract-probe-report.md
+  why: Verified agents HITL contracts (chat pending_approval shape, approve 400 on consumed action).
+```
+
+### Current Codebase tree (relevant subset, post-E1)
+
+```bash
+app/features/demo/
+├── models.py          # ShowcaseWorkspace + E1 columns (approval_events/rag_events/config_schema_version/replayed_from_workspace_id)
+├── pipeline.py        # _Client sink @136; DemoContext @213; HITL constants @314; knowledge steps @1449-1576; step_agent_hitl_flow @2192; run_pipeline @2618
+├── workspace.py       # create @46; finalize @106; get @158; list @174; delete @199; count @224 (+ E1 update_workspace)
+├── schemas.py         # DemoRunRequest @29; StepEvent @88; Workspace* @169-213 (+ E1 WorkspaceUpdateRequest, slot fields on Detail)
+├── routes.py          # POST /run @51; GET/PATCH/DELETE /workspaces @80-163; WS /stream @166
+├── service.py         # _pipeline_lock @19 (UNCHANGED)
+└── tests/             # conftest, test_models, test_pipeline (_make_hitl_client @1838), test_routes, test_schemas, test_workspace
+frontend/src/
+├── components/demo/demo-step-card.tsx   # ApproveButton @371; render condition @496
+├── components/demo/WorkspaceArtifactsPanel.tsx
+├── pages/showcase.tsx                   # loadedWorkspace @128; panels @244-450
+├── pages/ops.tsx                        # Needs Attention table @394-446
+├── hooks/use-workspaces.ts / use-ops.ts
+├── lib/api.ts                           # api<T>() @23 (API_BASE_URL @3)
+└── types/api.ts                         # StepEvent @760; Workspace* @806-830
+```
+
+### Desired Codebase tree (files added/modified)
+
+```bash
+app/features/demo/
+├── hitl.py                       # NEW — single-slot decision relay (register/wait/resolve/clear)
+├── pipeline.py                   # MOD — ctx accumulators; HITL step rework; rag-event appends; D2 drain
+├── workspace.py                  # MOD — finalize slot writes + story_reproduction; NEW list_approval_events
+├── schemas.py                    # MOD — HitlDecisionRequest; ApprovalEventItem; ApprovalEventsResponse
+├── routes.py                     # MOD — POST /hitl-decision; GET /approval-events
+├── models.py                     # MOD — config_schema_version ORM default 2; slot-comment delta
+└── tests/
+    ├── test_hitl.py              # NEW — relay unit tests (asyncio)
+    ├── test_pipeline.py          # MOD — HITL paths, rag capture, drain-ordering, constant rename
+    ├── test_routes.py            # MOD — hitl-decision 204/404/409/422; approval-events 200
+    ├── test_schemas.py           # MOD — HitlDecisionRequest (+ JSON path); response models
+    └── test_workspace.py         # MOD — finalize slots + story_reproduction (integration)
+frontend/src/
+├── components/demo/demo-step-card.tsx      # MOD — HitlDecisionButtons (Approve+Reject, api(), countdown)
+├── components/demo/demo-step-card.test.tsx # MOD — Reject render + POST body
+├── components/demo/WorkspaceStoryPanel.tsx       # NEW — approval + rag events + reproduction marker
+├── components/demo/WorkspaceStoryPanel.test.tsx  # NEW
+├── components/demo/index.ts                # MOD — export
+├── pages/showcase.tsx                      # MOD — mount WorkspaceStoryPanel
+├── pages/ops.tsx                           # MOD — Approval History section
+├── hooks/use-approval-events.ts            # NEW — useApprovalEvents
+├── hooks/use-approval-events.test.ts       # NEW
+├── hooks/index.ts                          # MOD — export
+└── types/api.ts                            # MOD — ApprovalEventItem/Response (+ Detail slot fields if E1 didn't)
+docs/_base/API_CONTRACTS.md                 # MOD — 2 endpoints + WS data-key notes
+docs/_base/DOMAIN_MODEL.md                  # MOD — slot-schema v2 delta + story_reproduction
+docs/_base/RUNBOOKS.md                      # MOD — HITL incidents + out-of-scope trim
+```
+
+### Known Gotchas & Library Quirks
+
+```python
+# CRITICAL — intermediate events flush only AFTER the step returns today
+#   (run_pipeline drains the list sink post-await, pipeline.py:2701-2715).
+#   PRP-41's Approve button therefore never renders during the window. D2's
+#   concurrent drain is LOAD-BEARING for this whole epic — implement and test
+#   it FIRST (Task 3) or every FE-interaction test downstream is meaningless.
+
+# CRITICAL — the relay wait must use asyncio primitives, not polling sleeps:
+#   asyncio.wait_for(event.wait(), timeout=...) raises TimeoutError on lapse
+#   (stdlib-verified 2026-06-12:
+#   uv run python -c "import asyncio
+#   async def m():
+#       ev=asyncio.Event()
+#       async def r(): await asyncio.sleep(0.05); ev.set()
+#       t=asyncio.ensure_future(r())
+#       await asyncio.wait_for(ev.wait(), timeout=1.0); print(True); await t
+#   asyncio.run(m())"  -> True).
+
+# CRITICAL — warn-and-continue: ALL new finalize_workspace logic (slot writes,
+#   source-row read, story_reproduction) goes INSIDE the existing try block
+#   (workspace.py:126-146). Never add a second commit path; never let a
+#   malformed source row raise out.
+
+# CRITICAL — JSONB whole-value assignment: build the full list on ctx, then
+#   row.approval_events = ctx.approval_events or None. NEVER append to a
+#   loaded row's JSONB in place (invisible to SQLAlchemy without
+#   flag_modified). Empty list -> None (E1: NULL = "slot never written").
+
+# CRITICAL — the relay is process-global mutable state. It is safe ONLY
+#   because service._pipeline_lock enforces one run at a time. Guard anyway:
+#   register() overwrites any stale slot (a crashed run must not wedge the
+#   next one) and the step clears it in a finally block.
+
+# CRITICAL — do NOT modify app/features/agents/** (D8) and do NOT touch
+#   agent_require_approval. The reject path is expressed entirely through the
+#   EXISTING ApprovalRequest.approved=false contract (agents/schemas.py:192).
+
+# GOTCHA — FE ApproveButton today uses bare fetch(approvalUrl) with a
+#   RELATIVE url (demo-step-card.tsx:393) — breaks when VITE_API_BASE_URL
+#   points off-origin. HitlDecisionButtons must use api() from lib/api.ts.
+
+# GOTCHA — tests monkeypatch pipeline._APPROVAL_DISPLAY_DELAY_S at FOUR
+#   sites: test_pipeline.py:1973, 2047, 2063, 2081. Renaming the constant
+#   without sweeping all four fails loudly (monkeypatch.setattr
+#   AttributeError). The "auto-approve in 3 s" detail string lives only in
+#   pipeline.py:2306 (no test asserts it); the FE countdown copy at
+#   demo-step-card.tsx:415 computes from the 90 s HARD timeout, not the
+#   window — replace it with the decision_window_s countdown.
+#   Grep "_APPROVAL_DISPLAY_DELAY_S\|auto-approve in" before renaming.
+
+# GOTCHA — StepEvent attribute stamping (ev.step_index = index) relies on
+#   Pydantic validate_assignment being OFF (default) — existing production
+#   behavior (pipeline.py:2708); keep the drained-event stamping identical.
+
+# GOTCHA — D2's task wrapper changes exception flow: _StepError raised inside
+#   the step now surfaces from task.result(). Keep the EXACT except ladder
+#   (_StepError -> fail / httpx.HTTPError|OSError -> transport fail /
+#   Exception -> unexpected fail).
+#   CRITICAL sub-case: the Stop button closes the WebSocket -> the async
+#   generator is CLOSED, which throws **GeneratorExit** (a BaseException) into
+#   the frame at D2's new mid-step `yield ev` suspension point. Neither
+#   `except asyncio.CancelledError` nor `except Exception` catches it, so the
+#   in-flight step task would be orphaned ("Task was destroyed but it is
+#   pending") with the _Client context exiting underneath it. The drain loop
+#   MUST therefore sit inside `try/finally: if not task.done(): task.cancel()`
+#   — cancellation on ANY exit (GeneratorExit, CancelledError, or a raise),
+#   never only on CancelledError. Verify the Stop path by hand in Level 4.
+
+# GOTCHA — parallel-epic merge friction: E3 (#409) extends DemoContext and
+#   the create-time workspace write; E2/E4 may touch finalize for
+#   job_ids/phase_summaries. All additions are disjoint — resolve conflicts
+#   by keeping both hunks; re-run the full demo test file after any rebase.
+
+# GOTCHA — repo has mixed CRLF/LF line endings; run `git diff --stat` before
+#   committing (Edit/Write emit LF — whole-file noise diffs are a regression).
+
+# GOTCHA — frontend type gate: `pnpm tsc --noEmit` is vacuous and `tsc -b`
+#   already fails on dev with pre-existing errors. Gate on "no NEW errors vs
+#   the dev baseline" + `pnpm lint` + `pnpm test --run`.
+
+# GOTCHA — mypy --strict AND pyright --strict gate merge: full annotations
+#   incl. `-> None` on tests, typed module-level relay state
+#   (e.g. _slot: _PendingDecision | None), and dataclass field types.
+
+# CONVENTION — branch: feat/showcase-completion-e5-agent-rag-story (off dev,
+#   slug ≤50). Commits reference #411: feat(api): ... (#411) for slice code,
+#   feat(ui): ... (#411) for frontend, docs(repo)/docs(docs): ... (#411).
+#   NO AI trailer (hook-enforced).
+
+# RUNTIME-VERIFICATION LOG (per prp-create step 3 — re-run on upgrade):
+#   1. asyncio.Event + wait_for timeout semantics — verified 2026-06-12
+#      (command above, prints True).
+#   2. No NEW third-party API claims: httpx-ASGITransport step client,
+#      SQLAlchemy JSONB whole-value writes, Pydantic strict-literal bodies,
+#      and TanStack query/mutation shapes are all existing production code in
+#      this repo (pipeline.py / workspace.py / schemas.py / use-workspaces.ts).
+#   3. agents approve contract probed in PRPs/ai_docs/prp-41-contract-probe-
+#      report.md (approved=false path + 400-on-consumed) — re-verify only if
+#      the agents slice changes upstream.
+```
+
+## Implementation Blueprint
+
+### Data shapes (documented slot-schema v2 — authoritative copy goes to DOMAIN_MODEL)
+
+```python
+# approval_events entry (list[dict], append-only). E1-frozen base keys first;
+# E5 additive keys below the marker. decision enum WIDENED in v2.
+{
+    "action_id": str,
+    "tool_name": str,                  # pending_action.action_type
+    "decision": "approved" | "rejected" | "timed_out",
+    "decided_at": str,                 # iso8601 UTC
+    "session_id": str,
+    # -- E5 (#411) additive, config_schema_version >= 2 --
+    "auto_approved": bool,             # True when the window lapsed
+    "reason": str | None,              # operator-supplied (Reject), <=500
+    "execution_status": str | None,    # ApprovalResponse.status: executed|rejected|expired;
+                                       # "external_4xx" on the absorbed pre-empt edge; None on timed_out
+    "tool_call_summary": {"description": str, "arguments_keys": list[str]},
+    "transcript_summary": str,         # agent chat message, <=200 chars
+    "tokens_used": int,
+    "tool_calls_count": int,
+}
+
+# rag_events entry (list[dict], append-only). "probe" event ADDED in v2.
+{
+    "event": "probe" | "index" | "retrieve" | "skip",
+    "status": "pass" | "warn" | "skip",   # E5 additive
+    "detail": str,
+    "count": int,                      # chunks indexed / results returned / 0
+    "occurred_at": str,                # iso8601 UTC
+    "provider": str | None,            # E5 additive — embedding provider name
+    "reachable": bool | None,          # E5 additive — probe only
+}
+
+# result_summary additive key (replay keep-runs only):
+{"story_reproduction": {"agent": V, "knowledge": V, "source_workspace_id": str}}
+# V ∈ "reproduced" | "not_reproduced" | "not_applicable" | "unknown"
+```
+
+```python
+# app/features/demo/hitl.py — NEW. Single-slot in-memory decision relay.
+# Safe because service._pipeline_lock enforces one pipeline per process and
+# the HITL step registers at most one action per run (precedent for
+# module-level state: service.py:19).
+"""HITL decision relay for the showcase pipeline (E5, issue #411). ..."""
+from __future__ import annotations
+import asyncio
+from dataclasses import dataclass, field
+from typing import Literal
+
+Decision = Literal["approved", "rejected"]
+ResolveOutcome = Literal["applied", "already_decided", "not_found"]
+
+@dataclass
+class _PendingDecision:
+    action_id: str
+    event: asyncio.Event = field(default_factory=asyncio.Event)
+    decision: Decision | None = None
+    reason: str | None = None
+
+_slot: _PendingDecision | None = None   # module-level; one pipeline at a time
+
+def register(action_id: str) -> None:
+    """Open the decision window; overwrites any stale slot from a dead run."""
+    global _slot
+    _slot = _PendingDecision(action_id=action_id)
+
+def resolve(action_id: str, decision: Decision, reason: str | None = None) -> ResolveOutcome:
+    """Record the operator's decision; called by POST /demo/hitl-decision."""
+    if _slot is None or _slot.action_id != action_id:
+        return "not_found"
+    if _slot.decision is not None:
+        return "already_decided"
+    _slot.decision = decision
+    _slot.reason = reason
+    _slot.event.set()
+    return "applied"
+
+async def wait_for_decision(action_id: str, timeout: float) -> tuple[Decision, str | None] | None:
+    """Block up to ``timeout`` for an operator decision; None on lapse."""
+    if _slot is None or _slot.action_id != action_id:
+        return None
+    try:
+        await asyncio.wait_for(_slot.event.wait(), timeout=timeout)
+    except TimeoutError:
+        return None
+    if _slot.decision is None:   # defensive: set() without decision
+        return None
+    return (_slot.decision, _slot.reason)
+
+def clear() -> None:
+    """Close the window (step's finally)."""
+    global _slot
+    _slot = None
+```
+
+```python
+# app/features/demo/pipeline.py — DemoContext additions (after workspace_name,
+# E3-comment style):
+    # E5 (#411) -- story-capture accumulators. Appended by step_agent_hitl_flow
+    # and the knowledge steps on SHOWCASE_RICH; finalize_workspace persists
+    # them to the workspace slots (empty -> slot stays NULL).
+    approval_events: list[dict[str, Any]] = field(default_factory=list)
+    rag_events: list[dict[str, Any]] = field(default_factory=list)
+
+# Constants — REPLACE _APPROVAL_DISPLAY_DELAY_S (line 317):
+_APPROVAL_DECISION_WINDOW_S = 10.0   # D3 — operator decision window
+
+# RAG-event helper (near the knowledge steps):
+def _record_rag_event(ctx, *, event, status, detail, count=0, provider=None, reachable=None) -> None:
+    ctx.rag_events.append({... per the v2 shape, "occurred_at": datetime.now(UTC).isoformat()})
+# Call once on EVERY return path of the three knowledge steps:
+#   probe   -> event="probe",   status="pass", provider=, reachable=
+#   index   -> event="index"|"skip", count=total_chunks
+#   retrieve-> event="retrieve"|"skip", status="warn" on zero hits, count=results_count
+# (provider for index/retrieve: reuse the probe's provider via a ctx echo or
+#  re-read get_settings().rag_embedding_provider — settings read is simplest.)
+
+# step_agent_hitl_flow rework (between the intermediate event and terminal):
+#   - intermediate event data ADDS: "decision_window_s": _APPROVAL_DECISION_WINDOW_S
+#     and "decision_url": "/demo/hitl-decision"; detail becomes
+#     f"awaiting approval (auto-approve in {int(_APPROVAL_DECISION_WINDOW_S)} s)"
+#   - hitl.register(action_id) BEFORE yielding the intermediate event;
+#     try/finally hitl.clear() around everything after registration.
+#   - replace the sleep @2320-2324 with:
+#       remaining = max(0.0, _APPROVAL_DECISION_WINDOW_S - (time.monotonic() - started_at))
+#       operator = await hitl.wait_for_decision(action_id, timeout=remaining)
+#   - keep the hard-timeout check @2326-2341; on timed_out ALSO append the
+#     approval_events entry (decision="timed_out", execution_status=None).
+#   - approved = operator is None or operator[0] == "approved"
+#     reason = operator[1] if operator else None
+#     POST /approve with {"action_id": ..., "approved": approved,
+#     **({"reason": reason} if reason else {})}
+#   - 4xx absorb stays; record execution_status="external_4xx" on that edge.
+#   - append the approval_events entry on EVERY resolved path, then terminal:
+#       reject -> ("pass", "rejected by operator", {..., "approval_decision": "rejected"})
+#       approve -> existing pass shape (+ "auto_approved" key in data)
+
+# run_pipeline D2 drain — replace `status, detail, data = await fn(ctx, client)`
+# (and its except ladder) with:
+    task = asyncio.ensure_future(fn(ctx, client))
+    try:
+        while True:
+            done, _ = await asyncio.wait({task}, timeout=0.25)
+            for ev in intermediate_events:           # same stamping as today
+                ev.step_index = index; ev.total_steps = total
+                ev.phase_index = phase_index; ev.phase_total = phase_total
+                ev.phase_name = phase_name
+                yield ev                              # NEW suspension point —
+                                                      # generator close lands HERE
+            intermediate_events.clear()
+            if done:
+                break
+        status, detail, data = task.result()
+    except _StepError as exc: ...                    # EXACT existing ladder
+    finally:
+        # LOAD-BEARING (quality-gate Finding 3): the Stop button closes the
+        # async generator, throwing GeneratorExit (BaseException) into the
+        # mid-step `yield ev` above — no except clause sees it. The finally
+        # is the ONLY hook that runs on every exit path; without it the
+        # in-flight step task is orphaned while _Client closes under it.
+        if not task.done():
+            task.cancel()
+# The existing post-task drain block @2701-2715 stays as the final flush.
+```
+
+```python
+# app/features/demo/workspace.py — inside finalize_workspace's try, after
+# row.result_summary assignment (@141-145):
+            row.approval_events = ctx.approval_events or None   # E5 (#411)
+            row.rag_events = ctx.rag_events or None
+            summary: dict[str, Any] = {... existing three keys ...}
+            if row.replayed_from_workspace_id:                  # D7 marker
+                src = (await db.execute(select(ShowcaseWorkspace).where(
+                    ShowcaseWorkspace.workspace_id == row.replayed_from_workspace_id
+                ))).scalar_one_or_none()
+                summary["story_reproduction"] = _story_reproduction(src, ctx)
+            row.result_summary = summary
+
+def _story_reproduction(src: ShowcaseWorkspace | None, ctx: DemoContext) -> dict[str, Any]:
+    """D7 — compare the source row's story slots against this run's capture."""
+    if src is None:
+        return {"agent": "unknown", "knowledge": "unknown",
+                "source_workspace_id": None}
+    def _verdict(source_had: bool, new_has: bool) -> str:
+        if not source_had: return "not_applicable"
+        return "reproduced" if new_has else "not_reproduced"
+    src_knowledge = any(e.get("event") in ("index", "retrieve") and e.get("status") != "skip"
+                        for e in (src.rag_events or []))
+    new_knowledge = any(e.get("event") in ("index", "retrieve") and e.get("status") != "skip"
+                        for e in ctx.rag_events)
+    return {
+        "agent": _verdict(bool(src.approval_events), bool(ctx.approval_events)),
+        "knowledge": _verdict(src_knowledge, new_knowledge),
+        "source_workspace_id": src.workspace_id,
+    }
+
+async def list_approval_events(db: AsyncSession, *, limit: int = 50) -> list[dict[str, Any]]:
+    """Flatten approval_events across the newest rows that carry the slot."""
+    result = await db.execute(
+        select(ShowcaseWorkspace)
+        .where(ShowcaseWorkspace.approval_events.isnot(None))
+        .order_by(ShowcaseWorkspace.created_at.desc(), ShowcaseWorkspace.id.desc())
+        .limit(50)
+    )
+    events: list[dict[str, Any]] = []
+    for row in result.scalars():
+        for entry in row.approval_events or []:
+            events.append({"workspace_id": row.workspace_id,
+                           "workspace_name": row.name, **entry})
+            if len(events) >= limit:
+                return events
+    return events
+```
+
+```python
+# app/features/demo/schemas.py — new models.
+class HitlDecisionRequest(BaseModel):
+    """Operator decision relay for the showcase HITL step (E5, #411). ..."""
+    model_config = ConfigDict(strict=True, extra="forbid")
+    action_id: str = Field(..., min_length=1, description="Pending action to decide.")
+    decision: Literal["approved", "rejected"] = Field(..., description="Operator decision.")
+    reason: str | None = Field(default=None, max_length=500,
+                               description="Optional reason (mirrors agents ApprovalRequest.reason).")
+
+class ApprovalEventItem(BaseModel):
+    """One flattened approval event (built from JSONB dicts — tolerant typing)."""
+    workspace_id: str
+    workspace_name: str | None = None
+    action_id: str | None = None
+    tool_name: str | None = None
+    decision: str | None = None
+    decided_at: str | None = None
+    session_id: str | None = None
+    auto_approved: bool | None = None
+    reason: str | None = None
+    execution_status: str | None = None
+    transcript_summary: str | None = None
+
+class ApprovalEventsResponse(BaseModel):
+    events: list[ApprovalEventItem]
+    total: int   # number returned (flattened cap), not a table count
+```
+
+```python
+# app/features/demo/routes.py — two routes.
+@router.post("/hitl-decision", status_code=status.HTTP_204_NO_CONTENT,
+             summary="Relay an operator decision to the in-flight HITL step", ...)
+async def submit_hitl_decision(body: HitlDecisionRequest) -> None:
+    outcome = hitl.resolve(body.action_id, body.decision, body.reason)
+    if outcome == "not_found":
+        raise NotFoundError(message=f"No pending HITL action: {body.action_id}")
+    if outcome == "already_decided":
+        raise ConflictError(f"Action already decided: {body.action_id}")
+
+@router.get("/approval-events", response_model=ApprovalEventsResponse,
+            summary="Recent HITL approval events across saved workspaces", ...)
+async def list_hitl_approval_events(
+    db: AsyncSession = Depends(get_db),
+    limit: int = Query(default=50, ge=1, le=200),
+) -> ApprovalEventsResponse:
+    events = await workspace.list_approval_events(db, limit=limit)
+    return ApprovalEventsResponse(
+        events=[ApprovalEventItem.model_validate(e) for e in events],
+        total=len(events),
+    )
+```
+
+```tsx
+// frontend — HitlDecisionButtons replaces ApproveButton (demo-step-card.tsx):
+//  - props: actionId, decisionWindowS (from step.data.decision_window_s ?? 10)
+//  - api('/demo/hitl-decision', { method: 'POST', body: { action_id, decision } })
+//    via lib/api.ts (NOT bare fetch); absorb 404/409 silently, surface 5xx.
+//  - Approve: variant "default"; Reject: variant "destructive" + size "sm";
+//    both disable after either click ("Approving…"/"Rejecting…").
+//  - countdown: `auto-approve in ${remaining}s` ticking from decisionWindowS.
+// WorkspaceStoryPanel (new): Card titled "Run story"; sections —
+//  Approval history (decision StatusBadge + tool + transcript snippet + when),
+//  Knowledge events (event/status/provider/count), Reproduction marker chips
+//  (from result_summary.story_reproduction; render only when present).
+//  Render nothing when the workspace has no slots (legacy rows).
+// ops.tsx: "Approval History" Card+Table after Needs Attention, fed by
+//  useApprovalEvents() (hooks/use-approval-events.ts; queryKey
+//  ['demo','approval-events',limit]; no polling); empty-state paragraph.
+```
+
+### List of tasks (dependency order)
+
+```yaml
+Task 0 — preconditions:
+  VERIFY: gh issue view 407 --json state -> CLOSED; the three E1 columns exist
+    on ShowcaseWorkspace; re-anchor blueprint line numbers if E1/E3 moved code.
+  RUN: git switch dev && git pull && git switch -c feat/showcase-completion-e5-agent-rag-story
+
+Task 1 — CREATE app/features/demo/hitl.py:
+  - IMPLEMENT the relay per blueprint (typed module state, register/resolve/
+    wait_for_decision/clear, structlog on resolve)
+  - CREATE tests/test_hitl.py: resolve-before-wait, wait-then-resolve,
+    timeout->None, wrong-action->not_found, double-resolve->already_decided,
+    register-overwrites-stale-slot, clear()
+
+Task 2 — MODIFY app/features/demo/models.py:
+  - config_schema_version ORM default 1 -> 2 (server_default UNCHANGED)
+  - EXTEND the slot-schema comments with the v2 delta (blueprint shapes)
+
+Task 3 — MODIFY pipeline.py run_pipeline (D2 drain) — FIRST pipeline change:
+  - task wrapper + 0.25s asyncio.wait flush loop per blueprint; preserve the
+    exact except ladder; `finally: if not task.done(): task.cancel()` so a
+    generator close (Stop button -> GeneratorExit at the mid-step yield)
+    never orphans the in-flight step task
+  - ADD orchestrator tests: (a) a stub step that emits an intermediate event
+    then blocks on an asyncio.Event; assert the intermediate event is yielded
+    while the step is still pending, then release and assert terminal order;
+    (b) close the generator (aclose()) while the stub step is in-flight and
+    assert the step task ends cancelled (no "destroyed but pending" warning)
+
+Task 4 — MODIFY pipeline.py HITL step + constants:
+  - REPLACE _APPROVAL_DISPLAY_DELAY_S with _APPROVAL_DECISION_WINDOW_S = 10.0
+    (sweep tests: monkeypatches @1973/2047/2063 + "auto-approve in 3 s" asserts)
+  - DemoContext: + approval_events / rag_events accumulators (E5 comment block)
+  - step_agent_hitl_flow: hitl.register before the intermediate event;
+    intermediate data += decision_window_s + decision_url; wait_for_decision;
+    reject path (approved=false POST, terminal pass "rejected by operator");
+    approval_events entry on every resolved path (incl. timed_out);
+    try/finally hitl.clear()
+  - EXTEND _make_hitl_client: capture approve POST json_body; tests for
+    operator-approve / operator-reject (resolve via hitl.resolve before the
+    wait) / window-lapse auto-approve / hard-timeout entry / skip paths
+    append nothing
+
+Task 5 — MODIFY pipeline.py knowledge steps:
+  - _record_rag_event helper + one call per return path of probe/index/retrieve
+  - tests: each path appends the right entry (pass/skip/auth-skip/warn),
+    provider populated, demo_minimal run leaves ctx.rag_events empty
+
+Task 6 — MODIFY workspace.py:
+  - finalize_workspace: slot writes + story_reproduction per blueprint
+    (all inside the existing try); _story_reproduction helper
+  - NEW list_approval_events helper
+  - tests/test_workspace.py (@integration): finalize writes slots (and NULL
+    when empty); replay row vs source-with-story -> reproduced; source-empty
+    -> not_applicable; dangling source -> unknown; list_approval_events
+    flattens newest-first and respects limit
+
+Task 7 — MODIFY schemas.py + routes.py:
+  - HitlDecisionRequest / ApprovalEventItem / ApprovalEventsResponse
+  - POST /demo/hitl-decision (204/404/409) + GET /demo/approval-events
+  - tests/test_schemas.py: JSON-dict path (security-patterns.md § strict mode):
+    HitlDecisionRequest.model_validate({"action_id": "a", "decision": "rejected"});
+    extra-key 422; bad decision literal 422; reason >500 422
+  - tests/test_routes.py: decision 204 (hitl registered via monkeypatch/
+    direct register) / 404 / 409 / 422; approval-events 200 empty + populated
+    (monkeypatch workspace.list_approval_events for the unit-shaped test,
+    follow the file's existing convention)
+
+Task 8 — frontend:
+  - types/api.ts: ApprovalEventItem/ApprovalEventsResponse (+ WorkspaceDetail
+    approval_events/rag_events fields IF E1 didn't add them); `// E5 (#411)` comments
+  - demo-step-card.tsx: HitlDecisionButtons per blueprint (replace
+    ApproveButton; keep the @496-505 render condition shape); update
+    demo-step-card.test.tsx (Reject renders, POST body, countdown text)
+  - hooks/use-approval-events.ts + test; export from hooks/index.ts
+  - components/demo/WorkspaceStoryPanel.tsx + test; export from index.ts;
+    mount in showcase.tsx beside WorkspaceArtifactsPanel (@448-450 guard)
+  - pages/ops.tsx: Approval History section after Needs Attention (@446),
+    mirroring its Card+Table+empty-state pattern
+  - GATES: pnpm lint && pnpm test --run; tsc -b no NEW errors
+
+Task 9 — docs (additive):
+  - API_CONTRACTS.md: two new demo rows (POST /demo/hitl-decision incl.
+    204/404/409 semantics; GET /demo/approval-events); WS /demo/stream section:
+    intermediate HITL event now streams DURING the window and data gains
+    decision_window_s/decision_url; note the 10 s window and the Reject path
+  - DOMAIN_MODEL.md § showcase_workspace: slot-schema v2 delta (decision enum,
+    additive keys, "probe" event), config_schema_version=2 note,
+    story_reproduction in result_summary
+  - RUNBOOKS.md: incidents 23-25 — window now 10 s, Reject button exists, a
+    rejected run is GREEN by design; § Showcase workspace — trim "RAG-event
+    and approval-decision capture" from the out-of-scope list
+
+Task 10 — gates, dogfood, PR:
+  - full Validation Loop (Levels 1-4); git diff --stat (CRLF noise check)
+  - COMMITS (reference #411, no AI trailer), e.g.:
+      feat(api): add hitl decision relay and story capture to demo pipeline (#411)
+      feat(ui): add reject button, run story panel and ops approval history (#411)
+      docs(docs): document approval and rag story capture contracts (#411)
+  - PR into dev; title:
+      feat(api,ui): showcase-completion E5 — agent/hitl + rag story capture (#411)
+```
+
+### Integration Points
+
+```yaml
+DATABASE: none — no migration (E1 shipped the columns). ORM default bump only.
+CONFIG: none — no new settings; the window is a module constant emitted to the FE.
+ROUTES: two additions on the existing demo router — no app/main.py change.
+AGENTS SLICE: untouched (D8). The pipeline keeps using the public
+  /agents/sessions/{id}/approve contract (approved=true|false + reason).
+OPS SLICE: untouched — /ops approval history is a frontend section over the
+  demo endpoint.
+FRONTEND: one replaced component, one new panel, one new ops section, one hook.
+PARALLEL EPICS: E3 touches DemoContext + create-time writes; E2/E4 may write
+  job_ids/phase_summaries in finalize — keep-both merge resolution; whoever
+  lands after a slot-shape change rebases the config_schema_version default.
+```
+
+## Validation Loop
+
+### Level 1: Syntax & Style
+
+```bash
+uv run ruff check . && uv run ruff format --check .
+uv run mypy app/ && uv run pyright app/
+```
+
+### Level 2: Unit Tests (no DB)
+
+```bash
+uv run pytest app/features/demo -v -m "not integration"
+uv run pytest app/core/tests/test_strict_mode_policy.py -v
+# Key new/updated cases (see Tasks 1,3,4,5,7):
+#   test_hitl.py — relay semantics incl. timeout + already_decided
+#   test_pipeline.py — drain-ordering (intermediate BEFORE step completion);
+#     HITL operator-approve / operator-reject / auto-approve / timed-out
+#     entries; reject terminal is pass + green; rag-event appends per path;
+#     demo_minimal leaves both accumulators empty
+#   test_routes.py — hitl-decision 204/404/409/422; approval-events 200
+#   test_schemas.py — HitlDecisionRequest JSON path + extra=forbid
+```
+
+### Level 3: Integration (real Postgres)
+
+```bash
+docker compose up -d && uv run alembic upgrade head
+uv run pytest app/features/demo -v -m integration
+# test_workspace.py — finalize writes approval_events/rag_events (NULL when
+# empty); story_reproduction matrix (reproduced / not_applicable / unknown);
+# list_approval_events flatten + limit
+```
+
+### Level 4: Manual smoke (seeded local stack, uvicorn :8123 + vite)
+
+```bash
+# 1. showcase_rich keep-run from /showcase with "Save as workspace" ticked.
+#    During the agents phase the HITL card must show Approve + Reject with a
+#    ticking "auto-approve in Ns" — click REJECT. Expect: step flips to pass
+#    with "rejected by operator"; run finishes GREEN.
+# 2. Verify capture:
+curl -s "http://localhost:8123/demo/approval-events?limit=5" | python3 -m json.tool
+#    -> one entry, decision="rejected", workspace_id set
+docker exec forecastlab-postgres psql -U forecastlab -d forecastlab -c \
+  "SELECT name, jsonb_array_length(approval_events) AS approvals, \
+          jsonb_array_length(rag_events) AS rag, config_schema_version \
+   FROM showcase_workspace ORDER BY created_at DESC LIMIT 1;"
+#    -> approvals=1, rag=3, config_schema_version=2
+# 3. Decision relay error paths:
+curl -s -X POST http://localhost:8123/demo/hitl-decision \
+  -H 'Content-Type: application/json' \
+  -d '{"action_id": "bogus", "decision": "approved"}' | python3 -m json.tool   # 404 problem+json
+# 4. Replay the kept workspace (Replay button) and let it auto-approve; then:
+#    GET /demo/workspaces/{new_id} -> result_summary.story_reproduction.agent
+#    == "reproduced" (source had an approval event; replay produced one too).
+# 5. /ops page shows the Approval History table; Showcase Load on the kept
+#    workspace renders the story panel (events + reproduction chips).
+# 6. Regression: run demo_minimal — no buttons, no relay calls, slots NULL.
+```
+
+## Final validation Checklist
+
+- [ ] Five gates green: `uv run ruff check . && uv run ruff format --check . && uv run mypy app/ && uv run pyright app/ && uv run pytest -v -m "not integration"`
+- [ ] Integration suite green on a fresh docker-compose DB (reset first if the shared DB is polluted)
+- [ ] Drain-ordering test proves intermediate events stream mid-step; Stop button still cancels cleanly (CancelledError passthrough)
+- [ ] Reject path: green run, entry captured, no scenario_plan written by the agent
+- [ ] Slots NULL on empty capture; `config_schema_version`=2 on new rows, old rows still 1
+- [ ] `POST /demo/hitl-decision` 204/404/409/422 and `GET /demo/approval-events` verified (Levels 2+4)
+- [ ] story_reproduction matrix covered (reproduced / not_reproduced / not_applicable / unknown)
+- [ ] Frontend: `pnpm lint` + `pnpm test --run` green; no NEW `tsc -b` errors; manual browser pass (Level 4 steps 1, 5)
+- [ ] No agents-slice diff; no migration; `git diff --stat` surgical (no CRLF noise)
+- [ ] Docs updated (API_CONTRACTS, DOMAIN_MODEL v2 slot delta, RUNBOOKS trim)
+- [ ] Commits `feat(api)/feat(ui)/docs(...): ... (#411)`, no AI trailer; PR into dev
+
+---
+
+## Anti-Patterns to Avoid
+
+- ❌ Don't touch `app/features/agents/**` or `agent_require_approval` — the reject path uses the EXISTING `approved=false` contract (D1/D8).
+- ❌ Don't let a reject (or any capture failure) fail the pipeline — reject is `pass`; capture rides warn-and-continue.
+- ❌ Don't re-architect the pipeline beyond the D2 drain loop — steps stay strictly sequential under the single lock; no mid-run frame reading on the WS.
+- ❌ Don't make the FE call `/agents/sessions/{id}/approve` directly anymore — the relay is the single intent channel (keep emitting `approval_url` for back-compat, but the buttons use the relay).
+- ❌ Don't echo tool-call argument VALUES or full transcripts into `approval_events` — keys + 200-char summary only.
+- ❌ Don't write `[]` into a slot — empty capture leaves the column NULL (E1: NULL = "never written").
+- ❌ Don't add a migration or change `server_default` for the version bump — ORM `default=` only.
+- ❌ Don't put the approval-history endpoint in the ops slice — demo owns the data; /ops renders client-side.
+- ❌ Don't validate that the replay source row exists — dangles are designed; `unknown` is the honest verdict.
+- ❌ Don't mutate row JSONB in place — whole-value assignment only.
+- ❌ Don't add list pagination/filtering to approval-events — audit glance, not a browse API (E7 can extend).
+
+## Notes for the release-gate epic (E7)
+
+- E5 bumps the documented slot schema to v2 (D4) — the DOMAIN_MODEL delta is
+  the authoritative copy; verify E2/E4/E6 didn't race the same default.
+- The D2 drain generalizes intermediate-event streaming for ANY step — if a
+  later epic wants mid-step progress (e.g. batch sub-job ticks), the plumbing
+  now exists; document it if used.
+- The deferred durable approval audit on `agent_session` (D8) is a candidate
+  follow-up issue if the chat surface (non-showcase) needs history too.
+
+## Confidence Score
+
+**8/10** for one-pass implementation success. Every write path has a verified
+in-repo precedent: the slot columns + warn-and-continue hook (E1/`workspace.py`),
+the module-level single-flight state (`service.py:19`), the HITL step's fake-client
+test harness (`test_pipeline.py:1838`), the agents `approved=false` contract
+(`agents/schemas.py:192` — no agents change needed), and the ops Card+Table /
+TanStack patterns. The two judgment calls with real risk are resolved and frozen:
+D1 (relay; eliminates the unknowable FE-pre-empt decision) and D2 (concurrent
+drain; the one structural change, contained to a single loop body with a
+dedicated ordering test). The −2: (a) D2 touches the orchestrator's exception/
+cancellation flow — the Stop-button path (`WebSocketDisconnect` → generator
+close → CancelledError) must be re-verified by hand in Level 4; (b) this PRP is
+written against E1's PRP rather than E1's merged code, and E3 may land in
+parallel — Task 0's re-anchoring step and the keep-both merge note mitigate but
+can't eliminate rebase friction.