Skip to content

Abort the resume when label or comment reads fail#50

Merged
Phlogistique merged 14 commits into
mainfrom
claude/fail-loud-resume-reads-3njvdx
Jun 11, 2026
Merged

Abort the resume when label or comment reads fail#50
Phlogistique merged 14 commits into
mainfrom
claude/fail-loud-resume-reads-3njvdx

Conversation

@Phlogistique

Copy link
Copy Markdown
Collaborator

pr_has_conflict_label swallowed gh failures as "no label", so a transient API error made the resume silently skip a labeled PR. Worse, a failed comments fetch in read_state_marker read as "no marker", which made the caller abandon the resume and drop the conflict label for good — a dead end. Both now abort the run instead; the label stays on, so the next push retries.

Stacked on #45; #51 builds on this.

🤖 Generated with Claude Code

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX


Generated by Claude Code

claude added 9 commits June 9, 2026 20:49
The squash-merge fan-out retargeted every updated child PR onto the
target branch and only afterwards pushed the new heads, batched into a
single non-atomic push together with the merged-branch deletion. If the
push failed (e.g. someone pushed to a child mid-run, rejecting the plain
push) or a pr edit died partway through the loop, set -e aborted the run
with PRs already retargeted but their heads stale - and unlike the
conflict-resume path there is no label to re-trigger the action, so
nothing ever repaired them.

Apply the ordering the resume path already uses: push the updated heads
first, then flip the bases, and delete the merged branch last (deleting
a PR's base branch closes the PR, so every child must be off it first).
A failed push now leaves the PRs untouched on their old base.

The unit test captures the run transcript and asserts the
push -> retarget -> delete order; it fails against the previous code.
Also corrects the README: pushes are plain, not forced, and branch
deletion is its own final step.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX
Since #40 the conflict comment's fast-forward step reads `git merge
--ff-only origin/<branch>`, which assert_conflict_comment_merges picks
up with its `^git merge` grep, so the extracted commands never match the
expected conflict merges. Skip the --ff-only line when extracting.

Also trim the new comments in the fan-out push/retarget/delete sequence.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX
The fix for the --ff-only line breaking assert_conflict_comment_merges
moved to a separate PR; the e2e job here stays red until that lands and
main is merged back in.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX
A head branch can carry several PRs (one per base), so gh calls keyed by
branch name can comment, label, or retarget the wrong one. Every gh call
that acts on a specific PR now uses the PR number: the fan-out carries
number/branch pairs from gh pr list, and the conflict-resolved run gets
PR_NUMBER from the event payload via action.yml.

The payload also already carries the PR's base branch, so the resume
takes it from a new PR_BASE variable instead of querying the API; the
resume test's gh mock no longer answers baseRefName queries, so a
reintroduced lookup fails loudly.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX
pr_has_conflict_label swallowed gh failures as "no label", so a
transient API error made the resume silently skip a labeled PR, and a
failed comments fetch in read_state_marker read as "no marker", which
made the caller abandon the resume and drop the conflict label for
good. Both now abort the run instead; the label stays on, so the next
push retries.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX
@github-actions github-actions Bot changed the base branch from claude/pr-number-addressing-3njvdx to main June 11, 2026 11:22
Comment thread update-pr-stack.sh Outdated
Comment thread update-pr-stack.sh Outdated
@Phlogistique Phlogistique merged commit 9c0543e into main Jun 11, 2026
3 checks passed
@github-actions github-actions Bot deleted the claude/fail-loud-resume-reads-3njvdx branch June 11, 2026 11:52
Phlogistique added a commit that referenced this pull request Jun 11, 2026
`read_state_marker` accepted a marker from *any* comment, so on a public
repo anyone able to comment could plant one; if its `base=` matched, the
resume would merge an attacker-chosen commit (fork-pushed objects are
reachable by hash in the repo network) into the branch and push it with
the action's token. Benign variant: a quote-reply of an old conflict
comment resurrects a stale marker, since HTML comments survive quoting
and the newest marker wins.

Fix: filter comments to `viewerDidAuthor` — those posted with the same
token the action runs under — which needs no configured identity. The
resume test's gh mock rejects comment queries without that filter.
Caveat: if the repo switches tokens (e.g. `GITHUB_TOKEN` → App) while a
PR sits in conflict, the old marker is no longer "ours" and the resume
takes the safe abandon path.

Also rejects markers with missing fields instead of passing empty values
to git (a marker missing `squash=` used to crash on `update-ref` and
strand the PR under the label); new scenario E covers it.

Stacked on #50 (same function).

🤖 Generated with [Claude Code](https://claude.com/claude-code)

https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX

---
_Generated by [Claude
Code](https://claude.ai/code/session_01JHvKryT4QUpHYdNq9YEQxX)_

---------

Co-authored-by: Claude <noreply@anthropic.com>
Co-authored-by: github-actions <github-actions@github.com>
Phlogistique added a commit that referenced this pull request Jun 11, 2026
Hit on #50 today: after the retarget to main, GitHub kept comparing
against the deleted parent branch tip and showed a 323-line diff for a
14-line change. A push to the head branch eventually got it to
recompute.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-authored-by: Claude Fable 5 <noreply@anthropic.com>
Phlogistique pushed a commit that referenced this pull request Jun 11, 2026
Trunk meanwhile absorbed most of this PR (#42 reordered main(), #50
hardened the label and comment reads), so what remains is the
has_sibling_conflicts fix plus a new one of the same kind: list_child_prs
failures were swallowed by the process substitutions consuming it, so a
failed listing read as 'no children' and let main() delete the merged
branch under the children it never saw. Callers now capture the output and
die on failure.

https://claude.ai/code/session_01STkeSJ7cLrmrNn4aTDYkwH
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants