feat: Smart retry policies with safe rebase for failed jobs

## Source

Audit report — Section 9: Phase 3 Roadmap

## Description

When a job fails, the only option is manual intervention (inspect, fix, relaunch). There is no automated retry mechanism that can safely rebase the job's work onto the latest base branch and retry.

## Proposed Solution

1. **Retry policy configuration:** Per-job or global config for max retries, backoff strategy
2. **Safe rebase before retry:** When retrying, rebase the job's branch onto the latest base/integration branch to pick up any changes from other jobs
3. **Conflict detection:** If rebase conflicts, pause and notify instead of blindly retrying
4. **Retry context:** Pass failure context to the retried agent (what failed, error output) so it can adapt
5. **Retry limits:** Configurable max retries with exponential backoff to prevent infinite loops

## Relationship to Other Issues

- Builds on #23 (failure artifact capture) for retry context
- Related to (but distinct from) closed #39 (autopilot pause on error) — #39 was about graceful degradation, this is about automated recovery
- Benefits from #32 (sync against local base) for the rebase step

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Smart retry policies with safe rebase for failed jobs #51

Source

Description

Proposed Solution

Relationship to Other Issues

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

feat: Smart retry policies with safe rebase for failed jobs #51

Description

Source

Description

Proposed Solution

Relationship to Other Issues

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions