agent-runner

中文 · English

agent-runner

A restart-on-exit supervisor for autonomous coding CLIs. Tested with Claude Code and aider out of the box; any prompt-arg CLI via custom config. Spawn the agent round-after-round under defenses that prevent the failure modes that bite in production: stuck rounds, orphan commits, OAuth burn loops, full disks, runaway memory.

┌──────────────────────────────────────────┐
│ Layer 3: The Witness (monitor)           │  10 detectors + auto-stop
├──────────────────────────────────────────┤
│ Layer 2: The Loop (serve, ~60 LOC)       │  signal-trapping restart loop
├──────────────────────────────────────────┤
│ Layer 1: The Round (round)               │  one agent invocation
└──────────────────────────────────────────┘

Install

pip install cli-agent-runner

The installed CLI command is agent-runner (the PyPI distribution name is prefixed for namespace disambiguation; the import name and command are not).

Quick start

cd your-project
agent-runner init                 # scaffold agent-runner.toml + prompts/main.md
$EDITOR agent-runner.toml         # point agent.command at your CLI
agent-runner install --monitor    # systemd user units for serve + monitor
agent-runner status               # confirm running
agent-runner peek                 # snapshot of project state
agent-runner monitor              # live anomaly detection

Full walkthrough: docs/quickstart.md.

13 verbs

Lifecycle	Observation
`init` / `install` / `uninstall`	`peek` — state snapshot
`start` / `stop` / `kill` / `cancel`	`watch` — peek in a refresh loop
`restart` / `status`	`monitor` — 10 detectors, alerts, auto-stop
`round` / `serve`

Verb reference: docs/commands.md.

Defenses (built in)

11 named defenses, structured as data — see agent-runner peek --select defenses. Each carries the historical incident it codifies and the invariant test that guards it. Highlights:

round_timeout_s — hard wall, never the agent's word on when to stop
process_group_isolation — kill the round, not just the parent
orphan_stash_idempotency_s — no 3-stashes-per-second pile-ups
sha_locked_stash — stash@{N} indices drift; SHAs don't
set_diff_classification — line-set comparison, not unified-diff +/- scan
startup_smoke_check — refuse to run with a clearly-truncated prompt

Full list and rationale: docs/architecture.md.

Monitor: 10 detectors

Notify only: timeout_rate, hung, orphan_chain, disk_warning, mem_pressure, smoke_fail_rate, network_fail, rate_limit_active.

Auto-stop the service (continuing is harmful):

oauth_fail — burning API quota on auth-rejected rounds
disk_critical — writing to a near-full disk risks corruption

Runs locally or against a remote host via ssh:

agent-runner monitor                  # local, 30s poll
agent-runner monitor --host pi        # remote, 60s poll
agent-runner monitor --json | jq -c   # pipe to downstream consumers

Note: Remote monitor (monitor --host <alias>) relies on your local ~/.ssh/config for StrictHostKeyChecking and other ssh policy. Use accept-new or yes to avoid MITM exposure.

Documentation

docs/quickstart.md — 5-step install + first round
docs/commands.md — verb reference
docs/configuration.md — agent-runner.toml schema
docs/runbook.md — operator troubleshooting (OAuth, disk, orphan)
docs/architecture.md — 3-layer model, defenses-as-data

Development

git clone https://github.com/wan9yu/cli-agent-runner.git
cd cli-agent-runner
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

./build.sh check                          # full local-CI sweep
./build.sh test                           # unit + integration only
AGENT_RUNNER_E2E_PI=1 ./build.sh e2e      # opt-in pi e2e (needs ssh alias `pi`)

Some docs/*.md blocks are generated from code — ./build.sh docs rewrites the  regions, and ./build.sh check verifies they are fresh.

POSIX-only (Linux, macOS). Tested under Python 3.11+ on x86_64 and aarch64.

License

Apache License 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 385 Commits
.githooks		.githooks
.github		.github
agent_runner		agent_runner
deploy		deploy
docs		docs
tests		tests
.codecov.yml		.codecov.yml
.gitignore		.gitignore
.vulture-whitelist.py		.vulture-whitelist.py
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
README.zh.md		README.zh.md
SECURITY.md		SECURITY.md
build.sh		build.sh
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-runner

Install

Quick start

13 verbs

Defenses (built in)

Monitor: 10 detectors

Documentation

Development

License

About

Uh oh!

Releases 22

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-runner

Install

Quick start

13 verbs

Defenses (built in)

Monitor: 10 detectors

Documentation

Development

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 22

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages