diff --git a/README.md b/README.md new file mode 100644 index 0000000..6d314a3 --- /dev/null +++ b/README.md @@ -0,0 +1,184 @@ +# agentcrumbs + +AI agents can read your code but they can't see what happened at runtime. agentcrumbs fixes that. Agents drop structured traces inline as they write code. When something breaks, the agent queries those traces and sees exactly what ran, with what data, in what order. + +Crumbs are development-only. They get stripped before merge and cost nothing when disabled. + +``` +Service A ──┐ ┌── $ agentcrumbs tail +Service B ──┤── fetch() ──> Collector :8374 ──┤── $ agentcrumbs query --since 5m +Service C ──┘ (fire & forget) └── ~/.agentcrumbs/crumbs.jsonl +``` + +## Getting started + +```bash +npm install agentcrumbs +npx @tanstack/intent install +``` + +Then tell your agent: **"Run the agentcrumbs/init skill."** + +The init skill scans your repo, discovers services and modules, and builds a **namespace catalog** that gets written to your agent config (CLAUDE.md, .cursorrules, etc.). Without the catalog, every agent invents its own namespace names: `auth`, `auth-service`, `authService`, `authentication`, all pointing at the same thing. The catalog locks it down. Every agent, every session, same names. + +After init, the agent knows which namespaces to use and how to drop crumbs correctly. + +## Agent skills + +agentcrumbs ships with [@tanstack/intent](https://tanstack.com/blog/from-docs-to-agents) skills inside the npm package. Running `npx @tanstack/intent install` wires them into your agent config so the agent learns correct usage patterns, common mistakes to avoid, and the namespace catalog for your project. + +| Skill | What it teaches | +| --- | --- | +| `agentcrumbs/init` | Scans repo, discovers namespaces, writes config | +| `agentcrumbs/core` | `trail()`, `crumb()`, markers, env var, noop guarantee | +| `agentcrumbs/scopes-and-context` | `scope()`, `wrap()`, `child()`, `snapshot()`, `assert()` | +| `agentcrumbs/sessions-and-tags` | `session()`, tags, grouping and filtering | +| `agentcrumbs/cli` | `collect`, `tail`, `query`, `strip`, `session` | + +Skills travel with the package version. The agent always has docs matching the installed code. + +## How it works + +The agent writes crumbs as part of the code it's implementing: + +```typescript +import { trail } from "agentcrumbs"; // @crumbs +const crumb = trail("auth-service"); // @crumbs + +export async function handleLogin(token: string) { + crumb("login attempt", { tokenPrefix: token.slice(0, 8) }); // @crumbs + + const user = await validateToken(token); + + crumb("login success", { userId: user.id }); // @crumbs + return user; +} +``` + +When something goes wrong, the agent starts the collector and queries the trail: + +```bash +agentcrumbs collect --quiet & +AGENTCRUMBS=1 node app.js +agentcrumbs query --since 5m --ns auth-service +``` + +``` +auth-service login attempt +0ms { tokenPrefix: "eyJhbGci" } +auth-service token decode ok +3ms { userId: "u_8f3k" } +auth-service permissions check +8ms { roles: [] } +auth-service rejected: no roles +8ms { status: 401 } +``` + +Now the agent knows: the token is valid, but the user has no roles. The fix is in role assignment, not token validation. + +## Workflow + +Crumbs live on your feature branch. They never ship to main. + +1. **Agent writes code with crumbs.** As it implements a feature, it drops crumbs at every decision point. +2. **Something breaks.** The agent starts the collector, re-runs the failing code with `AGENTCRUMBS=1`, and queries the trail. +3. **Agent reads the trail.** It sees what actually executed, in what order, with what data. Fixes the root cause instead of guessing. +4. **Strip before merge.** `agentcrumbs strip` removes all crumb code. Clean diff, clean main. +5. **CI enforces it.** `agentcrumbs strip --check` exits 1 if any `@crumbs` markers are found. + +## The noop guarantee + +When a namespace is disabled, `trail()` returns a pre-built frozen noop function. There is no `if (enabled)` check on every call. The function itself is the noop. + +The only cost is the function call itself, which V8 will likely inline after warmup. For hot paths with expensive arguments, gate on `crumb.enabled`. + +## API overview + +All methods are documented in detail at [docs.agentcrumbs.dev/api](https://docs.agentcrumbs.dev/api). + +| Method | Purpose | +| --- | --- | +| `trail(namespace)` | Create a trail function for a namespace | +| `crumb(msg, data?, options?)` | Drop a crumb with message and optional data | +| `crumb.scope(name, fn)` | Wrap a function with entry/exit/error tracking | +| `crumb.child(context)` | Create a child trail with inherited context | +| `crumb.wrap(name, fn)` | Wrap any function with automatic scope tracking | +| `crumb.time(label)` / `crumb.timeEnd(label)` | Measure operation duration | +| `crumb.snapshot(label, obj)` | Capture a point-in-time deep clone | +| `crumb.assert(condition, msg)` | Debug-only assertion (emits crumb, never throws) | +| `crumb.session(name)` | Group crumbs into logical sessions | + +## Crumb markers + +Mark crumb lines with `// @crumbs` (single line) or `// #region @crumbs` / `// #endregion @crumbs` (block) so they can be stripped before merge. See the [markers docs](https://docs.agentcrumbs.dev/markers) for details and examples. + +## Environment variable + +Everything is controlled by a single `AGENTCRUMBS` environment variable. + +| Value | Effect | +| --- | --- | +| `1`, `*`, `true` | Enable all namespaces | +| `auth-service` | Exact namespace match | +| `auth-*` | Wildcard match | +| `auth-*,api-*` | Multiple patterns (comma or space separated) | +| `* -internal-*` | Match all except excluded patterns | +| `{"ns":"*","port":9999}` | JSON config with full control | + +JSON config fields: `ns` (namespace filter, required), `port` (collector port, default 8374), `format` (`"pretty"` or `"json"`, default `"pretty"`). + +## CLI + +Common commands for reference. Run `agentcrumbs --help` for the full list. + +```bash +# Collector +agentcrumbs collect --quiet & # Start in background +agentcrumbs collect --port 9999 # Custom port + +# Live tail +agentcrumbs tail # All namespaces +agentcrumbs tail --ns auth-service # Filter by namespace +agentcrumbs tail --tag perf # Filter by tag + +# Query +agentcrumbs query --since 5m # Last 5 minutes +agentcrumbs query --ns auth-service --since 1h +agentcrumbs query --tag root-cause +agentcrumbs query --json --limit 50 + +# Strip +agentcrumbs strip --dry-run # Preview removals +agentcrumbs strip # Remove all crumb code +agentcrumbs strip --check # CI gate (exits 1 if markers found) + +# Utilities +agentcrumbs stats # Crumb counts, file size +agentcrumbs clear # Delete stored crumbs +``` + +Time units: `s` (seconds), `m` (minutes), `h` (hours), `d` (days). + +## Multi-service architecture + +All services write to the same collector. `agentcrumbs tail` shows interleaved output with namespace-colored labels. See the [multi-service docs](https://docs.agentcrumbs.dev/multi-service) for setup patterns. + +## Cross-language compatibility + +The collector is language-agnostic. Any language with HTTP support can send crumbs: + +```bash +curl -X POST http://localhost:8374/crumb \ + -H "Content-Type: application/json" \ + -d '{"ts":"2026-01-01T00:00:00Z","ns":"shell","msg":"hello","type":"crumb","dt":0,"pid":1}' +``` + +## Runtime compatibility + +Zero runtime dependencies. Node.js built-in modules only: `node:http`, `node:async_hooks`, `node:crypto`, `node:fs`, `node:util`. + +Verified compatible with **Node.js 18+** and **Bun**. + +## Docs + +Full documentation at [docs.agentcrumbs.dev](https://docs.agentcrumbs.dev). + +## License + +MIT diff --git a/docs/src/app/layout.tsx b/docs/src/app/layout.tsx index fc37ffe..ad48d58 100644 --- a/docs/src/app/layout.tsx +++ b/docs/src/app/layout.tsx @@ -8,7 +8,7 @@ export const metadata: Metadata = { template: '%s | agentcrumbs', default: 'agentcrumbs', }, - description: 'Debug tracing for AI agents', + description: 'Debug mode for any agent.', }; export default function RootLayout({ children }: { children: ReactNode }) { diff --git a/docs/src/app/page.tsx b/docs/src/app/page.tsx index 50500c3..f09dd95 100644 --- a/docs/src/app/page.tsx +++ b/docs/src/app/page.tsx @@ -96,7 +96,7 @@ export default function HomePage() {
For Claude Code, Cursor, Copilot, and any agent
-

Debug mode for
AI agents

+

Debug mode for
any agent

Agents can read your code but they can't see what happened at runtime. agentcrumbs lets them drop structured traces inline while writing code, then query those traces when something breaks. Stripped before merge, zero cost when off.

diff --git a/packages/agentcrumbs/package.json b/packages/agentcrumbs/package.json index 4b85a4a..352e123 100644 --- a/packages/agentcrumbs/package.json +++ b/packages/agentcrumbs/package.json @@ -1,7 +1,7 @@ { "name": "agentcrumbs", "version": "0.2.0", - "description": "Debug tracing for agents. Drop crumbs, follow the trail.", + "description": "Debug mode for any agent.", "type": "module", "main": "./dist/index.js", "types": "./dist/index.d.ts",