sql/backfill: report panics in backfill goroutines by rafiss · Pull Request #169062 · cockroachdb/cockroach

rafiss · 2026-04-24T15:33:27Z

The index backfiller and the MVCC index merger each spawn goroutines that, until now, had no panic recovery. A panic inside the goroutine spawned by indexBackfiller.Run (or one re-thrown from a ctxgroup worker by g.Wait) would tear down the SQL pod with no Sentry report and no CRDB-formatted log entry — only the Go runtime's bare stderr dump.

Switch the indexBackfiller goroutine to stopper.RunAsyncTaskEx so that the stopper's recover wrapper reports the panic to Sentry before re-panicking. The MVCC index merger keeps its bare goroutine because Run depends on g.Wait returning before the deferred memory monitor cleanup runs (a refused stopper task would force an early return with workers still using the bound account); instead it gets a
defer logcrash.RecoverAndReportPanic so the same Sentry visibility applies.

Both changes are defense in depth: the SQL pod still crashes after a panic in either path, but now the crash is observable instead of silent.

Informs: #169059
Epic: none

Release note: None

The index backfiller and the MVCC index merger each spawn goroutines that, until now, had no panic recovery. A panic inside the goroutine spawned by indexBackfiller.Run (or one re-thrown from a ctxgroup worker by g.Wait) would tear down the SQL pod with no Sentry report and no CRDB-formatted log entry — only the Go runtime's bare stderr dump. Switch the indexBackfiller goroutine to stopper.RunAsyncTaskEx so that the stopper's recover wrapper reports the panic to Sentry before re-panicking. The MVCC index merger keeps its bare goroutine because Run depends on g.Wait returning before the deferred memory monitor cleanup runs (a refused stopper task would force an early return with workers still using the bound account); instead it gets a defer logcrash.RecoverAndReportPanic so the same Sentry visibility applies. Both changes are defense in depth: the SQL pod still crashes after a panic in either path, but now the crash is observable instead of silent. Informs: cockroachdb#169059 Epic: none Release note: None Co-Authored-By: roachdev-claude <roachdev-claude-bot@cockroachlabs.com>

blathers-crl · 2026-04-24T15:33:31Z

It looks like your PR touches production code but doesn't add or edit any test code. Did you consider adding tests to your PR?

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

trunk-io · 2026-04-24T15:33:31Z

😎 Merged successfully - details.

cockroach-teamcity · 2026-04-24T15:33:40Z

This change is

spilchen

nice find

@spilchen made 2 comments.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on mw5h and rafiss).

pkg/sql/rowexec/indexbackfiller.go line 578 at r1 (raw file):

	// we loop over progCh, which is closed only after the goroutine returns.
	if startErr := ib.flowCtx.Stopper().RunAsyncTaskEx(ctx, stop.TaskOpts{
		TaskName: "indexBackfiller-runBackfill",

nit: we don't typically use camel case for task names. I'm fine if you want to ignore this, I just thought it looked a bit odd.

rafiss

TFTR!

/trunk merge

@rafiss made 2 comments.
Reviewable status: complete! 1 of 0 LGTMs obtained (waiting on mw5h and spilchen).

pkg/sql/rowexec/indexbackfiller.go line 578 at r1 (raw file):

Previously, spilchen wrote…

nit: we don't typically use camel case for task names. I'm fine if you want to ignore this, I just thought it looked a bit odd.

we use camel case (and also a few other non-standard formats) for this in a few other places:

cockroach/pkg/backup/backup_processor.go

Line 210 in 02c2c76

TaskName: "backupDataProcessor.runBackupProcessor",

cockroach/pkg/backup/generative_split_and_scatter_processor.go

Line 366 in 06ecabb

TaskName: "generativeSplitAndScatter-worker",

cockroach/pkg/kv/kvclient/kvcoord/txn_interceptor_committer.go

Line 517 in 4f02123

TaskName: "txnCommitter: making txn commit explicit",

since there's no unified convention, i'll keep this as is

blathers-crl · 2026-04-27T16:39:47Z

Encountered an error creating backports. Some common things that can go wrong:

The backport branch might have already existed.
There was a merge conflict.
The backport branch contained merge commits.

You might need to create your backport manually using the backport tool.

merge conflict cherry-picking 5fdf1a2 to blathers/backport-release-25.4-169062

Backport to branch 25.4.x failed. See errors above.

merge conflict cherry-picking 5fdf1a2 to blathers/backport-release-26.1-169062

Backport to branch 26.1.x failed. See errors above.

merge conflict cherry-picking 5fdf1a2 to blathers/backport-release-26.2-169062

Backport to branch 26.2.x failed. See errors above.

_{🦉 Hoot! I am a Blathers, a bot for CockroachDB. My owner is dev-inf.}

rafiss requested review from mw5h and spilchen April 24, 2026 15:33

rafiss requested a review from a team as a code owner April 24, 2026 15:33

rafiss added backport-25.4.x Flags PRs that need to be backported to 25.4 backport-26.1.x Flags PRs that need to be backported to 26.1 backport-26.2.x Flags PRs that need to be backported to 26.2 labels Apr 24, 2026

spilchen approved these changes Apr 24, 2026

View reviewed changes

rafiss commented Apr 27, 2026

View reviewed changes

trunk-io Bot merged commit ac091d7 into cockroachdb:master Apr 27, 2026
29 checks passed

celeste-cockroachdb Bot added the target-release-26.3.0 label Apr 27, 2026

blathers-crl Bot added the backport-failed label Apr 27, 2026

This was referenced Apr 28, 2026

release-26.2: sql/backfill: report panics in backfill goroutines #169206

Merged

release-26.1: sql/backfill: report panics in backfill goroutines #169207

Merged

release-25.4: sql/backfill: report panics in backfill goroutines #169209

Merged

rafiss deleted the backfill-handle-panic branch May 1, 2026 20:57

rafiss removed the backport-failed label May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sql/backfill: report panics in backfill goroutines#169062

sql/backfill: report panics in backfill goroutines#169062
trunk-io[bot] merged 1 commit intocockroachdb:masterfrom
rafiss:backfill-handle-panic

rafiss commented Apr 24, 2026

Uh oh!

blathers-crl Bot commented Apr 24, 2026

Uh oh!

trunk-io Bot commented Apr 24, 2026 •

edited

Loading

Uh oh!

cockroach-teamcity commented Apr 24, 2026

Uh oh!

spilchen left a comment

Uh oh!

rafiss left a comment

Uh oh!

Uh oh!

blathers-crl Bot commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rafiss commented Apr 24, 2026

Uh oh!

blathers-crl Bot commented Apr 24, 2026

Uh oh!

trunk-io Bot commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cockroach-teamcity commented Apr 24, 2026

Uh oh!

spilchen left a comment

Choose a reason for hiding this comment

Uh oh!

rafiss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

blathers-crl Bot commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

trunk-io Bot commented Apr 24, 2026 •

edited

Loading