[fix][ml] Track all pending read callbacks for timeouts by Technoboy- · Pull Request #26081 · apache/pulsar

Technoboy- · 2026-06-24T08:44:26Z

Motivation

When managed ledger read-entry timeout is enabled, ManagedLedgerImpl only keeps the most recent ReadEntryCallbackWrapper in lastReadCallback. If multiple reads are pending and an older read hangs, a newer read can overwrite that callback, so the older operation is no longer checked by checkReadTimeout().

That can leave a cursor read pending indefinitely and block follow-up cursor operations such as reset/mark-delete progress.

Modifications

Replace the single lastReadCallback with a pending-read callback map keyed by read operation id.
Remove callbacks from the pending map when they are recycled by success, failure, or timeout paths.
Iterate all pending callbacks during read-timeout checks so each timed-out read can fail independently.
Add a regression test covering concurrent read-entry timeouts.

Verifying this change

./gradlew :managed-ledger:test --tests org.apache.bookkeeper.mledger.impl.ManagedLedgerTest.testManagedLedgerWithReadEntryTimeOut --tests org.apache.bookkeeper.mledger.impl.ManagedLedgerTest.testManagedLedgerWithConcurrentReadEntryTimeOut

void-ptr974 · 2026-06-25T15:27:24Z

I think the current priority-queue approach can still retain too much state after reads complete.

The wrapper is added before entryCache.asyncReadEntry(...), so cache hits are also inserted into the timeout queue. On normal completion, the callback reference is cleared, but the queue node remains until its timeout deadline is polled. With read-entry timeout enabled, the queue size can become proportional to read rate * timeout seconds, rather than the number of reads that are actually still pending.

A bucketed timeout structure may be a better fit here: group reads by timeout bucket, and keep an inner map from readOpCount to callback. The wrapper can keep a direct reference to its bucket, so normal completion removes itself in average O(1), while timeout checks only process expired buckets.

// bucketId -> (readOpCount -> callback wrapper)
private final ConcurrentLongHashMap<ConcurrentLongHashMap<ReadEntryCallbackWrapper>> readTimeoutBuckets =
        ConcurrentLongHashMap.<ConcurrentLongHashMap<ReadEntryCallbackWrapper>>newBuilder().build();

static final class ReadEntryCallbackWrapper implements ReadEntryCallback, ReadEntriesCallback {
    volatile ConcurrentLongHashMap<ReadEntryCallbackWrapper> timeoutBucket;
}

Fix managed ledger read timeout tracking

5a5e79f

Technoboy- marked this pull request as ready for review June 24, 2026 10:02

Technoboy- self-assigned this Jun 24, 2026

lhotari reviewed Jun 24, 2026

View reviewed changes

Comment thread managed-ledger/src/main/java/org/apache/bookkeeper/mledger/impl/ManagedLedgerImpl.java Outdated

Use priority queue for read timeouts

9ba177d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[fix][ml] Track all pending read callbacks for timeouts#26081

[fix][ml] Track all pending read callbacks for timeouts#26081
Technoboy- wants to merge 2 commits into
apache:masterfrom
Technoboy-:codex/fix-managed-ledger-read-timeout-tracking

Technoboy- commented Jun 24, 2026

Uh oh!

Uh oh!

void-ptr974 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Technoboy- commented Jun 24, 2026

Motivation

Modifications

Verifying this change

Uh oh!

Uh oh!

void-ptr974 commented Jun 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants