Handle multiple updates to the same record in a transaction #3722

ohadzeliger · 2025-11-04T18:18:18Z

The issue at hand is that when running multiple update operations in a single transaction, the partition's document counts and the PK-segment index may get into an inconsistent state. The root cause is that the first update in the transaction clears the doc from the Lucene index and the PK index. Since the changes are not flushed, the IndexWriter has them cached in the NRT cache. The second record update would then not find the record in the PK index (because the segment has changed but the IndexReader does not yet reflect that) and therefore the delete is skipped, including updating the partition count. Note that it does attempt a delete-by-query that actually removes the doc from the Lucene index, but since we can't know that, the partition is not updated.
The solution is to refresh the DirectoryReader when doing an update, so that any previously written changes are showing up. The refresh operation uses DirectoryReader.openIfChanged that is more efficient in resources than using a brand new open call.

Resolve #3704

…latest IndexWriter changes

…to find doc.

# Conflicts: # fdb-record-layer-lucene/src/test/java/com/apple/foundationdb/record/lucene/LuceneIndexMaintenanceTest.java

jjezra · 2025-11-11T18:43:41Z

...d-layer-lucene/src/main/java/com/apple/foundationdb/record/lucene/LuceneIndexMaintainer.java


+    /**
+     * Try to find the document for the given record in the segment index.
+     * This method would first try to find the document using teh existing reader. If it can't, it will refresh the reader


Suggested change

* This method would first try to find the document using teh existing reader. If it can't, it will refresh the reader

* This method would first try to find the document using the existing reader. If it can't, it will refresh the reader

jjezra · 2025-11-11T18:45:20Z

...d-layer-lucene/src/main/java/com/apple/foundationdb/record/lucene/LuceneIndexMaintainer.java

+     * writer may cache the changes in NRT and the reader (created earlier) can't see them. Refreshing the reader from the
+     * writer can alleviate this. If the index can't find the document with the refresh reader, null is returned.
+     * Note that the refresh of the reader will do so at the {@link com.apple.foundationdb.record.lucene.directory.FDBDirectoryWrapper}
+     * and so has impact on the entire directory.


Please consider rewording.

ScottDugas · 2025-11-11T20:48:47Z

...lucene/src/main/java/com/apple/foundationdb/record/lucene/directory/FDBDirectoryWrapper.java

+            if (newReader != null) {
+                // previous reader instantiated but then writer changed
+                readersToClose.add(writerReader);
+                writerReader = LazyCloseable.supply(() -> newReader);


What happens if this is touched concurrently?

Added test to update the readers concurrently

I still feel like there is a race condition, but it's possible that it doesn't work out in practice.

We start with writerReader0

threadA calls openIfChanged resulting in newReader1

threadB changes the writer -- This may not be possible to do without also creating a new writerReader, given the way this method is used, your test, I believe does call getWriterReader to be called when it saves.

threadC calls openIfChanged resulting in newReader2.

threadA adds writerReader0 to readersToClose

threadC adds writerReader0 to readersToClose

threadA sets writerReader to writerReader1 and returns it

threadC sets writerReader to writerRearedr2 and returns it

In this case, none of the code here will close writerReader1. So it might be worthwhile to change it so that we add the created reader to readersToClose, and we don't close writerReader directly.
i.e.
in the constructor call:
readersToClose(writerReader)
and do so again here, with a local:

newWriterReader = LazyCloseable.supply(() -> newReader); readersToClose.add(newWriterReader); writerReader = newWriterReader

Also, do you need to set writerReader to volatile?

Yeah, I think there is a possibility of a race condition. I believe you only need two threads making a change to the writer to expose this.

Done.
I don't think it is necessary to add volatile to the writerReader field, as in each thread, if the delete failed and the thread has caused a new reader to be created, this new reader should have the changed document cached, and if it is not, it will not be available in other threads, so a cached value in a thread should be OK in that situation.

...er-lucene/src/test/java/com/apple/foundationdb/record/lucene/LuceneIndexMaintenanceTest.java

Implement a refresh getter for the DirectoryReader to be able to see …

7f9a93e

…latest IndexWriter changes

ohadzeliger self-assigned this Nov 4, 2025

ohadzeliger added the bug fix Change that fixes a bug label Nov 4, 2025

ohadzeliger added 4 commits November 7, 2025 09:33

Keep old readers and close all together, only refresh reader if fail …

ee7eea9

…to find doc.

Cleanup

1908913

Cleanup

7151358

Cleanup

1a60d94

ohadzeliger marked this pull request as ready for review November 7, 2025 20:06

ohadzeliger requested review from ScottDugas and jjezra November 7, 2025 20:07

ohadzeliger added 2 commits November 8, 2025 00:20

Merge branch 'main' into lucene-multiple-updates

5f87f43

Merge branch 'main' into lucene-multiple-updates

de9955c

# Conflicts: # fdb-record-layer-lucene/src/test/java/com/apple/foundationdb/record/lucene/LuceneIndexMaintenanceTest.java

jjezra reviewed Nov 11, 2025

View reviewed changes

ScottDugas requested changes Nov 11, 2025

View reviewed changes

ohadzeliger added 2 commits November 18, 2025 15:11

PR comments

71db73d

PR comments: prevent potential race condition with readers to close

4d26c6c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle multiple updates to the same record in a transaction #3722

Handle multiple updates to the same record in a transaction #3722

ohadzeliger commented Nov 4, 2025

Uh oh!

jjezra Nov 11, 2025

Uh oh!

ohadzeliger Nov 17, 2025

Uh oh!

jjezra Nov 11, 2025

Uh oh!

ohadzeliger Nov 17, 2025

Uh oh!

ScottDugas Nov 11, 2025

Uh oh!

ohadzeliger Nov 18, 2025

Uh oh!

ScottDugas Dec 1, 2025

Uh oh!

ScottDugas Dec 1, 2025

Uh oh!

ohadzeliger Dec 3, 2025

Uh oh!

ohadzeliger Dec 3, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	* This method would first try to find the document using teh existing reader. If it can't, it will refresh the reader
	* This method would first try to find the document using the existing reader. If it can't, it will refresh the reader

Handle multiple updates to the same record in a transaction #3722

Are you sure you want to change the base?

Handle multiple updates to the same record in a transaction #3722

Conversation

ohadzeliger commented Nov 4, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants