fix: Retry creation of multiplexed session #4288

sakthivelmanii · 2026-01-06T08:11:56Z

Description:

Currently when multiplexed session fails with any error, we are storing the exception in the session reference and re-throwing that error to all the subsequent requests. This will cause the library to stall since no further requests will be processed successfully. It's a general expectation that all RPC requests are expected due to CPU, Network, GFE and other factors.

Proposed solution:
We will be retrying creation of multiplexed session for the duration of waitForMinSessions. If a client is having a cold start issues(higher CPU/memory during initial start), they can increase the waitForMinSessions time for the client to retry

gemini-code-assist · 2026-01-06T08:12:16Z

Summary of Changes

Hello @sakthivelmanii, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses a critical reliability issue in the Spanner client library where transient failures during the initial creation of multiplexed sessions could lead to a complete stall. By introducing a retry mechanism for specific transient error codes (DEADLINE_EXCEEDED, RESOURCE_EXHAUSTED, UNAVAILABLE) during the waitForMinSessions period, the client library becomes more resilient to temporary network or resource issues. This change significantly improves the robustness of the Spanner client, especially during application startup or under fluctuating load conditions, ensuring that operations can proceed after transient hiccups.

Highlights

Multiplexed Session Creation Retry Logic: Implemented a retry mechanism for the creation of multiplexed sessions in the Spanner client library. This prevents the library from stalling indefinitely if initial session creation fails due to transient errors.
Configurable Retry Duration: The retry duration for multiplexed session creation is now configurable via the waitForMinSessions option in SessionPoolOptions, allowing users to adjust for cold start issues or other transient network conditions.
Specific Retryable Error Codes: Retries are specifically triggered for DEADLINE_EXCEEDED, RESOURCE_EXHAUSTED, and UNAVAILABLE error codes, ensuring that only transient, retryable failures lead to retries, while permanent errors are propagated immediately.
Enhanced Test Coverage: Added comprehensive unit tests to validate the new retry logic under various scenarios, including successful retries, non-retryable exceptions, and different delay configurations.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces a retry mechanism for creating multiplexed sessions, which is a great improvement for handling transient errors during client startup. The changes are well-structured, and the new tests cover the retry logic effectively.

I have a few suggestions to further enhance the implementation:

The retry loop in maybeWaitForSessionCreation can be made more efficient by exiting immediately on success or non-retryable errors.
Using an EnumSet for RETRYABLE_ERROR_CODES would be more performant.
The new test cases in MultiplexedSessionDatabaseClientMockServerTest have some code duplication that could be refactored into a helper method for better maintainability.

Overall, this is a solid contribution that improves the robustness of the client library.

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

gemini-code-assist · 2026-01-06T08:15:03Z

...r/src/test/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClientMockServerTest.java

+    Spanner testSpanner =
+        SpannerOptions.newBuilder()
+            .setProjectId("test-project")
+            .setChannelProvider(channelProvider)
+            .setCredentials(NoCredentials.getInstance())
+            .setSessionPoolOption(
+                SessionPoolOptions.newBuilder()
+                    .setUseMultiplexedSession(true)
+                    .setUseMultiplexedSessionForRW(true)
+                    .setUseMultiplexedSessionPartitionedOps(true)
+                    .setWaitForMinSessionsDuration(Duration.ofSeconds(1))
+                    .setFailOnSessionLeak()
+                    .build())
+            .build()
+            .getService();


There's significant code duplication across the new test cases for creating Spanner instances with different SessionPoolOptions. To improve maintainability and reduce boilerplate, consider extracting this logic into a private helper method.

For example:

private Spanner createTestSpanner(SessionPoolOptions sessionPoolOptions) { return SpannerOptions.newBuilder() .setProjectId("test-project") .setChannelProvider(channelProvider) .setCredentials(NoCredentials.getInstance()) .setSessionPoolOption(sessionPoolOptions) .build() .getService(); }

This helper could then be called from each test, passing in the specific SessionPoolOptions required for that test case. This would make the tests cleaner and easier to read.

sakthivelmanii · 2026-01-06T15:45:57Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces a valuable retry mechanism for creating multiplexed sessions, making the Spanner client more resilient to transient failures, particularly during cold starts. The implementation correctly extracts the session creation logic for reuse within a new retry loop in maybeWaitForSessionCreation, and the accompanying tests are comprehensive. I have one suggestion to improve the clarity of an error message in the case of a timeout, but otherwise, this is a solid and well-executed enhancement.

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java

olavloite · 2026-01-07T08:53:32Z

...r/src/test/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClientMockServerTest.java

+    mockSpanner.setCreateSessionExecutionTime(
+        SimulatedExecutionTime.ofMinimumAndRandomTimeAndExceptions(
+            600,
+            0,
+            Arrays.asList(
+                Status.DEADLINE_EXCEEDED
+                    .withDescription(
+                        "CallOptions deadline exceeded after 22.986872393s. "
+                            + "Name resolution delay 6.911918521 seconds. [closed=[], "
+                            + "open=[[connecting_and_lb_delay=32445014148ns, was_still_waiting]]]")
+                    .asRuntimeException(),
+                Status.DEADLINE_EXCEEDED
+                    .withDescription(
+                        "CallOptions deadline exceeded after 22.986872393s. "
+                            + "Name resolution delay 6.911918521 seconds. [closed=[], "
+                            + "open=[[connecting_and_lb_delay=32445014148ns, was_still_waiting]]]")
+                    .asRuntimeException())));


If I understand this correctly, then it will wait for 600ms and then return a DEADLINE_EXCEEDED error based on the exceptions. This will again trigger a retry, which will time out. Can we:

Lower the wait time to keep the tests as quick as possible

Use a different retryable error code for the exceptions to show that the timeout error is really coming from the retry and not the exceptions that are being returned

olavloite · 2026-01-07T08:54:30Z

...r/src/test/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClientMockServerTest.java

+  public void testRetryWithDelayInExceptionWithInSessionCreationWaitTime() {
+    mockSpanner.setCreateSessionExecutionTime(
+        SimulatedExecutionTime.ofMinimumAndRandomTimeAndExceptions(
+            200,


Can we lower this value to keep the tests as fast as possible?

sakthivelmanii requested review from a team as code owners January 6, 2026 08:11

product-auto-label bot added size: l Pull request size is large. api: spanner Issues related to the googleapis/java-spanner API. labels Jan 6, 2026

sakthivelmanii changed the title ~~fix(spanner): Retry creation of multiplexed session~~ fix: Retry creation of multiplexed session Jan 6, 2026

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

product-auto-label bot added size: m Pull request size is medium. and removed size: l Pull request size is large. labels Jan 6, 2026

sakthivelmanii force-pushed the retry_multiplexed_session_till_session_wait_time branch from f86aa85 to 9f585a8 Compare January 6, 2026 08:58

product-auto-label bot added size: l Pull request size is large. and removed size: m Pull request size is medium. labels Jan 6, 2026

sakthivelmanii force-pushed the retry_multiplexed_session_till_session_wait_time branch 4 times, most recently from 04eab47 to f9260d1 Compare January 6, 2026 11:37

gemini-code-assist bot reviewed Jan 6, 2026

View reviewed changes

...e-cloud-spanner/src/main/java/com/google/cloud/spanner/MultiplexedSessionDatabaseClient.java Show resolved Hide resolved

sakthivelmanii force-pushed the retry_multiplexed_session_till_session_wait_time branch 6 times, most recently from eaab0fb to 8fea7db Compare January 6, 2026 17:15

fix(spanner): Retry creation of multiplexed session

26ddd73

sakthivelmanii force-pushed the retry_multiplexed_session_till_session_wait_time branch from 8fea7db to 26ddd73 Compare January 6, 2026 17:30

olavloite reviewed Jan 7, 2026

View reviewed changes

Reduce wait time to run tests faster

103b5bf

olavloite approved these changes Jan 7, 2026

View reviewed changes

sakthivelmanii merged commit 735e29e into main Jan 7, 2026
60 of 62 checks passed

sakthivelmanii deleted the retry_multiplexed_session_till_session_wait_time branch January 7, 2026 15:10

release-please bot mentioned this pull request Jan 7, 2026

chore(main): release 6.106.0 #4286

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Retry creation of multiplexed session #4288

fix: Retry creation of multiplexed session #4288

sakthivelmanii commented Jan 6, 2026

Uh oh!

gemini-code-assist bot commented Jan 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Jan 6, 2026

Uh oh!

sakthivelmanii commented Jan 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

olavloite Jan 7, 2026

Uh oh!

olavloite Jan 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: Retry creation of multiplexed session #4288

fix: Retry creation of multiplexed session #4288

Conversation

sakthivelmanii commented Jan 6, 2026

Uh oh!

gemini-code-assist bot commented Jan 6, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Jan 6, 2026

Choose a reason for hiding this comment

Uh oh!

sakthivelmanii commented Jan 6, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

olavloite Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

olavloite Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants