improve: log diagnostic for remote deployment junit extension by csviri · Pull Request #3306 · operator-framework/java-operator-sdk

csviri · 2026-04-22T11:13:19Z

adds also option to not delete CRDs

Signed-off-by: Attila Mészáros a_meszaros@apple.com

Copilot

Pull request overview

Adds richer diagnostics when a cluster-deployed operator times out during deployment, to make remote E2E failures easier to debug.

Changes:

Catch KubernetesClientTimeoutException during waitUntilReady and emit diagnostics before rethrowing.
Add logDiagnosticInfo(...) to report deployments, pods, container statuses, related events, and recent pod logs on timeout.

Copilot · 2026-04-22T13:25:11Z

+              "  Could not retrieve logs for pod '{}': {}",
+              pod.getMetadata().getName(),
+              logEx.getMessage());


When pod log retrieval fails, this only logs the exception message and drops the stack trace, which makes diagnosing client/auth/network issues harder. Consider logging the exception itself as the last argument (similar to the diagEx handling below) so the full cause is available when needed.

Suggested change

" Could not retrieve logs for pod '{}': {}",

pod.getMetadata().getName(),

logEx.getMessage());

" Could not retrieve logs for pod '{}'",

pod.getMetadata().getName(),

logEx);

csviri · 2026-04-22T13:26:30Z

@metacosm @xstefank I restarted a couple times the tomcat E2E test, but it does not want to fail, however this might be useful in general to log issues such a way as in this PR. So we could merge this an see if it fails again.

What do you think?

xstefank

OK with me

csviri · 2026-04-23T08:52:25Z

Now it fails, this seems to be an issue that I also observed with the operations sample PR:
#3291

It seems that CRD is not applied in some cases, will investigate further. This might be actually because of junit extension issue - maybe with newer version of junit? - since we never observed this before.

Or maybe with your recent changes @xstefank ?

xstefank · 2026-04-23T09:16:43Z

@csviri there is only one test in TomcatOperatorE2ETest so I don't think deleting of the CRD is the issue.

csviri · 2026-04-23T12:12:55Z

Will mere

@csviri there is only one test in TomcatOperatorE2ETest so I don't think deleting of the CRD is the issue.

No, not the deletion, just thought it might ring some bell, nvm

csviri · 2026-04-23T15:20:59Z

@csviri there is only one test in TomcatOperatorE2ETest so I don't think deleting of the CRD is the issue.

Yes but there are two test modes local and remote, so that will be the problem. @xstefank

csviri · 2026-04-23T15:26:19Z

I will do a nicer opt-out from that feature for E2Es

xstefank

@csviri all tests passed with the CRD deleting after the test class is executed in my PR. So I don't understand why you think that it caused failures in this PR? Local and Cluster tests run separatedly, so they should both restart the operator and redeploy the CRD. I don't think leaving some CRDs like it is proposed in this PR is good.

csviri · 2026-04-24T07:33:30Z

@csviri all tests passed with the CRD deleting after the test class is executed in my PR. So I don't understand why you think that it caused failures in this PR? Local and Cluster tests run separatedly, so they should both restart the operator and redeploy the CRD. I don't think leaving some CRDs like it is proposed in this PR is good.

@xstefank they are running against the same cluster, there is probably an issue with cluster test runner regarding the CRD, I will take a look on that later; for this fixes the issue. Also this api to turn off CRD deletion might help for others too. Will continue on this in this PR.

csviri · 2026-04-24T07:44:55Z

@xstefank btw, currently the deletion of CRD-s would great to handle uniformly also for ClusterDeployedOperatorExtension so implementing it in AbstractOperatorExtension, would you care to create a PR for that?

csviri · 2026-04-24T12:38:31Z

@xstefank I addressed the CRD issue in separate PR what we discussed. Removed the deletion opt-out, but let there the option for the future / if some might need it.

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>

csviri · 2026-04-24T13:25:31Z

Delete CRD flag I will move to a separate PR.

openshift-ci Bot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 22, 2026

csviri linked an issue Apr 22, 2026 that may be closed by this pull request

Flaky Sample TomcatOperator E2E test #3304

Open

csviri changed the title ~~improve: log diagnostic for remote deployment junit test~~ improve: log diagnostic for remote deployment junit extension Apr 22, 2026

csviri force-pushed the tomcat-fix branch 2 times, most recently from ddc23d3 to 5bb595d Compare April 22, 2026 13:20

csviri marked this pull request as ready for review April 22, 2026 13:21

Copilot AI review requested due to automatic review settings April 22, 2026 13:21

openshift-ci Bot removed the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Apr 22, 2026

openshift-ci Bot requested review from metacosm and xstefank April 22, 2026 13:21

Copilot started reviewing on behalf of csviri April 22, 2026 13:21 View session

Copilot AI reviewed Apr 22, 2026

View reviewed changes

xstefank approved these changes Apr 23, 2026

View reviewed changes

csviri force-pushed the tomcat-fix branch from 5bb595d to 73e7aac Compare April 23, 2026 07:21

metacosm approved these changes Apr 23, 2026

View reviewed changes

xstefank requested changes Apr 24, 2026

View reviewed changes

csviri requested a review from xstefank April 24, 2026 12:38

csviri added 4 commits April 24, 2026 15:24

improve: log diagnostic for remote deployment junit test

dc1cd4d

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>

logging

39c048b

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>

wip

ef75270

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>

wip

8ac6a30

Signed-off-by: Attila Mészáros <a_meszaros@apple.com>

csviri force-pushed the tomcat-fix branch from b7a3e9e to 8ac6a30 Compare April 24, 2026 13:24

Conversation

csviri commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Copilot AI Apr 22, 2026

Choose a reason for hiding this comment

Uh oh!

csviri commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xstefank left a comment

Choose a reason for hiding this comment

Uh oh!

csviri commented Apr 23, 2026

Uh oh!

xstefank commented Apr 23, 2026

Uh oh!

csviri commented Apr 23, 2026

Uh oh!

csviri commented Apr 23, 2026

Uh oh!

csviri commented Apr 23, 2026

Uh oh!

xstefank left a comment

Choose a reason for hiding this comment

Uh oh!

csviri commented Apr 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

csviri commented Apr 24, 2026

Uh oh!

csviri commented Apr 24, 2026

Uh oh!

csviri commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

csviri commented Apr 22, 2026 •

edited

Loading

csviri commented Apr 22, 2026 •

edited

Loading

csviri commented Apr 24, 2026 •

edited

Loading