NO-JIRA: Fix interval filtering bug in watchnodes tests#31320
Conversation
|
Pipeline controller notification For optional jobs, comment This repository is configured in: automatic mode |
|
@mdbooth: This pull request explicitly references no jira issue. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
WalkthroughThe ChangesInterval Overlap Predicate Fix with Grace Period Parametrization
Estimated code review effort🎯 2 (Simple) | ⏱️ ~12 minutes Possibly related PRs
Suggested reviewers
🚥 Pre-merge checks | ✅ 14 | ❌ 1❌ Failed checks (1 warning)
✅ Passed checks (14 passed)
✏️ Tip: You can configure your own custom pre-merge checks in the settings. ✨ Finishing Touches🧪 Generate unit tests (beta)
Comment |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: mdbooth The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
d846c68 to
18a4f0a
Compare
|
Unit test flaked because it couldn't reach google.com. /test unit |
|
Scheduling required tests: |
|
Confirmed e2e-gcp-ovn-upgrade failed due to #31319 |
18a4f0a to
c9c4c4d
Compare
|
/pipeline auto |
|
The |
|
Scheduling required tests: |
|
e2e-gcp-ovn-upgrade failed due to a timing issue: the network CO set Progressing=False 8 seconds before the last node became NodeReady=false. This looks like a bug in CNO. Possible fixes:
Preference is for the former. |
|
/test e2e-vsphere-ovn e2e-aws-ovn-fips |
|
/testwith openshift/origin/main/e2e-gcp-ovn-upgrade openshift/cluster-network-operator#3034 |
A bug in intervalStartDuring meant that it would always return true if the target had event had ever occurred, not just within a specified window.
c9c4c4d to
e3a4331
Compare
|
@mdbooth: The following tests failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
A bug in intervalStartDuring meant that it would always return true if the target had event had ever occurred, not just within a specified window.
The test adds a 30s grace period to the window where the network ClusterOperator reports Progressing=True. This is for 2 reasons:
Firstly, CNO reports Progressing=False before the last pod becomes available, so that Node will always be NodeReady=False outside the target window. openshift/cluster-network-operator#3034 is a proposed fix for this.
Secondly, even if the CNO fix is applied, without any grace period this test would be likely to flake under load if kubelet does not update NodeReady fast enough.
Summary by CodeRabbit
Bug Fixes
Tests