OCPBUGS-90086: [release-4.22] clear stale EtcdRecoveryActive failure condition when etcd is healthy#8806
Conversation
|
Pipeline controller notification For optional jobs, comment This repository is configured in: LGTM mode |
|
@vsolanki12: This pull request references Jira Issue OCPBUGS-90086, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
Skipping CI for Draft Pull Request. |
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: vsolanki12 The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Repository YAML (base), Central YAML (inherited) Review profile: CHILL Plan: Enterprise Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
Codecov Report❌ Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## release-4.22 #8806 +/- ##
================================================
+ Coverage 35.45% 35.59% +0.14%
================================================
Files 767 767
Lines 93724 93771 +47
================================================
+ Hits 33226 33381 +155
+ Misses 57785 57673 -112
- Partials 2713 2717 +4
🚀 New features to boost your workflow:
|
|
/uncc |
|
/jira refresh |
|
@vsolanki12: This pull request references Jira Issue OCPBUGS-90086, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/jira refresh |
|
@vsolanki12: This pull request references Jira Issue OCPBUGS-90086, which is invalid:
Comment DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
|
/cc @bryan-cox @sdminonne |
75a3a26 to
821f069
Compare
…when etcd is healthy When an etcd recovery job fails and etcd later recovers on its own, the EtcdRecoveryActive condition retains the stale failure reason and message. The console renders this as a red error icon even though the cluster is healthy (Available=True, Degraded=False, EtcdAvailable=True). Clear the stale EtcdRecoveryActive condition by resetting its reason and message when etcd quorum is available and no recovery job is running. Add unit tests covering all condition state transitions. Signed-off-by: Vimal Solanki <vsolanki@redhat.com>
821f069 to
dfc89c3
Compare
|
@vsolanki12: all tests passed! Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
Summary
Backport of #8406 to release-4.22.
EtcdRecoveryJobFailedcan transition toAsExpectedwhen both useStatus=FalseEtcdRecoveryJobFailedcondition when etcd is fully healthy and no pods are failingCherry-pick required manual conflict resolution because #8309 refactored the monolithic function on main but was not backported to release branches.
Original PR
JIRA