Skip to content

DAOS-18674 chk: CHK engine talks with local control plane#17698

Draft
Nasf-Fan wants to merge 1 commit intoNasf-Fan/DAOS-18361_1from
Nasf-Fan/DAOS-18674_1
Draft

DAOS-18674 chk: CHK engine talks with local control plane#17698
Nasf-Fan wants to merge 1 commit intoNasf-Fan/DAOS-18361_1from
Nasf-Fan/DAOS-18674_1

Conversation

@Nasf-Fan
Copy link
Contributor

CHK engine will report interaction and repair result to the local control plane on its own node via dRPC (upcall) instead of to CHK leader. Correspondingly, local control plane will downcall to CHK engine to trigger CHK repair.

Cleanup related useless logic.

Test-tag: recovery

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@github-actions
Copy link

Ticket title is 'Enhance CHK upcall to handle MS leader switch'
Status is 'In Progress'
https://daosio.atlassian.net/browse/DAOS-18674

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-18674_1 branch 3 times, most recently from c4e6de4 to 0e6ee19 Compare March 13, 2026 06:24
@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17698/4/execution/node/1322/log

@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17698/4/execution/node/1312/log

@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-18674_1 branch from 0e6ee19 to 70446c8 Compare March 13, 2026 15:08
CHK engine will report interaction and repair result to the local
control plane on its own node via dRPC (upcall) instead of to CHK
leader. Correspondingly, local control plane will downcall to CHK
engine to trigger CHK repair.

Cleanup related useless logic.

Test-tag: recovery

Signed-off-by: Fan Yong <fan.yong@hpe.com>
@Nasf-Fan Nasf-Fan force-pushed the Nasf-Fan/DAOS-18674_1 branch from 70446c8 to bce615d Compare March 13, 2026 15:12
@daosbuild3
Copy link
Collaborator

Test stage Functional Hardware Medium Verbs Provider MD on SSD completed with status FAILURE. https://jenkins-3.daos.hpc.amslabs.hpecorp.net//job/daos-stack/job/daos/view/change-requests/job/PR-17698/6/execution/node/1312/log

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Development

Successfully merging this pull request may close these issues.

2 participants