Fix eigenvalue monitor logging#7987
Conversation
There was a problem hiding this comment.
💡 Codex Review
Here are some automated review suggestions for this pull request.
Reviewed commit: b858ce5ea4
ℹ️ About Codex in GitHub
Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you
- Open a pull request for review
- Mark a draft as ready
- Comment "@codex review".
If Codex has suggestions, it will comment; otherwise it will react with 👍.
When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".
| - name: Bump version | ||
| run: | | ||
| python release/bump_patch_version.py --current_version ${{ env.RELEASE_VERSION }} |
There was a problem hiding this comment.
Revalidate master version before bumping patch
In the updated release workflow, version validation runs on the release tag before master is checked out, but the bump step still writes version.txt from ${{ env.RELEASE_VERSION }}. If master has advanced (for example, it already contains a higher version) by the time Checkout master for version bump runs, python release/bump_patch_version.py --current_version ${{ env.RELEASE_VERSION }} can overwrite master with a lower next patch version and open a downgrade PR. Add a validation step after checking out master (or derive the bump from master's current version.txt) before creating the PR.
Useful? React with 👍 / 👎.
There was a problem hiding this comment.
Addressed in be529dc3.
After checking out master, the workflow now reads MASTER_VERSION from master's current version.txt and uses that value for the post-release patch bump. This avoids creating a downgrade PR if master has advanced beyond the released tag version.
b858ce5 to
be529dc
Compare
|
Hi @heurry , I would suggest to seperate this PR into two, each of them fix a different issue. |
|
Also see that Eigenvalue based MoQ is temporary disabled. Hi @sfc-gh-truwase do you know whether this functionality is still needed? |
Signed-off-by: heurry <restart12212022@163.com>
be529dc to
198e52f
Compare
|
Thanks for the review. I updated this PR to address only the eigenvalue monitor logging issue (#7983). The release workflow/version check changes have been removed from this PR, and the diff is now limited to I also updated the PR title and description to match the narrowed scope. |
@delock thanks for the good catch. @heurry thanks for PR, but unfortunately it applies to a feature that is disabled because the overall HybridEngine has not been maintained. Are you interested in using HybridEngine? |
Summary
Fixes #7983.
This PR fixes eigenvalue monitor logging by generating monitor events from the local
block_eigenvaluevalues instead of the nonexistentself.ev_values, and by converting thedict_valuesview into an indexable list before event generation.The release workflow/version check change from the earlier revision was removed so this PR now addresses only the eigenvalue monitor issue.
Changes
_get_eigenvalue_monitor_events()for focused eigenvalue monitor event generation.self.ev_valuesmonitor logging path with the helper.Tests
git diff --check masterpython -m py_compile deepspeed/runtime/engine.py tests/unit/runtime/test_engine.pyPYTEST_DISABLE_PLUGIN_AUTOLOAD=1 python -m pytest tests/unit/runtime/test_engine.py -v