-
-
Notifications
You must be signed in to change notification settings - Fork 11.6k
[Bugfix][EPLB] Disabled shared expert overlap when EPLB is enabled #28377
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bugfix][EPLB] Disabled shared expert overlap when EPLB is enabled #28377
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Code Review
This pull request introduces a workaround to fix a correctness issue with Expert Parallelism Load Balancing (EPLB) by disabling shared expert overlap when EPLB is enabled. The change is straightforward and effectively addresses the issue described. I have one suggestion to improve a comment's clarity to ensure it accurately reflects the code's logic, which will enhance future maintainability.
yewentao256
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice catch, thanks for the work!
Signed-off-by: Sage Moore <sage@neuralmagic.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Signed-off-by: Sage Moore <sagemoore@utexas.edu> Signed-off-by: Sage Moore <sage@neuralmagic.com>
Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: Sage Moore <sagemoore@utexas.edu> Signed-off-by: Sage Moore <sage@neuralmagic.com>
Signed-off-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: Sage Moore <sage@neuralmagic.com>
d949b89 to
bd2c846
Compare
…llm-project#28377) Signed-off-by: Sage Moore <sage@neuralmagic.com> Signed-off-by: Sage Moore <sagemoore@utexas.edu> Signed-off-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com> Co-authored-by: Wentao Ye <44945378+yewentao256@users.noreply.github.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>
Purpose
EPLB is currently broken on main after #28164. This PR works around the issue by disabling shared expert overlap when EPLB is enabled.
Test Plan
lm eval
Test Result
Server command
Before
After