Fix: Perform sigmoid calculation in fp32 for aux loss stability #2765

CodersAcademy006 · 2025-12-26T11:46:02Z

This PR updates compute_routing_scores_for_aux_loss to perform sigmoid routing score calculations in float32, matching the existing behavior of the softmax path.

Fixes #2741

Currently, the softmax routing path explicitly casts to float32 to avoid underflow/overflow in BF16/FP16. However, the sigmoid path performs operations in the input dtype.

While sigmoid is bounded [0, 1], the subsequent normalization (scores / sum) involves accumulation that can suffer from precision loss in BF16, especially with a large number of experts. This change aligns both methods to use high-precision accumulation for auxiliary loss stability.

copy-pr-bot · 2025-12-26T11:46:07Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

Fix: Perform sigmoid calculation in fp32 for aux loss stability

fc17827

CodersAcademy006 requested review from a team as code owners December 26, 2025 11:46

github-actions bot requested a review from Phlip79 December 26, 2025 11:46

github-actions bot added the community-request label Dec 26, 2025

Merge branch 'main' into fix/moe-sigmoid-fp32

1ced23a

yaox12 approved these changes Jan 4, 2026

View reviewed changes

yaox12 added the Expert Review Apply this label to indicate that your PR is ready for expert review. label Jan 4, 2026

Merge branch 'main' into fix/moe-sigmoid-fp32

5b77b88

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix: Perform sigmoid calculation in fp32 for aux loss stability #2765

Fix: Perform sigmoid calculation in fp32 for aux loss stability #2765

Uh oh!

CodersAcademy006 commented Dec 26, 2025

Uh oh!

copy-pr-bot bot commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix: Perform sigmoid calculation in fp32 for aux loss stability #2765

Are you sure you want to change the base?

Fix: Perform sigmoid calculation in fp32 for aux loss stability #2765

Uh oh!

Conversation

CodersAcademy006 commented Dec 26, 2025

Uh oh!

copy-pr-bot bot commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants