Changing RMS layer norm to accept DTensors. #982

kolehma8 · 2025-12-17T00:14:46Z

Summary

Changing RMS layer norm to accept DTensors.

Details

RMS layer norm parameters are NOT sharded under typical tensor parallelism implementations but the inputs (and gradients) may become sharded (DTensors). This PR resolves the issue by gathering the input tensors and performing the full compute in each device.

Relates Issues:

Testing Done

Hardware Type:
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

shimizust · 2025-12-17T17:58:47Z

Can you run the example e2e training script just using FSDP2 (since TP won't work until all kernels are tp-compatible) to ensure the perf/loss is correct?

shimizust · 2025-12-17T18:09:15Z

src/liger_kernel/ops/rms_norm.py

+            # needs to be gathered to a local tensor to compute
+            # RMSE layer norm on each TP worker.
+            # TODO: support CP.
+            X = X.full_tensor()


For my understanding, pytorch native TP keeps activations as DTensors and lets subsequent ops decide what to do?

That is my understanding as well.

shimizust · 2025-12-17T18:10:35Z

src/liger_kernel/utils.py

    return PEFT_AVAILABLE


+def infer_backend():


Suggest renaming to infer_comm_backend

Changed as suggested.

kolehma8 · 2025-12-19T21:30:56Z

Can you run the example e2e training script just using FSDP2 (since TP won't work until all kernels are tp-compatible) to ensure the perf/loss is correct?

I did not see any benchmarks using FSDP in the scripts folder. I think we can say to a very high degree of confidence that this PR will not impact FSDP since if it did, the op would have crashed earlier (e.g. if any inputs are DTensors). Vice versa, this PR only impacts input DTensors and behavior for regular tensors remains unchanged.

kolehma8 self-assigned this Dec 17, 2025

kolehma8 force-pushed the kolehma8/rms_norm_dtensor branch from ff318e5 to 23fc67f Compare December 17, 2025 00:18

kolehma8 requested review from shimizust and vaibhavjindal December 17, 2025 00:19

shimizust reviewed Dec 17, 2025

View reviewed changes

kolehma8 force-pushed the kolehma8/rms_norm_dtensor branch 5 times, most recently from 80279eb to 75a54f8 Compare December 19, 2025 20:53

kolehma8 requested a review from shimizust December 19, 2025 21:31

kolehma8 force-pushed the kolehma8/rms_norm_dtensor branch from 75a54f8 to 9af46d9 Compare January 5, 2026 18:34

Changing RMS layer norm to accept DTensors.

b445975

kolehma8 force-pushed the kolehma8/rms_norm_dtensor branch from 9af46d9 to b445975 Compare January 5, 2026 18:42

shimizust approved these changes Jan 5, 2026

View reviewed changes

kolehma8 merged commit 5101e3c into main Jan 5, 2026
5 of 7 checks passed

kolehma8 deleted the kolehma8/rms_norm_dtensor branch January 5, 2026 20:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Changing RMS layer norm to accept DTensors. #982

Changing RMS layer norm to accept DTensors. #982

Uh oh!

kolehma8 commented Dec 17, 2025 •

edited

Loading

Uh oh!

shimizust commented Dec 17, 2025

Uh oh!

shimizust Dec 17, 2025

Uh oh!

kolehma8 Dec 18, 2025

Uh oh!

shimizust Dec 17, 2025

Uh oh!

kolehma8 Dec 18, 2025

Uh oh!

kolehma8 commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Changing RMS layer norm to accept DTensors. #982

Changing RMS layer norm to accept DTensors. #982

Uh oh!

Conversation

kolehma8 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Details

Testing Done

Uh oh!

shimizust commented Dec 17, 2025

Uh oh!

shimizust Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

kolehma8 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

shimizust Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

kolehma8 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

kolehma8 commented Dec 19, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

kolehma8 commented Dec 17, 2025 •

edited

Loading