[FSDP2/Megatron-FSDP/DCP] If model parameters are DTensors, optimizer states should also be DTensors.#2795
Merged
vthumbe1503 merged 17 commits intoNVIDIA:mainfrom Apr 4, 2026
Merged
Commits
Commits on Mar 31, 2026
- committed
- authored andcommitted
- committed
- andcommitted
- committed
- committed
- authored andcommitted
- committed
- authored andcommitted
- committed
- committed
- committed