Skip to content

[FSDP2/Megatron-FSDP/DCP] If model parameters are DTensors, optimizer states should also be DTensors.#2795

Merged
vthumbe1503 merged 17 commits intoNVIDIA:mainfrom
cspades:cye/fused-adam-dcp
Apr 4, 2026
Merged

[FSDP2/Megatron-FSDP/DCP] If model parameters are DTensors, optimizer states should also be DTensors.#2795
vthumbe1503 merged 17 commits intoNVIDIA:mainfrom
cspades:cye/fused-adam-dcp

Commits

Commits on Mar 31, 2026

Commits on Apr 3, 2026

Commits on Apr 4, 2026