[Feature]: add ViT activation_offload for InternS1 by NengXu001 · Pull Request #1619 · InternLM/xtuner

NengXu001 · 2026-03-23T14:50:20Z

Reduce InternS1 training memory by ~10GB via ViT offloading:

Added support for offloading activations in the modeling_vision module.
Added variables to allow VisionConfig and MoE modules to dynamically perceive each other's layer depths.
Updated MoE activation offloading arguments to include necessary vision parameters.

HAOCHENYE

We should enhance the implementation of activation offload to make it more general-purpose, so that it can serve multiple models without needing to be aware of layer counts.

HAOCHENYE · 2026-03-25T11:07:18Z

xtuner/v1/model/compose/intern_s1/intern_s1_config.py

    use_mask_token: bool = False
    use_mean_pooling: bool = True
    attn_impl: Literal["flash_attention", "flex_attention", "eager_attention"] = "flash_attention"
+    text_hidden_layers: int = 0


We should not modify other modules to work around functional deficiencies in ActivationOffload itself. I understand that, from VisionConfig's perspective, it should not need to be aware of text_hidden_layers.

feat: add ViT activation_offload for InternS1

a38a8b3

NengXu001 force-pushed the main branch from 51d058c to a38a8b3 Compare March 24, 2026 03:51

HAOCHENYE reviewed Mar 25, 2026

View reviewed changes

HAOCHENYE added the npu label Mar 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: add ViT activation_offload for InternS1#1619

[Feature]: add ViT activation_offload for InternS1#1619
NengXu001 wants to merge 1 commit intoInternLM:mainfrom
NengXu001:main

NengXu001 commented Mar 23, 2026

Uh oh!

HAOCHENYE left a comment

Uh oh!

HAOCHENYE Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

NengXu001 commented Mar 23, 2026

Uh oh!

HAOCHENYE left a comment

Choose a reason for hiding this comment

Uh oh!

HAOCHENYE Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants