Skip to content

Conversation

@yiliu30
Copy link
Contributor

@yiliu30 yiliu30 commented Dec 4, 2025

No description provided.

yiliu30 and others added 21 commits November 20, 2025 20:57
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30 yiliu30 added the llmc label Dec 4, 2025
Signed-off-by: yiliu30 <yi4.liu@intel.com>

def _preprocess_block_inputs(self, inputs, first_input_name="input_ids"):
input_ids, input_others = self._split_inputs(inputs, first_input_name)
clear_memory(device_list=self.device_list)
Copy link
Contributor

@wenhuach21 wenhuach21 Dec 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don''t know why clear_memory is needed here. @n1ck-guo do you know?

Signed-off-by: yiliu30 <yi4.liu@intel.com>
input_ids, input_others = normalize_input(inputs)
self.normalize_decoding_layer_inputs_(inputs)
block_inputs = self.inputs[self.quant_block_list[0][0]]
decoding_layer_first_input_name = "hidden_states"
Copy link
Contributor

@wenhuach21 wenhuach21 Dec 4, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

does the origin code have such assumption? Is there any case that the first_input_name is not hidden_states

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The origin code replaces the hidden_states with input_ids.

Signed-off-by: yiliu30 <yi4.liu@intel.com>
@yiliu30 yiliu30 added this to the 0.9.3 milestone Dec 4, 2025
Signed-off-by: yiliu30 <yi4.liu@intel.com>
@wenhuach21 wenhuach21 self-requested a review December 5, 2025 02:19
@chensuyue chensuyue merged commit 152f1a0 into main Dec 5, 2025
26 checks passed
@chensuyue chensuyue deleted the fix-attn-mask branch December 5, 2025 08:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants