fix: resolve issue #13811 by Zhu1116 · Pull Request #13813 · huggingface/diffusers

Zhu1116 · 2026-05-26T13:11:33Z

What does this PR do?

I found an issue in the cond ids processing in train_dreambooth_lora_flux2_img2img.py in diffusers 0.38.0, around lines 1703-1709 of the package. This issue also exists in the main branch.

With the original code:

cond_model_input_list = [cond_model_input[i].unsqueeze(0) for i in range(cond_model_input.shape[0])]
cond_model_input_ids = Flux2Pipeline._prepare_image_ids(cond_model_input_list).to(
    device=cond_model_input.device
)
cond_model_input_ids = cond_model_input_ids.view(
    cond_model_input.shape[0], -1, model_input_ids.shape[-1]
)

When batch size is 2, the output cond_model_input_ids looks like this:

tensor([[[10,  0,  0,  0],
         [10,  0,  1,  0],
         [10,  0,  2,  0],
         ...,
         [10, 31, 29,  0],
         [10, 31, 30,  0],
         [10, 31, 31,  0]],

        [[20,  0,  0,  0],
         [20,  0,  1,  0],
         [20,  0,  2,  0],
         ...,
         [20, 31, 29,  0],
         [20, 31, 30,  0],
         [20, 31, 31,  0]]], device='cuda:0')

However, cond ids within the same batch should not be different.
Flux2Pipeline._prepare_image_ids is designed for multiple conditioning images from the same sample.

With the fixed code:

model_input_ids = Flux2Pipeline._prepare_latent_ids(model_input).to(device=model_input.device)
cond_model_input_ids = Flux2Pipeline._prepare_image_ids([cond_model_input[0:1]]).to(
    device=cond_model_input.device
)
cond_model_input_ids = cond_model_input_ids.expand(
    cond_model_input.shape[0], -1, -1
)

The output becomes correct (same cond ids for the whole batch):

tensor([[[10,  0,  0,  0],
         [10,  0,  1,  0],
         [10,  0,  2,  0],
         ...,
         [10, 31, 29,  0],
         [10, 31, 30,  0],
         [10, 31, 31,  0]],

        [[10,  0,  0,  0],
         [10,  0,  1,  0],
         [10,  0,  2,  0],
         ...,
         [10, 31, 29,  0],
         [10, 31, 30,  0],
         [10, 31, 31,  0]]], device='cuda:0')

I'm sorry if I have misunderstood the underlying logic.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

fix: resolve issue huggingface#13811

0392cd8

github-actions Bot added fixes-issue examples size/S PR with diff < 50 LOC and removed fixes-issue labels May 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: resolve issue #13811#13813

fix: resolve issue #13811#13813
Zhu1116 wants to merge 1 commit into
huggingface:mainfrom
Zhu1116:fix/issue-13811

Zhu1116 commented May 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Zhu1116 commented May 26, 2026

What does this PR do?

Before submitting

Who can review?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant