Allow any multiple of 64 during chunked prefill #169

JRosenkranz · 2025-11-13T04:40:40Z

This PR will allow prompts with any multiple of 64 during chunked prefill while ensuring prefill size chunks

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

…atch Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz · 2025-11-13T13:55:59Z

This PR will replace #160

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz · 2025-11-13T14:15:22Z

bot:test
TEST_FILE=test_scripts.py

ani300 · 2025-11-13T15:19:27Z

aiu_fms_testing_utils/scripts/drive_paged_programs.py

+                        # add the valid prompt size to the end since it will already exist in the above enforce_sizes
+                        possible_seq_lengths = possible_seq_lengths + [
+                            valid_prompt_shape[1]
+                        ]


why are we adding this but we weren't before?

Technically it is not needed, but if we cycle through the sequence lengths under a certain program, there is no reason why we are skipping the largest size. This is an artifact of the prior PR which had all sequences of prefill_chunk size. I think this can be removed in this PR, though it makes sense to add it in general

ani300 · 2025-11-13T15:22:33Z

aiu_fms_testing_utils/utils/paged.py

+                        if chunk_j == 0:
+                            chunk_start = 0
+                            chunk_end = prefill_chunk_size - required_extra_pads
+                        else:
+                            required_extra_pads = 0
+                            chunk_start = chunk_end
+                            chunk_end += prefill_chunk_size


I know what chunk_start and chunk_end mean here, but I don't think they're the best names for these variables, as they are more of a mapping between the original sequence and its chunk partition. I don't know what would be a better name, maybe just a comment explaining what they are?

ani300 · 2025-11-13T15:26:53Z

aiu_fms_testing_utils/utils/paged.py

+                        position_ids_seq_chunk = kwargs["position_ids"][seq_i][
+                            chunk_start:chunk_end
+                        ]
+                        if required_extra_pads > 0:


I'm 50-50 on whether it's cleaner to centralize all the "if required_extra_pads > 0" into a single one or keep it as is. I was thinking maybe create all the {property}_seq_chunk first, then do all the padding, and finally turn them into unsqueezed tensors might make the code cleaner, but idk

ani300

left some comments on code clarity for future us, but the logic looks good if test_scripts passes

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz · 2025-11-13T16:56:30Z

bot:test
TEST_FILE=test_scripts.py

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz · 2025-11-14T00:37:16Z

bot:test
TEST_FILE=test_scripts.py

aiu_fms_testing_utils/utils/paged.py

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz · 2025-11-14T15:21:00Z

bot:test
TEST_FILE=test_scripts.py

ani300 and others added 11 commits October 30, 2025 01:27

Add padding to chunk size and update env vars

5b4d826

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

Fix the DPP warmup

ff9f9a4

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

address PR comments

edc6d6d

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

Ruff and dynamic

5ffd7e3

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

remove import

c5fab85

Signed-off-by: Antoni Viros i Martin <aviros@ibm.com>

fixed incorrect chunk sizes in all sequences but the largest in the b…

2851133

…atch Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

Merge branch 'main' into chunked_padding

60c90a1

Merge branch 'custom_dataset_fix' into chunked_padding

fd2935f

Merge branch 'main' into chunked_padding

b541d7a

fixed logic to allow any multiple of 64 in chunked prefill

3510df6

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

compile/inference is now completing

6fa414c

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

JRosenkranz marked this pull request as ready for review November 13, 2025 13:55

JRosenkranz requested a review from ani300 November 13, 2025 13:55

added tests

13df639

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

ani300 reviewed Nov 13, 2025

View reviewed changes

ani300 approved these changes Nov 13, 2025

View reviewed changes

JRosenkranz added 2 commits November 13, 2025 16:52

addressed PR comments

cc46501

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

disable cache for chunk prefill in test_scripts

c28aed8

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

adding pad block for chunked prefill;

40f88e1

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

ani300 reviewed Nov 14, 2025

View reviewed changes

aiu_fms_testing_utils/utils/paged.py Outdated Show resolved Hide resolved

removed unnecessary %

b6175ac

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow any multiple of 64 during chunked prefill #169

Allow any multiple of 64 during chunked prefill #169

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

ani300 Nov 13, 2025

Uh oh!

JRosenkranz Nov 13, 2025

Uh oh!

ani300 Nov 13, 2025

Uh oh!

ani300 Nov 13, 2025

Uh oh!

ani300 left a comment

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 14, 2025

Uh oh!

Uh oh!

JRosenkranz commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Allow any multiple of 64 during chunked prefill #169

Are you sure you want to change the base?

Allow any multiple of 64 during chunked prefill #169

Uh oh!

Conversation

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

ani300 Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 Nov 13, 2025

Choose a reason for hiding this comment

Uh oh!

ani300 left a comment

Choose a reason for hiding this comment

Uh oh!

JRosenkranz commented Nov 13, 2025

Uh oh!

JRosenkranz commented Nov 14, 2025

Uh oh!

Uh oh!

JRosenkranz commented Nov 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants