Refactor get valid prompts - for memory optimization #170

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

Ssukriti wants to merge 6 commits into main from refactor_get_prompts

+113 −107

Ssukriti commented Nov 14, 2025

Currently on main, we generate all valid prompts for all programs in a large list called valid_prompts. After which, we start validating programs using the list of prompts. This can cause large memory consumption if size of prompts is large. We were running into out of memory with 128k size prompts.

This PR makes the generation of prompts a iterator instead. Prompts will be generated for a program and the program will be validated, before going to the next program. If prompts could not be generated for a program, the program validation is skipped just like today.

This helps save a lot of memory. Users will see change that prompt extraction for all programs will not happen upfront, and will happen as needed for a program

Ssukriti added 2 commits

November 14, 2025 13:55


          refactor get valid prompts

c156101

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>


          black fmt

ba345b1

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

Ssukriti marked this pull request as ready for review

November 17, 2025 20:52

Author

Ssukriti commented Nov 17, 2025

I tested with 32*32 RAG factoid dataset on granite model and got comparable logs as the main branch

Kept refactor to the minimal for the memory fix, as there is another branch with a whole refactor

Ssukriti requested a review from JRosenkranz

November 17, 2025 20:54

Ssukriti added 3 commits

November 17, 2025 15:58


          rebase on main

22c13ea

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>


          merge main

0ce59bf

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>


          merge main

952d432

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

Ssukriti commented

View reviewed changes

aiu_fms_testing_utils/scripts/drive_paged_programs.py

    
                                  (

              def get_program_prompt_list():

                  if custom_shape:

                      prompt_found = 0

Author

Ssukriti Nov 17, 2025 •

edited

Loading

since its hard to see git diff changes made in this PR:,
change 1 - use prompt_found flag as we are yielding instead of storing in list

Ssukriti commented

View reviewed changes

aiu_fms_testing_utils/scripts/drive_paged_programs.py

    
                                      pad_multiple=pad_multiple,

                                  )

                                  prompt_found = 1

                                  yield (

Author

Ssukriti Nov 17, 2025

change2: yield instead of list, flag set before yield

Ssukriti commented

View reviewed changes

aiu_fms_testing_utils/scripts/drive_paged_programs.py

    
                                  )

                              ]

                                  break

                          if prompt_found:

Author

Ssukriti Nov 17, 2025

change3: see flag instead of length of list

Ssukriti commented

View reviewed changes

aiu_fms_testing_utils/scripts/drive_paged_programs.py

    
                                      )

                                      valid_prompts.append(

                                          (

                                          used_keys.add(program_seq_key[0])

Author

Ssukriti Nov 17, 2025

change 4: used_keys.add(program_seq_key[0]) before yield and then yield

Ssukriti commented

View reviewed changes

aiu_fms_testing_utils/scripts/drive_paged_programs.py

    
                  input_ids,

                  extra_kwargs,

                  sample_key,

              ) in get_program_prompt_list():

Author

Ssukriti Nov 17, 2025

change 5: call function to yield instead of list


          ruff format

0aa793f

Signed-off-by: Sukriti-Sharma4 <sukriti.sharma4@ibm.com>

Author

Ssukriti commented Nov 18, 2025

rebased on latest main and tested

Contributor

JRosenkranz commented Nov 18, 2025

bot:test
TEST_FILE=test_scripts.py

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet