FEAT: Scenario DatasetConfiguration #1288

rlundeen2 · 2025-12-29T19:24:33Z

This PR introduces DatasetConfiguration class that is passed to scenarios in initialize_async to address several pain points.

Big available default datasets

It was tough to change the default datasets - some were too big and some too small. As an example, Foundry used harm_bench, which is too big for the default. So it would randomly select 4 by default. But how could users run against all 100? There was no way to configure this. On the flip side garak.encoding_scenario had a large dataset that took a very long time to run, but there was no way to make it small by default and still have the entire dataset available.

This change allows scenarios to be configured with a default (e.g. name=harm_bench, max=4). But users can easily configure both the dataset name or max differently.

It also helps with our end to end tests, which were taking a million years because encoding scenario had so many datasets (this pr reduces to only 3 by default)

Allows dataset params from the front end

This also allows users to specify the dataset names they want to use from pyrit_scan and pyrit_shell. Previously, users had to use the dataset defaults.

--dataset-names DATASET_NAMES [DATASET_NAMES ...]
Dataset names to load for the scenario (overrides scenario defaults)
--max-dataset-size MAX_DATASET_SIZE
Maximum number of seed groups to use (randomly samples if dataset is larger)

Deprecates Incompatible Parameters

objective, seed_prompts are now deprecated as initialization parameters for scenarios

Tests

Unit tests added!
End to end tests passed 2c281be

…_12_28_scenario_dataset

rlundeen2 added 12 commits December 28, 2025 15:39

Adding DatasetConfiguration

efbc8fe

updating tests

b59d651

updated

1ea0297

pre-commit

975c30e

test fixc

726b8ef

Merge branch 'main' into users/rlundeen/2025_12_28_scenario_dataset

4088a8e

fixing max

a73bb1f

Fixing bug with system prompt

f2cd912

fixing tests

f95d4ff

adding validation for prompt_target prepended_conversation combos

504f7bd

Merge branch 'users/rlundeen/2025_12_29_bug' into users/rlundeen/2025…

e179dc5

…_12_28_scenario_dataset

adding debugging to openaitarget

2c281be

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FEAT: Scenario DatasetConfiguration #1288

FEAT: Scenario DatasetConfiguration #1288

rlundeen2 commented Dec 29, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

FEAT: Scenario DatasetConfiguration #1288

Are you sure you want to change the base?

FEAT: Scenario DatasetConfiguration #1288

Conversation

rlundeen2 commented Dec 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Big available default datasets

Allows dataset params from the front end

Deprecates Incompatible Parameters

Tests

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rlundeen2 commented Dec 29, 2025 •

edited

Loading