Update the benchmark to load datasets from multiple S3 buckets when None is specified by R-Palazzo · Pull Request #607 · sdv-dev/SDGym

R-Palazzo · 2026-05-25T19:49:55Z

Resolve #604
CU-86b9zwpcu

This PR includes:

Relying on SDV download_demo() to get sdv_dataset. This works for public datasets or for users with SDV-Enterprise installed. Otherwise, because on SDV there is some validation to allow access to the public bucket only, I reused the SDV logic here, so it works if the S3 client is created with valid credentials (from input keys or env variables).
Because we rely on download_demo(), we no longer save the dataset locally.
Improvement in the _generate_job_arg_list: Now we're only passing dataset_info inside it. The dataset_info includes everything necessary to then download the data and metadata during execution.
Added an integration test with a private dataset. Also tested it on AWS here and GCP here.

sdv-team · 2026-05-25T19:50:00Z

Task linked: CU-86b9zwpcu SDGym - Update the benchmark to load datasets from multiple S3 buckets when None is specified #604

codecov · 2026-05-25T19:59:07Z

Codecov Report

❌ Patch coverage is 95.87629% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.91%. Comparing base (61e71cb) to head (e4bfb1d).

Files with missing lines	Patch %	Lines
sdgym/benchmark.py	92.45%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #607      +/-   ##
==========================================
+ Coverage   85.67%   85.91%   +0.24%     
==========================================
  Files          40       40              
  Lines        3679     3749      +70     
==========================================
+ Hits         3152     3221      +69     
- Misses        527      528       +1

Flag	Coverage Δ
integration	`43.85% <76.28%> (+0.25%)`	⬆️
unit	`81.72% <95.87%> (+0.23%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

amontanez24 · 2026-05-26T16:51:49Z


+def _get_dataset_bucket_mapping(modality, buckets, s3_client, skip_inaccessible=False):
+    """Map SDV demo dataset names to the bucket they should be loaded from."""
+    dataset_buckets = {}


nitpick: can we call dataset_to_bucket

Yes, done in c349d70

R-Palazzo · 2026-05-28T09:54:00Z

 """Main SDGym benchmarking module."""

 import functools
-import gzip


Since the dataset is no longer included in the job_arg_list, we don't need to compress it. Also, it now contains one job by default, and from my tests, it's roughly 1KB.

sarahmish · 2026-06-01T14:07:48Z

+def _get_s3_client_from_result_writer(result_writer):
+    if isinstance(result_writer, S3ResultsWriter):
+        return result_writer.s3_client
+
+    return None


Just want to note that in the future, we should setup the s3 client independently from using the same credentials as writing the results.

R-Palazzo requested review from amontanez24 and frances-h May 25, 2026 19:49

R-Palazzo self-assigned this May 25, 2026

R-Palazzo requested a review from a team as a code owner May 25, 2026 19:49

R-Palazzo removed the request for review from a team May 25, 2026 19:50

R-Palazzo commented May 26, 2026

View reviewed changes

Comment thread pyproject.toml

R-Palazzo force-pushed the issue-604-2-private-bucket branch 2 times, most recently from 95be93a to 9a91033 Compare May 27, 2026 14:53

R-Palazzo mentioned this pull request May 27, 2026

Update the benchmark to launch one instance per (dataset, synthesizer) pair #611

Open

amontanez24 reviewed May 27, 2026

View reviewed changes

R-Palazzo force-pushed the issue-604-2-private-bucket branch from 725c22c to 5555441 Compare May 28, 2026 09:27

R-Palazzo commented May 28, 2026

View reviewed changes

R-Palazzo requested a review from amontanez24 May 28, 2026 09:54

frances-h approved these changes May 28, 2026

View reviewed changes

Comment thread sdgym/benchmark.py Outdated

R-Palazzo requested a review from sarahmish May 29, 2026 09:11

sarahmish reviewed Jun 1, 2026

View reviewed changes

R-Palazzo force-pushed the issue-604-2-private-bucket branch from b97fa0d to 0786528 Compare June 1, 2026 17:08

R-Palazzo requested a review from sarahmish June 1, 2026 17:09

sarahmish reviewed Jun 1, 2026

View reviewed changes

Comment thread sdgym/datasets.py

Comment thread sdgym/datasets.py

R-Palazzo requested a review from sarahmish June 1, 2026 17:29

R-Palazzo added 8 commits June 1, 2026 19:16

load dataset only once

6425fd1

use downlad_demo from sdv

d998966

update _resolve_dataset

d03f8f5

fix lint

75bc0a6

cleaning

16485c4

add validation

b1c68ff

move dataset loading to execution

815397b

fix tests

2a7b68f

R-Palazzo added 6 commits June 1, 2026 19:16

rename _get_dataset_bucket_mapping -> dataset_to_bucket

82faa73

add tests

40edcda

stop compressing job_arg_list

3ed9edc

undo pip install command

4cf0001

remove dataset_name from JobArgs

2731ccf

add logging

4919876

R-Palazzo force-pushed the issue-604-2-private-bucket branch from 4bd6e85 to 4907bb3 Compare June 1, 2026 18:33

improve _load_sdv_demo_dataset

e4bfb1d

R-Palazzo force-pushed the issue-604-2-private-bucket branch from 4907bb3 to e4bfb1d Compare June 1, 2026 18:35

sarahmish approved these changes Jun 1, 2026

View reviewed changes

R-Palazzo merged commit 6228ecb into main Jun 1, 2026
51 of 57 checks passed

R-Palazzo deleted the issue-604-2-private-bucket branch June 1, 2026 19:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update the benchmark to load datasets from multiple S3 buckets when None is specified#607

Update the benchmark to load datasets from multiple S3 buckets when None is specified#607
R-Palazzo merged 15 commits into
mainfrom
issue-604-2-private-bucket

R-Palazzo commented May 25, 2026 •

edited

Loading

Uh oh!

sdv-team commented May 25, 2026

Uh oh!

codecov Bot commented May 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

amontanez24 May 26, 2026

Uh oh!

R-Palazzo May 28, 2026

Uh oh!

R-Palazzo May 28, 2026

Uh oh!

Uh oh!

sarahmish Jun 1, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

R-Palazzo commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sdv-team commented May 25, 2026

Uh oh!

codecov Bot commented May 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

amontanez24 May 26, 2026

Choose a reason for hiding this comment

Uh oh!

R-Palazzo May 28, 2026

Choose a reason for hiding this comment

Uh oh!

R-Palazzo May 28, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sarahmish Jun 1, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

R-Palazzo commented May 25, 2026 •

edited

Loading

codecov Bot commented May 25, 2026 •

edited

Loading