feat(examples): Add end-to-end reproduction of Boag et al. 2018 mistrust pipeline on MIMIC-III by vtewari2 · Pull Request #962 · sunlabuiuc/PyHealth

vtewari2 · 2026-04-11T15:35:06Z

Summary

Adds a complete, runnable end-to-end example that reproduces the medical
mistrust classification pipeline from:

Boag et al. "Racial Disparities and Mistrust in End-of-Life Care."
MLHC 2018. arXiv:1808.03827
Original code: github.com/wboag/eol-mistrust

The example ties together all PyHealth components introduced in PRs #960 and
#961 into a single runnable script, and serves as a reference implementation
for reproducing the paper's supervised mistrust classifiers using modern
Python tooling.

Background

Boag et al. 2018 establishes that racial disparities in aggressive
end-of-life ICU care are better explained by medical mistrust than by
race alone. The paper trains L1-regularised logistic regression models on
interpersonal CHARTEVENTS features to produce continuous mistrust scores,
then uses those scores to stratify treatment duration disparities.

Key findings reproduced by this pipeline:

Black EOL patients receive ~879 min more mechanical ventilation than White
patients (p=0.009)
Mistrust stratification amplifies this gap to ~2,319 min (3× larger)
Noncompliance mistrust score AUC-ROC: 0.667
Autopsy-consent mistrust score AUC-ROC: 0.531
LogisticRegression + all mistrust features improves mortality AUC from
0.629 (baseline demographics) to 0.661

New: `examples/mistrust_prediction/mistrust_mimic3_logistic_regression.py`

What it does

MIMIC-III v1.4
└── MIMIC3Dataset(tables=["CHARTEVENTS", "NOTEEVENTS"])
│
├── build_interpersonal_itemids(D_ITEMS.csv.gz)
│ → {itemid: label} (~168 entries)
│
├── MistrustNoncomplianceMIMIC3.set_task()
│ → interpersonal_features (sequence) + noncompliance (binary)
│ → LogisticRegression(l1_lambda=2.62e-4)
│ → Trainer.train() + evaluate() → AUC-ROC
│
└── MistrustAutopsyMIMIC3.set_task()
→ interpersonal_features (sequence) + autopsy_consent (binary)
→ LogisticRegression(l1_lambda=1.43e-2)
→ Trainer.train() + evaluate() → AUC-ROC

L1 lambda derivation (paper-equivalent)

The original paper uses sklearn LogisticRegression(C=0.1, penalty='l1').
The PyHealth equivalent uses l1_lambda = 1 / (C × n_train):

Task	n_train (70%)	l1_lambda
Noncompliance	~38,157	`2.62e-4`
Autopsy consent	~697	`1.43e-2`

Modes

# Full run against real MIMIC-III data (requires PhysioNet access)
python mistrust_mimic3_logistic_regression.py \
    --mimic3_root /path/to/physionet.org/files/mimiciii/1.4

# Smoke-test — no data access needed (synthetic MIMIC-III)
python mistrust_mimic3_logistic_regression.py --synthetic

Expected output (real MIMIC-III v1.4)

Task: NoncompliantMistrust
  Test metrics:
    roc_auc : 0.667

Task: AutopsyConsentMistrust
  Test metrics:
    roc_auc : 0.531
Dependencies

- Requires PR #1 (l1_lambda in LogisticRegression)
- Requires PR #2 (MistrustNoncomplianceMIMIC3, MistrustAutopsyMIMIC3,
build_interpersonal_itemids)
- MIMIC-III v1.4 with PhysioNet credentialed access (or --synthetic for
smoke-test)


References

┌─────────────────────────────┬─────────────────────────────────────────────┐
│          Resource           │                    Link                     │
├─────────────────────────────┼─────────────────────────────────────────────┤
│ Paper (MLHC 2018)           │ https://arxiv.org/abs/1808.03827            │
├─────────────────────────────┼─────────────────────────────────────────────┤
│ Original code               │ https://github.com/wboag/eol-mistrust       │
  ├─────────────────────────────┼─────────────────────────────────────────────┤
│ PyHealth LogisticRegression │ pyhealth/models/logistic_regression.py      │
├─────────────────────────────┼─────────────────────────────────────────────┤
│ PyHealth Mistrust Tasks     │ pyhealth/tasks/mistrust_mimic3.py           │
├─────────────────────────────┼─────────────────────────────────────────────┤
│ ```                         │                                             │
└─────────────────────────────┴─────────────────────────────────────────────┘

## PRs
Merge order

PR #960 -> PR #961 -> PR #962 (this PR)
Merging out of order will cause import errors in the example script.

…-III Add examples/mistrust_prediction/mistrust_mimic3_logistic_regression.py, a complete reproduction of the Boag et al. 2018 mistrust classifier pipeline using PyHealth: - Loads MIMIC-III via MIMIC3Dataset (CHARTEVENTS + NOTEEVENTS tables) - Builds interpersonal itemid map from D_ITEMS via build_interpersonal_itemids() - Runs MistrustNoncomplianceMIMIC3 and MistrustAutopsyMIMIC3 tasks - Trains LogisticRegression with L1 regularisation (l1_lambda matched to sklearn C=0.1: 2.62e-4 for noncompliance, 1.43e-2 for autopsy) - Evaluates AUC-ROC via pyhealth.trainer.Trainer - --synthetic flag for smoke-test without PhysioNet data access - Documents expected AUC targets (0.667 noncompliance, 0.531 autopsy) and paper-equivalent hyperparameter derivation Paper: arXiv:1808.03827 | Data: MIMIC-III v1.4 (PhysioNet) Co-Authored-By: Varun Tewari <vtewari2@illinois.edu>

vtewari2 mentioned this pull request Apr 12, 2026

feat: Reproduce Boag et al. 2018 medical mistrust pipeline in PyHealth #964

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(examples): Add end-to-end reproduction of Boag et al. 2018 mistrust pipeline on MIMIC-III#962

feat(examples): Add end-to-end reproduction of Boag et al. 2018 mistrust pipeline on MIMIC-III#962
vtewari2 wants to merge 1 commit intosunlabuiuc:masterfrom
vtewari2:pr/uiuccs598dlh/paper-pipeline/eol-mistrust-boag-2018

vtewari2 commented Apr 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vtewari2 commented Apr 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Background

New: examples/mistrust_prediction/mistrust_mimic3_logistic_regression.py

What it does

L1 lambda derivation (paper-equivalent)

Modes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vtewari2 commented Apr 11, 2026 •

edited

Loading

New: `examples/mistrust_prediction/mistrust_mimic3_logistic_regression.py`