Skip to content

Conversation

@tristan-f-r
Copy link
Contributor

@tristan-f-r tristan-f-r commented Jun 17, 2025

(TLDR): This primarily integrates CI, which runs the yeast-osmotic-stress, hiv, and responsenet datasets, displaying them on a little webpage.

This supersedes #5 and #11 (and, consequently, #7 and #6) by integrating the RN data and the synthetic pathway data into this pull request (for testing the PRA workflow). Note: This PR contains scripts (including from #5 and #11) that process biological data and should be reviewed.

Organization

This closes #21, with the following organization:

.
├── configs
├── datasets
├── spras
└── web
  • configs is our SPRAS configs, split by data type (DMMM, PRA)
  • datasets is each category of datasets, and is mainly meant for heuristic organization (e.g. hiv and yeast-osmotic-stress are still separate, but synthetic networks that come from the same database would be categorized in the same dataset folder. This isn't an all-encompassing standard and should be broken when necessary)
  • spras as a submodule (it is easier to make commits to spras-benchmarking than it is to spras)
  • web for web output

Eventually, there will be an analysis folder for further analysis not yet supported by SPRAS, making SPRAS-benchmarking double-serve as a staging area for analysis.

oliverfanderson and others added 30 commits October 31, 2024 14:59
R script for parsing human interactome based on experimental evidence values
R script for taking PANTHER pathways and parsing into STRING ID edge list
Now can access a list of TFs and Receptors to process a PANTHER pathway into sources and targets.
Used the function to generate Source and Target sets for 14 PANTHER pathways
…ss the data to make the combined networks (still in progress)
@annaritz
Copy link
Contributor

There is a lot here - I don't have the technical chops to do a full code review without talking through the PR with @tristan-f-r and @AMINOexe. I made a few comments, and look forward to talking more.

One question for @agitter is whether we should perform this merge as a first-attempt at the repo (since the repo is so new), knowing that not all details have been thought out yet. It would help us get a sense of how to run multiple datasets, and would be helpful to have this merged when we think about file storage solutions.

@ntalluri
Copy link
Collaborator

ntalluri commented Jul 1, 2025

The synthetic pathways dataset and Gabes ResponseNet dataset still need a review over the code in general. I think we should open PRs again to review those once this is merged rather than trying to review those in this PR.

@tristan-f-r
Copy link
Contributor Author

tristan-f-r commented Jul 1, 2025

[Waited for CI to pass] I've removed the unreviewed datasets and am splitting it to a separate PR.

@tristan-f-r tristan-f-r merged commit 02a8b4a into Reed-CompBio:main Jul 1, 2025
@tristan-f-r tristan-f-r mentioned this pull request Jul 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Proposed Repo Structure

6 participants