update create subset files to enrich for rnaseq batch correction #297
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Purpose/implementation Section
What scientific question is your analysis addressing?
Update create-subset-files module to enrich expression RDS files with samples suitable to fully test the rnaseq batch correction module
What was your approach?
1). Create a script,
00-enrich-batch-correction-examples.Rmd
to determine a list of bs_ids required for tumor-only and tumor-normal RNA-Seq batch correction analyses as described in the issue ticket and discussion2) Include randomly selected samples from each category (
subtypes
,cancer groups
, andnormal subgroups
)in the RNA-Seq RDS subset filesWhat GitHub issue does your pull request address?
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
00-enrich-batch-correction-examples.Rmd
script in the module. The updated histologies files include MYCN status values in themolecular_subtype
column (mostly from the clinical data while themycn-molecular-subtyping
module is being reviewed) required for selecting Neuroblastoma samples for batchIs there anything that you want to discuss further?
Sample selected:
5 MYCN amp
and5 MYCN non-amp
5 poly-A
and5 stranded
5 poly-A
and5 stranded
10 Brain Cortex
and10 Brain Cerebellum
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Yes
Results
What is your summary of the results?
../../data/testing/v11
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.