-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
update mutational signatures module #300
Conversation
Hi @rjcorb! Based on some offline discussions with the Shlien lab, Sharon, and you - I think we should try the following approach for subsetting v3 mutational signatures pan-cancer:
Please see the code here for the mapping file and example script for commands for signature inclusion. |
@jharenza the analysis has been re-run with the filtered v3 signature subset. As expected, we see average SBS3 exposure levels increase in the filtered vs. unfiltered analysis (attached). Let me now if there are any other comparisons you'd like to see between runs. |
@rjcorb let's add this as an agenda item to discuss next week |
…iatricOpenTargets/OpenPedCan-analysis into update-mutational-signatures
@jharenza the module has been re-run on v12 data and is ready for review. The |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hi @rjcorb the docker image is merged and this is ready for a rerun.
Two immediate things.
- analyses/mutational-signatures/plots/nature/bubble_matrix_nature_mutation_sig.png has non-signature rows
- I am wondering if we should save the images within a dataframe/rds file instead of committing them. I saw somewhere, but cannot find, where there is a way now to put images into a dataframe... just thinking of how to save some space. For now, perhaps you can put the broad histology plots in scratch?
@jharenza I have updated the module with requested changes |
going to merge this without a second GA, since the only merge since was docs |
Purpose/implementation Section
What scientific question is your analysis addressing?
The mutational signatures analysis module was updated to generate signature weight matrices from the latest versions of multiple mutational signature databases.
What was your approach?
01-known_signatures.Rmd
was updated to generate per-sample mutational signature weight matrices of COSMIC (v2 and v3.3) and Nature single-base substitution (SBS) signatures.02-cosmic_dbs_signatures.Rmd
generates signature weight matrices of COSMIC v3.3 double-base substitution (DBS) signatures.03-fit_cns_signatures.R
determines the relative contributions of 8 known (adult) CNS signatures in the OpenPedCan data (considers both WXS and WGS) usingdeconstructSigs::whichSignatures()
.04-explore_hypermutators.Rmd
investigates whether hyper-mutant and/or ultra-hypermutant samples have enrichment of signatures 3, 18, and/or MMR.What GitHub issue does your pull request address?
Ticket #453
Directions for reviewers. Tell potential reviewers what kind of feedback you are soliciting.
Which areas should receive a particularly close look?
Is there anything that you want to discuss further?
Is the analysis in a mature enough form that the resulting figure(s) and/or table(s) are ready for review?
Results
What types of results are included (e.g., table, figure)?
{DATABASE}_signature_exposures.tsv
files store a sample x signature weight matrix of mutational signatures from each database.{DATABASE}_signature_results.tsv
files store number of signature mutations per Mb for each sample and signature.fitted_exposure_signal-cns-deconstructSigs.rds
anddeconstructSigs_cns_exposure_merged.tsv
store CNS mutational signature results and results merged with histology data, respectively.hypermutator_sig_matrix.tsv
includes CNS mutational signature weights for samples defined as hypermutants.sig_matrix_by_molecular_subtype.tsv
stores CNS mutational signature weights and includes molecular subtype to assess TP53 mutant mutational signatures.bubble_matrix_{DATABASE}_mutation_sig.png
; plots showing prevalence of each mutational signature in each histology group by database queried.{HISTOLOGY_GROUP}_{DATABASE}_mutation_sig.png
; grouped bar charts showing mutational signature weights be histology group and database.{BS_ID}_{DATABASE}_mutation_sig.png
; per-sample barplots showing contributions of top 5 mutational signatures and error, by database queried.What is your summary of the results?
Reproducibility Checklist
Documentation Checklist
README
and it is up to date.analyses/README.md
and the entry is up to date.