Skip to content
This repository has been archived by the owner on Jun 21, 2023. It is now read-only.

Proposed Analysis: apply TP53 & NF1 classifiers to PBTA data #165

Closed
jaclyn-taroni opened this issue Oct 23, 2019 · 9 comments
Closed

Proposed Analysis: apply TP53 & NF1 classifiers to PBTA data #165

jaclyn-taroni opened this issue Oct 23, 2019 · 9 comments
Labels
in progress Someone is working on this issue, but feel free to propose an alternative approach! proposed analysis transcriptomic Related to or requires transcriptomic data

Comments

@jaclyn-taroni
Copy link
Member

jaclyn-taroni commented Oct 23, 2019

Scientific goals

Identify samples that have TP53 or NF1 inactivation using gene expression data.

Proposed methods

  • This would use a pre-existing classifier (trained on TCGA data) that was described in Knijnenburg et al.
  • It seems appropriate to keep the poly-A and stranded samples separate for this.

Required input data

Gene expression data. I believe relative abundance data at the gene-level is what was used to train the classifier. In this cohort that would correspond to:

pbta-gene-expression-rsem-fpkm.polya.rds
pbta-gene-expression-rsem-fpkm.stranded.rds

Proposed timeline

I think this could be accomplished in 2 weeks.

Relevant literature

@cgreene
Copy link
Collaborator

cgreene commented Oct 24, 2019

From a conversation with @gwaygenomics and @PichaiRaman I have some additional info that may be helpful:

Hi Pichai and Casey,

Probably the easiest way to apply the TP53 classifier is by swapping in PBTA data in this script: https://github.com/marislab/pdx-classification/blob/master/1.apply-classifier.ipynb

This is the analysis done for the PPTC PDX paper.

Thanks!
Greg

@jharenza jharenza changed the title Proposed Analysis: apply TP53 classifier to PBTA data Proposed Analysis: apply TP53 & NF1 classifiers to PBTA data Oct 24, 2019
@jharenza
Copy link
Collaborator

jharenza commented Oct 24, 2019

Updated to add NF1 since that classifier also works well in pediatric data and both can be accomplished at the same time.

@jaclyn-taroni
Copy link
Member Author

Relevant pub for NF1: Way et al. BMC Genomics. 2017.

@jaclyn-taroni jaclyn-taroni added the transcriptomic Related to or requires transcriptomic data label Oct 26, 2019
@jharenza
Copy link
Collaborator

jharenza commented Dec 4, 2019

@kgaonkar6 will work on this!

@jharenza jharenza added the in progress Someone is working on this issue, but feel free to propose an alternative approach! label Dec 4, 2019
@gwaybio
Copy link
Collaborator

gwaybio commented Jan 9, 2020

perhaps important to note distinction here:

Relevant pub for NF1: Way et al. BMC Genomics. 2017.

This demonstrated proof of concept for NF1 loss classification in Glioblastoma specifically.

Initially described in Knijnenburg et al. Cell Reports. 2018.

This trained an TP53 alteration classifier using pancancer data (so many more cancer types than just glioblastoma)

Applied in Rokita et al. bioRxiv. 2019.

This used the Knijnenburg et al classifier for the TP53 analysis. But the NF1 and Ras coefficients were built in the original PanCancer classifier paper. The current PBTA analysis classifier also uses the Knijnenburg coefficients for the TP53 analysis and the original PanCan classifier coefficients in the NF1 analysis.


Edit:

Specified references for TP53 and NF1 classifiers 👀

@jaclyn-taroni
Copy link
Member Author

Once #128 has been completed, we will want to include CNV data when evaluating the results of these classifiers. I will also note that currently there is a very low number of NF1 alterations in the poly-A data so the AUROC results may be a bit misleading (see discussion on #385) and we may want to add confidence intervals to the plot (@cgreene found this package - https://rdrr.io/cran/pROC/man/ci.auc.html).

@jaclyn-taroni
Copy link
Member Author

Addressed via the linked pull requests. Closing in favor of filing an updated analysis ticket if needed.

@kgaonkar6
Copy link
Collaborator

Hi @jaclyn-taroni Do we need an updated analysis ticket for this analysis which includes CNV data ?

@kgaonkar6 kgaonkar6 mentioned this issue Mar 26, 2020
5 tasks
@jaclyn-taroni
Copy link
Member Author

Good idea @kgaonkar6

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
in progress Someone is working on this issue, but feel free to propose an alternative approach! proposed analysis transcriptomic Related to or requires transcriptomic data
Projects
None yet
Development

No branches or pull requests

5 participants