Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Project 2: Neutral tumor evolution testing #2

Open
tdelhomme opened this issue Feb 26, 2018 · 7 comments
Open

Project 2: Neutral tumor evolution testing #2

tdelhomme opened this issue Feb 26, 2018 · 7 comments
Assignees
Labels
collaborative project 💡 :octocat: GitHub collaborative project description and discussion on. public TCGA data This project is using only public TCGA data

Comments

@tdelhomme
Copy link
Member

tdelhomme commented Feb 26, 2018

Test neutral tumor evolution model described in Williams et al. on TCGA data.

Data: public MAF files

Script: R script, which output regression coefficient and slope of the model, by sample.

Guidelines: Loop on files

Project source code and documentation is hosted here.

@tdelhomme tdelhomme added the collaborative project 💡 :octocat: GitHub collaborative project description and discussion on. label Feb 26, 2018
@tdelhomme
Copy link
Member Author

tdelhomme commented Feb 26, 2018

@NoemieL, you should have an R script computing this on a VCF file, you just should adapt this to MAF file. Be carefull, you have one MAF per cohort.
@aurelieGabriel would give a presentation on TCGA data and so also on MAF format.

@mfoll
Copy link
Member

mfoll commented Feb 27, 2018

Note that Williams et al. selected only samples with a tumor purity >70%. Estimates of tumor purity for most TCGA cohorts are available in COSMIC under "ASCAT Ploidy and Purity Estimates".

@mfoll
Copy link
Member

mfoll commented Feb 27, 2018

One difficulty is that you have to filter on allelic fraction (>0.12 & <0.25 for example), and MAF files don't always contain this information. VCF do, but are protected and are not filtered with the same QC. Maybe check in which cohort the allelic fraction is available in the MAF file with @aurelieGabriel?

@aurelieGabriel
Copy link
Collaborator

aurelieGabriel commented Feb 27, 2018

In every MAF file the following columns are now reported: t_depth (Read depth across the locus in tumor BAM) and t_alt_count (Read depth supporting the variant allele in tumor BAM).

@mfoll
Copy link
Member

mfoll commented Feb 27, 2018

Good news! (It was not the case before).

@mfoll mfoll added the public TCGA data This project is using only public TCGA data label Feb 28, 2018
@mfoll
Copy link
Member

mfoll commented Mar 2, 2018

Tumor purity are also available in the biospecimen/slides tables available on TCGA data portal like here for LUAD for example

@mfoll
Copy link
Member

mfoll commented Mar 2, 2018

To run a R script in your docker container:

  • you need to make it work as a command line using Rscript.
  • then you need to put it somewhere on GitHub
  • when you create your Dockerfile, you will need to (you should use the biocontainer base image and then conda):
    • install R
    • install the R packages in R using RScript (see here for example)
    • download your R script and copy it somewhere in the docker image (see here for example)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
collaborative project 💡 :octocat: GitHub collaborative project description and discussion on. public TCGA data This project is using only public TCGA data
Projects
None yet
Development

No branches or pull requests

4 participants