Explore, compare, and evaluate Bioconductor packages related to genomic copy number analysis
Genomic amplifications and deletions are found in most (all?) tumor genomes. A common practice today is to do low coverage DNA sequencing (0.5x, for instance) of a tumor genome, and a matched normal genome (from the same subject). Judicious comparison of the the two sequence genomes illuminates structural changes in the tumor.
Copy number changes in tumors vary from broad (an entire chromosome arm) to focal (i.e., a 10kb amplification, loss of heterozygosity or gain). Detection methods should be sensitive enough to detect these very different phenomena in noisy low-coverage data.
Our purpose here is to provide
- A tumor/normal single chromosome pair of bam files (with accompanying index files)
- A reference analysis, using the popular SeqSeg matlab program from the Broad Institute
- A tutorial on the exploratory data analysis of these files using "native" Bioconductor capabilities
- Demonstrate (and evaluate) the capabilities of many of the Bioconductor copy number analysis packages
Bioconductor Packages
Non Biocondcutor packages
- Alkan, C., et al. (2011). "Genome structural variation discovery and genotyping." Nat Rev Genet 12(5): 363-376.
- Duan J, Zhang J-G, Deng H-W, Wang Y-P (2013) Comparative Studies of Copy Number Variation Detection Methods for Next-Generation Sequencing Technologies. PLoS ONE 8(3): e59128. doi:10.1371/journal.pone.0059128
- http://s3.amazonaws.com/copy-number-analysis/tumorA.chr4.bam
- http://s3.amazonaws.com/copy-number-analysis/tumorA.chr4.bam.bai
- http://s3.amazonaws.com/copy-number-analysis/normalA.chr4.bam
- http://s3.amazonaws.com/copy-number-analysis/normalA.chr4.bam.bai
Use, e.g.,
download.file(url="http://s3.amazonaws.com/copy-number-analysis/tumorA.chr4.bam.bai",
destfile="tumorA.chr4.bam.bai")
We have done some primary Exploratory Data Analysis on the Normal and Tumor Sample Datasets.