This repository collects pipelines, codes, and some intermediate results for the study of mosaic SNV/Indels for sperm, blood, and other samples of a control cohort. Raw data of this study is available here and here. The first and second dataset can be accessed through SRA Run Selector.
Pipelines for pre-processing of the bams.
Codes for depth of coverage and insertsize distribution.
Pipeline for population analysis, and codes for plot.
Pipelines for MuTect2 (paired mode) and Strelka2 (somatic mode) variant calling from WGS data
Pipelines for MuTect2 (single mode) has a "Leave One Out" version for the YA cohort, and a "Full Panel of Normal" version for the AA and ASD cohort. The MuTect2 (single mode) result is followed by MosaicForecast, and the variant annotation pipeline.
Codes to plot the calls of different methods on simulated variants.
Codes and data for different CIRCOS plots.
Pipelines for alignment, processing, and germline variant calling of TAS reads.
Pipelines for AF quantification and variant anntations.
Codes to filter and annotate on TAS data.
3. Pipelines for the data analysis, variant filtering, comprehensive annotations, and statistical analysis
After variant calling from different strategies, variants were annotated and filtered by a python script and positive mosaic variants as well as the corresponding samples and additional information were annotated.
Codes for permutation analysis from gnomAD and codes for plotting the permutation result.
UpSet plot is generated from an online tool.
Codes for the estimation of accumulation of mutations through a stepwise exponential regression regression model.
Codes for the analysis of accuracy of number of variants and estimate limit of sampling with age in different groups.
📧 Xiaoxu Yang: u6055394@utah.edu, xiaoxuyanglab@gmail.com,xiy010@health.ucsd.edu
📧 Martin Breuss: martin.breuss@cuanschutz.edu
📧 Joseph Gleeson: jogleeson@health.ucsd.edu
Yang X & Breuss MW, et al., Gleeson JG. Developmental and temporal characteristics of clonal sperm mosaicism. 2021. (Cell, DOI:10.1016/j.cell.2021.07.024, PMID:34388390)