This repository contains complete computational methods including source code, processed data outputs and explanatory notes. It is an accompaniment to the paper, as published in:
Jia Zhang, Zoe T Richards, Arne A S Adam, Cheong Xin Chan, Chuya Shinzato, James Gilmour, Luke Thomas, Jan M Strugnell, David J Miller, Ira Cooke, Evolutionary responses of a reef-building coral to climate change at the end of the last glacial maximum, Molecular Biology and Evolution, 2022;, msac201, https://doi.org/10.1093/molbev/msac201
This repository has also been archived on Zenodo:
Sample sequencing and data processing
- Sampling sites and sequencing data
- Variant Calling and quality control
- Phasing haplotypes
- Building pseudo-chromosomes
Population genetics of the coral host
- Basic population genetic statistics
- Population structure analysis
- FineStructure analysis
- Outgroup analysis using Japanese A. digitifera
- IBD and HBD segments
- Demographic history based on SMC
- Demographic model fitting using fastsimcoal
- Demographic model validation by simulation
- Demographic model without a bottleneck
- All .est and .tpl files used for alternative demographic models
Symbiodiniaceae profiles
Signatures of selection in the coral host
- Selection analysis based on EHH and allele frequency
- Comparison between EHH and allele-frequency-based approaches
- Identification of the top 1% of regions putatively under selection
- GO Enrichment of genes under selection
- Estimating the TMRCA for selective sweeps
- Phylogenetic tree of haem peroxidases
Miscellaneous
- Functional annotation of genes
- Checking the mislocated sample using radseq data
- Estimating the age of individual alleles
All of the sections above are provided as processed markdown files. Clicking the link should display a web readable page with text, a few select commands and plots and tables. The code used to generate these pages is provided in the corresponding .Rmd file. If you would like to run the code in these files yourself you will need to;
Checkout this repository
git clone https://github.com/bakeronit/acropora_digitifera_wgs.git
Download additional data not hosted on github due to size constraints
cd acropora_digitifera_wgs
wget 'http://data.qld.edu.au/public/Q5999/bakeronit/acropora_digitifera_wgs/data_essential.tgz' -O data_essential.tgz
tar -zxvf data_essential.tgz
Open the project file in RStudio and open the desired file. After installing any required R packages the code should run and produce plots and tables identical to those shown in the web links above.
In general the code provided here is for illustration purposes only. In many cases the commands we provide should work as-is on your system. In some cases however, we have provided scripts that include strategies for parallelisation (eg via gnu parallel, or using a PBS job management system). These are highly platform specific and you will need to modify them to run on your own system.