- LiftOver
- Phasing
- GenotypeHarmonizer
- Imputation
- CreateStats
During this step, the genomic assembly of the data is converted from one genomebuild to another. At this moment, there are two LiftOver options:
- hg18 -> hg19
- hg38 -> hg19
Tools used by the LiftOver step:
- LiftOver, version 20161011
- PLINK, version 1.9
During this step, the haplotype structure of the data is determined.
Tools used by the Phasing step:
- SHAPEIT, v2.r837-static
During this step, quality control is performed and the data is aligned to the reference data.
Tools used by the GenotypeHarmonizer step:
- GenotypeHarmonizer, version 1.4.18
During this step, the data is split into many chunks, in order to impute the data properly.
Tools used by the Imputation step:
- IMPUTE4, version r265
During this step, statistics are measured for the imputed data.
Tools used by the Imputation step:
- QCTOOL, version 1.4
mkdir /groups/${GROUP}/${tmpDir}/generatedscripts/${PROJECT}
cp datasheet.csv /groups/${GROUP}/${tmpDir}/generatedscripts/${PROJECT}/
- study: Name of your project
- rawdata: Location of the input data (should be in .bed, .bim, .fam format per chromosome)
- genomeBuild: The genomebuild of your data
- referenceGenome: The reference genome used for imputation, currently supported: gonl and 1000G
- run: Name of the run (e.g. run01)
cp generate_template.sh /groups/${GROUP}/${tmpDir}/generatedscripts/${PROJECT}/
sh generate_template.sh
Your jobs are generated and can be found here: /groups/${GROUP}/${tmpDir}/projects/${PROJECT}/${RUN}/jobs/
sh submit.sh
Your results can be found here: /groups/${GROUP}/${tmpDir}/projects/${PROJECT}/${RUN}/results/
NOTE: In case of a crash, the temporary results can be found here: /groups/${GROUP}/${tmpDir}/tmp/${PROJECT}/