4.2.1
Dragen
- added new protocols
- PreparedDragenData
- CombineDragenSampleData
- ReheaderVcf
- new workflow_DRAGEN
- new generate_DragenScripts
Bucketsystem
From this version on the folders Samplesheets
, tmp
, project
and generatedscripts
have a subfolder with the name of the pipeline (e.g. NGS_DNA, NGS_Demultiplexing,VIP, GAP etc). This has been implemented due to the fact that one machine does both the rawdata handling (AGCT/NGS_Demultiplexing) and the variantcalling (GAP/NGS_DNA). Data archiving is still on a seperate machine (chaperone)
Getting rawdata from permanent storage changed
Diagnostic clusters cannot talk directly to permanent storage anymore (i.e. permanent storage can push only).
- Created new protocol: CheckRawDataOnTmp
this protocol comes instead of CopyPrmToTmpData, it will check whether the rawdata is already available, if not than a file will be created in thelogs
folder as${logsDir}/${project}/${project}.data.requested
, this filetype will be checked by a new NGS_Automated protocol (copyRawDataToTmp) and will push the data to the diagnostic cluster
coverage calculations per gVCF (instead of per BAM)
This script will use gVCF files as input, due to all kind of possible rearrangements of the reads during the variantcalling the bam files created by the pipeline are not always the end product. Therefore it is more accurate to calculate coverage based on the variantcaller created gVCF files.
End product is not changed, still in this format:
"Index\tChr\tChr Position Start\tChr Position End\tAverage Counts\tDescription\tReference Length\tCDS\tContig"
But additional file(s) are created to calculate percentages/bases with zero coverage, dp<20, GQ values (.{bedfile}.CoverageOutput.csv
)
Diploid chrX
- Male chromosome X will be called diploid
This means that there is no need to have 2 seperate chr X batches (nonPar and Par))
updated tool versions:
bcfTools 1.16
BEDTools 2.30.0
CADD-v1.4
GATK 4.2.4.1
gVCF2BED 1.0.0 (new tool)
HTSlib 1.16
jq/1.6
multiQC-1.12
ngs-utils 23.04.1
picard 2.26.0
Python 3.10.4
R 4.2.1
SAMTools 1.16.1
vcfAnno v0.3.3
Others
- moved unused protocols to deprecated and removed them from the workflow.csv
vcfToTable
CoverageCalculations
XHMM
Convading
DecisionTree