-
Notifications
You must be signed in to change notification settings - Fork 417
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix GenomicsDB bug with mismatched intervals, remove duplicated variants from VQSR vcfs, add VQSR CI test #1173
Fix GenomicsDB bug with mismatched intervals, remove duplicated variants from VQSR vcfs, add VQSR CI test #1173
Conversation
|
Should some of these improvements perhaps be "mirrored" or "copied" over into the sentieon-subworkflows? |
sure, but in a separate PR imo. and not all is relevant due to the different structure |
Tests are failing because of the generated csv file. We should decide which vcf files go in there and by extension get annotated: This is the current state for the joint_germline track. No gvcfs, but the output of genotypegvcf and then the one following recalibration.
|
code is looking good, still I think we need to fix some tests |
Agree with @maxulysse, code is an improvement. Anything we can help with re: tests? |
fixing the samplesheet stuff at the moment. Should fix the tests, plus I added a sneaky docker tag in one one. Let me push things and get back to you on the tests |
@nf-core-bot fix linting |
List of changes:
Adding at least stub test for VQSR (see Pytest for VQSR-flow #1027 ), attempts to get some actual teeny tiny data to run through failed
Removing duplicate entries in joint germline VQSR vcf by following instructions described here fixing Joint Genotype Calling Recalibrated VCFs Duplicate Entries; VEP is (Intentionally) Eating Variants #966 For joint variant calling, produce a final VCF file with filtered variants (those that PASS filters) #1102. opting for this approach for now as it reduces the number of computational steps that need to be done.
Group on actual interval files and add relevant meta information to fix occasional grouping mismatches (GenomicsDBImport : Mismatching intervals for input-vcf-files and bed-interval-file #1137)
some channel renaming (Rename channels genotype_intervals #1042)
Refactor single sample filtering (Refactor joint germline calling #1053)
PR checklist
nf-core lint
).nextflow run . -profile test,docker --outdir <OUTDIR>
).docs/usage.md
is updated.docs/output.md
is updated.CHANGELOG.md
is updated.README.md
is updated (including new tool citations and authors/contributors).