-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Duplicate variants in VCF outputs causing bcftools consensus error #53
Comments
Thanks for the report! So this can happen occasionally and it's not clear what can be easily done about it. Typically this will be if you have two competing edits in your VCF file. Can you check the pass and fail VCFs and look around 27967 and see what variants are reported? |
Thanks for the quick response! Fails vcf has nothing but the one variant:
Pass vcf has a duplicate at that spot along with all variants after it. These duplicates are also seen in merged.vcf:
|
OK, this probably isn't the issue I was thinking about, but more relates to the duplicate indel at 28250. I'll check this is the case and it should be easy to fix. In the meantime if you remove one of those duplicate lines at 28250 in the VCF you might find that bcftools consensus can be run manually to generate a consensus genome. |
You are correct removing one of the spots at 28250 allowed me to generate a consensus genome! Thanks!
|
@will-rowe I think we need an enhancement which is to systematically remove duplicate variants present in both pools before calling |
I've been messing around a bit more with bcftools consensus and even if I do not remove any of the duplicates, I can still create a consensus file with the normal command:
So something before must be causing the issue leading up to this one |
Is there any chance it could be related to this issue? It got resolved in bcftools 1.11 from the release notes, but ARTIC is using a lower version... |
Hi,
I've run into what seems to be a rare issue (first time I've seen it in 100's of samples) where there are duplicate variants in my
*.pass.vcf.gz
and*.merged.vcf
file which is leading to bcftools consensus to generate an error with an output of:I also get the output from vcf_merge of:
Issue is similar to #21 from the looks of it although I checked and I am on the latest version so it must be something else.
For reference I'm currently using the following:
Sorry I don't know too much more, I'll keep looking at it in the meantime. Thanks for any help you can offer and sorry if I missed a solution somewhere,
Darian
The text was updated successfully, but these errors were encountered: