Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Invalid VCF produced by cnvpytor view #178

Closed
archmageirvine opened this issue Apr 23, 2023 · 1 comment
Closed

Invalid VCF produced by cnvpytor view #178

archmageirvine opened this issue Apr 23, 2023 · 1 comment

Comments

@archmageirvine
Copy link
Contributor

When I do

cnvpytor -root file.pytor -view 1000
set print_filename output.vcf
print calls

The resulting VCF is invalid in at least three ways:

  1. The header line for GT contains a trailing ; that should not be there:
##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">;
##FORMAT=<ID=CN,Number=1,Type=Integer,Description="Copy number genotype for imprecise events">
  1. The REF column is all missing ., but that is not permitted in VCF v4.1 (use N instead).

  2. The CN FORMAT field is defined to be Integer, but some records are coming out with floating-point values in that field:

1       145097001       CNVpytor_dup79  .       <DUP>   .       PASS    END=145326000;IMPRECISE;SVLEN=229000;SVTYPE=DUP;pytorRD=2.514962864006279;pytorP1=0.0;pytorP2=0.0;pytorP3=0.0;pytorP4=0.0;pytorQ0=0.1908020972780009;pytorPN=0.0;pytorDG=286400.0;pytorCL=rd_mean_shift GT:CN   ./1:2.51

All of these I can manually work around by correcting the problems with sed etc., but it would be nice to have these fixed in the tool itself.

@arpanda
Copy link
Member

arpanda commented Apr 24, 2023

We fixed both the 1 and 3 point previously. This is the current vcf header for GT and CN tag.

##FORMAT=<ID=GT,Number=1,Type=String,Description="Genotype">
##FORMAT=<ID=CN,Number=1,Type=Float,Description="Copy number genotype for imprecise events">

The REF column is all missing ., but that is not permitted in VCF v4.1 (use N instead).

Thanks for pointing this out. I updated the ref column to N

-Arijit

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants