-
Notifications
You must be signed in to change notification settings - Fork 596
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request for VariantsToTable: option to preserve numeric genotypes #8160
Comments
@bensprung So I thought this would be a trivial change. It turns out that encoding the Genotype as something like |
Well, no worries, thanks so much for working on it! |
* Expose two public methods in VCFEncoder writeGtField and encodeGtField * Supports broadinstitute/gatk#8160 but seems like a useful thing to be able to do in general * minor breaking change in VCFEncoder, made methods formatVCFField and buildAlleleStrings static It is unlikely anyone overrides either of these methods so it should not be a problem.
Blocked on samtools/htsjdk#1648 |
* Expose two public methods in VCFEncoder writeGtField and encodeGtField * Supports broadinstitute/gatk#8160 but seems like a useful thing to be able to do in general * minor breaking change in VCFEncoder, made methods formatVCFField and buildAlleleStrings static It is unlikely anyone overrides either of these methods so it should not be a problem.
* Expose the ability to encode a Genotoype into a GT field by exposing two public methods in VCFEncoder: writeGtField and encodeGtField * Supports broadinstitute/gatk#8160 but seems like a useful thing to be able to do in general * minor breaking change in VCFEncoder, made methods formatVCFField and buildAlleleStrings static It is unlikely anyone overrides either of these methods so it should not be a problem.
The necesssary change went into htsjdk so we can do this now whenever we pick up a new release of htsjdk. |
* add an new option to VariantsToTable to allow output VCF style numeric GT fields previously it always output the actual bases of the Allele in the GT spot * resolves #8160 * updates htsjdk to 3.0.5
* add an new option to VariantsToTable to allow output VCF style numeric GT fields previously it always output the actual bases of the Allele in the GT spot * resolves #8160 * updates htsjdk to 3.0.5
Following the discussion here, I'd like to request a feature for VariantsToTable. Currently, when genotypes are specified via
-GF GT
, the numeric genotypes (e.g.0/1
) from the input VCF are replaced in the output TSV by the actual alleles (e.g.CTACCCT/AACCCA
). I can see why that behavior would be desirable sometimes, but in cases where there are a lot of long variants and a lot of samples, this makes the resulting TSV file quite a bit more hefty than it needs to be (since that info is already in the REF and ALT fields). Therefore I think an option to output the original numeric genotypes from the VCF would be useful. Thanks!The text was updated successfully, but these errors were encountered: