Convert gvcf/vcf files to transposed fasta format (tfasta, indexed and compressed).
Jordi Leno-Colorado, Sebastian E. Ramos-Onsins, Ahmed Hafez
gVCF2tFasta -v input.vcf(.gz) -r reference.fa(.gz) -c ploidy [-o outputname] [-n chromosomes.txt] [-i 0/1]
Note: Structural Variants are considered as missing data (N)
-h Help and exit
-v Input VCF file
-r Reference Fasta file
-c Number of allele copies per position and individual (ploidy)
-o Optional Output compressed tFasta filename, Default same as input vcf file
-n Optional File with chromosome(s) to convert and its length, Default use all sequences as in reference fasta file
-i Imputation (Only use with VCF files, not gVCF files):
0 if missing data in VCF is equal to N in tFasta
1 if missing data in VCF is equal to reference fasta in tFasta
Default value is 0
gVCF2tFasta accepts both uncompressed and compressed files (gzip). The input VCF file must sorted. If the fasta file is compressed, it must be compressed with bgzip. An index file (.fai) is created for the reference fasta file if it does not exist.
The output tfasta file is compressed (bgzip) and indexed.
On linux, you can install htslib using the package manager of your distribution. For example, on Ubuntu, you can install htslib using the following command:
$ sudo apt-get install libhts-dev
On macOS, you can install htslib using Homebrew:
$ brew install htslib
Or you can build htslib from source. To do so, follow these steps:
$ git clone https://github.com/samtools/htslib.git
$ cd htslib
$ git checkout 1.20
$ autoreconf -i # Build the configure script and install files it uses
$ ./configure # Optional but recommended, for choosing extra functionality
$ make
## make install or sudo make install as required
$ make install
For more information, see the htslib GITHUB
$ git clone https://github.com/sramosonsins/gVCF2tFasta.git
$ cd gVCF2tFasta
## configure cmake
$ cmake -B ./build -DCMAKE_CXX_COMPILER=g++ -DCMAKE_C_COMPILER=gcc -DCMAKE_BUILD_TYPE=Release -S ./
# build
$ cmake --build ./build --config Release
Or you can try:
sh ./build.sh
$ ./build/gVCF2tFasta -h
$ sh run_examples_gVCF2tFasta.sh