From 48965cc58e12d668da4450346a9c9c78699348e2 Mon Sep 17 00:00:00 2001 From: Joyjit Daw Date: Sun, 26 May 2024 18:47:42 -0700 Subject: [PATCH] specify bgz compression required --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index b5311c640..04d0b9c88 100644 --- a/README.md +++ b/README.md @@ -290,7 +290,7 @@ To correct reads, run: $ dorado correct reads.fastq(.gz) > corrected_reads.fasta ``` -Dorado correct only supports FASTX(.gz) as the input and generates a FASTA file as output. An index file is generated for the input FASTX file in the same folder unless one is already present. Please ensure that the folder with the input file is writeable by the `dorado` process and has sufficient disk space (no more than 10GB should be necessary for a whole genome dataset). +Dorado correct only supports FASTX(.gz) as the input and generates a FASTA file as output. The input can be uncompressed or compressed with `bgz`. An index file is generated for the input FASTX file in the same folder unless one is already present. Please ensure that the folder with the input file is writeable by the `dorado` process and has sufficient disk space (no more than 10GB should be necessary for a whole genome dataset). The error correction tool is both compute and memory intensive. As a result, it is best run on a system with multiple high performance CPU cores ( > 64 cores), large system memory ( > 256GB) and a modern GPU with a large VRAM ( > 32GB).