GROOT
is a tool to type Antibiotic Resistance Genes (ARGs) in metagenomic samples (a.k.a. Resistome Profiling). It combines variation graph representation of gene sets with an LSH indexing scheme to allow for fast classification of metagenomic reads. Subsequent hierarchical local alignment of classified reads against graph traversals facilitates accurate reconstruction of full-length gene sequences using a simple scoring scheme.
GROOT
will output an ARG alignment file (in BAM format) that contains the graph traversals possible for each query read; the alignment file is then used by GROOT
to generate a resistome profile.
Since version 0.4, GROOT
will also output the variation graphs which had reads align. These graphs are in GFA format, allowing you to visualise graph alignments using Bandage and determine which variants of a given ARG type are dominant in your metagenomes. Read the documentation for more info.
Since version 1.0.0, GROOT
has had a partial re-write (merging features and changes from my baby groot project). It now uses the excellent LSH Ensemble library as the LSH index, enabling containment search for read seeding. I've also improved my dev know-how and GROOT
is now more efficient. However, these changes have meant that I've needed to change some of the CLI, so please read the docs.
Check out the releases to download a binary. Alternatively, install using Bioconda or compile the software from source.
conda install -c bioconda groot
brew install brewsci/bio/groot
GROOT
is written in Go (v1.14) - to compile from source you will first need the Go tool chain. Once you have it, try something like this to compile:
# Clone this repository
git clone https://github.com/will-rowe/groot.git
# Go into the repository and get the package dependencies
cd groot
go get -d -t -v ./...
# Run the unit tests
go test -v ./...
# Compile the program
go build ./
# Call the program
./groot --help
GROOT
is called by typing groot, followed by the subcommand you wish to run. There are three main subcommands: index, align and report. This quick start will show you how to get things running but it is recommended to follow the documentation.
# Get a pre-clustered ARG database
groot get -d arg-annot
# Create graphs and index
groot index -m arg-annot.90 -i grootIndex -w 100
# Align reads and report
groot align -i grootIndex -f reads.fq | groot report
note: it's recommended to index the graph using a window size ~= your maximum expected read length, so for 100bp reads, use
-w 100
Please readthedocs for more extensive documentation and a tutorial.
GROOT
has now been published in Bioinformatics: