-
Notifications
You must be signed in to change notification settings - Fork 32
The tormes report files
under development
Once TORMES analysis ends, the results will be summarized in an interactive html-file, that is automatically performed by the tormes-report
script included in TORMES environment. The report file (tormes_report.html) can be viewed in any web browser (if some browser return problems, please try to open it with a different one) and does not require any other data to be visualized, so the results can be easily visualized by different people just by sharing the tormes_report.html file. All the tables and figures included in the report file are downloadable.
However, probably the report is not showing exactly what you want or need. Additionally, you may have extra results, figures and tables that you would like to include before sharing with others. Or perhaps, you might want to modify some of the tables and/or figures that are already present in the report.
This is totally possible!
Some experience in R and R markdowns is needed, but we will try to make it easy through this wiki. Let's go step by step:
After TORMES run, all the results generated and intermediate files will be contained in the output directory specified when running TORMES (see https://github.com/nmquijada/tormes#output for further information).
Inside this output directory, you will find a zipped directory: report_files.tgz. This directory contains all the files that were required to render the tormes_report.html. The information contained in this directory is redundant, as several results from other directories (antibiotic_resistance_genes, mlst, etc.) are copied into this directory. This is done intentionally, as the unzipped report_files.tgz can be used as the working directory for the user to easily modify the report. The directory is zipped in order to ocupy the less space as possible.
For unzipping report_files.tgz, type:
tar xzf report_files.tgz
Now, a directory (report_files) will appear and the zipped report_files.tgz will remain too. By looking to the content of the report_files directory, you will find the following files (I will sort them in this wiki according to their appearence possition in the tormes_report.html):
-
metadata.txt: the same information that was parsed to the
-m/--metadata
option in TORMES).
- sequencing_assembly_report.txt: contains the information regarding the sequencing and the assembly, generated by Quast and custom scripts.
- taxonomic-identification-kraken2.txt: contains the taxonomic identification results based on k-mers generated by Kraken2.
- taxonomic-identification-16S-rRNA.RDP.txt: contains the taxonomic identification results based on the 16S rRNA gene generated by the RDP Classifier.
- mlst.tab: contains the data generated by mlst.
- summary_statistics.txt: contains the number of core, soft, shell and cloud genes identified in your set of isolates, based on their prevalence within the different isolates. Generated by Roary.
- pangenome.png: image that summarizes the gene composition of each isolate. Generated by roary2svg.pl.
- accessory_binary_genes.fa.newick: newick file with the information of the pangenome tree based on presence/absence of genes between the isolates. Generated by Roary.
- core_gene_alignment.newick: newick file with the information of pangenome tree based on core-genome-SNP distances. Generated by Roary.
- *_argannot.tab: one file per isolate. Contains the results of the antibiotic resistance genes screening of that isolate against the ARG-ANNOT database. Generated by ABRicate.
- *_card.tab: une file per isolate. Contains the results of the antibiotic resistance genes screening of that isolate against the CARD database. Generated by ABRicate.
- *_resfinder.tab: une file per isolate. Contains the results of the antibiotic resistance genes screening of that isolate against the RESFINDER database. Generated by ABRicate.
- argannot_summary.tab: matrix containing the presence/absence of the antibiotic resistance genes (from the ARG-ANNOT database) in the different isolates (coverage and identity > 90%). The value correspond to the % of identity. Generated by ABRicate and custom scripts included in tormes-report.
- card_summary.tab: matrix containing the presence/absence of the antibiotic resistance genes (from the CARD database) in the different isolates (coverage and identity > 90%). The value correspond to the % of identity. Generated by ABRicate and custom scripts included in tormes-report.
- resfinder_summary.tab: matrix containing the presence/absence of the antibiotic resistance genes (from the RESFINDER database) in the different isolates (coverage and identity > 90%). The value correspond to the % of identity. Generated by ABRicate and custom scripts included in tormes-report.
- *_vfdb.tab: one file per isolate. Contains the results of the virulence genes screening of that isolate against the VFDB database. Generated by ABRicate.
Additionally, if used the option -g/--genera Salmonella
when running TORMES, you will also get:
- serotyping.txt: serotyping results by using SISTR
- *_PointFinder_results.txt: one file per isolate. Contain the results of point mutations known related to antimicrobial resistant. Generated with PointFinder
- *_plasmids.tab: one file per isolate. Contains the results of the plasmid replicons screening of that isolate against the PlasmidFinder database. Generated by ABRicate.
And finally, three additional files generated by TORMES:
- render_report.sh
- tormes_report.html
- tormes_report.Rmd