GitHub - britolab/PRO-seq: Scripts and R code used to process sequencing data and generate figures for E. coli & microbiome PRO-seq

PRO-seq and RNAseq in E. coli and microbiomes

Much of the code in this repo was not used in the final analyses. The most important scripts are listed below.

metagenome assembly

run_clean_assemble_bin.sh is the master script for read QC, metagenome assembly, contig binning, bin QC, and taxonomic assignment. Parts of this script are hard-coded to work with the Cornell BioHPC SGE scheduler and the Brito Lab server structure.

bin annotation

Genes were called in metagenomic bins using run_prokka.sh. gtf annotations output by prokka can be converted to R objects using gtf2tibble.R.

transcript mapping

The main scripts for cleaning up transcript reads and mapping those reads to references can be found in the Danko Lab proseq2.0 repo. Once you have bam files, per-base coverage reports can be generated with get_pileup_correct.sh.

data processing and visualization

EC_peaks.rmd and Stool_PRO-seq.Rmd contain the R code used for the E. coli and human microbiome analyses, respectively. The Rmarkdown documents are ordered by main sections (#) and subsections (##/###).

Post-review, analyses were conducted in separate notebooks. These notebooks can be found in data_processing_and_figures.

data availability

E. coli sequencing reads: https://www.ncbi.nlm.nih.gov/sra/PRJNA800038
microbiome sequencing reads: https://www.ncbi.nlm.nih.gov/sra/PRJNA800070

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
data_processing_and_figures		data_processing_and_figures
metagenome_assembly		metagenome_assembly
miscellaneous		miscellaneous
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRO-seq and RNAseq in E. coli and microbiomes

metagenome assembly

bin annotation

transcript mapping

data processing and visualization

data availability

About

Releases

Packages

Languages

License

britolab/PRO-seq

Folders and files

Latest commit

History

Repository files navigation

PRO-seq and RNAseq in E. coli and microbiomes

metagenome assembly

bin annotation

transcript mapping

data processing and visualization

data availability

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages