Skip to content

Detection of chromatin melting and condensation events in regions of interest from GAM data

License

Notifications You must be signed in to change notification settings

DominikSzabo1/MELTRONIC

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

42 Commits
 
 
 
 
 
 
 
 

Repository files navigation

MELTRONIC

a statistical framework to detect chromatin melting and condensation genome wide from GAM data

DOI:10.5281/zenodo.10528545

System requirements

Developed and tested with R version 3.6.0 Planting of a Tree.
Please refer to the R project website for installation and basic usage examples. Typical installation time on a desktop computer is ~15 minutes.

library(data.table)
library(dplyr)
library(argparser)
library(stringr)

Software was developed and tested using data.table package version 1.12.0, dplyr package version 0.7.8, argparser package version 0.7.1 and stringr package version 1.4.1. No non-standard hardware is required for execution of the scripts.

Available command line applications:

  • matrix_wide_to_long.R:
    Converts a square matrix into a long matrix for IS calculation. Accepts wildcards for processing of multiple chromosomes.
  • long_matrix_to_IS.R:
    Calculates insulation scores at multiple distances (default 100kb - 1Mb, steps of 100kb).
  • MELTRONIC.R:
    Compares insulation score (IS) distributions over regions of interest. Was applied to 120 kb sliding windows accross the entire genome and and long genes for the preparation of the manuscript.

Expected output files can be found in the /data folder.
Matrix conversion, IS calculation and MELTRONIC score calculation for the entire mouse genome takes about two hours on a standard desktop computer.

type

Rscript command_line_apps/matrix_wide_to_long.R --help 

for explanations

Available Snakefiles:

  • Snakefile_wide_to_melt_long_genes:
    Example Snakefile for calculation of melting scores of long protein coding genes (> 200 kb). Includes reshaping of square NPMI matrices to long NPMI matrices, ultra-fast calculation of insulation scores using 10 square sizes, and calculation of melting scores.
  • Snakefile_wide_to_MELTRONIC:
    Example Snakefile for calculation of melting scores in 120 kb sliding windows across the entire mouse genome (autosomes + chrX):. Includes reshaping of square NPMI matrices to long NPMI matrices, ultra-fast calculation of insulation scores using 10 square sizes, and calculation of melting scores.

Snakemake workflow manager was used to ensure reproducibility of analyses, but is not required for melting score calculations.
Please refer to the Snakemake website for installation and basic usage explanations.
Please adjust file paths etc before executing the snakefiles.

Developed and maintained by Dominik Szabó .
Please get in touch for questions and issues: dominik.szabo at mdc-berlin.de

About

Detection of chromatin melting and condensation events in regions of interest from GAM data

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages