Link to paper: 10.1038/s41467-020-19921-4
Figure. Illustration of four genes (CDC6, RIO2, NSP1, EXG1) that carry a group of motif co-occurrence rules with a common motif (NHP6B transcription factor binding site, blue line) in their promoter region. The genes diverge in possessing 2 to 4 other DNA motifs (red lines) across the remaining regulatory regions (promoter, 5' and 3' UTRs and terminator), which repurpose the expression of these genes across a range of almost 3 orders of magnitude of expression levels. Red lines in the histogram denote the specific expression levels of the genes.This repository contains scripts to reproduce the analysis and figures. The data is available at , extract the archive to a folder named 'data'.
Dependencies are provided in the conda environment.yml file in the 'docs' folder.