Integrating gene expression and biological knowledge for drug discovery and repurposing

These materials was delivered as part of the ISCB Academy webinar series on April 22, 2022 at 8:00 AM EDT.

Cite as:

Mahmoud Ahmed, & Trang Huyen Lai. (2022, May 10). Integrating gene expression and biological knowledge for drug discovery and repurposing. Zenodo. https://doi.org/10.5281/zenodo.6534994

Integrating gene expression and biological knowledge for drug discovery and repurposing

Description

Screening for potential cancer therapies using existing large datasets of drug perturbations requires expertise and resources not available to all. This is often a barrier for lab scientists to tap into these valuable resources. To address these issues, one can take advantage of prior knowledge especially those coded in standard formats such as causal biological networks (CBN). Large datasets can be converted into appropriate structures, analyzed once and the results made freely available in easy-to-use formats. In this three parts tutorial, we will give a full description of one large scale analysis of using this approach, one case study of building a network of metastasis suppressors from scratch, and a walkthrough example code to perform and adapt these tools for different use cases

Detailed description

Part One: a talk describing the construction of a database for cancer-cell-specific perturbations of biological networks (LINPS). We pre-computed cancer-cell-specific perturbation amplitudes of several biological networks and made the output available in a database with an interactive web interface.
Part Two: a talk describing the building of a functional network model of the metastasis suppressor RKIP and its regulators in breast cancer cells. In this case study, we applied text mining and a manual literature search to extract known interactions between several metastasis suppressors and their regulators. Then we adopted a reverse causal reasoning approach to evaluate and prioritize pathways that are most consistent and responsive to drugs that inhibit cell growth. We further validated some of the predicted regulatory links in the breast cancer cell line MCF7 experimentally and highlighted the points of uncertainty in our model.

Part Three: a code walkthrough encoding directed interactions into the biological expression language (BEL), computing the network perturbation amplitudes (NPA), and interpreting the output.

Pre-requisites

Knowledge of the mechanisms of transcriptional regulation
Familiarity with high-throughput gene expression data
Basic knowledge of R and Bioconductor
Familiarity with Docker and Git (This is only required to execute the code on a local machine)

Time

Three half-hour parts. With breaks and time for discussion for up to two hours.

Resources

The Shiny app (Link)
The database (link) and app (link) source code
Published articles on the database (link) and the case study (link) (recommended reads)
The tutorial source code (Link)

Instructors bio

Mahmoud Ahmed (MahShaaban) is working as a postdoc at Gyeongsang National University, South Korea, where he received his PhD in 2021. He studied the role of autophagy in development (adipocyte differentiation) and disease (cancer) using high-throughput data. Mahmoud developed open-source R/Bioconductors packages and Shiny applications to implement methods for analyzing gene expression data and integrating it with other kinds of data. His recent work focuses on understanding cancer evolution, biological network perturbations, and drug repurposing.
Trang Huyen Lai (Tranghuyenlai) is a PhD candidate in the department of Convergence Medical Science, Gyeongsang National University, School of Medicine, South Korea. Trang is primarily interested in the mechanisms and regulatory pathways of genes involved in autophagy and cancer using molecular technology and experimental assays. Her current research topic is investigating the role of Raf kinase inhibitory protein (RKIP) in metastasis and its regulators in breast cancer.

Setup

To follow through the code, you can choose one of the following options.

Download the materials and the docker environment using the following code

# clone this repo
git clone --recurse-submodules https://github.com/MahShaaban/integrating_knowledge_data

# pull the docker image
docker pull mahshaaban/integrating_knowledge_data:latest

# run the docker container
docker run -it --rm \
    -p 8787:8787 -e PASSWORD=rstudio \
    -v $(pwd)/integrating_knowledge_data:/home/rstudio \
    mahshaaban/integrating_knowledge_data:latest

Alternatively, create and run a project on RStudio Cloud using this repo (The free tier works fine)
- Log in your RStudio Cloud account
- New Project > New Project from Git Repository
- Copy & paste this URL "https://github.com/MahShaaban/integrating_knowledge_data"

There you need to start by installing the required packages

# to run the code in part 3
install.packages('igraph')
remotes::install_github('MahShaaban/NPAModels')
remotes::install_github('MahShaaban/NPA')

# to render rmarkdown files
install.packages(c("base64enc", "htmltools", "markdown", "rmarkdown"))

You can clone the repo, and install the required packages on your local machine using the code above.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
62429e30b8762535c08cf128 @ 65e5fbb		62429e30b8762535c08cf128 @ 65e5fbb
62429e45b876255fa68cf41b @ ef649d8		62429e45b876255fa68cf41b @ ef649d8
Part_3_code-walkthrough		Part_3_code-walkthrough
.Rbuildignore		.Rbuildignore
.gitmodules		.gitmodules
CITATION.cff		CITATION.cff
CITATION.md		CITATION.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
DESCRIPTION		DESCRIPTION
Dockerfile		Dockerfile
LICENSE		LICENSE
Part_1_slides.pdf		Part_1_slides.pdf
Part_2_slides.pdf		Part_2_slides.pdf
README.md		README.md
integrating_knowledge_data.Rproj		integrating_knowledge_data.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cite as:

Integrating gene expression and biological knowledge for drug discovery and repurposing

Description

Detailed description

Pre-requisites

Time

Resources

Instructors bio

Setup

About

Packages

Languages

License

ISCB-Academy/integrating_knowledge_data

Folders and files

Latest commit

History

Repository files navigation

Cite as:

Integrating gene expression and biological knowledge for drug discovery and repurposing

Description

Detailed description

Pre-requisites

Time

Resources

Instructors bio

Setup

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Packages 0

Languages

Packages