Skip to content

BiodataAnalysisGroup/lncAPNet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

lncAPNet

Abstract

Transcriptomic analyses elucidate complexities in diseases like Chronic Lymphocytic Leukemia (CLL), revealing specific molecules, cellular traits, and signalling patterns influencing clinical outcomes. The status of the variable region of IGHV in leukemia cells is crucial for classifying CLL cases, providing insights into their clinical condition and prognosis. Patients can be stratified based on the mutational status of the IGHV genes, with mutated CLL (M-CLL) exhibiting a more indolent course than the more aggressive unmutated CLL (U-CLL) cases. Long non-coding RNAs (lncRNAs) play diverse roles in cancer but are often overlooked due to limitations in computational approaches. To address this, we propose lncAPNet [long non-coding Activity PASNet], a biologically informed Deep Learning model-based framework for supervised patient classification, instructed by lncRNA-mRNA-pathways interactions with eXplainable Artificial Intelligence (XAI) read-outs. This pipeline utilizes NetBID2 for network-based integration of omics data, and PASNet [Pathway Associated Sparse deep neural Network] for the deep learning analysis, further building on lncrnalyzer and EnrichR knowledge graphs as input for the biological priors. Complementing these, we deploy custom code integrating SHAP for interpretability, by combining a bespoke graph architecture representing lncRNA-mRNA-pathway interactions, facilitating a comprehensive analysis of regulatory mechanisms. The proposed framework is being tested by applying it into two extensive CLL transcriptomic datasets; data from ICGC-CLLES project (257 samples) are used as the training-validation dataset, with the produced model tested on the BloodCancerMultiOmics2017 dataset (113 samples). Preliminary results showed that U-CLL cases are easily differentiated from M-CLL cases as expected, additionally revealing both biological ground-truth pathways, as well as insofar unknown but potential prognostic lncRNAs in disease.

Poster

ECCB2024

Framework steps

lncAPNet framework on the ICGC and BCMO datasets aims to identify lncRNAs that act as molecular drivers

  • Step1:

    Run SJARACNe/NetBID2 for Network Reconstruction and Differential Activity Analysis

a. Prepare and preprocess the ICGC and BCMO datasets, focusing on both mRNA and lncRNA expression data.

b. Use SJARACNe to reconstruct gene regulatory networks (GRNs), identifying interactions between mRNAs and lncRNAs.

c. Run NetBID2 to estimate the activity levels of genes and lncRNAs as driver molecules.

d. Perform Differential Activity analysis to highlight lncRNAs and mRNAs that are potential drivers influencing key biological pathways, especially focusing on lncRNAs as potential master regulators.

  • Step2:

    Match Pathways with Drivers and Use PASNet for Interpretability

a. Match identified drivers (including lncRNAs and mRNAs) to biological pathways using pathway databases EnrichR-KG, lncRNAlyzr (Gene Ontology, KEGG, Reactome, Wikipathways).

b. Run PASNet to integrate biological pathway information and train a deep learning model on the ICGC dataset (70-30 split for training/validation), and then test on the BCMO dataset.

c. Utilize SHAP values for deep learning explainability, identifying which lncRNAs and pathways are most important in the model’s predictions.

  • Step3:

    Post-hoc Network Analysis for Biological Explainability

a. Perform a Post-hoc network analysis to explore the connections between identified lncRNA/mRNA drivers and their related pathways.

b. Use network visualization tools to map these relationships, focusing on the role of lncRNAs as drivers.

c. Interpret the network to uncover how lncRNAs function as key regulators of biological processes.

About

This repository Created for ECCB2024 - Turku, Finland

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published