Skip to content

xCompass-AI/CellPolaris

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CellPolaris

Abstract

Changes in cell fate are determined by a gene regulatory network (GRN), a sophisticated system that regulates gene expression with precise spatiotemporal patterns. However, accurately capturing causal and context specific gene regulations by integrating multi-omics data at bulk and single cell level remains challenging. In this study, we introduce CellPolaris, a computational system that leverages transfer learning algorithms to generate tissue or cell-type-specific GRNs. The system employs high-confidence GRNs generated from collected transcriptomic data and paired ATAC-Seq data for training. Once trained, the model only requires transcriptomic data as input to infer the GRNs of the corresponding states. Benchmarking tests demonstrated the reliability of CellPolaris in GRN construction. Notably, to assess the model's scalability, we incorporated GRNs from other species such as mouse and zebrafish, which slightly improved the performance of CellPolaris in predicting GRNs for human cells. Subsequently, based on the GRNs, CellPolaris can predict master transcription factors (TFs) involved in cell fate transitions and simulate the impact of TF perturbations on developmental processes. The results show that the top-ranked master regulatory factors identified by CellPolaris largely overlap with the combinations of factors that have successfully achieved cell fate conversion in selected reprogramming experimental systems. Simultaneously, CellPolaris predicts the changes in target genes (TGs) with in silico knockout of TFs and then simulates the effects of these knockouts on round spermatid differentiation. In summary, CellPolaris provides an approach for utilizing transcriptomic data to construct GRNs by transfer learning, thereby identifying potential TFs involved in cell fate transitions and deciphering the underlying mechanisms of developmental regulation.

Workflow

image Figure 1. Schematic overview of CellPolaris design. a Generation of a generalized transfer model using PECA2 tool to construct a GRN database from ATAC-Seq and corresponding cell state RNA-Seq data. This model enables cross-species and cross-tissue analysis. The trained model can take RNA-Seq data as input and generate corresponding GRNs, which can be utilized for downstream applications. b Prediction of cell fate regulatory factors based on GRN analysis: Comparison of GRN differences between source and target cells during cell fate transitions. Transcription factor nodes in the differential networks were scored and ranked. c Simulation of the impact of gene perturbation on cell differentiation pathways based on GRN analysis: A probability graphical model of gene expression regulation was constructed using GRN analysis. Simulated knockout genes were set to zero, and the expression changes of neighboring node genes were inferred to deduce changes in cell states.

1 Use transfer_learning to get GRN:

Dataset

To download the dataset required to run the transfer learning code from the link below: https://www.scidb.cn/en/s/VNvY3e

Dependency

conda env create -f ./transfer_learning/environment.yml

Running

For cross-tissue/period

python ./transfer_learning/scripts_for_execute/script_train_tissue_period.py

For cross-cluster

python ./transfer_learning/scripts_for_execute/script_train_multi_to_multi.py

And to enhance domain-specific edges learning

python  ./transfer_learning/scripts_for_execute/script_train_multi_to_multi_enhance_specific.py

Using Drosophila and zebrafish to enhance the human GRN learning, change the 'mouse' related content to 'fly'/'zebrafish' according to the annotations in transfer_learning/load_dataset/dataset.py; transfer_learning/load_dataset/load_each_peca/loader.py; transfer_learning/load_dataset/load_each_peca/implementation/sc_human.py

and then run

python  ./transfer_learning/scripts_for_execute/script_train_multi_to_multi_with_fly_or_zebrafiosh.py

2 Use model_PGM to get deltaX:

Dependency

Creating an Environment from a YAML File

conda env create -f environment_pgm.yml

Running

python ./model_PGM/run.py

3 Use plot to get final results:

Running

python ./plot/CellCruise.py

Troubleshooting

If you encounter any problems during installation or use of CellPolaris, please contact us by email cuiwentao@cnic.cn. We will help you as soon as possible.

Citation

If you use this code for your research, please cite our paper CellPolaris: Decoding Cell Fate through Generalization Transfer Learning of Gene Regulatory Networks

@article{feng2023cellpolaris,
  title={CellPolaris: Decoding Cell Fate through Generalization Transfer Learning of Gene Regulatory Networks},
  author={Feng, Guihai and Qin, Xin and Zhang, Jiahao and Huang, Wuliang and Zhang, Yiyang and Cui, Wentao and Li, Shirui and Chen, Yao and Liu, Wenhao and Tian, Yao and others},
  journal={bioRxiv},
  pages={2023--09},
  year={2023},
  publisher={Cold Spring Harbor Laboratory}
}