Single-Cell-Perturbations

History

Name		Name	Last commit message	Last commit date
parent directory ..
versions		versions
README.md		README.md
output.txt		output.txt

README.md

Single-Cell Gene Expression Prediction

Overview

This repository contains code for predicting how small molecules change gene expression in different cell types. The goal is to accelerate drug discovery and basic biology research by developing methods to accurately predict chemical perturbations in new cell types.

Data

The dataset used for this project can be found here. It includes various data files, such as adata_obs_meta.csv, adata_train.parquet, de_train.parquet, and more. These files are used for training and evaluation.

Getting Started

Prerequisites

Python 3.7 or higher
LightGBM
Scikit-learn
PyArrow

Installation

Clone the repository:

git clone https://github.com/spmfte/single-cell-gene-expression-prediction.git

Navigate to the project directory:

cd single-cell-gene-expression-prediction

Install the required packages:
```
pip install -r requirements.txt
```

Usage

Run the Jupyter notebook pert30.ipynb for a step-by-step walkthrough of the project.
Modify the code as needed for your specific use case and dataset.

Data Exploration

In the Jupyter notebook, we perform data exploration, visualize the dataset, and analyze its characteristics.

Preprocessing

We preprocess the data by handling missing values and scaling the features to prepare it for model development.

Model Development

We train a LightGBM regression model to predict chemical perturbations' impact on gene expression in different cell types.

Evaluation

The model's performance is evaluated using root mean squared error (RMSE) on a test dataset.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

Single-Cell-Perturbations

Single-Cell-Perturbations

README.md

Single-Cell Gene Expression Prediction

Overview

Data

Getting Started

Prerequisites

Installation

Usage

Data Exploration

Preprocessing

Model Development

Evaluation

Acknowledgments

Files

Single-Cell-Perturbations

Directory actions

More options

Directory actions

More options

Latest commit

History

Single-Cell-Perturbations

Folders and files

parent directory

README.md

Single-Cell Gene Expression Prediction

Overview

Data

Getting Started

Prerequisites

Installation

Usage

Data Exploration

Preprocessing

Model Development

Evaluation

Acknowledgments