CellWalkR is an R package that integrates single-cell open chromatin (scATAC-seq) data with cell type labels and bulk epigenetic data to identify cell type-specific regulatory regions. A GPU implementation and downsampling strategies enable thousands of cells to be processed in seconds. CellWalkR’s user-friendly interface provides interactive analysis and visualization of cell labels and regulatory region mappings.
Install CellWalkR for R using devtools as follows:
$ R
> install.packages("devtools")
> devtools::install_github("PFPrzytycki/CellWalkR")
For a guide to using CellWalkR, see the provided vignette, which covers the following:
- Data Pre-processing
- Getting Started with CellWalkR
- Building a Network
- Tuning Label Edges
- Making a cellWalk Object
- Adding Filters
- Downstream Analysis
- Interactive Visualizaiton
- Adding a Second Set of Labels
- Detecting Doublets
If you use CellWalkR please cite:
-
Przytycki, P.F., Pollard, K.S. “CellWalkR: An R Package for integrating and visualizing single-cell and bulk data to resolve regulatory elements.” Bioinformatics (2022). https://doi.org/10.1093/bioinformatics/btac150
-
Przytycki, P.F., Pollard, K.S. “CellWalker integrates single-cell and bulk data to resolve regulatory elements across cell types in complex tissues.” Genome Biology (2021). https://doi.org/10.1186/s13059-021-02279-1
CellWalkR can also be run on AWS which vastly simplifies the process of running on GPUs using TensorFlow. Using GPUs allows the code to run more than 15 times faster. For a guide to running CellWalkR on AWS using GPUs see this vignette.