DropletQC

This is a simple R package to calculate, for every requested cell barcode in a provided scRNA-seq BAM file, the nuclear fraction score:

nuclear fraction = intronic reads / (intronic  reads  +  exonic  reads)

The score captures the proportion of reads from intronic regions. These RNA fragments originate from unspliced (nuclear) pre-mRNA, hence the name “nuclear fraction”. This score can be used to help identify:

“Empty” droplets containing ambient RNA: low nuclear fraction score and low UMI count
Droplets containing damaged cells: high nuclear fraction score and low UMI count

Installation

You can install DropletQC with:

# install.packages("devtools")
devtools::install_github("powellgenomicslab/DropletQC", build_vignettes = TRUE)

Calculating the nuclear fraction

There are two functions which can be used to calculate the nuclear fraction; nuclear_fraction_tags and nuclear_fraction_annotation.

If your BAM file contains region tags which identify aligned reads as intronic or exonic, such as those produced by 10x Genomics’ Cell Ranger software, then the simplest and fastest way to calculate the nuclear fraction is to point nuclear_fraction_tags to the directory:

library(DropletQC)
nf1 <- nuclear_fraction_tags(
    outs = system.file("extdata", "outs", package = "DropletQC"),
     tiles = 1, cores = 1, verbose = FALSE)
head(nf1)
#>                    nuclear_fraction
#> AAAAGTCACTTACTTG-1        0.9032698
#> AAAAGTGGATCTCTAA-1        0.4032761
#> AAAGCAGTTACGAAGA-1        0.3957704
#> AACGACTTCAATATGT-1        0.4004525
#> AACGGCGTCATCTGGA-1        0.8845109
#> AAGCAGGGGTCGCGAA-1        0.3929376

Alternatively, you can point nuclear_fraction_annotation to a gene annotation, BAM and barcode files:

nf2 <- nuclear_fraction_annotation(
 annotation_path = system.file("extdata/outs/chr1.gff3",package = "DropletQC"),
 bam = system.file("extdata/outs/possorted_genome_bam.bam",package = "DropletQC"),
 barcodes = system.file("extdata/outs/filtered_feature_bc_matrix/barcodes.tsv.gz",package = "DropletQC"),
 tiles = 1, cores = 1, verbose = FALSE)
head(nf2)
#>                    nuclear_fraction
#> AAAAGTCACTTACTTG-1        0.9032698
#> AAAAGTGGATCTCTAA-1        0.4032761
#> AAAGCAGTTACGAAGA-1        0.3957704
#> AACGACTTCAATATGT-1        0.4004525
#> AACGGCGTCATCTGGA-1        0.8845109
#> AAGCAGGGGTCGCGAA-1        0.3929376

This method is more flexible, as it makes no assumptions about how your BAM file was produced - but it will take longer. Take care that the provided barcodes match the barcode structure in the BAM file.

Identifying empty droplets and damaged cells

Once the nuclear fraction score has been calculated, the identify_empty_drops and identify_damaged_cells functions can be used to assist in identifying each these populations. Empty or damaged cells are flagged, not removed.

More information

For a detailed discussion see our manuscript:

DropletQC: improved identification of empty droplets and damaged cells in single-cell RNA-seq data

For more information about the functions included in the package, including tips on how to assess the nuclear fraction score using real-world examples, see the package vignette.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
R		R
data		data
docs		docs
inst/extdata/outs		inst/extdata/outs
man		man
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
LICENSE.md		LICENSE.md
NAMESPACE		NAMESPACE
README.Rmd		README.Rmd
README.md		README.md
_pkgdown.yml		_pkgdown.yml
dropletQC.Rproj		dropletQC.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

DropletQC

Installation

Calculating the nuclear fraction

Identifying empty droplets and damaged cells

More information

About

Licenses found

Releases 1

Packages

Languages

License

Licenses found

powellgenomicslab/DropletQC

Folders and files

Latest commit

History

Repository files navigation

DropletQC

Installation

Calculating the nuclear fraction

Identifying empty droplets and damaged cells

More information

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages