You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
NMF (Nonnegative Matrix Factorization) is a mathematical technique with a long history in the field of genomics for the analysis of bulk RNA sequencing data1,2, and it has been widely adopted as a powerful dimensionality reduction tool for single-cell data as well3,4,5–10. It is unique among matrix factorization methods in identifying overlapping patterns in high-throughput data to define biological processes that can co-occur across cell types or biological conditions11.
What is CoGAPS?
CoGAPS (Coordinated Gene Activity across Pattern Subsets) is a Bayesian NMF (Nonnegative Matrix Factorization) algorithm. It can be used to perform sparse matrix factorization on any data, and when this data represents biomolecules, to do gene set analysis. CoGAPS improves on other enrichment measurement methods by combining a Markov chain Monte Carlo (MCMC) matrix factorization algorithm (GAPS) with a threshold-independent statistic inferring activity on gene sets.
Who should use CoGAPS?
Anyone can use CoGAPS; no machine learning experience is required.
What kind of data does CoGAPS work on?
CoGAPS can be used to perform sparse matrix factorization on any data. And when this data represents biomolecules, to do gene set analysis.
How do I cite CoGAPS?
If you use the CoGAPS package for your analysis, please cite:
Fertig EJ, Ding J, Favorov AV, Parmigiani G, Ochs MF (2010). “CoGAPS: an integrated R/C++ package to identify overlapping patterns of activation of biological processes from expression data.” Bioinformatics, 26(21), 2792–2793.
If you use the gene set statistic, please cite Ochs et al. (2009)