Skip to content

Latest commit

 

History

History
22 lines (14 loc) · 815 Bytes

README.md

File metadata and controls

22 lines (14 loc) · 815 Bytes

DP_means

Dirichlet Process K-means

Description

DP K-means is a bayesian non-parametric extension of the K-means algorithm based on small variance assymptotics (SVA) approximation of the Dirichlet Process Mixture Model.

It doesn't require prior knowledge of the number of clusters K. The cluster penalty parameter lambda is set based on the data by taking the maximum distance to the K++ means initialization. Normalized Mutual Information (NMI) is used to compare posterior cluster assignments with the ground truth.

Reference

B. Kulis and M. Jordan, "Revisiting k-means: New Algorithms via Bayesian Nonparametrics"

Dependencies

Matlab 2015a
Python 3.11.2
Eigen3