K-Means in Haskell

Performs K-means clustering on a list of tab-separated numeric vectors given a number K of clusters, a list of tab-separated numberic vectors of starting centroids, and a number of maximum iterations

Sample usage:

ghc --make kmeans.hs
./kmeans 2 yeast.dat 10 yeast_centroids.dat

Out-of-the-box, this should print out:

K          = 2
Iterations = 10
Num Points = 2467
Group 0: 1496
Group 1: 971
Group 0: 1589
Group 1: 878
Group 0: 1659
Group 1: 808
Group 0: 1706
Group 1: 761
Group 0: 1743
Group 1: 724
Group 0: 1757
Group 1: 710
Group 0: 1772
Group 1: 695
Group 0: 1782
Group 1: 685
Group 0: 1793
Group 1: 67
Group 0: 1802
Group 1: 665

TODO

optionally create random centroids (either by taking K centroids at random from data points, or by generating K random centroids in some semi-smart manner)
write out to file the classification of each point (e.g. %(point)\t%(class)\n format)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.gitignore		.gitignore
README.md		README.md
kmeans.hs		kmeans.hs
yeast.dat		yeast.dat
yeast_centroids.dat		yeast_centroids.dat
yeast_experiments.txt		yeast_experiments.txt
yeast_gene_names.txt		yeast_gene_names.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

K-Means in Haskell

TODO

About

Releases

Packages

Languages

aotimme/kmeans-haskell

Folders and files

Latest commit

History

Repository files navigation

K-Means in Haskell

TODO

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages