The objective of this repo was to compare the use various clustering techniques on data of the fatty acid composition of various Italian Olive oils.
The clustering techniques used were:
- K-Means
- Gaussian Mixture Models (GMM)
- HDBSCAN
The output from this analysis can be read on this article in RPubs.
The data used is from a paper by Forina, Armanino, Lanteri, Tiscornia (1983) Classification of Olive Oils from their Fatty Acid Composition, in Martens and Russwurm (ed) Food Research and Data Anlysis. I thank Prof. Michele Forina, University of Genova, Italy for making this dataset available.
You can reproduce the environment by using the renv
package and the renv.lock
file available in this repo.
For questions or feedback please contact me via LinkedIn or email.
For code fixes feel free to raise an issue.