GitHub - adityagandhamal/clustering-algorithm: K-Means Clustering Algorithm experimented on the famous iris dataset

About Project

In this project, I have done an experiment with K-Means Clustering Algorithm (or Unsupervised Machine Learning) on the famous iris dataset. As we know, the iris dataset has 3 target labels viz. iris setosa, iris versicolor and iris virginica labeled as 0, 1 and 2 respectively, I have given the clustering algorithm only the features of the iris dataset to get the predicted labels and to compare with the true ones.

Data Used

The data used is the iris dataset available in the sklearn library itself. I have worked with only the petal features(length and width) in this experiment.

Model training and predictions

I have trained the model(KMeans) with the data and have plotted the predictions along with their predicted labels(using colors as the respective clusters) with the help of Matplotlib.

Plot depicting the training data.

Plot depicting the predictions on the same data using K-Means Clustering Algorithm.

Further improvements

I have applied the Elbow Method to find out the optimum value for n_clusters and further even scaled down the features using MinMaxScaler.

Elbow Method to determine the optimum value for n_clusters

Libraries Used

SciKit Learn
Numpy
Matplotlib

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Experiment.ipynb		Experiment.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About Project

Data Used

Model training and predictions

Plot depicting the training data.

Plot depicting the predictions on the same data using K-Means Clustering Algorithm.

Further improvements

Elbow Method to determine the optimum value for n_clusters

Libraries Used

About

Releases

Languages

adityagandhamal/clustering-algorithm

Folders and files

Latest commit

History

Repository files navigation

About Project

Data Used

Model training and predictions

Plot depicting the training data.

Plot depicting the predictions on the same data using K-Means Clustering Algorithm.

Further improvements

Elbow Method to determine the optimum value for n_clusters

Libraries Used

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages