Clustering-Geolocation-Data-Intelligently-in-Python

This is Coursera Guided Project completed by me with the following learning objectives:-

How to visualize and understand geographical data in an interactive way with Python.
How the K-Means algorithm works, and some of the shortcomings it has.
Density-based clustering approaches, and how to deal with any outliers they may classify.

Initially the project was completed by me on the Coursera's hands-on platform "Rhyme", but later I downloaded ht Jupyter Notebook and saved my progress.

Following python modules/functions have been used in the project:-

matplotlib for plots and charts visualization of the outcomes.
Pandas for storing and manipulating data.
Numpy for its use in data-manipulation.
hdbscan and DBSCAN for spatial-clusterings (hierarchichal).
sklearn functionalities like Kmeans and silhouette_score with KneighboursClassifier.
folium for maps and co-ordinates visualization.

The Project has been divided into 7-tasks:-

Task 1: An introduction to the problem, as well as basic exploratory data analysis and visualizations.

Task 2: Visualizing geographical data in a more meaningful and interactive way.

Task 3: Methods of evaluating the strength of a clustering algorithm.

Task 4: Theory behind K-Means, and how to use it for our problem.

Task 5: Introduction to density-based clustering approaches, and how to use DBSCAN.

Task 6: Introduction to HDBSCAN, to alleviate constraints of classical DBSCAN.

Task 7: A simple method to address outliers classified by density-based models.

At the end of this Project I found out that I need to work more on :-

K-Means Algorithm.
Density-based clustering approaches with HDBSCAN.
A little bit of DataVisualization skills.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.ipynb_checkpoints		.ipynb_checkpoints
Data		Data
pictures		pictures
.DS_Store		.DS_Store
ProjectOutcome.html		ProjectOutcome.html
Project_Complete.ipynb		Project_Complete.ipynb
README.md		README.md
hybrid.html		hybrid.html
kmeans_70.html		kmeans_70.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clustering-Geolocation-Data-Intelligently-in-Python

The Project has been divided into 7-tasks:-

About

Releases

Packages

Languages

memeghaj10/Clustering-Geolocation-Data-Intelligently-in-Python

Folders and files

Latest commit

History

Repository files navigation

Clustering-Geolocation-Data-Intelligently-in-Python

The Project has been divided into 7-tasks:-

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages