Skip to content

Notebook about the customer segmentation problem using machine learning models in this case Unsupervised models like K-Prototypes since the dataset has categorical and numerical data

Notifications You must be signed in to change notification settings

erikqtrs/customer_segmentation_bank

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Customer Segemntation

This repository represents a project where I create a cluster model, for clustering the customers of a bank that take credits.

The goal is investigate what group of customers take the highest credit amounts, in this case we have a dataset with categorical and numerical data so, I used K-Means Clustering with OneHot Encoding for categorical data,

Also I used a K_Prototypes algorith beacause this algorithm permit using categorical and numerical datawithout encoding categorical data.

The result was that the group with ages between 20 and 68 years take credits with highest money amounts.

Enviroment

  • Python
  • Scikit-Learn
  • Plotly
  • Seaborn
  • Gower distances algorithm
  • Kmodes package for K-Prototype algorithm
  • Prince library for Factorial analysis mixed data (PCA for numerical and categorical )

If you want reproduce the repository

If you want use the repository you can make a git clone or download the repository

**You can see the notebook here

About

Notebook about the customer segmentation problem using machine learning models in this case Unsupervised models like K-Prototypes since the dataset has categorical and numerical data

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published