Customer Segemntation

This repository represents a project where I create a cluster model, for clustering the customers of a bank that take credits.

The goal is investigate what group of customers take the highest credit amounts, in this case we have a dataset with categorical and numerical data so, I used K-Means Clustering with OneHot Encoding for categorical data,

Also I used a K_Prototypes algorith beacause this algorithm permit using categorical and numerical datawithout encoding categorical data.

The result was that the group with ages between 20 and 68 years take credits with highest money amounts.

Enviroment

Python
Scikit-Learn
Plotly
Seaborn
Gower distances algorithm
Kmodes package for K-Prototype algorithm
Prince library for Factorial analysis mixed data (PCA for numerical and categorical )

If you want reproduce the repository

If you want use the repository you can make a git clone or download the repository

**You can see the notebook here

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
notebooks		notebooks
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
german_credit_data.csv		german_credit_data.csv
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Segemntation

Enviroment

If you want reproduce the repository

About

Releases

Packages

Languages

erikqtrs/customer_segmentation_bank

Folders and files

Latest commit

History

Repository files navigation

Customer Segemntation

Enviroment

If you want reproduce the repository

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages