The task of this project is to employ different techniques to train and evaluate models with unbalanced classes using the imbalanced-learn and scihit-learn libraries to build and evaluate models using resampling. The credit card dataset is from LendingClub, a peer-to-peer lending services company.
The tasks for this project are to predict credit risk using:
- The Resampling Models
- The SMOTEEN Algorithm
- Ensemble Classifiers
Files:
- credit_risk_resampling.ipynb
- credit_risk_ensemble.ipynb
- LoanStats_2019Q1
- 6 machine learning models
- 3 scores
- Balanced Accuracy
- Precision
- Recall
- 3 measures for analysis
- Accuracy Score
- Confusion Matrix
- Imbalanced Classification Report
RESAMPLING:
- Naive Random Oversampling
- SMOTE Oversampling
- Cluster Centroid Undersampling
- SMOTEENN Combo Sampling (Over and Under)
ENSEMBLE: