Skip to content

I use the credit card fraud binary classification problem to show how resampling can change modeling outcomes for unbalanced datasets.

Notifications You must be signed in to change notification settings

samarakoon-ryan/resampling-unbalanced-datasets

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

Resampling Unbalanced Datasets using the Credit Card Fraud Dataset

In this project I will show an example of how resampling can be useful for unbalanced datasets in binary classification problems. I'll be using a logistic regression model to demonstrate this. I'm aware that there are a vast amount of tools and libraries to deal with resampling and I strongly recommend that you use a combination of these methods to deal with unbalanced datasets. Here I would like to simply demonstrate the pros and cons of resampling using Sklearn's resample().

dataset: https://www.kaggle.com/datasets/yashpaloswal/fraud-detection-credit-card

About

I use the credit card fraud binary classification problem to show how resampling can change modeling outcomes for unbalanced datasets.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published