This Study/Project deals with predicting fraudulent credit card transactions. The data is taken from famous Kaggle dataset: https://www.kaggle.com/mlg-ulb/creditcardfraud Since the data is highly imbalanced, we have approcahed it in following ways;
- Random Forest Classification with class weight.
- Random Forest Classification with under sampling.
- Random Forest Classification with over sampling.
We have also used unsupervised ML algorithms for anomaly detection.
- Local Outlier Factor
- Isolation Forest
We have tried to understand different model evaluation measures such as accuracy, confusion matrix, precision, recall and f1 score.