Fraud Detection Case Study

Our team built a Gradient Boosted Model that triages fraudulent events. Our model produces an F1 score (weighted precision and recall) of 0.66.

We identified these factors as key indicators of fraud, and used our available data to find appropriate proxies:

File Structure

Data: The data used in this case study is confidential, and therefore not included in this repository.

Model: Our model is in the home directory, under model.py. Predictions are generated via predict.py

A few notebooks can also be found outlining our feature engineering process, and our model development process.