Finding_Donors_for_CharityML. Project Overview:

First project for Introduction to ML with TensorFlow Nanodegree Program at Udacity

Optimized several different supervised learners to predict highest donation yield (3.7x fscore (0.75 vs 0.2 naive predictor) +15% accuracy (0.869 vs 0.752 naive predictor).

Business Understanding
Data Understanding: Explored data collected from the 1994 US Census with 45222 observations and 13 variables + target. This dataset is a modified version of the dataset published in the paper "Scaling Up the Accuracy of Naive-Bayes Classifiers: a Decision-Tree Hybrid", by Ron Kohavi. You may find this paper online, with the original dataset hosted on UCI.
Data Preparation: Normalized numerical features, transformed skewed continuous features plus one-hot encoded categorical variables.
Data Modeling: Compared and Optimized different ensemble methods using GridCV.
Results Evaluation: Discussed effects of feature selection.

Code and Resources Used

Python Version: 3.8.5
Packages: pandas, numpy, sklearn, matplotlib, seaborn

Repo Walk-Through

visuals.py: A few auxiliary plot functions.
finding_donors.ipynb: Main and only notebook.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
census.csv		census.csv
finding_donors.ipynb		finding_donors.ipynb
test_census.csv		test_census.csv
visuals.py		visuals.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finding_Donors_for_CharityML. Project Overview:

Code and Resources Used

Repo Walk-Through

About

Releases

Packages

Languages

montsebenito/Finding_Donors_for_CharityML

Folders and files

Latest commit

History

Repository files navigation

Finding_Donors_for_CharityML. Project Overview:

Code and Resources Used

Repo Walk-Through

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages