Project: Finding Donors for CharityML

Part of Udacity Data Scientist Nanodegree

About

The project aims to evaluate and select the optimal supervised learning algorithm available that is adequate to accurately model individuals' income using data collected from the 1994 U.S. Census. In addition to accurately predicts whether an individual makes more than $50,000. Understanding an individual's income can help a non-profit better understand how large of a donation to request.

Based on the accuracy and f-score and the training time the best model is Random Forest Classifier (RFC). Since we are dealing with a classfication problem using Random Forest would be optimal and fast and easy to communicate results to the stakeholders.

Steps

preprocessing

Transforming Skewed Continuous Features
Normalizing Numerical Features
One-hot Encoding

Implement performance metrics to evaluate the potential algorithms
Choosing the Best Model & Model Tuning

Install

This project requires Python 3.x and the following Python libraries installed:

You will also need to have software installed to run and execute an iPython Notebook

Data

The modified census dataset consists of approximately 32,000 data points, with each datapoint having 13 features. This dataset is a modified version of the dataset published in the paper "Scaling Up the Accuracy of Naive-Bayes Classifiers: a Decision-Tree Hybrid", by Ron Kohavi. You may find this paper online, with the original dataset hosted on UCI.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
__pycache__		__pycache__
README.md		README.md
census.csv		census.csv
example_submission.csv		example_submission.csv
finding_donors.ipynb		finding_donors.ipynb
report.html		report.html
test_census.csv		test_census.csv
visuals.py		visuals.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project: Finding Donors for CharityML

Part of Udacity Data Scientist Nanodegree

About

Steps

Install

Data

About

Releases

Packages

Languages

athlatif/FindingDonorsProject

Folders and files

Latest commit

History

Repository files navigation

Project: Finding Donors for CharityML

Part of Udacity Data Scientist Nanodegree

About

Steps

Install

Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages