Understanding (big) data with machine-learning.
Material for Geneva's Humanitarian Big Data.
Slides: Geneva's Humanitarian Big Data
Try it live: (no installing needed)
By the end you will know:
- the basics of
scikit-learn
- how to use Decision Trees and Random Forests
- how to use cross-validation to measure performance
- that there are many metrics by which to measure performance
- Visual explanation of Decision trees
- Neural network playground
- Gathering Statistics with Privacy, aided by the Flip of a Coin
- Unbiased performance estimates
- Download pre-trained models
Anaconda is a python distribution that is easy to install and contains a large number of commonly used libraries. Download anaconda, make sure to get the python3.5 version.
Geneva's Humanitarian Big Data by Tim Head is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.