Skip to content

Latest commit

 

History

History
52 lines (40 loc) · 1.95 KB

File metadata and controls

52 lines (40 loc) · 1.95 KB

Investigate a Dataset Project

This is a Python-based project, completed in terms of the requirements of Udacity's Data Analyst nanodegree program. This repo contains the written report created for this project, as well as the jupyter notebook (in HTML format) used for the data analysis process while completing this project.

Project Overview

For this project, I had to select 1 dataset from a possible 5 from Udacity's dataset collection, and perform the entire data analysis process on the chosen dataset. I had to record all my work and analytical steps both in a jupyter notebook and an optional written report.

The data analysis process involved using Python, as well as additional libraries such as NumPy, pandas, matplotlib and seaborn.

Files/Software Used

This project used the following files/software:

  • European Soccer Database file downloaded from Kaggle
  • DB Browser for SQLite
  • Anaconda Powershell Prompt
  • jupyter notebook

Installation

Non-GitHub Users

If you are not familiar with GitHub, don't worry, I have got you covered. Just follow these steps and you can access the project:

  1. Click on the green Code button, which will open up a small menu;
  2. Within the menu, select the Download ZIP option;
  3. Download the repo file;
  4. Open the zipped repo file;
  5. Unzip/Extract the project files into a new folder.

GitHub Users

For GitHub users, please clone the project's repository into a local directory of your choosing.

git clone <remote-short-name> <remote-URL>

Contributions

Investigate a Dataset is a project that was completed as part of Udacity's Data Analyst nanodegree program. As such, it's a simple project which doesn't require any user contribution.

Credits

Resources that were used in the completion of this project can be found within the Bibliography section of the written report.