Skip to content

Exploratory analysis on 2019 Kaggle Machine Learning and Data Science Survey dataset.

License

Notifications You must be signed in to change notification settings

evertonbin/kaggle-survey

Repository files navigation

Table of Contents

  1. Installation
  2. Project Motivation
  3. File Descriptions
  4. Results
  5. Licensing, Authors, and Acknowledgements

Installation

All the necessary libraries to run the code were already available in Anaconda distribution of Python. This script was written using Python version 3.*.

Project Motivation

For developing this project, I was interested in using Kaggle data from its 2019 Machine Learning and Data Science Survey to better understand:

  1. What is the educational background of those acting in Data Science's fields and which courses do they usually attend?
  2. Which are the main activities that they perform in their companies? Do machine learning tasks play a big role in their daily activities?
  3. Which are the most used tools when it comes to programming languages, frameworks and databases?
  4. How well are they paid for their work in Data Science and what aspects may affect their incomes?

File Descriptions

  1. Notebook - Jupyter Notebook (English and Portuguese versions) with the script developed for answering the questions presented above. Along with the script, there are markdown cells contextualizing the adopted steps.
  2. Html file - the html file is a version of the notebook with a functional version of table of contents to facilitate the navigation through the file.
  3. Csv file - a copy of the dataset used in the exploratory analysis. The original one can be found in the Kaggle website through the link in the Licesing section below.
  4. Licensing - MIT License covering this project.

Results

The main findings of the code answering the proposed questions can be found at the Medium post available here.

Licensing, Authors, Acknowledgements

Credits must give credit to the Kaggle platform for making the data available. You can find the Licensing for the data, other descriptive information adn the original dataset at the Kaggle Survey's page - link available here.

About

Exploratory analysis on 2019 Kaggle Machine Learning and Data Science Survey dataset.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published