This notebook has a lot of interactive visualizations that aren't displayed on GitHub!
Please, click here to see this notebook on Kaggle, the way the author intended it to be seen.
This notebook is a study on the Data Science Job Salaries dataset, containing information on salary, company size and location, remote work, employee location, and many other related to people working in the Data Science field throughout the world.
The goal of this notebook is to do an EDA on the data, trying to understand how the feature salary in usd relates to other features and try to understand their relevance when it comes to achieving higher yearly earnings.
After doing an EDA, I've used PyCaret regression lib to predict employee salaries in usd according to the features analyzed in the dataset.
- Pandas
- Numpy
- Plotly
- Pycaret
Luís Fernando Torres