DataCamp Exam for Data Scientist José María Martín 2024-08-06
1 Setup
1.1 Packages
1.2 Import Data
1.3 Code
2 Data Analysis Framework
2.1 Question/Problem Statement
2.2 Structure of Information
2.3 Workflow
3 Preliminary Analysis
3.1 Load Data
3.2 Data Validation
3.2.1 Missing Values
3.2.2 Text Variables Validation
3.2.3 Numeric Variables Validation
3.3 Exploratory Analysis
3.3.1 Dataset Overview
3.3.2 Variable Classification
3.3.3 Target Description
3.3.4 Numeric Variables Exploration
3.3.5 Categorical Variables
3.3.6 Outliers
4 Pre-Processing Analysis
4.1 Correlation
4.1.1 High Correlated Variables vs Target Variable
4.1.2 ANOVA and post-hoc tests
4.2 Transformations
5 Machine Leaning Models Analysis
5.1 Workflow
5.2 Results
5.3 Average Predictions
5.4 Residuals
5.5 Accuracy
6 sessioninfo::session_info()