Skip to content

The "Santa" project is a comprehensive exploration of data analysis and predictive modeling. It encompasses Exploratory Data Analysis (EDA), Logistic Regression, and Decision Tree modeling. The project aims to extract meaningful insights from data and apply predictive algorithms to model and understand complex relationships.

License

Notifications You must be signed in to change notification settings

adinaamzarescu/Predictive-Modeling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Santa - Data Analysis and Predictive Modeling

Copyright 2024 Adina Amzarescu

Description

The "Santa" project is a comprehensive exploration of data analysis and predictive modeling. It encompasses Exploratory Data Analysis (EDA), Logistic Regression, and Decision Tree modeling. The project aims to extract meaningful insights from data and apply predictive algorithms to model and understand complex relationships.

Installation

To set up the project, install the necessary Python libraries:

pip install numpy pandas matplotlib seaborn scikit-learn

Usage

Run the Santa.ipynb notebook to engage with the data analysis and modeling. The notebook is organized into sections, guiding through EDA, model implementation, and evaluation.

Detailed Features

Exploratory Data Analysis (EDA)

  • Visualization and Statistical Analysis: Uses matplotlib and seaborn for visualizing data and understanding underlying patterns.
  • Correlation Studies: Analyzes correlations between different variables, particularly focusing on their relationship with the target variable.

Logistic Regression

  • Manual and Scikit-Learn Implementations: Includes both manual implementation and utilization of scikit-learn for logistic regression.
  • Comparative Evaluation: Compares the performance of manual and scikit-learn implementations in different scenarios.

Decision Tree Modeling

  • Implementation using Scikit-Learn: Explores decision tree modeling for predictive analysis using scikit-learn's API.
  • Model Evaluation: Focuses on training and evaluating the decision tree model, assessing its predictive power and accuracy.

Comparative Evaluation

  • F1 Score Analysis: Compares different models based on F1 score to evaluate their performance and reliability.

Technical Aspects

  • Python Libraries: Utilizes libraries like numpy, pandas, matplotlib, seaborn, and scikit-learn.
  • Structured Approach: The notebook is structured into various sections, each focusing on different aspects of data analysis and modeling.

License

This project is licensed under the MIT License. See the LICENSE file for more details.

About

The "Santa" project is a comprehensive exploration of data analysis and predictive modeling. It encompasses Exploratory Data Analysis (EDA), Logistic Regression, and Decision Tree modeling. The project aims to extract meaningful insights from data and apply predictive algorithms to model and understand complex relationships.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published