Clean and Analyze NBA Player Data in PostgreSQL

This project aims to clean and analyze NBA player data using PostgreSQL. The raw data is obtained from a CSV file containing information on NBA players such as player name, height, weight, points per game, rebounds per game, etc.

The steps involved in the project are as follows:

Data Cleaning: The raw data is cleaned using Python and the Pandas library. The data is inspected for any missing values, duplicates, and inconsistencies. The cleaned data is then stored in a new CSV file.
Database Creation: A new database is created in PostgreSQL and the cleaned data is loaded into the database. The database schema is designed to reflect the structure of the data.
Data Analysis: SQL queries are used to analyze the data in the database. The queries are designed to answer specific questions such as the players with the highest points per game, the players with the highest rebounds per game, and the average height of players in the league.

Technologies Used

Python
Pandas
PostgreSQL
SQL

How to Use

To run this project, follow these steps:

Clone the repository to your local machine.
Install the necessary dependencies using pip install -r requirements.txt.
Run the Jupyter notebook Clean_and_Analyze_NBA_Data.ipynb to clean the data and create the database.
Use any SQL client to connect to the PostgreSQL database and run queries to analyze the data.

Future Improvements

This project can be improved in the following ways:

Adding more data to the database to provide a more comprehensive analysis.
Creating a web application to display the analysis results.
Automating the data cleaning and database creation process.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.ipynb_checkpoints		.ipynb_checkpoints
README.md		README.md
Seasons_Stats.csv		Seasons_Stats.csv
create_nba_database.ipynb		create_nba_database.ipynb
data_cleaning_sql.ipynb		data_cleaning_sql.ipynb
nba_charts.twb		nba_charts.twb
nba_data_exploration.ipynb		nba_data_exploration.ipynb
nbadb.sql		nbadb.sql
player_data.csv		player_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Clean and Analyze NBA Player Data in PostgreSQL

Technologies Used

How to Use

Future Improvements

Resources

About

Releases

Packages

Languages

LeventSoykan/Clean_-_Analyze_NBA_Player_Data_In_PostgreSQL

Folders and files

Latest commit

History

Repository files navigation

Clean and Analyze NBA Player Data in PostgreSQL

Technologies Used

How to Use

Future Improvements

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages