Capstone Project: COVID-19 World Vaccination Progress

About

I performed basic exploratory data analysis (EDA) using Pandas, NumPy and Matplotlib libraries on a Kaggle dataset looking at COVID-19 world vaccination progress in the period (12/2020 - 04/2021). This included data cleaning, wrangling and visualization to answer some hypotheses based on the data being explored.

Install

This project required Python 3.x and the following Python libraries installed:

Pandas
NumPy
Matplotlib

Data Dictionary

The dictionary from this dataset (country_vaccinations.csv) was obtained from: https://www.kaggle.com/gpreda/covid-world-vaccination-progress

Country: this is the country for which the vaccination information is provided;
ISO_code: ISO code for the country.
Date: date for the data entry; for some of the dates we have only the daily vaccinations, for others, only the (cumulative) total.
Total_vaccinations: this is the absolute number of total immunizations in the country.
People_vaccinated: a person, depending on the immunization scheme, will receive one or more (typically 2) vaccines; at a certain moment, the number of vaccination might be larger than the number of people.
People_fully_vaccinated: this is the number of people that received the entire set of immunization according to the immunization scheme (typically 2); at a certain moment in time, there might be a certain number of people that received one vaccine and another number (smaller) of people that received all vaccines in the scheme.
Daily_vaccinations_raw: for a certain data entry, the number of vaccination for that date/country.
Daily_vaccinations: for a certain data entry, the number of vaccination for that date/country.
Total_vaccinations_per_hundred: ratio (in percent) between vaccination number and total population up to the date in the country.
People_vaccinated_per_hundred: ratio (in percent) between population immunized and total population up to the date in the country.
People_fully_vaccinated_per_hundred: ratio (in percent) between population fully immunized and total population up to the date in the country.
Daily_vaccinations_per_million: ratio (in ppm) between vaccination number and total population for the current date in the country.
Vaccines: total number of vaccines used in the country (up to date).
Source_name: source of the information (national authority, international organization, local organization etc.).
Source_website: website of the source of information

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
Pandas Project_Ahad Al Seraihi.ipynb		Pandas Project_Ahad Al Seraihi.ipynb
README.md		README.md
country_vaccinations.csv		country_vaccinations.csv
covid data dictionary.pdf		covid data dictionary.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Capstone Project: COVID-19 World Vaccination Progress

About

Install

Data Dictionary

About

Releases

Packages

Languages

Ahad-Al-Seraihi/COVID-19-World-Vaccination-Progress

Folders and files

Latest commit

History

Repository files navigation

Capstone Project: COVID-19 World Vaccination Progress

About

Install

Data Dictionary

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages