Used data analytics and visualization to draw valuable insights for the patients no-show scenarios
A medical appointments data set is used, which consists of patient's details (nearly 100k) and their presence based on the schedule
Data set can be downloaded here: https://www.kaggle.com/datasets/marwandiab/medical-appointment-no-shows-dataset
Libraries used: numpy, pandas, seaborn, matplolib (all are in-built with jupyter notebook)
Analysed the dataset to estimate the factors that affect patients to miss their appointment. Following factors/columns are considered:
- gender
- age
- time difference between schedule and appointment
- sms sent/not sent
- scholarship
- neighbourhood
Processes involved:
- Data wrangling
- Data cleaning
- Data analysis
While many conclusions were drawn from the above factors, yet data can be improved detailed to enhance detailed analytics.