In this project, we will be exploring data that looks at how certain diagnostic factors affect the diabetes outcome of women patients.
I will use EDA skills to help inspect, clean, and validate the data.
Note: This dataset is from the National Institute of Diabetes and Digestive and Kidney Diseases.
The dataset contains the following columns:
Pregnancies
: Number of times pregnantGlucose
: Plasma glucose concentration per 2 hours in an oral glucose tolerance testBloodPressure
: Diastolic blood pressureSkinThickness
: Triceps skinfold thicknessInsulin
: 2-Hour serum insulinBMI
: Body mass indexDiabetesPedigreeFunction
: Diabetes pedigree functionAge
: Age (years)Outcome
: Class variable (0 or 1)