VisionInsight

Objective

The objective of this project was to develop a machine learning model capable of predicting whether a person has diabetes based on a set of medical variables.

Project structure

Data collection
Data preprocessing and cleaning
Exploratoy data analysis (EDA)
Modeling and evaluation
- Models
  - Logistic Regression
  - K-Nearest Neighbors
  - Support Vector Machine
- Evaluation
  - F1-Score

Question

Is it possible to accurately predict whether a patient has diabetes using diagnostic variables such as number of pregnancies, BMI, insulin levels and age?

Findings

Random Forest emerged as the best-performing model, achieving an F1-Score of 0.7234.
The EDA revealed no strong correlation between the number of pregnancies and diabetes outcome.
Glucose levels and BMI had a stronger relationship with the target variable.

Dataset

UCI Machine Learning & Collaborator. (n.d.). Pima Indians Diabetes Database. Kaggle. https://www.kaggle.com/datasets/uciml/pima-indians-diabetes-database

Author

José Habacuc Soto Hernández - SWE Student

GitHub: https://github.com/habacucsoto
Portfolio: https://habacuc.dev

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
archive2.zip		archive2.zip
main.ipynb		main.ipynb
main.py		main.py
requirements.txt		requirements.txt
trained_model.pkl		trained_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VisionInsight

Objective

Project structure

Question

Findings

Dataset

Author

About

Releases

Packages

Languages

habacucsoto/visioninsight

Folders and files

Latest commit

History

Repository files navigation

VisionInsight

Objective

Project structure

Question

Findings

Dataset

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages