Skip to content

Internship / Field attachment: Personal machine learning project on building a Diabetes disease prediction model with implementation of a web application.

Notifications You must be signed in to change notification settings

Group4Day2019/Diabetes-disease-prediction-uisng-a-web-tool

Repository files navigation

ML_Internship2021

Personal machine learning project on building a Diabetes disease prediction model with implementation of a web application.

web app: https://share.streamlit.io/group4day2019/ml_internship2021/main/merged.py

Early Stage Diabetes Prediction.

About Data

This dataset contains the sign and symptpom data of newly diabetic or would be diabetic patient.This has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh and approved by a doctor.

Features of the dataset: Diabetes_data_upload.csv

The dataset consist of total 15 features and one target variable named class.

  1. Age: Age in years ranging from (20years to 65 years)
  2. Gender: Male / Female
  3. Polyuria: Yes / No
  4. Polydipsia: Yes/ No
  5. Sudden weight loss: Yes/ No
  6. Weakness: Yes/ No
  7. Polyphagia: Yes/ No
  8. Genital Thrush: Yes/ No
  9. Visual blurring: Yes/ No
  10. Itching: Yes/ No
  11. Irritability: Yes/No
  12. Delayed healing: Yes/ No
  13. Partial Paresis: Yes/ No
  14. Muscle stiffness: yes/ No
  15. Alopecia: Yes/ No
  16. Obesity: Yes/ No

Class: Positive / Negative

Relevant Papers:

Likelihood Prediction of Diabetes at Early Stage Using Data Mining Techniques [Web Link] Authors and affiliations M. M. Faniqul IslamEmail Rahatara Ferdousi Sadikur Rahman Humayra Yasmin Bushra

Citation Request:

Islam, MM Faniqul, et al. 'Likelihood prediction of diabetes at early stage using data mining techniques.' Computer Vision and Machine Intelligence in Medical Image Analysis. Springer, Singapore, 2020. 113-125.

Islam, MM Faniqul, et al. 'Likelihood prediction of diabetes at early stage using data mining techniques.' Computer Vision and Machine Intelligence in Medical Image Analysis. Springer, Singapore, 2020. 113-125

Pima Indians dataset.

Diabetes.csv

Source: https://www.kaggle.com/uciml/pima-indians-diabetes-database.

Data description: The Pima Indians Diabetes Dataset consists of several medical parameters and one dependent parameter (outcome) of binary values. The data-set is mainly for Female gender of at least 21 years old of Pima Indian heritage and the description of the data-set is as follows;

  • 9 columns with 8 independent parameters, 1 outcome parameter with uniquely identified 768 observations having 268 positive for diabetes (1) and 500 negative for diabetes (0). The 9 columns are the following. Pregnancies: Number of times pregnant. Glucose: Oral glucose tolerant test result. Blood pressure: Diastolic blood pressure values in mmHg skin thickness: triceps skin fold thickness in mmInsulin: 2-Hour serum insulin (mu U/ml) BMI: Body mass index. DiabetesPedigreeFunction : Diabetes Pedigree function. Age: Age in years. Outcome: Class 1 indicates person with diabetes, 0 indicates other.

About

Internship / Field attachment: Personal machine learning project on building a Diabetes disease prediction model with implementation of a web application.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published