Mon 22 Jun 2020
Group C
Team Members Are:
In 2019, 3.6 million Malaysians had diabetes disease, highest in Asia and one of the highest in the world. An alarming 7 million adults (31.3%) aged 18 and above, both diagnosed and undiagnosed are estimated to be affected by diabetes in Malaysia by 2025.
Our core objective is to develop an interactive and free app by integrating machine learning model using R programming to predict if a particular observation is at risk of developing diabetes. The PIMA Indians Diabetes dataset(csv format) used in this project is downloaded from Kaggle and is originally from the National Institute of Diabetes and Digestive and Kidney Diseases.
Data Downloaded from: https://www.kaggle.com/uciml/pima-indians-diabetes-database
To reach our goals of predicting Diabetes from the dataset under consideration we have performed the following activities:
After data preparation and EDA, the cleaned output was taken on which the below machine learning models were implemented using R to check the accuracy & value of different parameters. Finally, the model with highest performance was picked and incorporated into DiabetesApp
While we have chosen the model with higher accuracy among the ones under evaluation, yet this application is for an initial state prediction and for raising awareness beforehand. The results cannot be considered final. Therefore users are adviced to consult a physician and take necessary steps for an actual diagnosis.