22nd June 2020
Diabetes is a chronic condition in which the body develops a
resistance to insulin, a hormone which converts food into glucose.
Diabetes affect many people worldwide and is normally divided into
Type 1 and Type 2 diabetes. Both have different characteristics.
In this project, we have tried to analyze and considered a model with high
accuracy on the PIMA Indian Diabetes dataset to predict if a particular
observation is at a risk of developing diabetes, given the independent factors.
However, to scrutinize the prediction outcome, here we have developed
an interactive Shiny App using R language which will help to checkout the result depends
on the feature values you have set.
Dataset considered: https://www.kaggle.com/uciml/pima-indians-diabetes-database
To prepare data prior analysis and model implementation, we have carried out below task:
Once we are done with the data preparation and EDA then considered the cleaned output and implemented below machine learning model using R to check the accuracy. And finally selected the one with maximum accuracy and incorporated that in Shiny App
With the explosion rate of diabetes growing exponentially every year, we believe a prediction mechanism as done in the project along with an application will help to resist its speed by creating awareness, diagnosing at an early stage and thereby help to keep the situation under control