review

read data diabetes

cross validation

## 
##      neg      pos 
## 0.650974 0.349026
## 
##       neg       pos 
## 0.6513158 0.3486842
## 
##       neg       pos 
## 0.6510417 0.3489583

modeling

  1. naive bayes

predict

confusion matrix

## Accuracy 
##     0.75
  1. decision tree

predict

##  Accuracy 
## 0.6710526
  1. random forest

## rf variable importance
## 
##          Overall
## glucose  100.000
## mass      49.388
## pedigree  29.281
## age       21.953
## pressure  11.606
## pregnant   7.495
## triceps    2.478
## insulin    0.000

##  Accuracy 
## 0.7631579
## Loading required package: gplots
## 
## Attaching package: 'gplots'
## The following object is masked from 'package:stats':
## 
##     lowess

## [1] 0.8151325

david

8/22/2019