Leena Chan
The following R code loads the Diabetes dataset found at UC Irvine’s Machine Learning Repository – https://archive.ics.uci.edu/ml/datasets/Diabetes+130-US+hospitals+for+years+1999-2008. It reads the .csv file, loads it into a dataframe and produces a summary of the data.
#use read.csv function to import data into dataframe diabetesData0
diabetesData <- read.csv("C:\\Users\\idea\\Documents\\HHA551 - R\\dataset_diabetes\\dataset_diabetes\\diabetic_data.csv", header = TRUE)
summary(diabetesData)
## encounter_id patient_nbr race
## Min. : 12522 Min. : 135 ? : 2273
## 1st Qu.: 84961194 1st Qu.: 23413221 AfricanAmerican:19210
## Median :152388987 Median : 45505143 Asian : 641
## Mean :165201646 Mean : 54330401 Caucasian :76099
## 3rd Qu.:230270888 3rd Qu.: 87545950 Hispanic : 2037
## Max. :443867222 Max. :189502619 Other : 1506
##
## gender age weight
## Female :54708 [70-80):26068 ? :98569
## Male :47055 [60-70):22483 [75-100) : 1336
## Unknown/Invalid: 3 [50-60):17256 [50-75) : 897
## [80-90):17197 [100-125): 625
## [40-50): 9685 [125-150): 145
## [30-40): 3775 [25-50) : 97
## (Other): 5302 (Other) : 97
## admission_type_id discharge_disposition_id admission_source_id
## Min. :1.000 Min. : 1.000 Min. : 1.000
## 1st Qu.:1.000 1st Qu.: 1.000 1st Qu.: 1.000
## Median :1.000 Median : 1.000 Median : 7.000
## Mean :2.024 Mean : 3.716 Mean : 5.754
## 3rd Qu.:3.000 3rd Qu.: 4.000 3rd Qu.: 7.000
## Max. :8.000 Max. :28.000 Max. :25.000
##
## time_in_hospital payer_code medical_specialty
## Min. : 1.000 ? :40256 ? :49949
## 1st Qu.: 2.000 MC :32439 InternalMedicine :14635
## Median : 4.000 HM : 6274 Emergency/Trauma : 7565
## Mean : 4.396 SP : 5007 Family/GeneralPractice: 7440
## 3rd Qu.: 6.000 BC : 4655 Cardiology : 5352
## Max. :14.000 MD : 3532 Surgery-General : 3099
## (Other): 9603 (Other) :13726
## num_lab_procedures num_procedures num_medications number_outpatient
## Min. : 1.0 Min. :0.00 Min. : 1.00 Min. : 0.0000
## 1st Qu.: 31.0 1st Qu.:0.00 1st Qu.:10.00 1st Qu.: 0.0000
## Median : 44.0 Median :1.00 Median :15.00 Median : 0.0000
## Mean : 43.1 Mean :1.34 Mean :16.02 Mean : 0.3694
## 3rd Qu.: 57.0 3rd Qu.:2.00 3rd Qu.:20.00 3rd Qu.: 0.0000
## Max. :132.0 Max. :6.00 Max. :81.00 Max. :42.0000
##
## number_emergency number_inpatient diag_1 diag_2
## Min. : 0.0000 Min. : 0.0000 428 : 6862 276 : 6752
## 1st Qu.: 0.0000 1st Qu.: 0.0000 414 : 6581 428 : 6662
## Median : 0.0000 Median : 0.0000 786 : 4016 250 : 6071
## Mean : 0.1978 Mean : 0.6356 410 : 3614 427 : 5036
## 3rd Qu.: 0.0000 3rd Qu.: 1.0000 486 : 3508 401 : 3736
## Max. :76.0000 Max. :21.0000 427 : 2766 496 : 3305
## (Other):74419 (Other):70204
## diag_3 number_diagnoses max_glu_serum A1Cresult
## 250 :11555 Min. : 1.000 >200: 1485 >7 : 3812
## 401 : 8289 1st Qu.: 6.000 >300: 1264 >8 : 8216
## 276 : 5175 Median : 8.000 None:96420 None:84748
## 428 : 4577 Mean : 7.423 Norm: 2597 Norm: 4990
## 427 : 3955 3rd Qu.: 9.000
## 414 : 3664 Max. :16.000
## (Other):64551
## metformin repaglinide nateglinide chlorpropamide
## Down : 575 Down : 45 Down : 11 Down : 1
## No :81778 No :100227 No :101063 No :101680
## Steady:18346 Steady: 1384 Steady: 668 Steady: 79
## Up : 1067 Up : 110 Up : 24 Up : 6
##
##
##
## glimepiride acetohexamide glipizide glyburide
## Down : 194 No :101765 Down : 560 Down : 564
## No :96575 Steady: 1 No :89080 No :91116
## Steady: 4670 Steady:11356 Steady: 9274
## Up : 327 Up : 770 Up : 812
##
##
##
## tolbutamide pioglitazone rosiglitazone acarbose
## No :101743 Down : 118 Down : 87 Down : 3
## Steady: 23 No :94438 No :95401 No :101458
## Steady: 6976 Steady: 6100 Steady: 295
## Up : 234 Up : 178 Up : 10
##
##
##
## miglitol troglitazone tolazamide examide citoglipton
## Down : 5 No :101763 No :101727 No:101766 No:101766
## No :101728 Steady: 3 Steady: 38
## Steady: 31 Up : 1
## Up : 2
##
##
##
## insulin glyburide.metformin glipizide.metformin
## Down :12218 Down : 6 No :101753
## No :47383 No :101060 Steady: 13
## Steady:30849 Steady: 692
## Up :11316 Up : 8
##
##
##
## glimepiride.pioglitazone metformin.rosiglitazone metformin.pioglitazone
## No :101765 No :101764 No :101765
## Steady: 1 Steady: 2 Steady: 1
##
##
##
##
##
## change diabetesMed readmitted
## Ch:47011 No :23403 <30:11357
## No:54755 Yes:78363 >30:35545
## NO :54864
##
##
##
##