Leena Chan

The following R code loads the Diabetes dataset found at UC Irvine’s Machine Learning Repository – https://archive.ics.uci.edu/ml/datasets/Diabetes+130-US+hospitals+for+years+1999-2008. It reads the .csv file, loads it into a dataframe and produces a summary of the data.

#use read.csv function to import data into dataframe diabetesData0
diabetesData <- read.csv("C:\\Users\\idea\\Documents\\HHA551 - R\\dataset_diabetes\\dataset_diabetes\\diabetic_data.csv", header = TRUE)

summary(diabetesData)
##   encounter_id        patient_nbr                     race      
##  Min.   :    12522   Min.   :      135   ?              : 2273  
##  1st Qu.: 84961194   1st Qu.: 23413221   AfricanAmerican:19210  
##  Median :152388987   Median : 45505143   Asian          :  641  
##  Mean   :165201646   Mean   : 54330401   Caucasian      :76099  
##  3rd Qu.:230270888   3rd Qu.: 87545950   Hispanic       : 2037  
##  Max.   :443867222   Max.   :189502619   Other          : 1506  
##                                                                 
##              gender           age              weight     
##  Female         :54708   [70-80):26068   ?        :98569  
##  Male           :47055   [60-70):22483   [75-100) : 1336  
##  Unknown/Invalid:    3   [50-60):17256   [50-75)  :  897  
##                          [80-90):17197   [100-125):  625  
##                          [40-50): 9685   [125-150):  145  
##                          [30-40): 3775   [25-50)  :   97  
##                          (Other): 5302   (Other)  :   97  
##  admission_type_id discharge_disposition_id admission_source_id
##  Min.   :1.000     Min.   : 1.000           Min.   : 1.000     
##  1st Qu.:1.000     1st Qu.: 1.000           1st Qu.: 1.000     
##  Median :1.000     Median : 1.000           Median : 7.000     
##  Mean   :2.024     Mean   : 3.716           Mean   : 5.754     
##  3rd Qu.:3.000     3rd Qu.: 4.000           3rd Qu.: 7.000     
##  Max.   :8.000     Max.   :28.000           Max.   :25.000     
##                                                                
##  time_in_hospital   payer_code                 medical_specialty
##  Min.   : 1.000   ?      :40256   ?                     :49949  
##  1st Qu.: 2.000   MC     :32439   InternalMedicine      :14635  
##  Median : 4.000   HM     : 6274   Emergency/Trauma      : 7565  
##  Mean   : 4.396   SP     : 5007   Family/GeneralPractice: 7440  
##  3rd Qu.: 6.000   BC     : 4655   Cardiology            : 5352  
##  Max.   :14.000   MD     : 3532   Surgery-General       : 3099  
##                   (Other): 9603   (Other)               :13726  
##  num_lab_procedures num_procedures num_medications number_outpatient
##  Min.   :  1.0      Min.   :0.00   Min.   : 1.00   Min.   : 0.0000  
##  1st Qu.: 31.0      1st Qu.:0.00   1st Qu.:10.00   1st Qu.: 0.0000  
##  Median : 44.0      Median :1.00   Median :15.00   Median : 0.0000  
##  Mean   : 43.1      Mean   :1.34   Mean   :16.02   Mean   : 0.3694  
##  3rd Qu.: 57.0      3rd Qu.:2.00   3rd Qu.:20.00   3rd Qu.: 0.0000  
##  Max.   :132.0      Max.   :6.00   Max.   :81.00   Max.   :42.0000  
##                                                                     
##  number_emergency  number_inpatient      diag_1          diag_2     
##  Min.   : 0.0000   Min.   : 0.0000   428    : 6862   276    : 6752  
##  1st Qu.: 0.0000   1st Qu.: 0.0000   414    : 6581   428    : 6662  
##  Median : 0.0000   Median : 0.0000   786    : 4016   250    : 6071  
##  Mean   : 0.1978   Mean   : 0.6356   410    : 3614   427    : 5036  
##  3rd Qu.: 0.0000   3rd Qu.: 1.0000   486    : 3508   401    : 3736  
##  Max.   :76.0000   Max.   :21.0000   427    : 2766   496    : 3305  
##                                      (Other):74419   (Other):70204  
##      diag_3      number_diagnoses max_glu_serum A1Cresult   
##  250    :11555   Min.   : 1.000   >200: 1485    >7  : 3812  
##  401    : 8289   1st Qu.: 6.000   >300: 1264    >8  : 8216  
##  276    : 5175   Median : 8.000   None:96420    None:84748  
##  428    : 4577   Mean   : 7.423   Norm: 2597    Norm: 4990  
##  427    : 3955   3rd Qu.: 9.000                             
##  414    : 3664   Max.   :16.000                             
##  (Other):64551                                              
##   metformin     repaglinide     nateglinide     chlorpropamide 
##  Down  :  575   Down  :    45   Down  :    11   Down  :     1  
##  No    :81778   No    :100227   No    :101063   No    :101680  
##  Steady:18346   Steady:  1384   Steady:   668   Steady:    79  
##  Up    : 1067   Up    :   110   Up    :    24   Up    :     6  
##                                                                
##                                                                
##                                                                
##  glimepiride    acetohexamide    glipizide      glyburide    
##  Down  :  194   No    :101765   Down  :  560   Down  :  564  
##  No    :96575   Steady:     1   No    :89080   No    :91116  
##  Steady: 4670                   Steady:11356   Steady: 9274  
##  Up    :  327                   Up    :  770   Up    :  812  
##                                                              
##                                                              
##                                                              
##  tolbutamide     pioglitazone   rosiglitazone    acarbose     
##  No    :101743   Down  :  118   Down  :   87   Down  :     3  
##  Steady:    23   No    :94438   No    :95401   No    :101458  
##                  Steady: 6976   Steady: 6100   Steady:   295  
##                  Up    :  234   Up    :  178   Up    :    10  
##                                                               
##                                                               
##                                                               
##    miglitol      troglitazone     tolazamide     examide     citoglipton
##  Down  :     5   No    :101763   No    :101727   No:101766   No:101766  
##  No    :101728   Steady:     3   Steady:    38                          
##  Steady:    31                   Up    :     1                          
##  Up    :     2                                                          
##                                                                         
##                                                                         
##                                                                         
##    insulin      glyburide.metformin glipizide.metformin
##  Down  :12218   Down  :     6       No    :101753      
##  No    :47383   No    :101060       Steady:    13      
##  Steady:30849   Steady:   692                          
##  Up    :11316   Up    :     8                          
##                                                        
##                                                        
##                                                        
##  glimepiride.pioglitazone metformin.rosiglitazone metformin.pioglitazone
##  No    :101765            No    :101764           No    :101765         
##  Steady:     1            Steady:     2           Steady:     1         
##                                                                         
##                                                                         
##                                                                         
##                                                                         
##                                                                         
##  change     diabetesMed readmitted 
##  Ch:47011   No :23403   <30:11357  
##  No:54755   Yes:78363   >30:35545  
##                         NO :54864  
##                                    
##                                    
##                                    
##