Project Name: Does oral and dental hygiene plays a role in glaucoma?

Team Member: Hui Han, Jun Pan

Motivation: 1n 2017, Dr. Casella reported that oral and dental hygiene plays an important role in glaucoma (January 2017, OptemeryTimes.com).  This pilot study (119 cases) does well to point out the fact that the elevated presence of certain bacterial species in the oral cavity can serve as a catalyst for a pro-inflammatory response on the part of the immune system, which will lead to glaucoma.  However, there is few report about the correlation between glaucoma and dental/oral hygiene.   Thus, it is necessary to study the relationship between dental/oral hygiene on a bigger scale database.

Data Source: Korea National Health and Nutrition Examination Survey (KNHANES http://knhanes.cdc.go.kr) has a total of 13,831 participants with age 40 or above.  This survey uses a complex, stratified, multistage, probability-cluster survey. There are more than 500 papers published on professional journal since 1998.  The data based which we are going to use are the following years (2008, 2010, and 2011).

Data Science Workflow: (1) read.csv files; (2) combine, clean, subset, transformation; (3) demograhic information to describe the study population, logistic regression analyses will be used to evaluate the factor of dental/oral hygiene for glaucoma.  The result will be adjust by possible impact of other factors such as smoking, diabetes, and  hypertenstion.

The final results will be published on professinal medical journal next year. 
require(rvest)
## Loading required package: rvest
## Loading required package: xml2
require(dplyr)
## Loading required package: dplyr
## 
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
## 
##     filter, lag
## The following objects are masked from 'package:base':
## 
##     intersect, setdiff, setequal, union
require(stringr)
## Loading required package: stringr
require(tidyr)
## Loading required package: tidyr
require(dplyr)
require(ggplot2)
## Loading required package: ggplot2
data2008 <- read.csv("https://raw.githubusercontent.com/johnpannyc/DATA-607-Final-Project-/master/Complete%20Year%202008%20Glaucoma.csv")
data2010 <- read.csv("https://raw.githubusercontent.com/johnpannyc/DATA-607-Final-Project-/master/Complete%20Year%202010%20Glaucoma.csv")
data2011 <- read.csv("https://raw.githubusercontent.com/johnpannyc/DATA-607-Final-Project-/27db2de104e0dc35765409fd5edcc016a3a5d768/Complete%20Year%202011%20Glaucoma.csv")
dim(data2008)
## [1] 2418  562
dim(data2010)
## [1] 4280  555
dim(data2011)
## [1] 4386  561
head(data2011)
##           ID   ID_fam year region town_t apt_t  psu sex age age_month incm
## 1 A951929601 A9519296 2011      1      1     1 A951   2  56   ISBLANK    3
## 2 A951930002 A9519300 2011      1      1     1 A951   2  70   ISBLANK    2
## 3 A951930201 A9519302 2011      1      1     1 A951   1  42   ISBLANK    4
## 4 A951930202 A9519302 2011      1      1     1 A951   2  42   ISBLANK    4
## 5 A951930801 A9519308 2011      1      1     1 A951   2  47   ISBLANK    3
## 6 A951932301 A9519323 2011      1      1     1 A951   2  63   ISBLANK    1
##   ho_incm edu occp    wt_hs  wt_itvex       wt_hm      wt_pft  wt_ex1
## 1       3   3    1 7661.228 13586.119 31541.74686 6059.906636 ISBLANK
## 2       1   1    7 7661.228  3981.011 107471.7318 9195.151729 ISBLANK
## 3       4   3    5 7661.228 12597.407 30456.09323 13977.56203 ISBLANK
## 4       4   4    7 7661.228  8003.439     ISBLANK  11160.9696 ISBLANK
## 5       4   4    7 7661.228 14768.027     ISBLANK     ISBLANK ISBLANK
## 6       1   1    6 7661.228  4709.705     ISBLANK 5083.051121 ISBLANK
##        wt_ntr      wt_tot     wt_pfhm     wt_pfnt     wt_hmnt   wt_pfhmnt
## 1 6400.330178 16005.99577 14883.92246 6101.928222 31173.47473 14905.86465
## 2 3824.995304 4179.593539 163186.4917 9594.885057  113576.975 175129.3538
## 3  13890.4606 17499.93106 41632.06961 18282.14762 33874.63636 51274.66011
## 4 8445.844108 8783.223965     ISBLANK 10742.20747     ISBLANK     ISBLANK
## 5 13876.54749 17096.83208     ISBLANK     ISBLANK     ISBLANK     ISBLANK
## 6 4975.889577 4557.730987     ISBLANK 5312.639892     ISBLANK     ISBLANK
##   wt_ex1nt wt_ex1pf wt_ex1hm wt_ex1pfnt wt_ex1hmnt wt_tot1 wt_tot1nt
## 1  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
## 2  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
## 3  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
## 4  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
## 5  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
## 6  ISBLANK  ISBLANK  ISBLANK    ISBLANK    ISBLANK ISBLANK   ISBLANK
##   kstrata kstrata0 E_ex E_Q_EX E_Q_SUN E_Q_FAM E_Q_FAM1 E_Q_RM E_Q_EXC
## 1     512      517    1      3       1       2   888888      4       8
## 2     512      517    1      3       1       1        6      4       8
## 3     512      517    1      3       2       2   888888      4       8
## 4     512      517    1      5       1       1        2      3       8
## 5     512      517    1      1       1       1        2      2       8
## 6     512      517    1      2       1       1        2      3       8
##   E_Q_EXC1 E_BL_V E_BL E_DMH E_PEO E_PEO_1 E_DES_dg E_DES_ds E_DH2_dg
## 1   888888     89    2     9     2  888888        2        2        2
## 2   888888     89    2     9     1       2        1        1        1
## 3   888888    112    2     9     2  888888        2        2        2
## 4   888888     97    2     9     2  888888        2        1        2
## 5   888888     87    2     9     2  888888        2        2        2
## 6   888888    102    2     9     2  888888        2        2        2
##   E_AMD_dg E_DH3_dg E_DH3_dt E_CO E_CO_1 E_CR_M E_CR_1 E_DR_DSPH E_DR_DCYL
## 1        2        2        8    1      8      1      2      0.50     -0.50
## 2        2        2        8    1      8      1      2      0.75     -0.75
## 3        2        2        8    1      8      1      2      0.25     -0.25
## 4        2        2        8    1      8      1      2     -1.00     -0.75
## 5        2        2        8    1      8      2      2     -5.50     -0.50
## 6        2        2        8    1      8      1      4      1.75     -0.25
##   E_DR_A E_DR_2_1 E_DR_3_1 E_CL_M E_CL_1 E_DL_DSPH E_DL_DCYL E_DL_A
## 1     78       88       88      1      2      0.75     -0.25    107
## 2     13       88       88      1      2     -0.25     -0.50     13
## 3     54       88       88      1      1      0.00     -0.50     77
## 4     12       88       88      1      3     -0.75     -0.25    167
## 5     13       88       88      2      1     -5.50     -1.00    165
## 6     74        1       88      1      4      1.00     -0.25    110
##   E_DL_2_1 E_DL_3_1 E_VS_DS E_VS_MYO E_VS_HYPER E_VS_AST E_SQ_1 E_SQ_2
## 1       88       88       0        0          0        0      3      1
## 2       88       88       0        0          0        1      3      1
## 3       88       88       0        0          0        0      3      1
## 4        1       88       0        1          0        1      3      1
## 5       88       88       0        1          0        1      3      1
## 6        1       88       0        0          1        0      3      1
##   E_SQ_3 E_SQ_3_OTH E_SQ_4 E_VS_SQ E_MRD_R E_MRD_L E_LF_R E_LF_L E_VS_MRD
## 1      1    ISBLANK      1       0       1       1      1      1        0
## 2      1    ISBLANK      1       0       3       3      2      2        0
## 3      1    ISBLANK      1       0       4       2      2      1        1
## 4      1    ISBLANK      1       0       2       2      1      1        0
## 5      1    ISBLANK      1       0       1       1      1      1        0
## 6      1    ISBLANK      1       0       3       3      2      2        0
##   E_TR_Y E_TR_C E_TL_Y E_TL_C E_VS_TY E_VS_TY_C E_WR E_WR_T E_WR_C E_WR_L
## 1      2      8      2      8       0         0    2      8      8  888.8
## 2      3      8      3      8       1         0    2      8      8  888.8
## 3      2      8      2      8       0         0    2      8      8  888.8
## 4      2      8      2      8       0         0    2      8      8  888.8
## 5      2      8      2      8       0         0    2      8      8  888.8
## 6      2      8      2      8       0         0    2      8      8  888.8
##   E_WR_R E_WL E_WL_T E_WL_C E_WL_L E_WL_R E_VS_WY E_GR_P E_GR_1 E_GR_2
## 1      8    2      8      8  888.8      8       0     16      0      0
## 2      8    2      8      8  888.8      8       0     12      0      0
## 3      8    2      8      8  888.8      8       0     20      0      0
## 4      8    2      8      8  888.8      8       0     11      0      0
## 5      8    2      8      8  888.8      8       0     14      0      0
## 6      8    2      8      8  888.8      8       0     16      0      0
##   E_GR_3 E_GR_4 E_GL_P E_GL_1 E_GL_2 E_GL_3 E_GL_4 E_AMR_RT E_AMR_LT
## 1      0      3     16      0      0      0      3        0        0
## 2      0      3     12      0      0      0      3        0        0
## 3      0      3     19      0      0      0      3        0        0
## 4      0      3     11      0      0      0      3        0        0
## 5      0      3     15      0      0      0      3        0        0
## 6      0      3     17      0      0      0      3        0        0
##   E_NMFP_RC E_NMFP_LC E_AMD_ELY_R E_AMD_LATE_R E_AMD_WET_R E_AMD_DRY_R
## 1   ISBLANK   ISBLANK           0            0           0           0
## 2   ISBLANK   ISBLANK           0            0           0           0
## 3   ISBLANK   ISBLANK           0            0           0           0
## 4   ISBLANK   ISBLANK           0            0           0           0
## 5   ISBLANK   ISBLANK           0            0           0           0
## 6   ISBLANK   ISBLANK           0            0           0           0
##   E_AMD_ELY_L E_AMD_LATE_L E_AMD_WET_L E_AMD_DRY_L E_DRP_R E_DRP_L E_GRV
## 1           0            0           0           0       0       0   0.4
## 2           0            0           0           0       0       0   0.3
## 3           0            0           0           0       0       0   0.4
## 4           0            0           0           0       0       0   0.4
## 5           0            0           0           0       0       0   0.4
## 6           0            0           0           0       0       0   0.2
##   Vertical.CD.Ratio.Difference E_GRH Horizontal.CD.Ratio.Difference
## 1                            0   0.4                              0
## 2                            0   0.3                              0
## 3                            0   0.4                              0
## 4                            0   0.4                              0
## 5                            0   0.4                              0
## 6                            0   0.2                              0
##   E_GR_ISNT E_GR_B E_GR_F1 E_GR_F2 E_GLV E_GLH E_GL_ISNT E_GL_B E_GL_F1
## 1         0      0       0       0   0.4   0.4         0      0       0
## 2         0      0       0       0   0.3   0.3         0      0       0
## 3         0      0       0       0   0.4   0.4         0      0       0
## 4         0      0       0       0   0.4   0.4         0      0       0
## 5         0      0       0       0   0.4   0.4         0      0       0
## 6         0      0       0       0   0.2   0.2         0      0       0
##   E_GL_F2 E_VS_DM E_VS_ELY E_VS_LATE E_VS_WET E_VS_DRY E_GRT1 E_GRT2
## 1       0 ISBLANK        0         0        0        0      8      8
## 2       0 ISBLANK        0         0        0        0      8      8
## 3       0 ISBLANK        0         0        0        0      8      8
## 4       0 ISBLANK        0         0        0        0      8      8
## 5       0 ISBLANK        0         0        0        0      8      8
## 6       0 ISBLANK        0         0        0        0      8      8
##   E_GRT3 E_GRT4 E_GRT5 E_GRT6 E_GRT7 E_GRT8 E_GRT9 E_GRT10 E_GRT11 E_GRT12
## 1      8      8      8      8      8      8      8       8       8       8
## 2      8      8      8      8      8      8      8       8       8       8
## 3      8      8      8      8      8      8      8       8       8       8
## 4      8      8      8      8      8      8      8       8       8       8
## 5      8      8      8      8      8      8      8       8       8       8
## 6      8      8      8      8      8      8      8       8       8       8
##   E_GRT13 E_GRT14 E_GRT15 E_GRT16 E_GRT17 E_GRT18 E_GLT1 E_GLT2 E_GLT3
## 1       8       8       8       8       8       8      8      8      8
## 2       8       8       8       8       8       8      8      8      8
## 3       8       8       8       8       8       8      8      8      8
## 4       8       8       8       8       8       8      8      8      8
## 5       8       8       8       8       8       8      8      8      8
## 6       8       8       8       8       8       8      8      8      8
##   E_GLT4 E_GLT5 E_GLT6 E_GLT7 E_GLT8 E_GLT9 E_GLT10 E_GLT11 E_GLT12
## 1      8      8      8      8      8      8       8       8       8
## 2      8      8      8      8      8      8       8       8       8
## 3      8      8      8      8      8      8       8       8       8
## 4      8      8      8      8      8      8       8       8       8
## 5      8      8      8      8      8      8       8       8       8
## 6      8      8      8      8      8      8       8       8       8
##   E_GLT13 E_GLT14 E_GLT15 E_GLT16 E_GLT17 E_GLT18 E_GRS E_GRF E_GRF_P
## 1       8       8       8       8       8       8  8888     8    8888
## 2       8       8       8       8       8       8  8888     8    8888
## 3       8       8       8       8       8       8  8888     8    8888
## 4       8       8       8       8       8       8  8888     8    8888
## 5       8       8       8       8       8       8  8888     8    8888
## 6       8       8       8       8       8       8  8888     8    8888
##   E_GRP E_GRP_P E_GLS E_GLF E_GLF_P E_GLP E_GLP_P E_DR_1 E_DR_2 E_DR_3
## 1     8    8888  8888     8    8888     8    8888      0      0      0
## 2     8    8888  8888     8    8888     8    8888      0      0      0
## 3     8    8888  8888     8    8888     8    8888      0      0      0
## 4     8    8888  8888     8    8888     8    8888      0      0      0
## 5     8    8888  8888     8    8888     8    8888      0      0      0
## 6     8    8888  8888     8    8888     8    8888      0      0      0
##   E_DR_4 E_DR_5 E_DR_6 E_DR_7 E_DR_8 E_DR_9 E_DR_10 E_DR_11 E_DR_12
## 1      0      0      0      0      0      0       0       0       0
## 2      0      0      0      0      0      0       0       0       0
## 3      0      0      0      0      0      0       0       0       0
## 4      0      0      0      0      0      0       0       0       0
## 5      0      0      0      0      0      0       0       0       0
## 6      0      0      0      0      0      0       0       0       0
##   E_DR_13 E_DR_14 E_DL_1 E_DL_2 E_DL_3 E_DL_4 E_DL_5 E_DL_6 E_DL_7 E_DL_8
## 1       0       0      0      0      0      0      0      0      0      0
## 2       0       0      0      0      0      0      0      0      0      0
## 3       0       0      0      0      0      0      0      0      0      0
## 4       0       0      0      0      0      0      0      0      0      0
## 5       0       0      0      0      0      0      0      0      0      0
## 6       0       0      0      0      0      0      0      0      0      0
##   E_DL_9 E_DL_10 E_DL_11 E_DL_12 E_DL_13 E_DL_14       ID.1 ID_fam.1
## 1      0       0       0       0       0       0 A951929601 A9519296
## 2      0       0       0       0       0       0 A951930002 A9519300
## 3      0       0       0       0       0       0 A951930201 A9519302
## 4      0       0       0       0       0       0 A951930202 A9519302
## 5      0       0       0       0       0       0 A951930801 A9519308
## 6      0       0       0       0       0       0 A951932301 A9519323
##   year.1 region.1 town_t.1 apt_t.1 psu.1 sex.1 age.1 age_month.1 incm.1
## 1   2011        1        1       1  A951     2    56     ISBLANK      3
## 2   2011        1        1       1  A951     2    70     ISBLANK      2
## 3   2011        1        1       1  A951     1    42     ISBLANK      4
## 4   2011        1        1       1  A951     2    42     ISBLANK      4
## 5   2011        1        1       1  A951     2    47     ISBLANK      3
## 6   2011        1        1       1  A951     2    63     ISBLANK      1
##   ho_incm.1 edu.1 occp.1  wt_hs.1 wt_itvex.1    wt_pft.1     wt_hm.1
## 1         3     3      1 7661.228  13586.119 6059.906636 31541.74686
## 2         1     1      7 7661.228   3981.011 9195.151729 107471.7318
## 3         4     3      5 7661.228  12597.407 13977.56203 30456.09323
## 4         4     4      7 7661.228   8003.439  11160.9696     ISBLANK
## 5         4     4      7 7661.228  14768.027     ISBLANK     ISBLANK
## 6         1     1      6 7661.228   4709.705 5083.051121     ISBLANK
##   wt_ex1.1    wt_ntr.1    wt_tot.1   wt_pfhm.1   wt_pfnt.1   wt_hmnt.1
## 1  ISBLANK 6400.330178 16005.99577 14883.92246 6101.928222 31173.47473
## 2  ISBLANK 3824.995304 4179.593539 163186.4917 9594.885057  113576.975
## 3  ISBLANK  13890.4606 17499.93106 41632.06961 18282.14762 33874.63636
## 4  ISBLANK 8445.844108 8783.223965     ISBLANK 10742.20747     ISBLANK
## 5  ISBLANK 13876.54749 17096.83208     ISBLANK     ISBLANK     ISBLANK
## 6  ISBLANK 4975.889577 4557.730987     ISBLANK 5312.639892     ISBLANK
##   wt_pfhmnt.1 wt_ex1nt.1 wt_ex1pf.1 wt_ex1hm.1 wt_ex1pfnt.1 wt_ex1hmnt.1
## 1 14905.86465    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
## 2 175129.3538    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
## 3 51274.66011    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
## 4     ISBLANK    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
## 5     ISBLANK    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
## 6     ISBLANK    ISBLANK    ISBLANK    ISBLANK      ISBLANK      ISBLANK
##   wt_tot1.1 wt_tot1nt.1 kstrata.1 kstrata0.1 O_DTD O_DTP O_DID O_DIP
## 1   ISBLANK     ISBLANK       512        517     0     0     0     0
## 2   ISBLANK     ISBLANK       512        517     0     0     0     0
## 3   ISBLANK     ISBLANK       512        517     0     0     0     0
## 4   ISBLANK     ISBLANK       512        517     0     0     0     0
## 5   ISBLANK     ISBLANK       512        517     0     0     0     0
## 6   ISBLANK     ISBLANK       512        517     0     1     0     1
##   O_DFTD O_DMFTP O_DFID O_DMFIP O_pain O_TMJ O_ortho O_55B O_55D O_55O
## 1      0       3      0       1      0     0       0     9     9     9
## 2      0       4      0       1      0     0       0     9     9     9
## 3      0       6      0       1      0     0       0     9     9     9
## 4      0       7      0       1      0     0       0     9     9     9
## 5      0       6      0       1      1     1       0     9     9     9
## 6      0      14      0       1      1     0       0     9     9     9
##   O_55M O_55L O_54B O_54D O_54O O_54M O_54L O_53B O_53D O_53M O_53L O_52B
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_52D O_52M O_52L O_51B O_51D O_51M O_51L O_61B O_61M O_61D O_61L O_62B
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_62M O_62D O_62L O_63B O_63M O_63D O_63L O_64B O_64M O_64O O_64D O_64L
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_65B O_65M O_65O O_65D O_65L O_18B O_18D O_18O O_18M O_18L O_17B O_17D
## 1     9     9     9     9     9     8     8     8     8     8     0     0
## 2     9     9     9     9     9     8     8     8     8     8     5     5
## 3     9     9     9     9     9     5     5     5     5     5     0     0
## 4     9     9     9     9     9     5     5     5     5     5     0     0
## 5     9     9     9     9     9     8     8     8     8     8     0     0
## 6     9     9     9     9     9     8     8     8     8     8     1     1
##   O_17O O_17M O_17L O_16B O_16D O_16O O_16M O_16L O_15B O_15D O_15O O_15M
## 1     0     0     0     3     3     3     3     3     0     0     0     0
## 2     5     5     5     5     5     5     5     5     7     7     7     7
## 3     3     0     3     0     0     0     0     0     0     0     0     0
## 4     3     0     3     0     0     3     0     3     0     0     0     0
## 5     3     3     0     0     3     3     0     0     0     0     0     0
## 6     1     1     1     4     4     4     4     4     4     4     4     4
##   O_15L O_14B O_14D O_14O O_14M O_14L O_13B O_13D O_13M O_13L O_12B O_12D
## 1     0     0     0     0     0     0     7     7     7     7     7     7
## 2     7     5     5     5     5     5     5     5     5     5     7     7
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     4     0     0     0     0     0     0     0     0     0     0     0
##   O_12M O_12L O_11B O_11D O_11M O_11L O_21B O_21M O_21D O_21L O_22B O_22M
## 1     7     7     7     7     7     7     7     7     7     7     7     7
## 2     7     7     7     7     7     7     7     7     5     5     5     5
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     0     0     0     0     0     0     0     0     0     0     0     0
##   O_22D O_22L O_23B O_23M O_23D O_23L O_24B O_24M O_24O O_24D O_24L O_25B
## 1     7     7     7     7     0     0     0     0     0     0     0     0
## 2     5     5     5     5     5     5     5     5     5     5     5     5
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     4     4     4     4     4     4     4     4     4     4     4     4
##   O_25M O_25O O_25D O_25L O_26B O_26M O_26O O_26D O_26L O_27B O_27M O_27O
## 1     0     0     3     3     3     3     3     4     4     4     4     4
## 2     5     5     5     5     5     5     5     7     7     7     7     7
## 3     0     0     0     0     0     0     0     0     0     3     0     3
## 4     0     0     0     0     3     0     3     0     0     3     0     3
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     4     4     4     4     4     4     4     4     4     4     4     4
##   O_27D O_27L O_28B O_28M O_28O O_28D O_28L O_48L O_48D O_48O O_48M O_48B
## 1     8     8     8     8     8     8     8     8     8     8     0     0
## 2     8     8     8     8     8     8     8     8     8     8     7     7
## 3     5     5     5     5     5     5     5     5     5     5     0     0
## 4     5     5     5     5     5     5     5     5     5     5     0     0
## 5     0     0     0     0     0     8     8     8     8     8     7     7
## 6     8     8     8     8     8     8     8     8     8     8     0     0
##   O_47L O_47D O_47O O_47M O_47B O_46L O_46D O_46O O_46M O_46B O_45L O_45D
## 1     0     0     0     0     0     0     0     0     0     0     0     0
## 2     7     7     7     4     4     4     4     4     4     4     4     4
## 3     3     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     3     0     0     3     0     3     0     0     0     0
## 5     7     7     7     4     4     4     4     4     7     7     7     7
## 6     3     0     3     3     3     3     3     3     3     3     3     3
##   O_45O O_45M O_45B O_44L O_44D O_44O O_44M O_44B O_43L O_43D O_43M O_43B
## 1     0     0     0     0     0     0     0     0     0     0     7     7
## 2     4     7     7     7     7     7     0     0     0     0     0     0
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     7     0     0     0     0     0     0     0     0     0     0     0
## 6     3     0     0     0     0     0     7     7     7     7     5     5
##   O_42L O_42D O_42M O_42B O_41L O_41D O_41M O_41B O_31L O_31M O_31D O_31B
## 1     7     7     7     7     7     7     7     7     7     7     7     7
## 2     0     0     0     0     0     0     0     0     0     0     0     0
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     5     5     5     5     5     5     5     5     5     5     5     5
##   O_32L O_32M O_32D O_32B O_33L O_33M O_33D O_33B O_34L O_34M O_34O O_34D
## 1     7     7     0     0     0     0     0     0     0     0     0     0
## 2     0     0     0     0     0     0     7     7     7     7     7     4
## 3     0     0     0     0     0     0     0     0     0     0     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     0     0     0     0     0     0     0     0     0
## 6     5     5     7     7     7     7     7     7     7     7     7     4
##   O_34B O_35L O_35M O_35O O_35D O_35B O_36L O_36M O_36O O_36D O_36B O_37L
## 1     0     0     0     0     0     0     0     0     0     0     0     0
## 2     4     4     4     4     4     4     4     4     4     4     4     7
## 3     0     0     0     3     3     0     0     0     3     3     0     0
## 4     0     0     0     0     0     0     0     0     0     0     0     0
## 5     0     0     0     3     3     0     3     3     3     3     3     3
## 6     4     4     4     4     4     4     4     4     4     4     4     4
##   O_37M O_37O O_37D O_37B O_38L O_38M O_38O O_38D O_38B O_85L O_85D O_85O
## 1     0     0     0     0     8     8     8     8     8     9     9     9
## 2     7     7     7     7     8     8     8     8     8     9     9     9
## 3     0     3     0     0     5     5     5     5     5     9     9     9
## 4     0     3     0     0     5     5     5     5     5     9     9     9
## 5     3     3     3     3     8     8     8     8     8     9     9     9
## 6     4     4     4     4     8     8     8     8     8     9     9     9
##   O_85M O_85B O_84L O_84D O_84O O_84M O_84B O_83L O_83D O_83M O_83B O_82L
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_82D O_82M O_82B O_81L O_81D O_81M O_81B O_71L O_71M O_71D O_71B O_72L
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_72M O_72D O_72B O_73L O_73M O_73D O_73B O_74L O_74M O_74O O_74D O_74B
## 1     9     9     9     9     9     9     9     9     9     9     9     9
## 2     9     9     9     9     9     9     9     9     9     9     9     9
## 3     9     9     9     9     9     9     9     9     9     9     9     9
## 4     9     9     9     9     9     9     9     9     9     9     9     9
## 5     9     9     9     9     9     9     9     9     9     9     9     9
## 6     9     9     9     9     9     9     9     9     9     9     9     9
##   O_75L O_75M O_75O O_75D O_75B O_TN55 O_TN54 O_TN53 O_TN52 O_TN51 O_TN61
## 1     9     9     9     9     9      0      0      0      0      0      0
## 2     9     9     9     9     9      0      0      0      0      0      0
## 3     9     9     9     9     9      0      0      0      0      0      0
## 4     9     9     9     9     9      0      0      0      0      0      0
## 5     9     9     9     9     9      0      0      0      0      0      0
## 6     9     9     9     9     9      0      0      0      0      0      0
##   O_TN62 O_TN63 O_TN64 O_TN65 O_TN18 O_TN17 O_TN16 O_TN15 O_TN14 O_TN13
## 1      0      0      0      0      0      0      0      0      0      0
## 2      0      0      0      0      0      0      0      0      0      0
## 3      0      0      0      0      0      0      0      0      0      0
## 4      0      0      0      0      0      0      0      0      0      0
## 5      0      0      0      0      0      0      0      0      0      0
## 6      0      0      0      0      0      3      0      0      0      0
##   O_TN12 O_TN11 O_TN21 O_TN22 O_TN23 O_TN24 O_TN25 O_TN26 O_TN27 O_TN28
## 1      0      0      0      0      0      0      0      0      0      0
## 2      0      0      0      0      0      0      0      0      0      0
## 3      0      0      0      0      0      0      0      0      0      0
## 4      0      0      0      0      0      0      0      0      0      0
## 5      0      0      0      0      0      0      0      0      0      0
## 6      0      0      0      0      0      0      0      0      0      0
##   O_TN48 O_TN47 O_TN46 O_TN45 O_TN44 O_TN43 O_TN42 O_TN41 O_TN31 O_TN32
## 1      0      0      0      0      0      0      0      0      0      0
## 2      0      0      0      0      0      0      0      0      0      0
## 3      0      0      0      0      0      0      0      0      0      0
## 4      0      0      0      0      0      0      0      0      0      0
## 5      0      0      0      0      0      0      0      0      0      0
## 6      0      0      0      0      0      0      0      0      0      0
##   O_TN33 O_TN34 O_TN35 O_TN36 O_TN37 O_TN38 O_TN85 O_TN84 O_TN83 O_TN82
## 1      0      0      0      0      0      0      0      0      0      0
## 2      0      0      0      0      0      0      0      0      0      0
## 3      0      0      0      0      0      0      0      0      0      0
## 4      0      0      0      0      0      0      0      0      0      0
## 5      0      0      0      0      0      0      0      0      0      0
## 6      0      0      0      0      0      0      0      0      0      0
##   O_TN81 O_TN71 O_TN72 O_TN73 O_TN74 O_TN75 O_PROS_U O_PROS_L O_IMP_U
## 1      0      0      0      0      0      0        0        0       0
## 2      0      0      0      0      0      0        4        2       0
## 3      0      0      0      0      0      0        0        0       0
## 4      0      0      0      0      0      0        0        0       0
## 5      0      0      0      0      0      0        0        1       0
## 6      0      0      0      0      0      0        1        2       1
##   O_IMP_L O_BR_N_U O_BR_N_L O_DENT_U O_DENT_L     O_F Glaucoma
## 1       0        0        0        0        0 ISBLANK        0
## 2       0        0        0        0        0 ISBLANK        0
## 3       0        0        0        0        0 ISBLANK        0
## 4       0        0        0        0        0 ISBLANK        0
## 5       0        0        0        0        0 ISBLANK        0
## 6       1        4        0        0        0 ISBLANK        0

Using the dim command, we can see that the KNHANES data is based on larger population collection many health relation informations.

we already did some preliminary analysis.

Here is the demographic information of the study population: “https://github.com/johnpannyc/DATA-607-Final-Project-/blob/master/demographic%20information%20of%20KNHANES.png

Here is the oral health variable information “https://github.com/johnpannyc/DATA-607-Final-Project-/blob/master/oral%20health%20variables.png