ReadSASFiles.R

Author

Kementari Whitcher

Step 1 - Install the package “haven”.

library(haven)
library(tidyverse)
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.4     ✔ readr     2.1.5
✔ forcats   1.0.0     ✔ stringr   1.5.1
✔ ggplot2   3.5.1     ✔ tibble    3.2.1
✔ lubridate 1.9.3     ✔ tidyr     1.3.1
✔ purrr     1.0.2     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
library(dplyr)
acqL<-read_xpt("ACQ_L.xpt")
head(acqL)
# A tibble: 6 × 5
    SEQN ACD010A ACD010B ACD010C ACD040
   <dbl>   <dbl>   <dbl>   <dbl>  <dbl>
1 130378       1      NA       9     NA
2 130379       1      NA      NA     NA
3 130380      NA      NA      NA      2
4 130381       1      NA      NA     NA
5 130383      NA      NA      NA      3
6 130384      NA      NA      NA      1
summary(acqL)
      SEQN           ACD010A          ACD010B         ACD010C     
 Min.   :130378   Min.   : 1.000   Min.   :8       Min.   :9      
 1st Qu.:133362   1st Qu.: 1.000   1st Qu.:8       1st Qu.:9      
 Median :136362   Median : 1.000   Median :8       Median :9      
 Mean   :136351   Mean   : 1.023   Mean   :8       Mean   :9      
 3rd Qu.:139327   3rd Qu.: 1.000   3rd Qu.:8       3rd Qu.:9      
 Max.   :142310   Max.   :99.000   Max.   :8       Max.   :9      
                  NA's   :2773     NA's   :11346   NA's   :10683  
     ACD040     
 Min.   :1.000  
 1st Qu.:2.000  
 Median :3.000  
 Mean   :3.143  
 3rd Qu.:5.000  
 Max.   :9.000  
 NA's   :9047   

Can we read one from the website?

demoL<-read_xpt("https://wwwn.cdc.gov/Nchs/Data/Nhanes/Public/2021/DataFiles/DEMO_L.xpt")
head(demoL)
# A tibble: 6 × 27
    SEQN SDDSRVYR RIDSTATR RIAGENDR RIDAGEYR RIDAGEMN RIDRETH1 RIDRETH3 RIDEXMON
   <dbl>    <dbl>    <dbl>    <dbl>    <dbl>    <dbl>    <dbl>    <dbl>    <dbl>
1 130378       12        2        1       43       NA        5        6        2
2 130379       12        2        1       66       NA        3        3        2
3 130380       12        2        2       44       NA        2        2        1
4 130381       12        2        2        5       NA        5        7        1
5 130382       12        2        1        2       NA        3        3        2
6 130383       12        1        2        3       NA        2        2       NA
# ℹ 18 more variables: RIDEXAGM <dbl>, DMQMILIZ <dbl>, DMDBORN4 <dbl>,
#   DMDYRUSR <dbl>, DMDEDUC2 <dbl>, DMDMARTZ <dbl>, RIDEXPRG <dbl>,
#   DMDHHSIZ <dbl>, DMDHRGND <dbl>, DMDHRAGZ <dbl>, DMDHREDZ <dbl>,
#   DMDHRMAZ <dbl>, DMDHSEDZ <dbl>, WTINT2YR <dbl>, WTMEC2YR <dbl>,
#   SDMVSTRA <dbl>, SDMVPSU <dbl>, INDFMPIR <dbl>
summary(demoL)
      SEQN           SDDSRVYR     RIDSTATR        RIAGENDR        RIDAGEYR    
 Min.   :130378   Min.   :12   Min.   :1.000   Min.   :1.000   Min.   : 0.00  
 1st Qu.:133361   1st Qu.:12   1st Qu.:1.000   1st Qu.:1.000   1st Qu.:13.00  
 Median :136344   Median :12   Median :2.000   Median :2.000   Median :37.00  
 Mean   :136344   Mean   :12   Mean   :1.742   Mean   :1.533   Mean   :38.32  
 3rd Qu.:139327   3rd Qu.:12   3rd Qu.:2.000   3rd Qu.:2.000   3rd Qu.:62.00  
 Max.   :142310   Max.   :12   Max.   :2.000   Max.   :2.000   Max.   :80.00  
                                                                              
    RIDAGEMN        RIDRETH1        RIDRETH3        RIDEXMON       RIDEXAGM    
 Min.   : 0.00   Min.   :1.000   Min.   :1.000   Min.   :1.00   Min.   :  0.0  
 1st Qu.: 6.00   1st Qu.:3.000   1st Qu.:3.000   1st Qu.:1.00   1st Qu.: 66.0  
 Median :11.00   Median :3.000   Median :3.000   Median :2.00   Median :122.0  
 Mean   :11.63   Mean   :3.105   Mean   :3.321   Mean   :1.52   Mean   :121.9  
 3rd Qu.:17.00   3rd Qu.:4.000   3rd Qu.:4.000   3rd Qu.:2.00   3rd Qu.:179.5  
 Max.   :24.00   Max.   :5.000   Max.   :7.000   Max.   :2.00   Max.   :239.0  
 NA's   :11556                                   NA's   :3073   NA's   :9146   
    DMQMILIZ        DMDBORN4        DMDYRUSR         DMDEDUC2    
 Min.   :1.000   Min.   :1.000   Min.   : 1.000   Min.   :1.000  
 1st Qu.:2.000   1st Qu.:1.000   1st Qu.: 3.000   1st Qu.:3.000  
 Median :2.000   Median :1.000   Median : 6.000   Median :4.000  
 Mean   :1.917   Mean   :1.157   Mean   : 7.335   Mean   :3.805  
 3rd Qu.:2.000   3rd Qu.:1.000   3rd Qu.: 6.000   3rd Qu.:5.000  
 Max.   :7.000   Max.   :2.000   Max.   :99.000   Max.   :9.000  
 NA's   :3632    NA's   :19      NA's   :10058    NA's   :4139   
    DMDMARTZ         RIDEXPRG        DMDHHSIZ        DMDHRGND    
 Min.   : 1.000   Min.   :1.000   Min.   :1.000   Min.   :1.000  
 1st Qu.: 1.000   1st Qu.:2.000   1st Qu.:2.000   1st Qu.:1.000  
 Median : 1.000   Median :2.000   Median :3.000   Median :2.000  
 Mean   : 1.778   Mean   :2.237   Mean   :3.243   Mean   :1.564  
 3rd Qu.: 2.000   3rd Qu.:3.000   3rd Qu.:4.000   3rd Qu.:2.000  
 Max.   :99.000   Max.   :3.000   Max.   :7.000   Max.   :2.000  
 NA's   :4141     NA's   :10430                   NA's   :7818   
    DMDHRAGZ       DMDHREDZ        DMDHRMAZ        DMDHSEDZ    
 Min.   :1.00   Min.   :1.000   Min.   :1.000   Min.   :1.000  
 1st Qu.:2.00   1st Qu.:2.000   1st Qu.:1.000   1st Qu.:2.000  
 Median :2.00   Median :2.000   Median :1.000   Median :2.000  
 Mean   :2.54   Mean   :2.171   Mean   :1.381   Mean   :2.275  
 3rd Qu.:3.00   3rd Qu.:3.000   3rd Qu.:2.000   3rd Qu.:3.000  
 Max.   :4.00   Max.   :3.000   Max.   :3.000   Max.   :3.000  
 NA's   :7809   NA's   :8187    NA's   :7913    NA's   :9806   
    WTINT2YR         WTMEC2YR         SDMVSTRA        SDMVPSU     
 Min.   :  4584   Min.   :     0   Min.   :173.0   Min.   :1.000  
 1st Qu.: 14332   1st Qu.:     0   1st Qu.:176.0   1st Qu.:1.000  
 Median : 21670   Median : 21718   Median :180.0   Median :1.000  
 Mean   : 27404   Mean   : 27404   Mean   :179.9   Mean   :1.492  
 3rd Qu.: 33831   3rd Qu.: 38341   3rd Qu.:184.0   3rd Qu.:2.000  
 Max.   :170968   Max.   :227108   Max.   :187.0   Max.   :2.000  
                                                                  
    INDFMPIR    
 Min.   :0.000  
 1st Qu.:1.180  
 Median :2.500  
 Mean   :2.708  
 3rd Qu.:4.500  
 Max.   :5.000  
 NA's   :2041