A little guidance for the following…
Dataset: Midlife in the United States (MIDUS). There are three waves, denoted MIDUS1, MIDUS2, and MIDUS3.
Variables: DV: Use of completementary and alternative medicine (CAM). There are dummies for each of 15 CAM modalities indicating any use in the past year, and a summed index that represents the number of CAMs used in the past year. TBD: Latent classes (LCA) and factors. I spent about a week on LCA and have some output we can look over, but it became clear I would need more time to come up with a class structure I was confident in.
IVs: Income, education, gender, and race.
This document is organized into three sections: single variable distributions (CAM dummies, CAM summed index, income, education, gender, and race), bivariate distributions (CAM summed index by each IV), and missing case patterns.
## MIDUS1_recode[camslist]
##
## 15 Variables 7108 Observations
## --------------------------------------------------------------------------------
## aCamEverAcupuncture
## n missing distinct
## 6279 829 2
##
## Value No Yes
## Frequency 6198 81
## Proportion 0.987 0.013
## --------------------------------------------------------------------------------
## aCamEverBiofeedback
## n missing distinct
## 6272 836 2
##
## Value No Yes
## Frequency 6224 48
## Proportion 0.992 0.008
## --------------------------------------------------------------------------------
## aCamEverChiropractor
## n missing distinct
## 6282 826 2
##
## Value No Yes
## Frequency 5523 759
## Proportion 0.88 0.12
## --------------------------------------------------------------------------------
## aCamEverEnergy
## n missing distinct
## 6271 837 2
##
## Value No Yes
## Frequency 6180 91
## Proportion 0.985 0.015
## --------------------------------------------------------------------------------
## aCamEverMoveTherapy
## n missing distinct
## 6275 833 2
##
## Value No Yes
## Frequency 5162 1113
## Proportion 0.82 0.18
## --------------------------------------------------------------------------------
## aCamEverHerbTherapy
## n missing distinct
## 6269 839 2
##
## Value No Yes
## Frequency 5951 318
## Proportion 0.949 0.051
## --------------------------------------------------------------------------------
## aCamEverMegaVitamins
## n missing distinct
## 6269 839 2
##
## Value No Yes
## Frequency 5976 293
## Proportion 0.953 0.047
## --------------------------------------------------------------------------------
## aCamEverHomeopathy
## n missing distinct
## 6250 858 2
##
## Value No Yes
## Frequency 6105 145
## Proportion 0.977 0.023
## --------------------------------------------------------------------------------
## aCamEverHypnosis
## n missing distinct
## 6276 832 2
##
## Value No Yes
## Frequency 6198 78
## Proportion 0.988 0.012
## --------------------------------------------------------------------------------
## aCamEverImageTech
## n missing distinct
## 6270 838 2
##
## Value No Yes
## Frequency 6075 195
## Proportion 0.969 0.031
## --------------------------------------------------------------------------------
## aCamEverMassage
## n missing distinct
## 6263 845 2
##
## Value No Yes
## Frequency 5725 538
## Proportion 0.914 0.086
## --------------------------------------------------------------------------------
## aCamEverPray
## n missing distinct
## 6263 845 2
##
## Value No Yes
## Frequency 4360 1903
## Proportion 0.7 0.3
## --------------------------------------------------------------------------------
## aCamEverMeditate
## n missing distinct
## 6273 835 2
##
## Value No Yes
## Frequency 5430 843
## Proportion 0.87 0.13
## --------------------------------------------------------------------------------
## aCamEverSpecialDiet
## n missing distinct
## 6279 829 2
##
## Value No Yes
## Frequency 5591 688
## Proportion 0.89 0.11
## --------------------------------------------------------------------------------
## aCamEverSpiritHeal
## n missing distinct
## 6266 842 2
##
## Value No Yes
## Frequency 6060 206
## Proportion 0.967 0.033
## --------------------------------------------------------------------------------
## MIDUS2_recode[camslist]
##
## 15 Variables 7354 Observations
## --------------------------------------------------------------------------------
## bCamEverAcupuncture
## n missing distinct
## 3980 3374 2
##
## Value No Yes
## Frequency 3898 82
## Proportion 0.979 0.021
## --------------------------------------------------------------------------------
## bCamEverBiofeedback
## n missing distinct
## 3969 3385 2
##
## Value No Yes
## Frequency 3932 37
## Proportion 0.991 0.009
## --------------------------------------------------------------------------------
## bCamEverChiropractor
## n missing distinct
## 3981 3373 2
##
## Value No Yes
## Frequency 3302 679
## Proportion 0.83 0.17
## --------------------------------------------------------------------------------
## bCamEverEnergy
## n missing distinct
## 3956 3398 2
##
## Value No Yes
## Frequency 3860 96
## Proportion 0.976 0.024
## --------------------------------------------------------------------------------
## bCamEverMoveTherapy
## n missing distinct
## 3973 3381 2
##
## Value No Yes
## Frequency 3369 604
## Proportion 0.85 0.15
## --------------------------------------------------------------------------------
## bCamEverHerbTherapy
## n missing distinct
## 3977 3377 2
##
## Value No Yes
## Frequency 3582 395
## Proportion 0.901 0.099
## --------------------------------------------------------------------------------
## bCamEverMegaVitamins
## n missing distinct
## 3974 3380 2
##
## Value No Yes
## Frequency 3522 452
## Proportion 0.89 0.11
## --------------------------------------------------------------------------------
## bCamEverHomeopathy
## n missing distinct
## 3968 3386 2
##
## Value No Yes
## Frequency 3801 167
## Proportion 0.958 0.042
## --------------------------------------------------------------------------------
## bCamEverHypnosis
## n missing distinct
## 3970 3384 2
##
## Value No Yes
## Frequency 3929 41
## Proportion 0.99 0.01
## --------------------------------------------------------------------------------
## bCamEverImageTech
## n missing distinct
## 3956 3398 2
##
## Value No Yes
## Frequency 3799 157
## Proportion 0.96 0.04
## --------------------------------------------------------------------------------
## bCamEverMassage
## n missing distinct
## 3950 3404 2
##
## Value No Yes
## Frequency 3268 682
## Proportion 0.83 0.17
## --------------------------------------------------------------------------------
## bCamEverPray
## n missing distinct
## 3977 3377 2
##
## Value No Yes
## Frequency 2200 1777
## Proportion 0.55 0.45
## --------------------------------------------------------------------------------
## bCamEverMeditate
## n missing distinct
## 3961 3393 2
##
## Value No Yes
## Frequency 2999 962
## Proportion 0.76 0.24
## --------------------------------------------------------------------------------
## bCamEverSpecialDiet
## n missing distinct
## 3969 3385 2
##
## Value No Yes
## Frequency 3801 168
## Proportion 0.958 0.042
## --------------------------------------------------------------------------------
## bCamEverSpiritHeal
## n missing distinct
## 3972 3382 2
##
## Value No Yes
## Frequency 3765 207
## Proportion 0.948 0.052
## --------------------------------------------------------------------------------
## MIDUS3_recode[camslist]
##
## 15 Variables 7108 Observations
## --------------------------------------------------------------------------------
## cCamEverAcupuncture
## n missing distinct
## 2867 4241 2
##
## Value No Yes
## Frequency 2763 104
## Proportion 0.964 0.036
## --------------------------------------------------------------------------------
## cCamEverBiofeedback
## n missing distinct
## 2857 4251 2
##
## Value No Yes
## Frequency 2823 34
## Proportion 0.988 0.012
## --------------------------------------------------------------------------------
## cCamEverChiropractor
## n missing distinct
## 2874 4234 2
##
## Value No Yes
## Frequency 2367 507
## Proportion 0.82 0.18
## --------------------------------------------------------------------------------
## cCamEverEnergy
## n missing distinct
## 2840 4268 2
##
## Value No Yes
## Frequency 2765 75
## Proportion 0.974 0.026
## --------------------------------------------------------------------------------
## cCamEverMoveTherapy
## n missing distinct
## 2865 4243 2
##
## Value No Yes
## Frequency 2317 548
## Proportion 0.81 0.19
## --------------------------------------------------------------------------------
## cCamEverHerbTherapy
## n missing distinct
## 2858 4250 2
##
## Value No Yes
## Frequency 2643 215
## Proportion 0.925 0.075
## --------------------------------------------------------------------------------
## cCamEverMegaVitamins
## n missing distinct
## 2863 4245 2
##
## Value No Yes
## Frequency 2578 285
## Proportion 0.9 0.1
## --------------------------------------------------------------------------------
## cCamEverHomeopathy
## n missing distinct
## 2860 4248 2
##
## Value No Yes
## Frequency 2727 133
## Proportion 0.953 0.047
## --------------------------------------------------------------------------------
## cCamEverHypnosis
## n missing distinct
## 2861 4247 2
##
## Value No Yes
## Frequency 2830 31
## Proportion 0.989 0.011
## --------------------------------------------------------------------------------
## cCamEverImageTech
## n missing distinct
## 2849 4259 2
##
## Value No Yes
## Frequency 2735 114
## Proportion 0.96 0.04
## --------------------------------------------------------------------------------
## cCamEverMassage
## n missing distinct
## 2817 4291 2
##
## Value No Yes
## Frequency 2274 543
## Proportion 0.81 0.19
## --------------------------------------------------------------------------------
## cCamEverPray
## n missing distinct
## 2855 4253 2
##
## Value No Yes
## Frequency 1658 1197
## Proportion 0.58 0.42
## --------------------------------------------------------------------------------
## cCamEverMeditate
## n missing distinct
## 2834 4274 2
##
## Value No Yes
## Frequency 2143 691
## Proportion 0.76 0.24
## --------------------------------------------------------------------------------
## cCamEverSpecialDiet
## n missing distinct
## 2860 4248 2
##
## Value No Yes
## Frequency 2685 175
## Proportion 0.939 0.061
## --------------------------------------------------------------------------------
## cCamEverSpiritHeal
## n missing distinct
## 2858 4250 2
##
## Value No Yes
## Frequency 2710 148
## Proportion 0.948 0.052
## --------------------------------------------------------------------------------
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 2 16 17 17 18 32 814
## median mean SE.mean CI.mean.0.95 var std.dev
## 17.000 17.104 0.023 0.044 3.206 1.791
## coef.var
## 0.105
## Warning: Removed 814 rows containing non-finite values (stat_density).
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 1 16 17 17 18 31 3342
## median mean SE.mean CI.mean.0.95 var std.dev
## 17.000 17.432 0.038 0.074 5.689 2.385
## coef.var
## 0.137
## Warning: Removed 3342 rows containing non-finite values (stat_density).
## MIDUS3_recode$cCamSum
## n missing distinct Info Mean Gmd .05 .10
## 2899 4209 27 0.927 17.42 2.329 16 16
## .25 .50 .75 .90 .95
## 16 17 18 20 22
##
## lowest : 1 2 4 6 8, highest: 28 29 30 31 32
## median mean SE.mean CI.mean.0.95 var std.dev
## 17.000 17.419 0.048 0.094 6.733 2.595
## coef.var
## 0.149
## Warning: Removed 4209 rows containing non-finite values (stat_density).
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0 29000 55000 73380 95000 300000 998
## median mean SE.mean CI.mean.0.95 var
## 55000.00 73379.62 830.03 1627.15 4209495124.10
## std.dev coef.var
## 64880.62 0.88
## Warning: Removed 998 rows containing non-finite values (stat_density).
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0 29000 57500 71364 95443 300000 3500
## x
## median 57500.00
## mean 71363.52
## SE.mean 973.95
## CI.mean.0.95 1909.51
## var 3655822857.30
## std.dev 60463.40
## coef.var 0.85
## Warning: Removed 3500 rows containing non-finite values (stat_density).
## Min. 1st Qu. Median Mean 3rd Qu. Max. NA's
## 0 34000 68000 87920 121250 300000 4396
## x
## median 68000.00
## mean 87919.71
## SE.mean 1421.75
## CI.mean.0.95 2787.82
## var 5481942578.04
## std.dev 74040.14
## coef.var 0.84
## Warning: Removed 4396 rows containing non-finite values (stat_density).
##
## Less than High School
## 681
## High School or GED
## 2060
## Some college/Associates/2-year college or vocational school
## 2173
## Bachelors
## 1437
## Graduate or Professional Degree
## 744
## <NA>
## 13
##
## Less than High School
## 309
## High School or GED
## 1327
## Some college/Associates/2-year college or vocational school
## 1484
## Bachelors
## 1109
## Graduate or Professional Degree
## 727
## <NA>
## 2398
##
## Less than High School
## 170
## High School or GED
## 785
## Some college/Associates/2-year college or vocational school
## 953
## Bachelors
## 794
## Graduate or Professional Degree
## 581
## <NA>
## 3825
## MIDUS1_recode$aGender
## n missing distinct
## 7106 2 2
##
## Value (1) Male (2) Female
## Frequency 3440 3666
## Proportion 0.48 0.52
## MIDUS2_recode$bGender
## n missing distinct
## 4963 2391 2
##
## Value (1) MALE (2) FEMALE
## Frequency 2316 2647
## Proportion 0.47 0.53
## MIDUS3_recode$cGender
## n missing distinct
## 3294 3814 2
##
## Value (1) MALE (2) FEMALE
## Frequency 1484 1810
## Proportion 0.45 0.55
##
## White Black Other <NA>
## 5600 321 255 932
##
## Non-Hispanic White Non-Hispanic Black Hispanic Other
## 4392 225 151 164
## <NA>
## 2422
##
## Non-Hispanic White Non-Hispanic Black Hispanic Other
## 2853 121 98 191
## <NA>
## 3845
# Bivariate Distributions
## Warning: Removed 1022 rows containing missing values (geom_point).
##
## Pearson's product-moment correlation
##
## data: x and y
## t = -0.9, df = 6084, p-value = 0.4
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## -0.036 0.014
## sample estimates:
## cor
## -0.011
## Warning: Removed 3523 rows containing missing values (geom_point).
##
## Pearson's product-moment correlation
##
## data: x and y
## t = 1, df = 3829, p-value = 0.2
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## -0.010 0.053
## sample estimates:
## cor
## 0.022
## Warning: Removed 4416 rows containing missing values (geom_point).
##
## Pearson's product-moment correlation
##
## data: x and y
## t = 4, df = 2690, p-value = 0.00004
## alternative hypothesis: true correlation is not equal to 0
## 95 percent confidence interval:
## 0.041 0.116
## sample estimates:
## cor
## 0.079
## Warning: Removed 814 rows containing non-finite values (stat_bin).
## Warning: Removed 814 rows containing non-finite values (stat_bin).
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
##
## (1) Male (2) Female
## 2 1 2
## 3 3 0
## 4 1 2
## 8 0 1
## 10 0 1
## 11 1 0
## 12 1 3
## 13 2 4
## 14 4 10
## 15 23 20
## 16 1645 1375
## 17 641 808
## 18 314 457
## 19 180 282
## 20 78 134
## 21 45 73
## 22 18 60
## 23 14 26
## 24 4 16
## 25 5 12
## 26 2 11
## 27 4 3
## 28 0 5
## 30 1 1
## 32 0 1
## Warning: Removed 814 rows containing non-finite values (stat_bin).
## Warning: Removed 814 rows containing non-finite values (stat_bin).
##
## (1) MALE (2) FEMALE
## 1 1 0
## 2 2 7
## 3 1 0
## 4 5 5
## 5 0 1
## 6 1 5
## 7 0 1
## 8 1 1
## 9 0 1
## 10 1 2
## 11 0 1
## 13 2 1
## 14 1 2
## 15 25 26
## 16 839 729
## 17 416 459
## 18 216 348
## 19 137 208
## 20 60 159
## 21 34 102
## 22 20 61
## 23 10 31
## 24 13 24
## 25 4 13
## 26 3 15
## 27 2 6
## 28 2 2
## 29 1 3
## 30 0 1
## 31 1 0
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
##
## (1) MALE (2) FEMALE
## 1 0 2
## 2 4 10
## 4 2 6
## 6 0 3
## 8 0 2
## 10 1 1
## 12 1 0
## 13 0 2
## 14 5 5
## 15 13 20
## 16 607 527
## 17 277 339
## 18 165 262
## 19 94 145
## 20 61 101
## 21 18 56
## 22 13 38
## 23 10 27
## 24 12 18
## 25 5 16
## 26 1 8
## 27 1 9
## 28 1 4
## 29 2 1
## 30 1 1
## 31 0 1
## 32 1 0
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
##
## White Black Other
## 2 1 2 0
## 3 2 0 0
## 4 1 2 0
## 8 1 0 0
## 10 0 0 1
## 11 0 0 1
## 12 1 1 1
## 13 6 0 0
## 14 13 0 1
## 15 34 6 1
## 16 2669 169 115
## 17 1294 65 53
## 18 706 28 27
## 19 415 22 18
## 20 181 10 11
## 21 102 6 7
## 22 68 4 5
## 23 31 3 6
## 24 18 0 2
## 25 14 0 2
## 26 11 0 2
## 27 7 0 0
## 28 3 1 1
## 30 2 0 0
## 32 1 0 0
## Warning: Removed 814 rows containing non-finite values (stat_bin).
## Warning: Removed 814 rows containing non-finite values (stat_bin).
##
## Non-Hispanic White Non-Hispanic Black Hispanic Other
## 1 1 0 0 0
## 2 8 1 0 0
## 3 1 0 0 0
## 4 9 0 0 1
## 5 1 0 0 0
## 6 4 0 1 1
## 7 1 0 0 0
## 8 2 0 0 0
## 9 0 0 0 1
## 10 2 1 0 0
## 11 0 1 0 0
## 13 1 1 0 1
## 14 2 1 0 0
## 15 46 1 2 1
## 16 1400 57 47 53
## 17 805 28 15 23
## 18 516 21 12 14
## 19 308 10 11 11
## 20 199 5 6 6
## 21 119 5 5 7
## 22 71 3 5 2
## 23 41 0 0 0
## 24 27 6 4 0
## 25 15 1 1 0
## 26 16 1 1 0
## 27 7 0 0 1
## 28 4 0 0 0
## 29 4 0 0 0
## 30 1 0 0 0
## 31 0 0 1 0
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
## Warning: Removed 3342 rows containing non-finite values (stat_bin).
##
## Non-Hispanic White Non-Hispanic Black Hispanic Other
## 1 2 0 0 0
## 2 14 0 0 0
## 4 7 1 0 0
## 6 1 1 1 0
## 8 2 0 0 0
## 10 2 0 0 0
## 12 1 0 0 0
## 13 2 0 0 0
## 14 9 0 1 0
## 15 28 1 1 3
## 16 977 47 29 74
## 17 541 16 13 40
## 18 373 10 16 23
## 19 213 7 10 9
## 20 139 7 2 11
## 21 63 2 4 3
## 22 43 3 1 3
## 23 35 1 1 0
## 24 25 2 0 2
## 25 19 0 0 2
## 26 8 0 0 1
## 27 10 0 0 0
## 28 3 2 0 0
## 29 3 0 0 0
## 30 2 0 0 0
## 31 1 0 0 0
## 32 1 0 0 0
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
## Warning: Removed 4209 rows containing non-finite values (stat_bin).
## # A tibble: 21 x 3
## variable n_miss pct_miss
## <chr> <int> <dbl>
## 1 aHHIncome 998 14.0
## 2 aCamEverOtherHeal 957 13.5
## 3 aRace 932 13.1
## 4 aCamEverHomeopathy 858 12.1
## 5 aCamEverMassage 845 11.9
## 6 aCamEverPray 845 11.9
## 7 aCamEverSpiritHeal 842 11.8
## 8 aCamEverHerbTherapy 839 11.8
## 9 aCamEverMegaVitamins 839 11.8
## 10 aCamEverImageTech 838 11.8
## 11 aCamEverEnergy 837 11.8
## 12 aCamEverBiofeedback 836 11.8
## 13 aCamEverMeditate 835 11.7
## 14 aCamEverMoveTherapy 833 11.7
## 15 aCamEverHypnosis 832 11.7
## 16 aCamEverAcupuncture 829 11.7
## 17 aCamEverSpecialDiet 829 11.7
## 18 aCamEverChiropractor 826 11.6
## 19 aCamSum 814 11.5
## 20 aEducation 13 0.183
## 21 aGender 2 0.0281
## # A tibble: 21 x 3
## variable n_miss pct_miss
## <chr> <int> <dbl>
## 1 bCamEverOtherHeal 3558 48.4
## 2 bHHIncome 3500 47.6
## 3 bCamEverMassage 3404 46.3
## 4 bCamEverEnergy 3398 46.2
## 5 bCamEverImageTech 3398 46.2
## 6 bCamEverMeditate 3393 46.1
## 7 bCamEverHomeopathy 3386 46.0
## 8 bCamEverBiofeedback 3385 46.0
## 9 bCamEverSpecialDiet 3385 46.0
## 10 bCamEverHypnosis 3384 46.0
## 11 bCamEverSpiritHeal 3382 46.0
## 12 bCamEverMoveTherapy 3381 46.0
## 13 bCamEverMegaVitamins 3380 46.0
## 14 bCamEverHerbTherapy 3377 45.9
## 15 bCamEverPray 3377 45.9
## 16 bCamEverAcupuncture 3374 45.9
## 17 bCamEverChiropractor 3373 45.9
## 18 bCamSum 3342 45.4
## 19 bRace 2422 32.9
## 20 bEducation 2398 32.6
## 21 bGender 2391 32.5
## # A tibble: 21 x 3
## variable n_miss pct_miss
## <chr> <int> <dbl>
## 1 cHHIncome 4396 61.8
## 2 cCamEverOtherHeal 4310 60.6
## 3 cCamEverMassage 4291 60.4
## 4 cCamEverMeditate 4274 60.1
## 5 cCamEverEnergy 4268 60.0
## 6 cCamEverImageTech 4259 59.9
## 7 cCamEverPray 4253 59.8
## 8 cCamEverBiofeedback 4251 59.8
## 9 cCamEverHerbTherapy 4250 59.8
## 10 cCamEverSpiritHeal 4250 59.8
## 11 cCamEverHomeopathy 4248 59.8
## 12 cCamEverSpecialDiet 4248 59.8
## 13 cCamEverHypnosis 4247 59.7
## 14 cCamEverMegaVitamins 4245 59.7
## 15 cCamEverMoveTherapy 4243 59.7
## 16 cCamEverAcupuncture 4241 59.7
## 17 cCamEverChiropractor 4234 59.6
## 18 cCamSum 4209 59.2
## 19 cRace 3845 54.1
## 20 cEducation 3825 53.8
## 21 cGender 3814 53.7