数据的字段如下:
## [1] "姓名" "住院号" "性别" "性别1" "住院天数"
## [6] "肺心" "支气管扩张" "哮喘" "呼吸衰竭" "高血压"
## [11] "冠心病" "诊断" "年龄" "吸烟史1" "职业"
## [16] "知识水平" "身高" "体重" "BMI" "SAA"
## [21] "PCT" "CRP" "WBC" "NE" "PH"
## [26] "PCO2" "PO2" "IL-6" "HCY" "MMSE"
## [31] "MMSE结果" "MoCA" "MoCA结果" "ADL 结果" "GDS结果"
## [36] "GDS" "肺功能" "舒张试验" "肺功能结果" "FVC"
## [41] "FEV1" "FEV1/FVC"
数据的摘要如下:
## 姓名 住院号 性别 性别1
## Length:100 Min. :203574 Min. :0.00 Length:100
## Class :character 1st Qu.:675792 1st Qu.:0.00 Class :character
## Mode :character Median :863193 Median :1.00 Mode :character
## Mean :780172 Mean :0.73
## 3rd Qu.:924278 3rd Qu.:1.00
## Max. :938234 Max. :1.00
##
## 住院天数 肺心 支气管扩张 哮喘 呼吸衰竭
## Min. : 3.000 Min. :0.00 Min. :0.00 Min. :0.00 Min. :0.0
## 1st Qu.: 6.000 1st Qu.:0.00 1st Qu.:0.00 1st Qu.:0.00 1st Qu.:0.0
## Median : 7.000 Median :0.00 Median :0.00 Median :0.00 Median :0.0
## Mean : 7.091 Mean :0.11 Mean :0.19 Mean :0.12 Mean :0.2
## 3rd Qu.: 8.000 3rd Qu.:0.00 3rd Qu.:0.00 3rd Qu.:0.00 3rd Qu.:0.0
## Max. :29.000 Max. :1.00 Max. :1.00 Max. :1.00 Max. :1.0
## NA's :1
## 高血压 冠心病 诊断 年龄
## Min. :0.00 Min. :0.00 Length:100 Min. :47.00
## 1st Qu.:0.00 1st Qu.:0.00 Class :character 1st Qu.:66.75
## Median :0.00 Median :0.00 Mode :character Median :74.00
## Mean :0.22 Mean :0.04 Mean :73.10
## 3rd Qu.:0.00 3rd Qu.:0.00 3rd Qu.:79.00
## Max. :1.00 Max. :1.00 Max. :89.00
##
## 吸烟史1 职业 知识水平 身高
## Length:100 Length:100 Length:100 Min. :1.350
## Class :character Class :character Class :character 1st Qu.:1.518
## Mode :character Mode :character Mode :character Median :1.585
## Mean :1.582
## 3rd Qu.:1.650
## Max. :1.750
##
## 体重 BMI SAA PCT
## Min. :35.00 Min. :14.45 Length:100 Length:100
## 1st Qu.:44.75 1st Qu.:17.96 Class :character Class :character
## Median :50.00 Median :20.48 Mode :character Mode :character
## Mean :51.78 Mean :20.66
## 3rd Qu.:59.00 3rd Qu.:22.81
## Max. :96.00 Max. :34.01
##
## CRP WBC NE PH
## Length:100 Min. : 4.000 Min. :41.50 Min. :7.267
## Class :character 1st Qu.: 6.080 1st Qu.:62.05 1st Qu.:7.395
## Mode :character Median : 7.855 Median :71.90 Median :7.425
## Mean : 8.546 Mean :70.97 Mean :7.420
## 3rd Qu.: 9.928 3rd Qu.:78.70 3rd Qu.:7.453
## Max. :38.140 Max. :98.20 Max. :7.493
##
## PCO2 PO2 IL-6 HCY
## Min. : 3.090 Min. : 6.80 Min. : 0.010 Min. : 6.20
## 1st Qu.: 4.683 1st Qu.:11.78 1st Qu.: 4.805 1st Qu.:11.35
## Median : 5.510 Median :13.15 Median : 11.060 Median :13.09
## Mean : 5.882 Mean :13.99 Mean : 36.696 Mean :13.43
## 3rd Qu.: 6.525 3rd Qu.:15.93 3rd Qu.: 29.672 3rd Qu.:15.85
## Max. :12.800 Max. :25.90 Max. :842.320 Max. :21.39
## NA's :1
## MMSE MMSE结果 MoCA MoCA结果 ADL 结果
## Min. :0.00 Min. : 7.00 Min. :0.00 Min. : 4.00 Min. :18.00
## 1st Qu.:0.00 1st Qu.:21.00 1st Qu.:0.00 1st Qu.:19.00 1st Qu.:20.00
## Median :0.00 Median :27.00 Median :1.00 Median :24.50 Median :20.00
## Mean :0.47 Mean :24.89 Mean :0.53 Mean :22.45 Mean :23.82
## 3rd Qu.:1.00 3rd Qu.:30.00 3rd Qu.:1.00 3rd Qu.:28.00 3rd Qu.:23.00
## Max. :1.00 Max. :30.00 Max. :1.00 Max. :30.00 Max. :56.00
##
## GDS结果 GDS 肺功能 舒张试验
## Min. :0.00 Min. : 0.00 Min. :1.00 Min. :0.00
## 1st Qu.:0.00 1st Qu.: 3.00 1st Qu.:2.00 1st Qu.:0.00
## Median :0.00 Median : 5.00 Median :3.00 Median :2.00
## Mean :0.18 Mean : 6.44 Mean :2.98 Mean :1.22
## 3rd Qu.:0.00 3rd Qu.: 8.00 3rd Qu.:4.00 3rd Qu.:2.00
## Max. :2.00 Max. :30.00 Max. :4.00 Max. :2.00
##
## 肺功能结果 FVC FEV1 FEV1/FVC
## Length:100 Length:100 Min. :17.12 Min. :35.86
## Class :character Class :character 1st Qu.:29.74 1st Qu.:43.66
## Mode :character Mode :character Median :39.64 Median :51.66
## Mean :42.92 Mean :52.70
## 3rd Qu.:54.77 3rd Qu.:60.51
## Max. :86.43 Max. :85.51
##
问题一:结局变量是哪个(些)?
问题二:自变量是哪些?
问题三:研究目的是什么?
回复: 胡主任,早上好,感谢您帮你看数据,关于提出的问题,答案如下:1.结局变量是MMSE、MOCA其中一个阳性。2.自变量是CRP、PCT、SAA、IL6、HCY这些炎症因子。3.目的是探讨这些炎症因子在COPD病人中是否能推测存在认知功能障碍。
结局变量和自变量单独筛选出来
## # A tibble: 6 × 7
## 姓名 住院号 MMSE MMSE结果 MoCA MoCA结果 MMSEoCA
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 杨景轩 928966 1 20 1 18 1
## 2 曾柏香 530683 0 28 1 22 1
## 3 林福云 918460 0 29 1 23 1
## 4 张桂芳 282806 1 25 1 20 1
## 5 吴旭进 919761 0 29 0 28 0
## 6 彭成帝 927078 1 21 1 18 1
## # A tibble: 6 × 7
## 姓名 住院号 CRP PCT SAA `IL-6` HCY
## <chr> <dbl> <chr> <chr> <chr> <dbl> <dbl>
## 1 彭秀琼 923480 17.2 0.15 201.01 19.8 16.2
## 2 倪木江 923791 11.7 0.02 <5 10.7 18.7
## 3 许亚桂 856950 19.61 0.04 123.57 15.5 16.3
## 4 潘瑞新 923881 40.270000000000003 0.16 196.74 0.01 13.4
## 5 黄美蓉 458240 <5 <0.02 <5 0.01 12.7
## 6 张亚福 912179 25.14 0.04 96.57 89.1 15.5
将结果值中的”<“符号剔除,<5看作5
## Warning in mask$eval_all_mutate(quo): NAs introduced by coercion
## Warning in mask$eval_all_mutate(quo): NAs introduced by coercion
## # A tibble: 6 × 12
## 姓名 住院号 CRP PCT SAA `IL-6` HCY MMSE MMSE结果 MoCA MoCA结果
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 彭秀琼 923480 17.2 0.15 201. 19.8 16.2 1 20 1 19
## 2 倪木江 923791 11.7 0.02 5 10.7 18.7 0 30 0 28
## 3 许亚桂 856950 19.6 0.04 124. 15.5 16.3 0 30 0 28
## 4 潘瑞新 923881 40.3 0.16 197. 0.01 13.4 0 30 0 29
## 5 黄美蓉 458240 5 0.02 5 0.01 12.7 0 30 0 29
## 6 张亚福 912179 25.1 0.04 96.6 89.1 15.5 0 30 0 28
## # … with 1 more variable: MMSEoCA <dbl>
## # A tibble: 6 × 12
## 姓名 住院号 CRP PCT SAA `IL-6` HCY MMSE MMSE结果 MoCA MoCA结果
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 杨景轩 928966 9.58 0.02 5 0.01 8.07 1 20 1 18
## 2 曾柏香 530683 10.7 0.15 13.0 7.16 13.5 0 28 1 22
## 3 林福云 918460 5 0.02 5 0.01 10.4 0 29 1 23
## 4 张桂芳 282806 9.67 0.03 9.21 33.0 14.1 1 25 1 20
## 5 吴旭进 919761 5 0.04 7.23 8.24 15.6 0 29 0 28
## 6 彭成帝 927078 17.8 0.09 15.3 13.4 17.8 1 21 1 18
## # … with 1 more variable: MMSEoCA <dbl>
查看并删除有缺失值的行
## # A tibble: 3 × 12
## 姓名 住院号 CRP PCT SAA `IL-6` HCY MMSE MMSE结果 MoCA MoCA结果
## <chr> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl> <dbl>
## 1 张茂光 859129 141. 0.73 NA 842. 11.7 1 22 1 21
## 2 骆中强 925529 102. 0.09 279. 74.5 NA 0 30 0 28
## 3 邓国珍 580686 NA 0.69 290. 266. 10.3 1 18 1 24
## # … with 1 more variable: MMSEoCA <dbl>
## 姓名 住院号 CRP PCT
## Length:97 Min. :203574 Min. : 5.00 Min. :0.0200
## Class :character 1st Qu.:676317 1st Qu.: 5.00 1st Qu.:0.0200
## Mode :character Median :867257 Median : 11.70 Median :0.0300
## Mean :779916 Mean : 26.04 Mean :0.1263
## 3rd Qu.:924273 3rd Qu.: 25.14 3rd Qu.:0.0600
## Max. :938234 Max. :181.19 Max. :5.7500
## SAA IL-6 HCY MMSE
## Min. : 5.00 Min. : 0.01 Min. : 6.20 Min. :0.0000
## 1st Qu.: 5.00 1st Qu.: 4.76 1st Qu.:11.37 1st Qu.:0.0000
## Median : 13.07 Median : 10.88 Median :13.10 Median :0.0000
## Mean : 52.59 Mean : 25.63 Mean :13.48 Mean :0.4639
## 3rd Qu.: 69.84 3rd Qu.: 29.44 3rd Qu.:15.86 3rd Qu.:1.0000
## Max. :235.83 Max. :330.15 Max. :21.39 Max. :1.0000
## MMSE结果 MoCA MoCA结果 MMSEoCA
## Min. : 7.00 Min. :0.0000 Min. : 4.00 Min. :0.0000
## 1st Qu.:21.00 1st Qu.:0.0000 1st Qu.:19.00 1st Qu.:0.0000
## Median :27.00 Median :1.0000 Median :25.00 Median :1.0000
## Mean :24.94 Mean :0.5258 Mean :22.39 Mean :0.5464
## 3rd Qu.:30.00 3rd Qu.:1.0000 3rd Qu.:28.00 3rd Qu.:1.0000
## Max. :30.00 Max. :1.0000 Max. :30.00 Max. :1.0000
逻辑回归, MMSEoCA
##
## predicted_labels 0 1
## 0 23 14
## 1 21 39
##
## Call:
## glm(formula = MMSEoCA ~ CRP + PCT + SAA + `IL-6` + HCY, family = binomial,
## data = d)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -1.8741 -1.1612 0.5668 1.1546 1.5090
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -0.069698 0.901144 -0.077 0.938
## CRP 0.004449 0.009494 0.469 0.639
## PCT 4.134405 3.505425 1.179 0.238
## SAA -0.005294 0.004464 -1.186 0.236
## `IL-6` 0.009867 0.008845 1.115 0.265
## HCY -0.006382 0.064531 -0.099 0.921
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 133.63 on 96 degrees of freedom
## Residual deviance: 126.35 on 91 degrees of freedom
## AIC: 138.35
##
## Number of Fisher Scoring iterations: 7
逻辑回归,MMSE
##
## predicted_labels 0 1
## 0 47 34
## 1 5 11
##
## Call:
## glm(formula = MMSE ~ CRP + PCT + SAA + `IL-6` + HCY, family = binomial,
## data = d)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -1.5769 -1.0781 -0.9746 1.2708 1.4016
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 0.012088 0.863837 0.014 0.989
## CRP 0.007205 0.008776 0.821 0.412
## PCT 2.651574 3.061433 0.866 0.386
## SAA -0.002245 0.004162 -0.539 0.590
## `IL-6` -0.002455 0.006046 -0.406 0.685
## HCY -0.027139 0.063031 -0.431 0.667
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 133.96 on 96 degrees of freedom
## Residual deviance: 129.93 on 91 degrees of freedom
## AIC: 141.93
##
## Number of Fisher Scoring iterations: 6
逻辑回归,MoCA
##
## predicted_labels 0 1
## 0 25 24
## 1 21 27
##
## Call:
## glm(formula = MoCA ~ CRP + PCT + SAA + `IL-6` + HCY, family = binomial,
## data = d)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -1.8436 -1.1589 0.4052 1.1728 1.5558
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) -0.189455 0.890265 -0.213 0.831
## CRP 0.004204 0.009311 0.451 0.652
## PCT 5.217923 3.593370 1.452 0.146
## SAA -0.005636 0.004475 -1.259 0.208
## `IL-6` 0.004711 0.007922 0.595 0.552
## HCY 0.001084 0.064063 0.017 0.987
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 134.21 on 96 degrees of freedom
## Residual deviance: 127.26 on 91 degrees of freedom
## AIC: 139.26
##
## Number of Fisher Scoring iterations: 7
检验自变量在不同水平是否存在差异, CRP
##
## Welch Two Sample t-test
##
## data: group1 and group2
## t = -1.0657, df = 93.344, p-value = 0.2893
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -22.430879 6.763298
## sample estimates:
## mean of x mean of y
## 21.75545 29.58925
检验自变量在不同水平是否存在差异, PCT
##
## Welch Two Sample t-test
##
## data: group1 and group2
## t = -1.2934, df = 53.07, p-value = 0.2015
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -0.35825942 0.07735033
## sample estimates:
## mean of x mean of y
## 0.04954545 0.19000000
检验自变量在不同水平是否存在差异, SAA
##
## Welch Two Sample t-test
##
## data: group1 and group2
## t = -0.2035, df = 91.338, p-value = 0.8392
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -32.52391 26.47900
## sample estimates:
## mean of x mean of y
## 50.94000 53.96245
检验自变量在不同水平是否存在差异, HCY
##
## Welch Two Sample t-test
##
## data: group1 and group2
## t = -0.077724, df = 91.803, p-value = 0.9382
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -1.416042 1.309387
## sample estimates:
## mean of x mean of y
## 13.44818 13.50151
检验自变量在不同水平是否存在差异, IL-6
##
## Welch Two Sample t-test
##
## data: group1 and group2
## t = -1.7796, df = 74.451, p-value = 0.07923
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -30.28945 1.70851
## sample estimates:
## mean of x mean of y
## 17.82500 32.11547