Rows: 8000 Columns: 10
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
dbl (10): sex, indigeneity, high_school, partnered, remoteness, language, sm...
ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
#Using the mathcit function from MatchIt to match each smoker with a non-smoker (1 to 1 matching) based on#sex, indigeneity status, high school completion, marital status (partnered or not),#region of residence (major cities, inner regional, outer regional), language background (English speaking Yes/No) #and risky alcohol drinking (Yes/No)match_obj <-matchit(smoker ~ sex + indigeneity + high_school + partnered + remoteness + language + risky_alcohol + age,data = currentDataset, method ="nearest", distance ="glm",ratio =1,replace =FALSE)summary(match_obj)
Call:
matchit(formula = smoker ~ sex + indigeneity + high_school +
partnered + remoteness + language + risky_alcohol + age,
data = currentDataset, method = "nearest", distance = "glm",
replace = FALSE, ratio = 1)
Summary of Balance for All Data:
Means Treated Means Control Std. Mean Diff. Var. Ratio eCDF Mean
distance 0.1850 0.1130 0.6310 2.0638 0.2035
sex 0.4938 0.4421 0.1035 . 0.0518
indigeneity 0.0524 0.0175 0.1565 . 0.0349
high_school 0.4220 0.6378 -0.4370 . 0.2158
partnered 0.4630 0.6913 -0.4578 . 0.2283
remoteness 0.6119 0.4553 0.1969 1.2318 0.0522
language 0.9579 0.9130 0.2234 . 0.0449
risky_alcohol 0.6427 0.5411 0.2120 . 0.1016
age 51.6057 53.7824 -0.1676 0.8214 0.0441
eCDF Max
distance 0.3204
sex 0.0518
indigeneity 0.0349
high_school 0.2158
partnered 0.2283
remoteness 0.0921
language 0.0449
risky_alcohol 0.1016
age 0.1020
Summary of Balance for Matched Data:
Means Treated Means Control Std. Mean Diff. Var. Ratio eCDF Mean
distance 0.1850 0.1845 0.0042 1.0213 0.0002
sex 0.4938 0.4887 0.0103 . 0.0051
indigeneity 0.0524 0.0390 0.0599 . 0.0133
high_school 0.4220 0.4045 0.0353 . 0.0175
partnered 0.4630 0.4713 -0.0165 . 0.0082
remoteness 0.6119 0.6078 0.0052 1.0051 0.0014
language 0.9579 0.9671 -0.0460 . 0.0092
risky_alcohol 0.6427 0.6561 -0.0279 . 0.0133
age 51.6057 51.9723 -0.0282 0.9175 0.0145
eCDF Max Std. Pair Dist.
distance 0.0103 0.0053
sex 0.0051 0.3019
indigeneity 0.0133 0.2627
high_school 0.0175 0.1767
partnered 0.0082 0.1977
remoteness 0.0021 0.3563
language 0.0092 0.0665
risky_alcohol 0.0133 0.1350
age 0.0370 0.2881
Sample Sizes:
Control Treated
All 7026 974
Matched 974 974
Unmatched 6052 0
Discarded 0 0
#plotting the balance between smokers and non-smokersplot(match_obj, type ="jitter", interactive =FALSE)