In this exercise I use most of the variables considered by Massoglia et al. (2014), get estimates of the effect of incarceration on mortality, and examine an initial imputation model.
The variables I considered were:
- Gender (I)
- Age (I)
- Race (I)
- Income (V)
- Education (V)
- Parents’ education (I)
- Delinquency (1980) (I)
- Working (V)
- Health issues (related to work) (V)
- Incarceration (V)
- Married (V)
I considered records since 1980 and respondents 18 years old or more, that is, a sample of 12625. During that period 661 people died. The median age of death was 41.
Just to get an idea of the missing data:
countmis(org)
clincome_adj job healthw cdeltot married rprison cpedu cedui
0.214 0.066 0.059 0.058 0.058 0.041 0.037 0.000
The highest proportion of missing cases is income. I defined imputation multivel models, where I use both time invariant and variant variables, age and year. Just to provide an example, the income imputation model is:
\[income = \alpha + year + age + edu + parents\_edu + prison + married \\
+ health + delinquency + job + dropout + death + \delta_r + \epsilon_i \]
Death and dropout are time-invariant variables. One of the most important predictor of income missigness is incarceration! For this exercise, I generated 10 imputations. Final versions of this exercise should use more iterations and imputations (e.g., 30 iterations, 60 imputations). Anyways mixing doesn’t look bad.
I explored differences by gender:
# explore differences by gender
x <- org[, .(male = max(male, na.rm = TRUE),
prison = max(rprison, na.rm = TRUE),
death = max(died, na.rm = TRUE), age = max(agei, na.rm = TRUE)), id]
# these numbers are different because of age >= 18 and year >= 1980
table(x[, .(prison, death, male)])
, , male = 0
death
prison 0 1
0 5936 239
1 61 17
, , male = 1
death
prison 0 1
0 5386 341
1 581 64
# odds ratio male
a <- table(x[male == 1, .(prison, death)])
(a[2,2]*a[1,1]) / (a[2,1]*a[1,2])
[1] 1.739866
# odd ratios female
b <- table(x[male == 0, .(prison, death)])
(b[2,2]*b[1,1]) / (b[2,1]*b[1,2]) # huge
[1] 6.921737
# time between first incarceration event and death by gender!
x <- org[, .(id ,rprison, died, male, agei, stop)]
a <- x[rprison == 1, .(prison = min(stop), age = min(agei)), .(id, male)]
b <- x[died == 1, .(death = min(stop)), .(id, male)]
#anyDuplicated(b)
#anyDuplicated(a)
setkey(a, id, male)
setkey(b, id, male)
c <- b[a]
# only 81 cases of people who have been incarcerated and died in the sample
length(c[!is.na(death), death - prison])
[1] 81
mean(c[!is.na(death) & male == 1, age]) # men are younger
[1] 26.6875
mean(c[!is.na(death) & male == 0, age]) # women are older
[1] 29.23529
# mean and median of year between first incarceration event and death
# men
mean(c[!is.na(death) & male == 1, death - prison])
[1] 14.15625
median(c[!is.na(death) & male == 1, death - prison])
[1] 13.5
hist(c[!is.na(death) & male == 1, death - prison], main = "Men: Distribution time between incarceration and death")

# women
mean(c[!is.na(death) & male == 0, death - prison])
[1] 13.29412
median(c[!is.na(death) & male == 0, death - prison])
[1] 12
hist(c[!is.na(death) & male == 0, death - prison], main = "Women: Distribution time between incarceration and death")

Let’s explore the imputations:



Then, I examine the distribution of the imputed variables by age and year. Here I show only income and prison, all the rest of imputed variables look right. Gray lines represent imputed datasets, the red line the observed data.
Income imputations are far below the observed values, I guess mainly to the effect of prison on income missingness. Probably, it would be a good idea to use a lag version of income to improve the imputation.
Incarceartion looks fine. After the 2004, NLYS recorded non-response due to incarceration, that is why there are no missing records during those years.




Pooled Models
A simple model removing time-varying adjustments (different from the naive model, but similar to MSM):
Multiple imputation results:
MIcombine.default(models)
results se (lower upper) missInfo
prison 0.20705581 0.13817298 -0.06376533 0.477876963 1 %
male 0.38692541 0.08416393 0.22196705 0.551883776 0 %
magef19 -0.15800652 0.12487920 -0.40276526 0.086752223 0 %
magef20 0.17021554 0.12056974 -0.06609681 0.406527898 0 %
magef21 0.06794871 0.13167489 -0.19012934 0.326026767 0 %
magef22 0.40046214 0.11793874 0.16930644 0.631617850 0 %
magef23 0.52363598 0.21263395 0.10688108 0.940390883 0 %
raceblack 0.41827848 0.12071222 0.18168590 0.654871061 1 %
racenon-hispanic/non-black 0.20973827 0.12495792 -0.03518295 0.454659488 2 %
cedui -0.07346525 0.01896565 -0.11064084 -0.036289667 3 %
cpedu -0.02032035 0.01438131 -0.04853588 0.007895176 9 %
clincome_adj -0.10112303 0.01398543 -0.12855172 -0.073694337 7 %
cdeltot 0.13429863 0.06417320 0.00845311 0.260144156 6 %
Adding time-varying covariates.
Multiple imputation results:
MIcombine.default(models)
results se (lower upper) missInfo
prison 0.05844335 0.13637113 -0.20884505 0.325731758 1 %
male 0.52322639 0.08484666 0.35692992 0.689522855 0 %
magef19 -0.18304748 0.12497753 -0.42799893 0.061903976 0 %
magef20 0.14054481 0.12068848 -0.09600026 0.377089885 0 %
magef21 0.06108670 0.13177843 -0.19719429 0.319367686 0 %
magef22 0.39196654 0.11812839 0.16043914 0.623493938 0 %
magef23 0.52082845 0.21309940 0.10316129 0.938495610 0 %
raceblack 0.25216301 0.12067392 0.01564484 0.488681169 1 %
racenon-hispanic/non-black 0.25761083 0.12464618 0.01330041 0.501921249 2 %
cedui -0.03627544 0.01919190 -0.07389332 0.001342446 2 %
cpedu -0.02003647 0.01432104 -0.04813576 0.008062817 9 %
clincome_adj -0.01975688 0.01642951 -0.05199882 0.012485049 10 %
healthw 0.83907566 0.10570916 0.63188895 1.046262377 0 %
job -0.76537225 0.10104180 -0.96342877 -0.567315725 3 %
married -0.83233637 0.09413722 -1.01684304 -0.647829702 1 %
cdeltot 0.08505121 0.06561958 -0.04366846 0.213770884 8 %
No effect of prison on mortality.
Because of time-varying confounding I used the MSM adjustment. This seems to correct the adjustment of mediators. The effects are higher but less precise and non-statistically significant. Similarly to Massoglia et al, the interaction between prison and gender suggests a huge positive effect for women. The imputations I am using are not completly suitable, the imputation model should include the interaction by gender. The coefficient for women is too big to be credible (hazard ratio of about 5, kind of similar to the odds ratios computed above).
Multiple imputation results:
MIcombine.default(modelsMSM)
results se (lower upper) missInfo
prison 0.281064758 0.19620595 -0.10471927 0.666848785 16 %
magef19 -0.194962483 0.13907104 -0.46766175 0.077736781 6 %
magef20 -0.031991808 0.15602230 -0.33789005 0.273906438 5 %
magef21 0.070066585 0.14210109 -0.20847563 0.348608804 3 %
magef22 0.365676881 0.12695691 0.11683424 0.614519521 2 %
magef23 0.493597994 0.22522961 0.05213245 0.935063539 2 %
male 0.469803620 0.09460129 0.28436504 0.655242200 3 %
raceblack 0.236583066 0.13763483 -0.03329556 0.506461692 6 %
racenon-hispanic/non-black 0.304874084 0.13600363 0.03825046 0.571497703 4 %
cedui -0.004109759 0.02197381 -0.04718881 0.038969294 4 %
cpedu -0.028272743 0.01627431 -0.06020967 0.003664184 10 %
clincome_adj -0.014768386 0.01983132 -0.05377729 0.024240514 17 %
healthw 0.791785692 0.10899082 0.57813693 1.005434451 3 %
job -0.727400945 0.10834780 -0.93979810 -0.515003792 4 %
married -0.856636235 0.10068798 -1.05398434 -0.659288128 1 %
cdeltot 0.082107232 0.09021496 -0.09655979 0.260774252 29 %
Multiple imputation results:
MIcombine.default(modelsMSM)
results se (lower upper) missInfo
prison 1.634382867 0.29486277 1.05591949 2.212846246 8 %
male 0.542145740 0.09349000 0.35887475 0.725416725 4 %
magef19 -0.216069895 0.13893477 -0.48852445 0.056384660 6 %
magef20 -0.050113070 0.15652347 -0.35697616 0.256750023 5 %
magef21 0.065940045 0.14231506 -0.21302992 0.344910010 3 %
magef22 0.363537763 0.12661104 0.11537048 0.611705043 2 %
magef23 0.502423222 0.22282963 0.06566337 0.939183076 2 %
raceblack 0.253268805 0.13667027 -0.01474066 0.521278274 6 %
racenon-hispanic/non-black 0.311047923 0.13601157 0.04439313 0.577702713 5 %
cedui -0.003172991 0.02201871 -0.04634187 0.039995890 5 %
cpedu -0.028772638 0.01615455 -0.06047258 0.002927308 10 %
clincome_adj -0.016393007 0.01983752 -0.05542020 0.022634183 17 %
healthw 0.785154809 0.10936140 0.57077585 0.999533771 3 %
job -0.729676066 0.10866812 -0.94269926 -0.516652871 4 %
married -0.855741824 0.10053569 -1.05279202 -0.658691624 1 %
cdeltot 0.087528616 0.08868364 -0.08810911 0.263166342 29 %
prison:male -1.553770291 0.36463179 -2.27040549 -0.837135093 15 %
