In this exercise I use most of the variables considered by Massoglia et al. (2014), get estimates of the effect of incarceration on mortality, and examine an initial imputation model.

The variables I considered were:

I considered records since 1980 and respondents 18 years old or more, that is, a sample of 12625. During that period 661 people died. The median age of death was 41.

Just to get an idea of the missing data:

countmis(org)
clincome_adj          job      healthw      cdeltot      married      rprison        cpedu        cedui 
       0.214        0.066        0.059        0.058        0.058        0.041        0.037        0.000 

The highest proportion of missing cases is income. I defined imputation multivel models, where I use both time invariant and variant variables, age and year. Just to provide an example, the income imputation model is:

\[income = \alpha + year + age + edu + parents\_edu + prison + married \\ + health + delinquency + job + dropout + death + \delta_r + \epsilon_i \]

Death and dropout are time-invariant variables. One of the most important predictor of income missigness is incarceration! For this exercise, I generated 10 imputations. Final versions of this exercise should use more iterations and imputations (e.g., 30 iterations, 60 imputations). Anyways mixing doesn’t look bad.

I explored differences by gender:

# explore differences by gender
x <- org[, .(male = max(male, na.rm = TRUE), 
              prison = max(rprison, na.rm = TRUE), 
              death = max(died, na.rm = TRUE), age = max(agei, na.rm = TRUE)), id]
# these numbers are different because of age >= 18 and year >= 1980
table(x[, .(prison, death, male)])
, , male = 0

      death
prison    0    1
     0 5936  239
     1   61   17

, , male = 1

      death
prison    0    1
     0 5386  341
     1  581   64
# odds ratio male
a <- table(x[male == 1, .(prison, death)])
(a[2,2]*a[1,1]) / (a[2,1]*a[1,2])
[1] 1.739866
# odd ratios female
b <- table(x[male == 0, .(prison, death)])
(b[2,2]*b[1,1]) / (b[2,1]*b[1,2]) # huge
[1] 6.921737
# time between first incarceration event and death by gender!
x <- org[, .(id ,rprison, died, male, agei, stop)]
a <- x[rprison == 1, .(prison = min(stop), age = min(agei)), .(id, male)]
b <- x[died == 1, .(death = min(stop)), .(id, male)]
#anyDuplicated(b)
#anyDuplicated(a)
setkey(a, id, male)
setkey(b, id, male)
c <- b[a]
# only 81 cases of people who have been incarcerated and died in the sample
length(c[!is.na(death), death - prison])
[1] 81
mean(c[!is.na(death) & male == 1, age]) # men are younger
[1] 26.6875
mean(c[!is.na(death) & male == 0, age]) # women are older
[1] 29.23529
# mean and median of year between first incarceration event and death
# men
mean(c[!is.na(death) & male == 1, death - prison])
[1] 14.15625
median(c[!is.na(death) & male == 1, death - prison])
[1] 13.5
hist(c[!is.na(death) & male == 1, death - prison], main = "Men: Distribution time between incarceration and death")

# women
mean(c[!is.na(death) & male == 0, death - prison])
[1] 13.29412
median(c[!is.na(death) & male == 0, death - prison])
[1] 12
hist(c[!is.na(death) & male == 0, death - prison], main = "Women: Distribution time between incarceration and death")

Let’s explore the imputations:

Then, I examine the distribution of the imputed variables by age and year. Here I show only income and prison, all the rest of imputed variables look right. Gray lines represent imputed datasets, the red line the observed data.

Income imputations are far below the observed values, I guess mainly to the effect of prison on income missingness. Probably, it would be a good idea to use a lag version of income to improve the imputation.

Incarceartion looks fine. After the 2004, NLYS recorded non-response due to incarceration, that is why there are no missing records during those years.

Pooled Models

A simple model removing time-varying adjustments (different from the naive model, but similar to MSM):

Multiple imputation results:
      MIcombine.default(models)
                               results         se      (lower       upper) missInfo
prison                      0.20705581 0.13817298 -0.06376533  0.477876963      1 %
male                        0.38692541 0.08416393  0.22196705  0.551883776      0 %
magef19                    -0.15800652 0.12487920 -0.40276526  0.086752223      0 %
magef20                     0.17021554 0.12056974 -0.06609681  0.406527898      0 %
magef21                     0.06794871 0.13167489 -0.19012934  0.326026767      0 %
magef22                     0.40046214 0.11793874  0.16930644  0.631617850      0 %
magef23                     0.52363598 0.21263395  0.10688108  0.940390883      0 %
raceblack                   0.41827848 0.12071222  0.18168590  0.654871061      1 %
racenon-hispanic/non-black  0.20973827 0.12495792 -0.03518295  0.454659488      2 %
cedui                      -0.07346525 0.01896565 -0.11064084 -0.036289667      3 %
cpedu                      -0.02032035 0.01438131 -0.04853588  0.007895176      9 %
clincome_adj               -0.10112303 0.01398543 -0.12855172 -0.073694337      7 %
cdeltot                     0.13429863 0.06417320  0.00845311  0.260144156      6 %

Adding time-varying covariates.

Multiple imputation results:
      MIcombine.default(models)
                               results         se      (lower       upper) missInfo
prison                      0.05844335 0.13637113 -0.20884505  0.325731758      1 %
male                        0.52322639 0.08484666  0.35692992  0.689522855      0 %
magef19                    -0.18304748 0.12497753 -0.42799893  0.061903976      0 %
magef20                     0.14054481 0.12068848 -0.09600026  0.377089885      0 %
magef21                     0.06108670 0.13177843 -0.19719429  0.319367686      0 %
magef22                     0.39196654 0.11812839  0.16043914  0.623493938      0 %
magef23                     0.52082845 0.21309940  0.10316129  0.938495610      0 %
raceblack                   0.25216301 0.12067392  0.01564484  0.488681169      1 %
racenon-hispanic/non-black  0.25761083 0.12464618  0.01330041  0.501921249      2 %
cedui                      -0.03627544 0.01919190 -0.07389332  0.001342446      2 %
cpedu                      -0.02003647 0.01432104 -0.04813576  0.008062817      9 %
clincome_adj               -0.01975688 0.01642951 -0.05199882  0.012485049     10 %
healthw                     0.83907566 0.10570916  0.63188895  1.046262377      0 %
job                        -0.76537225 0.10104180 -0.96342877 -0.567315725      3 %
married                    -0.83233637 0.09413722 -1.01684304 -0.647829702      1 %
cdeltot                     0.08505121 0.06561958 -0.04366846  0.213770884      8 %

No effect of prison on mortality.

Because of time-varying confounding I used the MSM adjustment. This seems to correct the adjustment of mediators. The effects are higher but less precise and non-statistically significant. Similarly to Massoglia et al, the interaction between prison and gender suggests a huge positive effect for women. The imputations I am using are not completly suitable, the imputation model should include the interaction by gender. The coefficient for women is too big to be credible (hazard ratio of about 5, kind of similar to the odds ratios computed above).

Multiple imputation results:
      MIcombine.default(modelsMSM)
                                results         se      (lower       upper) missInfo
prison                      0.281064758 0.19620595 -0.10471927  0.666848785     16 %
magef19                    -0.194962483 0.13907104 -0.46766175  0.077736781      6 %
magef20                    -0.031991808 0.15602230 -0.33789005  0.273906438      5 %
magef21                     0.070066585 0.14210109 -0.20847563  0.348608804      3 %
magef22                     0.365676881 0.12695691  0.11683424  0.614519521      2 %
magef23                     0.493597994 0.22522961  0.05213245  0.935063539      2 %
male                        0.469803620 0.09460129  0.28436504  0.655242200      3 %
raceblack                   0.236583066 0.13763483 -0.03329556  0.506461692      6 %
racenon-hispanic/non-black  0.304874084 0.13600363  0.03825046  0.571497703      4 %
cedui                      -0.004109759 0.02197381 -0.04718881  0.038969294      4 %
cpedu                      -0.028272743 0.01627431 -0.06020967  0.003664184     10 %
clincome_adj               -0.014768386 0.01983132 -0.05377729  0.024240514     17 %
healthw                     0.791785692 0.10899082  0.57813693  1.005434451      3 %
job                        -0.727400945 0.10834780 -0.93979810 -0.515003792      4 %
married                    -0.856636235 0.10068798 -1.05398434 -0.659288128      1 %
cdeltot                     0.082107232 0.09021496 -0.09655979  0.260774252     29 %
Multiple imputation results:
      MIcombine.default(modelsMSM)
                                results         se      (lower       upper) missInfo
prison                      1.634382867 0.29486277  1.05591949  2.212846246      8 %
male                        0.542145740 0.09349000  0.35887475  0.725416725      4 %
magef19                    -0.216069895 0.13893477 -0.48852445  0.056384660      6 %
magef20                    -0.050113070 0.15652347 -0.35697616  0.256750023      5 %
magef21                     0.065940045 0.14231506 -0.21302992  0.344910010      3 %
magef22                     0.363537763 0.12661104  0.11537048  0.611705043      2 %
magef23                     0.502423222 0.22282963  0.06566337  0.939183076      2 %
raceblack                   0.253268805 0.13667027 -0.01474066  0.521278274      6 %
racenon-hispanic/non-black  0.311047923 0.13601157  0.04439313  0.577702713      5 %
cedui                      -0.003172991 0.02201871 -0.04634187  0.039995890      5 %
cpedu                      -0.028772638 0.01615455 -0.06047258  0.002927308     10 %
clincome_adj               -0.016393007 0.01983752 -0.05542020  0.022634183     17 %
healthw                     0.785154809 0.10936140  0.57077585  0.999533771      3 %
job                        -0.729676066 0.10866812 -0.94269926 -0.516652871      4 %
married                    -0.855741824 0.10053569 -1.05279202 -0.658691624      1 %
cdeltot                     0.087528616 0.08868364 -0.08810911  0.263166342     29 %
prison:male                -1.553770291 0.36463179 -2.27040549 -0.837135093     15 %
