About
Overview
This ADA uses data from the project “The effect of intranasal oxytocin on neural functioning in widow(er)s” (#1503744420, PI: Mary-Frances O’Connor, Co-PI: Brian Arizmendi). It was a double-blind randomized crossover study, designed to address conflicting accounts of motivated behavior in complicated grief using a novel approach-avoidance task variant (grief AAT; gAAT) that allows for comparison between participant-provided photos and nomothetic grief stimuli. Participants were 40 bereaved adults ages 55-80, who experienced the death of a spouse or long-term romantic partner between six months and three years prior to their participation.
Task
Participants attended two identical experimental sessions, at which they received one of two intranasal sprays (treatment A/B) and then took part in an fMRI scan, which involved a structural sequence, the gAAT, and a resting state sequence.
For the gAAT, five stimulus categories were presented: (1) deceased spouse, (2) living loved one [another attachment figure], (3) stranger, (4) generic grief-related scenes, and (5) neutral scenes. Photos provided by the participant were scanned and resized to a standard 750-pixel height and width, and framed in colored frames to match the stimuli used for non-idiographic categories (i.e., stranger, grief-related, neutral). Photos of the stranger were sex-matched to the participant’s partner.
During the task, participants were instructed to follow rules based on an irrelevant characteristic of the stimuli, namely, the color of the frame around the photo (e.g., “push the joystick when the frame is blue, pull when the frame is yellow”). The task was completed in two runs of approximately seven minutes each. In the second run, the instructions were reversed (e.g. “now pull the joystick for blue framed photos”). Order of instructions was counterbalanced across participants. Stimuli were presented via Inquisit 4 (2014), in a pseudorandomized order determined by genetic algorithm to optimize statistical power and psychological validity (Wager & Nichols, 2003). Each trial last 3000ms, with 2500 ms allowed for response. Intertrial interval was 500ms.
Participants completed the task using an MRI-compatible joystick. The stimuli were animated so that a joystick push would make the image smaller (as if they were pushing it away from them) and a joystick pull would make the image larger (as if they were bringing it nearer to them).
Variables
The primary dependent variable of interest for the present analysis is reaction time bias, computed as RTpush-RTpull. Negative bias scores (faster to push than pull) indicate relative avoidance bias, while positive scores (faster to pull than to push) indicate relative approach bias. Because the two runs were identical in every way except for the instruction provided, each trial had both a push RT and a pull RT (e.g., run1 trial1 was push, run2 trial1 was pull, so subtract RT on run2 trial 1 from run1 trial1 to get the difference - this is further elaborated in the data cleaning script below), unless they failed to reverse the instructions.
Main predictors of interest included trial number (in order to establish whether peoples’ response bias changed over the 7-minute run), stimulus (neutral, spouse, stranger, living loved one, death), and condition (treatment A/B - oxytocin or placebo [I’m still blinded]).
Participants also completed a number of self-report measures (state and trait) and provided biological samples for OXTR genotyping and cortisol assay, some of which were used as predictors.
Analyses (NHST)
Remember, the effect of time refers to trialnumber (this is NOT the time variable; time refers to time of day when task was run.)
The dependent variable, bias_t, is the measure of relative behavioral approach/avoidance bias, based on reaction time to push or pull a joystick in response to various images. We transformed it because it was not normally distributed, and use the transformed variable for all analyses.
Unconditional means model
The overall mean bias score (transformed) is about -0.002, with a standard deviation of 0.023.
ICC is about 1.6% of the variance. This means that 1.6% of the variance in bias is explained by people being different from one another, while the remainder is explained by…something else.
Add fixed and random intercept and slope of time
The average bias at trial number = 0 is -0.016 and gets increasingly approach-biased by a nonsignificant fraction of a percent with each unit of time. There is no significant effect of trial number (time) on RT bias score, t = 0.74, p = 0.457.
99.99% of variance in bias is due to the between-person variance, after controlling for time. This is consistent with the lack of a significant effect of time.
Adding stimulus as a fixed effect
There were 5 different types of stimuli, presented in a randomized order (consistent across participants/sessions) determined by genetic algorithm. Everyone saw the same stimuli in the same order, so it is a fixed effect. The different categories of images were: Death (generic), Deceased (personal photo), Other living loved one (personal photo), Stranger (sex-matched to spouse), and Neutral (objects). The factor stim was created such that Neutral was the reference level.
NO effect of time. There is a significant fixed effect of spouse, t = 3.55, p = .0004. There is also a non-significant effect of stranger, t = -1.73, p = .084. This indicates that people tended to show more approach bias to photos of their spouse (across conditions). The parameter estimate is 0.12 with an SE of 0.03. They also tended to show greater more avoidance bias to photos of the sex-matched stranger.
95% CIs don’t cross zero for spouse, but do for all other stimulus types. This supports the significant effect of spouse stimuli on reaction time bias.
The ICC is identical to that in model 2.
Comparing the model with the fixed effect of stimulus (m3) to the model without (m2), m3 is better (p < .0001) as evidenced by lower AIC, BIC, and lL values.
Take out fixed and random effect of time
Because there’s no significant effect of time, makes sense to drop it from the model. Now we’re basically doing regression but accounting for whatever interdependence exists. Note that these results will NOT match the results of the regressions, because the regressions are based on bias score medians as the DV, rather than means.
Fixed effects are still more or less the same - significant for spouse, marginal for stranger - although the parameter estimates and test statistics are slightly different.
As above, spouse is the only stimulus category whose 95 CIs don’t cross zero, supporting a significant effect of spouse on bias - in this case, viewing spouse images was associated with a relative approach bias.
Is m4 any better than m3?
Nope. Models do not significantly differ (p = 0.851), though the AIC and BIC values are a little smaller for m4 than m3. So let’s forget time and go with m4…
Adding condition as a fixed effect
Participants completed the task at two different sessions, where they were randomized to receive treatment A at one and treatment B at the other (order counterbalanced across participants). The two treatments are oxytocin vs. placebo.
The fixed effect of condition B is not significant, t = 0.15, p = 0.884.
The two models are not significantly different, but the one with the fixed effect of condition (m5) has slightly higher AIC/BIC. Not compelling enough to select that model over m4, though.
Adding the stimulus*condition interaction
We might expect that peoples’ tendency to approach or avoid might differ depending on which treatment they got, but only for certain stimuli.
Interesting. With the interaction in the model, there is a significant effect of death and stranger but not spouse. 95% CIs indicate that viewing the generic death and stranger stimuli are both associated with greater relative avoidance bias.
The model with the interaction in it is not significantly better than either the one with just the fixed effect of condition or with no effect of condition at all, so makes sense to go with the simplest model which only includes stimulus as a fixed effect.
Adding other plausible predictors
There are a few other variables that we might expect to impact approach/avoidance bias, namely grief severity, OXTR genotype, time since death, and attachment style (avoidant, anxious).
None of the added fixed effects are significant in the new model, and the AIC/BIC values are higher, so take ’em out.
Final model
A case of significant, but not meaningful: The R2 for the entire model is .02, indicating that 2% of the variance in bias is explained by the model. So this model is not very good at predicting approach/avoidance bias from the fixed effect of stimulus within person. Oh well.
Check final model assumptions
Checking normality of the residuals
The QQ plot looks pretty okay, albeit a few outliers. D113 is the person who most falls off the line, and does so at both ends. The residuals are also pretty evenly distributed across time, though many fall outside of the +/- 2 range.
Getting an error here for the random slopes Error in [.data.frame(ranef(m4), 72) : undefined columns selected. I think this may be because I don’t have any random slopes, since I took time out of the model. Indeed, when I go into m4, it doesn’t look like a second element for the slopes exists. Let’s test this…
My suspicion was correct: the error message above is because there are no random slopes for my final model.
Checking homoscedasticity
Note that again there are a lot of residuals falling outside of the +/-2 range.
Also, there’s something weird about the L2 plot. Not sure why it shows only 3 of 5 stimulus types.
Analyses (Bayesian)
Unconditional means model
First step is to look at the unconditional means model. We know from the exploratory stuff in previous sections that bias_t is normally distributed, so we can tell brms to use normal likelihood (Gaussian).
library(brms)
# b1N <- brm(bias_t ~ 1 + (1|ID), data=data, family=gaussian())
# saveRDS(b1N, "mlm_ada1_b1N.rds")
summary(b1N)
Family: gaussian
Links: mu = identity; sigma = identity
Formula: bias_t ~ 1 + (1 | ID)
Data: data (Number of observations: 9289)
Samples: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
total post-warmup samples = 4000
Group-Level Effects:
~ID (Number of levels: 35)
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sd(Intercept) 0.13 0.02 0.10 0.17 1184 1.00
Population-Level Effects:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
Intercept -0.00 0.02 -0.05 0.05 1712 1.00
Family Specific Parameters:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sigma 0.99 0.01 0.98 1.01 4000 1.00
Samples were drawn using sampling(NUTS). For each parameter, Eff.Sample
is a crude measure of effective sample size, and Rhat is the potential
scale reduction factor on split chains (at convergence, Rhat = 1).
Rhat = 1 across the board, so that’s good.
plot(b1N) # check trace plots


I think these plots look okay? b_Intercept and SD_ID__Intercept plots are not exactly on top of each other, but do have a lot of overlap in their distributions.
pp_check(b1N, nsamples=20) # posterior predictive check using normal likelihood

pp_check(b1N, type="error_hist", nsamples=20, binwidth=1) # error histogram (histogram of prediction residuals)

The observed distribution is a very good fit to the predicted data (based on the normal distribution). The error histogram also shows that residuals are normally distributed around 0. See this page for some helpful documentation.
# check ICC to get sense of how variance is distributed between- and within- people
iSd <- as.numeric(summary(b1N)$random$ID[1,1]) # finds 1st row, 1st column of random effects inside the summary of b1N
iVar <- iSd^2
residSd <- as.numeric(summary(b1N)$spec_pars[[1,1]])
residVar <- residSd^2
icc1 <- iVar/(iVar+residVar)
icc1 # this is the unconditional growth model - just happens in this dataset that it's a nuisance model
[1] 0.01749842
For the unconditional growth model, the ICC is 0.017, meaning that only about 2% of the variance in bias is because people on average perform differently from each other on the task.
Time as fixed and random effect
## check whether we need time as a control variable
# b2 <- brm(bias_t ~ 1 + trialnum + (1 + trialnum|ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b2, "mlm_ada1_b2.rds")
summary(b2)
Family: skew_normal
Links: mu = identity; sigma = identity; alpha = identity
Formula: bias_t ~ 1 + trialnum + (1 + trialnum | ID)
Data: data (Number of observations: 9289)
Samples: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
total post-warmup samples = 4000
Group-Level Effects:
~ID (Number of levels: 35)
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sd(Intercept) 0.12 0.03 0.07 0.17 1651 1.00
sd(trialnum) 0.00 0.00 0.00 0.00 510 1.01
cor(Intercept,trialnum) 0.22 0.47 -0.75 0.95 2352 1.00
Population-Level Effects:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
Intercept -0.02 0.03 -0.07 0.04 1819 1.00
trialnum 0.00 0.00 -0.00 0.00 4000 1.00
Family Specific Parameters:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sigma 0.99 0.01 0.98 1.01 4000 1.00
alpha -0.02 0.35 -0.64 0.60 4000 1.00
Samples were drawn using sampling(NUTS). For each parameter, Eff.Sample
is a crude measure of effective sample size, and Rhat is the potential
scale reduction factor on split chains (at convergence, Rhat = 1).
Note: Estimate = mean of posterior distribution for each of the parameters.
Group-level effects: When trialnum is 0, the standard deviation of the intercepts is 0.12. There is no random effect of time: the slopes are not at all variable (people are not different from each other). Zero is a credible value for the correlation, because the 95% CIs (credible intervals) cross zero.
Note that Rhat for sd(trialnum) in the random effects is 1.01. Per the brms documentation, this does not seem to indicate a convergence issue due to lack of error messages and the fact that the Rhat for the group-level effect in their example is 1.01: “If ‘Rhat’ is considerably greater than 1, the algorithm has not yet converged and it is necessary to run more iterations and / or set stronger priors.” However, it does indicate that I would probably want to run it again with a higher number of iterations or stronger priors.
Population-level effects: There is no fixed effect of time, either, given that the upper and lower CI limits are -0.0, 0.0.
plot(b2)



SD_ID__trialnum and cor_ID__Intercept__trialnum are totally skewed.
pp_check(b2, nsamples=20)

pp_check(b2, type="error_hist", nsamples=20, binwidth=1)

Again, the trace plot shows that the observed distribution is a very good fit to the predicted data based on the normal distribution. The error histogram also shows that residuals are normally distributed around 0.
SKIPPING THE CHUNK BELOW… I can’t get sjstats to install (issue with the specific TMB package version- freaking dependencies are the bane of my existence with all these different packages), and no point in running cross validation on a model that we’re going to drop anyway due to lack of both fixed/random effects of time.
# install.packages("sjstats")
# library(sjstats)
# equi_test(b2) # is there a fixed effect?
# waic(b1N, b2) # does WAIC think we should keep time? If so, given there is no fixed effect, this is due to the random effect accounting for variance.
# b1 <- add_ic(b1, ic="R2")
# b2 <- add_ic(b2, ic="R2")
# mean(b1$R2)
# mean(b2$R2) # does R2 thinks we should keep time?
# kfold(b1SN, b2, K=10) # use CV to check for over-fitting
# this takes too long, so reduce n(folds) or only perform for final model
Fixed effect of stimulus
# b3 <- brm(bias_t ~ 1 + stim + (1 |ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b3, "mlm_ada1_b3.rds")
summary(b3)
Family: skew_normal
Links: mu = identity; sigma = identity; alpha = identity
Formula: bias_t ~ 1 + stim + (1 | ID)
Data: data (Number of observations: 9289)
Samples: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
total post-warmup samples = 4000
Group-Level Effects:
~ID (Number of levels: 35)
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sd(Intercept) 0.13 0.02 0.10 0.17 981 1.00
Population-Level Effects:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
Intercept -0.01 0.03 -0.07 0.05 1156 1.00
stimdeath -0.05 0.03 -0.12 0.01 2955 1.00
stimliving 0.04 0.03 -0.02 0.11 2423 1.00
stimspouse 0.12 0.03 0.05 0.18 2704 1.00
stimstranger -0.06 0.03 -0.12 0.01 2560 1.00
Family Specific Parameters:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sigma 0.99 0.01 0.98 1.00 4000 1.00
alpha -0.04 0.35 -0.65 0.58 3285 1.00
Samples were drawn using sampling(NUTS). For each parameter, Eff.Sample
is a crude measure of effective sample size, and Rhat is the potential
scale reduction factor on split chains (at convergence, Rhat = 1).
Similar to the results of the NHST analysis, there is a notable fixed effect of spouse (estimate = 0.12, est. error = 0.03, 95% CI[0.05,0.18]) on bias, such that people show more approach bias on trials where the stimulus is their spouse.
plot(b3)



marginal_effects(b3)

The high density intervals illustrate the positive effect of spouse on bias, which we also see in the marginal effects plot. In the analysis above, the 95% CI for death and stranger stimuli barely crossed zero, which corresponds with what we see in the marginal effects plot, where the upper limit of the error bar is very close to zero (or below). This also corresponds with the effects we saw in NHST, where there were significant or marginally significant effects of death and stranger in addition to spouse.
pp_check(b3, nsamples=20)

pp_check(b3, type="error_hist", nsamples=20, binwidth=1) # looks like we have improved the residuals; if we rerun this random sampling at least some of the time we no longer see the lower end outliers

# equi_test(b3) # TRY THIS CODE WHEN I GET SJTOOLS TO INSTALL (what does ROPE tells us about the size of effect?)
# waic(b2, b3) # TRY THIS CODE WHEN I GET SJTOOLS TO INSTALL
b2 <- add_ic(b2, ic="R2")
b3 <- add_ic(b3, ic="R2")
round(mean(b2$R2), digits=3)
[1] 0.017
round(mean(b3$R2), digits=3)
[1] 0.02
In both models, we have explained about 2 percent of the variance (1.7% vs. 2.0%).
Adding the stimulus*condition interaction (no time)
One of the original study hypotheses was that there would be an interaction (actually, a 3-way interaction with group, but not going to go there for ADA #1; also we may be underpowered for that) between stimulus and condition, such that people might display different response biases for different stimulus categories in condition A vs. B, or vice versa.
# b4 <- brm(bias_t ~ 1 + stim*cond + (1|ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b4, "mlm_ada1_b4.rds")
summary(b4)
Family: skew_normal
Links: mu = identity; sigma = identity; alpha = identity
Formula: bias_t ~ 1 + stim * cond + (1 | ID)
Data: data (Number of observations: 9289)
Samples: 4 chains, each with iter = 2000; warmup = 1000; thin = 1;
total post-warmup samples = 4000
Group-Level Effects:
~ID (Number of levels: 35)
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sd(Intercept) 0.13 0.02 0.10 0.17 1088 1.00
Population-Level Effects:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
Intercept 0.02 0.04 -0.06 0.10 1357 1.00
stimdeath -0.11 0.05 -0.20 -0.02 1815 1.00
stimliving 0.04 0.05 -0.05 0.13 1791 1.00
stimspouse 0.06 0.05 -0.03 0.16 2030 1.00
stimstranger -0.09 0.05 -0.18 -0.00 2068 1.00
condB -0.05 0.05 -0.15 0.04 1546 1.00
stimdeath:condB 0.11 0.07 -0.02 0.24 1760 1.00
stimliving:condB 0.01 0.06 -0.12 0.13 1959 1.00
stimspouse:condB 0.11 0.07 -0.02 0.24 2012 1.00
stimstranger:condB 0.07 0.06 -0.06 0.20 2009 1.00
Family Specific Parameters:
Estimate Est.Error l-95% CI u-95% CI Eff.Sample Rhat
sigma 0.99 0.01 0.98 1.00 4000 1.00
alpha -0.05 0.35 -0.64 0.58 3415 1.00
Samples were drawn using sampling(NUTS). For each parameter, Eff.Sample
is a crude measure of effective sample size, and Rhat is the potential
scale reduction factor on split chains (at convergence, Rhat = 1).
# b4 <- readRDS("mlm_ada1_b4.rds")
Got some warnings while running:
Rejecting initial value: Gradient evaluated at the initial value is not finite. Stan can't start sampling from this initial value. Rejecting initial value: Log probability evaluates to log(0), i.e. negative infinity. Stan can't start sampling from this initial value.
Apparently setting priors may help with this.
Here, we see notable effects of death and stranger stimuli, but not condition nor its interaction with stimulus type.
plot(b4)




marginal_effects(b4)



pp_check(b4, nsamples=20)

pp_check(b4, type="error_hist", nsamples=20, binwidth=1) # residuals seem about the same

# equi_test(b4)
Nothing terribly new here, except for the marginal effects. Looking at the plot for stim, the distributions for spouse and stranger and spouse and death don’t seem to overlap, and living loved one and death just barely overlap. Unsurprisingly, people tend to show more approach bias for spouse and living loved one photos compared to death and stranger photos. The third marginal effects plot illustrates the lack of condition or stimulus*condition interaction effects.
The variance accounted for shows a negligable increase with the stimulus*condition interaction in the model (goes from 0.020 to 0.022). The negligable effect is shown in the plot - note no real differences
Using priors
# First find out what default priors are.
get_prior(bias_t ~ 1 + stim*cond + (1|ID), data=data, family=skew_normal())
brms has set some weakly informative default priors on some of the special parameters. The default for the random coefficients is a half student-t with 3 degrees of freedom.
Let’s assume some weak prior knowledge from previous research. It would actually be somewhat difficult to get solid priors from previous research on this task, given that no previous work has used this specific variant in this population with this specific manipulation (i.e., oxytocin vs. placebo) so I’m making these numbers up.
Based on the error message here (The following priors do not correspond to any model parameter…), I clearly have no idea what I’m doing and need to go back to that lesson (get_prior was not helpful to me, but thanks anyway, brms).
---
title: "MLM ADA #1 v2"
author: "Saren Seeley"
date: "10-22-2018"
output:
  html_notebook:
    number_sections: no
    theme: paper
    toc: yes
    toc_float: yes
  html_document:
    df_print: paged
    toc: yes
---
## About
### Overview
This ADA uses data from the project “The effect of intranasal oxytocin on neural functioning in widow(er)s” (#1503744420, PI: Mary-Frances O'Connor, Co-PI: Brian Arizmendi). It was a double-blind randomized crossover study, designed to address conflicting accounts of motivated behavior in complicated grief using a novel approach-avoidance task variant (grief AAT; gAAT) that allows for comparison between participant-provided photos and nomothetic grief stimuli. Participants were 40 bereaved adults ages 55-80, who experienced the death of a spouse or long-term romantic partner between six months and three years prior to their participation. 

### Task
Participants attended two identical experimental sessions, at which they received one of two intranasal sprays (treatment A/B) and then took part in an fMRI scan, which involved a structural sequence, the gAAT, and a resting state sequence.

For the gAAT, five stimulus categories were presented: (1) deceased spouse, (2) living loved one [another attachment figure], (3) stranger, (4) generic grief-related scenes, and (5) neutral scenes. Photos provided by the participant were scanned and resized to a standard 750-pixel height and width, and framed in colored frames to match the stimuli used for non-idiographic categories (i.e., stranger, grief-related, neutral). Photos of the stranger were sex-matched to the participant's partner. 

During the task, participants were instructed to follow rules based on an irrelevant characteristic of the stimuli, namely, the color of the frame around the photo (e.g., _"push the joystick when the frame is blue, pull when the frame is yellow"_). The task was completed in two runs of approximately seven minutes each. In the second run, the instructions were reversed (e.g. _"now pull the joystick for blue framed photos"_). Order of instructions was counterbalanced across participants. Stimuli were presented via Inquisit 4 (2014), in a pseudorandomized order determined by genetic algorithm to optimize statistical power and psychological validity (Wager & Nichols, 2003). Each trial last 3000ms, with 2500 ms allowed for response. Intertrial interval was 500ms.

Participants completed the task using an MRI-compatible joystick. The stimuli were animated so that a joystick push would make the image smaller (as if they were pushing it away from them) and a joystick pull would make the image larger (as if they were bringing it nearer to them).

### Variables
The primary dependent variable of interest for the present analysis is **reaction time bias**, computed as RT<sub>push</sub>-RT<sub>pull</sub>. Negative bias scores (faster to push than pull) indicate relative avoidance bias, while positive scores (faster to pull than to push) indicate relative approach bias. Because the two runs were identical in every way except for the instruction provided, each trial had both a push RT and a pull RT (e.g., run1 trial1 was push, run2 trial1 was pull, so subtract RT on run2 trial 1 from run1 trial1 to get the difference - this is further elaborated in the data cleaning script below), unless they failed to reverse the instructions.

Main predictors of interest included **trial number** (in order to establish whether peoples' response bias changed over the 7-minute run), **stimulus** (neutral, spouse, stranger, living loved one, death), and **condition** (treatment A/B - oxytocin or placebo [I'm still blinded]). 

Participants also completed a number of self-report measures (state and trait) and provided biological samples for OXTR genotyping and cortisol assay, some of which were used as predictors.

## Equations
#### Level 1:
We first performed MLM to estimate the effect of trial number (i.e., the random effect of time) on response bias (push RT-pull RT) on a given trial. The level-1 model estimated, for each participant _j_ (_j_ = 1–35), a regression line that predicted each participant’s response bias in milliseconds on each trial _i_ from the trial number. This model was represented as follows: 
<blockquote><center><font size=4>
[bias<i><sub>i</sub></i>]<sub>j</sub> = <i>b</i><sub>0<i><sub>j</sub></i></sub> + <i>b</i><sub>1<i><sub>j</sub></i></sub> [trialnumber<i><sub>i</sub></i>] + <i>r<sub>ij</sub></i>
</font></center>
</blockquote>

where <i>b</i><sub>0<i><sub>j</sub></i></sub>, the intercept, is interpreted as participant _j_’s mean response bias for a given trial; <i>b</i><sub>1<i><sub>j</sub></i></sub>, the slope, represents the effect of trial number on response bias for each participant _j_ (positive coefficients represent the predictive magnitude of trial number on response bias for a given participant); and <i>r<sub>ij</sub></i> is the residual error term.

#### Level 2:
The level-2 models estimated the average effects for the entire sample. The models were represented as follows: 

<blockquote><center><font size=4>
<i>b</i><sub>0</sub><sub>j</sub> = γ<sub>00</sub> + μ<sub>0<sub>j</sub></sub>
<p>
<i>b</i><sub>1<sub><i>j</i></sub></sub> = γ<sub>10</sub> +  μ<sub>1<sub>j</sub></sub>
</font></center></blockquote>

where the intercept, γ<sub>00</sub>, is interpreted as the average response bias for the entire sample; γ<sub>10</sub> is the average effect of represents the effect of trial number on response bias for each participant _j_ (positive coefficients represent the predictive magnitude of trial number on response bias for the sample as a whole), and thus, the primary estimate of interest; and μ<sub>0<sub>j</sub></sub> and μ<sub>1<sub>j</sub></sub> are the residual error terms. 

Adding the fixed effect of stimulus type resulted in the following model:

<blockquote><center><font size=4>
<i>b</i><sub>0</sub><sub>j</sub> = γ<sub>00</sub> + γ<sub>01</sub>(stimulus<sub>i</sub>) + μ<sub>0<sub>j</sub></sub>
<p>
<i>b</i><sub>1<sub><i>j</i></sub></sub> = γ<sub>10</sub> + γ<sub>11</sub>(stimulus<sub>i</sub>) + μ<sub>1<sub>j</sub></sub></font></center></blockquote>


## Data cleaning
<i>Note: Went back to the raw data due to some inconsistencies in the `OT_ADAA_Omnibus.xlsx` file from BA that I found.</i>

<b>Pre-R steps</b>:

1. Removed all .iqdat files under 25KB or so in order to filter out pilot data and false starts.<br>
2. Removed non-study data (i.e., tests and OT undergrad files). <br>
3. Organized raw .iqdat files for each visit by run 1/run 2 (two separate folders) based on date/time stamp. <p>
<i>In Terminal, within each of the folders in turn...</i><p>
4. Changed file extensions to .tsv (tab-separated) from .iqdat: `find . -iname "*.iqdat" -exec bash -c 'mv "$0" "${0%\.iqdat}.tsv"' {} \;`<br>
5. Merged the files: `cat *.tsv > OT_gAAT_run-1.tsv` (or `OT_gAAT_run-2.tsv`)
<strike>5. Merged the files: `mlr --tsvlite cat *.tsv > OT_gAAT_run-1.tsv` (or `OT_gAAT_run-2.tsv`)
<ul>5a. `mlr --tsvlite` drops the header from all but the first file, so column names are not repeated throughout the dataset.</ul>
<ul>5b. This requires Miller, which can be installed via Homebrew on OSX (http://johnkerl.org/miller/doc/build.html)</strike> <i>tried to make this work but no luck</i></ul>

<i>Notes:<br>
<ul><li>D117: Had 3 visits due to scanner technical issues. Use "_b" and "_c" as visit 1 and visit 2 respectively.</li>
<li>D114: Has multiple files from visit 2. Use the last two.</li>
<li>D135: Use files from 6/06/16 and 6/17/18 (scanner issues on 06/13/16). Use "_b" and "_c" as visit 1 and visit 2 respectively.</li>
<li>D147: Visit 1 run 2 is partial data for some reason (I do have notes that she switched back to <i>PUSH-YELLOW</i> during run 2, but did not re-run because running way over time.)</li></ul></i>

## Creating the MLM dataset
### Import data
```{r}
library(tidyverse)
filter <- dplyr::filter
select <- dplyr::select

# read in the data (only importing a subset of columns)
raw_r1 <- data.frame(read_tsv("~/Desktop/OT-gAAT_MLM-data/Run1_tsv/OT_gAAT_run-1.tsv", col_types=cols_only(
  date = col_integer(),
  time = col_character(),
  subject = col_character(),
  blockcode = col_character(),
  blocknum = col_character(),
  trialcode = col_character(),
  values.trialcode = col_character(),
  values.stimulus = col_character(),
  values.initialresponse = col_character(),
  values.RT = col_number()
)))

raw_r2 <- data.frame(read_tsv("~/Desktop/OT-gAAT_MLM-data/Run2_tsv/OT_gAAT_run-2.tsv", col_types=cols_only(
  date = col_integer(),
  time = col_character(),
  subject = col_character(),
  blockcode = col_character(),
  blocknum = col_character(),
  trialcode = col_character(),
  values.trialcode = col_character(),
  values.stimulus = col_character(),
  values.initialresponse = col_character(),
  values.RT = col_number()
)))

# header line repeats when I used `cat` to merge, so get those out of there
raw_r1 <- raw_r1 %>% filter(!is.na(date))
raw_r2 <- raw_r2 %>% filter(!is.na(date))

unique(raw_r1$subject)
# D118 is showing up as "test" for some reason (checked against the .iqdat file), and D130_b is missing
# both of D130's and D142's visits were entered as "D130"/"D142", so change the subject ID for the visit on 052416 to D130_b and on 
raw_r1$subject[raw_r1$date == 020916] <- "D118"
raw_r1$subject[raw_r1$date == 052416] <- "D130_b"
raw_r1$subject[raw_r1$date == 111516] <- "D142_b"

# fixing some other stuff in a similar vein
raw_r1$subject[raw_r1$subject == "D101_2"] <- "D101_b"
raw_r1$subject[raw_r1$subject == "D102_B"] <- "D102_b"
raw_r1$subject[raw_r1$subject == "D117_b"] <- "D117"
raw_r1$subject[raw_r1$subject == "D117_c"] <- "D117_b"
raw_r1$subject[raw_r1$subject == "D135_c"] <- "D135_b"

unique(raw_r1$subject) # 78 unique - that is correct

unique(raw_r2$subject)
# some similar issues to fix
raw_r2$subject[raw_r2$subject == "D101_2Y"] <- "D101_b"
raw_r2$subject[raw_r2$subject == "D102_B"] <- "D102_b"
raw_r2$subject[raw_r2$subject == "D107_B_2"] <- "D107_b"
raw_r2$subject[raw_r2$subject == "D117_b"] <- "D117"
raw_r2$subject[raw_r2$subject == "D117_c"] <- "D117_b"
raw_r2$subject[raw_r2$subject == "D126_B"] <- "D126_b"
raw_r2$subject[raw_r2$subject == "D135_c"] <- "D135_b"
raw_r2$subject[raw_r2$date == 111516] <- "D142_b"


unique(raw_r2$subject) # 78 unique - that is correct

str(raw_r1)
str(raw_r2)
```

```{r}
# take out ITIs (inter-trial intervals)
unique(raw_r1$trialcode)
mlm_r1 <- filter(raw_r1, grepl("^A",trialcode))
unique(mlm_r1$trialcode)

unique(raw_r2$trialcode)
mlm_r2 <- filter(raw_r2, grepl("^A",trialcode))
unique(mlm_r2$trialcode)

# add a "trialnum" variable, grouped by ID
mlm_r1 <- mlm_r1 %>% group_by(subject) %>% mutate(trialnum = row_number())
View(mlm_r2)
mlm_r2 <- mlm_r2 %>% group_by(subject) %>% mutate(trialnum = row_number())
View(mlm_r2)

mlm_r1 %>% group_by(trialnum) %>% count(trialnum) # everyone has 144 trials, hooray!
mlm_r2 %>% group_by(trialnum) %>% count(trialnum) # everyone has 144 trials, one person has 177
mlm_r2 %>% group_by(subject) %>% count() # that person is D122_b
```
I do recall one scan session where Inquisit had some kind of hiccup and the task kept going beyond the end of the fMRI sequence so we had to force-quit it. D122 has complete data from the first 144 trials, so we can just drop the extra ones.
```{r}
mlm_r2 <- filter(mlm_r2, trialnum <=144) # remove trials 145-177 for D122_b
mlm_r2 %>% group_by(trialnum) %>% count(trialnum)
# now everyone has 144 trials

# add a column for "run"
mlm_r1$run <- "1"
mlm_r2$run <- "2"

# add a column for "visit"
mlm_r1$visit <- as.factor(ifelse(grepl("*_b", mlm_r1$subject), "2", "1"))
mlm_r2$visit <- as.factor(ifelse(grepl("*_b", mlm_r2$subject), "2", "1"))

mlm_r1 %>% group_by(subject) %>% count(visit) # all looks good
mlm_r2 %>% group_by(subject) %>% count(visit) # all looks good

# now remove the "_b" from visit 2 IDs
mlm_r1 <- mlm_r1 %>% ungroup() %>%
  mutate(subject = str_replace(subject, "_b", ""))
unique(mlm_r1$subject)

mlm_r2 <- mlm_r2 %>% ungroup() %>%
  mutate(subject = str_replace(subject, "_b", ""))
unique(mlm_r2$subject)

# merge data from the 2 runs 
mlm_bx <- bind_rows(mlm_r1, mlm_r2)

# split up by visit
mlm_v1 <- mlm_bx %>% filter(visit == "1")
mlm_v2 <- mlm_bx %>% filter(visit == "2")

mlm_v1 %>% group_by(trialnum) %>% count(trialnum) # everyone has 144 trials, hooray!
mlm_v2 %>% group_by(trialnum) %>% count(trialnum) # everyone has 144 trials, hooray!
```

```{r}
# add columns for "cond" at visit 1 and visit 2 [condition: A or B]
# load in the randomization data
randomize <- readRDS("~/Dropbox/GLASS Lab/OT Study/data/cleaned-data/randomization_clean.rds")
head(randomize)

mlm_v1 <- mlm_v1 %>%
  rename("ID" = subject)

mlm_v2 <- mlm_v2 %>%
  rename("ID" = subject)

# get the IDs for everyone who got treatment A at visit 1 
IDs_v1_txA <- randomize %>% filter(tx_v1 == "A")
View(IDs_v1_txA)

# ifelse statement: if ID = any of those listed in txA_list, make cond_v1 = "A", else make it "B"
mlm_v1 <- mlm_v1 %>%
  mutate(cond_v1 = as.factor(ifelse(ID == "D101" | ID == "D105" | ID == "D109" | ID == "D110" | ID == "D114" | ID == "D115" | ID == "D116" | ID == "D118" | ID == "D119" | ID == "D120" | ID == "D123" | ID == "D124" | ID == "D126" | ID == "D128" | ID == "D132" | ID == "D135" | ID == "D136" | ID == "D137" | ID == "D138" | ID == "D140" | ID == "D141" | ID == "D145" | ID == "D148" | ID == "D149" | ID == "D117", "A", "B")), cond_v2 = as.factor(ifelse(ID == "D101" | ID == "D105" | ID == "D109" | ID == "D110" | ID == "D114" | ID == "D115" | ID == "D116" | ID == "D118" | ID == "D119" | ID == "D120" | ID == "D123" | ID == "D124" | ID == "D126" | ID == "D128" | ID == "D132" | ID == "D135" | ID == "D136" | ID == "D137" | ID == "D138" | ID == "D140" | ID == "D141" | ID == "D145" | ID == "D148" | ID == "D149" | ID == "D117", "B", "A")))
head(mlm_v1)

# ifelse statement: if ID = any of those listed in txA_list, make cond_v2 = "B", else make it "A" (people who were A at visit 1 should be B for visit 2, and vice versa)
mlm_v2 <- mlm_v2 %>%
  mutate(cond_v2 = as.factor(ifelse(ID == "D101" | ID == "D105" | ID == "D109" | ID == "D110" | ID == "D114" | ID == "D115" | ID == "D116" | ID == "D118" | ID == "D119" | ID == "D120" | ID == "D123" | ID == "D124" | ID == "D126" | ID == "D128" | ID == "D132" | ID == "D135" | ID == "D136" | ID == "D137" | ID == "D138" | ID == "D140" | ID == "D141" | ID == "D145" | ID == "D148" | ID == "D149" | ID == "D117", "B", "A")), cond_v1 = as.factor(ifelse(ID == "D101" | ID == "D105" | ID == "D109" | ID == "D110" | ID == "D114" | ID == "D115" | ID == "D116" | ID == "D118" | ID == "D119" | ID == "D120" | ID == "D123" | ID == "D124" | ID == "D126" | ID == "D128" | ID == "D132" | ID == "D135" | ID == "D136" | ID == "D137" | ID == "D138" | ID == "D140" | ID == "D141" | ID == "D145" | ID == "D148" | ID == "D149" | ID == "D117", "A", "B")))

levels(mlm_v1$cond_v1)
levels(mlm_v2$cond_v2)

head(mlm_v1)
  
IDs_v1_txA$ID
mlm_v1 %>% group_by(cond_v1) %>% count(ID) # IDs and cond match txA_list
mlm_v2 %>% group_by(cond_v2) %>% count(ID) # IDs and cond match txA_list

mlm_bx <- bind_rows(mlm_v1,mlm_v2)
View(mlm_bx)

# save it
saveRDS(mlm_bx, "~/Dropbox/2018Fall/FSHD617C/mlm_bx.rds")
```

```{r}
# rename some variables
colnames(mlm_bx)
mlm_bx <- mlm_bx %>% rename("push_pull" = values.initialresponse,
                  "gAAT_RT" = values.RT)
head(mlm_bx)

# add a new variable for condition
mlm_bx1 <- mlm_bx %>% mutate(cond = as.factor(ifelse(cond_v1 == "A" & visit == "1", "A",
                              ifelse(cond_v2 == "A" & visit == "2", "A",
                                    ifelse(cond_v1 == "B" & visit == "1", "B",
                                           ifelse(cond_v2 == "B" & visit == "2", "B", NA))))))

View(mlm_bx1) # check that it coded the "cond" column correctly      

# add a new variable for stimulus category
# in values.stimulus:
# 1 = spouse, 2 = living loved one/WHOTO, 3 = stranger, 4 = nomothetic death-related, 5 = neutral images

sort(unique(mlm_bx1$values.stimulus))

mlm_bx2 <- mlm_bx1 %>%
  mutate(stim = as.factor(ifelse(values.stimulus == "1B_1.jpg"|values.stimulus == "1B_2.jpg"|values.stimulus == "1B_3.jpg"|values.stimulus == "1Y_1.jpg"|values.stimulus == "1Y_2.jpg"|values.stimulus == "1Y_3.jpg", "spouse", 
                       ifelse(values.stimulus == "2B_1.jpg"|values.stimulus == "2B_2.jpg"|values.stimulus == "2B_3.jpg"|values.stimulus == "2Y_1.jpg"|values.stimulus == "2Y_2.jpg"|values.stimulus == "2Y_3.jpg", "living",
                              ifelse(values.stimulus == "3B_1.jpg"|values.stimulus == "3B_2.jpg"|values.stimulus == "3B_3.jpg"|values.stimulus == "3Y_1.jpg"|values.stimulus == "3Y_2.jpg"|values.stimulus == "3Y_3.jpg", "stranger",
                                     ifelse(values.stimulus == "4B_1.jpg"|values.stimulus == "4B_2.jpg"|values.stimulus == "4B_3.jpg"|values.stimulus == "4Y_1.jpg"|values.stimulus == "4Y_2.jpg"|values.stimulus == "4Y_3.jpg", "death",
                                            ifelse(values.stimulus == "5B_1.jpg"|values.stimulus == "5B_2.jpg"|values.stimulus == "5B_3.jpg"|values.stimulus == "5Y_1.jpg"|values.stimulus == "5Y_2.jpg"|values.stimulus == "5Y_3.jpg", "neutral", NA)))))))

levels(mlm_bx2$stim)
mlm_bx2$stim <- relevel(mlm_bx2$stim, ref = "neutral") # make neutral the reference level
                       
# save it
saveRDS(mlm_bx2, "~/Dropbox/2018Fall/FSHD617C/mlm_bx2.rds")
```

### Bias scores
Separate out push minus pull, then subtract run1 from run2 to generate a score quantifying relative approach or avoidance bias.
The top and bottom 1% of reaction times (RTs) will be removed as part of conventional data cleaning for this task.
```{r}

mlm_r1.1 <- mlm_bx2 %>% filter(run == "1")
tail(mlm_r1.1) # does it line up?
mlm_r2.1 <- mlm_bx2 %>% filter(run == "2")
tail(mlm_r2.1) 

p99r1 <- quantile(mlm_r1.1$gAAT_RT, 0.99) 
p01r1 <- quantile(mlm_r1.1$gAAT_RT, 0.01) 
p99r1 # 99th percentile RTs >= 1713.48ms 
p01r1 # 1st percentile RTs <= 472ms

p99r2 <- quantile(mlm_r2.1$gAAT_RT, 0.99) 
p01r2 <- quantile(mlm_r2.1$gAAT_RT, 0.01) # 1st percentile RTs <= 463ms
p99r2 # 99th percentile RTs >= 1712.45ms 
p01r2 # 1st percentile RTs <= 463ms

mlm_r1.2 <- mlm_r1.1 %>%
  mutate(outlier_RT = ifelse(gAAT_RT <= p01r1 | gAAT_RT >= p99r1, 1, 0))
table(mlm_r1.2$outlier_RT) 
227/11005 # 227 trials will be dropped when we remove the top and bottom 1%, and 11005 will be retained.

mlm_r2.2 <- mlm_r2.1 %>%
  mutate(outlier_RT = ifelse(gAAT_RT <= p01r2 | gAAT_RT >= p99r2, 1, 0))
table(mlm_r2.2$outlier_RT) 
230/11002 # 230 trials will be dropped when we remove the top and bottom 1%, and 11002 will be retained.

mlm_r1r2 <- bind_cols(mlm_r1.2, mlm_r2.2) # 11232 trials total.
head(mlm_r1r2) # make sure that trials in the two runs line up post-merge
tail(mlm_r1r2) # make sure that trials in the two runs line up post-merge

# get rid of any trials with outliers in either run1 or run2 
# (need run1/run2 to be the same length so that the trials align between the runs)
mlm_r1r2.1 <- mlm_r1r2 %>% filter(outlier_RT == "0" & outlier_RT1 == "0") 
count(mlm_r1r2.1) # 10790 obs.
10790/11232 # 96% of trials retained = 4% of trials dropped = <5% trials dropped (in effect, <5% "missing data").

# calculate the bias metric
push_pull <- mlm_r1r2.1 %>% filter(push_pull == "PUSH" & push_pull1 == "PULL") %>% mutate(bias = (gAAT_RT-gAAT_RT1))
pull_push <- mlm_r1r2.1 %>% filter(push_pull == "PULL" & push_pull1 == "PUSH") %>% mutate(bias = (gAAT_RT1-gAAT_RT))

# merge push_pull and pull_push together
mlm_pp <- bind_rows(push_pull,pull_push)

View(mlm_pp)
```

### Transformations
```{r}
library(psych)
describe(mlm_pp$bias) # skewness is 0.1, kurtosis is 1.69
hist(mlm_pp$bias)
qqnorm(mlm_pp$bias)
ggplot(mlm_pp, aes(sample=bias))+stat_qq()+stat_qq_line() # not a good look at all...

# install.packages("bestNormalize")
# bestNormalize identifies the transformation that normalizes your data the best
library(bestNormalize)

orderNorm_bias <- bestNormalize(mlm_pp$bias) # the "orderNorm" transform gets the data the closest to normally distributed, compared to other methods
mlm_pp$bias_t <- orderNorm_bias$x.t # pull x.t from orderNorm_bias and assign it back into dataset
# x.t contains the transformed values of x, using the transform identified by bestNormalize as being superior

describe(mlm_pp$bias_t)
hist(mlm_pp$bias_t)
qqnorm(mlm_pp$bias_t)
ggplot(mlm_pp, aes(sample=bias_t))+stat_qq()+stat_qq_line() # so much better!

# back-transformation, just to show how to do it:
p <- predict(orderNorm_bias)
    x2 <- predict(orderNorm_bias, newdata = p, inverse = TRUE) # x2 is the back-transformed data
    all.equal(x2, mlm_pp$bias_t) 

mlm_bx3 <- mlm_pp
saveRDS(mlm_bx3, "~/Dropbox/2018Fall/FSHD617C/mlm_bx3.rds")
```
Because `bias` is quite non-normal, looking at the plots above, analyses will use the transformed bias score (`bias_t`) instead.

### Checking the data
```{r}
table <- mlm_bx3 %>% group_by(ID) %>% group_by(cond) %>% count(ID) 
table
# missing data...
# cond A low trial counts: D101 (3 trials), D141 (18 trials), D148 (19 trials)
# cond B low trial counts:  D147 (1 trial)

describe(table$n)
ggplot(table, aes(sample=n))+stat_qq()+stat_qq_line() # can see those 4 on the QQ plot

# where are the trials getting dropped?
mlm_r1r2 %>% group_by(ID) %>% group_by(cond) %>% count(ID) # everyone has 144 here
mlm_r1r2.1 %>% group_by(ID) %>% group_by(cond) %>% count(ID) 
# after getting rid of outliers, most people missing a few trials (D107 cond A/B and D113 cond A missing >20)

mlm_r1r2.1 %>% filter(ID=="D101" & cond=="A")
mlm_r1r2.1 %>% filter(ID=="D141" & cond=="A")
mlm_r1r2.1 %>% filter(ID=="D148" & cond=="A")
mlm_r1r2.1 %>% filter(ID=="D147" & cond=="B")
# so trials are getting dropped because they didn't effectively reverse the instructions or they reverted to the old instructions
# in essence, this removes error trials

# create a new variable to indicate the 4 participants with low trial counts (i.e., people who have a bias score on <14% of trials)
mlm_bx3 <- mlm_bx3 %>% mutate(low_n_trials = ifelse(ID == "D101" | ID == "D141" | ID == "D147" | ID == "D148", 1, 0))

# check whether the rows line up by looking for rows where var and var1 don't match
notlinedup <- mlm_bx3 %>% filter(stim != stim1)
notlinedup # 0 rows returned; all the rows match
notlinedup <- mlm_bx3 %>% filter(trialcode != trialcode1)
notlinedup # 0 rows returned; all the rows match
notlinedup <- mlm_bx3 %>% filter(trialnum != trialnum1)
notlinedup # 0 rows returned; all the rows match

head(mlm_bx3)
saveRDS(mlm_bx3, "~/Dropbox/2018Fall/FSHD617C/mlm_bx3.rds")
```

### Add time-invariant variables
Next step is to add demographics and other self-reported and genetic data variables that we might potentially want to include in the model but that do not change over time. Looking at the codebook for the master dataset, these include:

Demographics:

* `age`
* `age_yrs`
* `sex_m`
* `ethnicity_hisp`
* `race`
* `timesincedeath`

Genetic variables:

* `rs2254298`
* `rs2268498`
* `rs53576`
* `rs2254298_A.carrier`
* `rs2268498_C.carrier`
* `rs53576_A.carrier`

Self-report measures:

* `tot_icg` [grief]
* `tot_bdi` [depression]
* `tot_bgq` [brief grief questionnaire with impairment item]
* `tot_ecrrs_global_anx` [anxious attachment - global]
* `tot_ecrrs_global_avoid` [avoidant attachment - global]
* `tot_ecrrs_spouse_anx` [anxious attachment - spouse]
* `tot_ecrrs_spouse_avoid` [avoidant attachment - spouse]
* `tot_ysl` [yearning for the deceased]


```{r}
# load in master dataset
master <- readRDS("~/Dropbox/GLASS Lab/OT Study/data/cleaned-data/selfreports_oxtr_cortisol_random_AB_bx_clean.rds") 

# subset the selected variables
mlm_add <- subset(master, select=c(ID,age, age_yrs, sex_m, ethnicity_hisp, race, timesincedeath, rs2254298, rs2268498, rs53576, rs2254298_A.carrier, rs2268498_C.carrier, rs53576_A.carrier, tot_bdi, tot_bgq, tot_ecrrs_global_anx,tot_ecrrs_global_avoid, tot_ecrrs_spouse_anx, tot_ecrrs_spouse_avoid, tot_ysl, tot_icg))
head(mlm_add)

# merge into the behavioral dataset
mlm_bx4 <- left_join(mlm_bx3, mlm_add, by="ID")
head(mlm_bx4)
library(psych)

# save it
saveRDS(mlm_bx4, "~/Dropbox/2018Fall/FSHD617C/mlm_bx4.rds")
```


## Exploratory analysis and checking assumptions

First step is to overview the entire dataset using `summarytools`. 
```{r}
library(summarytools)

# overview the entire dataset
dfSummary(mlm_bx4) # this will look like TOTAL crap in the console (notebook output)
# use view(dfSummary(mlm_bx4)) from the console to see a much more readable and attractive output
```
None of the variables have any missing data, apart from `rs2254298` and `rs2254298_A.carrier`, and I'm aware from the data cleaning that I did for the master dataset that one participant's genotype for that SNP was unable to be identified.

### Descriptives
```{r}
library(psych)
library(stargazer)
library(summarytools)

# descriptives for the time-invariant variables are going to be wrong if we use mlm_bx4 (since observations are duplicated)
# so going back to the mlm_add object...
# note that n = 40, but n = 39 with RT data
# D149's behavioral data was dropped (ran outside of scanner), so drop his case

mlm_add <- filter(mlm_add, !grepl("D149",ID))
unique(mlm_add$ID) # it worked
nums <- select_if(mlm_add, is.numeric) # just get the numeric variables
view(dfSummary(nums)) 
stargazer(nums, type="text", title="Descriptives", digits=2, out="table1.txt", notes = "Note: 'stargazer' makes nicer looking tables, but doesn't provide skewness or kurtosis statistics, so we'll use 'psych' for that.") 

describe(nums)
```

### Frequencies
```{r}
# which variables are factors?
mlm_add %>% select_if(is.factor) %>% colnames()

# hmm, rs2268498 and rs53576 should also be factors, so fix that now
mlm_add <- mlm_add %>% mutate(rs2268498 = recode_factor(rs2268498, "C/T" = "C/T", "CC" = "CC", "TT" = "TT"), rs53576 = recode_factor(rs53576, "A/G" = "A/G", "AA" = "AA", "GG" = "GG"))
mlm_add %>% select_if(is.factor) %>% colnames()

# function to count frequencies for variables that are factors...this isn't working
# countAll <- function(basedata)
# {
#	facs <- sapply(basedata, is.factor)
#	facdata <- basedata[ ,facs]
#	for(i in 1:length(facdata))
#	{
#		count(facdata)
#	 }
# }
# countAll(mlm_bx4)

mlm_add %>% count(sex_m) # 28 females, 11 males
mlm_add %>% count(race) # 38 white, 1 another race not listed
mlm_add %>% count(ethnicity_hisp) # 37 non-Hispanic/Latino, 2 Hispanic/Latino
mlm_add %>% count(rs2254298) # 22 GG, 10 A/G, 1 NA/missing (0 AA)
mlm_add %>% count(rs2268498) # 17 C/T, 10 CC, 12 TT
mlm_add %>% count(rs53576) # 21 A/G, 18 GG (0 AA)
mlm_add %>% count(rs2254298_A.carrier) # yes = 10, no = 28, NA = 1
mlm_add %>% count(rs2268498_C.carrier) # yes = 27, no = 12
mlm_add %>% count(rs53576_A.carrier) # yes = 21, no = 18
```

### Checking distribution shapes and normality
Got a preview of this above when we used `summarytools`, but here's a closer look at some histograms and QQ plots.
```{r}
# a function to produce histograms for all numeric variables in a dataframe
histAll <- function(basedata)
{
	nums <- sapply((basedata), is.numeric)
	numdata <- basedata[ ,nums]
	par(mfrow=c(2,2))
	for(i in 1:length(numdata))
	{
		hist(numdata[,i], xlab=NULL, main=names(numdata[i]))
	}
}

histAll(as.data.frame(mlm_bx4))

qqAll <- function(basedata)
{
	nums <- sapply((basedata), is.numeric)
	numdata <- basedata[ ,nums]
	par(mfrow=c(2,2))
	for(i in 1:length(numdata))
	{
		qqnorm(numdata[,i], xlab=NULL, main=names(numdata[i]))
	}
}

qqAll(as.data.frame(mlm_bx4))

```
The most problematic of these appear to be anxious/avoidant attachment style for the spouse (`tot_ecrrs_spouse_anx` and `tot_ecrrs_spouse_avoid`), global anxious attachment style (`tot_ecrrs_global_anx`), depressive symptoms (`tot_bdi`), and time since spouse's death (`timesincedeath` - in days).

### Centering L2 predictors

```{r}
# pull out only the total score variables, age, timesincedeath
gcvars <- as.data.frame(mlm_bx4[, grep('^tot|age|timesincedeath|ID', names(mlm_bx4))])
gcvars1 <- gcvars[!duplicated(gcvars$ID), ]
stargazer(gcvars1, type="text", median = T, digits=3)

gmean_age <- mean(gcvars1$age)
gmean_age_yrs <- mean(gcvars1$age_yrs)
gmean_time <- mean(gcvars1$timesincedeath)
gmean_bdi <- mean(gcvars1$tot_bdi)
gmean_bgq <- mean(gcvars1$tot_bgq)
gmean_ecrrs_g_anx <- mean(gcvars1$tot_ecrrs_global_anx)
gmean_ecrrs_g_av <- mean(gcvars1$tot_ecrrs_global_avoid)
gmean_ecrrs_s_anx <- mean(gcvars1$tot_ecrrs_spouse_anx)
gmean_ecrrs_s_av <- mean(gcvars1$tot_ecrrs_spouse_avoid)
gmean_icg <- mean(gcvars1$tot_icg)
gmean_ysl <- mean(gcvars1$tot_ysl)

mlm_bx5 <- mlm_bx4 %>% mutate(gmean_age = age-gmean_age, gmean_age_yrs = age_yrs-gmean_age_yrs, gmean_timesincedeath = timesincedeath-gmean_time, gmean_bdi = tot_bdi-gmean_bdi, gmean_bgq = tot_bgq-gmean_bgq, gmean_ecrrs_global_anx = tot_ecrrs_global_anx-gmean_ecrrs_g_anx, gmean_ecrrs_global_avoid = tot_ecrrs_global_avoid-gmean_ecrrs_g_av, gmean_ecrrs_spouse_anx = tot_ecrrs_spouse_anx-gmean_ecrrs_s_anx, gmean_ecrrs_spouse_avoid = tot_ecrrs_spouse_avoid-gmean_ecrrs_s_av, gmean_ysl = tot_ysl-gmean_ysl, gmean_icg = tot_icg-gmean_icg)
stargazer(as.data.frame(mlm_bx5), type="text") # note that means of the group-mean centered variables in the long dataset are not zero - I think this is because some people had more trials than others

# test that hypothesis with the non-duplicated data
gcvars2 <- gcvars1 %>% mutate(gmean_age = age-gmean_age, gmean_age_yrs = age_yrs-gmean_age_yrs, gmean_timesincedeath = timesincedeath-gmean_time, gmean_bdi = tot_bdi-gmean_bdi, gmean_bgq = tot_bgq-gmean_bgq, gmean_ecrrs_global_anx = tot_ecrrs_global_anx-gmean_ecrrs_g_anx, gmean_ecrrs_global_avoid = tot_ecrrs_global_avoid-gmean_ecrrs_g_av, gmean_ecrrs_spouse_anx = tot_ecrrs_spouse_anx-gmean_ecrrs_s_anx, gmean_ecrrs_spouse_avoid = tot_ecrrs_spouse_avoid-gmean_ecrrs_s_av, gmean_ysl = tot_ysl-gmean_ysl, gmean_icg = tot_icg-gmean_icg)
stargazer(as.data.frame(gcvars2), type="text") # yes, now the means are zero when each person contributes only one observation

saveRDS(mlm_bx5, "~/Dropbox/2018Fall/FSHD617C/mlm_bx5.rds")
```

### Evaluate missing data 
```{r}
mlm_bx5 %>% 
  select_if(function(x) any(is.na(x))) %>% 
  summarise_all(funs(mean(is.na(.)))) -> mlm_bx5_NAs # or use sum() instead of mean() to find n missing values

mlm_bx5_NAs <- gather(mlm_bx5_NAs) # create long data that lists variables with any NAs, and % of data missing
mlm_bx5_NAs # this does basically the same thing as the propmiss function from HO8, but it only shows variables where there is missing data

## revised this for v2 to use VIM
# install.packages("VIM")
library(mice)
library(VIM)
aggr = aggr(mlm_bx5, col=mdc(1:2), numbers=TRUE, sortVars=TRUE, labels=names(mlm_bx5), cex.axis=.7, gap=3, ylab=c("Proportion of missingness","Missingness Pattern"))
```
One person is missing rs2254298 genotype (`rs2254298` and `rs2254298_A.carrier`), which we already knew about. No one else is missing data on any variable.

### Scale reliabilities
This dataset does not include the item-level data, but we can get that from the master dataset we loaded in earlier.
```{r}
library(psych)
mlm_scales <- filter(master, !grepl("D149",ID))

## BDI
mlm_tot_bdi <- subset(mlm_scales, select=c(bdi_1c:bdi_21c))  
psych::alpha(mlm_tot_bdi) # use psych::alpha so it doesn't conflict w/ggplot's alpha

## BGQ
mlm_tot_bgq <- subset(mlm_scales, select=c(bgq_1:bgq_5))
psych::alpha(mlm_tot_bgq)

## ICG
mlm_tot_icg <- subset(mlm_scales, select=c(icg_1:icg_19))
psych::alpha(mlm_tot_icg)

## YSL
mlm_tot_ysl <- subset(mlm_scales, select=c(ysl_1:ysl_21))
psych::alpha(mlm_tot_ysl)

## ECRRS-global anxiety
mlm_tot_ecrrs_global_anx <- subset(mlm_scales,  select=c(ecrrs_global_7c:ecrrs_global_9c))
psych::alpha(mlm_tot_ecrrs_global_anx)

## ECRRS-global avoid
mlm_tot_ecrrs_global_avoid <- subset(mlm_scales,  select=c(ecrrs_global_1r:ecrrs_global_4r, ecrrs_global_5c, ecrrs_global_6c))
psych::alpha(mlm_tot_ecrrs_global_avoid)

## ECRRS-spouse anxiety
mlm_tot_ecrrs_spouse_anx <- subset(mlm_scales,  select=c(ecrrs_spouse_7c:ecrrs_spouse_9c))
psych::alpha(mlm_tot_ecrrs_spouse_anx)

## ECRRS-spouse avoid
mlm_tot_ecrrs_spouse_avoid <- subset(mlm_scales,  select=c(ecrrs_spouse_1r:ecrrs_spouse_4r, ecrrs_spouse_5c, ecrrs_spouse_6c))
psych::alpha(mlm_tot_ecrrs_spouse_avoid)

```
All reliabilities look good or excellent (a >=.86) except for the Brief Grief Questionnaire, where a = .72. It's a short screening measure with only 7 items, so makes sense that its reliability would be lower.

### Drop the 4 people with low trial counts
Recall that these participants had bias scores on <14% of trials, likely due to failing to reverse the instructions on run 2.
```{r}
data <- filter(mlm_bx5, !grepl("D101|D141|D147|D148",ID))
unique(data$ID) # checking that it worked
```

## OLS estimates for exploratory purposes

```{r}
# getting regular regression estimates one person at a time for exploratory purposes:
lmEst <- by(data, data$ID, function(x) summary(lm(bias_t ~ trialnum, data=x)))
lmEst
```
Peoples' intercepts do look fairly different from each other...

### Stem plot for the intercepts based on the OLS estimates
```{r}
int <- by(data, data$ID, function(x) coefficients(lm(bias_t ~ trialnum, data=x))[[1]])
int

summary(int)
stem(int, scale=2)
hist(int)
```
### Stem plot for the slopes based on the OLS estimates
```{r}
slope <- by(data, data$ID, function(x) coefficients(lm(bias_t ~ trialnum, data=x))[[2]])
slope

summary(slope)
stem(slope, scale=2)
hist(slope)
```
Note that these are all extremely close to zero - probably because of the lack of time effect.

### R-square for OLS models
```{r}

rsq <- by(data, data$ID, function(x) summary(lm(bias_t ~ trialnum, data=x))$r.squared)
summary(rsq)
stem(rsq, scale=2)

cor(int, slope)
```
The correlation between intercepts and slopes is -0.475, so people who show less avoidance bias overall change less over the course of the task.
 
## Graphing multilevel data for exploratory purposes
```{r}
library(lattice)
xyplot(bias_t ~ trialnum | ID, data = data, as.table=T) # too many people/hard to see, try random subset

ids <- sample(unique(data$ID), 12)
ids
temp <- data[data$ID %in% ids, ]
xyplot(bias_t ~ trialnum | ID, data = temp, as.table=T) 

# add fit line(s)
xyplot(bias_t ~ trialnum | ID, data = temp, type=c("p","r")) # linear fit
xyplot(bias_t ~ trialnum | ID, data = temp, type=c("p","smooth")) # curvilinear fit

# graph of the raw data for each person
interaction.plot(data$trialnum, data$ID, data$bias_t)

# generate linear fitted values for each person and then plot all together in one plot
fit <- by(data, data$ID, function(x) fitted.values(lm(bias_t ~ trialnum, data=x)))
fit2 <- unlist(fit)
interaction.plot(data$trialnum, data$ID, fit2, xlab="trial number", ylab="RT bias (transformed)")
```
There does not seem to be either a linear or curvilinear effect of trial number (i.e., time) on RT bias in the task.


## Analyses (NHST)
<center><i>Remember, the effect of time refers to `trialnumber` (this is NOT the `time` variable; `time` refers to time of day when task was run.)</i></center>

The dependent variable, `bias_t`, is the measure of relative behavioral approach/avoidance bias, based on reaction time to push or pull a joystick in response to various images. We transformed it because it was not normally distributed, and use the transformed variable for all analyses.

### Unconditional means model
```{r}
library(nlme)

# unconditional means model
m1 <- lme(bias_t ~ 1, random = ~ 1 | ID, data=data, method="ML")
summary(m1)

# ICC
iVar <- as.numeric(VarCorr(m1)[1,1])
residVar <- as.numeric(VarCorr(m1)[2,1])
icc1 <- iVar/(iVar+residVar) 
icc1
```
The overall mean bias score (transformed) is about -0.002, with a standard deviation of 0.023.

ICC is about 1.6% of the variance. This means that 1.6% of the variance in bias is explained by people being different from one another, while the remainder is explained by...something else.

### Add fixed and random intercept and slope of time
```{r}
# model with fixed and random intercept and slope of time (intercepts implied)
m2 <- lme(bias_t ~ trialnum, data = data, random = ~ trialnum | ID, method="ML")
summary(m2) 
anova(m2)
```
The average bias at trial number = 0 is -0.016 and gets increasingly approach-biased by a nonsignificant fraction of a percent with each unit of time. There is no significant effect of trial number (time) on RT bias score, _t_ = 0.74, _p_ = 0.457. 

```{r}
# ICC
iVar <- as.numeric(VarCorr(m2)[1,1])
residVar <- as.numeric(VarCorr(m2)[2,1])
icc2 <- iVar/(iVar+residVar)
icc2
```
99.99% of variance in bias is due to the between-person variance, after controlling for time. This is consistent with the lack of a significant effect of time. 

### Adding stimulus as a fixed effect
There were 5 different types of stimuli, presented in a randomized order (consistent across participants/sessions) determined by genetic algorithm. Everyone saw the same stimuli in the same order, so it is a fixed effect. The different categories of images were: Death (generic), Deceased (personal photo), Other living loved one (personal photo), Stranger (sex-matched to spouse), and Neutral (objects). The factor `stim` was created such that Neutral was the reference level.

```{r}
# add the fixed effect of stim and compare its fit to m2 by inspecting the t and f tests
m3 <- lme(bias_t ~ trialnum + stim, data = data, random = ~ trialnum | ID, method="ML")
summary(m3)
```
NO effect of time. There is a significant fixed effect of spouse, _t_ = 3.55, _p_ = .0004. There is also a non-significant effect of stranger, _t_ = -1.73, _p_ = .084. This indicates that people tended to show more approach bias to photos of their spouse (across conditions). The parameter estimate is 0.12 with an _SE_ of 0.03. They also tended to show greater more avoidance bias to photos of the sex-matched stranger.

```{r}
intervals(m3, which="fixed")
```
95% CIs don't cross zero for spouse, but do for all other stimulus types. This supports the significant effect of spouse stimuli on reaction time bias.

```{r}
iVar <- as.numeric(VarCorr(m3)[1,1])
residVar <- as.numeric(VarCorr(m3)[2,1])
icc3 <- iVar/(iVar+residVar)
icc3 # Same ICC as m2
```
The ICC is identical to that in model 2. 

```{r}
anova(m2, m3) # compare this model with the first one
```
Comparing the model with the fixed effect of stimulus (m3) to the model without (m2), m3 is better (_p_ < .0001) as evidenced by lower AIC, BIC, and lL values.

### Take out fixed and random effect of time
Because there's no significant effect of time, makes sense to drop it from the model. Now we're basically doing regression but accounting for whatever interdependence exists. <font color="red">Note that these results will _NOT_ match the results of the regressions, because the regressions are based on bias score <u>medians</u> as the DV, rather than means.</font>

```{r}
# without fixed or random effect of time, only fixed effect of stimulus
m4 <- lme(bias_t ~ stim, data = data, random = ~ 1 | ID, method="ML")
summary(m4)
```
Fixed effects are still more or less the same - significant for spouse, marginal for stranger - although the parameter estimates and test statistics are slightly different.

```{r}
intervals(m4, which="fixed")
```
As above, spouse is the only stimulus category whose 95 CIs don't cross zero, supporting a significant effect of spouse on bias - in this case, viewing spouse images was associated with a relative approach bias.

Is m4 any better than m3?
```{r}
anova(m3,m4)
```
Nope. Models do not significantly differ (_p_ = 0.851), though the AIC and BIC values are a little smaller for m4 than m3. So let's forget time and go with m4...

### Adding condition as a fixed effect
Participants completed the task at two different sessions, where they were randomized to receive treatment A at one and treatment B at the other (order counterbalanced across participants). The two treatments are oxytocin vs. placebo. 
```{r}
# add the interaction of stim and check its fit by inspecting t and f tests. 
m5 <- lme(bias_t ~ stim + cond, data = data, random = ~ 1 | ID, method="ML")
summary(m5)
```
The fixed effect of condition B is not significant, _t_ = 0.15, _p_ = 0.884.

```{r}
anova(m4, m5)
```
The two models are not significantly different, but the one with the fixed effect of condition (m5) has slightly higher AIC/BIC. Not compelling enough to select that model over m4, though.

### Adding the stimulus*condition interaction

We might expect that peoples' tendency to approach or avoid might differ depending on which treatment they got, but only for certain stimuli.
```{r}
# add the interaction of stim and check its fit by inspecting t and f tests. 
m6 <- lme(bias_t ~ stim*cond, data = data, random = ~ 1 | ID, method="ML")
summary(m6)
intervals(m6)
```
Interesting. With the interaction in the model, there is a significant effect of death and stranger but not spouse. 95% CIs indicate that viewing the generic death and stranger stimuli are both associated with greater relative avoidance bias. 

```{r}
anova(m6,m4,m5)
```
The model with the interaction in it is not significantly better than either the one with just the fixed effect of condition or with no effect of condition at all, so makes sense to go with the simplest model which only includes stimulus as a fixed effect.

### Adding other plausible predictors
There are a few other variables that we might expect to impact approach/avoidance bias, namely grief severity, OXTR genotype, time since death, and attachment style (avoidant, anxious). 
```{r}
# used group mean centered scores for the additional variables for interpretability
m7 <- lme(bias_t ~ stim + rs53576_A.carrier + rs2268498_C.carrier + gmean_ecrrs_spouse_anx + gmean_ecrrs_spouse_avoid + gmean_age + gmean_icg + gmean_timesincedeath, data = data, random = ~ 1 | ID, method="ML")
summary(m7)
anova(m4,m7)
```
None of the added fixed effects are significant in the new model, and the AIC/BIC values are higher, so take 'em out. 

### Final model

```{r}
# variance explained by the entire model 
fitted <- fitted(m4)
data$fitted <- as.vector(fitted)
rsq <- lm(bias ~ fitted, data=data) 
summary(rsq)
```
A case of significant, but not meaningful: The R<sup>2</sup> for the entire model is .02, indicating that 2% of the variance in bias is explained by the model. So this model is not very good at predicting approach/avoidance bias from the fixed effect of stimulus within person. Oh well.

### Check final model assumptions

####Checking normality of the residuals
```{r}
# checking normality of residuals

# level 1:
qqnorm(m4, ~resid(.), id=0.00075)
plot(m4, resid(., type="p") ~ trialnum, id=0.0025)
```
The QQ plot looks pretty okay, albeit a few outliers. D113 is the person who most falls off the line, and does so at both ends. The residuals are also pretty evenly distributed across time, though many fall outside of the +/- 2 range.

```{r}
# the following bit wants IDs to be numeric, so take out the "D" from each ID
data <- data %>% mutate(numID = as.numeric(str_replace(ID, "D", "")))

# level 2:
qqnorm(m4, ~ranef(.), id=0.25)
ranInt <- ranef(m4)[1] # create object with the random intercept estimates
ranInt <- unlist(ranInt) 
names(ranInt) <- NULL 
ID <- unlist(subset(data, !duplicated(data$numID), select="numID")) # create an ID variable
plot(ranInt ~ ID)

# ranSlope <- ranef(m4)[2] # create an object with the random slopes
# ranSlope<- unlist(ranSlope) 
# names(ranSlope) <- NULL 
# plot(ranSlope ~ ID)

```
Getting an error here for the random slopes <font color="red">`Error in [.data.frame(ranef(m4), 72) : undefined columns selected`</font>. I think this may be because I don't have any random slopes, since I took time out of the model. Indeed, when I go into `m4`, it doesn't look like a second element for the slopes exists. Let's test this...

```{r}
# test my hypothesis, using m3 (where time was included as a random effect)
ranInt1 <- ranef(m3)[1] # create object with the random intercept estimates
ranInt1 <- unlist(ranInt1) 
names(ranInt1) <- NULL 
ID <- unlist(subset(data, !duplicated(data$numID), select="numID")) # create an ID variable
plot(ranInt1 ~ ID)

ranSlope1 <- ranef(m3)[2] # create an object with the random slopes
ranSlope1<- unlist(ranSlope1) 
names(ranSlope1) <- NULL 
plot(ranSlope1 ~ ID)

```
My suspicion was correct: the error message above is because there are no random slopes for my final model.

#### Checking homoscedasticity
```{r}
# checking homoscedasticity

# level 1:
data$resid <- as.numeric(residuals(m4)) # create an object with the Level-1 residuals
plot(data$resid ~ data$stim)

# level 2:
library(lattice)
stim <- unlist(subset(data, !duplicated(data$numID), select="stim"))
xyplot(ranInt ~ stim)
# xyplot(ranSlope ~ stim) - no random slopes so don't run this
```
Note that again there are a lot of residuals falling outside of the +/-2 range.

Also, there's something weird about the L2 plot. Not sure why it shows only 3 of 5 stimulus types.

## Analyses (Bayesian)

###Unconditional means model
First step is to look at the unconditional means model. We know from the exploratory stuff in previous sections that `bias_t` is normally distributed, so we can tell `brms` to use normal likelihood (Gaussian).

```{r}
library(brms)
setwd("/Users/sarenseeley/Dropbox/2018Fall/FSHD617C")
# b1N <- brm(bias_t ~ 1 + (1|ID), data=data, family=gaussian()) 
# saveRDS(b1N, "~/Dropbox/2018Fall/FSHD617C/mlm_ada1_b1N.rds")
b1N <- readRDS("mlm_ada1_b1N.rds")
summary(b1N)

```
Rhat = 1 across the board, so that's good.

```{r}
plot(b1N) # check trace plots
```
I think these plots look okay? b_Intercept and SD_ID__Intercept plots are not exactly on top of each other, but do have a lot of overlap in their distributions.

```{r}
pp_check(b1N, nsamples=20)  # posterior predictive check using normal likelihood
pp_check(b1N, type="error_hist", nsamples=20, binwidth=1) # error histogram (histogram of prediction residuals)
```
The observed distribution is a very good fit to the predicted data (based on the normal distribution).
The error histogram also shows that residuals are normally distributed around 0. See [this page](https://cran.r-project.org/web/packages/bayesplot/vignettes/graphical-ppcs.html) for some helpful documentation.

```{r}
# check ICC to get sense of how variance is distributed between- and within- people
iSd <- as.numeric(summary(b1N)$random$ID[1,1]) # finds 1st row, 1st column of random effects inside the summary of b1N
iVar <- iSd^2
residSd <- as.numeric(summary(b1N)$spec_pars[[1,1]])
residVar <- residSd^2
icc1 <- iVar/(iVar+residVar) 
icc1 # this is the unconditional growth model - just happens in this dataset that it's a nuisance model
```
For the unconditional growth model, the ICC is 0.017, meaning that only about 2% of the variance in bias is because people on average perform differently from each other on the task.

### Time as fixed and random effect
```{r}
## check whether we need time as a control variable
# b2 <- brm(bias_t ~ 1 + trialnum + (1 + trialnum|ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b2, "mlm_ada1_b2.rds")
b2 <- readRDS("mlm_ada1_b2.rds")
summary(b2) 
```
<i>Note: Estimate = mean of posterior distribution for each of the parameters.</i>

**Group-level effects**:
When trialnum is 0, the standard deviation of the intercepts is 0.12.
There is no random effect of time: the slopes are not at all variable (people are not different from each other).
Zero is a credible value for the correlation, because the 95% CIs (credible intervals) cross zero.

Note that Rhat for sd(trialnum) in the random effects is 1.01. Per the [brms documentation](https://github.com/paul-buerkner/brms), this does not seem to indicate a convergence issue due to lack of error messages and the fact that the Rhat for the group-level effect in their example is 1.01: <i>"If 'Rhat' is considerably greater than 1, the algorithm has not yet converged and it is necessary to run more iterations and / or set stronger priors.</i>" However, it does indicate that I would probably want to run it again with a higher number of iterations or stronger priors.

**Population-level effects**:
There is no fixed effect of time, either, given that the upper and lower CI limits are -0.0, 0.0.

```{r}
plot(b2)
```
SD_ID__trialnum and cor_ID__Intercept__trialnum are totally skewed.
```{r}
pp_check(b2, nsamples=20)
pp_check(b2, type="error_hist", nsamples=20, binwidth=1) 
```
Again, the trace plot shows that the observed distribution is a very good fit to the predicted data based on the normal distribution.
The error histogram also shows that residuals are normally distributed around 0. 

```{r}
# if you get the error message
## Error in (function (which = dev.cur())  : 
##   cannot shut down device 1 (the null device)

# run dev.set(dev.next()) a few times until it returns something like
## > dev.set(dev.next())
## quartz_off_screen 
##                4 
```

_SKIPPING THE CHUNK BELOW..._
I can't get sjstats to install (issue with the specific TMB package version- freaking dependencies are the bane of my existence with all these different packages), and no point in running cross validation on a model that we're going to drop anyway due to lack of both fixed/random effects of time.

```{r}
# install.packages("sjstats")
# library(sjstats)
# equi_test(b2) # is there a fixed effect?
# waic(b1N, b2) # does WAIC think we should keep time? If so, given there is no fixed effect, this is due to the random effect accounting for variance.

# b1 <- add_ic(b1, ic="R2")
# b2 <- add_ic(b2, ic="R2") 
# mean(b1$R2)
# mean(b2$R2) # does R2 thinks we should keep time? 

# kfold(b1SN, b2, K=10) # use CV to check for over-fitting
# this takes too long, so reduce n(folds) or only perform for final model
```

### Fixed effect of stimulus
```{r}
# b3 <- brm(bias_t ~ 1 + stim + (1 |ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b3, "mlm_ada1_b3.rds")
b3 <- readRDS("mlm_ada1_b3.rds")
summary(b3) 
```
Similar to the results of the NHST analysis, there is a notable fixed effect of spouse (estimate = 0.12, est. error = 0.03, 95% CI[0.05,0.18]) on bias, such that people show more approach bias on trials where the stimulus is their spouse.

```{r}
plot(b3)
marginal_effects(b3)
```
The high density intervals illustrate the positive effect of spouse on bias, which we also see in the marginal effects plot. In the analysis above, the 95% CI for death and stranger stimuli barely crossed zero, which corresponds with what we see in the marginal effects plot, where the upper limit of the error bar is very close to zero (or below). This also corresponds with the effects we saw in NHST, where there were significant or marginally significant effects of death and stranger in addition to spouse.

```{r}
pp_check(b3, nsamples=20)
pp_check(b3, type="error_hist", nsamples=20, binwidth=1) # looks like we have improved the residuals; if we rerun this random sampling at least some of the time we no longer see the lower end outliers
# equi_test(b3) # TRY THIS CODE WHEN I GET SJTOOLS TO INSTALL (what does ROPE tells us about the size of effect?)
# waic(b2, b3) # TRY THIS CODE WHEN I GET SJTOOLS TO INSTALL
```

```{r}
b2 <- add_ic(b2, ic="R2") 
b3 <- add_ic(b3, ic="R2") 
round(mean(b2$R2), digits=3)
round(mean(b3$R2), digits=3)
```
In both models, we have explained about 2 percent of the variance (1.7% vs. 2.0%).

### Adding the stimulus*condition interaction (no time)

One of the original study hypotheses was that there would be an interaction (actually, a 3-way interaction with group, but not going to go there for ADA #1; also we may be underpowered for that) between stimulus and condition, such that people might display different response biases for different stimulus categories in condition A vs. B, or vice versa.

```{r}
# b4 <- brm(bias_t ~ 1 + stim*cond + (1|ID), data=data, family=skew_normal(), control=list(adapt_delta = .95))
# saveRDS(b4, "mlm_ada1_b4.rds")
b4 <- readRDS("mlm_ada1_b4.rds")
summary(b4) 
```
Got some warnings while running:

`Rejecting initial value:`
  `Gradient evaluated at the initial value is not finite.`
  `Stan can't start sampling from this initial value.`
`Rejecting initial value:`
  `Log probability evaluates to log(0), i.e. negative infinity.`
  `Stan can't start sampling from this initial value.`
  
Apparently setting priors may help with this.

Here, we see notable effects of death and stranger stimuli, but not condition nor its interaction with stimulus type.
  
```{r}
plot(b4)
marginal_effects(b4) 

pp_check(b4, nsamples=20)
pp_check(b4, type="error_hist", nsamples=20, binwidth=1) # residuals seem about the same
# equi_test(b4) 
```
Nothing terribly new here, except for the marginal effects. Looking at the plot for `stim`, the distributions for spouse and stranger and spouse and death don't seem to overlap, and living loved one and death just barely overlap. Unsurprisingly, people tend to show more approach bias for spouse and living loved one photos compared to death and stranger photos. The third marginal effects plot illustrates the lack of condition or stimulus*condition interaction effects.
```{r}
b4 <- add_ic(b4, ic="R2") 
round(mean(b3$R2), digits=3)
round(mean(b4$R2), digits=3)
```
The variance accounted for shows a negligable increase with the stimulus*condition interaction in the model (goes from 0.020 to 0.022). The negligable effect is shown in the plot - note no real differences

### Using priors
```{r}
# First find out what default priors are. 
data <- filter(mlm_bx5, !grepl("D101|D141|D147|D148",ID))
get_prior(bias_t ~ 1 + stim*cond + (1|ID), data=data, family=skew_normal())
```
`brms` has set some weakly informative default priors on some of the special parameters. The default for the random coefficients is a half student-t with 3 degrees of freedom.

Let's assume some weak prior knowledge from previous research. It would actually be somewhat difficult to get solid priors from previous research on this task, given that no previous work has used this specific variant in this population with this specific manipulation (i.e., oxytocin vs. placebo) so I'm making these numbers up.
```{r}
# Let's assume prior studies have found the following slope estimates:
# stimulus ~ .03 (.01)
# condition ~ -.03 (.01)

# for weak, we use normals with means from prior work and SDs chosen to create a weak peak (I got these by playing with visualizations - they all are similar given the estimates from prior work - e.g., all effects were smaller than one with SDs circa .01-.15)

# visualizing to get appropriate weak and strong SDs
x <- seq(-1, 1, .1) # create sequence for x-axis 
plot(x, dnorm(x, mean= .20, sd=.7), type="l", ylim=c(0,1)) 

pWeak <- c(set_prior("normal(.03, .7)", class="b", coef="stim"),
			set_prior("normal(-.03, .7)", class="b", coef="cond"))

# b5 <- brm(bias_t ~ 1 + stim*cond + (1|ID), data=data, family=skew_normal(), prior = pWeak, control=list(adapt_delta = .95))
# saveRDS(b5, "mlm_ada1_b5.rds")
# b5 <- readRDS("mlm_ada1_b5.rds")
#summary(b5)
#plot(b5)
#marginal_effects(b5)
```
Based on the error message here (<font color="red">The following priors do not correspond to any model parameter...</font>), I clearly have no idea what I'm doing and need to go back to that lesson (`get_prior` was not helpful to me, but thanks anyway, brms).