Navona Calarco
PSYC2002
12/13/2019

Introduction

Neurocognitive and social cognitive deficits are pervasive in schizophrenia (SSD). Both deficits are often evident early in illness course, highly predictive of functional outcome, and largely unalleviated by existing pharmacological intervention. A great deal of research has described the unique and shared variance of neurocognition and social cognition deficits, as well as their respective, and mutual, relationship to other hallmark features in SSD. More recently, interest in biologically-driven detection and targeted intervention has driven a smaller body of work to investigate the neurobiology underlying neurocognition and social cognition deficits. To date, most imaging studies have tested the association of a single construct (i.e., either neurocognition or social cognition, often considered as unitary rather than dimensional) with whole-brain derived features. As a result, it is unknown if neurocognitive and social cognitive deficits arise from overlapping or distinct brain pathology.

Objective. In the present analysis, we apply a multivariate statistical technique, canonical correlation analysis (CCA), to illuminate the relationship between neurocognition and social cognition, in select white matter tracts believed to underlie social cognition, across a spectrum of SSD and healthy controls (HCs). Though CCA analysis does not lend itself to traditional hypothesis formulation or testing, we expect to find that white matter disintegrity in these tracts will predict social cognitive impairment, and also neurocognitive impairement, though perhaps only in some facets, and likely to a lesser degree.


Methods

Data were extracted from the collaborative multi-centre study, ‘Social Processes Initiative in Neurobiology of the Schizophrenia(s)’ (SPINS), designed to identify neural circuitry related to social cognitive impairment, in nearly 500 adults who either have SSD or no psychiatric diagnosis (HC). The study has an extensive multi-modal imaging protocol, as well as ‘deep phenotyping’ of neurocognitive and social cognitive ability, from multiple days of participant assessment.

DWI acquisition and preprocessing. The present study analyzed diffusion weighted imaging (DWI) scans, which are sensitive to white matter microstructure. Despite prospective efforts to harmonize DWI acquisitions across sites, analyses revealed that derived metrics showed a large, non-linear scanner effect (i.e., different scanner models produced signals that varied nonlinearly as a function of brain tissue), and we have therefore elected to analyze only data collected from three matched 3T Siemens PRISMA MRIs with 64-channel head-coils. All DWI scans were a 60-direction EPI dual spin echo sequence with the following identical parameters: b=1000, 5 b0s, TR/TE=8800/85ms, FOV=256mm, 128x128 matrix, and 2mm isotropic voxels. DWI data were denoised, skull-stripped, aligned, and motion- and distortion-corrected using FSL and MRtrix software. Data underwent automated and manual quality control after every preprocessing step.

White matter tract estimation and ROI selection. We used the Slicer whitematteranalysis computational pipeline to fit a tensor and perform deterministic whole brain fiber tracking. Streamlines were bundled via registration to the ORG atlas (Zhang 2018), and parcellated into 58 deep white matter tracts. From these tracts, we estimated fractional anisotropy (FA), which indicates the coherence with which water molecules diffuse along tissue, and is held to be a proxy of white matter integrity (values closer to 1 are indicative of healthy tissue; those closer to 0 indicate probable pathology). For this analysis, we elected to analyze data from the bilateral arcuate fasciculus (AF), uncinate fasciculus (UF), and inferior longitudinal fasciculus (ILF) (Figure 1). In past work, we have found FA values in the right AF, left UF, and right ILF to be highly related to social cognition (Voineskos, 2013; Wheeler, 2015; Behdinan, 2015). Here, we seek to replicate this finding, confirm its hemispheric sensitivity, and investigate if these regions are also implicated in neurocognition.

Neurocognitive and social cognitive assessment. Neurocognitive status was assessed via the MATRICS (Measurement and Treatment Research to Improve Cognition in Schizophrenia) Cognitive Consensus Battery (MCCB), which provides an evaluation of key cognitive domains relevant to SSD (Nuechterlein, 2008). In this analysis, we included factor scores for Processing speed, Attention & vigilance, Working memory, Verbal learning, Visual learning, and Problem solving. Social cognitive ability was captured by factor scores representing lower-level Simulation and higher-level Mentalizing (derived from performance on the Penn Emotion Recognition Test (ER40), the Reading the Mind in the Eyes Test (RMET), the Empathic Accuracy (EA) task, and the Awareness of Social Inference Test-Revised (TASIT); Oliver, 2018). Importantly, all neurocognitive and social cognitive assessments were administered experimentally, and produce objective performance-based outcomes. Moreover, though all assessments are sensitive to impairement in SSD, a wide distribution of performance is evident across the SSD-HC continuum (no floor or ceiling performance).

Canonical Correlation Analysis. CCA reveals multivariate patterns of linked dimensions between two sets of variables, often denoted as \(X\) and \(Y\). Our \(X\) set was comprised of the 6 brain variables (namely FA in bilateral AF, ILF, and UF). The \(Y\) set was comprised of 8 behavioural variables: 6 neurocognition variables, and 2 social cognition variables, described above. Though CCA is a correlational method that precludes causal inference, we can think of the \(X\) set as the ‘predictor’ set, and the \(Y\) set the ‘criterion’ set, given the theoretical assumption that brain structure underlies behaviour.


Results

Table 1. Participant characteristics (means and standard deviations).

Combined, n=91 HC, n=37 SSD, n=54
Demographics and clinical
Sex (F:M) 40:51 20:17 20:34
Age 33.71 (11.12) 32.51 (10.58) 34.54 (11.50)
Education (years) 14.71 (2.48) 16.11 (2.00) 13.76 (2.34)
WTAR 106.82 (11.85) 112.59 (10.87) 102.97 (11.02)
BPRS – – 31.17 (7.93)
SANS – – 24.83 (11.82)
Fractional anisotropy (FA) [\(X\) set]
AF (L) 0.40 (0.03) 0.41 (0.03) 0.40 (0.03)
AF (R) 0.38 (0.03) 0.39 (0.02) 0.38 (0.03)
UF (L) 0.30 (0.03) 0.30 (0.03) 0.29 (0.03)
UF (R) 0.30 (0.03) 0.31 (0.03) 0.30 (0.02)
ILF (L) 0.38 (0.03) 0.38 (0.03) 0.37 (0.02)
ILF (R) 0.38 (0.03) 0.39 (0.03) 0.37 (0.03)
Neurocognition [\(Y\) set]
Processing speed 44.43 (14.68) 53.81 (12.15) 38.00 (12.74)
Attention & vigilance 42.65 (14.30) 47.09 (15.11) 39.62 (13.01)
Working memory 43.86 (10.80) 47.59 (10.69) 41.30 (10.21)
Verbal learning 44.74 (11.90) 52.51 (11.45) 39.41 (8.97)
Visual learning 40.74 (13.25) 48.41 (11.10) 35.48 (12.05)
Problem solving 43.42 (11.69) 46.51 (10.70) 41.30 (11.95)
Social cognition [\(Y\) set]
Simulation -0.11 (0.84) 0.32 (0.56) -0.41 (0.87)
Mentalizing -0.08 (0.90) 0.50 (0.42) -0.48 (0.92)
Note:
The BPRS and SANS were only administered to participants with SSD; All neurocognition
scores are MATRICS MCCB factor scores and have been normed for age and sex (Nuechterlein
et al., 2008); Both social cognition scores are standardized factor scores that have
been adjusted for age and sex (Oliver et al., 2018).
HC = healthy control, SSD = schizophrenia spectrum disorder, WTAR = Wechsler Test of
Adult Reading, BPRS = Brief Psychiatric Rating Scale, SANS = Scale for the Assessment of
Negative Symptoms, AF = arcuate fasciculus, UF = uncinate fasciculus, ILF = inferior
longitudinal fasciculus, L = left, R = right.

Participants. Participant characteristics are presented in Table 1. Data from 91 participants (37 HC, 54 SSD) were available for analysis after excluding participants for excessive missing observations, defined as ≥2/6 FA values (n=4 participants), ≥2/6 neurocognition factor scores (n=3), either social cognition factor score (n=4), or more than ≥3 missing values across all variables (n=0). Missing data from participants with below threshold missingness (n=30) were interpolated as the average value across their diagnostic group (HC, SSD). No participants were excluded for in-scanner motion or poor DWI data quality. Note that the 14 variables across the 91 participants exceeded the ideal 10:1 observation-to-variable ratio guideline for CCA, but not so greatly that dimensionality reduction techniques (e.g., PCA) were thought warranted to avoid over-fitting. In general, our sample was gender-balanced, young adult, well educated, and the SSD participants were clinically stable.

Statistical preprocessing. All statistical tests were run with R version 3.5.3. We regressed age, sex, and motion from the \(X\) set, as all are known to confound FA estimates, and also because both the neurocognition and social cognition factor scores in the \(Y\) set had been normed for age and sex. For ease of interpretation, all variables in the \(X\) and \(Y\) sets were standardized via Z-scoring. All variables were found to be heteroscedastic, absent of outlying values, linearly related, and univariate and multivariate normal. We also examined raw correlations within and between the \(X\) and \(Y\) sets, to affirm our conceptual grouping. As expected, we found small-to-moderate positive correlations between brain variables (\(R_{xx}\) matrix mean r=.27, range=-.05-.58) and strong correlations between neurocognitive and social cognitive variables (\(R_{yy}\) matrix mean r=.49, range=.31-.85; see Supplementary Figure 1). We also found that several variables in the \(R_{xy}\) matrix showed some multicolinearity, as visualized by an adjusted \(R_{xy\omega}\) matrix (see Supplementary Figure 2).

Selection of canonical variates. The CCA analysis produced 6 canonical functions (analogous with the size of the smaller \(X\) set). First, we employed a permutation test (500 bootstraps) to evaluate the null hypothesis of no correlation between the \(X\) and \(Y\) sets. Wilk’s lambda confirmed correlation between the sets (p=.010, r=.559), with an overall effect size of 60% variance explained across the entire model. Similar values were seen when employing the Hotelling-Lawley Trace (p=.002, r=.629), the Pillai-Bartlett Trace (p=.004, r=.531), and Roy’s Largest Root (p=.026, r=.221), with all canonical correlations included in all models. We proceeded to evaluate the functions stepwise for significance, magnitude, and redundancy (Hair, Anderson, and Tatham, 1998). Using Wilk’s lambda, we found only the first function to be significant in isolation, \(\lambda\)=.397, F(48, 382.93)=1.65, p=.006, with a large magnitude, \(R_c\)=.571 (see Figure 2 for a visualization of the association between all participants’ \(X\) abd \(Y\) scores). The first function accounted for 45.1% of model variance (calculated as canonical roots, i.e., variance shared by the linear composites). The Stewart-Love redundancy index (i.e., variance in one set that can be explained by the other set) was 7.7% for the \(X\) set and 12.5% for the \(Y\) set. The low value for the \(X\) set is unproblematic, given the clear delineation between the predictor and criterion variables. The low value for the \(Y\) is of potentially more concern, but we hold it is not so low to warrant abandonning interpretation in the context of brain-behaviour research, and thus continue to interpret the first canonical function. Table 2 shows standardized canonical function coefficients (\(\beta\)), structure coeffients (\(r_s\)), and squared structure coefficients \(r_s^2\) (here analogous with communalities) for the first derived function. Canonical weights (\(\beta\)) are included mostly for illustrative purposes, but not interpreted, as they are unstable in the face of multicollinearity, which we have already demonstrated is a feature of the \(R_{XX}\) matrix in Supplementary Figure 2.

Table 2.

Function 1
Variable \(\beta\) \(r_s\) \(r_s^2\) (\(h^2\))
X
AF (L) -0.191 0.354 0.126
AF (R) 0.358 0.464 0.215
UF (L) -0.540 -0.192 0.037
UF (R) 0.632 0.3 0.090
ILF (L) -0.028 0.509 0.259
ILF (R) 0.748 0.832 0.693
Y
Processing speed 0.366 0.593 0.352
Attention & vigilance -0.060 0.342 0.117
Working memory 0.104 0.512 0.262
Verbal learning 0.020 0.547 0.299
Visual learning -0.303 0.489 0.240
Problem solving -0.193 0.272 0.074
Simulation 0.484 0.911 0.831
Mentalizing 0.529 0.943 0.890
Note:
\(r_s\) values > .450 are emphasized, following convention;
\(\beta\) = standardized canonical function coefficient;
\(r_s\) = structure coefficient;
\(r_s^2\) = squared structure coefficient, here also communality \(h^2\)

Validation. Lastly, we validated our model via iterative feature removal, i.e., we removed one of 14 features from the combined \(X\) and \(Y\) sets, ran the CCA, and compared the 12 derived canonical correlation coefficients (note that 5 \(R_c\) are derived for the 5 iterations in which a feature is removed from the smaller \(X\) set). This procedure leveraged the CCA property that each variable relates to all other variables in both sets: if the model is stable, canonical correlation estimates will remain similar; if unstable, estimate will vary widely. Values were similar across all iterations, suggesting that our model is stable (Supplementary Figure 3). We were unable to perform more rigorous validation techniques, such as split-half reliability, or cross-validation on unseen samples, due to limitated sample size / unavailability.

Exploratory analysis. Though increasing evidence suggests that neurocognition and social cognition are dimensional constructs on which HC and SSD fall along a continuum, as opposed to discrete classes, we were curious to verify that HC and SSD participants exhibited a complementary pattern of correlation within or between \(X\) and \(Y\) sets. Because our sample size is insufficient for CCA, we merely observed their respective \(R_{xx}\), \(R_{yy}\), and \(R_{xy}\) matrices (see Supplementary Figure 4). We found that correlation patterns appear to be similar across HC and SSD populations.


Discussion

Summary. We performed a CCA on a small \(X\) set of white matter tracts (bilateral AF, UF, ILF), and a \(Y\) set of neurocognitive and social cognitive variables, across 91 participants along the HC-SSD spectrum. CCA revealed one significant relationship between these sets, \(R_c\)=.571. Thus, it appears that the structural integrity of these white matter tracts is generally predictive of neurocognitive and social cognitive ability. In addition to evalulating the strength of correlation between \(X\) and \(Y\) sets, CCA allows for examination of which variables within each set contributed most to the correlation (i.e., CCA is highly interpretable). Commonly evaluation metrics are reported in Table 2.

Interpretation of structure coefficients. Structure coefficients (\(r_s\)) reveal the correlation between a given variable and the synthetic set to which it belongs, irrespective of the contribution of other variables. Examination of structure coefficients for the \(X\) set reveals that FA values in the right ILF contributed most highly (\(r_s\)=.832), followed by the left ILF (\(r_s\)=.509) and right AF (\(r_s\)=.464). In contrast, the left AF and right UF made modest contributions, and the left UF a negligible contribution, suggesting these tracts may not be very strongly related to the synthetic variable combining neurocognition and social cognition. Moreover, the left UF shows a negative structure coefficient, meaning that it is inversely related to the other brain variables. This is surprising, given that the left UF has, in prior lab work, been found to be highly and robustly associated with social cognition. This discrepancy may be the result of a larger sample, different characterization of social cognition, and/or different processing of DWI data; future lab work will have to examine this discrepancy more carefully.

Examination of the \(Y\) set shows some very high structure coefficients. Specifically, the social cognition variables Simulation (\(r_s\)=.911) and Mentalizing (\(r_s\)=.943) made substantive contributions. Additionally, four of six neurocognition variables made large contributions: Processing speed (\(r_s\)=.593), Working memory (\(r_s\)=.512), Verbal learning (\(r_s\)=.547), and Visual learning (\(r_s\)=.489). All of the \(Y\) set structure coefficients have the same sign, indicating that all variables are positively related. It is telling that the Attention & vigilance and Problem solving factors do not reveal high structure coefficients: this suggests that these factors are not strongly related to white matter integrity in the included tracts, and also provides evidence that neurocognition has multidimensional representation in the brain, as well as behaviour.

Interpretation of squared structure coefficients / communalities. We also report squared structure coefficients (\(r_s^2\)), which indicate the proportion of variance that a given variable linearly shares with the synthetic variable generated from that variable’s set. In the case of interpreting a single canonical function, the squared structure coefficients are analogous to canonical communality coefficients (\(h^2\)), which provide a summative measure of how useful a given variable was to the solution across all interpreted canonical functions. Here, we see that the right ILF was most of use to the \(X\) set, and the Simulation and Mentalizing variables were of high use to the \(Y\) set. This is very consistent with, and serves as a replication of, our past findings which indicate the strong relationship between right ILF disintegrity and social cognitive impairment.


Conclusion

We used the multivariate technique CCA to uncover links between integrity of white matter microstructure (operationalized as FA) and domains of neurocognition and social cognition. To the best of our knowledge, this is the first study to examine how tracts implicated in social cognition may also subserve select facets of neurocognition, in a large sample of SSD and HCs.

At least one aspect of CCA makes it particularly attractive to the task of elucidating brain and behaviour links in psychiatry: it holistically integrates observations from transdiagnostic groups (here, HC and SSD), and thus avoids the increasingly noted pitfalls of case-control design. Accumulating evidence suggests that the neurobiology underlying particular deficits in SSD is also evident in HC. Our results, though preliminary, are suggestive that this relationship holds for the association between white matter microstructure and neurocognitive and social cognitive deficits in both HC and SSD. Future research should further test this association transdiagnostically, for example, by including data from other disorders with noted neurocognitive and social cognitive deficits.

Despite these promising links, several limitations should be noted, both of the present study design and the CCA statistical method. In relation to the former, perhaps the greatest limitation is that our current sample size does not allow for the inclusion of a greater number of white matter tracts in the \(X\) set. Increasing our sample size will allow for simultaneous examination of tracts hypothesized to underlie neurocognition, for example, as well as the inclusion of additional data types, such as genomics, which are likely to capture differing sources of biological heterogeneity. Relatedly, obtaining an independent replication sample is essential for cross-validation, i.e., the ability to assess the model’s ability to generalize to unseen, unrelated samples. Pertaining to methods, it must be remembered that as a linear model, CCA assumes additive covariation patterns, which may prove a simplification of brain-behaviour relationships. Finally, CCA is ‘merely’ correlational; investigating the cause(s) of the reported correlations remains a challenging issue for future experimental work.

Nonetheless, multivariate analyses that illuminate the shared and distinct neural basis of disabling deficits, such as neurocognitive and social cognitive impairement, are sure to accelerate the design of novel, biologically-targeted treatments for SSD, and psychiatry more broadly.


Data availability. The data reported in this paper have been deposited in the RDoC database: R01-MH102324-01A1

Code availability. All analysis code is available here: https://github.com/navonacalarco/PSYC2002_finalAssignment


References

Behdinan, T., Foussias, G., Wheeler, A.L., Stefanik, L., Felsky, D., Remington, G., Rajji, T.K., Chakravarty, M.M., Voineskos, A.N. (2015). Neuroimaging predictors of functional outcomes in schizophrenia at baseline and 6-month follow up. Schizophrenia Research., 169(1), 69-75.

Hair, J.F Jr., Anderson, R.E., Tatham. R.L., Black, W.C. (1998). Chapter 8. Multivariate Data Analysis, 5th edition. Prentice Hall.

Nuechterlein, K.H., Green, M.F., Kern, R.S., Baade, L.E., et al. (2008). The MATRICS Consensus Cognitive Battery, Part 1: Test selection, reliability, and validity. American Journal of Psychiatry, 165(2), 203-213.

Oliver, L.D., Haltigan, J.D., Gold, J.M., Foussias, G., DeRosse, P., Buchanan, R.W., Malhotra, A.K., Voineskos, A.N. (2019). Lower- and higher-level social cognitive factors across individuals with Schizophrenia Spectrum Disorders and healthy controls: Relationship with neurocognition and functional outcome. Schizophrenia Bulletin, 45(3), 629-638.

Voineskos, A.N., Foussias, G., Lerch, J., Felsky, D., Remington, G., Rajji, T.K., Lobaugh, N., Pollock, B.G., Mulsant, B.H. (2013). Neuroimaging evidence for the deficit subtype of schizophrenia. JAMA Psychiatry, 70(5), 472-480.

Wheeler, A.L., Wessa, M., Szeszko, P.R., Foussias, G., Chakravarty, M.M., Lerch, J.P., DeRosse, P., Remington, G., Mulsant, B.H., Linke, J., Malhotra, A.K., Voineskos, A.N. (2015). Further Neuroimaging Evidence for the Deficit Subtype of Schizophrenia: A Cortical Connectomics Analysis. JAMA Psychiatry, 72(5), 446-455.

Zhang, F., Wu, Y., Norton, I., Rathi, Y., Makris, N., O’Donnell, L.J. (2018). An anatomically curated fiber clustering white matter atlas for consistent white matter tract parcellation across the lifespan. NeuroImage, 179, 429-447.


Supplementary Figures

Supplementary Figure 1. Correlation matrices for the \(X\) and \(Y\) sets. A. The \(R_{xx}\) matrix (brain variables) generally shows small-to-moderate positive correlations between variables. B. The \(R_{yy}\) matrix (neurocognition and social cognition variables) shows moderate-to-strong positive correlations between all variables.


Supplementary Figure 2. Cross-correlation, omega, and difference matrices. A. The The \(R_{xy}\) matrix shows cross-correlations between variables in the \(X\) and \(Y\) sets. We see some evidence of multicolinearity. B. The omega matrix shows the \(R_{xy}\) matrix adjusted for redundancy. The omega matrix is calculated as the product of the inverse of the Choleski factorization of the \(R_{xx}\) matrix, the \(R_{xy}\), and the inverse of the Choleski factorization of the \(R_{xy}\) matrix; C. The difference matrix shows the magnitude of difference between the original \(R_{xy}\) and difference matrices.



Supplementary Figure 3. Model validation. We validated our model using iterative feature removal, i.e., we removed one of 14 features from the combined \(X\) and \(Y\) sets, ran the CCA, and compared the derived canonical correlation coefficients. The plot shows the 12 comparions (black dots), as well the original model (red dots), across all 6 canonical functions. Note that the 6th canonical function has values only from 7 of the 12 iterations, as the \(X\) set had 5 features in 5 of the 12 iterations. We see that, though iteratively removing a variable does alter the canonical correlation coefficients, the change is not drastic, nor statistically significant.


Supplementary Figure 4. Raw correlation matrices within and between HC and SSD groups. We compared raw correlation matrices within and between HC and SSD groups, to gain a sense of if patterns between both groups were generally representative of both groups, or differentially driven by one. Our sample size was insufficient to run CCA; thus, we inferred on the basis of visual review that all matrices seem comparable between HC and SSD.