Genetic and Environmental Effects

dummy slide

Background

Before MEB and KTH

Wanted to study economics, finance, or computer science.

Realized that I enjoy statistics.

Particularly, computational statistics.

Got a Ph.d. from Copenhagen Business School.

Worked with default models which mostly are survival models.

Position

Postdoc supervised by Keith, Mark, and Hedvig Kjellström from KTH.

Work on variational approximations in biostatistics.

A way of approximating integrals when estimating models. Often very fast.

Has almost nothing to do with today’s topic!

Except also having to do with integral approximations.

Motivation

Overview

Mention previous work and the type of data.

Introduce the models.

Highlight limitations of previous estimation methods.

Example

Suppose that the above is family \(i\) for which we observe \(Y_{i1},\dots Y_{i10} \in \{0, 1\}\).

Circles are females and squares are males.

OCD Study

Genes are known to have an impact on the risk of obsessive-compulsive disorder (OCD).

Genetic and environmental influences on a maternal phenotype can affect the phenotype of the child, in turn.

Mahjani et al. (2020) look at direct genetic effects and maternal effects on OCD

controlling for sex and age of the mother.

ASD Study

Both rheumatoid arthritis (RA) and autism spectrum disorder (ASD) seem to be effected by genes and share risk factors.

Interesting to study the potential genetic link with RA in mothers and ASD in children.

Joint work with Evora Hailin Zhu, Benjamin Yip, and Sven.

Research Questions

Want to estimate unobserved genetic effects, environmental effects, paternal effects, etc. for binary outcomes.

Other outcomes have been studied in the department.

E.g. pre-eclamptic events, melanoma onset, schizophrenia and bipolar disorder, and preterm birth (Pawitan et al. 2004; Lindström et al. 2006; Svensson et al. 2009; Lichtenstein et al. 2009; Yip et al. 2018; Bai et al. 2019).

Liability-threshold Models

\[ Y_{ij} = \begin{cases} 1 & \vec x_{ij}^\top\vec\beta + \epsilon_{ij} > 0 \\ 0 & \text{otherwise} \end{cases} \]

where \(\epsilon_{ij}\) is standard normally distributed. We observe the \(Y_{ij}\)s which are zero or one.

It is a GLM with \(\Phi(P(Y_{ij} = 1)) = \vec x_{ij}^\top\vec\beta\) where \(\Phi\) is the standard normal CDF.

There is a latent score \(x_{ij}^\top\vec\beta + \epsilon_{ij}\) and we observe \(Y_{ij} = 1\) if this exceeds the threshold zero.

This choice allows us to use fast methods later.

Liability-threshold Models (continued)

The Kinship Matrix

Want to add additive genetic effects.

Use random effects with a correlation matrix given by the kinship matrix.

Entries are the probability that a randomly selected allele from a locus will be identical by descent between two individuals.

Example

Left the family. Right the kinship matrix.

Darker colors are further from zero.

Adding Additive Genetic Effects

\(g_{ij}\) is the additive genetic effect for individual \(j\) in family \(i\) and \(\mat K_i\) be the kinship matrix of family \(i\). Suppose that

\[(g_{i1},\dots,g_{i10})^\top \sim N^{(10)}(\vec 0, \sigma_g^2 \underbrace{2\mat K_i}_{\mat C_{ig}}).\]

Write the model as

\[ \begin{align*} Y_{ij} &= \begin{cases} 1 & \vec x_{ij}^\top\vec\beta + \epsilon_{ij} + g_{ij} > 0 \\ 0 & \text{otherwise} \end{cases}. \end{align*} \]

ACE Model

Wants to account for environmental effects also: the ACE model.

Additive genetic effect (A),
shared environmental factors (C), and
individual specific effects and measurement errors (E).

The environmental effects (C) are often based on strong assumptions.