Dr Niamh Cahill (she/her)
IGCP 725 Radiocarbon workshop (January 2022)
In general, when we model time dependent data we don’t usually have to worry about having errors in the time variable
But that is not the case when the time dependent data is a reconstruction (e.g., a sea level reconstruction)
In my research, usually on the analysis of sea-level reconstructions, I use the output from age-depth models (e.g., Bchron, Bacon) which provide age estimates with uncertainty for core sediment samples.
So, here I will explore the impact of this age uncertainty in the statistical modelling of time dependent data.
I use a Bayesian approach to statistical modelling and when it comes to the specifying the Bayesian model recipe, the ingredients are
Combining the priors and the data model gives us the posterior. The posterior tells us everything we need to know about what we are trying to estimate.
Expected Y = f(parameters,x)
Uncertain Data (Y) = Expected Y + error
Priors = Constraints
Expected Y = f(parameters,x), where f = linear
\[\text{Expected } Y = \alpha + \beta Age\]
Y = Expected Y + error, where error \(\sim\) Normal
\[Y = \alpha + \beta Age + error\]
\[error \sim Normal(0, \sigma_{Y})\]
Priors = Constraints
We might assume we don’t know much about \(\alpha\) and \(\beta\)
\(\sigma_{Y}\) must be positive
Instead of \[\text{Expected } Y = \alpha + \beta Age\] What we want is \[\text{Expected } Y = \alpha + \beta Age^{TRUE}\]
But we don’t know \(Age^{TRUE}\) 😞
We can again assume that
X = Expected X + error, where error \(\sim\) Normal
\[Age = Age^{TRUE} + error\]
\[error \sim Normal(0, \sigma_{Age})\]
Priors = Constraints
We need a prior for \(Age^{TRUE}\) with a plausible range.
This is called an Errors-in-Variables model.
We can extend the Errors-in-Variables idea to be used with more complex process models.
Expected Y = f(parameters,xtrue), where f = change point model
\(\text{Expected } Y = \alpha + \beta_1 (Age^{TRUE} - cp) \hspace{2em} \text{for x $<$ $cp$}\)
\(\text{Expected } Y = \alpha + \beta_2 (Age^{TRUE} - cp) \hspace{2em} \text{for x $\geq$ $cp$}\)
Expected Y = f(parameters,xtrue), where f = Gaussian process model
\(\text{Expected } Y \sim GP(0,K)\)
\[K_{i,j} = \sigma_g^2exp\bigg(-\rho^2(Age_i^{TRUE} -Age_j^{TRUE})^2\bigg)\]
Firstly, we’ll look at this web application to see the impact of accounting for vs not accounting for age errors in a simple linear regression model.
Then, if you are interested in taking this even further, you can download and use this R code to run the simple regression models and Gaussian process models on simulated data (and your own data).