We are given the joint normal distribution:
\[ \begin{pmatrix} Y_1 \\ Y_2 \end{pmatrix} = \begin{pmatrix} Y(\mathbf{s}_0) \\ \mathbf{y} \end{pmatrix} \sim N\left( \begin{pmatrix} \mu_1 \\ \mu_2 \end{pmatrix}, \begin{pmatrix} \Omega_{11} & \Omega_{12} \\ \Omega_{21} & \Omega_{22} \end{pmatrix} \right) \]
with:
We need to verify expressions (2.22), (2.23), and (2.24).
For a joint normal distribution, the conditional mean is:
\[ E[Y_1 | Y_2 = \mathbf{y}] = \mu_1 + \Omega_{12} \Omega_{22}^{-1} (\mathbf{y} - \mu_2) \]
Substituting the given values:
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \mathbf{x}_0^T \boldsymbol{\beta} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} (\mathbf{y} - \mathbf{X} \boldsymbol{\beta}) \]
Therefore:
\[ \boxed{E[Y(\mathbf{s}_0) | \mathbf{y}] = \mathbf{x}_0^T \boldsymbol{\beta} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} (\mathbf{y} - \mathbf{X} \boldsymbol{\beta})} \]
This matches (2.22) exactly. ✓
For a joint normal distribution, the conditional variance is:
\[ \text{Var}[Y_1 | Y_2 = \mathbf{y}] = \Omega_{11} - \Omega_{12} \Omega_{22}^{-1} \Omega_{21} \]
Substituting the given values:
\[ \text{Var}[Y(\mathbf{s}_0) | \mathbf{y}] = \sigma^2 + \tau^2 - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} \]
Therefore:
\[ \boxed{\text{Var}[Y(\mathbf{s}_0) | \mathbf{y}] = \sigma^2 + \tau^2 - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma}} \]
This matches (2.23) exactly. ✓
We want to express the conditional mean as a linear combination of \(\mathbf{y}\):
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \lambda^T \mathbf{y} \]
The Generalized Least Squares (GLS) estimator of \(\boldsymbol{\beta}\) is:
\[ \hat{\boldsymbol{\beta}}_{\text{GLS}} = \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} \]
Starting from (2.22):
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \mathbf{x}_0^T \boldsymbol{\beta} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} (\mathbf{y} - \mathbf{X} \boldsymbol{\beta}) \]
Replace \(\boldsymbol{\beta}\) with \(\hat{\boldsymbol{\beta}}_{\text{GLS}}\):
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \mathbf{x}_0^T \hat{\boldsymbol{\beta}}_{\text{GLS}} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \left( \mathbf{y} - \mathbf{X} \hat{\boldsymbol{\beta}}_{\text{GLS}} \right) \]
Expanding the second term:
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \mathbf{x}_0^T \hat{\boldsymbol{\beta}}_{\text{GLS}} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \hat{\boldsymbol{\beta}}_{\text{GLS}} \]
Grouping terms involving \(\hat{\boldsymbol{\beta}}_{\text{GLS}}\):
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} + \left( \mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right) \hat{\boldsymbol{\beta}}_{\text{GLS}} \]
This gives us:
\[ \boxed{E[Y(\mathbf{s}_0) | \mathbf{y}] = \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} + \left( \mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right) \hat{\boldsymbol{\beta}}_{\text{GLS}}} \]
Note: The key step is factoring out \(\hat{\boldsymbol{\beta}}_{\text{GLS}}\) from \(\mathbf{x}_0^T \hat{\boldsymbol{\beta}}_{\text{GLS}} - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \hat{\boldsymbol{\beta}}_{\text{GLS}}\) to get \((\mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X}) \hat{\boldsymbol{\beta}}_{\text{GLS}}\).
Substitute the GLS estimator into the expression:
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} + \left( \mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right) \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{y} \]
Factor \(\mathbf{y}\) from both terms:
\[ E[Y(\mathbf{s}_0) | \mathbf{y}] = \left[ \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} + \left( \mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right) \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \right] \mathbf{y} \]
Since \(E[Y(\mathbf{s}_0) | \mathbf{y}] = \lambda^T \mathbf{y}\), we have:
\[ \lambda^T = \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} + \left( \mathbf{x}_0^T - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right) \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \]
Taking the transpose of both sides:
\[ \lambda = \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} + \boldsymbol{\Sigma}^{-1} \mathbf{X} \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \left( \mathbf{x}_0 - \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} \right) \]
Therefore:
\[ \boxed{\lambda = \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} + \boldsymbol{\Sigma}^{-1} \mathbf{X} \left( \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X} \right)^{-1} \left( \mathbf{x}_0 - \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} \right)} \]
This matches (2.24) exactly. ✓
| Expression | Result | Status |
|---|---|---|
| (2.22) | \(E[Y(\mathbf{s}_0) \| \mathbf{y}] = \mathbf{x}_0^T \boldsymbol{\beta} + \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} (\mathbf{y} - \mathbf{X} \boldsymbol{\beta})\) | ✓ Verified |
| (2.23) | \(\text{Var}[Y(\mathbf{s}_0) \| \mathbf{y}] = \sigma^2 + \tau^2 - \boldsymbol{\gamma}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma}\) | ✓ Verified |
| (2.24) | \(\lambda = \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma} + \boldsymbol{\Sigma}^{-1} \mathbf{X} (\mathbf{X}^T \boldsymbol{\Sigma}^{-1} \mathbf{X})^{-1} (\mathbf{x}_0 - \mathbf{X}^T \boldsymbol{\Sigma}^{-1} \boldsymbol{\gamma})\) | ✓ Verified |
Conditional Mean: The conditional mean of \(Y(\mathbf{s}_0)\) given observed data \(\mathbf{y}\) consists of:
Conditional Variance: The conditional variance is the unconditional variance minus the variance explained by the observed data.
Kriging Weights: The weight vector \(\lambda\) accounts for both:
These results form the foundation of universal kriging with measurement error.