Near Abbey Pond VT in Summer 2016:
November 8th, 2016
Near Abbey Pond VT in Summer 2016:
Important variable in forest ecology. Surrogate for age, shadow cast, tree height/size, basal area/biomass.
Random sample of 6 trees:
| species | x | y | dbh08 | dbh14 |
|---|---|---|---|---|
| Service Berry | -114.5 | 148.7 | 3.565 | 3.565 |
| Pignut Hickory | 216.9 | 129.6 | 35.014 | 36.065 |
| Black Cherry | -21.7 | 12.7 | 5.634 | 5.570 |
| Shagbark Hickory | -2.0 | 20.0 | 10.250 | 10.250 |
| Shagbark Hickory | 63.6 | 199.8 | 8.594 | 9.199 |
| Black Oak | 260.7 | 222.4 | 32.308 | 34.568 |
Convert DBH's to outcome variable: annual growth
| species | x | y | annual growth |
|---|---|---|---|
| Service Berry | -114.5 | 148.7 | 0.000 |
| Pignut Hickory | 216.9 | 129.6 | 0.175 |
| Black Cherry | -21.7 | 12.7 | -0.011 |
| Shagbark Hickory | -2.0 | 20.0 | 0.000 |
| Shagbark Hickory | 63.6 | 199.8 | 0.101 |
| Black Oak | 260.7 | 222.4 | 0.377 |
Color represents top 15 species (in terms of biomass):
Observed annual growth:
Can we model the annual growth of a tree while accounting for
This translates to a statistical model that needs to incorporate:
Effect of American Basswoods on growth of Witch Hazels via \(\lambda_{AB, WH}\). Aid or impede?
| American Basswood | Witch Hazel |
|---|---|
We display 4 (of 34) estimates of \(\mu_i\) in red: sample mean \(\overline{y}_i\).
At the root, model assumes:
\[ \mbox{observed annual growth} \sim \mbox{Normal}\left(\mu_i, \sigma^2\right) \]
where \(\mu_i\) incorporates all
Example: Say \(\mu_{AB}=0.20\), the observed growth of 1000 American Basswood trees might be:
Small sample size for certain species…
… but also for pairs of species. How can we estimate \(\lambda_{ij}\)?
To infer about \(\mu\) based on observations \(\vec{y}\):
Who cares? See blackboard
| binomial classification | family | species |
|---|---|---|
| Quercus alba | Fagaceae | White Oak |
| Quercus coccinea | Fagaceae | Scarlet Oak |
| Quercus ellipsoidalis | Fagaceae | Northern Pin Oak |
| Quercus rubra | Fagaceae | Red Oak |
| Quercus velutina | Fagaceae | Black Oak |
| Acer rubrum | Sapindaceae | Red Maple |
| Acer saccharum | Sapindaceae | Sugar Maple |
We display 4 (of 34) estimates \(\overline{y}_i\) of \(\mu_i\) in red.
In blue we mark the estimate of the hyperparameter \(\mu\):
The model for the observed annual growth of a specific tree, so far, incorporates:
For species \(i\), we model
\[ \mbox{observed annual growth} \sim \mbox{Normal}\left(\mu_i, \sigma^2\right) \]
We focus only on two species for now:
| species | count | median dbh | total biomass |
|---|---|---|---|
| Red Maple | 7584 | 5.09 | 55.01 |
| Red Oak | 160 | 31.08 | 17.12 |
Even with small \(n_i\), using Bayesian Hierarchical models can incorporate inherent contextual structure in the data to nevertheless obtain reliable estimates of all parameters.
There are in general two ways to derive a distribution:
| Analytically (Pen & Paper) | Via Simulation |
|---|---|
So say the unknown distribution \(\pi(\mu|\vec{y})\) is this. Without knowing the true shape of this distribution…
… we simulate random samples using MCMC and approximate all quantities.