9 December 2021
Arise from: - sampling methodology (e.g., RDD, landline/mobile mix; quotas from web panels) - weighting procedures and selection of weighting variables (raking; propensity score matching) - survey mode (live interviewer, IVR, web self-complete) - question wording and question ordering effects - response options (are minor parties or DK offered or volunteered?; are DKs pushed?) - field operations (time of day, day of week) - reporting conventions (DKs reported or not) - compounded in low or uncertain voter turnout environments
Measurement model: \(\color{cyan}{y_{p}} \sim N(\color{orange}{\xi_{t(p)}} + \color{orange}{\delta_{j(p)}} \, , \, \color{cyan}{V_p})\)
Dynamic model: \(\color{orange}{\xi_t} \sim N(\color{orange}{\xi_{t-1}}, \color{orange}{\omega^2})\), with the endpoint constraints from election results observed on \(\color{orange}{\xi_1}\) and \(\color{orange}{\xi_T}\).
Given published polls, \(\color{cyan}{\boldsymbol{Y}}\), sample sizes, field dates and identity of polling companies — and the model — we seek
R and C/C++jags via rjags (Plummer 2019), see Jackman (2009).Stan via RStan (Stan Development Team 2020)nimble (de Valpine et al. 2017)pomp (King et al. 2016)| n | |
|---|---|
| Essential | 108 |
| Ipsos | 18 |
| Morgan F2F | 12 |
| Newspoll | 61 |
| ReachTEL | 15 |
| YouGov | 12 |
latent \(\xi_t\) is a \(K\) vector, subject to restriction that \(\sum_{k=1}^K \xi_{tk} = 1 \forall t\).
2 approaches:
examples: multi-candidate elections (e.g., Iowa caucuses), with drop-out and drop-in.
multiple jurisdictions: e.g., each state in US presidential elections, \(M \approx 50\) filters running in parallel, with high dependencies in trajectories (voters in different states consuming same information, the “nationalisation” of politics and campaigns)