Recall the following problem. Let \(X\) represents the number of cars driving past a parking ramp in any given hour, and suppose that \(X\sim POI(\lambda)\). Given \(X=x\) cars driving past a ramp, let \(Y\) represent the number that decide to park in the ramp, with \(Y|X=x\sim BIN(x,p)\). We’ve shown analytically that unconditionally, \(Y\sim POI(\lambda p)\). Verify this result via a simulation study over a grid of \(\lambda \in \{10, 20, 30\}\) and \(p\in \{0.2, 0.4, 0.6\}\) using the framework presented in your notes, with 10,000 simulated outcomes per \((\lambda, p)\) combination. Create a faceted plot of the overlaid analytic and empirial CDFs as well as the p-p plot.
(ggplot(aes(x = Y), data = simstudy)+geom_step(aes(y = Fhat_Y, color ='Simulated CDF'))+geom_step(aes(y = F_Y, color ='Analytic CDF'))+labs(y =expression(P(Y<= y)), x ='y', color ='')+theme_classic(base_size =12)+facet_grid(lambda~p, labeller = label_both, scales ='free_x') )
(ggplot(aes(x = Fhat_Y, y = F_Y), data = simstudy)+geom_point()+labs(y =expression(P(Y<= y)), x =expression(hat(P)(Y <=y)))+geom_abline(intercept =0, slope =1)+theme_classic()+facet_grid(lambda~p, labeller = label_both) )
Question 2
In this problem we will study the relationship between \(\alpha\) and \(Var(Y)\) for fixed \(\mu\) for the beta distribution.
Recall that if \(Y\sim BETA(\alpha,\beta)\) and with \(\mu \equiv E(Y)\), \(\beta = \alpha \cdot \frac{1-\mu}{\mu}\).
Use this fact to simulate 10,000 realizations of \(Y\sim BETA(\alpha,\beta)\) for each combination of \(\alpha \in \{2,4,8,16\}\) and \(\mu\in\{0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7\}\).
Create a plot of the simulated variances as a function of \(\alpha\). Use lines as your geometry, and color-code each line by \(\mu\).
(ggplot(aes(x = alpha, y = sim_Var, color =factor(mu), group = mu), data = updated_simstudy)+geom_line()+theme_classic()+labs(x =expression(alpha), y =expression(Var(Y)), color ='')+scale_color_brewer(palette ="Dark2",labels =function(x) paste0("μ =", x)))
D)
Comment on how the combination of \(\alpha\) and \(\mu\) impact variance.
We see that for each \(\mu\), the variance decreases as \(\alpha\) increases. When we hold \(\alpha\) constant, we see that the variance increases as we increase \(\mu\). We also see that the vertical gaps between the different \(\mu\) levels shrinks as we increase \(\alpha\).