9.4/sqrt((507))
## [1] 0.4174687
436 randomly sampled American adults;
Average: $84.71 [95% confidence interval ($80.31,$89.11)]
(a) We are 95% confident that the average spending of these 436 American adults is between $80.31 and $89.11.
False. We can 100% confirm that the average spending of these 536 American adults is $84.71. The 95% confidence interval is for the whole population.
(b) This confidence interval is not valid since the distribution of spending in the sample is right skewed.
False. The skew is not strong enough to fail the sample mean.
(c) 95% of random samples have a sample mean between $80.31 and $89.11.
False. The range concluded is relied on this specific sample.
(d) We are 95% confident that the average spending of all American adults is between $80.31 and $89.11.
True. This conclusion is drawn directly by the definition of a confidence interval. (e) A 90% confidence interval would be narrower than the 95% confidence interval since we don’t need to be as sure about our estimate.
True. A more specific confidence interval leads to less accuracy.
(f) In order to decrease the margin of error of a 95% confidence interval to a third of what it is now, we would need to use a sample 3 times larger.
False. Margin of Error = \(z*SE\)=\(z*\frac{s}{\sqrt n}\). The sample size needs to be 9 times larger to achieve \(\frac{1}{3}\) of current margin of error.
(g) The margin of error is 4.4.
True. Margin of Error = sample mean - lower confidence interval = $84.71-$80.31 = 4.4.
\(SE_{\bar{x}}\)=\(\frac{s}{\sqrt n}\)
se <- 4.31/sqrt(36)
se
## [1] 0.7183333
\(Z\)=\(\frac{bar{x}-null value}{SE_{\bar{x}}}=\frac{30.69-32}{SE}\)
z <- (30.69-32)/se
pnorm(z)
## [1] 0.0341013
\(\because p-value\) = 0.034 < 0.10 = \(\alpha\)
\(\therefore\) we reject the null hypothesis.
(c) Interpret the p-value in context of the hypothesis test and the data.
The smaller the p-value, the stronger the data favor \(H_A\) over \(H_0\). A small p-value (usually < 0.05) corresponds to sufficient evidence to reject \(H_0\) in favor of \(H_A\).
(d) Calculate a 90% confidence interval for the average age at which gifted children first count to 10 successfully.
mean <- 30.69
lower <- mean - 1.645 * se
upper <- mean + 1.645 * se
c(lower,upper)
## [1] 29.50834 31.87166
(a) Perform a hypothesis test to evaluate if these data provide convincing evidence that the average IQ of mothers of gifted children is different than the average IQ for the population at large, which is 100. Use a significance level of 0.10.
\(H_0\): \(\mu\)=100
\(H_A\): $$100
\(SE_{\bar{x}}\)=\(\frac{s}{\sqrt n}\)
se <- 6.5/sqrt(36)
se
## [1] 1.083333
\(Z\)=\(\frac{bar{x}-null value}{SE_{\bar{x}}}=\frac{118.2-100}{SE}\)
mean <- 118.2
hy_mean <- 100
z <- (mean - hy_mean)/se
1 - pnorm(z)
## [1] 0
\(\because p-value\) = 0 < 0.10 = \(\alpha\)
\(\therefore\) we reject the null hypothesis.
(b) Calculate a 90% confidence interval for the average IQ of mothers of gifted children.
lower <- mean - 1.645 * se
upper <- mean + 1.645 * se
c(lower,upper)
## [1] 116.4179 119.9821
Define the term “sampling distribution” of the mean, and describe how the shape, center, and spread of the sampling distribution of the mean change as sample size increases.
The sampling distribution represents the distribution of the point estimates based on samples of a fixed size from a certain population.
With larger n, the sampling distribution of \(\bar{x}\) becomes more normal. As the sample size increases, the normal model for \(\bar{x}\) becomes more reasonable. We can also relax our condition on skew when the sample size is very large.
nearly normal with a mean of 9,000 hours and a standard deviation of 1,000 hours.
mean <- 9000
sd <- 1000
se <- sd
z <- (10500 - mean)/se
1 - pnorm(z)
## [1] 0.0668072
se <- sd/sqrt(15)
z <- (10500 - mean)/se
1 - pnorm(z)
## [1] 3.133452e-09
plot(seq(4000,14000,30),dnorm(seq(4000,14000,30),mean = mean,sd = sd))
(e) Could you estimate the probabilities from parts (a) and (c) if the lifespans of light bulbs had a skewed distribution?
A special case of the Central Limit Theorem ensures the distribution of sample means will be nearly normal, regardless of sample size, when the data come from a nearly normal distribution. If this condition is not met, no assumption should be made.
sample size = 50
p-value = 0.08
sample size should be 500 not 50
Will your p-value increase, decrease, or stay the same? Explain.
\(SE\bar{x}=\frac{sd}{\sqrt n}\)
As \(SE\) change from \(\frac{sd}{\sqrt{50}}\) to \(\frac{sd}{\sqrt{500}}\), SE decreases and therefore Z-score increases and p-value decreases.