Topic 5: Hypothesis Testing

These are the solutions for Computer Lab 6.

1 Carrying out a one-sample \(t\)-test in jamovi

1.1

1.2

Yield: Continuous
Density: Continuous
Locality: Nominal

1.3

1.4

Based on the histogram, we observe that the yield data is skewed to the right.

While this Normal Q-Q plot appears acceptable for theoretical quantile values between roughly \(-1\) and \(1\), for theoretical quantile values of greater magnitude the data diverges from the diagonal line (so the tails of the distribution do not match the tails of a Normal distribution).

The \(p\)-value computed by the test is \(0.008958\). As this is much smaller than the \(\alpha = 0.05\) value used in the Shapiro-Wilk test, we reject the null hypothesis that the data follows a Normal distribution, and conclude, based on this test, that the yield data is non-normal.

However, we note that our sample size is \(n=84\), so despite assessing data with a non-normal underlying distribution, thanks to the Central Limit Theorem we can still conclude that the distribution of the sample mean is (approximately) normal.

1.5

\(H_0: \mu = 115\) versus \(H_1: \mu \neq 115,\)

where:

\(\mu\) denotes the population average yield (in grams) of White Imperial Spanish onions.

1.6

The descriptives table provides a sample mean of \(\overline{x} = 119.7\) and a standard deviation of \(s = 53.052\).

Hence, as we already have \(n=84\), and \(\mu_0 = 115\), we have \[\begin{align*} t &= \dfrac{119.7-115}{53.052/\sqrt{84}} \approx 0.812. \end{align*}\]

1.7

Compare your jamovi output to the following output from R:

## 
##  One Sample t-test
## 
## data:  wonions$Yield
## t = 0.81201, df = 83, p-value = 0.4191
## alternative hypothesis: true mean is not equal to 115
## 95 percent confidence interval:
##  108.1873 131.2132
## sample estimates:
## mean of x 
##  119.7002

1.8

The test statistic is \(0.812\), which is equal to the value calculated in 1.6 above.

1.9

The degrees of freedom are \(83\). For the one-sample \(t\)-test, this is found by computing \(n-1\).

1.10

The \(p\)-value is \(0.419\). This denotes the probability of seeing the result we did (\(\overline{x} = 119.7\)) assuming the null hypothesis is true; that is, assuming the true mean is equal to 115.

1.11

The \(95\%\) confidence interval for \(\mu\) is \((108.187, 131.213)\). Since we construct \((1-\alpha)\times 100\%\) confidence intervals, this tells us that our \(\alpha = 0.05\).

1.12

Since our \(p\)-value \(> \alpha\) (i.e. \(0.419 > 0.05\)), we fail to reject the null hypothesis.

1.13

Our \(95\%\) confidence interval is \((108.187, 131.213)\). Since this interval contains \(\mu_0 = 115\), we cannot reject the null hypothesis. This decision matches our decision based on the \(p\)-value assessment.

1.14

The \(95\%\) confidence interval of \((108.187, 131.213)\) tells us that we are \(95\%\) confident that the true (population) average yield of White Imperial Spanish onions is between 108.187 and 131.213 grams per plant.

1.15

We have carried out a statistical analysis of the yield characteristics of White Imperial Spanish onions, to determine if the true average yield of these onions is different from \(115\) grams per plant. Our results suggest, with a high degree of statistical certainty, that the true (population) average yield value is between approximately 108 and 131 grams per plant. We do not have sufficient evidence to support the alternative hypothesis that the true population mean yield is different to 115 grams per plant. Therefore, we conclude that we do not have enough evidence to disprove the original claim that the true (population) average yield of these onions is 115 grams per plant.

2 Carrying out a one-sample \(t\)-test in jamovi (`Density` variable)

2.1

2.2

Yield: Continuous
Density: Continuous
Locality: Nominal

2.3

2.4

Based on the histogram, we observe that the density data is skewed to the right.

The Normal Q-Q plot for the density data looks even worse than the one obtained for the yield data, and shows clear signs of non-normal behaviour.

The \(p\)-value computed by the test is \(p < 0.001\). As this is much smaller than the \(\alpha = 0.05\) value used in the Shapiro-Wilk test, we reject the null hypothesis that the data follows a Normal distribution, and conclude, based on this test, that the density data is non-normal.

2.5

\(H_0: \mu = 80\) versus \(H_1: \mu < 80,\)

where:

\(\mu\) denotes the population average planting density of White Imperial Spanish onions.

2.6

The descriptives table provides a sample mean of \(\overline{x} = 73.332\) and a standard deviation of \(s = 41.531\).

Hence, as we already have \(n=84\), and \(\mu_0 = 80\), we have \[\begin{align*} t &= \dfrac{73.332-80}{41.531/\sqrt{84}} \approx -1.472. \end{align*}\]

2.7

Compare your jamovi output to the following output from R:

## 
##  One Sample t-test
## 
## data:  wonions$Density
## t = -1.4714, df = 83, p-value = 0.07248
## alternative hypothesis: true mean is less than 80
## 95 percent confidence interval:
##     -Inf 80.8701
## sample estimates:
## mean of x 
##   73.3325

2.8

The test statistic is \(-1.471\), which is approximately equal to the value calculated in 1.6 above.

2.9

The degrees of freedom are \(83\). For the one-sample \(t\)-test, this is found by computing \(n-1\).

2.10

The \(p\)-value is \(0.072\). This denotes the probability of seeing the result we did (\(\overline{x} = 73.332\)) assuming the null hypothesis is true; that is, assuming the true mean is equal to 80.

2.11

The \(95\%\) confidence interval for \(\mu\) is \((-\infty, 80.870)\). Since we construct \((1-\alpha)\times 100\%\) confidence intervals, this tells us that our \(\alpha = 0.05\).

2.12

Since our \(p\)-value \(> \alpha\) (i.e. \(0.072 > 0.05\)), we fail to reject the null hypothesis. Note, however, that this is a “close-to-significant” result.

2.13

Our \(95\%\) confidence interval is \((-\infty, 80.870)\). Since this interval contains \(\mu_0 = 80\), we cannot reject the null hypothesis. This decision matches our decision based on the \(p\)-value assessment.

2.14

The \(95\%\) confidence interval of \((-\infty, 80.870)\) tells us that we are \(95\%\) confident that the true (population) average planting density of White Imperial Spanish onions a 80.870 plants per m\(^2\) or less.

2.15

We have carried out a statistical analysis of the planting density characteristics of White Imperial Spanish onions, to determine if the population average planting density of these onions is less than \(80\) plants per m\(^2\). Our results suggest, with a high degree of statistical certainty, that the true (population) average density value is approximately 81 plants per m\(^2\) or less. Therefore, we conclude that we do not have enough evidence to disprove the original claim that the true (population) average planting density of these onions is \(80\) plants per m\(^2\).

3 Assessing Normal Q-Q plots

Note that only plots A and E show data generated from a Normal distribution.

Plot B, which plots data generated from a Poisson distribution, shows a clear violation of the normality assumption.

Plots C and F (both with data generated from a Student’s t distribution) are not too bad. The underlying distribution is symmetrical, but often the dots pull away at the extremities of the Q-Q plots, due to the fatter tails of the Student’s t distribution (compared to the tails of a Normal distribution).

Plot D is difficult, because it certainly looks as if it satisfies the normality assumption, despite some fluctuations at the extremities. However, the data for this plot are actually generated from a Weibull distribution!

Note that for both plots A and E, at the theoretical quantiles of higher magnitude there are some minor deviations from the diagonal line. However in practice this is common, and to be expected (to a degree). As always, you should support your analysis with multiple tests, to ensure you have a robust understanding of the data.

If there were any parts you were unsure about, take a look back over the relevant sections of the Topic 5 material.

References

These notes have been prepared by Amanda Shaker and Rupert Kuveke. The copyright for the material in these notes resides with the authors named above, with the Department of Mathematical and Physical Sciences and with La Trobe University. Copyright in this work is vested in La Trobe University including all La Trobe University branding and naming. Unless otherwise stated, material within this work is licensed under a Creative Commons Attribution-Non Commercial-Non Derivatives License BY-NC-ND.

STM1001: Computer Lab 6 Solutions (jamovi)