Motivation to do two population hypothesis tests arise from the need to compare the population mean of one to the other.
Example: Is the average sales of agents that were trained using system X larger than those agents that were trained using system Y.
Example: Is the average heartrate of the group who have taken drug A lower than those who did not.
Example: Is the average salary of a Analytics Major more than the average salary of a Finance major.
You are interested in making a statement about whether one population’s average is more/less/different than the other.
The key is that there are two separate populations from which you observe 2 separate samples.
Ex: Is the average salary of a Analytics Major more than the average salary of a Finance major.
There is the population of Analytics Majors and the population of Finance Majors. You will observe 2 separate samples from these populations.
\[\begin{align} H_{0}:& \mu_{A} - \mu_{F} \le 0 \\ H_{a}:& \mu_{A} - \mu_{F} > 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Nic} - \mu_{Hon} = 0 \\ H_{a}:& \mu_{Nic} - \mu_{Hon} \neq 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Meat only} - \mu_{Balan.} \ge 0 \\ H_{a}:& \mu_{Meat only} - \mu_{Balan.} < 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Before-After} \leq 0 \\ H_{a}:& \mu_{Before-After} > 0 \end{align}\]
-Note that the above is also equivalent to \[\begin{align} H_{0}:& \mu_{After-Before} \ge 0 \\ H_{a}:& \mu_{After-Before} < 0 \end{align}\]
Sometimes students are confused about whether they should choose paired samples t-test or independent samples t-test in a given problem.
If there are 2 separate groups from which you observe 2 samples, this leads to an independent samples t-test.
If the same set of individuals (not neccessarily human beings) are observed twice, this leads to paired samples t-test.
The principles we learned in single population hypothesis tests are still relevant and true.
-If population means are known there is no grounds for hypothesis testing.
\[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} \leq 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} > 0 \end{align}\]
\(\mu\) represents the average price of the houses sold by the group specified in the subscript.
This is a directional hypothesis test. The p-values we will look at will be called 1 tail p-value.
Excel calculates the \(t_{computed}\).
If \(t_{computed}\) is positive as in this case, 1 tail p value is calculated as the area to the right of \(t_{computed}\) under the curve.
If \(t_{computed}\) is negative, 1 tail p value is calculated as the area to the left of \(t_{computed}\) under the curve.
This is where we set up the range of data which contains saple 1 and sample 2 as well as alpha and hypothesized mean differences
This output contains results from 2 hypothesis tests. The p values and t computed as well as t criticals are all here.
The output in Excel contains results from multiple hypothesis tests. You should be able to pick the relevant set.
The alternative hypothesis always calculates the p value in the tail of where \(\bar{Var}_{1}-\bar{Var}_{2}\) lands. In this notation \(\bar{Var}_{1}\) is associated with the sample mean of Variable 1 range of values you entered on Section 14 and of course \(\bar{Var}_{2}\) is associated with the sample mean of Variable 2 range of values you entered on Section 14.
If the sample mean differences are positive p value is to the right of \(t_{computed}\). If the sample mean differences are negative the p value is to the left of the \(t_{computed}\)
In our example the sample mean differences were positive so excel assume what the alternative hypothesis was for us when calculating the p value.
Var1 and Var2 consists of 469 and 49 observations respectively. These are the values you entered in Variable 1 and 2 ranges. We assume unequal variances since the sample sizes and sample variances are so different from each other.
What if we were researching whether the company NOT being sued has a lower average sale price compared to the company being sued? \[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} \geq 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} < 0 \end{align}\]
There would be no reason to look at p values.
The sample mean differences is consistent with the null hypothesis. Therefore we can not reject the null hypothesis is all we need to say.
\[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} = 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} \neq 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{After-Before} \leq 0 \\ H_{a}:& \mu_{After-Before} > 0 \end{align}\]
equivalent to
\[\begin{align} H_{0}:& \mu_{Before-After} \geq 0 \\ H_{a}:& \mu_{Before-After} < 0 \end{align}\]
Unlike the Independent Samples t-test pairs of values need to be matched in Paired Samples t-test on the excel columns. In this case Column B and C
Similar to Independent Samples t-test
Similar to Independent Samples t-test