Motivation to do two population hypothesis tests arise from the need to compare the population mean of one to the other.
Example: Is the average sales of agents that were trained using system X larger than those agents that were trained using system Y.
Example: Is the average heartrate of the group who have taken drug A lower than those who did not.
Example: Is the average salary of a Analytics Major more than the average salary of a Finance major.
You are interested in making a statement about whether one population’s average is more/less/different than the other.
The key is that there are two separate populations from which you observe 2 separate samples.
Ex: Is the average salary of a Analytics Major more than the average salary of a Finance major.
There is the population of Analytics Majors and the population of Finance Majors. You will observe 2 separate samples from these populations.
\[\begin{align} H_{0}:& \mu_{A} - \mu_{F} \le 0 \\ H_{a}:& \mu_{A} - \mu_{F} > 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Nic} - \mu_{Hon} = 0 \\ H_{a}:& \mu_{Nic} - \mu_{Hon} \neq 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Meat only} - \mu_{Balan.} \ge 0 \\ H_{a}:& \mu_{Meat only} - \mu_{Balan.} < 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{Before-After} \leq 0 \\ H_{a}:& \mu_{Before-After} > 0 \end{align}\]
-Note that the above is also equivalent to \[\begin{align} H_{0}:& \mu_{After-Before} \ge 0 \\ H_{a}:& \mu_{After-Before} < 0 \end{align}\]
Sometimes students are confused about whether they should choose paired samples t-test or independent samples t-test in a given problem.
If there are 2 separate groups from which you observe 2 samples, this leads to an independent samples t-test.
If the same set of individuals (not neccessarily human beings) are observed twice, this leads to paired samples t-test.
The principles we learned in single population hypothesis tests are still relevant and true.
-If population means are known there is no grounds for hypothesis testing.
\[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} \leq 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} > 0 \end{align}\]
\(\mu\) represents the average price of the houses sold by the group specified in the subscript.
This is a directional hypothesis test. The p-values we will look at will be called 1 tail p-value.
Excel calculates the \(t_{computed}\).
If \(t_{computed}\) is positive as in this case, 1 tail p value is calculated as the area to the right of \(t_{computed}\) under the curve.
If \(t_{computed}\) is negative, 1 tail p value is calculated as the area to the left of \(t_{computed}\) under the curve.
This is where we set up the range of data which contains saple 1 and sample 2 as well as alpha and hypothesized mean differences
This output contains results from 2 hypothesis tests. The p values and t computed as well as t criticals are all here.
The output in Excel contains results from multiple hypothesis tests. You should be able to pick the relevant set.
The alternative hypothesis always tests \(\mu_{Var1}-\mu_{Var2}>0\) where \(Var_1\) is associated with the Variable 1 range of values you entered on Section 14 . In our example Var1 consists of 469 observations and Var2 which consists of the values you entered in Variable 2 range contains 49 observations.
We assumed unequal variances since the sample sizes are so different from each other. We will not discuss this at this point of the course but it can be important.
What if we were researching whether the company NOT being sued has a lower average sale price compared to the company being sued? \[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} \geq 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} < 0 \end{align}\]
There would be no reason to look at p values.
The sample mean differences 94117.697(sample mean of sale prices from the company not being sued) and 87112.245 (sample mean of sale prices from the company being sued) is consistent with the null hypothesis. Therefore we can not reject the null hypothesis is all we need to say.
\[\begin{align} H_{0}:& \mu_{NotSued} - \mu_{Sued} = 0 \\ H_{a}:& \mu_{NotSued} - \mu_{Sued} \neq 0 \end{align}\]
\[\begin{align} H_{0}:& \mu_{After-Before} \leq 0 \\ H_{a}:& \mu_{After-Before} > 0 \end{align}\]
equivalent to
\[\begin{align} H_{0}:& \mu_{Before-After} \geq 0 \\ H_{a}:& \mu_{Before-After} < 0 \end{align}\]
Unlike the Independent Samples t-test pairs of values need to be matched in Paired Samples t-test on the excel columns. In this case Column B and C
Similar to Independent Samples t-test
Similar to Independent Samples t-test