December 14, 2015

WF ED 540 Final Exam - Task 1

Goal:

Using data consisting of observations on six variables for 52 tenure-track professors employed by a relatively low-enrollment college, our task was to analyze whether there is gender discrimination in University Professors' salaries.

1a. Box Plots (1 of 2)

i. For sl (academic year salary) by sx (sex, coded 0 for male and 1 for female)

1a. Box Plots (2 of 2)

ii. For sl (academic year salary) by dg (Highest degree, coded 0 for masters and 1 for doctoral degree)

1b. Scatterplots of Points (1 of 2)

With a Smooth Line Among Points - sl (academic year salary) by yd (Number of years since highest degree was earned)

1b. Scatterplots of Points (2 of 2)

With a Smooth Line Among Points - sl (academic year salary) by yr (Number of years in current rank)

1c. Scatterplot of points

Plotted with a linear model, and 95% confidence interval for the model - sl (academic year salary) by yd (Number of years since highest degree was earned).

1d. Scatterplot of points

sl (academic year salary) by yr (Number of years in current rank) grouped by rk (academic rank, coded 1 for assistant professor, 2 for associate professor, and 3 for full professor).

2. Linear Regression

A simple linear regression with sl as the dependent variable and sx, yr, dg, yd, and a recoded rk variable as independent variables.

2a. Is the entire set of independent variables related to the dependent variable?

2b. Is sl related to sx?

2c. The interval estimate and 95% confidence interval for sx

Here is the 95% confidence interval for sx:

Since we failed to reject the null hypothesis that there is no relationship between sl and sx, the interpretation of this confidence interval is meaningless.

3. A closer look at the relationship between sl and sx

Our next task was to compute and report a new regression equation with sl as the dependent variable and sx as the sole independent variable. Then compute a t-test of the difference in mean sl by sx. From there, I will describe whether and how the results about the relationship between sl and sx from the regression analysis and from the t-test are similar.

3 - Part 1. A new regression equation with sl as the dependent variable and sx as the sole independent variable.

3 - Part 2. Is sl related to sx?

3 - Part 3. Compute a t-test of the difference in mean sl by sx

## 
##  Two Sample t-test
## 
## data:  sl by sx
## t = 1.8474, df = 50, p-value = 0.0706
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  -291.257 6970.550
## sample estimates:
## mean in group 0 mean in group 1 
##        24696.79        21357.14

3 - Part 4. Are the results about the relationship between sl and sx from the regression analysis and from the t-test similar (and if so, how)?

The results of the regression equation with sl as the dependent variable and sx as the sole independent variable is the same as the t-test of the difference in mean sl by sx. Both reveal of p-value of 0.07 (rounded). Since my α = .05, then the p-value, 0.07 (rounded), is more than α. Therefore both test fail to reject the null hypothesis that there is no difference in academic year salary between the men and women. In other words, there does not seem to be a relationship between academic year salary and sex in this small sample of University faculty.