DATA 605 Discussion 11

The dataset contains observations about income and happiness taken from a sample of 500 people.

Linear Regression Analysis

Is there a linear relationship between income and happiness?

income_dataset_happiness_lm <- lm(happiness ~ income, data = income_dataset)
summary(income_dataset_happiness_lm)

## 
## Call:
## lm(formula = happiness ~ income, data = income_dataset)
## 
## Residuals:
##      Min       1Q   Median       3Q      Max 
## -2.02479 -0.48526  0.04078  0.45898  2.37805 
## 
## Coefficients:
##             Estimate Std. Error t value Pr(>|t|)    
## (Intercept)  0.20427    0.08884   2.299   0.0219 *  
## income       0.71383    0.01854  38.505   <2e-16 ***
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
## 
## Residual standard error: 0.7181 on 496 degrees of freedom
## Multiple R-squared:  0.7493, Adjusted R-squared:  0.7488 
## F-statistic:  1483 on 1 and 496 DF,  p-value: < 2.2e-16

Check if the residual means are close to zero. In this case they are as they hug the red lines in the graphs, which means our model is valid, and we can contune with our study.

par(mfrow = c(2,2))
plot(income_dataset_happiness_lm)

par(mfrow=c(1,1))

income_dataset_graph<-ggplot(income_dataset, aes(x=income, y=happiness))+
                     geom_point()
income_dataset_graph

income_dataset_graph <- income_dataset_graph + geom_smooth(method="lm", col="black")

income_dataset_graph

DATA 605 Discussion 11

Income vs Happiness

Stephen Haslett

11/03/2020

Does the dependent variable follow a normal distribution?

Is the relationship between the indepentent and dependant variable linear?

Linear Regression Analysis