Description of the data, data_quiz5
countrycontinentlifeExp life expectancy in yearpop total populationgdpPercap GDP per capita in U.S. dollarHint: The data is posted in Moodle. Look for data_quiz5.csv under the Data Files section.
QuizData <- read.csv("data_quiz5 (4).csv")
Hint: Use head() to display the first six rows.
head(QuizData)
## country continent lifeExp pop gdpPercap
## 1 Albania Europe 76.423 3600523 5937.030
## 2 Algeria Africa 72.301 33333216 6223.367
## 3 Argentina Americas 75.320 40301927 12779.380
## 4 Australia Oceania 81.235 20434176 34435.367
## 5 Austria Europe 79.829 8199783 36126.493
## 6 Bahrain Asia 75.635 708573 29796.048
Hint: Create a scatter plot to examine the relationship between GDP per capita (mapped to y-axis) and life expectancy (mapped to x-axis).
library(tidyverse)
ggplot(QuizData,
aes(x= lifeExp,
y=gdpPercap)) +
geom_point()
data(QuizData, package="mosaicData")
GdpPercap_1m <- lm(gdpPercap ~ lifeExp,
data = QuizData)
# View summary of model 1
summary(GdpPercap_1m)
##
## Call:
## lm(formula = gdpPercap ~ lifeExp, data = QuizData)
##
## Residuals:
## Min 1Q Median 3Q Max
## -17319.8 -4512.4 -63.2 3443.1 24014.4
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) -215340.5 18057.2 -11.93 <2e-16 ***
## lifeExp 3075.6 237.6 12.94 <2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 7578 on 81 degrees of freedom
## Multiple R-squared: 0.6741, Adjusted R-squared: 0.6701
## F-statistic: 167.5 on 1 and 81 DF, p-value: < 2.2e-16
I would say that yes the cofficient of life expectancy will be statictally significant at 5%. the P value is under 5%, being 2.2e-16
Hint: Discuss both its sign and magnitude.
Every dollar increased in gdp the life ecpectancy is expected to rais by the P value (2.2e-16)
Hint: Make your argument using the relevant test results, such as p-value.
The new model is not as significant as the first model, this is because the pvalue is 0.91 which is higher than 5%. The adjusted r squared in the first modfel is higher making it also a better one to look at.