titanic.df <- read.csv(paste("Titanic Data.csv",sep=""))
View(titanic.df)
Use R to create a table showing the average age of the survivors and the average age of the people who died.
aggregate(titanic.df$Age,by=list(Survived = titanic.df$Survived),mean)
## Survived x
## 1 0 30.41530
## 2 1 28.42382
Use R to run a t-test to test the following hypothesis: H2: The Titanic survivors were younger than the passengers who died.
t.test(Age ~ Survived, data = titanic.df)
##
## Welch Two Sample t-test
##
## data: Age by Survived
## t = 2.1816, df = 667.56, p-value = 0.02949
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.1990628 3.7838912
## sample estimates:
## mean in group 0 mean in group 1
## 30.41530 28.42382
We can see that p-value is lesser than 0.05 which means it is safe to reject the null hypothesis that says that there is no difference between the age of people who survived.
Indeed, the titanic survivors were younger than the passengers who died.