titanic <- read.csv("C:/Program Files/RStudio/files/Titanic Data.csv")
View(titanic)
1.The average age of the survivors and the average age of the people who died.
x <- by(titanic$Age, list(titanic$Survived), mean)
x
## : 0
## [1] 30.4153
## --------------------------------------------------------
## : 1
## [1] 28.42382
t.test(titanic$Age ~ titanic$Survived,var.equal=TRUE)
##
## Two Sample t-test
##
## data: titanic$Age by titanic$Survived
## t = 2.2302, df = 887, p-value = 0.02599
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## 0.238890 3.744064
## sample estimates:
## mean in group 0 mean in group 1
## 30.41530 28.42382
p-value = 0.02599
As p-value<0.05, so we can say that the null hypothesis H2 is not true and is rejected. There is significant difference between the ages of people who died and the ones who survived.