Task 4a

setwd("C:/Users/Prabha Shankar/Desktop/Winter Internship/R file")
var1 <- read.csv(paste("Titanic Data.csv"))
var2 <- aggregate(var1$Age , by = list(var1$Survived) , FUN=mean)
var2
##   Group.1        x
## 1       0 30.41530
## 2       1 28.42382

Task 4c

Use R to Run t-test to test the following hypothesis :

H1 : The Titanic survivors were younger than the passengers who died .

Null Hypothesis : There is no significant difference between the age of survivors .

t.test(var1$Age~var1$Survived)
## 
##  Welch Two Sample t-test
## 
## data:  var1$Age by var1$Survived
## t = 2.1816, df = 667.56, p-value = 0.02949
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  0.1990628 3.7838912
## sample estimates:
## mean in group 0 mean in group 1 
##        30.41530        28.42382

Value of p is 0.02949 which is greater than 0.01 (p>0.01). Therfore null hypothesis is accepted and H1 is regected .