There is a total of 12 variables. I consider 5 meaningful variables suitable for simple linear regression and hypothesis testing.
-
Survived
- 0 = Didn’t survive
- 1 = Survived
-
Pclass
- Passenger’s ticket class, represent a passenger’s economic status
- 1 = First class
- 2 = Second class
- 3 = Third class
-
Sex
- “male” and “female”
-
Age
- Passengers’ ages
-
Fare
- Ticket price that passengers paid