February 21, 2017

Descripcion del Dominio de los Datos

## [1] "ID"
## [1] 1 2 3 4 5 6
## [1] "Supervivencia"
## [1] 0 1 1 1 0 0
## [1] "Clase"
## [1] 3 1 3 1 3 3

## [1] "Nombres"
## [1] Braund, Mr. Owen Harris                            
## [2] Cumings, Mrs. John Bradley (Florence Briggs Thayer)
## [3] Heikkinen, Miss. Laina                             
## [4] Futrelle, Mrs. Jacques Heath (Lily May Peel)       
## [5] Allen, Mr. William Henry                           
## [6] Moran, Mr. James                                   
## 891 Levels: Abbing, Mr. Anthony ... Zimmerman, Mr. Leo

## [1] "Genero"
## [1] male   female female female male   male  
## Levels: female male
## [1] "Edad"
## [1] 22 38 26 35 35 NA
## [1] "NĂºmero de Hermanos"
## [1] 1 1 0 1 0 0

## [1] "NĂºmero de Padres"
## [1] 0 0 0 0 0 0
## [1] "Boleto"
## [1] A/5 21171        PC 17599         STON/O2. 3101282 113803          
## [5] 373450           330877          
## 681 Levels: 110152 110413 110465 110564 110813 111240 111320 ... WE/P 5735

## [1] "Costo del Boleto"
## [1]  7.2500 71.2833  7.9250 53.1000  8.0500  8.4583
## [1] "Camarote"
## [1]      C85       C123          
## 148 Levels:  A10 A14 A16 A19 A20 A23 A24 A26 A31 A32 A34 A36 A5 A6 ... T
## [1] "A bordo?"
## [1] S C S S S Q
## Levels:  C Q S

Descripcion de Variables

##  [1] "PassengerId" "Survived"    "Pclass"      "Name"        "Sex"        
##  [6] "Age"         "SibSp"       "Parch"       "Ticket"      "Fare"       
## [11] "Cabin"       "Embarked"

Limpieza de datos

Resumen Estad?stico

##     Survived          Pclass          Sex           Age       
##  Min.   :0.0000   Min.   :1.000   female:314   Min.   : 0.42  
##  1st Qu.:0.0000   1st Qu.:2.000   male  :577   1st Qu.:20.12  
##  Median :0.0000   Median :3.000                Median :28.00  
##  Mean   :0.3838   Mean   :2.309                Mean   :29.70  
##  3rd Qu.:1.0000   3rd Qu.:3.000                3rd Qu.:38.00  
##  Max.   :1.0000   Max.   :3.000                Max.   :80.00  
##                                                NA's   :177    
##       Ticket         Fare                Cabin     Embarked
##  1601    :  7   Min.   :  0.00              :687    :  2   
##  347082  :  7   1st Qu.:  7.91   B96 B98    :  4   C:168   
##  CA. 2343:  7   Median : 14.45   C23 C25 C27:  4   Q: 77   
##  3101295 :  6   Mean   : 32.20   G6         :  4   S:644   
##  347088  :  6   3rd Qu.: 31.00   C22 C26    :  3           
##  CA 2144 :  6   Max.   :512.33   D          :  3           
##  (Other) :852                    (Other)    :186

Boxplot

Edad

Asimetria (Skewness)

Asimetria

La asimetr?a esta definida como γ1 = μ3∕μ3∕22

## [1] 0.3874744
## [1] 4.77121

Histogramas

Edad

Costo del Boleto

Supervivencia

Clase del Boleto

Cuartiles

Edad

## [1] "Cuartiles"
##     0%    25%    50%    75%   100% 
##  0.420 20.125 28.000 38.000 80.000
## [1] "Rango intercuartil: 17.875000"

Costo del Boleto

## [1] "Cuartiles"
##       0%      25%      50%      75%     100% 
##   0.0000   7.9104  14.4542  31.0000 512.3292
## [1] "Rango intercuartil: 23.089600"

CorrelaciĂ³n

Supervivencia y Edad

## [1] "Supervivencia y Edad"
## [1] "Coeficiente de CorrelaciĂ³n: -0.077221"
## [1] "Costo del Boleto y Edad"
## [1] "Coeficiente de CorrelaciĂ³n: 0.096067"
## [1] "Clase del Boleto y Edad"
## [1] "Coeficiente de CorrelaciĂ³n: -0.369226"
## [1] "Costo del Boleto y Clase del boleto"
## [1] "Coeficiente de CorrelaciĂ³n: -0.549500"

Scatterplots

Edad y Clase

Costo del Boleto y Supervivencia

Costo del Boleto y Edad