Pendahuluan

Bagian ini merupakan bentuk R Markdown tentang EDA dan Data Preprocessing.

Import Library

if(!require(tidyverse)) install.packages("tidyverse")
## Loading required package: tidyverse
## Warning: package 'tidyverse' was built under R version 3.6.3
## -- Attaching packages --------------------------------------- tidyverse 1.3.0 --
## v ggplot2 3.3.3     v purrr   0.3.4
## v tibble  3.1.0     v dplyr   1.0.5
## v tidyr   1.1.3     v stringr 1.4.0
## v readr   1.4.0     v forcats 0.5.1
## Warning: package 'ggplot2' was built under R version 3.6.3
## Warning: package 'tibble' was built under R version 3.6.3
## Warning: package 'tidyr' was built under R version 3.6.3
## Warning: package 'readr' was built under R version 3.6.3
## Warning: package 'purrr' was built under R version 3.6.3
## Warning: package 'dplyr' was built under R version 3.6.3
## Warning: package 'stringr' was built under R version 3.6.3
## Warning: package 'forcats' was built under R version 3.6.3
## -- Conflicts ------------------------------------------ tidyverse_conflicts() --
## x dplyr::filter() masks stats::filter()
## x dplyr::lag()    masks stats::lag()
if(!require(skimr)) install.packages("skimr")
## Loading required package: skimr
## Warning: package 'skimr' was built under R version 3.6.3
if(!require(DataExplorer)) install.packages("DataExplorer")
## Loading required package: DataExplorer
## Warning: package 'DataExplorer' was built under R version 3.6.3
if(!require(visdat)) install.packages("visdat")
## Loading required package: visdat
## Warning: package 'visdat' was built under R version 3.6.3
library(visdat)
library(tidyverse)
library(skimr)
library(DataExplorer)
library(dplyr)
library(ggplot2)
library(tidyr)
library(readr)
library(tibble)

Mengakses Dataset

df  <- read.csv("https://raw.githubusercontent.com/millaoktavia/Kelas-Mahir-Pejuang-Data-2.0/main/titanic_modify.csv",sep=';', stringsAsFactors = T)
df[0:5,]
##   PassengerId Survived Pclass
## 1           1        0      3
## 2           2        1      1
## 3           3        1      3
## 4           4        1      1
## 5           5        0      3
##                                                  Name    Sex Age SibSp Parch
## 1                             Braund, Mr. Owen Harris   male  22     1     0
## 2 Cumings, Mrs. John Bradley (Florence Briggs Thayer) female  38     1     0
## 3                              Heikkinen, Miss. Laina female  26     0     0
## 4        Futrelle, Mrs. Jacques Heath (Lily May Peel) female  35     1     0
## 5                            Allen, Mr. William Henry   male  35     0     0
##             Ticket    Fare Cabin Embarked
## 1        A/5 21171    7.25              S
## 2         PC 17599 712.833   C85        C
## 3 STON/O2. 3101282   7.925              S
## 4           113803    53.1  C123        S
## 5           373450    8.05              S

Tujuan Analisis Data

Melakukan klasifikasi penumpang yang selamat dan tidak selamat pada kasus tenggelamnya kapal Titanic.

Memahami dataset

Dataset Titanic dibuat untuk membuat machine learning untuk melakukan klasifikasi biner(Selamat atau Tidak Selamat. Variabel-variabel yang terdapat pada dataset ini adalah sebagai berikut:

  1. PassengerId = Nomor Id Penumpang
  2. Survived = Keterangan Selamat(0=Tidak, 1=Ya)
  3. Pclass = Kelas Tiket (1=Kelas 1, 2=Kelas 2, dst)
  4. Name = Nama Penumpang
  5. Sex = Jenis kelamin
  6. Age = Usia dalam tahun
  7. SibSp = Jumlah saudara kandung / pasangan di kapal Titanic
  8. Parch = Jumlah orang tua / anak di kapal Titanic
  9. Ticket = Nomor Tiket
  10. Fare = Harga Tiket
  11. Cabin = Nama Kabin
  12. Embarked = Pelabuhan Asal (C = Cherbourg, Q = Queenstown, S = Southampton)

##Eksploratory Data Analysis

Dimensi Data

dim(df)
## [1] 707  12

Artinya kita memiliki data dengan 12 kolom dan 707 baris

Variabel Pada Dataset

names(df)
##  [1] "PassengerId" "Survived"    "Pclass"      "Name"        "Sex"        
##  [6] "Age"         "SibSp"       "Parch"       "Ticket"      "Fare"       
## [11] "Cabin"       "Embarked"

Digunakan untuk menampilkan nama variable tiap dataset.

df$Fare=as.numeric(df$Fare)

Mengubah tipe data fare menjadi numerik.

str(df)
## 'data.frame':    707 obs. of  12 variables:
##  $ PassengerId: int  1 2 3 4 5 6 7 8 9 10 ...
##  $ Survived   : int  0 1 1 1 0 0 0 0 1 1 ...
##  $ Pclass     : int  3 1 3 1 3 3 1 3 3 2 ...
##  $ Name       : Factor w/ 707 levels "Abbott, Mrs. Stanton (Rosa Hunt)",..: 86 149 282 221 8 432 401 493 326 449 ...
##  $ Sex        : Factor w/ 2 levels "female","male": 2 1 1 1 2 2 2 2 1 1 ...
##  $ Age        : num  22 38 26 35 35 NA 54 2 27 14 ...
##  $ SibSp      : int  1 1 0 1 0 0 0 3 0 1 ...
##  $ Parch      : int  0 0 0 0 0 0 0 1 2 0 ...
##  $ Ticket     : Factor w/ 563 levels "110152","110413",..: 426 492 554 41 385 227 72 321 281 110 ...
##  $ Fare       : num  158 171 166 133 196 209 130 68 13 101 ...
##  $ Cabin      : Factor w/ 122 levels "","A10","A14",..: 1 70 1 49 1 1 109 1 1 1 ...
##  $ Embarked   : Factor w/ 4 levels "","C","Q","S": 4 2 4 4 4 3 4 4 4 2 ...

Kita dapat mengetahui tipe-tipe data masing-masing variabel dan nama-nama variabel dalam dataset.

Mengecek Missing Data

sapply(df, function(x) sum(is.na(x)))
## PassengerId    Survived      Pclass        Name         Sex         Age 
##           0           0           0           0           0         145 
##       SibSp       Parch      Ticket        Fare       Cabin    Embarked 
##           0           0           0           0           0           0

Berarti di data Age terdapat 145 data hilang

vis_miss(df)

Mengecek Outlier

plot_boxplot(df,by="Age")

Melihat korelasi data

plot_correlation(df)
## 3 features with more than 20 categories ignored!
## Name: 707 categories
## Ticket: 563 categories
## Cabin: 122 categories
## Warning: Removed 24 rows containing missing values (geom_text).

### Melihat statistik data

summary(df)
##   PassengerId       Survived          Pclass     
##  Min.   :  1.0   Min.   :0.0000   Min.   :1.000  
##  1st Qu.:177.5   1st Qu.:0.0000   1st Qu.:2.000  
##  Median :354.0   Median :0.0000   Median :3.000  
##  Mean   :354.0   Mean   :0.3876   Mean   :2.308  
##  3rd Qu.:530.5   3rd Qu.:1.0000   3rd Qu.:3.000  
##  Max.   :707.0   Max.   :1.0000   Max.   :3.000  
##                                                  
##                                              Name         Sex     
##  Abbott, Mrs. Stanton (Rosa Hunt)              :  1   female:254  
##  Abelson, Mr. Samuel                           :  1   male  :453  
##  Adahl, Mr. Mauritz Nils Martin                :  1               
##  Adams, Mr. John                               :  1               
##  Ahlin, Mrs. Johan (Johanna Persdotter Larsson):  1               
##  Albimona, Mr. Nassef Cassem                   :  1               
##  (Other)                                       :701               
##       Age            SibSp            Parch                 Ticket   
##  Min.   : 0.75   Min.   :0.0000   Min.   :0.0000   CA 2144     :  6  
##  1st Qu.:21.00   1st Qu.:0.0000   1st Qu.:0.0000   1601        :  5  
##  Median :28.00   Median :0.0000   Median :0.0000   3101295     :  5  
##  Mean   :30.03   Mean   :0.5304   Mean   :0.3833   347082      :  5  
##  3rd Qu.:39.00   3rd Qu.:1.0000   3rd Qu.:0.0000   347088      :  5  
##  Max.   :80.00   Max.   :8.0000   Max.   :6.0000   S.O.C. 14879:  5  
##  NA's   :145                                       (Other)     :676  
##       Fare               Cabin     Embarked
##  Min.   :  1.0              :548    :  1   
##  1st Qu.: 57.5   C23 C25 C27:  4   C:136   
##  Median :119.0   G6         :  4   Q: 64   
##  Mean   :115.8   C22 C26    :  3   S:506   
##  3rd Qu.:174.0   D          :  3           
##  Max.   :227.0   F2         :  3           
##                  (Other)    :142

Pairplot

plot_scatterplot(df, by="Survived")

##Data Preprocessing Data preprocessing ini digunakan guna menyiapkan data untuk diklasifikasi menggunakan metode SVM(Hanya Contoh).

Setelah mengetahui hasil EDA di atas maka untuk hasil yang baik diperlukan beberapa hal sebagai berikut:

  1. Menghapus kolom teks yang tidak diperlukan:Name dan Ticket
  2. Mengisi NaN pada kolom Cabin dengan salah satu nama kabin tertentu, mengisi NaN pada kolom *Age** dengan nilai rata-ratanya
  3. Melakukan pengkodean(Encoding) pada kolom Sex , Cabin, dan Embarked
  4. Menghapus outlier
  5. Karena nilai data ada yang sangat besar dan kecil maka perlu dilakukan normalisasi.

Setelah data diperbaiki dilakukan langkah sebagai berikut:

  1. Memilih variabel independent dan label
  2. Membagi data menjadi data training dan data testing

[Note] * Ini hanya contoh untuk melakukan pengisian data NaN(Kosong), nama kabin mungkin menjadi penting sehingga tidak segampang itu diganti

Seleksi Kolom

df_1=df[, -c(4,9)]
df_1
##     PassengerId Survived Pclass    Sex   Age SibSp Parch Fare           Cabin
## 1             1        0      3   male 22.00     1     0  158                
## 2             2        1      1 female 38.00     1     0  171             C85
## 3             3        1      3 female 26.00     0     0  166                
## 4             4        1      1 female 35.00     1     0  133            C123
## 5             5        0      3   male 35.00     0     0  196                
## 6             6        0      3   male    NA     0     0  209                
## 7             7        0      1   male 54.00     0     0  130             E46
## 8             8        0      3   male  2.00     3     1   68                
## 9             9        1      3 female 27.00     0     2   13                
## 10           10        1      2 female 14.00     1     0  101                
## 11           11        1      3 female  4.00     1     1   51              G6
## 12           12        1      1 female 58.00     0     0   84            C103
## 13           13        0      3   male 20.00     0     0  196                
## 14           14        0      3   male 39.00     1     5  104                
## 15           15        0      3 female 14.00     0     0  189                
## 16           16        1      2 female 55.00     0     0   49                
## 17           17        0      3   male  2.00     4     1   97                
## 18           18        1      2   male    NA     0     0   25                
## 19           19        0      3 female 31.00     1     0   54                
## 20           20        1      3 female    NA     0     0  157                
## 21           21        0      2   male 35.00     0     0   82                
## 22           22        1      2   male 34.00     0     0   25             D56
## 23           23        1      3 female 15.00     0     0  199                
## 24           24        1      1   male 28.00     0     0  114              A6
## 25           25        0      3 female  8.00     3     1   68                
## 26           26        1      3 female 38.00     1     5  105                
## 27           27        0      3   male    NA     0     0  157                
## 28           28        0      1   male 19.00     3     2   88     C23 C25 C27
## 29           29        1      3 female    NA     0     0  190                
## 30           30        0      3   male    NA     0     0  192                
## 31           31        0      1   male 40.00     0     0   93                
## 32           32        1      1 female    NA     1     0    4             B78
## 33           33        1      3 female    NA     0     0  162                
## 34           34        0      2   male 66.00     0     0    7                
## 35           35        0      1   male 28.00     1     0  204                
## 36           36        0      1   male 42.00     1     0  131                
## 37           37        1      3   male    NA     0     0  172                
## 38           38        0      3   male 21.00     0     0  196                
## 39           39        0      3 female 18.00     2     0   54                
## 40           40        1      3 female 14.00     1     0   14                
## 41           41        0      3 female 40.00     1     0  218                
## 42           42        0      2 female 27.00     1     0   67                
## 43           43        0      3   male    NA     0     0  192                
## 44           44        1      2 female  3.00     1     2  121                
## 45           45        1      3 female 19.00     0     0  190                
## 46           46        0      3   male    NA     0     0  196                
## 47           47        0      3   male    NA     1     0   40                
## 48           48        1      3 female    NA     0     0  162                
## 49           49        0      3   male    NA     2     0   70                
## 50           50        0      3 female 18.00     1     0   53                
## 51           51        0      3   male  7.00     4     1  119                
## 52           52        0      3   male 21.00     0     0  164                
## 53           53        1      1 female 49.00     1     0  180             D33
## 54           54        1      2 female 29.00     1     0   82                
## 55           55        0      1   male 65.00     0     1  146             B30
## 56           56        1      1   male    NA     0     0  114             C52
## 57           57        1      2 female 21.00     0     0    7                
## 58           58        0      3   male 28.50     0     0  172                
## 59           59        1      2 female  5.00     1     2   91                
## 60           60        0      3   male 11.00     5     2  123                
## 61           61        0      3   male 22.00     0     0  172                
## 62           62        1      1 female 38.00     0     0  198             B28
## 63           63        0      1   male 45.00     1     0  205             C83
## 64           64        0      3   male  4.00     3     2   92                
## 65           65        0      1   male    NA     0     0   93                
## 66           66        1      3   male    NA     1     1   47                
## 67           67        1      2 female 29.00     0     0    7             F33
## 68           68        0      3   male 19.00     0     0  202                
## 69           69        1      3 female 17.00     4     2  166                
## 70           70        0      3   male 26.00     2     0  212                
## 71           71        0      2   male 32.00     0     0    7                
## 72           72        0      3 female 16.00     5     2  123                
## 73           73        0      2   male 21.00     0     0  174                
## 74           74        0      3   male 26.00     1     0   35                
## 75           75        1      3   male 32.00     0     0  137                
## 76           76        0      3   male 25.00     0     0  160           F G73
## 77           77        0      3   male    NA     0     0  192                
## 78           78        0      3   male    NA     0     0  196                
## 79           79        1      2   male  0.83     0     2   96                
## 80           80        1      3 female 30.00     0     0   19                
## 81           81        0      3   male 22.00     0     0  215                
## 82           82        1      3   male 29.00     0     0  219                
## 83           83        1      3 female    NA     0     0  184                
## 84           84        0      1   male 28.00     0     0  124                
## 85           85        1      2 female 17.00     0     0    7                
## 86           86        1      3 female 33.00     3     0   43                
## 87           87        0      3   male 16.00     1     3  110                
## 88           88        0      3   male    NA     0     0  196                
## 89           89        1      1 female 23.00     3     2   88     C23 C25 C27
## 90           90        0      3   male 24.00     0     0  196                
## 91           91        0      3   male 29.00     0     0  196                
## 92           92        0      3   male 20.00     0     0  189                
## 93           93        0      1   male 46.00     1     0  144             E31
## 94           94        0      3   male 26.00     1     2   65                
## 95           95        0      3   male 59.00     0     0  158                
## 96           96        0      3   male    NA     0     0  196                
## 97           97        0      1   male 71.00     0     0  112              A5
## 98           98        1      1   male 23.00     0     1  148         D10 D12
## 99           99        1      2 female 34.00     0     1   75                
## 100         100        0      2   male 34.00     1     0   82                
## 101         101        0      3 female 28.00     0     0  192                
## 102         102        0      3   male    NA     0     0  192                
## 103         103        0      1   male 21.00     0     1  186             D26
## 104         104        0      3   male 33.00     0     0  211                
## 105         105        0      3   male 37.00     2     0  166                
## 106         106        0      3   male 28.00     0     0  192                
## 107         107        1      3 female 21.00     0     0  160                
## 108         108        1      3   male    NA     0     0  163                
## 109         109        0      3   male 38.00     0     0  192                
## 110         110        1      3 female    NA     1     0   78                
## 111         111        0      1   male 47.00     0     0  131            C110
## 112         112        0      3 female 14.50     1     0   35                
## 113         113        0      3   male 22.00     0     0  196                
## 114         114        0      3 female 20.00     1     0  220                
## 115         115        0      3 female 17.00     0     0   36                
## 116         116        0      3   male 21.00     0     0  166                
## 117         117        0      3   male 70.50     0     0  162                
## 118         118        0      2   male 29.00     1     0   67                
## 119         119        0      1   male 24.00     0     1   62         B58 B60
## 120         120        0      3 female  2.00     4     2  104                
## 121         121        0      2   male 21.00     2     0  174                
## 122         122        0      3   male    NA     0     0  196                
## 123         123        0      2   male 32.50     1     0  101                
## 124         124        1      2 female 32.50     0     0   25            E101
## 125         125        0      1   male 54.00     0     1  186             D26
## 126         126        1      3   male 12.00     1     0   14                
## 127         127        0      3   male    NA     0     0  162                
## 128         128        1      3   male 24.00     0     0  170                
## 129         129        1      3 female    NA     1     1   73           F E69
## 130         130        0      3   male 45.00     0     0  143                
## 131         131        0      3   male 33.00     0     0  192                
## 132         132        0      3   male 20.00     0     0  155                
## 133         133        0      3 female 47.00     1     0   34                
## 134         134        1      2 female 29.00     1     0   82                
## 135         135        0      2   male 25.00     0     0   25                
## 136         136        0      2   male 23.00     0     0   45                
## 137         137        1      1 female 19.00     0     2   86             D47
## 138         138        0      1   male 37.00     1     0  133            C123
## 139         139        0      3   male 16.00     0     0  223                
## 140         140        0      1   male 24.00     0     0  194             B86
## 141         141        0      3 female    NA     0     2   47                
## 142         142        1      3 female 22.00     0     0  162                
## 143         143        1      3 female 24.00     1     0   43                
## 144         144        0      3   male 19.00     0     0  142                
## 145         145        0      2   male 18.00     0     0   12                
## 146         146        0      2   male 19.00     1     1  115                
## 147         147        1      3   male 27.00     0     0  185                
## 148         148        0      3 female  9.00     2     2  110                
## 149         149        0      2   male 36.50     0     2   82              F2
## 150         150        0      2   male 42.00     0     0   25                
## 151         151        0      2   male 51.00     0     0   20                
## 152         152        1      1 female 22.00     1     0  151              C2
## 153         153        0      3   male 55.50     0     0  196                
## 154         154        0      3   male 40.50     0     2   34                
## 155         155        0      3   male    NA     0     0  173                
## 156         156        0      1   male 51.00     0     1  145                
## 157         157        1      3 female 16.00     0     0  182                
## 158         158        0      3   male 30.00     0     0  196                
## 159         159        0      3   male    NA     0     0  212                
## 160         160        0      3   male    NA     8     2  154                
## 161         161        0      3   male 44.00     0     1   50                
## 162         162        1      2 female 40.00     0     0   42                
## 163         163        0      3   male 26.00     0     0  163                
## 164         164        0      3   male 17.00     0     0  212                
## 165         165        0      3   male  1.00     4     1  119                
## 166         166        1      3   male  9.00     0     2   64                
## 167         167        1      1 female    NA     0     1  134             E33
## 168         168        0      3 female 45.00     1     4   92                
## 169         169        0      1   male    NA     0     0   79                
## 170         170        0      3   male 28.00     0     0  137                
## 171         171        0      1   male 61.00     0     0  109             B19
## 172         172        0      3   male  4.00     4     1   97                
## 173         173        1      3 female  1.00     1     1   13                
## 174         174        0      3   male 21.00     0     0  166                
## 175         175        0      1   male 56.00     0     0  102              A7
## 176         176        0      3   male 18.00     1     1  189                
## 177         177        0      3   male    NA     3     1   80                
## 178         178        0      1 female 50.00     0     0   95             C49
## 179         179        0      2   male 30.00     0     0   25                
## 180         180        0      3   male 36.00     0     0    1                
## 181         181        0      3 female    NA     8     2  154                
## 182         182        0      2   male    NA     0     0   38                
## 183         183        0      3   male  9.00     4     2  105                
## 184         184        1      2   male  1.00     2     1  117              F4
## 185         185        1      3 female  4.00     0     2   71                
## 186         186        0      1   male    NA     0     0  128             A32
## 187         187        1      3 female    NA     1     0   40                
## 188         188        1      1   male 45.00     0     0   84                
## 189         189        0      3   male 40.00     1     1   40                
## 190         190        0      3   male 36.00     0     0  192                
## 191         191        1      2 female 32.00     0     0   25                
## 192         192        0      2   male 19.00     0     0   25                
## 193         193        1      3 female 19.00     1     0  189                
## 194         194        1      2   male  3.00     1     1   82              F2
## 195         195        1      1 female 44.00     0     0   93              B4
## 196         196        1      1 female 58.00     0     0    4             B80
## 197         197        0      3   male    NA     0     0  162                
## 198         198        0      3   male 42.00     0     1  207                
## 199         199        1      3 female    NA     0     0  162                
## 200         200        0      2 female 24.00     0     0   25                
## 201         201        0      3   male 28.00     0     0  219                
## 202         202        0      3   male    NA     8     2  154                
## 203         203        0      3   male 34.00     0     0  149                
## 204         204        0      3   male 45.50     0     0  157                
## 205         205        1      3   male 18.00     0     0  196                
## 206         206        0      3 female  2.00     0     1    9              G6
## 207         207        0      3   male 32.00     1     0   43                
## 208         208        1      3   male 26.00     0     0   56                
## 209         209        1      3 female 16.00     0     0  162                
## 210         210        1      1   male 40.00     0     0  103             A31
## 211         211        0      3   male 24.00     0     0  155                
## 212         212        1      2 female 35.00     0     0   67                
## 213         213        0      3   male 22.00     0     0  158                
## 214         214        0      2   male 30.00     0     0   25                
## 215         215        0      3   male    NA     1     0  162                
## 216         216        1      1 female 31.00     1     0   15             D36
## 217         217        1      3 female 27.00     0     0  166                
## 218         218        0      2   male 42.00     1     0   90                
## 219         219        1      1 female 32.00     0     0  179             D15
## 220         220        0      2   male 30.00     0     0    7                
## 221         221        1      3   male 16.00     0     0  196                
## 222         222        0      2   male 27.00     0     0   25                
## 223         223        0      3   male 51.00     0     0  196                
## 224         224        0      3   male    NA     0     0  192                
## 225         225        1      1   male 38.00     1     0  221             C93
## 226         226        0      3   male 22.00     0     0  217                
## 227         227        1      2   male 19.00     0     0    7                
## 228         228        0      3   male 20.50     0     0  158                
## 229         229        0      2   male 18.00     0     0   25                
## 230         230        0      3 female    NA     3     1   80                
## 231         231        1      1 female 35.00     1     0  205             C83
## 232         232        0      3   male 29.00     0     0  163                
## 233         233        0      2   male 59.00     0     0   26                
## 234         234        1      3 female  5.00     4     2  105                
## 235         235        0      2   male 24.00     0     0    7                
## 236         236        0      3 female    NA     0     0  159                
## 237         237        0      2   male 44.00     1     0   82                
## 238         238        1      2 female  8.00     0     2   83                
## 239         239        0      2   male 19.00     0     0    7                
## 240         240        0      2   male 33.00     0     0   17                
## 241         241        0      3 female    NA     1     0   35                
## 242         242        1      3 female    NA     1     0   40                
## 243         243        0      2   male 29.00     0     0    7                
## 244         244        0      3   male 22.00     0     0  156                
## 245         245        0      3   male 30.00     0     0  157                
## 246         246        0      1   male 44.00     2     0  221             C78
## 247         247        0      3 female 25.00     0     0  163                
## 248         248        1      2 female 24.00     0     2   34                
## 249         249        1      1   male 37.00     1     1  132             D35
## 250         250        0      2   male 54.00     1     0   82                
## 251         251        0      3   male    NA     0     0  158                
## 252         252        0      3 female 29.00     1     1    9              G6
## 253         253        0      1   male 62.00     0     0   84             C87
## 254         254        0      3   male 30.00     1     0   50                
## 255         255        0      3 female 41.00     0     2   66                
## 256         256        1      3 female 29.00     0     2   47                
## 257         257        1      1 female    NA     0     0  194                
## 258         258        1      1 female 30.00     0     0  210             B77
## 259         259        1      1 female 35.00     0     0  127                
## 260         260        1      2 female 50.00     0     1   82                
## 261         261        0      3   male    NA     0     0  162                
## 262         262        1      3   male  3.00     4     2  105                
## 263         263        0      1   male 52.00     1     1  195             E67
## 264         264        0      1   male 40.00     0     0    1             B94
## 265         265        0      3 female    NA     0     0  162                
## 266         266        0      2   male 36.00     0     0    7                
## 267         267        0      3   male 16.00     4     1  119                
## 268         268        1      3   male 25.00     1     0  163                
## 269         269        1      1 female 58.00     0     1    5            C125
## 270         270        1      1 female 35.00     0     0    3             C99
## 271         271        0      1   male    NA     0     0  103                
## 272         272        1      3   male 25.00     0     0    1                
## 273         273        1      2 female 41.00     0     1   57                
## 274         274        0      1   male 37.00     0     1   98            C118
## 275         275        1      3 female    NA     0     0  162                
## 276         276        1      1 female 63.00     1     0  187              D7
## 277         277        0      3 female 45.00     0     0  162                
## 278         278        0      2   male    NA     0     0    1                
## 279         279        0      3   male  7.00     4     1   97                
## 280         280        1      3 female 35.00     1     1   63                
## 281         281        0      3   male 65.00     0     0  162                
## 282         282        0      3   male 28.00     0     0  189                
## 283         283        0      3   male 16.00     0     0  219                
## 284         284        1      3   male 19.00     0     0  196                
## 285         285        0      1   male    NA     0     0   82             A19
## 286         286        0      3   male 33.00     0     0  212                
## 287         287        1      3   male 30.00     0     0  219                
## 288         288        0      3   male 22.00     0     0  192                
## 289         289        1      2   male 42.00     0     0   25                
## 290         290        1      3 female 22.00     0     0  162                
## 291         291        1      1 female 26.00     0     0  191                
## 292         292        1      1 female 19.00     1     0  222             B49
## 293         293        0      2   male 36.00     0     0   22               D
## 294         294        0      3 female 24.00     0     0  197                
## 295         295        0      3   male 24.00     0     0  192                
## 296         296        0      1   male    NA     0     0   93                
## 297         297        0      3   male 23.50     0     0  172                
## 298         298        0      1 female  2.00     1     2   46         C22 C26
## 299         299        1      1   male    NA     0     0  100            C106
## 300         300        1      1 female 50.00     0     1   62         B58 B60
## 301         301        1      3 female    NA     0     0  162                
## 302         302        1      3   male    NA     2     0   76                
## 303         303        0      3   male 19.00     0     0    1                
## 304         304        1      2 female    NA     0     0   18            E101
## 305         305        0      3   male    NA     0     0  196                
## 306         306        1      1   male  0.92     1     2   46         C22 C26
## 307         307        1      1 female    NA     0     0    2                
## 308         308        1      1 female 17.00     1     0   11             C65
## 309         309        0      2   male 30.00     1     0   77                
## 310         310        1      1 female 30.00     0     0  138             E36
## 311         311        1      1 female 24.00     0     0  206             C54
## 312         312        1      1 female 18.00     2     2   85 B57 B59 B63 B66
## 313         313        0      2 female 26.00     1     1   82                
## 314         314        0      3   male 28.00     0     0  192                
## 315         315        0      2   male 43.00     1     1   83                
## 316         316        1      3 female 26.00     0     0  189                
## 317         317        1      2 female 24.00     1     0   82                
## 318         318        0      2   male 54.00     0     0   32                
## 319         319        1      1 female 31.00     0     2    6              C7
## 320         320        1      1 female 40.00     1     1   29             E34
## 321         321        0      3   male 22.00     0     0  158                
## 322         322        0      3   male 27.00     0     0  192                
## 323         323        1      2 female 30.00     0     0   18                
## 324         324        1      2 female 22.00     1     1   96                
## 325         325        0      3   male    NA     8     2  154                
## 326         326        1      1 female 36.00     0     0    3             C32
## 327         327        0      3   male 61.00     0     0  147                
## 328         328        1      2 female 36.00     0     0   25               D
## 329         329        1      3 female 31.00     1     1   64                
## 330         330        1      1 female 16.00     0     1  140             B18
## 331         331        1      3 female    NA     2     0   76                
## 332         332        0      1   male 45.50     0     0   94            C124
## 333         333        0      1   male 38.00     0     1    5             C91
## 334         334        0      3   male 16.00     2     0   54                
## 335         335        1      1 female    NA     1     0   27                
## 336         336        0      3   male    NA     0     0  192                
## 337         337        0      1   male 29.00     1     0  151              C2
## 338         338        1      1 female 41.00     0     0   29             E40
## 339         339        1      3   male 45.00     0     0  196                
## 340         340        0      1   male 45.00     0     0  114               T
## 341         341        1      2   male  2.00     1     1   82              F2
## 342         342        1      1 female 24.00     3     2   88     C23 C25 C27
## 343         343        0      2   male 28.00     0     0   25                
## 344         344        0      2   male 25.00     0     0   25                
## 345         345        0      2   male 36.00     0     0   25                
## 346         346        1      2 female 24.00     0     0   25             F33
## 347         347        1      2 female 40.00     0     0   25                
## 348         348        1      3 female    NA     1     0   50                
## 349         349        1      3   male  3.00     1     1   44                
## 350         350        0      3   male 42.00     0     0  212                
## 351         351        0      3   male 23.00     0     0  216                
## 352         352        0      1   male    NA     0     0  113            C128
## 353         353        0      3   male 15.00     1     1  172                
## 354         354        0      3   male 25.00     1     0   53                
## 355         355        0      3   male    NA     0     0  157                
## 356         356        0      3   male 28.00     0     0  219                
## 357         357        1      1 female 22.00     0     1  134             E33
## 358         358        0      2 female 38.00     0     0   25                
## 359         359        1      3 female    NA     0     0  190                
## 360         360        1      3 female    NA     0     0  190                
## 361         361        0      3   male 40.00     1     4   92                
## 362         362        0      2   male 29.00     1     0   93                
## 363         363        0      3 female 45.00     0     1   35                
## 364         364        0      3   male 35.00     0     0  155                
## 365         365        0      3   male    NA     1     0   40                
## 366         366        0      3   male 30.00     0     0  158                
## 367         367        1      1 female 60.00     1     0  177             D37
## 368         368        1      3 female    NA     0     0  172                
## 369         369        1      3 female    NA     0     0  162                
## 370         370        1      1 female 24.00     0     0  153             B35
## 371         371        1      1   male 25.00     1     0  136             E50
## 372         372        0      3   male 18.00     1     0  149                
## 373         373        0      3   male 19.00     0     0  196                
## 374         374        0      1   male 22.00     0     0    3                
## 375         375        0      3 female  3.00     3     1   68                
## 376         376        1      1 female    NA     1     0  204                
## 377         377        1      3 female 22.00     0     0  158                
## 378         378        0      1   male 27.00     0     2   69             C82
## 379         379        0      3   male 20.00     0     0  120                
## 380         380        0      3   male 19.00     0     0  163                
## 381         381        1      1 female 42.00     0     0   74                
## 382         382        1      3 female  1.00     0     2   48                
## 383         383        0      3   male 32.00     0     0  166                
## 384         384        1      1 female 35.00     1     0  131                
## 385         385        0      3   male    NA     0     0  192                
## 386         386        0      2   male 18.00     0     0  174                
## 387         387        0      3   male  1.00     5     2  123                
## 388         388        1      2 female 36.00     0     0   25                
## 389         389        0      3   male    NA     0     0  181                
## 390         390        1      2 female 17.00     0     0   16                
## 391         391        1      1   male 36.00     1     2   23         B96 B98
## 392         392        1      3   male 21.00     0     0  185                
## 393         393        0      3   male 28.00     2     0  166                
## 394         394        1      1 female 23.00     1     0   15             D36
## 395         395        1      3 female 24.00     0     2   51              G6
## 396         396        0      3   male 22.00     0     0  185                
## 397         397        0      3 female 31.00     0     0  189                
## 398         398        0      2   male 46.00     0     0   82                
## 399         399        0      2   male 23.00     0     0    7                
## 400         400        1      2 female 28.00     0     0   21                
## 401         401        1      3   male 39.00     0     0  166                
## 402         402        0      3   male 26.00     0     0  196                
## 403         403        0      3 female 21.00     1     0  220                
## 404         404        0      3   male 28.00     1     0   43                
## 405         405        0      3 female 20.00     0     0  212                
## 406         406        0      2   male 34.00     1     0   67                
## 407         407        0      3   male 51.00     0     0  162                
## 408         408        1      2   male  3.00     1     1   55                
## 409         409        0      3   male 21.00     0     0  163                
## 410         410        0      3 female    NA     3     1   80                
## 411         411        0      3   male    NA     0     0  192                
## 412         412        0      3   male    NA     0     0  152                
## 413         413        1      1 female 33.00     1     0  221             C78
## 414         414        0      2   male    NA     0     0    1                
## 415         415        1      3   male 44.00     0     0  166                
## 416         416        0      3 female    NA     0     0  196                
## 417         417        1      2 female 34.00     1     1  106                
## 418         418        1      2 female 18.00     0     2   25                
## 419         419        0      2   male 30.00     0     0   25                
## 420         420        0      3 female 10.00     0     2   78                
## 421         421        0      3   male    NA     0     0  192                
## 422         422        0      3   male 21.00     0     0  182                
## 423         423        0      3   male 29.00     0     0  165                
## 424         424        0      3 female 28.00     1     1   33                
## 425         425        0      3   male 18.00     1     1   66                
## 426         426        0      3   male    NA     0     0  158                
## 427         427        1      2 female 28.00     1     0   82                
## 428         428        1      2 female 19.00     0     0   82                
## 429         429        0      3   male    NA     0     0  162                
## 430         430        1      3   male 32.00     0     0  196             E10
## 431         431        1      1   male 28.00     0     0   84             C52
## 432         432        1      3 female    NA     1     0   50                
## 433         433        1      2 female 42.00     1     0   82                
## 434         434        0      3   male 17.00     0     0  156                
## 435         435        0      1   male 50.00     1     0  135             E44
## 436         436        1      1 female 14.00     1     2   23         B96 B98
## 437         437        0      3 female 21.00     2     2  110                
## 438         438        1      2 female 24.00     2     3   55                
## 439         439        0      1   male 64.00     1     4   88     C23 C25 C27
## 440         440        0      2   male 31.00     0     0    7                
## 441         441        1      2 female 45.00     1     1   83                
## 442         442        0      3   male 20.00     0     0  219                
## 443         443        0      3   male 25.00     1     0  163                
## 444         444        1      2 female 28.00     0     0   25                
## 445         445        1      3   male    NA     0     0  200                
## 446         446        1      1   male  4.00     0     2  203             A34
## 447         447        1      2 female 13.00     0     1   57                
## 448         448        1      1   male 34.00     0     0   84                
## 449         449        1      3 female  5.00     2     1   58                
## 450         450        1      1   male 52.00     0     0  100            C104
## 451         451        0      2   male 36.00     1     2   91                
## 452         452        0      3   male    NA     1     0   59                
## 453         453        0      1   male 30.00     0     0   91            C111
## 454         454        1      1   male 49.00     1     0  214             C92
## 455         455        0      3   male    NA     0     0  196                
## 456         456        1      3   male 29.00     0     0  192                
## 457         457        0      1   male 65.00     0     0   84             E38
## 458         458        1      1 female    NA     1     0  130             D21
## 459         459        1      2 female 50.00     0     0    7                
## 460         460        0      3   male    NA     0     0  162                
## 461         461        1      1   male 48.00     0     0   84             E12
## 462         462        0      3   male 34.00     0     0  196                
## 463         463        0      1   male 47.00     0     0  116             E63
## 464         464        0      2   male 48.00     0     0   25                
## 465         465        0      3   male    NA     0     0  196                
## 466         466        0      3   male 38.00     0     0  155                
## 467         467        0      2   male    NA     0     0    1                
## 468         468        0      1   male 56.00     0     0   84                
## 469         469        0      3   male    NA     0     0  161                
## 470         470        1      3 female  0.75     2     1   58                
## 471         471        0      3   male    NA     0     0  158                
## 472         472        0      3   male 38.00     0     0  212                
## 473         473        1      2 female 33.00     1     2   91                
## 474         474        1      2 female 23.00     0     0   30               D
## 475         475        0      3 female 22.00     0     0  226                
## 476         476        0      1   male    NA     0     0  131             A14
## 477         477        0      2   male 34.00     1     0   67                
## 478         478        0      3   male 29.00     1     0  167                
## 479         479        0      3   male 22.00     0     0  176                
## 480         480        1      3 female  2.00     0     1   24                
## 481         481        0      3   male  9.00     5     2  123                
## 482         482        0      2   male    NA     0     0    1                
## 483         483        0      3   male 50.00     0     0  196                
## 484         484        1      3 female 63.00     0     0  225                
## 485         485        1      1   male 25.00     1     0  222             B49
## 486         486        0      3 female    NA     3     1   80                
## 487         487        1      1 female 35.00     1     0  221             C93
## 488         488        0      1   male 58.00     0     0   98             B37
## 489         489        0      3   male 30.00     0     0  196                
## 490         490        1      3   male  9.00     1     1   44                
## 491         491        0      3   male    NA     1     0   59                
## 492         492        0      3   male 21.00     0     0  158                
## 493         493        0      1   male 55.00     0     0  100             C30
## 494         494        0      1   male 71.00     0     0  126                
## 495         495        0      3   male 21.00     0     0  196                
## 496         496        0      3   male    NA     0     0   36                
## 497         497        1      1 female 54.00     1     0  193             D20
## 498         498        0      3   male    NA     0     0   39                
## 499         499        0      1 female 25.00     1     2   46         C22 C26
## 500         500        0      3   male 24.00     0     0  185                
## 501         501        0      3   male 17.00     0     0  212                
## 502         502        0      3 female 21.00     0     0  162                
## 503         503        0      3 female    NA     0     0  178                
## 504         504        0      3 female 37.00     0     0  225                
## 505         505        1      1 female 16.00     0     0  210             B79
## 506         506        0      1   male 18.00     1     0   11             C65
## 507         507        1      2 female 33.00     0     2   82                
## 508         508        1      1   male    NA     0     0   84                
## 509         509        0      3   male 28.00     0     0   72                
## 510         510        1      3   male 26.00     0     0  137                
## 511         511        1      3   male 29.00     0     0  162                
## 512         512        0      3   male    NA     0     0  196                
## 513         513        1      1   male 36.00     0     0   87             E25
## 514         514        1      1 female 54.00     1     0  141                
## 515         515        0      3   male 24.00     0     0  175                
## 516         516        0      1   male 47.00     0     0  111             D46
## 517         517        1      2 female 34.00     0     0    7             F33
## 518         518        0      3   male    NA     0     0   78                
## 519         519        1      2 female 36.00     1     0   82                
## 520         520        0      3   male 32.00     0     0  192                
## 521         521        1      1 female 30.00     0     0  224             B73
## 522         522        0      3   male 22.00     0     0  192                
## 523         523        0      3   male    NA     0     0  157                
## 524         524        1      1 female 44.00     0     1  140             B18
## 525         525        0      3   male    NA     0     0  172                
## 526         526        0      3   male 40.50     0     0  162                
## 527         527        1      2 female 50.00     0     0    7                
## 528         528        0      1   male    NA     0     0   61             C95
## 529         529        0      3   male 39.00     0     0  166                
## 530         530        0      2   male 23.00     2     1   12                
## 531         531        1      2 female  2.00     1     1   82                
## 532         532        0      3   male    NA     0     0  172                
## 533         533        0      3   male 17.00     1     1  172                
## 534         534        1      3 female    NA     0     2   73                
## 535         535        0      3 female 30.00     0     0  212                
## 536         536        1      2 female  7.00     0     2   83                
## 537         537        0      1   male 45.00     0     0   84             B38
## 538         538        1      1 female 30.00     0     0   10                
## 539         539        0      3   male    NA     0     0   34                
## 540         540        1      1 female 22.00     0     2  125             B39
## 541         541        1      1 female 36.00     0     2  169             B22
## 542         542        0      3 female  9.00     4     2  104                
## 543         543        0      3 female 11.00     4     2  104                
## 544         544        1      2   male 32.00     1     0   82                
## 545         545        0      1   male 50.00     1     0   10             C86
## 546         546        0      1   male 64.00     0     0   82                
## 547         547        1      2 female 19.00     1     0   82                
## 548         548        1      2   male    NA     0     0   31                
## 549         549        0      3   male 33.00     1     1   64                
## 550         550        1      2   male  8.00     1     1  115                
## 551         551        1      1   male 17.00     0     2    2             C70
## 552         552        0      2   male 27.00     0     0   82                
## 553         553        0      3   male    NA     0     0  188                
## 554         554        1      3   male 22.00     0     0  157                
## 555         555        1      3 female 22.00     0     0  163                
## 556         556        0      1   male 62.00     0     0   84                
## 557         557        1      1 female 48.00     1     0  118             A16
## 558         558        0      1   male    NA     0     0   74                
## 559         559        1      1 female 39.00     1     1  195             E67
## 560         560        1      3 female 36.00     1     0   52                
## 561         561        0      3   male    NA     0     0  162                
## 562         562        0      3   male 40.00     0     0  192                
## 563         563        0      2   male 28.00     0     0   26                
## 564         564        0      3   male    NA     0     0  196                
## 565         565        0      3 female    NA     0     0  196                
## 566         566        0      3   male 24.00     2     0   78                
## 567         567        0      3   male 19.00     0     0  192                
## 568         568        0      3 female 29.00     0     4   68                
## 569         569        0      3   male    NA     0     0  172                
## 570         570        1      3   male 32.00     0     0  189                
## 571         571        1      2   male 62.00     0     0    7                
## 572         572        1      1 female 53.00     2     0  129            C101
## 573         573        1      1   male 36.00     0     0   89             E25
## 574         574        1      3 female    NA     0     0  162                
## 575         575        0      3   male 16.00     0     0  196                
## 576         576        0      3   male 19.00     0     0   34                
## 577         577        1      2 female 34.00     0     0   25                
## 578         578        1      1 female 39.00     1     0  135             E44
## 579         579        0      3 female    NA     1     0   36                
## 580         580        1      3   male 32.00     0     0  166                
## 581         581        1      2 female 25.00     1     1   99                
## 582         582        1      1 female 39.00     1     1    2             C68
## 583         583        0      2   male 54.00     0     0   82                
## 584         584        0      1   male 36.00     0     0  120             A10
## 585         585        0      3   male    NA     0     0  213                
## 586         586        1      1 female 18.00     0     2  195             E68
## 587         587        0      2   male 47.00     0     0   37                
## 588         588        1      1   male 60.00     1     1  194             B41
## 589         589        0      3   male 22.00     0     0  196                
## 590         590        0      3   male    NA     0     0  196                
## 591         591        0      3   male 35.00     0     0  156                
## 592         592        1      1 female 52.00     1     0  193             D20
## 593         593        0      3   male 47.00     0     0  158                
## 594         594        0      3 female    NA     0     2  162                
## 595         595        0      2   male 37.00     1     0   82                
## 596         596        0      3   male 36.00     1     1   78                
## 597         597        1      2 female    NA     0     0  108                
## 598         598        0      3   male 49.00     0     0    1                
## 599         599        0      3   male    NA     0     0  157                
## 600         600        1      1   male 49.00     1     0  138             A20
## 601         601        1      2 female 24.00     2     1   90                
## 602         602        0      3   male    NA     0     0  192                
## 603         603        0      1   male    NA     0     0  122                
## 604         604        0      3   male 44.00     0     0  196                
## 605         605        1      1   male 35.00     0     0   84                
## 606         606        0      3   male 36.00     1     0   41                
## 607         607        0      3   male 30.00     0     0  192                
## 608         608        1      1   male 27.00     0     0  100                
## 609         609        1      2 female 22.00     1     2  121                
## 610         610        1      1 female 40.00     0     0    5            C125
## 611         611        0      3 female 39.00     1     5  104                
## 612         612        0      3   male    NA     0     0  155                
## 613         613        1      3 female    NA     1     0   40                
## 614         614        0      3   male    NA     0     0  162                
## 615         615        0      3   male 35.00     0     0  196                
## 616         616        1      2 female 24.00     1     2  150                
## 617         617        0      3   male 34.00     1     1   33                
## 618         618        0      3 female 26.00     1     0   50                
## 619         619        1      2 female  4.00     2     1  117              F4
## 620         620        0      2   male 26.00     0     0    7                
## 621         621        0      3   male 27.00     1     0   35                
## 622         622        1      1   male 42.00     1     0  132             D19
## 623         623        1      3   male 20.00     1     1   48                
## 624         624        0      3   male 21.00     0     0  189                
## 625         625        0      3   male 21.00     0     0   50                
## 626         626        0      1   male 61.00     0     0  107             D50
## 627         627        0      2   male 57.00     0     0   18                
## 628         628        1      1 female 21.00     0     0  187              D9
## 629         629        0      3   male 26.00     0     0  192                
## 630         630        0      3   male    NA     0     0  182                
## 631         631        1      1   male 80.00     0     0   99             A23
## 632         632        0      3   male 51.00     0     0  168                
## 633         633        1      1   male 32.00     0     0  100             B50
## 634         634        0      1   male    NA     0     0    1                
## 635         635        0      3 female  9.00     3     2   92                
## 636         636        1      2 female 28.00     0     0   25                
## 637         637        0      3   male 32.00     0     0  166                
## 638         638        0      2   male 31.00     1     1   83                
## 639         639        0      3 female 41.00     0     5  119                
## 640         640        0      3   male    NA     1     0   50                
## 641         641        0      3   male 20.00     0     0  189                
## 642         642        1      1 female 24.00     0     0  153             B35
## 643         643        0      3 female  2.00     3     2   92                
## 644         644        1      3   male    NA     0     0  137                
## 645         645        1      3 female  0.75     2     1   58                
## 646         646        1      1   male 48.00     1     0  180             D33
## 647         647        0      3   male 19.00     0     0  192                
## 648         648        1      1   male 56.00     0     0  114             A26
## 649         649        0      3   male    NA     0     0  159                
## 650         650        1      3 female 23.00     0     0  159                
## 651         651        0      3   male    NA     0     0  192                
## 652         652        1      2 female 18.00     0     1   75                
## 653         653        0      3   male 21.00     0     0  208                
## 654         654        1      3 female    NA     0     0  188                
## 655         655        0      3 female 18.00     0     0  142                
## 656         656        0      2   male 24.00     2     0  174                
## 657         657        0      3   male    NA     0     0  192                
## 658         658        0      3 female 32.00     1     1   40                
## 659         659        0      2   male 23.00     0     0   25                
## 660         660        0      1   male 58.00     0     2   15             D48
## 661         661        1      1   male 50.00     2     0   27                
## 662         662        0      3   male 40.00     0     0  157                
## 663         663        0      1   male 47.00     0     0   81             E58
## 664         664        0      3   male 36.00     0     0  175                
## 665         665        1      3   male 20.00     1     0  166                
## 666         666        0      2   male 32.00     2     0  174                
## 667         667        0      2   male 25.00     0     0   25                
## 668         668        0      3   male    NA     0     0  163                
## 669         669        0      3   male 43.00     0     0  196                
## 670         670        1      1 female    NA     1     0  131            C126
## 671         671        1      2 female 40.00     1     1  117                
## 672         672        0      1   male 31.00     1     0  131             B71
## 673         673        0      2   male 70.00     0     0    7                
## 674         674        1      2   male 31.00     0     0   25                
## 675         675        0      2   male    NA     0     0    1                
## 676         676        0      3   male 18.00     0     0  163                
## 677         677        0      3   male 24.50     0     0  196                
## 678         678        1      3 female 18.00     0     0  227                
## 679         679        0      3 female 43.00     1     6  123                
## 680         680        1      1   male 36.00     0     1  127     B51 B53 B55
## 681         681        0      3 female    NA     0     0  201                
## 682         682        1      1   male 27.00     0     0  180             D49
## 683         683        0      3   male 20.00     0     0  216                
## 684         684        0      3   male 14.00     5     2  123                
## 685         685        0      2   male 60.00     1     1  117                
## 686         686        0      2   male 25.00     1     2  121                
## 687         687        0      3   male 14.00     4     1  119                
## 688         688        0      3   male 19.00     0     0    8                
## 689         689        0      3   male 18.00     0     0  185                
## 690         690        1      1 female 15.00     0     1   60              B5
## 691         691        1      1   male 31.00     1     0  139             B20
## 692         692        1      3 female  4.00     0     1   28                
## 693         693        1      3   male    NA     0     0  137                
## 694         694        0      3   male 25.00     0     0  157                
## 695         695        0      1   male 60.00     0     0   84                
## 696         696        0      2   male 52.00     0     0   26                
## 697         697        0      3   male 44.00     0     0  196                
## 698         698        1      3 female    NA     0     0  182                
## 699         699        0      1   male 49.00     1     1    2             C68
## 700         700        0      3   male 42.00     0     0  160           F G63
## 701         701        1      1 female 18.00     1     0   74         C62 C64
## 702         702        1      1   male 35.00     0     0   87             E24
## 703         703        0      3 female 18.00     0     1   35                
## 704         704        0      3   male 25.00     0     0  183                
## 705         705        0      3   male 26.00     1     0  189                
## 706         706        0      2   male 39.00     0     0   82                
## 707         707        1      2 female 45.00     0     0   26                
##     Embarked
## 1          S
## 2          C
## 3          S
## 4          S
## 5          S
## 6          Q
## 7          S
## 8          S
## 9          S
## 10         C
## 11         S
## 12         S
## 13         S
## 14         S
## 15         S
## 16         S
## 17         Q
## 18         S
## 19         S
## 20         C
## 21         S
## 22         S
## 23         Q
## 24         S
## 25         S
## 26         S
## 27         C
## 28         S
## 29         Q
## 30         S
## 31         C
## 32         C
## 33         Q
## 34         S
## 35         C
## 36         S
## 37         C
## 38         S
## 39         S
## 40         C
## 41         S
## 42         S
## 43         C
## 44         C
## 45         Q
## 46         S
## 47         Q
## 48         Q
## 49         C
## 50         S
## 51         S
## 52         S
## 53         C
## 54         S
## 55         C
## 56         S
## 57         S
## 58         C
## 59         S
## 60         S
## 61         C
## 62          
## 63         S
## 64         S
## 65         C
## 66         C
## 67         S
## 68         S
## 69         S
## 70         S
## 71         S
## 72         S
## 73         S
## 74         C
## 75         S
## 76         S
## 77         S
## 78         S
## 79         S
## 80         S
## 81         S
## 82         S
## 83         Q
## 84         S
## 85         S
## 86         S
## 87         S
## 88         S
## 89         S
## 90         S
## 91         S
## 92         S
## 93         S
## 94         S
## 95         S
## 96         S
## 97         C
## 98         C
## 99         S
## 100        S
## 101        S
## 102        S
## 103        S
## 104        S
## 105        S
## 106        S
## 107        S
## 108        S
## 109        S
## 110        Q
## 111        S
## 112        C
## 113        S
## 114        S
## 115        C
## 116        S
## 117        Q
## 118        S
## 119        C
## 120        S
## 121        S
## 122        S
## 123        C
## 124        S
## 125        S
## 126        C
## 127        Q
## 128        S
## 129        C
## 130        S
## 131        C
## 132        S
## 133        S
## 134        S
## 135        S
## 136        C
## 137        S
## 138        S
## 139        S
## 140        C
## 141        C
## 142        S
## 143        S
## 144        Q
## 145        S
## 146        S
## 147        S
## 148        S
## 149        S
## 150        S
## 151        S
## 152        S
## 153        S
## 154        S
## 155        S
## 156        C
## 157        Q
## 158        S
## 159        S
## 160        S
## 161        S
## 162        S
## 163        S
## 164        S
## 165        S
## 166        S
## 167        S
## 168        S
## 169        S
## 170        S
## 171        S
## 172        Q
## 173        S
## 174        S
## 175        C
## 176        S
## 177        S
## 178        C
## 179        S
## 180        S
## 181        S
## 182        C
## 183        S
## 184        S
## 185        S
## 186        S
## 187        Q
## 188        S
## 189        Q
## 190        S
## 191        S
## 192        S
## 193        S
## 194        S
## 195        C
## 196        C
## 197        Q
## 198        S
## 199        Q
## 200        S
## 201        S
## 202        S
## 203        S
## 204        C
## 205        S
## 206        S
## 207        S
## 208        C
## 209        Q
## 210        C
## 211        S
## 212        S
## 213        S
## 214        S
## 215        Q
## 216        C
## 217        S
## 218        S
## 219        C
## 220        S
## 221        S
## 222        S
## 223        S
## 224        S
## 225        S
## 226        S
## 227        S
## 228        S
## 229        S
## 230        S
## 231        S
## 232        S
## 233        S
## 234        S
## 235        S
## 236        S
## 237        S
## 238        S
## 239        S
## 240        S
## 241        C
## 242        Q
## 243        S
## 244        S
## 245        C
## 246        Q
## 247        S
## 248        S
## 249        S
## 250        S
## 251        S
## 252        S
## 253        S
## 254        S
## 255        S
## 256        C
## 257        C
## 258        S
## 259        C
## 260        S
## 261        Q
## 262        S
## 263        S
## 264        S
## 265        Q
## 266        S
## 267        S
## 268        S
## 269        S
## 270        S
## 271        S
## 272        S
## 273        S
## 274        C
## 275        Q
## 276        S
## 277        S
## 278        S
## 279        Q
## 280        S
## 281        Q
## 282        S
## 283        S
## 284        S
## 285        S
## 286        C
## 287        S
## 288        S
## 289        S
## 290        Q
## 291        S
## 292        C
## 293        C
## 294        S
## 295        S
## 296        C
## 297        C
## 298        S
## 299        S
## 300        C
## 301        Q
## 302        Q
## 303        S
## 304        Q
## 305        S
## 306        S
## 307        C
## 308        C
## 309        C
## 310        C
## 311        C
## 312        C
## 313        S
## 314        S
## 315        S
## 316        S
## 317        S
## 318        S
## 319        S
## 320        C
## 321        S
## 322        S
## 323        Q
## 324        S
## 325        S
## 326        C
## 327        S
## 328        S
## 329        S
## 330        C
## 331        Q
## 332        S
## 333        S
## 334        S
## 335        S
## 336        S
## 337        S
## 338        C
## 339        S
## 340        S
## 341        S
## 342        S
## 343        S
## 344        S
## 345        S
## 346        S
## 347        S
## 348        S
## 349        S
## 350        S
## 351        S
## 352        S
## 353        C
## 354        S
## 355        C
## 356        S
## 357        S
## 358        S
## 359        Q
## 360        Q
## 361        S
## 362        C
## 363        C
## 364        S
## 365        Q
## 366        S
## 367        C
## 368        C
## 369        Q
## 370        C
## 371        C
## 372        S
## 373        S
## 374        C
## 375        S
## 376        C
## 377        S
## 378        C
## 379        C
## 380        S
## 381        C
## 382        C
## 383        S
## 384        S
## 385        S
## 386        S
## 387        S
## 388        S
## 389        Q
## 390        C
## 391        S
## 392        S
## 393        S
## 394        C
## 395        S
## 396        S
## 397        S
## 398        S
## 399        S
## 400        S
## 401        S
## 402        S
## 403        S
## 404        S
## 405        S
## 406        S
## 407        S
## 408        S
## 409        S
## 410        S
## 411        S
## 412        Q
## 413        Q
## 414        S
## 415        S
## 416        S
## 417        S
## 418        S
## 419        S
## 420        S
## 421        C
## 422        Q
## 423        S
## 424        S
## 425        S
## 426        S
## 427        S
## 428        S
## 429        Q
## 430        S
## 431        S
## 432        S
## 433        S
## 434        S
## 435        S
## 436        S
## 437        S
## 438        S
## 439        S
## 440        S
## 441        S
## 442        S
## 443        S
## 444        S
## 445        S
## 446        S
## 447        S
## 448        S
## 449        C
## 450        S
## 451        S
## 452        S
## 453        C
## 454        C
## 455        S
## 456        C
## 457        S
## 458        S
## 459        S
## 460        Q
## 461        S
## 462        S
## 463        S
## 464        S
## 465        S
## 466        S
## 467        S
## 468        S
## 469        Q
## 470        C
## 471        S
## 472        S
## 473        S
## 474        C
## 475        S
## 476        S
## 477        S
## 478        S
## 479        S
## 480        S
## 481        S
## 482        S
## 483        S
## 484        S
## 485        C
## 486        S
## 487        S
## 488        C
## 489        S
## 490        S
## 491        S
## 492        S
## 493        S
## 494        C
## 495        S
## 496        C
## 497        C
## 498        S
## 499        S
## 500        S
## 501        S
## 502        Q
## 503        Q
## 504        S
## 505        S
## 506        C
## 507        S
## 508        S
## 509        S
## 510        S
## 511        Q
## 512        S
## 513        S
## 514        C
## 515        S
## 516        S
## 517        S
## 518        Q
## 519        S
## 520        S
## 521        S
## 522        S
## 523        C
## 524        C
## 525        C
## 526        Q
## 527        S
## 528        S
## 529        S
## 530        S
## 531        S
## 532        C
## 533        C
## 534        C
## 535        S
## 536        S
## 537        S
## 538        C
## 539        S
## 540        C
## 541        S
## 542        S
## 543        S
## 544        S
## 545        C
## 546        S
## 547        S
## 548        C
## 549        S
## 550        S
## 551        C
## 552        S
## 553        Q
## 554        C
## 555        S
## 556        S
## 557        C
## 558        C
## 559        S
## 560        S
## 561        Q
## 562        S
## 563        S
## 564        S
## 565        S
## 566        S
## 567        S
## 568        S
## 569        C
## 570        S
## 571        S
## 572        S
## 573        S
## 574        Q
## 575        S
## 576        S
## 577        S
## 578        S
## 579        C
## 580        S
## 581        S
## 582        C
## 583        S
## 584        C
## 585        C
## 586        S
## 587        S
## 588        C
## 589        S
## 590        S
## 591        S
## 592        C
## 593        S
## 594        Q
## 595        S
## 596        S
## 597        S
## 598        S
## 599        C
## 600        C
## 601        S
## 602        S
## 603        S
## 604        S
## 605        C
## 606        S
## 607        S
## 608        S
## 609        C
## 610        S
## 611        S
## 612        S
## 613        Q
## 614        Q
## 615        S
## 616        S
## 617        S
## 618        S
## 619        S
## 620        S
## 621        C
## 622        S
## 623        C
## 624        S
## 625        S
## 626        S
## 627        Q
## 628        S
## 629        S
## 630        Q
## 631        S
## 632        S
## 633        C
## 634        S
## 635        S
## 636        S
## 637        S
## 638        S
## 639        S
## 640        S
## 641        S
## 642        C
## 643        S
## 644        S
## 645        C
## 646        C
## 647        S
## 648        C
## 649        S
## 650        S
## 651        S
## 652        S
## 653        S
## 654        Q
## 655        Q
## 656        S
## 657        S
## 658        Q
## 659        S
## 660        C
## 661        S
## 662        C
## 663        S
## 664        S
## 665        S
## 666        S
## 667        S
## 668        S
## 669        S
## 670        S
## 671        S
## 672        S
## 673        S
## 674        S
## 675        S
## 676        S
## 677        S
## 678        S
## 679        S
## 680        C
## 681        Q
## 682        C
## 683        S
## 684        S
## 685        S
## 686        C
## 687        S
## 688        S
## 689        S
## 690        S
## 691        S
## 692        C
## 693        S
## 694        C
## 695        S
## 696        S
## 697        S
## 698        Q
## 699        C
## 700        S
## 701        C
## 702        S
## 703        C
## 704        Q
## 705        S
## 706        S
## 707        S

Mengisi NAN

df$Cabin = ifelse(is.na(df$Cabin),
                  ave(df$Cabin,FUN=function(x) mean(x, na.rm=TRUE)), df$Cabin)

#Di console ketik "view(df)" lalu enter

Normalisasi data

unit_length <- function(x) {
                            x / sqrt(sum(x^2))
                            }

unit_length_df <- as.data.frame(lapply(df, unit_length))
## Warning in Ops.factor(x, 2): '^' not meaningful for factors
## Warning in Ops.factor(x, sqrt(sum(x^2))): '/' not meaningful for factors
## Warning in Ops.factor(x, 2): '^' not meaningful for factors
## Warning in Ops.factor(x, sqrt(sum(x^2))): '/' not meaningful for factors
## Warning in Ops.factor(x, 2): '^' not meaningful for factors
## Warning in Ops.factor(x, sqrt(sum(x^2))): '/' not meaningful for factors
## Warning in Ops.factor(x, 2): '^' not meaningful for factors
## Warning in Ops.factor(x, sqrt(sum(x^2))): '/' not meaningful for factors
head(unit_length_df)
##    PassengerId   Survived     Pclass Name Sex Age      SibSp Parch Ticket
## 1 9.203883e-05 0.00000000 0.04596386   NA  NA  NA 0.03157545     0     NA
## 2 1.840777e-04 0.06041221 0.01532129   NA  NA  NA 0.03157545     0     NA
## 3 2.761165e-04 0.06041221 0.04596386   NA  NA  NA 0.00000000     0     NA
## 4 3.681553e-04 0.06041221 0.01532129   NA  NA  NA 0.03157545     0     NA
## 5 4.601941e-04 0.00000000 0.04596386   NA  NA  NA 0.00000000     0     NA
## 6 5.522330e-04 0.00000000 0.04596386   NA  NA  NA 0.00000000     0     NA
##         Fare       Cabin Embarked
## 1 0.04443788 0.001067588       NA
## 2 0.04809417 0.074731156       NA
## 3 0.04668790 0.001067588       NA
## 4 0.03740657 0.052311809       NA
## 5 0.05512548 0.001067588       NA
## 6 0.05878176 0.001067588       NA

Normalisasi Panjang Unit.Normalisasi Panjang Satuan mengubah x menjadi x ’dengan membagi setiap nilai vektor fitur dengan panjang vektor Euclidean.

##This is the end of Notebooks

Say: Alhamdulillah, panjang banget dah :)