Un conjunto de datos que clasifica 4601 correos electrónicos como spam o no spam. Además de esta etiqueta de clase, existen 57 variables que indican la frecuencia de ciertas palabras y caracteres en el correo electrónico.
Las primeras 48 variables contienen la frecuencia del nombre de la variable en el correo electrónico. Si el nombre de la variable comienza con num (p. ej., num650), indica la frecuencia del número correspondiente (p. ej., 650). Las variables 49-54 indican la frecuencia de los caracteres ‘;’, ‘(’, ‘[’, ‘!’, ‘$’ y ‘#’. Las variables 55-57 contienen la ejecución promedio, más larga y total. longitud de las letras mayúsculas La variable 58 indica el tipo de correo y es “no spam” o “spam”, es decir, correo electrónico comercial no solicitado.
library(kernlab)
Obtenemos el dataset:
data(spam)
Observemos el nombre de las columnas:
names(spam)
## [1] "make" "address" "all"
## [4] "num3d" "our" "over"
## [7] "remove" "internet" "order"
## [10] "mail" "receive" "will"
## [13] "people" "report" "addresses"
## [16] "free" "business" "email"
## [19] "you" "credit" "your"
## [22] "font" "num000" "money"
## [25] "hp" "hpl" "george"
## [28] "num650" "lab" "labs"
## [31] "telnet" "num857" "data"
## [34] "num415" "num85" "technology"
## [37] "num1999" "parts" "pm"
## [40] "direct" "cs" "meeting"
## [43] "original" "project" "re"
## [46] "edu" "table" "conference"
## [49] "charSemicolon" "charRoundbracket" "charSquarebracket"
## [52] "charExclamation" "charDollar" "charHash"
## [55] "capitalAve" "capitalLong" "capitalTotal"
## [58] "type"
Veamos los datos de las primeras filas:
head(spam)
## make address all num3d our over remove internet order mail receive will
## 1 0.00 0.64 0.64 0 0.32 0.00 0.00 0.00 0.00 0.00 0.00 0.64
## 2 0.21 0.28 0.50 0 0.14 0.28 0.21 0.07 0.00 0.94 0.21 0.79
## 3 0.06 0.00 0.71 0 1.23 0.19 0.19 0.12 0.64 0.25 0.38 0.45
## 4 0.00 0.00 0.00 0 0.63 0.00 0.31 0.63 0.31 0.63 0.31 0.31
## 5 0.00 0.00 0.00 0 0.63 0.00 0.31 0.63 0.31 0.63 0.31 0.31
## 6 0.00 0.00 0.00 0 1.85 0.00 0.00 1.85 0.00 0.00 0.00 0.00
## people report addresses free business email you credit your font num000
## 1 0.00 0.00 0.00 0.32 0.00 1.29 1.93 0.00 0.96 0 0.00
## 2 0.65 0.21 0.14 0.14 0.07 0.28 3.47 0.00 1.59 0 0.43
## 3 0.12 0.00 1.75 0.06 0.06 1.03 1.36 0.32 0.51 0 1.16
## 4 0.31 0.00 0.00 0.31 0.00 0.00 3.18 0.00 0.31 0 0.00
## 5 0.31 0.00 0.00 0.31 0.00 0.00 3.18 0.00 0.31 0 0.00
## 6 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0 0.00
## money hp hpl george num650 lab labs telnet num857 data num415 num85
## 1 0.00 0 0 0 0 0 0 0 0 0 0 0
## 2 0.43 0 0 0 0 0 0 0 0 0 0 0
## 3 0.06 0 0 0 0 0 0 0 0 0 0 0
## 4 0.00 0 0 0 0 0 0 0 0 0 0 0
## 5 0.00 0 0 0 0 0 0 0 0 0 0 0
## 6 0.00 0 0 0 0 0 0 0 0 0 0 0
## technology num1999 parts pm direct cs meeting original project re edu
## 1 0 0.00 0 0 0.00 0 0 0.00 0 0.00 0.00
## 2 0 0.07 0 0 0.00 0 0 0.00 0 0.00 0.00
## 3 0 0.00 0 0 0.06 0 0 0.12 0 0.06 0.06
## 4 0 0.00 0 0 0.00 0 0 0.00 0 0.00 0.00
## 5 0 0.00 0 0 0.00 0 0 0.00 0 0.00 0.00
## 6 0 0.00 0 0 0.00 0 0 0.00 0 0.00 0.00
## table conference charSemicolon charRoundbracket charSquarebracket
## 1 0 0 0.00 0.000 0
## 2 0 0 0.00 0.132 0
## 3 0 0 0.01 0.143 0
## 4 0 0 0.00 0.137 0
## 5 0 0 0.00 0.135 0
## 6 0 0 0.00 0.223 0
## charExclamation charDollar charHash capitalAve capitalLong capitalTotal type
## 1 0.778 0.000 0.000 3.756 61 278 spam
## 2 0.372 0.180 0.048 5.114 101 1028 spam
## 3 0.276 0.184 0.010 9.821 485 2259 spam
## 4 0.137 0.000 0.000 3.537 40 191 spam
## 5 0.135 0.000 0.000 3.537 40 191 spam
## 6 0.000 0.000 0.000 3.000 15 54 spam
La estructura del dataset:
str(spam)
## 'data.frame': 4601 obs. of 58 variables:
## $ make : num 0 0.21 0.06 0 0 0 0 0 0.15 0.06 ...
## $ address : num 0.64 0.28 0 0 0 0 0 0 0 0.12 ...
## $ all : num 0.64 0.5 0.71 0 0 0 0 0 0.46 0.77 ...
## $ num3d : num 0 0 0 0 0 0 0 0 0 0 ...
## $ our : num 0.32 0.14 1.23 0.63 0.63 1.85 1.92 1.88 0.61 0.19 ...
## $ over : num 0 0.28 0.19 0 0 0 0 0 0 0.32 ...
## $ remove : num 0 0.21 0.19 0.31 0.31 0 0 0 0.3 0.38 ...
## $ internet : num 0 0.07 0.12 0.63 0.63 1.85 0 1.88 0 0 ...
## $ order : num 0 0 0.64 0.31 0.31 0 0 0 0.92 0.06 ...
## $ mail : num 0 0.94 0.25 0.63 0.63 0 0.64 0 0.76 0 ...
## $ receive : num 0 0.21 0.38 0.31 0.31 0 0.96 0 0.76 0 ...
## $ will : num 0.64 0.79 0.45 0.31 0.31 0 1.28 0 0.92 0.64 ...
## $ people : num 0 0.65 0.12 0.31 0.31 0 0 0 0 0.25 ...
## $ report : num 0 0.21 0 0 0 0 0 0 0 0 ...
## $ addresses : num 0 0.14 1.75 0 0 0 0 0 0 0.12 ...
## $ free : num 0.32 0.14 0.06 0.31 0.31 0 0.96 0 0 0 ...
## $ business : num 0 0.07 0.06 0 0 0 0 0 0 0 ...
## $ email : num 1.29 0.28 1.03 0 0 0 0.32 0 0.15 0.12 ...
## $ you : num 1.93 3.47 1.36 3.18 3.18 0 3.85 0 1.23 1.67 ...
## $ credit : num 0 0 0.32 0 0 0 0 0 3.53 0.06 ...
## $ your : num 0.96 1.59 0.51 0.31 0.31 0 0.64 0 2 0.71 ...
## $ font : num 0 0 0 0 0 0 0 0 0 0 ...
## $ num000 : num 0 0.43 1.16 0 0 0 0 0 0 0.19 ...
## $ money : num 0 0.43 0.06 0 0 0 0 0 0.15 0 ...
## $ hp : num 0 0 0 0 0 0 0 0 0 0 ...
## $ hpl : num 0 0 0 0 0 0 0 0 0 0 ...
## $ george : num 0 0 0 0 0 0 0 0 0 0 ...
## $ num650 : num 0 0 0 0 0 0 0 0 0 0 ...
## $ lab : num 0 0 0 0 0 0 0 0 0 0 ...
## $ labs : num 0 0 0 0 0 0 0 0 0 0 ...
## $ telnet : num 0 0 0 0 0 0 0 0 0 0 ...
## $ num857 : num 0 0 0 0 0 0 0 0 0 0 ...
## $ data : num 0 0 0 0 0 0 0 0 0.15 0 ...
## $ num415 : num 0 0 0 0 0 0 0 0 0 0 ...
## $ num85 : num 0 0 0 0 0 0 0 0 0 0 ...
## $ technology : num 0 0 0 0 0 0 0 0 0 0 ...
## $ num1999 : num 0 0.07 0 0 0 0 0 0 0 0 ...
## $ parts : num 0 0 0 0 0 0 0 0 0 0 ...
## $ pm : num 0 0 0 0 0 0 0 0 0 0 ...
## $ direct : num 0 0 0.06 0 0 0 0 0 0 0 ...
## $ cs : num 0 0 0 0 0 0 0 0 0 0 ...
## $ meeting : num 0 0 0 0 0 0 0 0 0 0 ...
## $ original : num 0 0 0.12 0 0 0 0 0 0.3 0 ...
## $ project : num 0 0 0 0 0 0 0 0 0 0.06 ...
## $ re : num 0 0 0.06 0 0 0 0 0 0 0 ...
## $ edu : num 0 0 0.06 0 0 0 0 0 0 0 ...
## $ table : num 0 0 0 0 0 0 0 0 0 0 ...
## $ conference : num 0 0 0 0 0 0 0 0 0 0 ...
## $ charSemicolon : num 0 0 0.01 0 0 0 0 0 0 0.04 ...
## $ charRoundbracket : num 0 0.132 0.143 0.137 0.135 0.223 0.054 0.206 0.271 0.03 ...
## $ charSquarebracket: num 0 0 0 0 0 0 0 0 0 0 ...
## $ charExclamation : num 0.778 0.372 0.276 0.137 0.135 0 0.164 0 0.181 0.244 ...
## $ charDollar : num 0 0.18 0.184 0 0 0 0.054 0 0.203 0.081 ...
## $ charHash : num 0 0.048 0.01 0 0 0 0 0 0.022 0 ...
## $ capitalAve : num 3.76 5.11 9.82 3.54 3.54 ...
## $ capitalLong : num 61 101 485 40 40 15 4 11 445 43 ...
## $ capitalTotal : num 278 1028 2259 191 191 ...
## $ type : Factor w/ 2 levels "nonspam","spam": 2 2 2 2 2 2 2 2 2 2 ...
Hechamos un vistaso a la variable de interes:
table(spam$type)
##
## nonspam spam
## 2788 1813
spam$type <- ifelse(spam$type== "spam", 1, 0)
table(spam$type)
##
## 0 1
## 2788 1813
set.seed(111)
split <- sort(sample(nrow(spam), nrow(spam)*0.7))
training <- spam[split,]
testing <- spam[-split,]
library(dplyr)
##
## Attaching package: 'dplyr'
## The following objects are masked from 'package:stats':
##
## filter, lag
## The following objects are masked from 'package:base':
##
## intersect, setdiff, setequal, union
library(MASS)
##
## Attaching package: 'MASS'
## The following object is masked from 'package:dplyr':
##
## select
base.model = glm(type ~ . , data=spam)
model = glm(type ~ . , data=spam) %>% stepAIC(trace =TRUE)
## Start: AIC=2808.67
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + report + addresses +
## free + business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + num650 + lab + labs + telnet +
## num857 + data + num415 + num85 + technology + num1999 + parts +
## pm + direct + cs + meeting + original + project + re + edu +
## table + conference + charSemicolon + charRoundbracket + charSquarebracket +
## charExclamation + charDollar + charHash + capitalAve + capitalLong +
## capitalTotal
##
## Df Deviance AIC
## - num857 1 483.45 2806.7
## - num415 1 483.46 2806.8
## - num650 1 483.46 2806.8
## - report 1 483.46 2806.8
## - cs 1 483.49 2807.0
## - lab 1 483.50 2807.1
## - people 1 483.51 2807.2
## - addresses 1 483.53 2807.4
## - telnet 1 483.61 2808.1
## - capitalAve 1 483.62 2808.3
## - charSquarebracket 1 483.64 2808.4
## - technology 1 483.65 2808.5
## <none> 483.45 2808.7
## - direct 1 483.69 2808.9
## - pm 1 483.76 2809.6
## - capitalLong 1 483.80 2809.9
## - mail 1 483.85 2810.5
## - receive 1 483.95 2811.4
## - parts 1 484.05 2812.4
## - charHash 1 484.06 2812.4
## - num85 1 484.15 2813.3
## - num1999 1 484.18 2813.6
## - original 1 484.22 2814.0
## - charRoundbracket 1 484.22 2814.0
## - make 1 484.39 2815.6
## - table 1 484.46 2816.2
## - address 1 484.53 2816.9
## - hpl 1 484.55 2817.1
## - labs 1 484.56 2817.2
## - conference 1 484.71 2818.6
## - num3d 1 484.71 2818.7
## - order 1 485.01 2821.4
## - all 1 485.08 2822.1
## - project 1 485.26 2823.8
## - business 1 485.47 2825.9
## - you 1 485.66 2827.6
## - will 1 485.82 2829.2
## - data 1 485.85 2829.4
## - meeting 1 485.96 2830.5
## - email 1 486.88 2839.2
## - credit 1 487.64 2846.3
## - hp 1 487.73 2847.2
## - charSemicolon 1 487.75 2847.4
## - over 1 487.88 2848.6
## - edu 1 488.01 2849.9
## - re 1 488.83 2857.6
## - internet 1 489.40 2862.9
## - money 1 490.18 2870.3
## - george 1 490.42 2872.5
## - capitalTotal 1 490.73 2875.4
## - font 1 490.92 2877.2
## - charDollar 1 495.73 2922.1
## - num000 1 496.30 2927.3
## - charExclamation 1 496.54 2929.6
## - our 1 496.62 2930.3
## - your 1 497.04 2934.2
## - free 1 499.97 2961.3
## - remove 1 511.87 3069.5
##
## Step: AIC=2806.67
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + report + addresses +
## free + business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + num650 + lab + labs + telnet +
## data + num415 + num85 + technology + num1999 + parts + pm +
## direct + cs + meeting + original + project + re + edu + table +
## conference + charSemicolon + charRoundbracket + charSquarebracket +
## charExclamation + charDollar + charHash + capitalAve + capitalLong +
## capitalTotal
##
## Df Deviance AIC
## - num650 1 483.46 2804.8
## - report 1 483.46 2804.8
## - cs 1 483.49 2805.0
## - lab 1 483.50 2805.1
## - people 1 483.51 2805.2
## - addresses 1 483.53 2805.4
## - telnet 1 483.61 2806.1
## - capitalAve 1 483.62 2806.3
## - charSquarebracket 1 483.64 2806.4
## - technology 1 483.65 2806.5
## <none> 483.45 2806.7
## - direct 1 483.69 2807.0
## - pm 1 483.76 2807.6
## - num415 1 483.77 2807.7
## - capitalLong 1 483.80 2807.9
## - mail 1 483.85 2808.5
## - receive 1 483.95 2809.4
## - parts 1 484.05 2810.4
## - charHash 1 484.06 2810.4
## - num85 1 484.15 2811.3
## - num1999 1 484.18 2811.6
## - original 1 484.22 2812.0
## - charRoundbracket 1 484.22 2812.0
## - make 1 484.39 2813.6
## - table 1 484.46 2814.2
## - address 1 484.53 2814.9
## - hpl 1 484.55 2815.1
## - labs 1 484.56 2815.2
## - conference 1 484.71 2816.6
## - num3d 1 484.71 2816.7
## - order 1 485.01 2819.4
## - all 1 485.08 2820.1
## - project 1 485.26 2821.8
## - business 1 485.47 2823.9
## - you 1 485.66 2825.6
## - will 1 485.82 2827.2
## - data 1 485.85 2827.4
## - meeting 1 485.96 2828.5
## - email 1 486.88 2837.2
## - credit 1 487.64 2844.4
## - hp 1 487.73 2845.2
## - charSemicolon 1 487.75 2845.4
## - over 1 487.88 2846.6
## - edu 1 488.01 2847.9
## - re 1 488.83 2855.6
## - internet 1 489.40 2860.9
## - money 1 490.19 2868.3
## - george 1 490.42 2870.5
## - capitalTotal 1 490.73 2873.4
## - font 1 490.92 2875.2
## - charDollar 1 495.74 2920.2
## - num000 1 496.31 2925.4
## - charExclamation 1 496.55 2927.6
## - our 1 496.62 2928.3
## - your 1 497.05 2932.2
## - free 1 499.97 2959.3
## - remove 1 511.88 3067.5
##
## Step: AIC=2804.77
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + report + addresses +
## free + business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + lab + labs + telnet + data +
## num415 + num85 + technology + num1999 + parts + pm + direct +
## cs + meeting + original + project + re + edu + table + conference +
## charSemicolon + charRoundbracket + charSquarebracket + charExclamation +
## charDollar + charHash + capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - report 1 483.47 2802.9
## - cs 1 483.50 2803.1
## - lab 1 483.51 2803.2
## - people 1 483.52 2803.3
## - addresses 1 483.54 2803.5
## - telnet 1 483.61 2804.2
## - capitalAve 1 483.64 2804.4
## - charSquarebracket 1 483.65 2804.5
## <none> 483.46 2804.8
## - technology 1 483.68 2804.8
## - direct 1 483.71 2805.1
## - pm 1 483.77 2805.7
## - num415 1 483.78 2805.8
## - capitalLong 1 483.80 2806.0
## - mail 1 483.87 2806.7
## - receive 1 483.96 2807.5
## - charHash 1 484.06 2808.5
## - parts 1 484.07 2808.5
## - num85 1 484.17 2809.5
## - num1999 1 484.21 2809.8
## - original 1 484.23 2810.0
## - charRoundbracket 1 484.23 2810.0
## - make 1 484.40 2811.7
## - table 1 484.47 2812.4
## - address 1 484.54 2813.0
## - hpl 1 484.55 2813.1
## - labs 1 484.56 2813.2
## - conference 1 484.72 2814.7
## - num3d 1 484.72 2814.8
## - order 1 485.02 2817.5
## - all 1 485.09 2818.2
## - project 1 485.27 2819.9
## - business 1 485.48 2821.9
## - you 1 485.67 2823.7
## - will 1 485.84 2825.4
## - data 1 485.86 2825.5
## - meeting 1 485.98 2826.7
## - email 1 486.93 2835.6
## - credit 1 487.65 2842.5
## - hp 1 487.74 2843.3
## - charSemicolon 1 487.77 2843.6
## - over 1 487.89 2844.7
## - edu 1 488.02 2845.9
## - re 1 488.84 2853.7
## - internet 1 489.41 2859.0
## - money 1 490.19 2866.4
## - george 1 490.43 2868.6
## - capitalTotal 1 490.74 2871.5
## - font 1 490.94 2873.4
## - charDollar 1 495.75 2918.2
## - num000 1 496.32 2923.5
## - charExclamation 1 496.55 2925.7
## - our 1 496.63 2926.4
## - your 1 497.06 2930.3
## - free 1 499.98 2957.3
## - remove 1 511.88 3065.6
##
## Step: AIC=2802.88
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + addresses + free +
## business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + lab + labs + telnet + data +
## num415 + num85 + technology + num1999 + parts + pm + direct +
## cs + meeting + original + project + re + edu + table + conference +
## charSemicolon + charRoundbracket + charSquarebracket + charExclamation +
## charDollar + charHash + capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - cs 1 483.51 2801.2
## - lab 1 483.52 2801.3
## - people 1 483.53 2801.4
## - addresses 1 483.55 2801.6
## - telnet 1 483.63 2802.3
## - capitalAve 1 483.65 2802.5
## - charSquarebracket 1 483.66 2802.6
## <none> 483.47 2802.9
## - technology 1 483.69 2803.0
## - direct 1 483.72 2803.2
## - pm 1 483.78 2803.8
## - num415 1 483.79 2803.9
## - capitalLong 1 483.81 2804.1
## - mail 1 483.88 2804.8
## - receive 1 483.98 2805.7
## - charHash 1 484.08 2806.6
## - parts 1 484.08 2806.6
## - num85 1 484.18 2807.6
## - num1999 1 484.22 2808.0
## - original 1 484.23 2808.1
## - charRoundbracket 1 484.24 2808.2
## - make 1 484.41 2809.8
## - table 1 484.48 2810.4
## - address 1 484.55 2811.1
## - hpl 1 484.57 2811.3
## - labs 1 484.58 2811.3
## - conference 1 484.73 2812.8
## - num3d 1 484.74 2812.9
## - order 1 485.04 2815.7
## - all 1 485.10 2816.3
## - project 1 485.28 2818.1
## - business 1 485.49 2820.0
## - you 1 485.68 2821.8
## - will 1 485.86 2823.5
## - data 1 485.88 2823.7
## - meeting 1 485.99 2824.8
## - email 1 486.93 2833.7
## - credit 1 487.67 2840.7
## - hp 1 487.76 2841.5
## - charSemicolon 1 487.79 2841.8
## - over 1 487.90 2842.8
## - edu 1 488.04 2844.1
## - re 1 488.85 2851.7
## - internet 1 489.42 2857.1
## - money 1 490.21 2864.5
## - george 1 490.45 2866.8
## - font 1 490.94 2871.4
## - capitalTotal 1 490.97 2871.7
## - charDollar 1 495.81 2916.8
## - num000 1 496.35 2921.8
## - charExclamation 1 496.55 2923.7
## - our 1 496.64 2924.5
## - your 1 497.09 2928.7
## - free 1 499.99 2955.4
## - remove 1 511.88 3063.6
##
## Step: AIC=2801.23
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + addresses + free +
## business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + lab + labs + telnet + data +
## num415 + num85 + technology + num1999 + parts + pm + direct +
## meeting + original + project + re + edu + table + conference +
## charSemicolon + charRoundbracket + charSquarebracket + charExclamation +
## charDollar + charHash + capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - lab 1 483.56 2799.7
## - people 1 483.57 2799.8
## - addresses 1 483.58 2799.9
## - telnet 1 483.66 2800.6
## - capitalAve 1 483.68 2800.9
## - charSquarebracket 1 483.70 2801.0
## <none> 483.51 2801.2
## - technology 1 483.74 2801.4
## - direct 1 483.75 2801.5
## - pm 1 483.82 2802.1
## - num415 1 483.83 2802.3
## - capitalLong 1 483.85 2802.4
## - mail 1 483.92 2803.2
## - receive 1 484.02 2804.0
## - parts 1 484.11 2805.0
## - charHash 1 484.11 2805.0
## - num85 1 484.23 2806.0
## - num1999 1 484.27 2806.4
## - charRoundbracket 1 484.28 2806.5
## - original 1 484.30 2806.7
## - make 1 484.46 2808.2
## - table 1 484.51 2808.7
## - address 1 484.59 2809.4
## - hpl 1 484.60 2809.6
## - labs 1 484.64 2810.0
## - conference 1 484.77 2811.2
## - num3d 1 484.78 2811.2
## - order 1 485.08 2814.1
## - all 1 485.13 2814.6
## - project 1 485.32 2816.4
## - business 1 485.53 2818.4
## - you 1 485.74 2820.3
## - will 1 485.90 2821.9
## - data 1 485.91 2822.0
## - meeting 1 486.02 2823.0
## - email 1 486.97 2832.1
## - credit 1 487.71 2839.0
## - hp 1 487.78 2839.7
## - charSemicolon 1 487.88 2840.6
## - over 1 487.94 2841.2
## - re 1 488.88 2850.0
## - edu 1 488.95 2850.7
## - internet 1 489.44 2855.3
## - money 1 490.25 2862.9
## - george 1 490.48 2865.0
## - capitalTotal 1 491.01 2870.0
## - font 1 491.01 2870.0
## - charDollar 1 495.85 2915.2
## - num000 1 496.38 2920.1
## - charExclamation 1 496.59 2922.1
## - our 1 496.68 2922.9
## - your 1 497.14 2927.1
## - free 1 500.03 2953.8
## - remove 1 511.91 3061.8
##
## Step: AIC=2799.65
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + people + addresses + free +
## business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + labs + telnet + data + num415 +
## num85 + technology + num1999 + parts + pm + direct + meeting +
## original + project + re + edu + table + conference + charSemicolon +
## charRoundbracket + charSquarebracket + charExclamation +
## charDollar + charHash + capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - people 1 483.61 2798.2
## - addresses 1 483.63 2798.3
## - telnet 1 483.71 2799.1
## - capitalAve 1 483.73 2799.3
## - charSquarebracket 1 483.74 2799.4
## <none> 483.56 2799.7
## - technology 1 483.77 2799.7
## - direct 1 483.78 2799.8
## - num415 1 483.84 2800.4
## - pm 1 483.85 2800.5
## - capitalLong 1 483.90 2800.9
## - mail 1 483.97 2801.6
## - receive 1 484.06 2802.5
## - parts 1 484.13 2803.1
## - charHash 1 484.16 2803.4
## - num85 1 484.27 2804.5
## - num1999 1 484.31 2804.8
## - charRoundbracket 1 484.33 2805.0
## - original 1 484.34 2805.1
## - make 1 484.49 2806.6
## - table 1 484.55 2807.1
## - address 1 484.63 2807.8
## - hpl 1 484.65 2808.0
## - labs 1 484.71 2808.6
## - conference 1 484.81 2809.6
## - num3d 1 484.82 2809.7
## - order 1 485.12 2812.5
## - all 1 485.20 2813.2
## - project 1 485.35 2814.7
## - business 1 485.59 2816.9
## - you 1 485.79 2818.8
## - data 1 485.96 2820.4
## - will 1 486.04 2821.2
## - email 1 487.04 2830.7
## - meeting 1 487.38 2833.9
## - credit 1 487.75 2837.4
## - hp 1 487.83 2838.1
## - charSemicolon 1 487.92 2839.0
## - over 1 487.99 2839.6
## - re 1 488.91 2848.3
## - edu 1 488.99 2849.1
## - internet 1 489.48 2853.7
## - money 1 490.29 2861.3
## - george 1 490.52 2863.4
## - font 1 491.05 2868.4
## - capitalTotal 1 491.06 2868.5
## - charDollar 1 495.92 2913.8
## - num000 1 496.42 2918.4
## - charExclamation 1 496.63 2920.4
## - our 1 496.72 2921.2
## - your 1 497.17 2925.4
## - free 1 500.07 2952.2
## - remove 1 511.96 3060.2
##
## Step: AIC=2798.17
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + addresses + free + business +
## email + you + credit + your + font + num000 + money + hp +
## hpl + george + labs + telnet + data + num415 + num85 + technology +
## num1999 + parts + pm + direct + meeting + original + project +
## re + edu + table + conference + charSemicolon + charRoundbracket +
## charSquarebracket + charExclamation + charDollar + charHash +
## capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - addresses 1 483.68 2796.9
## - telnet 1 483.76 2797.6
## - capitalAve 1 483.78 2797.8
## - charSquarebracket 1 483.80 2798.0
## <none> 483.61 2798.2
## - technology 1 483.82 2798.2
## - direct 1 483.84 2798.4
## - num415 1 483.90 2798.9
## - pm 1 483.90 2798.9
## - capitalLong 1 483.95 2799.4
## - mail 1 484.03 2800.2
## - receive 1 484.12 2801.1
## - parts 1 484.19 2801.7
## - charHash 1 484.21 2801.8
## - num85 1 484.34 2803.1
## - num1999 1 484.37 2803.4
## - original 1 484.39 2803.6
## - charRoundbracket 1 484.41 2803.8
## - make 1 484.54 2805.0
## - table 1 484.61 2805.7
## - address 1 484.70 2806.5
## - hpl 1 484.71 2806.6
## - labs 1 484.77 2807.2
## - num3d 1 484.86 2808.1
## - conference 1 484.87 2808.2
## - order 1 485.16 2810.9
## - all 1 485.26 2811.8
## - project 1 485.41 2813.3
## - business 1 485.66 2815.6
## - you 1 485.90 2817.9
## - data 1 486.04 2819.2
## - will 1 486.11 2819.9
## - email 1 487.13 2829.6
## - meeting 1 487.45 2832.6
## - credit 1 487.78 2835.6
## - hp 1 487.88 2836.7
## - charSemicolon 1 487.98 2837.6
## - over 1 488.07 2838.4
## - re 1 489.02 2847.3
## - edu 1 489.06 2847.7
## - internet 1 489.54 2852.2
## - money 1 490.39 2860.3
## - george 1 490.62 2862.4
## - font 1 491.07 2866.6
## - capitalTotal 1 491.22 2868.0
## - num000 1 496.50 2917.2
## - charDollar 1 496.56 2917.7
## - charExclamation 1 496.68 2918.9
## - our 1 496.79 2919.8
## - your 1 497.18 2923.5
## - free 1 500.09 2950.3
## - remove 1 511.96 3058.3
##
## Step: AIC=2796.87
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + free + business + email +
## you + credit + your + font + num000 + money + hp + hpl +
## george + labs + telnet + data + num415 + num85 + technology +
## num1999 + parts + pm + direct + meeting + original + project +
## re + edu + table + conference + charSemicolon + charRoundbracket +
## charSquarebracket + charExclamation + charDollar + charHash +
## capitalAve + capitalLong + capitalTotal
##
## Df Deviance AIC
## - capitalAve 1 483.83 2796.3
## - telnet 1 483.85 2796.4
## - charSquarebracket 1 483.88 2796.7
## - technology 1 483.89 2796.8
## <none> 483.68 2796.9
## - direct 1 483.95 2797.4
## - num415 1 483.97 2797.6
## - pm 1 483.97 2797.6
## - capitalLong 1 484.09 2798.8
## - mail 1 484.15 2799.3
## - receive 1 484.20 2799.7
## - parts 1 484.27 2800.4
## - charHash 1 484.27 2800.4
## - num85 1 484.41 2801.8
## - original 1 484.43 2802.0
## - num1999 1 484.46 2802.2
## - charRoundbracket 1 484.52 2802.8
## - make 1 484.64 2803.9
## - table 1 484.69 2804.4
## - address 1 484.77 2805.2
## - hpl 1 484.78 2805.3
## - labs 1 484.86 2806.1
## - num3d 1 484.93 2806.8
## - conference 1 484.95 2806.9
## - all 1 485.37 2810.9
## - order 1 485.39 2811.1
## - project 1 485.48 2811.9
## - business 1 485.69 2813.9
## - you 1 485.95 2816.3
## - data 1 486.10 2817.8
## - will 1 486.18 2818.6
## - meeting 1 487.53 2831.3
## - email 1 487.62 2832.1
## - credit 1 487.82 2834.1
## - hp 1 487.96 2835.3
## - charSemicolon 1 488.06 2836.3
## - over 1 488.26 2838.2
## - re 1 489.09 2846.0
## - edu 1 489.11 2846.2
## - internet 1 489.64 2851.2
## - money 1 490.46 2858.9
## - george 1 490.68 2861.0
## - font 1 491.14 2865.3
## - capitalTotal 1 491.26 2866.4
## - charDollar 1 496.58 2916.0
## - charExclamation 1 496.72 2917.2
## - our 1 496.92 2919.1
## - your 1 497.22 2921.8
## - num000 1 498.60 2934.6
## - free 1 500.10 2948.5
## - remove 1 512.01 3056.8
##
## Step: AIC=2796.26
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + free + business + email +
## you + credit + your + font + num000 + money + hp + hpl +
## george + labs + telnet + data + num415 + num85 + technology +
## num1999 + parts + pm + direct + meeting + original + project +
## re + edu + table + conference + charSemicolon + charRoundbracket +
## charSquarebracket + charExclamation + charDollar + charHash +
## capitalLong + capitalTotal
##
## Df Deviance AIC
## - telnet 1 483.99 2795.8
## - charSquarebracket 1 484.03 2796.1
## - technology 1 484.03 2796.2
## <none> 483.83 2796.3
## - direct 1 484.09 2796.7
## - pm 1 484.11 2796.9
## - num415 1 484.13 2797.2
## - mail 1 484.32 2798.9
## - receive 1 484.33 2799.0
## - charHash 1 484.41 2799.7
## - parts 1 484.42 2799.9
## - num85 1 484.56 2801.2
## - num1999 1 484.59 2801.4
## - original 1 484.59 2801.5
## - make 1 484.76 2803.1
## - capitalLong 1 484.77 2803.2
## - table 1 484.84 2803.8
## - charRoundbracket 1 484.87 2804.2
## - hpl 1 484.93 2804.7
## - address 1 484.93 2804.7
## - labs 1 485.01 2805.4
## - num3d 1 485.07 2806.1
## - conference 1 485.10 2806.3
## - order 1 485.56 2810.7
## - all 1 485.59 2811.0
## - project 1 485.63 2811.4
## - business 1 485.84 2813.3
## - you 1 486.03 2815.1
## - data 1 486.24 2817.1
## - will 1 486.37 2818.4
## - email 1 487.69 2830.8
## - meeting 1 487.70 2830.9
## - credit 1 487.99 2833.7
## - hp 1 488.09 2834.6
## - charSemicolon 1 488.24 2836.1
## - over 1 488.33 2836.9
## - re 1 489.25 2845.5
## - edu 1 489.26 2845.7
## - internet 1 489.77 2850.4
## - money 1 490.59 2858.1
## - george 1 490.89 2860.9
## - capitalTotal 1 491.27 2864.4
## - font 1 491.33 2865.0
## - charDollar 1 496.79 2915.8
## - charExclamation 1 496.90 2916.9
## - our 1 497.15 2919.2
## - your 1 497.34 2921.0
## - num000 1 498.62 2932.8
## - free 1 500.23 2947.7
## - remove 1 512.19 3056.3
##
## Step: AIC=2795.8
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + free + business + email +
## you + credit + your + font + num000 + money + hp + hpl +
## george + labs + data + num415 + num85 + technology + num1999 +
## parts + pm + direct + meeting + original + project + re +
## edu + table + conference + charSemicolon + charRoundbracket +
## charSquarebracket + charExclamation + charDollar + charHash +
## capitalLong + capitalTotal
##
## Df Deviance AIC
## - technology 1 484.13 2795.1
## - charSquarebracket 1 484.19 2795.7
## - direct 1 484.19 2795.7
## <none> 483.99 2795.8
## - num415 1 484.22 2796.0
## - pm 1 484.28 2796.5
## - receive 1 484.48 2798.5
## - mail 1 484.49 2798.5
## - charHash 1 484.57 2799.3
## - parts 1 484.58 2799.4
## - num1999 1 484.75 2801.0
## - original 1 484.76 2801.1
## - num85 1 484.80 2801.4
## - make 1 484.92 2802.6
## - capitalLong 1 484.93 2802.7
## - table 1 484.99 2803.3
## - charRoundbracket 1 485.03 2803.7
## - address 1 485.09 2804.2
## - hpl 1 485.12 2804.5
## - num3d 1 485.24 2805.6
## - conference 1 485.24 2805.7
## - labs 1 485.29 2806.1
## - order 1 485.73 2810.3
## - all 1 485.74 2810.3
## - project 1 485.95 2812.4
## - business 1 486.02 2813.1
## - you 1 486.19 2814.6
## - data 1 486.40 2816.6
## - will 1 486.52 2817.8
## - email 1 487.77 2829.6
## - meeting 1 487.86 2830.4
## - credit 1 488.16 2833.2
## - hp 1 488.28 2834.3
## - charSemicolon 1 488.40 2835.5
## - over 1 488.51 2836.5
## - re 1 489.39 2844.8
## - edu 1 489.42 2845.1
## - internet 1 489.94 2850.0
## - money 1 490.75 2857.6
## - george 1 491.06 2860.5
## - capitalTotal 1 491.44 2864.0
## - font 1 491.47 2864.4
## - charDollar 1 496.96 2915.4
## - charExclamation 1 497.06 2916.4
## - our 1 497.36 2919.1
## - your 1 497.53 2920.8
## - num000 1 498.80 2932.5
## - free 1 500.40 2947.2
## - remove 1 512.38 3056.1
##
## Step: AIC=2795.14
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + free + business + email +
## you + credit + your + font + num000 + money + hp + hpl +
## george + labs + data + num415 + num85 + num1999 + parts +
## pm + direct + meeting + original + project + re + edu + table +
## conference + charSemicolon + charRoundbracket + charSquarebracket +
## charExclamation + charDollar + charHash + capitalLong + capitalTotal
##
## Df Deviance AIC
## - charSquarebracket 1 484.34 2795.1
## <none> 484.13 2795.1
## - direct 1 484.38 2795.5
## - pm 1 484.40 2795.6
## - num415 1 484.51 2796.7
## - receive 1 484.63 2797.8
## - mail 1 484.63 2797.9
## - charHash 1 484.71 2798.6
## - parts 1 484.72 2798.8
## - num85 1 484.88 2800.2
## - num1999 1 484.89 2800.3
## - original 1 484.91 2800.6
## - capitalLong 1 485.06 2801.9
## - make 1 485.07 2802.1
## - table 1 485.14 2802.7
## - charRoundbracket 1 485.17 2802.9
## - hpl 1 485.24 2803.7
## - address 1 485.26 2803.8
## - labs 1 485.32 2804.4
## - conference 1 485.37 2804.9
## - num3d 1 485.38 2804.9
## - order 1 485.84 2809.4
## - all 1 485.89 2809.8
## - project 1 486.10 2811.8
## - business 1 486.19 2812.6
## - you 1 486.26 2813.3
## - data 1 486.55 2816.0
## - will 1 486.62 2816.7
## - email 1 488.05 2830.2
## - meeting 1 488.08 2830.5
## - credit 1 488.28 2832.4
## - hp 1 488.33 2832.9
## - charSemicolon 1 488.57 2835.1
## - over 1 488.64 2835.7
## - re 1 489.56 2844.4
## - edu 1 489.62 2845.0
## - internet 1 490.09 2849.4
## - money 1 490.89 2856.9
## - george 1 491.29 2860.6
## - capitalTotal 1 491.59 2863.5
## - font 1 491.61 2863.6
## - charDollar 1 497.08 2914.6
## - charExclamation 1 497.17 2915.4
## - our 1 497.47 2918.1
## - your 1 497.56 2919.0
## - num000 1 498.89 2931.3
## - free 1 500.48 2946.0
## - remove 1 512.48 3054.9
##
## Step: AIC=2795.07
## type ~ make + address + all + num3d + our + over + remove + internet +
## order + mail + receive + will + free + business + email +
## you + credit + your + font + num000 + money + hp + hpl +
## george + labs + data + num415 + num85 + num1999 + parts +
## pm + direct + meeting + original + project + re + edu + table +
## conference + charSemicolon + charRoundbracket + charExclamation +
## charDollar + charHash + capitalLong + capitalTotal
##
## Df Deviance AIC
## <none> 484.34 2795.1
## - direct 1 484.58 2795.4
## - pm 1 484.60 2795.6
## - num415 1 484.72 2796.7
## - mail 1 484.82 2797.7
## - receive 1 484.83 2797.8
## - charHash 1 484.92 2798.6
## - parts 1 484.92 2798.7
## - num85 1 485.10 2800.3
## - num1999 1 485.11 2800.4
## - original 1 485.20 2801.3
## - make 1 485.26 2801.9
## - capitalLong 1 485.27 2802.0
## - table 1 485.33 2802.5
## - charRoundbracket 1 485.37 2802.9
## - address 1 485.44 2803.6
## - hpl 1 485.48 2804.0
## - labs 1 485.50 2804.1
## - conference 1 485.56 2804.7
## - num3d 1 485.59 2804.9
## - order 1 486.02 2809.0
## - all 1 486.11 2809.9
## - project 1 486.27 2811.4
## - business 1 486.41 2812.7
## - you 1 486.51 2813.7
## - will 1 486.77 2816.1
## - data 1 486.92 2817.6
## - email 1 488.26 2830.2
## - meeting 1 488.29 2830.5
## - credit 1 488.48 2832.3
## - hp 1 488.53 2832.7
## - charSemicolon 1 488.77 2835.0
## - over 1 488.85 2835.7
## - re 1 489.76 2844.3
## - edu 1 489.80 2844.7
## - internet 1 490.33 2849.6
## - money 1 491.10 2856.8
## - george 1 491.42 2859.9
## - capitalTotal 1 491.77 2863.2
## - font 1 491.82 2863.6
## - charDollar 1 497.34 2914.9
## - charExclamation 1 497.40 2915.6
## - our 1 497.69 2918.2
## - your 1 497.78 2919.0
## - num000 1 499.02 2930.5
## - free 1 500.73 2946.2
## - remove 1 512.71 3055.1
Comparamos el AIC para ambos modelos:
base.model$aic
## [1] 2808.67
model$aic
## [1] 2795.074
Vemos un resumen del mejor modelo:
summary(model)
##
## Call:
## glm(formula = type ~ make + address + all + num3d + our + over +
## remove + internet + order + mail + receive + will + free +
## business + email + you + credit + your + font + num000 +
## money + hp + hpl + george + labs + data + num415 + num85 +
## num1999 + parts + pm + direct + meeting + original + project +
## re + edu + table + conference + charSemicolon + charRoundbracket +
## charExclamation + charDollar + charHash + capitalLong + capitalTotal,
## data = spam)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -2.26675 -0.21606 -0.05663 0.21404 0.94727
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 2.029e-01 1.123e-02 18.072 < 2e-16 ***
## make -4.938e-02 1.673e-02 -2.952 0.003175 **
## address -1.220e-02 3.783e-03 -3.225 0.001267 **
## all 4.087e-02 1.001e-02 4.085 4.49e-05 ***
## num3d 1.185e-02 3.459e-03 3.427 0.000615 ***
## our 8.470e-02 7.559e-03 11.204 < 2e-16 ***
## over 1.193e-01 1.832e-02 6.512 8.21e-11 ***
## remove 2.126e-01 1.301e-02 16.335 < 2e-16 ***
## internet 9.423e-02 1.256e-02 7.504 7.39e-14 ***
## order 7.416e-02 1.865e-02 3.976 7.13e-05 ***
## mail 1.649e-02 7.703e-03 2.141 0.032339 *
## receive 5.650e-02 2.620e-02 2.156 0.031121 *
## will -2.798e-02 5.849e-03 -4.784 1.77e-06 ***
## free 7.467e-02 6.015e-03 12.414 < 2e-16 ***
## business 5.222e-02 1.182e-02 4.416 1.03e-05 ***
## email 5.731e-02 9.433e-03 6.076 1.33e-09 ***
## you 1.390e-02 3.076e-03 4.520 6.35e-06 ***
## credit 6.131e-02 9.822e-03 6.243 4.70e-10 ***
## your 5.224e-02 4.647e-03 11.242 < 2e-16 ***
## font 4.474e-02 5.335e-03 8.386 < 2e-16 ***
## num000 1.774e-01 1.509e-02 11.752 < 2e-16 ***
## money 9.100e-02 1.141e-02 7.972 1.96e-15 ***
## hp -2.288e-02 3.644e-03 -6.280 3.71e-10 ***
## hpl -2.194e-02 6.678e-03 -3.286 0.001025 **
## george -1.228e-02 1.504e-03 -8.163 4.20e-16 ***
## labs -5.133e-02 1.552e-02 -3.307 0.000952 ***
## data -4.333e-02 8.791e-03 -4.929 8.56e-07 ***
## num415 5.777e-02 3.046e-02 1.897 0.057934 .
## num85 -3.083e-02 1.152e-02 -2.677 0.007448 **
## num1999 -3.410e-02 1.266e-02 -2.693 0.007100 **
## parts -5.257e-02 2.235e-02 -2.352 0.018702 *
## pm -1.838e-02 1.168e-02 -1.574 0.115614
## direct 4.009e-02 2.625e-02 1.527 0.126778
## meeting -3.996e-02 6.556e-03 -6.095 1.18e-09 ***
## original -6.636e-02 2.330e-02 -2.848 0.004419 **
## project -3.330e-02 7.801e-03 -4.269 2.01e-05 ***
## re -3.536e-02 4.949e-03 -7.144 1.05e-12 ***
## edu -3.900e-02 5.441e-03 -7.169 8.76e-13 ***
## table -1.940e-01 6.332e-02 -3.063 0.002203 **
## conference -5.756e-02 1.695e-02 -3.395 0.000691 ***
## charSemicolon -1.420e-01 2.199e-02 -6.459 1.16e-10 ***
## charRoundbracket -6.633e-02 2.123e-02 -3.124 0.001796 **
## charExclamation 6.793e-02 6.128e-03 11.084 < 2e-16 ***
## charDollar 2.364e-01 2.138e-02 11.056 < 2e-16 ***
## charHash 2.708e-02 1.160e-02 2.334 0.019628 *
## capitalLong 9.310e-05 3.139e-05 2.966 0.003028 **
## capitalTotal 7.929e-05 9.482e-06 8.362 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## (Dispersion parameter for gaussian family taken to be 0.1063541)
##
## Null deviance: 1098.60 on 4600 degrees of freedom
## Residual deviance: 484.34 on 4554 degrees of freedom
## AIC: 2795.1
##
## Number of Fisher Scoring iterations: 2
probabilities <- predict(model,
newdata = testing,
type = "response")
predictions <- ifelse(probabilities > 0.5, "1", "0")
trn_tab <- table(predictions, testing$type)
library("caret")
## Loading required package: ggplot2
##
## Attaching package: 'ggplot2'
## The following object is masked from 'package:kernlab':
##
## alpha
## Loading required package: lattice
confusionMatrix(trn_tab, positive = "1")
## Confusion Matrix and Statistics
##
##
## predictions 0 1
## 0 795 102
## 1 25 459
##
## Accuracy : 0.908
## 95% CI : (0.8916, 0.9228)
## No Information Rate : 0.5938
## P-Value [Acc > NIR] : < 2.2e-16
##
## Kappa : 0.8051
##
## Mcnemar's Test P-Value : 1.542e-11
##
## Sensitivity : 0.8182
## Specificity : 0.9695
## Pos Pred Value : 0.9483
## Neg Pred Value : 0.8863
## Prevalence : 0.4062
## Detection Rate : 0.3324
## Detection Prevalence : 0.3505
## Balanced Accuracy : 0.8938
##
## 'Positive' Class : 1
##
exp(cbind(OR = coef(model), confint(model)))
## Waiting for profiling to be done...
## OR 2.5 % 97.5 %
## (Intercept) 1.2248987 1.1982441 1.2521461
## make 0.9518215 0.9211212 0.9835449
## address 0.9878716 0.9805735 0.9952239
## all 1.0417188 1.0214874 1.0623510
## num3d 1.0119240 1.0050877 1.0188067
## our 1.0883867 1.0723803 1.1046321
## over 1.1266943 1.0869612 1.1678798
## remove 1.2368450 1.2056982 1.2687963
## internet 1.0988106 1.0720978 1.1261889
## order 1.0769809 1.0383153 1.1170864
## mail 1.0166267 1.0013943 1.0320908
## receive 1.0581279 1.0051553 1.1138923
## will 0.9724054 0.9613214 0.9836171
## free 1.0775245 1.0648962 1.0903026
## business 1.0536100 1.0294729 1.0783130
## email 1.0589861 1.0395879 1.0787462
## you 1.0140010 1.0079055 1.0201334
## credit 1.0632294 1.0429581 1.0838946
## your 1.0536292 1.0440768 1.0632691
## font 1.0457515 1.0348746 1.0567428
## num000 1.1940566 1.1592545 1.2299035
## money 1.0952696 1.0710374 1.1200501
## hp 0.9773763 0.9704203 0.9843821
## hpl 0.9782971 0.9655762 0.9911855
## george 0.9877987 0.9848913 0.9907147
## labs 0.9499646 0.9214957 0.9793130
## data 0.9575948 0.9412366 0.9742373
## num415 1.0594753 0.9980740 1.1246539
## num85 0.9696375 0.9479964 0.9917726
## num1999 0.9664782 0.9427927 0.9907589
## parts 0.9487854 0.9081212 0.9912705
## pm 0.9817874 0.9595683 1.0045209
## direct 1.0409050 0.9887042 1.0958618
## meeting 0.9608281 0.9485610 0.9732539
## original 0.9357967 0.8940237 0.9795214
## project 0.9672463 0.9525693 0.9821495
## re 0.9652602 0.9559420 0.9746693
## edu 0.9617466 0.9515457 0.9720569
## table 0.8236867 0.7275493 0.9325276
## conference 0.9440678 0.9132163 0.9759615
## charSemicolon 0.8675872 0.8309900 0.9057962
## charRoundbracket 0.9358232 0.8976767 0.9755906
## charExclamation 1.0702860 1.0575078 1.0832186
## charDollar 1.2666443 1.2146626 1.3208506
## charHash 1.0274514 1.0043514 1.0510827
## capitalLong 1.0000931 1.0000316 1.0001546
## capitalTotal 1.0000793 1.0000607 1.0000979
N=4000
read.training = file("train-images.idx3-ubyte", "rb")
readBin(read.training,'integer',n=1,size=4,endian='big')
## [1] 2051
nrow = readBin(read.training,'integer',n=1,size=4,endian='big')
ncol = readBin(read.training,'integer',n=1,size=4,endian='big')
x = data.frame()
for(i in 1:N){m = readBin(read.training,'integer',n=28*28,size=1,signed=F);
x <- rbind(x, m)}
close(read.training)
read.labels = file("train-labels.idx1-ubyte", "rb")
readBin(read.labels,'integer',n=1,size=4,endian='big')
## [1] 2049
n = readBin(read.labels,'integer',n=1,size=4,endian='big')
y = readBin(read.labels,'integer',n=n,size=1,signed=F)
close(read.labels)
y=data.frame(sapply(as.numeric(y[1:N]), c))
df =x
df['Y'] <- y
head(df)
## X0L X0L.1 X0L.2 X28L X0L.3 X0L.4 X0L.5 X0L.6 X0L.7 X0L.8 X0L.9 X0L.10 X0L.11
## 1 0 0 0 28 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0 0 0
## X0L.12 X0L.13 X0L.14 X0L.15 X0L.16 X0L.17 X0L.18 X0L.19 X0L.20 X0L.21 X0L.22
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.23 X0L.24 X0L.25 X0L.26 X0L.27 X0L.28 X0L.29 X0L.30 X0L.31 X0L.32 X0L.33
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.34 X0L.35 X0L.36 X0L.37 X0L.38 X0L.39 X0L.40 X0L.41 X0L.42 X0L.43 X0L.44
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.45 X0L.46 X0L.47 X0L.48 X0L.49 X0L.50 X0L.51 X0L.52 X0L.53 X0L.54 X0L.55
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.56 X0L.57 X0L.58 X0L.59 X0L.60 X0L.61 X0L.62 X0L.63 X0L.64 X0L.65 X0L.66
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.67 X0L.68 X0L.69 X0L.70 X0L.71 X0L.72 X0L.73 X0L.74 X0L.75 X0L.76 X0L.77
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.78 X0L.79 X0L.80 X0L.81 X0L.82 X0L.83 X0L.84 X0L.85 X0L.86 X0L.87 X0L.88
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.89 X0L.90 X0L.91 X0L.92 X0L.93 X0L.94 X0L.95 X0L.96 X0L.97 X0L.98 X0L.99
## 1 0 0 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0 0
## X0L.100 X0L.101 X0L.102 X0L.103 X0L.104 X0L.105 X0L.106 X0L.107 X0L.108
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.109 X0L.110 X0L.111 X0L.112 X0L.113 X0L.114 X0L.115 X0L.116 X0L.117
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.118 X0L.119 X0L.120 X0L.121 X0L.122 X0L.123 X0L.124 X0L.125 X0L.126
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.127 X0L.128 X0L.129 X0L.130 X0L.131 X0L.132 X0L.133 X0L.134 X0L.135
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 51 159 253 159 50 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.136 X0L.137 X0L.138 X0L.139 X0L.140 X0L.141 X0L.142 X0L.143 X0L.144
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.145 X0L.146 X0L.147 X0L.148 X0L.149 X0L.150 X0L.151 X0L.152 X0L.153
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.154 X3L X18L X18L.1 X18L.2 X126L X136L X175L X26L X166L X255L X247L X127L
## 1 0 3 18 18 18 126 136 175 26 166 255 247 127
## 2 0 0 0 48 238 252 252 252 237 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 67 232 39 0
## 4 0 0 0 0 0 0 0 124 253 255 63 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 13 25 100 122 7 0 0 0 0
## X0L.155 X0L.156 X0L.157 X0L.158 X0L.159 X0L.160 X0L.161 X0L.162 X0L.163
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 62
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.164 X0L.165 X0L.166 X30L X36L X94L X154L X170L X253L X253L.1 X253L.2
## 1 0 0 0 30 36 94 154 170 253 253 253
## 2 0 0 0 0 0 0 0 0 54 227 253
## 3 81 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 33 151 208
## X253L.3 X253L.4 X225L X172L X253L.5 X242L X195L X64L X0L.167 X0L.168 X0L.169
## 1 253 253 225 172 253 242 195 64 0 0 0
## 2 252 239 233 252 57 6 0 0 0 0 0
## 3 0 0 0 0 120 180 39 0 0 0 0
## 4 0 96 244 251 253 62 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0 0
## 6 252 252 252 146 0 0 0 0 0 0 0
## X0L.170 X0L.171 X0L.172 X0L.173 X0L.174 X0L.175 X0L.176 X0L.177 X49L X238L
## 1 0 0 0 0 0 0 0 0 49 238
## 2 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 126 163 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0
## X253L.6 X253L.7 X253L.8 X253L.9 X253L.10 X253L.11 X253L.12 X253L.13 X251L
## 1 253 253 253 253 253 253 253 253 251
## 2 0 0 10 60 224 252 253 252 202
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 127
## 5 0 0 0 55 148 210 253 253 113
## 6 0 0 40 152 244 252 253 224 211
## X93L X82L X82L.1 X56L X39L X0L.178 X0L.179 X0L.180 X0L.181 X0L.182 X0L.183
## 1 93 82 82 56 39 0 0 0 0 0 0
## 2 84 252 253 122 0 0 0 0 0 0 0
## 3 0 2 153 210 40 0 0 0 0 0 0
## 4 251 251 253 62 0 0 0 0 0 0 0
## 5 87 148 55 0 0 0 0 0 0 0 0
## 6 252 232 40 0 0 0 0 0 0 0 0
## X0L.184 X0L.185 X0L.186 X0L.187 X0L.188 X0L.189 X18L.3 X219L X253L.14
## 1 0 0 0 0 0 0 18 219 253
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 220 163 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 15
## X253L.15 X253L.16 X253L.17 X253L.18 X198L X182L X247L.1 X241L X0L.190 X0L.191
## 1 253 253 253 253 198 182 247 241 0 0
## 2 0 163 252 252 252 253 252 252 96 189
## 3 0 0 0 0 0 0 0 0 0 27
## 4 0 0 0 0 0 0 68 236 251 211
## 5 0 87 232 252 253 189 210 252 252 253
## 6 152 239 252 252 252 216 31 37 252 252
## X0L.192 X0L.193 X0L.194 X0L.195 X0L.196 X0L.197 X0L.198 X0L.199 X0L.200
## 1 0 0 0 0 0 0 0 0 0
## 2 253 167 0 0 0 0 0 0 0
## 3 254 162 0 0 0 0 0 0 0
## 4 31 8 0 0 0 0 0 0 0
## 5 168 0 0 0 0 0 0 0 0
## 6 60 0 0 0 0 0 0 0 0
## X0L.201 X0L.202 X0L.203 X0L.204 X0L.205 X0L.206 X0L.207 X80L X156L X107L
## 1 0 0 0 0 0 0 0 80 156 107
## 2 0 0 0 0 0 0 0 0 0 51
## 3 0 0 0 222 163 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 4 57
## 6 0 0 0 0 0 0 0 0 96 252
## X253L.19 X253L.20 X205L X11L X0L.208 X43L X154L.1 X0L.209 X0L.210 X0L.211
## 1 253 253 205 11 0 43 154 0 0 0
## 2 238 253 253 190 114 253 228 47 79 255
## 3 0 0 0 0 0 0 0 0 183 254
## 4 0 0 0 0 60 228 251 251 94 0
## 5 242 252 190 65 5 12 182 252 253 116
## 6 252 252 252 217 29 0 37 252 252 60
## X0L.212 X0L.213 X0L.214 X0L.215 X0L.216 X0L.217 X0L.218 X0L.219 X0L.220
## 1 0 0 0 0 0 0 0 0 0
## 2 168 0 0 0 0 0 0 0 0
## 3 125 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.221 X0L.222 X0L.223 X0L.224 X0L.225 X0L.226 X0L.227 X14L X1L X154L.2
## 1 0 0 0 0 0 0 0 14 1 154
## 2 0 0 0 0 0 0 0 48 238 252
## 3 0 46 245 163 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 96 252 252
## 6 0 0 0 0 0 0 0 181 252 252
## X253L.21 X90L X0L.228 X0L.229 X0L.230 X0L.231 X0L.232 X0L.233 X0L.234 X0L.235
## 1 253 90 0 0 0 0 0 0 0 0
## 2 252 179 12 75 121 21 0 0 253 243
## 3 0 0 0 0 0 0 0 198 254 56
## 4 0 0 0 155 253 253 189 0 0 0
## 5 183 14 0 0 92 252 252 225 21 0
## 6 220 167 30 0 0 77 252 252 60 0
## X0L.236 X0L.237 X0L.238 X0L.239 X0L.240 X0L.241 X0L.242 X0L.243 X0L.244
## 1 0 0 0 0 0 0 0 0 0
## 2 50 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.245 X0L.246 X0L.247 X0L.248 X0L.249 X0L.250 X0L.251 X0L.252 X139L
## 1 0 0 0 0 0 0 0 0 139
## 2 0 0 0 0 0 38 165 253 233
## 3 120 254 163 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 132 253 252 146
## 6 0 0 0 0 0 0 26 128 58
## X253L.22 X190L X2L X0L.253 X0L.254 X0L.255 X0L.256 X0L.257 X0L.258 X0L.259
## 1 253 190 2 0 0 0 0 0 0 0
## 2 208 84 0 0 0 0 0 0 253 252
## 3 0 0 0 0 0 0 23 231 254 29
## 4 0 0 20 253 251 235 66 0 0 0
## 5 14 0 0 0 215 252 252 79 0 0
## 6 22 0 0 0 0 100 252 252 60 0
## X0L.260 X0L.261 X0L.262 X0L.263 X0L.264 X0L.265 X0L.266 X0L.267 X0L.268
## 1 0 0 0 0 0 0 0 0 0
## 2 165 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.269 X0L.270 X0L.271 X0L.272 X0L.273 X0L.274 X0L.275 X0L.276 X11L.1
## 1 0 0 0 0 0 0 0 0 11
## 2 0 0 0 0 7 178 252 240 71
## 3 159 254 120 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 126 253 247 176 9
## 6 0 0 0 0 0 0 0 0 0
## X190L.1 X253L.23 X70L X0L.277 X0L.278 X0L.279 X0L.280 X0L.281 X0L.282 X0L.283
## 1 190 253 70 0 0 0 0 0 0 0
## 2 19 28 0 0 0 0 0 0 253 252
## 3 0 0 0 0 0 0 163 254 216 16
## 4 0 32 205 253 251 126 0 0 0 0
## 5 0 0 8 78 245 253 129 0 0 0
## 6 0 0 0 0 0 157 252 252 60 0
## X0L.284 X0L.285 X0L.286 X0L.287 X0L.288 X0L.289 X0L.290 X0L.291 X0L.292
## 1 0 0 0 0 0 0 0 0 0
## 2 195 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.293 X0L.294 X0L.295 X0L.296 X0L.297 X0L.298 X0L.299 X0L.300 X0L.301 X35L
## 1 0 0 0 0 0 0 0 0 0 35
## 2 0 0 0 0 57 252 252 63 0 0
## 3 159 254 67 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 16 232 252 176 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0
## X241L.1 X225L.1 X160L X108L X1L.1 X0L.302 X0L.303 X0L.304 X0L.305 X0L.306
## 1 241 225 160 108 1 0 0 0 0 0
## 2 0 0 0 0 0 0 0 253 252 195
## 3 0 0 14 86 178 248 254 91 0 0
## 4 104 251 253 184 15 0 0 0 0 0
## 5 36 201 252 252 169 11 0 0 0 0
## 6 110 121 122 121 202 252 194 3 0 0
## X0L.307 X0L.308 X0L.309 X0L.310 X0L.311 X0L.312 X0L.313 X0L.314 X0L.315
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 159
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.316 X0L.317 X0L.318 X0L.319 X0L.320 X0L.321 X0L.322 X0L.323 X0L.324 X81L
## 1 0 0 0 0 0 0 0 0 0 81
## 2 0 0 0 198 253 190 0 0 0 0
## 3 254 85 0 0 0 47 49 116 144 150
## 4 0 0 0 0 0 0 0 0 80 240
## 5 0 0 22 252 252 30 22 119 197 241
## 6 0 0 0 0 0 0 10 53 179 253
## X240L X253L.24 X253L.25 X119L X25L X0L.325 X0L.326 X0L.327 X0L.328 X0L.329
## 1 240 253 253 119 25 0 0 0 0 0
## 2 0 0 0 0 0 0 255 253 196 0
## 3 241 243 234 179 241 252 40 0 0 0
## 4 251 193 23 0 0 0 0 0 0 0
## 5 253 252 251 77 0 0 0 0 0 0
## 6 253 255 253 253 228 35 0 0 0 0
## X0L.330 X0L.331 X0L.332 X0L.333 X0L.334 X0L.335 X0L.336 X0L.337 X0L.338
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 150 253
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.339 X0L.340 X0L.341 X0L.342 X0L.343 X0L.344 X0L.345 X0L.346 X0L.347 X45L
## 1 0 0 0 0 0 0 0 0 0 45
## 2 0 76 246 252 112 0 0 0 0 0
## 3 237 207 207 207 253 254 250 240 198 143
## 4 0 0 0 0 0 0 32 253 253 253
## 5 0 16 231 252 253 252 252 252 226 227
## 6 0 0 0 5 54 227 252 243 228 170
## X186L X253L.26 X253L.27 X150L X27L X0L.348 X0L.349 X0L.350 X0L.351 X0L.352
## 1 186 253 253 150 27 0 0 0 0 0
## 2 0 0 0 0 0 253 252 148 0 0
## 3 91 28 5 233 250 0 0 0 0 0
## 4 159 0 0 0 0 0 0 0 0 0
## 5 252 231 0 0 0 0 0 0 0 0
## 6 242 252 252 231 117 6 0 0 0 0
## X0L.353 X0L.354 X0L.355 X0L.356 X0L.357 X0L.358 X0L.359 X0L.360 X0L.361
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 119 177
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.362 X0L.363 X0L.364 X0L.365 X0L.366 X0L.367 X0L.368 X0L.369 X0L.370 X16L
## 1 0 0 0 0 0 0 0 0 0 16
## 2 85 252 230 25 0 0 0 0 0 0
## 3 177 177 177 177 98 56 0 0 0 0
## 4 0 0 0 0 0 151 251 251 251 39
## 5 0 55 235 253 217 138 42 24 192 252
## 6 0 6 78 252 252 125 59 0 18 208
## X93L.1 X252L X253L.28 X187L X0L.371 X0L.372 X0L.373 X0L.374 X0L.375 X0L.376
## 1 93 252 253 187 0 0 0 0 0 0
## 2 0 0 7 135 253 186 12 0 0 0
## 3 0 102 254 220 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 143 0 0 0 0 0 0 0 0 0
## 6 252 252 252 252 87 7 0 0 0 0
## X0L.377 X0L.378 X0L.379 X0L.380 X0L.381 X0L.382 X0L.383 X0L.384 X0L.385
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 85
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 5
## X0L.386 X0L.387 X0L.388 X0L.389 X0L.390 X0L.391 X0L.392 X0L.393 X0L.394
## 1 0 0 0 0 0 0 0 0 0
## 2 252 223 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 48 221 251 251 172 0
## 5 0 0 0 0 0 0 62 255 253
## 6 135 252 252 180 16 0 21 203 253
## X0L.395 X249L X253L.29 X249L.1 X64L.1 X0L.396 X0L.397 X0L.398 X0L.399 X0L.400
## 1 0 249 253 249 64 0 0 0 0 0
## 2 0 7 131 252 225 71 0 0 0 0
## 3 0 169 254 137 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 109 0 0 0 0 0 0 0 0 0
## 6 247 129 173 252 252 184 66 49 49 0
## X0L.401 X0L.402 X0L.403 X0L.404 X0L.405 X0L.406 X0L.407 X0L.408 X0L.409
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 85
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 3 136
## X0L.410 X0L.411 X0L.412 X0L.413 X0L.414 X0L.415 X0L.416 X46L X130L X183L
## 1 0 0 0 0 0 0 0 46 130 183
## 2 252 145 0 0 0 0 0 0 0 48
## 3 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 234 251 251 196 12 0 0
## 5 0 0 0 0 0 0 71 253 252 21
## 6 252 241 106 17 0 53 200 252 216 65
## X253L.30 X253L.31 X207L X2L.1 X0L.417 X0L.418 X0L.419 X0L.420 X0L.421 X0L.422
## 1 253 253 207 2 0 0 0 0 0 0
## 2 165 252 173 0 0 0 0 0 0 0
## 3 169 254 57 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 14 72 163 241 252 252 223 0 0
## X0L.423 X0L.424 X0L.425 X0L.426 X0L.427 X0L.428 X0L.429 X0L.430 X0L.431
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 86 253
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 105 252 242
## X0L.432 X0L.433 X0L.434 X0L.435 X39L.1 X148L X229L X253L.32 X253L.33 X253L.34
## 1 0 0 0 0 39 148 229 253 253 253
## 2 225 0 0 0 0 0 0 114 238 253
## 3 0 0 0 0 0 0 0 0 0 169
## 4 0 0 253 251 251 89 0 0 0 0
## 5 0 0 0 0 0 0 253 252 21 0
## 6 88 18 73 170 244 252 126 29 0 0
## X250L X182L.1 X0L.436 X0L.437 X0L.438 X0L.439 X0L.440 X0L.441 X0L.442 X0L.443
## 1 250 182 0 0 0 0 0 0 0 0
## 2 162 0 0 0 0 0 0 0 0 0
## 3 254 57 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 89 180 180 37 0 0 0
## X0L.444 X0L.445 X0L.446 X0L.447 X0L.448 X0L.449 X0L.450 X0L.451 X0L.452
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 85 252 249
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 231 252 245 205
## X0L.453 X24L X114L X221L X253L.35 X253L.36 X253L.37 X253L.38 X201L X78L
## 1 0 24 114 221 253 253 253 253 201 78
## 2 146 48 29 85 178 225 253 223 167 56
## 3 0 0 0 0 0 0 0 0 169 255
## 4 159 255 253 253 31 0 0 0 0 0
## 5 0 0 0 0 71 253 252 21 0 0
## 6 216 252 252 252 124 3 0 0 0 0
## X0L.454 X0L.455 X0L.456 X0L.457 X0L.458 X0L.459 X0L.460 X0L.461 X0L.462
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 94 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.463 X0L.464 X0L.465 X0L.466 X0L.467 X0L.468 X0L.469 X0L.470 X23L X66L
## 1 0 0 0 0 0 0 0 0 23 66
## 2 0 0 0 0 0 0 85 252 252 252
## 3 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 48 228
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 207 252 252 252 252
## X213L X253L.39 X253L.40 X253L.41 X253L.42 X198L.1 X81L.1 X2L.2 X0L.471
## 1 213 253 253 253 253 198 81 2 0
## 2 229 215 252 252 252 196 130 0 0
## 3 0 0 0 0 0 0 0 169 254
## 4 253 247 140 8 0 0 0 0 0
## 5 0 0 0 106 253 252 21 0 0
## 6 178 116 36 4 0 0 0 0 0
## X0L.472 X0L.473 X0L.474 X0L.475 X0L.476 X0L.477 X0L.478 X0L.479 X0L.480
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 96 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.481 X0L.482 X0L.483 X0L.484 X0L.485 X0L.486 X18L.4 X171L X219L.1 X253L.43
## 1 0 0 0 0 0 0 18 171 219 253
## 2 0 0 0 0 0 0 28 199 252 252
## 3 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 64 251
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 13 93 143 121 23
## X253L.44 X253L.45 X253L.46 X195L.1 X80L.1 X9L X0L.487 X0L.488 X0L.489 X0L.490
## 1 253 253 253 195 80 9 0 0 0 0
## 2 253 252 252 233 145 0 0 0 0 0
## 3 0 0 0 0 0 0 0 169 254 153
## 4 253 220 0 0 0 0 0 0 0 0
## 5 0 0 0 45 255 253 21 0 0 0
## 6 6 0 0 0 0 0 0 0 0 0
## X0L.491 X0L.492 X0L.493 X0L.494 X0L.495 X0L.496 X0L.497 X0L.498 X0L.499
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.500 X0L.501 X0L.502 X55L X172L.1 X226L X253L.47 X253L.48 X253L.49
## 1 0 0 0 55 172 226 253 253 253
## 2 0 0 0 0 0 0 25 128 252
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 64 251
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X253L.50 X244L X133L X11L.2 X0L.503 X0L.504 X0L.505 X0L.506 X0L.507 X0L.508
## 1 253 244 133 11 0 0 0 0 0 0
## 2 253 252 141 37 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 169 255 153
## 4 253 220 0 0 0 0 0 0 0 0
## 5 0 0 0 0 218 252 56 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0
## X0L.509 X0L.510 X0L.511 X0L.512 X0L.513 X0L.514 X0L.515 X0L.516 X0L.517
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.518 X0L.519 X0L.520 X136L.1 X253L.51 X253L.52 X253L.53 X212L X135L X132L
## 1 0 0 0 136 253 253 253 212 135 132
## 2 0 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 24 193 253
## 5 0 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0 0
## X16L.1 X0L.521 X0L.522 X0L.523 X0L.524 X0L.525 X0L.526 X0L.527 X0L.528
## 1 16 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 96 254 153
## 4 220 0 0 0 0 0 0 0 0
## 5 0 0 0 96 252 189 42 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.529 X0L.530 X0L.531 X0L.532 X0L.533 X0L.534 X0L.535 X0L.536 X0L.537
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.538 X0L.539 X0L.540 X0L.541 X0L.542 X0L.543 X0L.544 X0L.545 X0L.546
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.547 X0L.548 X0L.549 X0L.550 X0L.551 X0L.552 X0L.553 X0L.554 X0L.555
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 14 184 252 170 11
## 6 0 0 0 0 0 0 0 0 0
## X0L.556 X0L.557 X0L.558 X0L.559 X0L.560 X0L.561 X0L.562 X0L.563 X0L.564
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.565 X0L.566 X0L.567 X0L.568 X0L.569 X0L.570 X0L.571 X0L.572 X0L.573
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.574 X0L.575 X0L.576 X0L.577 X0L.578 X0L.579 X0L.580 X0L.581 X0L.582
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 14 147 252
## 6 0 0 0 0 0 0 0 0 0
## X0L.583 X0L.584 X0L.585 X0L.586 X0L.587 X0L.588 X0L.589 X0L.590 X0L.591
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 42 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.592 X0L.593 X0L.594 X0L.595 X0L.596 X0L.597 X0L.598 X0L.599 X0L.600
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.601 X0L.602 X0L.603 X0L.604 X0L.605 X0L.606 X0L.607 X0L.608 X0L.609
## 1 0 0 0 0 0 0 0 0 0
## 2 0 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 0 0
## 4 0 0 0 0 0 0 0 0 0
## 5 0 0 0 0 0 0 0 0 0
## 6 0 0 0 0 0 0 0 0 0
## X0L.610 X0L.611 X0L.612 X0L.613 X0L.614 X0L.615 X0L.616 Y
## 1 0 0 0 0 0 0 0 5
## 2 0 0 0 0 0 0 0 0
## 3 0 0 0 0 0 0 0 4
## 4 0 0 0 0 0 0 0 1
## 5 0 0 0 0 0 0 0 9
## 6 0 0 0 0 0 0 0 2
DF=df[(df$Y==0) | (df$Y==1),]
set.seed(111)
split <- sort(sample(nrow(DF), nrow(DF)*0.7))
training <- DF[split,]
testing <- DF[-split,]
log_model <- glm(as.factor(Y) ~ .-Y,
family = binomial,
data = training)
## Warning in terms.formula(formula, data = data): 'varlist' has changed (from
## nvar=785) to new 786 after EncodeVars() -- should no longer happen!
## Warning: glm.fit: algorithm did not converge
## Warning: glm.fit: fitted probabilities numerically 0 or 1 occurred
summary(log_model)
##
## Call:
## glm(formula = as.factor(Y) ~ . - Y, family = binomial, data = training)
##
## Deviance Residuals:
## Min 1Q Median 3Q Max
## -3.795e-06 -2.409e-06 5.360e-07 2.409e-06 5.847e-06
##
## Coefficients: (333 not defined because of singularities)
## Estimate Std. Error z value Pr(>|z|)
## (Intercept) 1.042e+01 4.725e+05 0 1
## X0L NA NA NA NA
## X0L.1 NA NA NA NA
## X0L.2 NA NA NA NA
## X28L NA NA NA NA
## X0L.3 NA NA NA NA
## X0L.4 NA NA NA NA
## X0L.5 NA NA NA NA
## X0L.6 NA NA NA NA
## X0L.7 NA NA NA NA
## X0L.8 NA NA NA NA
## X0L.9 NA NA NA NA
## X0L.10 NA NA NA NA
## X0L.11 NA NA NA NA
## X0L.12 NA NA NA NA
## X0L.13 NA NA NA NA
## X0L.14 NA NA NA NA
## X0L.15 NA NA NA NA
## X0L.16 NA NA NA NA
## X0L.17 NA NA NA NA
## X0L.18 NA NA NA NA
## X0L.19 NA NA NA NA
## X0L.20 NA NA NA NA
## X0L.21 NA NA NA NA
## X0L.22 NA NA NA NA
## X0L.23 NA NA NA NA
## X0L.24 NA NA NA NA
## X0L.25 NA NA NA NA
## X0L.26 NA NA NA NA
## X0L.27 NA NA NA NA
## X0L.28 NA NA NA NA
## X0L.29 NA NA NA NA
## X0L.30 NA NA NA NA
## X0L.31 NA NA NA NA
## X0L.32 NA NA NA NA
## X0L.33 NA NA NA NA
## X0L.34 NA NA NA NA
## X0L.35 NA NA NA NA
## X0L.36 NA NA NA NA
## X0L.37 NA NA NA NA
## X0L.38 NA NA NA NA
## X0L.39 NA NA NA NA
## X0L.40 NA NA NA NA
## X0L.41 NA NA NA NA
## X0L.42 NA NA NA NA
## X0L.43 NA NA NA NA
## X0L.44 NA NA NA NA
## X0L.45 NA NA NA NA
## X0L.46 NA NA NA NA
## X0L.47 NA NA NA NA
## X0L.48 NA NA NA NA
## X0L.49 NA NA NA NA
## X0L.50 NA NA NA NA
## X0L.51 NA NA NA NA
## X0L.52 NA NA NA NA
## X0L.53 NA NA NA NA
## X0L.54 NA NA NA NA
## X0L.55 NA NA NA NA
## X0L.56 NA NA NA NA
## X0L.57 NA NA NA NA
## X0L.58 NA NA NA NA
## X0L.59 NA NA NA NA
## X0L.60 NA NA NA NA
## X0L.61 NA NA NA NA
## X0L.62 NA NA NA NA
## X0L.63 NA NA NA NA
## X0L.64 NA NA NA NA
## X0L.65 NA NA NA NA
## X0L.66 NA NA NA NA
## X0L.67 NA NA NA NA
## X0L.68 NA NA NA NA
## X0L.69 NA NA NA NA
## X0L.70 NA NA NA NA
## X0L.71 NA NA NA NA
## X0L.72 NA NA NA NA
## X0L.73 NA NA NA NA
## X0L.74 NA NA NA NA
## X0L.75 NA NA NA NA
## X0L.76 NA NA NA NA
## X0L.77 NA NA NA NA
## X0L.78 NA NA NA NA
## X0L.79 NA NA NA NA
## X0L.80 NA NA NA NA
## X0L.81 NA NA NA NA
## X0L.82 NA NA NA NA
## X0L.83 NA NA NA NA
## X0L.84 NA NA NA NA
## X0L.85 NA NA NA NA
## X0L.86 NA NA NA NA
## X0L.87 NA NA NA NA
## X0L.88 NA NA NA NA
## X0L.89 NA NA NA NA
## X0L.90 NA NA NA NA
## X0L.91 NA NA NA NA
## X0L.92 NA NA NA NA
## X0L.93 NA NA NA NA
## X0L.94 NA NA NA NA
## X0L.95 NA NA NA NA
## X0L.96 NA NA NA NA
## X0L.97 2.133e+00 1.250e+06 0 1
## X0L.98 -3.055e-01 1.312e+05 0 1
## X0L.99 -4.422e-02 3.310e+04 0 1
## X0L.100 1.080e-01 1.279e+04 0 1
## X0L.101 4.626e-03 1.489e+04 0 1
## X0L.102 -4.009e-02 1.933e+04 0 1
## X0L.103 7.602e-02 2.409e+04 0 1
## X0L.104 -6.215e-02 1.866e+04 0 1
## X0L.105 2.330e-02 5.064e+03 0 1
## X0L.106 7.154e-02 1.606e+04 0 1
## X0L.107 -4.395e+00 3.757e+05 0 1
## X0L.108 2.895e+01 2.421e+06 0 1
## X0L.109 NA NA NA NA
## X0L.110 NA NA NA NA
## X0L.111 NA NA NA NA
## X0L.112 NA NA NA NA
## X0L.113 NA NA NA NA
## X0L.114 NA NA NA NA
## X0L.115 NA NA NA NA
## X0L.116 NA NA NA NA
## X0L.117 NA NA NA NA
## X0L.118 NA NA NA NA
## X0L.119 -6.014e-02 3.831e+04 0 1
## X0L.120 NA NA NA NA
## X0L.121 NA NA NA NA
## X0L.122 -1.813e+01 6.361e+06 0 1
## X0L.123 6.773e-01 1.931e+05 0 1
## X0L.124 -6.981e-01 6.699e+04 0 1
## X0L.125 -2.717e-02 2.025e+04 0 1
## X0L.126 -8.553e-03 3.590e+03 0 1
## X0L.127 1.780e-02 2.316e+03 0 1
## X0L.128 -1.131e-02 1.338e+03 0 1
## X0L.129 1.893e-03 1.319e+03 0 1
## X0L.130 9.501e-03 1.088e+03 0 1
## X0L.131 -1.073e-02 1.139e+03 0 1
## X0L.132 8.928e-03 1.235e+03 0 1
## X0L.133 -8.319e-03 1.234e+03 0 1
## X0L.134 1.135e-02 1.586e+03 0 1
## X0L.135 1.478e-02 2.619e+03 0 1
## X0L.136 -4.240e-02 3.932e+03 0 1
## X0L.137 1.399e-02 5.879e+03 0 1
## X0L.138 6.968e-02 2.109e+04 0 1
## X0L.139 -2.116e+00 4.342e+05 0 1
## X0L.140 NA NA NA NA
## X0L.141 NA NA NA NA
## X0L.142 NA NA NA NA
## X0L.143 NA NA NA NA
## X0L.144 NA NA NA NA
## X0L.145 1.677e-03 1.015e+04 0 1
## X0L.146 NA NA NA NA
## X0L.147 NA NA NA NA
## X0L.148 NA NA NA NA
## X0L.149 -1.619e+01 6.168e+06 0 1
## X0L.150 1.101e-01 1.307e+04 0 1
## X0L.151 4.910e-01 1.022e+05 0 1
## X0L.152 2.015e-01 1.673e+04 0 1
## X0L.153 4.640e-02 1.036e+04 0 1
## X0L.154 1.574e-02 3.533e+03 0 1
## X3L -2.195e-03 2.935e+03 0 1
## X18L 3.291e-03 2.091e+03 0 1
## X18L.1 -8.286e-03 1.224e+03 0 1
## X18L.2 -8.039e-03 1.157e+03 0 1
## X126L 6.879e-03 1.152e+03 0 1
## X136L 7.377e-04 1.176e+03 0 1
## X175L 3.506e-03 1.378e+03 0 1
## X26L -6.077e-03 2.020e+03 0 1
## X166L -3.623e-03 2.415e+03 0 1
## X255L 4.805e-03 3.206e+03 0 1
## X247L 1.962e-02 4.870e+03 0 1
## X127L -1.919e-02 1.749e+04 0 1
## X0L.155 1.705e+00 4.354e+05 0 1
## X0L.156 NA NA NA NA
## X0L.157 -4.442e-02 1.979e+04 0 1
## X0L.158 NA NA NA NA
## X0L.159 NA NA NA NA
## X0L.160 NA NA NA NA
## X0L.161 NA NA NA NA
## X0L.162 NA NA NA NA
## X0L.163 NA NA NA NA
## X0L.164 NA NA NA NA
## X0L.165 1.605e+01 6.188e+06 0 1
## X0L.166 -9.428e-01 1.926e+05 0 1
## X30L -2.974e-01 2.530e+04 0 1
## X36L -1.064e-01 1.436e+04 0 1
## X94L -2.639e-02 6.506e+03 0 1
## X154L -2.396e-03 3.936e+03 0 1
## X170L -2.380e-02 3.025e+03 0 1
## X253L 2.355e-03 2.761e+03 0 1
## X253L.1 1.117e-03 1.700e+03 0 1
## X253L.2 2.041e-02 1.814e+03 0 1
## X253L.3 -6.821e-03 1.390e+03 0 1
## X253L.4 -1.412e-02 1.423e+03 0 1
## X225L 6.301e-03 1.804e+03 0 1
## X172L -5.062e-03 2.023e+03 0 1
## X253L.5 -1.804e-02 2.464e+03 0 1
## X242L 1.912e-02 3.712e+03 0 1
## X195L -1.686e-02 6.022e+03 0 1
## X64L 1.264e-02 1.338e+04 0 1
## X0L.167 3.053e-01 3.381e+04 0 1
## X0L.168 NA NA NA NA
## X0L.169 NA NA NA NA
## X0L.170 NA NA NA NA
## X0L.171 NA NA NA NA
## X0L.172 NA NA NA NA
## X0L.173 NA NA NA NA
## X0L.174 NA NA NA NA
## X0L.175 NA NA NA NA
## X0L.176 NA NA NA NA
## X0L.177 -5.095e-01 6.591e+04 0 1
## X49L 7.232e-02 5.370e+04 0 1
## X238L 4.155e-01 1.859e+04 0 1
## X253L.6 -2.701e-02 9.008e+03 0 1
## X253L.7 -1.946e-02 5.717e+03 0 1
## X253L.8 1.552e-02 4.950e+03 0 1
## X253L.9 4.158e-02 3.967e+03 0 1
## X253L.10 -4.054e-03 3.083e+03 0 1
## X253L.11 2.297e-03 1.911e+03 0 1
## X253L.12 -1.919e-02 2.177e+03 0 1
## X253L.13 9.311e-03 1.613e+03 0 1
## X251L 1.477e-02 1.672e+03 0 1
## X93L -4.677e-03 2.066e+03 0 1
## X82L 3.910e-03 2.103e+03 0 1
## X82L.1 1.135e-02 2.589e+03 0 1
## X56L -4.801e-02 3.898e+03 0 1
## X39L 8.368e-02 5.762e+03 0 1
## X0L.178 -9.045e-02 7.777e+03 0 1
## X0L.179 -7.978e-02 1.933e+04 0 1
## X0L.180 NA NA NA NA
## X0L.181 NA NA NA NA
## X0L.182 NA NA NA NA
## X0L.183 NA NA NA NA
## X0L.184 NA NA NA NA
## X0L.185 NA NA NA NA
## X0L.186 NA NA NA NA
## X0L.187 NA NA NA NA
## X0L.188 -1.599e-01 1.200e+06 0 1
## X0L.189 -1.114e-01 1.137e+05 0 1
## X18L.3 -2.060e-01 3.184e+04 0 1
## X219L -4.455e-02 9.223e+03 0 1
## X253L.14 2.617e-02 7.391e+03 0 1
## X253L.15 1.141e-02 4.769e+03 0 1
## X253L.16 -1.417e-02 4.176e+03 0 1
## X253L.17 -5.386e-02 3.063e+03 0 1
## X253L.18 1.804e-02 2.682e+03 0 1
## X198L -7.549e-03 1.933e+03 0 1
## X182L -3.726e-03 1.674e+03 0 1
## X247L.1 -6.575e-03 2.013e+03 0 1
## X241L -9.340e-03 1.859e+03 0 1
## X0L.190 -1.497e-02 1.692e+03 0 1
## X0L.191 1.515e-02 2.047e+03 0 1
## X0L.192 -1.289e-07 3.180e+03 0 1
## X0L.193 6.699e-02 4.302e+03 0 1
## X0L.194 -6.753e-02 6.206e+03 0 1
## X0L.195 4.875e-02 7.396e+03 0 1
## X0L.196 9.709e-02 1.408e+04 0 1
## X0L.197 NA NA NA NA
## X0L.198 NA NA NA NA
## X0L.199 NA NA NA NA
## X0L.200 NA NA NA NA
## X0L.201 NA NA NA NA
## X0L.202 NA NA NA NA
## X0L.203 NA NA NA NA
## X0L.204 NA NA NA NA
## X0L.205 2.389e-01 3.241e+05 0 1
## X0L.206 2.008e-01 3.975e+04 0 1
## X0L.207 -1.472e-01 1.399e+04 0 1
## X80L 3.067e-02 8.957e+03 0 1
## X156L 4.470e-02 6.477e+03 0 1
## X107L 7.612e-03 4.672e+03 0 1
## X253L.19 9.363e-03 3.199e+03 0 1
## X253L.20 2.711e-02 2.654e+03 0 1
## X205L -1.732e-02 1.960e+03 0 1
## X11L 6.860e-03 1.661e+03 0 1
## X0L.208 -6.008e-03 1.491e+03 0 1
## X43L 1.063e-03 2.138e+03 0 1
## X154L.1 1.677e-02 2.033e+03 0 1
## X0L.209 3.233e-02 2.112e+03 0 1
## X0L.210 7.061e-03 2.177e+03 0 1
## X0L.211 1.195e-02 5.309e+03 0 1
## X0L.212 -1.040e-01 6.268e+03 0 1
## X0L.213 4.511e-02 5.902e+03 0 1
## X0L.214 -4.970e-02 8.508e+03 0 1
## X0L.215 -6.544e-02 1.959e+04 0 1
## X0L.216 NA NA NA NA
## X0L.217 NA NA NA NA
## X0L.218 NA NA NA NA
## X0L.219 NA NA NA NA
## X0L.220 NA NA NA NA
## X0L.221 NA NA NA NA
## X0L.222 NA NA NA NA
## X0L.223 NA NA NA NA
## X0L.224 -3.564e-01 6.812e+04 0 1
## X0L.225 1.370e-01 1.342e+04 0 1
## X0L.226 -1.104e-02 1.452e+04 0 1
## X0L.227 -1.879e-02 6.810e+03 0 1
## X14L -2.414e-02 6.194e+03 0 1
## X1L -6.452e-03 4.607e+03 0 1
## X154L.2 -1.059e-02 3.309e+03 0 1
## X253L.21 -2.152e-02 2.823e+03 0 1
## X90L 6.167e-03 2.142e+03 0 1
## X0L.228 9.665e-03 1.691e+03 0 1
## X0L.229 6.639e-03 1.727e+03 0 1
## X0L.230 1.059e-03 2.057e+03 0 1
## X0L.231 -5.992e-03 2.117e+03 0 1
## X0L.232 -4.554e-02 2.065e+03 0 1
## X0L.233 -3.789e-02 3.351e+03 0 1
## X0L.234 -3.322e-02 5.436e+03 0 1
## X0L.235 3.617e-03 5.371e+03 0 1
## X0L.236 -1.033e-01 6.277e+03 0 1
## X0L.237 -3.101e-02 7.726e+03 0 1
## X0L.238 1.135e-02 1.883e+04 0 1
## X0L.239 -2.045e+00 5.733e+05 0 1
## X0L.240 NA NA NA NA
## X0L.241 NA NA NA NA
## X0L.242 NA NA NA NA
## X0L.243 NA NA NA NA
## X0L.244 NA NA NA NA
## X0L.245 NA NA NA NA
## X0L.246 2.977e-01 2.290e+05 0 1
## X0L.247 3.874e-01 2.488e+04 0 1
## X0L.248 -4.052e-02 1.646e+04 0 1
## X0L.249 -5.453e-02 9.644e+03 0 1
## X0L.250 9.714e-03 6.367e+03 0 1
## X0L.251 1.228e-02 5.274e+03 0 1
## X0L.252 4.725e-02 4.686e+03 0 1
## X139L 1.708e-02 3.726e+03 0 1
## X253L.22 3.353e-02 2.596e+03 0 1
## X190L -1.485e-03 1.955e+03 0 1
## X2L 5.495e-03 1.879e+03 0 1
## X0L.253 -1.932e-03 1.944e+03 0 1
## X0L.254 6.910e-03 1.717e+03 0 1
## X0L.255 -1.069e-02 1.890e+03 0 1
## X0L.256 4.195e-02 2.973e+03 0 1
## X0L.257 7.968e-03 3.679e+03 0 1
## X0L.258 6.905e-02 6.101e+03 0 1
## X0L.259 6.902e-02 7.190e+03 0 1
## X0L.260 1.466e-01 8.264e+03 0 1
## X0L.261 9.465e-02 8.822e+03 0 1
## X0L.262 4.247e-02 1.976e+04 0 1
## X0L.263 1.366e+00 3.909e+05 0 1
## X0L.264 -5.243e-01 1.670e+05 0 1
## X0L.265 NA NA NA NA
## X0L.266 NA NA NA NA
## X0L.267 NA NA NA NA
## X0L.268 NA NA NA NA
## X0L.269 NA NA NA NA
## X0L.270 -3.473e-01 5.806e+04 0 1
## X0L.271 -1.821e-01 1.823e+04 0 1
## X0L.272 4.020e-02 8.936e+03 0 1
## X0L.273 -4.921e-03 5.785e+03 0 1
## X0L.274 -1.433e-02 5.408e+03 0 1
## X0L.275 -2.647e-02 5.720e+03 0 1
## X0L.276 -1.437e-01 5.516e+03 0 1
## X11L.1 2.073e-03 3.840e+03 0 1
## X190L.1 -2.382e-02 2.430e+03 0 1
## X253L.23 7.563e-03 1.714e+03 0 1
## X70L 1.265e-02 1.999e+03 0 1
## X0L.277 -9.835e-03 2.499e+03 0 1
## X0L.278 -9.980e-03 1.785e+03 0 1
## X0L.279 3.179e-03 1.950e+03 0 1
## X0L.280 -7.564e-03 3.772e+03 0 1
## X0L.281 -1.143e-03 5.463e+03 0 1
## X0L.282 -7.475e-02 7.439e+03 0 1
## X0L.283 -9.786e-02 6.748e+03 0 1
## X0L.284 -4.278e-02 8.242e+03 0 1
## X0L.285 -1.060e-01 9.681e+03 0 1
## X0L.286 -7.771e-02 1.110e+04 0 1
## X0L.287 NA NA NA NA
## X0L.288 NA NA NA NA
## X0L.289 NA NA NA NA
## X0L.290 NA NA NA NA
## X0L.291 NA NA NA NA
## X0L.292 3.808e-01 4.354e+04 0 1
## X0L.293 NA NA NA NA
## X0L.294 9.189e-02 2.068e+04 0 1
## X0L.295 -3.187e-02 1.368e+04 0 1
## X0L.296 -5.947e-02 1.139e+04 0 1
## X0L.297 7.286e-02 5.520e+03 0 1
## X0L.298 -4.201e-02 6.728e+03 0 1
## X0L.299 7.554e-02 6.059e+03 0 1
## X0L.300 8.136e-02 5.354e+03 0 1
## X0L.301 7.773e-03 2.750e+03 0 1
## X35L -2.099e-05 1.883e+03 0 1
## X241L.1 -1.435e-02 1.234e+03 0 1
## X225L.1 -2.133e-02 3.697e+03 0 1
## X160L 1.188e-02 2.112e+03 0 1
## X108L 1.548e-02 1.805e+03 0 1
## X1L.1 6.882e-03 2.742e+03 0 1
## X0L.302 -1.328e-02 4.471e+03 0 1
## X0L.303 -7.457e-02 6.151e+03 0 1
## X0L.304 6.172e-02 7.435e+03 0 1
## X0L.305 3.583e-02 6.730e+03 0 1
## X0L.306 -2.632e-02 6.736e+03 0 1
## X0L.307 -7.703e-03 8.193e+03 0 1
## X0L.308 8.124e-02 1.376e+04 0 1
## X0L.309 NA NA NA NA
## X0L.310 NA NA NA NA
## X0L.311 NA NA NA NA
## X0L.312 NA NA NA NA
## X0L.313 NA NA NA NA
## X0L.314 NA NA NA NA
## X0L.315 NA NA NA NA
## X0L.316 -5.297e-02 2.083e+04 0 1
## X0L.317 1.268e-01 1.071e+04 0 1
## X0L.318 6.120e-02 9.574e+03 0 1
## X0L.319 -4.403e-02 6.071e+03 0 1
## X0L.320 9.570e-02 6.422e+03 0 1
## X0L.321 -7.001e-03 4.693e+03 0 1
## X0L.322 -4.917e-02 5.237e+03 0 1
## X0L.323 -2.492e-02 2.728e+03 0 1
## X0L.324 4.690e-03 1.716e+03 0 1
## X81L 2.303e-03 1.699e+03 0 1
## X240L 8.201e-02 4.688e+03 0 1
## X253L.24 -4.183e-03 1.530e+03 0 1
## X253L.25 -3.609e-03 1.612e+03 0 1
## X119L -8.522e-04 4.452e+03 0 1
## X25L 8.523e-03 5.264e+03 0 1
## X0L.325 6.671e-03 5.778e+03 0 1
## X0L.326 -4.413e-02 7.096e+03 0 1
## X0L.327 -6.476e-03 8.204e+03 0 1
## X0L.328 -1.502e-02 7.692e+03 0 1
## X0L.329 4.746e-02 9.310e+03 0 1
## X0L.330 -1.980e-01 1.363e+04 0 1
## X0L.331 -4.184e-01 6.788e+04 0 1
## X0L.332 NA NA NA NA
## X0L.333 NA NA NA NA
## X0L.334 NA NA NA NA
## X0L.335 NA NA NA NA
## X0L.336 NA NA NA NA
## X0L.337 8.424e+00 6.669e+05 0 1
## X0L.338 5.863e-02 1.271e+04 0 1
## X0L.339 -1.217e-01 9.306e+03 0 1
## X0L.340 6.760e-02 7.658e+03 0 1
## X0L.341 -3.601e-03 4.776e+03 0 1
## X0L.342 -1.204e-01 8.771e+03 0 1
## X0L.343 1.914e-02 6.646e+03 0 1
## X0L.344 6.427e-02 5.093e+03 0 1
## X0L.345 1.127e-02 2.324e+03 0 1
## X0L.346 5.221e-03 1.907e+03 0 1
## X0L.347 -2.066e-02 2.638e+03 0 1
## X45L -1.581e-02 4.048e+03 0 1
## X186L 3.277e-03 1.659e+03 0 1
## X253L.26 -1.881e-02 1.927e+03 0 1
## X253L.27 2.766e-02 5.233e+03 0 1
## X150L -6.257e-02 4.681e+03 0 1
## X27L 4.163e-02 5.356e+03 0 1
## X0L.348 7.516e-02 7.300e+03 0 1
## X0L.349 -6.190e-03 7.736e+03 0 1
## X0L.350 2.938e-02 8.132e+03 0 1
## X0L.351 5.416e-02 1.098e+04 0 1
## X0L.352 -9.355e-02 2.019e+04 0 1
## X0L.353 NA NA NA NA
## X0L.354 NA NA NA NA
## X0L.355 NA NA NA NA
## X0L.356 NA NA NA NA
## X0L.357 NA NA NA NA
## X0L.358 NA NA NA NA
## X0L.359 NA NA NA NA
## X0L.360 1.954e-02 7.453e+03 0 1
## X0L.361 -5.749e-02 5.556e+03 0 1
## X0L.362 -1.198e-01 7.873e+03 0 1
## X0L.363 1.579e-02 6.054e+03 0 1
## X0L.364 4.514e-02 6.171e+03 0 1
## X0L.365 6.994e-03 4.771e+03 0 1
## X0L.366 -3.487e-02 3.450e+03 0 1
## X0L.367 9.345e-03 2.241e+03 0 1
## X0L.368 1.152e-02 1.819e+03 0 1
## X0L.369 2.201e-02 3.002e+03 0 1
## X0L.370 6.810e-03 2.980e+03 0 1
## X16L 1.454e-03 1.604e+03 0 1
## X93L.1 2.669e-02 2.716e+03 0 1
## X252L -3.231e-02 6.203e+03 0 1
## X253L.28 7.042e-02 5.920e+03 0 1
## X187L -6.345e-03 5.381e+03 0 1
## X0L.371 -8.540e-02 6.721e+03 0 1
## X0L.372 -1.675e-02 5.586e+03 0 1
## X0L.373 -3.212e-02 6.547e+03 0 1
## X0L.374 -8.663e-02 1.131e+04 0 1
## X0L.375 5.266e-01 2.761e+04 0 1
## X0L.376 NA NA NA NA
## X0L.377 NA NA NA NA
## X0L.378 NA NA NA NA
## X0L.379 NA NA NA NA
## X0L.380 NA NA NA NA
## X0L.381 NA NA NA NA
## X0L.382 NA NA NA NA
## X0L.383 -7.019e-03 8.669e+03 0 1
## X0L.384 1.112e-01 7.355e+03 0 1
## X0L.385 4.205e-02 6.064e+03 0 1
## X0L.386 -2.812e-02 8.162e+03 0 1
## X0L.387 -1.736e-02 5.350e+03 0 1
## X0L.388 -3.297e-02 4.284e+03 0 1
## X0L.389 -1.104e-02 1.839e+03 0 1
## X0L.390 -1.235e-02 1.977e+03 0 1
## X0L.391 4.501e-04 1.830e+03 0 1
## X0L.392 -2.003e-02 2.600e+03 0 1
## X0L.393 -3.919e-03 1.821e+03 0 1
## X0L.394 -1.478e-02 2.021e+03 0 1
## X0L.395 -1.300e-02 3.369e+03 0 1
## X249L -2.539e-02 4.755e+03 0 1
## X253L.29 -1.036e-01 5.765e+03 0 1
## X249L.1 -1.733e-02 5.912e+03 0 1
## X64L.1 2.448e-02 5.143e+03 0 1
## X0L.396 2.078e-02 5.437e+03 0 1
## X0L.397 -1.898e-02 9.373e+03 0 1
## X0L.398 4.675e-02 1.287e+04 0 1
## X0L.399 -8.149e-01 3.816e+04 0 1
## X0L.400 NA NA NA NA
## X0L.401 1.020e-01 5.184e+04 0 1
## X0L.402 NA NA NA NA
## X0L.403 NA NA NA NA
## X0L.404 NA NA NA NA
## X0L.405 NA NA NA NA
## X0L.406 -2.483e-02 2.845e+04 0 1
## X0L.407 -7.825e-02 1.033e+04 0 1
## X0L.408 -1.337e-02 8.683e+03 0 1
## X0L.409 6.103e-02 8.907e+03 0 1
## X0L.410 -2.454e-03 8.768e+03 0 1
## X0L.411 -2.426e-03 3.481e+03 0 1
## X0L.412 3.456e-02 2.376e+03 0 1
## X0L.413 1.876e-02 1.661e+03 0 1
## X0L.414 -1.151e-02 2.013e+03 0 1
## X0L.415 -3.037e-03 1.980e+03 0 1
## X0L.416 1.514e-02 2.013e+03 0 1
## X46L 6.205e-03 1.611e+03 0 1
## X130L 2.151e-03 2.021e+03 0 1
## X183L 2.224e-02 2.969e+03 0 1
## X253L.30 2.637e-02 3.966e+03 0 1
## X253L.31 2.700e-02 5.693e+03 0 1
## X207L -5.539e-02 5.341e+03 0 1
## X2L.1 4.354e-03 5.660e+03 0 1
## X0L.417 -3.097e-02 7.727e+03 0 1
## X0L.418 -8.195e-02 1.172e+04 0 1
## X0L.419 1.390e-01 1.771e+04 0 1
## X0L.420 7.813e-01 6.476e+04 0 1
## X0L.421 -6.617e+00 1.144e+06 0 1
## X0L.422 NA NA NA NA
## X0L.423 -5.706e-03 8.958e+03 0 1
## X0L.424 NA NA NA NA
## X0L.425 NA NA NA NA
## X0L.426 NA NA NA NA
## X0L.427 NA NA NA NA
## X0L.428 -8.007e-03 7.684e+03 0 1
## X0L.429 -7.275e-03 6.438e+03 0 1
## X0L.430 -3.199e-02 7.804e+03 0 1
## X0L.431 -1.026e-02 7.858e+03 0 1
## X0L.432 1.524e-02 3.444e+03 0 1
## X0L.433 -2.952e-02 2.323e+03 0 1
## X0L.434 1.138e-02 1.900e+03 0 1
## X0L.435 -3.268e-03 1.867e+03 0 1
## X39L.1 8.318e-04 1.724e+03 0 1
## X148L 6.701e-03 1.666e+03 0 1
## X229L -1.027e-02 1.608e+03 0 1
## X253L.32 6.486e-03 1.741e+03 0 1
## X253L.33 5.992e-03 2.097e+03 0 1
## X253L.34 -1.814e-02 4.636e+03 0 1
## X250L 6.722e-02 4.697e+03 0 1
## X182L.1 -2.983e-03 4.307e+03 0 1
## X0L.436 1.298e-02 6.689e+03 0 1
## X0L.437 5.983e-02 8.645e+03 0 1
## X0L.438 -8.799e-03 1.702e+04 0 1
## X0L.439 -3.932e-01 2.345e+04 0 1
## X0L.440 -5.679e-01 6.052e+04 0 1
## X0L.441 2.695e+01 2.907e+06 0 1
## X0L.442 NA NA NA NA
## X0L.443 NA NA NA NA
## X0L.444 NA NA NA NA
## X0L.445 NA NA NA NA
## X0L.446 NA NA NA NA
## X0L.447 NA NA NA NA
## X0L.448 6.404e-02 8.254e+03 0 1
## X0L.449 -8.012e-04 6.109e+03 0 1
## X0L.450 -2.627e-02 7.057e+03 0 1
## X0L.451 3.484e-03 5.858e+03 0 1
## X0L.452 -8.178e-03 2.602e+03 0 1
## X0L.453 1.169e-02 2.153e+03 0 1
## X24L -3.077e-02 1.915e+03 0 1
## X114L 2.663e-02 1.844e+03 0 1
## X221L -9.017e-03 1.640e+03 0 1
## X253L.35 -6.942e-03 1.707e+03 0 1
## X253L.36 2.965e-02 1.534e+03 0 1
## X253L.37 -5.612e-03 1.715e+03 0 1
## X253L.38 -7.709e-03 2.041e+03 0 1
## X201L -1.772e-02 3.304e+03 0 1
## X78L 8.988e-03 4.093e+03 0 1
## X0L.454 2.336e-03 5.042e+03 0 1
## X0L.455 -4.479e-02 5.996e+03 0 1
## X0L.456 9.334e-02 1.749e+04 0 1
## X0L.457 5.332e-02 3.263e+04 0 1
## X0L.458 9.907e-01 7.371e+04 0 1
## X0L.459 -3.234e+00 3.561e+05 0 1
## X0L.460 NA NA NA NA
## X0L.461 NA NA NA NA
## X0L.462 NA NA NA NA
## X0L.463 NA NA NA NA
## X0L.464 NA NA NA NA
## X0L.465 NA NA NA NA
## X0L.466 NA NA NA NA
## X0L.467 5.399e-03 9.049e+03 0 1
## X0L.468 -4.834e-02 8.647e+03 0 1
## X0L.469 7.768e-02 7.204e+03 0 1
## X0L.470 -2.721e-03 4.343e+03 0 1
## X23L 4.637e-03 2.361e+03 0 1
## X66L -9.270e-03 2.592e+03 0 1
## X213L 2.725e-03 1.786e+03 0 1
## X253L.39 -1.393e-02 1.508e+03 0 1
## X253L.40 1.464e-02 1.432e+03 0 1
## X253L.41 2.151e-02 1.687e+03 0 1
## X253L.42 -7.007e-03 1.623e+03 0 1
## X198L.1 1.902e-03 1.924e+03 0 1
## X81L.1 -6.630e-03 2.198e+03 0 1
## X2L.2 -8.062e-03 3.810e+03 0 1
## X0L.471 4.204e-02 4.162e+03 0 1
## X0L.472 -4.563e-02 7.755e+03 0 1
## X0L.473 9.420e-02 1.371e+04 0 1
## X0L.474 -1.333e-01 2.041e+04 0 1
## X0L.475 5.171e-01 5.119e+04 0 1
## X0L.476 -2.006e+00 2.273e+05 0 1
## X0L.477 NA NA NA NA
## X0L.478 NA NA NA NA
## X0L.479 1.842e-02 4.609e+03 0 1
## X0L.480 NA NA NA NA
## X0L.481 NA NA NA NA
## X0L.482 NA NA NA NA
## X0L.483 NA NA NA NA
## X0L.484 NA NA NA NA
## X0L.485 6.973e-02 8.656e+03 0 1
## X0L.486 -1.341e-02 7.475e+03 0 1
## X18L.4 -3.023e-02 5.934e+03 0 1
## X171L 2.587e-03 2.668e+03 0 1
## X219L.1 -1.157e-02 2.202e+03 0 1
## X253L.43 1.928e-02 2.459e+03 0 1
## X253L.44 5.509e-03 1.522e+03 0 1
## X253L.45 -2.872e-03 1.382e+03 0 1
## X253L.46 -1.873e-02 1.516e+03 0 1
## X195L.1 -2.062e-02 1.430e+03 0 1
## X80L.1 -1.053e-02 1.485e+03 0 1
## X9L -5.681e-03 1.749e+03 0 1
## X0L.487 -5.038e-03 1.962e+03 0 1
## X0L.488 2.845e-02 3.179e+03 0 1
## X0L.489 -5.234e-02 6.929e+03 0 1
## X0L.490 -4.291e-02 1.512e+04 0 1
## X0L.491 -1.108e-01 1.414e+04 0 1
## X0L.492 -4.385e-01 3.418e+04 0 1
## X0L.493 -1.605e-01 3.703e+05 0 1
## X0L.494 NA NA NA NA
## X0L.495 NA NA NA NA
## X0L.496 NA NA NA NA
## X0L.497 NA NA NA NA
## X0L.498 NA NA NA NA
## X0L.499 NA NA NA NA
## X0L.500 NA NA NA NA
## X0L.501 NA NA NA NA
## X0L.502 NA NA NA NA
## X55L 4.868e-02 1.946e+04 0 1
## X172L.1 -7.703e-02 9.597e+03 0 1
## X226L 4.990e-02 4.159e+03 0 1
## X253L.47 -2.094e-02 2.588e+03 0 1
## X253L.48 1.277e-02 1.781e+03 0 1
## X253L.49 -9.132e-03 1.600e+03 0 1
## X253L.50 6.069e-03 1.178e+03 0 1
## X244L -4.002e-03 1.080e+03 0 1
## X133L 8.740e-03 1.163e+03 0 1
## X11L.2 -8.611e-04 1.022e+03 0 1
## X0L.503 1.062e-03 1.067e+03 0 1
## X0L.504 2.438e-03 1.507e+03 0 1
## X0L.505 4.807e-03 2.159e+03 0 1
## X0L.506 -9.688e-03 3.505e+03 0 1
## X0L.507 4.094e-02 1.396e+04 0 1
## X0L.508 -2.693e-03 1.856e+04 0 1
## X0L.509 6.189e-01 4.149e+04 0 1
## X0L.510 -8.038e-02 4.268e+05 0 1
## X0L.511 NA NA NA NA
## X0L.512 NA NA NA NA
## X0L.513 NA NA NA NA
## X0L.514 NA NA NA NA
## X0L.515 NA NA NA NA
## X0L.516 NA NA NA NA
## X0L.517 NA NA NA NA
## X0L.518 NA NA NA NA
## X0L.519 NA NA NA NA
## X0L.520 NA NA NA NA
## X136L.1 NA NA NA NA
## X253L.51 -3.264e+00 4.891e+05 0 1
## X253L.52 2.557e-01 3.941e+04 0 1
## X253L.53 -1.249e-02 4.914e+03 0 1
## X212L 1.417e-02 2.678e+03 0 1
## X135L -1.887e-03 1.867e+03 0 1
## X132L 1.093e-03 1.376e+03 0 1
## X16L.1 -6.438e-03 1.418e+03 0 1
## X0L.521 7.855e-03 1.407e+03 0 1
## X0L.522 -5.167e-03 1.508e+03 0 1
## X0L.523 8.212e-03 1.348e+03 0 1
## X0L.524 -1.271e-02 1.996e+03 0 1
## X0L.525 1.282e-02 2.600e+03 0 1
## X0L.526 -3.648e-02 5.098e+03 0 1
## X0L.527 4.865e-02 6.676e+03 0 1
## X0L.528 -6.685e-02 2.816e+04 0 1
## X0L.529 NA NA NA NA
## X0L.530 NA NA NA NA
## X0L.531 NA NA NA NA
## X0L.532 NA NA NA NA
## X0L.533 NA NA NA NA
## X0L.534 NA NA NA NA
## X0L.535 NA NA NA NA
## X0L.536 NA NA NA NA
## X0L.537 NA NA NA NA
## X0L.538 NA NA NA NA
## X0L.539 NA NA NA NA
## X0L.540 NA NA NA NA
## X0L.541 NA NA NA NA
## X0L.542 NA NA NA NA
## X0L.543 NA NA NA NA
## X0L.544 NA NA NA NA
## X0L.545 8.691e-01 1.242e+05 0 1
## X0L.546 -2.642e+00 2.730e+05 0 1
## X0L.547 2.800e+00 2.582e+05 0 1
## X0L.548 -1.124e+00 8.123e+04 0 1
## X0L.549 7.071e-02 2.928e+04 0 1
## X0L.550 -3.344e-02 1.075e+04 0 1
## X0L.551 1.763e-02 3.523e+03 0 1
## X0L.552 NA NA NA NA
## X0L.553 NA NA NA NA
## X0L.554 NA NA NA NA
## X0L.555 NA NA NA NA
## X0L.556 NA NA NA NA
## X0L.557 NA NA NA NA
## X0L.558 NA NA NA NA
## X0L.559 NA NA NA NA
## X0L.560 NA NA NA NA
## X0L.561 NA NA NA NA
## X0L.562 NA NA NA NA
## X0L.563 NA NA NA NA
## X0L.564 NA NA NA NA
## X0L.565 NA NA NA NA
## X0L.566 NA NA NA NA
## X0L.567 NA NA NA NA
## X0L.568 NA NA NA NA
## X0L.569 NA NA NA NA
## X0L.570 NA NA NA NA
## X0L.571 NA NA NA NA
## X0L.572 NA NA NA NA
## X0L.573 NA NA NA NA
## X0L.574 NA NA NA NA
## X0L.575 NA NA NA NA
## X0L.576 NA NA NA NA
## X0L.577 NA NA NA NA
## X0L.578 NA NA NA NA
## X0L.579 NA NA NA NA
## X0L.580 NA NA NA NA
## X0L.581 NA NA NA NA
## X0L.582 NA NA NA NA
## X0L.583 NA NA NA NA
## X0L.584 NA NA NA NA
## X0L.585 NA NA NA NA
## X0L.586 NA NA NA NA
## X0L.587 NA NA NA NA
## X0L.588 NA NA NA NA
## X0L.589 NA NA NA NA
## X0L.590 NA NA NA NA
## X0L.591 NA NA NA NA
## X0L.592 NA NA NA NA
## X0L.593 NA NA NA NA
## X0L.594 NA NA NA NA
## X0L.595 NA NA NA NA
## X0L.596 NA NA NA NA
## X0L.597 NA NA NA NA
## X0L.598 NA NA NA NA
## X0L.599 NA NA NA NA
## X0L.600 NA NA NA NA
## X0L.601 NA NA NA NA
## X0L.602 NA NA NA NA
## X0L.603 NA NA NA NA
## X0L.604 NA NA NA NA
## X0L.605 NA NA NA NA
## X0L.606 NA NA NA NA
## X0L.607 NA NA NA NA
## X0L.608 NA NA NA NA
## X0L.609 NA NA NA NA
## X0L.610 NA NA NA NA
## X0L.611 NA NA NA NA
## X0L.612 NA NA NA NA
## X0L.613 NA NA NA NA
## X0L.614 NA NA NA NA
## X0L.615 NA NA NA NA
## X0L.616 NA NA NA NA
##
## (Dispersion parameter for binomial family taken to be 1)
##
## Null deviance: 7.9483e+02 on 575 degrees of freedom
## Residual deviance: 3.4721e-09 on 124 degrees of freedom
## AIC: 904
##
## Number of Fisher Scoring iterations: 25
probabilities <- predict(log_model,
newdata = testing,
type = "response")
## Warning in predict.lm(object, newdata, se.fit, scale = 1, type = if (type == :
## prediction from a rank-deficient fit may be misleading
predictions <- ifelse(probabilities > 0.5, "1", "0")
trn_tab <- table(predictions, testing$Y)
library("caret")
confusionMatrix(trn_tab, positive = "1")
## Confusion Matrix and Statistics
##
##
## predictions 0 1
## 0 97 3
## 1 17 130
##
## Accuracy : 0.919
## 95% CI : (0.8777, 0.9498)
## No Information Rate : 0.5385
## P-Value [Acc > NIR] : < 2e-16
##
## Kappa : 0.8357
##
## Mcnemar's Test P-Value : 0.00365
##
## Sensitivity : 0.9774
## Specificity : 0.8509
## Pos Pred Value : 0.8844
## Neg Pred Value : 0.9700
## Prevalence : 0.5385
## Detection Rate : 0.5263
## Detection Prevalence : 0.5951
## Balanced Accuracy : 0.9142
##
## 'Positive' Class : 1
##
str(iris)
## 'data.frame': 150 obs. of 5 variables:
## $ Sepal.Length: num 5.1 4.9 4.7 4.6 5 5.4 4.6 5 4.4 4.9 ...
## $ Sepal.Width : num 3.5 3 3.2 3.1 3.6 3.9 3.4 3.4 2.9 3.1 ...
## $ Petal.Length: num 1.4 1.4 1.3 1.5 1.4 1.7 1.4 1.5 1.4 1.5 ...
## $ Petal.Width : num 0.2 0.2 0.2 0.2 0.2 0.4 0.3 0.2 0.2 0.1 ...
## $ Species : Factor w/ 3 levels "setosa","versicolor",..: 1 1 1 1 1 1 1 1 1 1 ...
library(stats4) #Load package stats
library(splines) #Load package splines
#To load package VGAM, need to load package stats4 and splines.
library(VGAM)
##
## Attaching package: 'VGAM'
## The following object is masked from 'package:caret':
##
## predictors
## The following object is masked from 'package:kernlab':
##
## nvar
fit.MLR <- vglm( Species ~ Sepal.Length + Sepal.Width + Petal.Length + Petal.Width, family=multinomial, iris)
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 2 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 13 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 22 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 34 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 39 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 41 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 47 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 50 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 54 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 59 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 63 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 78 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 91 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 96 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in checkwz(wz, M = M, trace = trace, wzepsilon = control$wzepsilon):
## 97 diagonal elements of the working weights variable 'wz' have been replaced by
## 1.819e-12
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in slot(family, "linkinv")(eta, extra = extra): fitted probabilities
## numerically 0 or 1 occurred
## Warning in tfun(mu = mu, y = y, w = w, res = FALSE, eta = eta, extra = extra):
## fitted values close to 0 or 1
## Warning in vglm.fitter(x = x, y = y, w = w, offset = offset, Xm2 = Xm2, : some
## quantities such as z, residuals, SEs may be inaccurate due to convergence at a
## half-step
summary(fit.MLR)
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
## Warning in temp1@family@linkinv(eta = temp1@predictors, extra = temp1@extra):
## fitted probabilities numerically 0 or 1 occurred
##
## Call:
## vglm(formula = Species ~ Sepal.Length + Sepal.Width + Petal.Length +
## Petal.Width, family = multinomial, data = iris)
##
## Coefficients:
## Estimate Std. Error z value Pr(>|z|)
## (Intercept):1 35.490 22666.953 NA NA
## (Intercept):2 42.638 25.708 1.659 0.0972 .
## Sepal.Length:1 9.495 6729.217 NA NA
## Sepal.Length:2 2.465 2.394 1.030 0.3032
## Sepal.Width:1 12.300 3143.611 NA NA
## Sepal.Width:2 6.681 4.480 1.491 0.1359
## Petal.Length:1 -22.975 4799.227 -0.005 0.9962
## Petal.Length:2 -9.429 4.737 NA NA
## Petal.Width:1 -33.843 7583.502 NA NA
## Petal.Width:2 -18.286 9.743 NA NA
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Names of linear predictors: log(mu[,1]/mu[,3]), log(mu[,2]/mu[,3])
##
## Residual deviance: 11.8985 on 290 degrees of freedom
##
## Log-likelihood: -5.9493 on 290 degrees of freedom
##
## Number of Fisher scoring iterations: 21
##
## Warning: Hauck-Donner effect detected in the following estimate(s):
## '(Intercept):1', 'Sepal.Length:1', 'Sepal.Width:1', 'Petal.Length:2', 'Petal.Width:1', 'Petal.Width:2'
##
##
## Reference group is level 3 of the response
probabilities.MLR <- predict(fit.MLR, iris[,1:4], type="response")
predictions <- apply(probabilities.MLR, 1, which.max)
predictions[which(predictions=="1")] <- levels(iris$Species)[1]
predictions[which(predictions=="2")] <- levels(iris$Species)[2]
predictions[which(predictions=="3")] <- levels(iris$Species)[3]
# Summarize accuracy
trn_tab <- table(iris$Species, predictions)
library("caret")
confusionMatrix(trn_tab, positive = "1")
## Confusion Matrix and Statistics
##
## predictions
## setosa versicolor virginica
## setosa 50 0 0
## versicolor 0 49 1
## virginica 0 1 49
##
## Overall Statistics
##
## Accuracy : 0.9867
## 95% CI : (0.9527, 0.9984)
## No Information Rate : 0.3333
## P-Value [Acc > NIR] : < 2.2e-16
##
## Kappa : 0.98
##
## Mcnemar's Test P-Value : NA
##
## Statistics by Class:
##
## Class: setosa Class: versicolor Class: virginica
## Sensitivity 1.0000 0.9800 0.9800
## Specificity 1.0000 0.9900 0.9900
## Pos Pred Value 1.0000 0.9800 0.9800
## Neg Pred Value 1.0000 0.9900 0.9900
## Prevalence 0.3333 0.3333 0.3333
## Detection Rate 0.3333 0.3267 0.3267
## Detection Prevalence 0.3333 0.3333 0.3333
## Balanced Accuracy 1.0000 0.9850 0.9850
exp(cbind(OR = coef(fit.MLR), confint(fit.MLR)))
## OR 2.5 % 97.5 %
## (Intercept):1 2.589511e+15 0.000000e+00 Inf
## (Intercept):2 3.291265e+18 4.314821e-04 2.510516e+40
## Sepal.Length:1 1.329174e+04 0.000000e+00 Inf
## Sepal.Length:2 1.176607e+01 1.077951e-01 1.284293e+03
## Sepal.Width:1 2.197940e+05 0.000000e+00 Inf
## Sepal.Width:2 7.970258e+02 1.225914e-01 5.181847e+06
## Petal.Length:1 1.051688e-10 0.000000e+00 Inf
## Petal.Length:2 8.032858e-05 7.456785e-09 8.653434e-01
## Petal.Width:1 2.005973e-15 0.000000e+00 Inf
## Petal.Width:2 1.144015e-08 5.827809e-17 2.245732e+00