
TeorĂa
La librerĂa Data Explorer es la mĂĄs conocida para en
anĂĄlisis exploratorio. Es muy simple de usar y muy poderosa, pues ofrece
como salida un informe con mucha informaciĂłn.
La funciĂłn para crear el informe es create_report, y para
ver cada grĂĄfica de forma individual, las funciones son:
- introduce()
- plot_intro()
- plot_boxpot()
- plot_missing()
- plot_histogram()
- plot_bar()
- plot_correlation()
Llamas paquetes y librerias
#install.packages("DataExplorer")
library(DataExplorer)
#install.packages("nycFlights13")
library(nycflights13)
Contexto
El paquete nycFlights13 contiene informaciĂłn de todos los
vuelos que partieron desde Nueva York (EWR, JKF y LGA) a destinos en los
Estados Unidos en el 2013. Fueron 336,776 vuelos en total.
La tabla de este paquete y sus relaciĂłnes son las siguientes:

Crear base de datos
flights <- flights
weather <- weather
planes <- planes
airports <- airports
airlines <- airlines
df <- merge(flights, airlines, by = "carrier")
df <- merge(df, planes, by ="tailnum")
#create_report(df)
introduce(df)
## rows columns discrete_columns continuous_columns all_missing_columns
## 1 284170 28 10 18 0
## total_missing_values complete_rows total_observations memory_usage
## 1 311768 920 7956760 50225296
plot_intro(df)

plot_boxplot(df, by = "carrier")
## Warning: Removed 23255 rows containing non-finite values (`stat_boxplot()`).

## Warning: Removed 288513 rows containing non-finite values (`stat_boxplot()`).

plot_missing(df)

plot_histogram(df)


plot_bar(df)
## 4 columns ignored with more than 50 categories.
## tailnum: 3322 categories
## dest: 104 categories
## time_hour: 6934 categories
## model: 127 categories

plot_correlation(df)
## 5 features with more than 20 categories ignored!
## tailnum: 3322 categories
## dest: 104 categories
## time_hour: 6934 categories
## manufacturer: 35 categories
## model: 127 categories
## Warning in cor(x = structure(list(year.x = c(2013L, 2013L, 2013L, 2013L, : the
## standard deviation is zero

LS0tCnRpdGxlOiAiRGF0YSBFeHBsb3JlciIKYXV0aG9yOiAiVmFsZXJpYSBDYW50w7ogLSBBMDE1NzA3NTgiCmRhdGU6ICIyMDI0LTAyLTI3IgpvdXRwdXQ6IAogIGh0bWxfZG9jdW1lbnQ6IAogICAgdG9jOiBUUlVFCiAgICB0b2NfZmxvYXQ6IFRSVUUKICAgIGNvZGVfZG93bmxvYWQ6IFRSVUUgCiAgICB0aGVtZTogZGFyawotLS0KCiFbXSgvVXNlcnMvdmFsZXJpYWNhbnR1bG9iby9Eb3dubG9hZHMvQVZJT04uZ2lmKQoKIyA8c3BhbiBzdHlsZT0iY29sb3I6IHllbGxvdzsiPlRlb3LDrWE8L3NwYW4+CkxhIGxpYnJlcsOtYSAqRGF0YSBFeHBsb3JlciogZXMgbGEgbcOhcyBjb25vY2lkYSBwYXJhIGVuIGFuw6FsaXNpcyBleHBsb3JhdG9yaW8uIEVzIG11eSBzaW1wbGUgZGUgdXNhciB5IG11eSBwb2Rlcm9zYSwgcHVlcyBvZnJlY2UgY29tbyBzYWxpZGEgdW4gaW5mb3JtZSBjb24gbXVjaGEgaW5mb3JtYWNpw7NuLiAgCgpMYSBmdW5jacOzbiBwYXJhIGNyZWFyIGVsIGluZm9ybWUgZXMgKmNyZWF0ZV9yZXBvcnQqLCB5IHBhcmEgdmVyIGNhZGEgZ3LDoWZpY2EgZGUgZm9ybWEgaW5kaXZpZHVhbCwgbGFzIGZ1bmNpb25lcyBzb246ICAKCiogKmludHJvZHVjZSgpKgoqICpwbG90X2ludHJvKCkqCiogKnBsb3RfYm94cG90KCkqCiogKnBsb3RfbWlzc2luZygpKgoqICpwbG90X2hpc3RvZ3JhbSgpKgoqICpwbG90X2JhcigpKgoqICpwbG90X2NvcnJlbGF0aW9uKCkqCgojIDxzcGFuIHN0eWxlPSJjb2xvcjogeWVsbG93OyI+TGxhbWFzIHBhcXVldGVzIHkgbGlicmVyaWFzPC9zcGFuPgpgYGB7cn0KI2luc3RhbGwucGFja2FnZXMoIkRhdGFFeHBsb3JlciIpCmxpYnJhcnkoRGF0YUV4cGxvcmVyKQoKI2luc3RhbGwucGFja2FnZXMoIm55Y0ZsaWdodHMxMyIpCmxpYnJhcnkobnljZmxpZ2h0czEzKQpgYGAKCiMgPHNwYW4gc3R5bGU9ImNvbG9yOiB5ZWxsb3c7Ij5Db250ZXh0bzwvc3Bhbj4KRWwgcGFxdWV0ZSAqbnljRmxpZ2h0czEzKiBjb250aWVuZSBpbmZvcm1hY2nDs24gZGUgdG9kb3MgbG9zIHZ1ZWxvcyBxdWUgcGFydGllcm9uIGRlc2RlIE51ZXZhIFlvcmsgKEVXUiwgSktGIHkgTEdBKSBhIGRlc3Rpbm9zIGVuIGxvcyBFc3RhZG9zIFVuaWRvcyBlbiBlbCAyMDEzLiBGdWVyb24gMzM2LDc3NiB2dWVsb3MgZW4gdG90YWwuICAKCkxhIHRhYmxhIGRlIGVzdGUgcGFxdWV0ZSB5IHN1cyByZWxhY2nDs25lcyBzb24gbGFzIHNpZ3VpZW50ZXM6IAoKIVtdKC9Vc2Vycy92YWxlcmlhY2FudHVsb2JvL0Rvd25sb2Fkcy9ueWMuUE5HKQoKIyA8c3BhbiBzdHlsZT0iY29sb3I6IHllbGxvdzsiPkNyZWFyIGJhc2UgZGUgZGF0b3M8L3NwYW4+CmBgYHtyfQpmbGlnaHRzIDwtIGZsaWdodHMKd2VhdGhlciA8LSB3ZWF0aGVyCnBsYW5lcyA8LSBwbGFuZXMKYWlycG9ydHMgPC0gYWlycG9ydHMKYWlybGluZXMgPC0gYWlybGluZXMKZGYgPC0gbWVyZ2UoZmxpZ2h0cywgYWlybGluZXMsIGJ5ID0gImNhcnJpZXIiKQpkZiA8LSBtZXJnZShkZiwgcGxhbmVzLCBieSA9InRhaWxudW0iKQoKYGBgCgpgYGB7cn0KI2NyZWF0ZV9yZXBvcnQoZGYpCmludHJvZHVjZShkZikKcGxvdF9pbnRybyhkZikKcGxvdF9ib3hwbG90KGRmLCBieSA9ICJjYXJyaWVyIikKcGxvdF9taXNzaW5nKGRmKQpwbG90X2hpc3RvZ3JhbShkZikKcGxvdF9iYXIoZGYpCnBsb3RfY29ycmVsYXRpb24oZGYpCmBgYAoK