Cargar datos y libreria (Titulo)

para iniciar vamos a utilizar las librerias ade4, Factoclass y Factominer. La base de datos es dechatlon con informacion de deportitas y su rendimiento en diversas competencias olimpicas.

require(ade4)
require(FactoClass)
require(FactoMineR)

data("decathlon")
decathlon2=decathlon[,1:10]
head(decathlon2)#,4 seria el numero de filas que quiero que me muestre.
##          100m Long.jump Shot.put High.jump  400m 110m.hurdle Discus Pole.vault
## SEBRLE  11.04      7.58    14.83      2.07 49.81       14.69  43.75       5.02
## CLAY    10.76      7.40    14.26      1.86 49.37       14.05  50.72       4.92
## KARPOV  11.02      7.30    14.77      2.04 48.37       14.09  48.95       4.92
## BERNARD 11.02      7.23    14.25      1.92 48.93       14.99  40.87       5.32
## YURKOV  11.34      7.09    15.19      2.10 50.42       15.31  46.26       4.72
## WARNERS 11.11      7.60    14.31      1.98 48.68       14.23  41.10       4.92
##         Javeline 1500m
## SEBRLE     63.19 291.7
## CLAY       60.15 301.5
## KARPOV     50.31 300.2
## BERNARD    62.77 280.1
## YURKOV     63.44 276.4
## WARNERS    51.77 278.1

Exploracion de Datos Univariado

Se realiza un resumen de indicadores descriptivos por cada variable y graficos de cajas.

## Resumen de Indicadores Descriptivos 
summary(decathlon2)# en la rueda, en el output esta la opcion de quitar el codigo 
##       100m         Long.jump       Shot.put       High.jump          400m      
##  Min.   :10.44   Min.   :6.61   Min.   :12.68   Min.   :1.850   Min.   :46.81  
##  1st Qu.:10.85   1st Qu.:7.03   1st Qu.:13.88   1st Qu.:1.920   1st Qu.:48.93  
##  Median :10.98   Median :7.30   Median :14.57   Median :1.950   Median :49.40  
##  Mean   :11.00   Mean   :7.26   Mean   :14.48   Mean   :1.977   Mean   :49.62  
##  3rd Qu.:11.14   3rd Qu.:7.48   3rd Qu.:14.97   3rd Qu.:2.040   3rd Qu.:50.30  
##  Max.   :11.64   Max.   :7.96   Max.   :16.36   Max.   :2.150   Max.   :53.20  
##   110m.hurdle        Discus        Pole.vault       Javeline    
##  Min.   :13.97   Min.   :37.92   Min.   :4.200   Min.   :50.31  
##  1st Qu.:14.21   1st Qu.:41.90   1st Qu.:4.500   1st Qu.:55.27  
##  Median :14.48   Median :44.41   Median :4.800   Median :58.36  
##  Mean   :14.61   Mean   :44.33   Mean   :4.762   Mean   :58.32  
##  3rd Qu.:14.98   3rd Qu.:46.07   3rd Qu.:4.920   3rd Qu.:60.89  
##  Max.   :15.67   Max.   :51.65   Max.   :5.400   Max.   :70.52  
##      1500m      
##  Min.   :262.1  
##  1st Qu.:271.0  
##  Median :278.1  
##  Mean   :279.0  
##  3rd Qu.:285.1  
##  Max.   :317.0
## Grafico de Cajas
require(ggplot2)
require(plotly)
## Loading required package: plotly
## 
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
## 
##     last_plot
## The following object is masked from 'package:stats':
## 
##     filter
## The following object is masked from 'package:graphics':
## 
##     layout
g1=ggplot(data=decathlon,aes(y=Long.jump,x=Competition,fill=Competition))+geom_boxplot()+theme_bw()
ggplotly(g1)# Grafico dinamico
## Grafico de Dispercion
g2=ggplot(data=decathlon,aes(x=`400m`,y=Long.jump,fill=Competition))+geom_point()+geom_smooth(method = "lm")+theme_bw()
ggplotly(g2)
## `geom_smooth()` using formula 'y ~ x'