Manejo de matrices y representación gráfica

6.1 Cálculos de los valores a representar

Cargamos 'Rmisc' y transformamos en factor el tiempo y el tratamiento:

library(Rmisc)

datos2$Tratamiento<-factor(datos2$Tratamiento)#tratamiento como factor
datos2$Time<-factor(datos2$Time)#tiempo como factor
str(datos2)

## 'data.frame':    72 obs. of  5 variables:
##  $ Individuo        : int  1 2 3 4 5 6 7 8 9 10 ...
##  $ Tratamiento      : Factor w/ 3 levels "1","2","3": 1 1 1 1 2 2 2 2 3 3 ...
##  $ Time             : Factor w/ 6 levels "t1","t2","t3",..: 1 1 1 1 1 1 1 1 1 1 ...
##  $ Grado_de_atencion: num  9 8 9 7 8 6.5 9.2 10 8.5 7.5 ...
##  $ Grado_porcentaje : num  90 80 90 70 80 65 92 100 85 75 ...

Lo siguiente es calcular los valores medios y medidas de desviación para nuestros datos. Con 'measurevar' indicamos la variable que vamos a utilizar para los cálculos y con 'groupvars' los criterios de agrupación (tiempo y tratamiento):

nuevos <- summarySE(datos2, measurevar="Grado_porcentaje", groupvars=c("Time","Tratamiento")) 
nuevos

##    Time Tratamiento N Grado_porcentaje        sd        se        ci
## 1    t1           1 4           82.500  9.574271  4.787136 15.234802
## 2    t1           2 4           84.250 15.239751  7.619875 24.249844
## 3    t1           3 4           89.000 11.165423  5.582711 17.766679
## 4    t2           1 4           81.125  9.490126  4.745063 15.100909
## 5    t2           2 4           86.625 12.133803  6.066901 19.307588
## 6    t2           3 4           72.500 21.794495 10.897247 34.679905
## 7    t3           1 4           49.125 11.946931  5.973466 19.010234
## 8    t3           2 4           62.750 12.284814  6.142407 19.547881
## 9    t3           3 4           75.500 14.059398  7.029699 22.371639
## 10   t4           1 4           49.250  2.986079  1.493039  4.751518
## 11   t4           2 4           39.250  2.986079  1.493039  4.751518
## 12   t4           3 4           36.500  5.066228  2.533114  8.061499
## 13   t5           1 4           83.000 12.027746  6.013873 19.138827
## 14   t5           2 4           47.500 18.929694  9.464847 30.121368
## 15   t5           3 4            7.375  3.300884  1.650442  5.252443
## 16   t6           1 4           85.000  9.128709  4.564355 14.525814
## 17   t6           2 4           45.000 13.540064  6.770032 21.545263
## 18   t6           3 4           12.500  2.886751  1.443376  4.593466

str(nuevos)

## 'data.frame':    18 obs. of  7 variables:
##  $ Time            : Factor w/ 6 levels "t1","t2","t3",..: 1 1 1 2 2 2 3 3 3 4 ...
##  $ Tratamiento     : Factor w/ 3 levels "1","2","3": 1 2 3 1 2 3 1 2 3 1 ...
##  $ N               : num  4 4 4 4 4 4 4 4 4 4 ...
##  $ Grado_porcentaje: num  82.5 84.2 89 81.1 86.6 ...
##  $ sd              : num  9.57 15.24 11.17 9.49 12.13 ...
##  $ se              : num  4.79 7.62 5.58 4.75 6.07 ...
##  $ ci              : num  15.2 24.2 17.8 15.1 19.3 ...

plot_missing(nuevos)#sin valores perdidos

6.2 Figura de puntos

La forma más sencilla de representar los datos. El tiempo en el eje x y la capacidad de atención en el eje y:

library(ggplot2)

gr1<-ggplot(nuevos, aes(Time, Grado_porcentaje,group=Tratamiento,colour=Tratamiento)) +
  geom_point() +
  xlab("Tiempo") +#rótulos eje x e y
  ylab("Porcentaje de atención") 
gr1

Le añadimos líneas:

gr2<-gr1+geom_line()
gr2

6.3 Figura de puntos con media y errores

Utilizamos 'geom_errorbar' para incluir los errores que hemos calculado en los pasos anteriores:

gr3<-ggplot(nuevos, aes(x=Time, y=Grado_porcentaje,group=Tratamiento, colour=Tratamiento)) + 
  geom_errorbar(aes(ymin=Grado_porcentaje-se, ymax=Grado_porcentaje+se), width=.1) +
  geom_point()+
  xlab("Tiempo")+
  ylab("Grado de atención")
gr3

Añadimos una linea vertical para indicar el descanso que hemos hecho en mitad de la clase. Con 'scale_x_discrete' y 'drop=FALSE' consigo que el eje X que es categórico pase a continuo y la línea se pueda poner entre dos tiempos diferentes:

gr4<-gr3 +geom_vline(xintercept = 3.5)+
  scale_x_discrete(drop = FALSE)
gr4

6.4 Figuras con tratamientos separados

Con 'facet_wrap' en función del '~Tratamiento' separamos en tres figuras y con 'labeller' le damos nombre a nuestros tratamientos:

gr5<-gr3+facet_wrap(~Tratamiento,labeller=as_labeller(c("1" = "Clase Magistral bien hecha", "2" = "Innovación docente regulera",
                                                        "3" = "Innovación docente mal hecha")))
gr5+geom_line()

Quitamos la leyenda:

gr6<-gr5+geom_line()+theme(legend.position = "none")
gr6

Mejoramos la figura con puntos más grandes:

gr7<-gr6+geom_point(size=3)+geom_line(size=1)
gr7

Ponemos los descansos a las tres figuras:

gr8<-gr7+geom_vline(xintercept = 3.5,linetype = "dashed")+
  scale_x_discrete(drop = FALSE)
gr8

Mejoramos los nombres de los tratamientos y el tamaño de todas las fuentes de los ejes con 'strip.text':

gr9<-gr8+theme(strip.text = element_text(face = "bold", color = "black",
                                          hjust = 0.5, size = 8),
                strip.background = element_rect(fill = "grey"),text = element_text(size = 15))
gr9

Se puede cambiar el fondo de la figura:

gr9+theme_classic()+theme(legend.position = "none")

gr9+theme_light()+theme(legend.position = "none")

gr9+theme_dark()+theme(legend.position = "none")#etc.

Manejo de matrices y representación gráfica

junio 14, 2022

1 Resumen

2 Nuestros datos

3 Exploratorio básico

4 Varias variables en columnas a una única columna

5 Incluir una nueva variable

6 Representación gráfica

6.1 Cálculos de los valores a representar

6.2 Figura de puntos

6.3 Figura de puntos con media y errores

6.4 Figuras con tratamientos separados

7 De una variable pasamos a varias en columnas

8 Crear nuevas columnas

9 Conclusiones

10 CRÉDITOS