Analisis exploratorio base de datos
library(readxl)
library(summarytools)
BASE <- read_excel("Anoscopia 2023-2025.xlsx")
print(dfSummary(BASE), method = 'render')
Tabla 1
Se incluyen las variables edad, toma de biopsia, para la limieza de
texto
Grado\sII detecta “Grado II” aunque haya uno o varios espacios.
LSIL[ -]II detecta “LSIL-II”, “LSIL – II” o “LSIL II”. (?!I) evita
que “Grado I” capture “Grado II”. perl = TRUE activa esa exclusión
(lookahead negativo).
Cruce de variables
DIAGNOSTICO ENDOSCOPICO VIH - RESULTADO BIOPSIA
BASE_sinNA <- subset(BASE, !is.na(Rbiopsia))
print(ctable(
x = BASE_sinNA$`DIAGNÓSTICO ENDOSCOPICO`,
y = BASE_sinNA$Rbiopsia,
prop = "r",
headings = FALSE), method = "render")
|
Rbiopsia |
|
|
`DIAGNÓSTICO ENDOSCOPICO`
|
Alto Grado |
Bajo Grado |
Total |
|
HSIL CUADRANTE ANTERIOR
|
21 |
( |
58.3% |
) |
15 |
( |
41.7% |
) |
36 |
( |
100.0% |
) |
|
HSIL LATERAL IZQUIERDA
|
37 |
( |
51.4% |
) |
35 |
( |
48.6% |
) |
72 |
( |
100.0% |
) |
|
HSIL POSTERIOR LSIL
|
0 |
( |
0.0% |
) |
1 |
( |
100.0% |
) |
1 |
( |
100.0% |
) |
|
LSIL
|
595 |
( |
57.5% |
) |
439 |
( |
42.5% |
) |
1034 |
( |
100.0% |
) |
|
<NA>
|
0 |
( |
0.0% |
) |
1 |
( |
100.0% |
) |
1 |
( |
100.0% |
) |
|
Total
|
653 |
( |
57.1% |
) |
491 |
( |
42.9% |
) |
1144 |
( |
100.0% |
) |
Generated by summarytools 1.1.4 (R version 4.4.2)
2025-10-13
print(ctable(
x = BASE_sinNA$VIH,
y = BASE_sinNA$Rbiopsia,
prop = "r",
chisq = TRUE,
OR = TRUE,
headings = FALSE),method = "render")
|
Rbiopsia |
|
|
VIH
|
Alto Grado |
Bajo Grado |
Total |
|
SI
|
501 |
( |
57.7% |
) |
368 |
( |
42.3% |
) |
869 |
( |
100.0% |
) |
|
NO
|
152 |
( |
55.3% |
) |
123 |
( |
44.7% |
) |
275 |
( |
100.0% |
) |
|
Total
|
653 |
( |
57.1% |
) |
491 |
( |
42.9% |
) |
1144 |
( |
100.0% |
) |
Χ2 = 0.3906 df = 1 p = .5320 O.R. (95% C.I.) = 1.10 (0.84 - 1.45)
|
Generated by summarytools 1.1.4 (R version 4.4.2)
2025-10-13