Learning how to create a data codebook

Load Packages

if (!require(haven)){
  install.packages("haven", dependencies = TRUE)
  require(haven)
}
Loading required package: haven
if (!require(tidyverse)){
  install.packages("tidyverse", dependencies = TRUE)
  require(tidyverse)
}
Loading required package: tidyverse
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.2     ✔ readr     2.1.4
✔ forcats   1.0.0     ✔ stringr   1.5.0
✔ ggplot2   3.4.2     ✔ tibble    3.2.1
✔ lubridate 1.9.2     ✔ tidyr     1.3.0
✔ purrr     1.0.1     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors
if (!require(summarytools)){
  install.packages("summarytools", dependencies = TRUE)
  require(summarytools)
}
Loading required package: summarytools
Warning in fun(libname, pkgname): couldn't connect to display ":0"
system might not have X11 capabilities; in case of errors when using dfSummary(), set st_options(use.x11 = FALSE)

Attaching package: 'summarytools'
The following object is masked from 'package:tibble':

    view

Import Data

dataset <- read_sav("Harry Potter Data (1).sav")

Codebook

#First select the variables you'd like to summarize
dataset %>%
  select (CoinFlip,FFM_4,Potter4) -> exampleDF

#Then print them with this command
print(dfSummary(exampleDF, graph.magnif = .75), method = 'render')
Warning in png(png_loc <- tempfile(fileext = ".png"), width = 150 *
graph.magnif, : unable to open connection to X11 display ''

Warning in png(png_loc <- tempfile(fileext = ".png"), width = 150 *
graph.magnif, : unable to open connection to X11 display ''

Warning in png(png_loc <- tempfile(fileext = ".png"), width = 150 *
graph.magnif, : unable to open connection to X11 display ''

Data Frame Summary

exampleDF

Dimensions: 122 x 3
Duplicates: 97
No Variable Label Stats / Values Freqs (% of Valid) Graph Valid Missing
1 CoinFlip [haven_labelled, vctrs_vctr, double] Flip a coin. Is it heads or tails?
Min : 1
Mean : 1.3
Max : 2
1 : 83 ( 70.3% )
2 : 35 ( 29.7% )
118 (96.7%) 4 (3.3%)
2 FFM_4 [haven_labelled, vctrs_vctr, double] I see Myself as Someone Who..... - Is depressed, blue
Mean (sd) : 2.5 (1.3)
min ≤ med ≤ max:
1 ≤ 2 ≤ 5
IQR (CV) : 3 (0.5)
1 : 28 ( 28.6% )
2 : 25 ( 25.5% )
3 : 18 ( 18.4% )
4 : 21 ( 21.4% )
5 : 6 ( 6.1% )
98 (80.3%) 24 (19.7%)
3 Potter4 [haven_labelled, vctrs_vctr, double] Black or White?
Min : 1
Mean : 1.3
Max : 2
1 : 71 ( 71.0% )
2 : 29 ( 29.0% )
100 (82.0%) 22 (18.0%)

Generated by summarytools 1.0.1 (R version 4.3.1)
2023-07-07