Learning how to import data

Load Packages

Import and Export SPSS, STATA, and SAS files

if(!require(haven)){
  install.packages("haven", dependencies = TRUE)
  library(haven)}
Loading required package: haven

A collection of packages that makes it easy to tidy, clean, and work with data.

if(!require(tidyverse)){
  install.packages("tidyverse", dependencies = TRUE)
  library(tidyverse)}
Loading required package: tidyverse
── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ──
✔ dplyr     1.1.2     ✔ readr     2.1.4
✔ forcats   1.0.0     ✔ stringr   1.5.0
✔ ggplot2   3.4.2     ✔ tibble    3.2.1
✔ lubridate 1.9.2     ✔ tidyr     1.3.0
✔ purrr     1.0.1     
── Conflicts ────────────────────────────────────────── tidyverse_conflicts() ──
✖ dplyr::filter() masks stats::filter()
✖ dplyr::lag()    masks stats::lag()
ℹ Use the conflicted package (<http://conflicted.r-lib.org/>) to force all conflicts to become errors

A package that allows us to read, write, and edit xlsx files.

if(!require(openxlsx)){
  install.packages("openxlsx", dependencies = TRUE)
  library(openxlsx)}
Loading required package: openxlsx

Import Data

dataset.xls <- read.xlsx ("Harry Potter Data.xlsx")
dataset.csv <- read_csv ("Harry Potter Data.csv")
Rows: 124 Columns: 90
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (90): StartDate, EndDate, Status, IPAddress, Progress, Duration (in seco...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
dataset.spss <- read_sav ("Harry Potter Data.sav")
dataset.spss.web <- read_sav ("https://osf.io/kd4ej/download")

Bonus Points

dataset.csv.web <- read_csv ("https://osf.io/wtghz/download")
Rows: 124 Columns: 90
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (90): StartDate, EndDate, Status, IPAddress, Progress, Duration (in seco...

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
dataset.xlsx.web <- read.xlsx ("https://osf.io/7fz89/download")