Introduction

This document outlines the analyses from Study 1 in insert article title here (insert authors, 2020). The purpose of this study was to investigate Ward et al.’s (2017) second experiment by completing a direct replication. Therefore, the present study investigated how the mere prescence of one’s smartphone effects cogntiion. This was pre-registered through OSF and can be found at the following link: https://osf.io/5fq4r.

The main goal of the current study was to see if Ward et al.’s findings replicated (experiment two). Therefore, as in Ward et al., participants were randomly assigned to one of six possible conditions. These conditions were based on two independent variables: smartphone location and smartphone power. For smartphone location, a participant’s smartphone was in one of three locations: (1) on the participant’s desk (“on desk”), (2) in their pocket/bag (“pocket/bag”), or (3) outside the testing room (“outside”). For smartphone power, a participant’s smartphone was either (1) powered ON or (2) powered OFF, in their respective location. For all conditions, participants were instructed to keep their smartphones on silent (i.e., no virbations if any notifications are received during the study). Also, as per Ward et al., participants in the “on desk” location conditions were instructed to keep their devices facing up.

Each participant completed two main tasks (counter-balanced):

1. An Automated Operation Span (OSpan) Task (Unsworth et al., 2005), which required participants to retain letter strings (i.e. ranging from three to seven letters in length) in memory while solving some simple math problems (e.g. “(7/7) + 6 = ?”). This task was a behavioural measure of the attentional control component of working memory.

  • Scoring:
    • Performance, measured with the OSpan absolute score, was shown by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e. 3 + 5 + 0 = 8). Therefore, OSpan absolute score showed performance where higher scores represented better performance.

2. The Cue-Dependent Go/No-Go Task (Bezdjia et al., 2009), which required participants to respond to “go” targets (i.e. green rectangle) while withholding any response to “no-go” targets (i.e. blue rectangle). This task included a cue component, where vertical targets (i.e., rectangles) were more likely to be “go” targets (i.e. 80% “go” and 20% “no-go”) and horizontal targets were more likely to be “no-go” targets (i.e. 80% “no-go” and 20% “go”). Participants were not explicitly made aware of the cue component. This task was a behavioural measure of sustained attention.

  • Scoring:
    • Performance was measured with mean omission errors and RT. It should be noted that mean errors can be divided into total error, commission error, and omission error. Commission errors occurred when a participant responded to a target stimulus. Omission errors occurred when a participant failed to respond to a non-target stimulus. Total errors were the sum of commission and omission errors. The present study focused on mean omission errors. Therefore, for each participant, higher mean omission errors represented lower performance. Additionally, higher mean RT also showed lower performance (i.e. indicative of greater interference).

Finally, after completing both tasks, all participants completed three survey measures:

1. The Demographic Questionnaire, which was created for the present study and consisted of four items: age (i.e. in years), gender (i.e. male, female, other, or prefer not to say), program (e.g. psychology, engineering), and year of study (e.g. first, fourth). The purpose of these items was to give a brief description of the sample.

2. The Smartphone Usage Questionnaire (SUQ), which was created for the present study and consisted of modified items from Ward et al.’s survey measures (i.e. found in the “BRAIN DRAIN” WEB APPENDIX”). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (“Never”) to 7 (“Always”). There were 10 items in total and there were five types of items, which measured: (1) smartphone use frequency (three items; e.g. “On average, how many text messages do you send per day?”); (2) smartphone use without external stimulation (two items; e.g. “If I am waiting to meet a friend, I pass the time by using my smartphone.”), or during other activities (two items; e.g. “I use my smartphone while driving.”); (3) smartphone subjective value (one item; e.g. “How much money would it take for you to give up your phone for a full day?”); (4) smartphone notification type (one item; e.g. “Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.”); and (5) phantom vibrations (one item; e.g. “Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?”). The purpose of the SUQ was to measure participants’ typical smartphone use.

3. Smartphone Attachment and Dependency Inventory (SAD), which was from Ward et al.’s (2017) study. The SAD used 13-items (i.e. 7-point likert scale) that asked participants to rate (i.e., from 1 = Strongly Disagree to 7 = Strongly Agree) how much they agreed with statements regarding their emotional attachment and dependency on their smartphones. The purpose of the SAD was to measure each participant’s reliance on their smartphone.

  • Scoring:
    • Smartphone reliance was calculated with a sum total for all items with a range of 13 to 91. Higher scores indicated a higher level of reliance with three levels: low reliance (13-39), moderate reliance (40-65), and high reliance (66+).

A total of 453 undergraduate students participated in this study. Of the total sample size, 44 participants were excluded due to either testing error (11; e.g. incomplete task data), or experimenter or external confounds (33; e.g. interruption during testing, distracting noise during testing). Therefore, 409 participants were included in the analyses for the location (on desk = 134; pocket/bag = 141; outside = 134) and power (ON = 213; OFF = 196) condition.

Data Analysis Plan

The present study completed a pre-registered data anallysis (pre-registered direct replication on Open Science Framework; OSF). Below is a summary of the data analysis plan (view full version https://osf.io/d5eja & https://osf.io/ptc7k/).

Hypotheses

  1. The location effect hypothesis predicted that those who were closest in proximity to their smartphone (i.e. those with their smartphones on their desk) would show lower performance on the OSpan task.

  2. The power effect hypothesis predicted that smartphone power (i.e. either ON or OFF) would not affect performance on both cognitive tasks.

  3. The moderation effect hypothesis predicted that smartphone attachment and dependency (i.e. SAD score) would moderate the location effect: those who reported higher smartphone attachment and dependency would perform worse on the OSpan task.

Note: All hypotheses were based on the findings from Ward et al. (2017)

Inclusion / Exclusion Criteria

There were two exclusion critera in the present study:

1. OSpan Task

  • Only data from participants who scored 85% accuracy on the math component or above was used.
    • Criteria helped control for any participants who did not follow instructions and neglected the math component of the task. Note: This was the same criteria as in Ward et al. (2017)

2. Cue-Dependent Go/No-Go Task

  • Only data from those who scored higher than chance performance was used (e.g. reponded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials).
    • Criteria helped control for any participants who did not complete the task as instructed (e.g. guessing). Note: There was no inclusion/exclusion criteria for thsi task in Ward et al. (2017)

Analysis

Testing the Location and Power Hypotheses:

1. OSpan Task

  • The OSpan absolute score was used to measure OSpan performance.
  • A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = OSpan absolute score).
    • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s) and/or power status were significantly different.

2. Cue-Dependent Go/No-Go Task

  • An error (i.e. omission errors) and accuracy (i.e. RT) analysis was completed using the average omission errors and RT for each participant.

    (a) Error Analysis
    • A 3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Go, No-Go) mixed ANOVA was completed (DV = average omission error(s)).
      • The between-subjects factors:smartphone location and power
      • The within-subjects factor: pre-target cue type
      • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s), power status, and/or cue type(s) were significantly different.
    (b) Accuracy Analysis
    • A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = average RT).
      • The between-subjects factors:smartphone location and power
      • The within-subjects factor: pre-target cue type
      • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s) and/or power status were significantly different.

Testing the Moderation Hypothesis:

3. SAD

(a) Factor Analysis

  • A principal components factor analysis with a Varimax rotation was completed for participant’s SAD score.
    • This was done to confirm the two main factors found by Ward et al. (2017): smartphone dependence and emotional attachment.

(b) Generalized Linear Model

  • A univariate generalized linear model was used to examine if smartphone reliance (i.e. SAD score) was a moderator of the relationship between the experimental manipulation and task performance.

Note: These analysis were completed regardless of the results for the location and power hypotheses.

Analysis Prep

Load Required Libraries

The following packages were used to complete the analyses, visualizations, etc. in this document.

# Code has been commented out to reduce computation time, uncomment any to run the code

# library(devtools)
# install_github("vqv/ggbiplot")
library(ggbiplot)
library(welchADF)
library(schoRsch)
library(plyr)
library(dplyr)
library(lattice)
library(car)
library(knitr)
library(ggplot2)
library(ggpubr)
library(kableExtra)
library(magrittr)
library(ez)
library(Hmisc)
library(rstatix)
library(purrr)
library(Formula)
library(gridExtra)
library(psych)
library(survival)
library(tidyverse)
library(DescTools)
library(GPArotation)
library(apaTables)

Additional Functions

Rounding p-values

This rounding function was adapted from Emily Nielsen’s Rpubs. The function (“p_round(x)”) was created to assess and print p-values. If \(p \ge .005\), the function will display “$p = $” and the value rounded to two decimal places. If $ .0005 p <.005\(, the function will display “\)p = $” and the value rounded to three decimal places. If \(p < .0005\), the function will display “\(p < .001\)”.


p_round <- function(x){
  if(x > .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 2)), sep = ''))
  }  
  else if(x == .005){x1 = (paste("=", gsub("0\\.","\\.", 0.01)))
  }
  else if(x > .0005 & x < .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 3)), sep = ''))
  }  
  else if(x == .0005){x1 = (paste("=", gsub("0\\.","\\.", 0.001)))
  }
  else{x1 = (paste("<", gsub("0\\.","\\.", 0.001)))
  } 
  (x1)
}

Raw Data

There are five raw data files that were imported for the present study (see the Import Raw Data section).

Task Data

  1. OSpan Raw Data
    • spreadsheet, collected manually from each participant
  2. Go/No-Go Raw Data
    • psychopy excel files
    • Note: raw data was extracted from folder into one file (see the Import Raw Data section)

Survey Measures

  1. Demographic & SUQ Raw Data
    • csv files exported from Qualtrics
  2. SAD Raw Data
    • csv files exported from Qualtrics
  3. Testing Tracking Sheet
    • csv files, collected during testing

Import Raw Data

1. OSpan Raw Data

The OSpan Raw Data was collected using the Inquisit 5 software (LINK), which used the same automated OSpan task as in Ward et al. (2017). The raw data from the OSpan was imported as a dataframe, “ospan_raw”, and included the following: (click to see description)

  • Participant number [ospan_participant]
  • OSpan ID [ospan_ID]
    • A 9-digit ID given by the Iquisit 5 software
  • OSpan Absolute Score [ospan_absolute_score]
    • Performance measured by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e. 3 + 5 + 0 = 8).
  • Total Letters Recalled [ospan_total_corr]
    • Performance measured by how many letters were recalled in total, regardless of whether all letters were correct within a given block.
  • Total Math Errors [ospan_math_err_tot]
    • the sum of all math errors (i.e. speed and accuracy)
  • Speed Errors [ospan_math_err_speed]
    • the sum of the speed related math errors
      • where a participant exceeded their time limit to respond to the math component of the task
  • Accuracy Errors [ospan_math_err_accuracy]
    • the sum of the accuracy related math errors
      • where a participant answered incorrectly on the math component of the task
  • Math Percentage [ospan_math_percent_corr]
    • the percent of all math trials that were answered correctly
      • Participants were asked to keep their math percentage at 85% or above to ensure participants did not ignore the math component of the task to improve on the letter recall component
  • Math Percentage Criteria [ospan_math_criteria_85]
    • whether a participant met the math criteria (i.e. 85% or above; yes or no)
      coding information (click to expand)
      • 1 = yes
      • 0 = no * including any participant that did not complete the OSpan task

This code imported the OSpan raw data. (click to expand)

ospan_raw = utils::read.csv("raw_ospan.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
ospan_raw = dplyr::arrange(ospan_raw, ospan_participant)

2. Go/No-Go Raw Data

The Go/No-Go Raw Data was collected using Psychopy (version: XXXX). Each participant’s data was stored in a separate folder (default from psychopy) as an excel file. The following code re-organized the raw data into a single wide format of the data as a dataframe, “gng_data_extracted”, and two data frames that separate the raw data by cue type (i.e.“vertical” or “horizontal”), “gng_data_extracted_v” and “gng_data_extracted_h”. (click to see descriptions)

For the gng_data_extracted data frame:

  • Participant number [gng_participant]
  • Age [age]
  • First Language [lang]
    • self-reported whether their fist language was English (i.e. yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [gng_trial_tot]
  • GnG Score [gng_score]
    • the sum of the correct answers
  • Total Responses [gng_resp_tot]
    • the sum of the responses for all trials
  • Total No Responses [gng_no_resp_tot]
    • the sum of the trials without a response
  • Total Errors [gng_err_tot]
    • the sum of all errors for all trials
  • Commission Errors [gng_err_com]
    • the sum of all comission errors, defined as:
      • where a reponse was made during a “no-go” trial (i.e. committing an erorr)
  • Omission Errors [gng_err_omi]
    • the sum of all omission errors, defined as:
      • where a reponse was NOT made during a “go” trial (i.e. failing to respond)
  • Average Reaction Time (RT) [gng_rt_av]

For the gng_data_extracted_v data frame:

  • Participant number [gng_participant]
  • Age [age_long]
  • First Language [lang_long]
    • self-reported whether their fist language was English (i.e. yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [trial_tot_v]
  • GnG Score [score_v]
    • the sum of the correct answers
  • Total Responses [resp_tot_v]
    • the sum of the responses for all trials
  • Total No Responses [no_resp_tot_v]
    • the sum of the trials without a response
  • Total Errors [err_tot_v]
    • the sum of all errors for all trials
  • Commission Errors [err_com_v]
    • the sum of all comission errors, defined as:
      • where a reponse was made during a “no-go” trial (i.e. committing an erorr)
  • Omission Errors [err_omi_v]
    • the sum of all omission errors, defined as:
      • where a reponse was NOT made during a “go” trial (i.e. failing to respond)
  • Average Reaction Time (RT) [rt_av_v]
  • Response Exclusion Criteria [exclude_v]
    • This depicts whether a participant should be excluded from analysis based on the response exclusion criteria
      • To be included, participants should score higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)

        coding information (click to expand)

        code 1 2
        response Yes (participant excluded) No (participant included)

For the gng_data_extracted_h data frame:

  • Participant number [gng_participant]
  • Age [age_long]
  • First Language [lang_long]
    • self-reported whether their fist language was English (i.e. yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [trial_tot_h]
  • GnG Score [score_h]
    • the sum of the correct answers
  • Total Responses [resp_tot_h]
    • the sum of the responses for all trials
  • Total No Responses [no_resp_tot_h]
    • the sum of the trials without a response
  • Total Errors [err_tot_h]
    • the sum of all errors for all trials
  • Commission Errors [err_com_h]
    • the sum of all comission errors, defined as:
      • where a reponse was made during a “no-go” trial (i.e. committing an erorr)
  • Omission Errors [err_omi_h]
    • the sum of all omission errors, defined as:
      • where a reponse was NOT made during a “go” trial (i.e. failing to respond)
  • Average Reaction Time (RT) [rt_av_h]
  • Response Exclusion Criteria [exclude_h]
    • This depicts whether a participant should be excluded from analysis based on the response exclusion criteria
      • To be included, participants should score higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)

        coding information (click to expand)

        code 1 2
        response Yes (participant excluded) No (participant included)

This code imported the Go/No-Go raw data without separating the data by cue type. (click to expand)


#---WIDE DATA FRAME---"gng_data_extracted"  

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
extract_gng = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant)
  participant = as.numeric(temp_data[which(temp_data == "participant"),2])
  
  # Age (age)
  age = as.integer(temp_data[which(temp_data == "Age"),2])
  
  # Gender (gender)
  # will be populated from survey measures
  
  # First Language (lang)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  
  # Total trials completed (trial_tot)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$order))
  
  # GnG Score (score)
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score = sum(temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Total Responses (resp_tot)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$key_resp_slash.keys_raw) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$key_resp_slash.keys_raw))
  
  # Total No Responses (no_resp_tot)
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot = (trial_tot - resp_tot)
  
  # Total Errors (err_tot)
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot = sum((temp_data$key_resp_slash.corr_raw == "0") == "TRUE", na.rm = TRUE) # alternative: sum(!temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Commission Errors (err_com)
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an erorr)
  err_com = sum((temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Omission Errors (err_omi)
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi = sum((temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Average Reaction Time (RT) (rt_av)
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av = mean(temp_data$key_resp_slash.rt_raw, na.rm = TRUE)
  
  temp_dataframe = as.data.frame(c(participant, age, lang, trial_tot, 
                                   score, resp_tot, no_resp_tot, 
                                   err_tot, err_com, err_omi, rt_av))
}

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
files_psychopy = dir("/Users/anaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files
result = sapply(files_psychopy, extract_gng)

#--(4) Create a data frame of the result of the function on the files

# take the lists of the extracted data and turn it into a data frame
gng_data_extracted = data.frame(Reduce(rbind, result)) # alternative method: x = data.frame(t(sapply(result,c)))
# rename the columns
colnames(gng_data_extracted) = c("gng_participant", "age", "lang", "gng_trial_tot", "gng_score", 
                                 "gng_resp_tot", "gng_no_resp_tot", "gng_err_tot", "gng_err_com", 
                                 "gng_err_omi", "gng_rt_av") 
# remove row names to avoid confusion
rownames(gng_data_extracted) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted = dplyr::arrange(gng_data_extracted, gng_participant)

This code imported the Go/No-Go raw data and separated the data by cue type (i.e. “vertical” or “horizontal”). (click to expand)


#--- for the LONG DATA FRAME - CUE TYPE (vertical vs. horizontal) --- "gng_data_extracted_v" & "gng_data_extracted_h" 
  
#-- This code differs from the above code in that it organizes the gng data based the CUE type (i.e. either a "vertical" (v) or "horizontal" (h)).
  # This means that the data is in a long format to test the within-subjects factor of cue type.

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files
# (5) Re-structure the data into 1 data frame in a long format for a repeated measures analysis

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once...
# for the "vertical" (v) (extract_gng_v) and "horizontal" (h) (extract_gng_h) trials

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
# NOTE: this function extracts 2 versions of each value: one for the "vertical" (v) trials, and one for the "horizontal" (h) trials

extract_gng_v = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "vertical" (v) (trial_tot_v)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_v = sum((temp_data$cue == "v_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "vertical" (v) (score_v)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "vertical" (v) (resp_tot_v)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "vertical" (v) (no_resp_tot_v)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_v = (trial_tot_v - resp_tot_v)  
  
  # Total Errors for the "vertical" (v) (err_tot_v) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "vertical" (v) (err_com_v) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "vertical" (v) (err_omi_v)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  

  # Average Reaction Time (RT) for the "vertical" (v) (rt_av_v)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_v = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "v_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "vertical" (v) (exclude_v)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
    # first, get the total trials and response totals for the "go" and "no-go" trials
      # "go"
      trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
      # "no-go"
      trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
    # next, find the minimum response needed based on the 50% criteria
    min_resp = trial_tot_go * .50
    # find if the participant meets exclusion criteria for the "go" and "no-go" trials
    meets_min_go = resp_tot_go > (min_resp)
    meets_min_nogo = resp_tot_nogo < (min_resp)
    # find if the participant meets exclusion criteria (exclude = 1; include = 2)
    exclude_v = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_v = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, 
                                        no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v))
}  

extract_gng_h = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "horizontal" (h) (trial_tot_h)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_h = sum((temp_data$cue == "h_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "horizontal" (h) (score_h)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "horizontal" (h) (resp_tot_h)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "horizontal" (h) (no_resp_tot_h)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_h = (trial_tot_h - resp_tot_h)  
  
  # Total Errors for the "horizontal" (h) (err_tot_h) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "horizontal" (h) (err_com_h) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "horizontal" (h) (err_omi_h)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Average Reaction Time (RT) for the "horizontal" (h) (rt_av_h)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_h = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "h_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "horizontal" (h) (exclude_h)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
  # first, get the total trials and response totals for the "go" and "no-go" trials
  # "go"
  trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # "no-go"
  trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # next, find the minimum response needed based on the 50% criteria
  min_resp = trial_tot_go * .50
  # find if the participant meets exclusion criteria for the "go" and "no-go" trials
  meets_min_go = resp_tot_go > (min_resp)
  meets_min_nogo = resp_tot_nogo < (min_resp)
  # find if the participant meets exclusion criteria (exclude = 1; include = 2)
  exclude_h = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_h = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_h, score_h, resp_tot_h, 
                                     no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h))
}  

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
# THIS IS THE SAME AS IN THE ABOVE CODE, SO IT IS COMMENTED OUT TO AVOID EXTRA COMPUTING POWER

# files_psychopy = dir("/Users/anaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
#                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files for the "vertical" (v) and "horizontal" (h) trials  
result_v = sapply(files_psychopy, extract_gng_v)
result_h = sapply(files_psychopy, extract_gng_h)

#--(4) Create a data frame of the result of the function on the files...
# for the "vertical" (gng_data_extracted_v) and "horizontal" (gng_data_extracted_h) trials  

# take the lists of the extracted data and turn it into a data frame for the "vertical" (h) trials
gng_data_extracted_v = data.frame(Reduce(rbind, result_v)) 
# rename the columns
colnames(gng_data_extracted_v) = c("participant_long", "age_long", "lang_long", "trial_tot_v", "score_v", 
                                         "resp_tot_v", "no_resp_tot_v", "err_tot_v", "err_com_v", 
                                         "err_omi_v", "rt_av_v", "exclude_v")

# take the lists of the extracted data and turn it into a data frame for the "horizontal" (h) trials
gng_data_extracted_h = data.frame(Reduce(rbind, result_h)) 
# rename the columns
colnames(gng_data_extracted_h) = c("participant_long", "age_long", "lang_long", "trial_tot_h", "score_h", 
                                           "resp_tot_h", "no_resp_tot_h", "err_tot_h", "err_com_h", 
                                           "err_omi_h", "rt_av_h", "exclude_h")
# remove row names to avoid confusion
rownames(gng_data_extracted_v) = NULL
rownames(gng_data_extracted_h) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted_v = dplyr::arrange(gng_data_extracted_v, participant_long)
gng_data_extracted_h = dplyr::arrange(gng_data_extracted_h, participant_long)

3. & 4. Demographic, SUQ, and SAD Raw Data

The Demographic & SUQ Raw Data and SAD Raw Data was collected using Qualtrics the Inquisit 5 software (LINK), which was the same as Ward et al. (2017). The raw data from the OSpan was imported as a dataframe, “ospan_raw”, and included the following: (click to see description)

General, Manipulation, and Demographic Information

General Information: (click to expand)

  • Start Date [date_start]
    • The date and time when a participant began their survey on Qualtrics (Y/M/D H:M)
  • End Date [date_end]
    • The date and time when a participant finished their survey on Qualtrics (Y/M/D H:M)
  • Duration (in seconds) [duration(s)]
    • The duration of the survey for the participant in seconds
  • Recorded Date [date_rec]
    • The date and time when a participant’s data was recorded on Qualtrics (Y/M/D H:M)
  • Participant number [sur_participant]

Manipulation Condition: (click to expand)

  • Smartphone Location [sur_location]
    • The smartphone location assigned to the participant (i.e. desk, pocket/bag, or outside)

      coding information (click to expand)

      code 1 2 3
      response Desk Pocket/Bag Outside
  • Smartphone Power [sur_power]
    • The smartphone power assigned to the participant (i.e. ON or OFF)

      coding information (click to expand)

      code 1 2
      response On Off
  • Smartphone Condition [sur_condition]
    • The smartphone condition (i.e. combination of location & power) assigned to the participant

      coding information (click to expand)

      code 1 2 3 4 5 6
      response ON & Desk ON & Pocket/Bag ON & Outside OFF & Desk OFF & Pocket/Bag OFF & Outside

Demographics: (click to expand)

  • Gender [gender]
    • Participant’s self-reported gender (i.e., male, female, other, or prefer not to say)

      coding information (click to expand)

      code 1 2 3 4
      response Male Female Other Prefer not to say
  • Year [year]
    • Participant’s self-reported year of study in their respective program (e.g. 1, 2, 3, 4)
  • Program [program]
    • Participant’s self-reported program of study (e.g., biology, BMOS, psychology)

      coding information (click to expand)

      1 = actuarial 11 = economics 21 = management 31 = psychology
      2 = anthropology science 22 = mathematics 32 = SASAH
      3 = arts 13 = english 23 = medical science 33 = science
      4 = arts & humanities 14 = film 24 = MIT 34 = social science
      5 = biology 15 = FIMS 25 = MOS 35 = sociology
      6 = BMOS 16 = geography 26 = music 36 = software
      7 = chemistry 17 = health science 27 = nutrition & dietetics 37 = visual arts
      8 = computer science 18 = integrated science 28 = philosophy
      9 = consumer behaviour 19 = kinesiology 29 = physics
      10 = criminology 20 = linguistics 30 = political science
The Smartphone Use Questionnaire (SUQ)

The purpose of the SUQ was to measure participants’ typical smartphone use. The SUQ was created for the present study and consisted of modified items from Ward and colleagues’ (2017) survey measures. (i.e. found in the “BRAIN DRAIN” WEB APPENDIX”). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (“Never”) to 7 (“Always”). There were 10 items in total and there were five types of items, which measured:

1. Smartphone Use Frequency: (SUQ_use_1 - SUQ_use_3)

Each of the following items used the following coding:

CODE 1 2 3 4
response 0-5 6-10 11-15 > 15
  • Text Messages (per day) [SUQ_use_1]
    • Answer to the following question: “On average, how many text messages do you send per day?
  • Social Media Messages (per day) [SUQ_use_2]
    • Answer to the following question: “On average, how many social media based messages do you send per day from your smartphone? (iMessage, Facebook Messenger, WhatsApp, WeChat, direct messages within social media platforms, etc.)
  • Social Media Posts (per day) [SUQ_use_3]
    • Answer to the following question: “On average, how many social media posts (e.g. written post, picture, article, etc.) do you send per day from your smartphone? (Facebook, Twitter, Instagram, etc.)

2. Smartphone Use With External Stimulation or During Other Activities: (SUQ_use_4 - SUQ_use_7)

Each of the following items used the following coding:

code 1 2 3 4 5 6 7
response Never About half
the time
Always
no text specified
  • Text Messages (per day) [SUQ_use_4]
    • Answer to the following question: “I look at my smartphone before I roll out of bed in the morning.
  • Social Media Messages (per day) [SUQ_use_5]
    • Answer to the following question: “If I am waiting to meet a friend, I pass the time by using my smartphone.
  • Social Media Posts (per day) [SUQ_use_6]
    • Answer to the following question: “I use my smartphone while driving.
  • Social Media Posts (per day) [SUQ_use_7]
    • Answer to the following question: “If my smartphone rings or vibrates in the middle of personal business, I look at it.

3. Smartphone Subjective Value: (SUQ_exp_1)

  • Subjective Value [SUQ_use_1]
    • Answer to the following question: “How much money would it take for you to give up your phone for a full day?

      coding information (click to expand)

      code 1 2 3 4
      response $0-$20 $21-$40 $41-$60 > $60

4. Smartphone Notification Type: (SUQ_exp_2 & SUQ_exp_3)

  • Notification Type [SUQ_use_2]
    • Answer to the following question: “Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.

      coding information (click to expand)

      code 1 2 3 4 5 6 7
      response Email Facebook Twitter Instagram LinkedIn Snapchat Other (specify)
  • Specify Notification Type [SUQ_exp_3]
    • Answer to the following question: “Please Specify: Where you receive notifications (a sound or vibration) on your phone?

5. Phantom Vibrations: (SUQ_exp_4)

  • Phantom Vibrations [SUQ_use_4]
    • Answer to the following question: “Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?

      coding information (click to expand)

      code 1 2
      response Yes No
The Smartphone Attachment and Dependency Inventory (SAD; Ward et al., 2017)

The SAD asked participants to respond to 13 items (SAD_1-SAD_13) using a 7-point likert scale. (click to expand)

code 1 2 3 4 5 6 7
response Strongly
Disagree
Disagree Somewhat
Disagree
Neutral Somewhat
Agree
Agree Strongly
Agree
  • SAD_1: I would have trouble getting through a normal day without my smartphone.
  • SAD_2: It would be painful for me to give up my smartphone for a day.
  • SAD_3: I feel like I could not live without my smartphone.
  • SAD_4: If I forgot to bring my smartphone with me, I would feel anxious.
  • SAD_5: It drives me crazy when my smartphone runs out of battery.
  • SAD_6: I am upset and annoyed when I find I do not have reception on my smartphone.
  • SAD_7: I feel impatient when the Internet connection speed on my smartphone is slow.
  • SAD_8: I feel lonely when my smartphone does not ring or vibrate for several hours.
  • SAD_9: Using my smartphone relieves me of my stress.
  • SAD_20: I feel excited when I have a new message or notification.
  • SAD_11: Using my smartphone makes me feel happy.
  • SAD_12: I find it tough to focus whenever my smartphone is nearby.
  • SAD_13: I become less attentive to my surroundings when I’m using my smartphone.

This code imported the Demographic, SUQ, and SAD raw data. (click to expand)

survey_raw = utils::read.csv("raw_survey.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
survey_raw = dplyr::arrange(survey_raw, sur_participant)

5. Testing Tracking Sheet Raw Data

The Testing Tracking Sheet Raw Data was collected during testing by the experimenter to store relevant information on participant’s (1) General Information (e.g. testing start date), (2) Manipulation Condition (e.g. assigned location and power manipulation), and (3) Data Cleaning (e.g. exclusion critera from testing error or other non-task-specific criteria). The raw data from the tracking sheet was imported as a dataframe, “track_raw”, and included the following: (click to see description)

1. General Information:

  • Testing Date [track_date]
    • The date a participant completed the study (Y/M/D)
  • Testing Time [date_end]
    • The time a participant was scheduled to complete the study (24hr-time). Note: Many participants were tested at the same time, but in separate cubicle-type desks to avoid any group-testing dynamic
  • Participant number [track_participant]

2. Manupulation Condition:

  • Smartphone Location [track_location]
    • The smartphone location assigned to the participant (i.e. desk, pocket/bag, or outside)

      coding information (click to expand)

      code 1 2 3
      response Desk Pocket/Bag Outside
  • Smartphone Power [track_power]
    • The smartphone power assigned to the participant (i.e. ON or OFF)

      coding information (click to expand)

      code 1 2
      response On Off
  • Smartphone Condition [track_condition]
    • The smartphone condition (i.e. combination of location & power) assigned to the participant

      coding information (click to expand)

      code 1 2 3 4 5 6
      response ON & Desk ON & Pocket/Bag ON & Outside OFF & Desk OFF & Pocket/Bag OFF & Outside
  • Counter Balanced Order [balance]
    • This depicts which of the main tasks (i.e. either the OSpan task or the Cue-Dependent Go/No-Go Task) was completed first

      coding information (click to expand)

      code 1 2
      response OSpan Task completed first Cue-Dependent Go/No-Go Task completed first

3. Data Cleaning:

  • Exclude from Analysis [exclude]
    • This depicts whether a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e. for reasons other than the task exclusion criteria in the data analysis plan).

      coding information (click to expand)

      code 1 2
      response Yes (participant excluded) No (participant included)
  • Reason for Exclusion from Analysis [reason]
    • This depicts the reason (summarized into 5 categories) a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e. for reasons other than the task exclusion criteria in the data analysis plan). Note: this includes a category for “no exclusion issues”, where a participant was not excluded

      coding information (click to expand)

      code 1 2 3 4 5
      response no exclusion issues external distractor (noise) gng incomplete gng missing ospan missing
  • Possible Outlier [pos_out]
    • This depicts whether a participant is a possible outlier for exploratory analyses. Note: this was not applied to any analysis unless explicitly stated

      coding information (click to expand)

      code 1 2
      response Yes (does qualify as a possible outlier) No (does not qualify as a possible outlier)
  • Experimenter [experimenter]
    • This depicts which experimenter (3 total) collected the data for a participant. Note: this was collected for exploratory analyses and was not used unless explicitly stated

      coding information (click to expand)

      code 1 2 3
      response ARP SW LM

This code imported the study tracking sheet raw data. (click to expand)

track_raw = utils::read.csv("raw_tracking.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
track_raw = dplyr::arrange(track_raw, track_participant)

Construct Working Dataframes

In order to complete the present study’s analyses, eight data frames were created.

1. All Data

All the data from the present study in one data frame: “all_data” (click to see more)

  • This data included the following:
    • participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age, lang, gng_trial_tot, gng_score, gng_resp_tot, gng_no_resp_tot, gng_err_tot, gng_err_com, gng_err_omi, gng_rt_av, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the raw data into one data frame. (click to expand)


# Find the final participant list for the analyses  
  # get a list of participants that (1) completed the gng task and (2) are not excluded (i.e., inclusion list) based on the "exclude" column in the tracking sheet  
  # (1) First get the gng participant list  
  gng_par_list = gng_data_extracted$gng_participant  
  # (2) Next, get the inclusion list  
  include_par_list = data.frame(track_raw$track_participant[which(track_raw$exclude %in% 2)])  
    colnames(include_par_list) = c("par_list")
  # Get a list of all the participants who completed the study (from the testing tracking sheet)
  all_participants = as.data.frame(track_raw$track_participant)
    colnames(all_participants) = c("all_participants")
  
# find the participants to remove from the data
  # Filter each data list from the list of all participants  
  gng_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% gng_par_list)))
  track_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% include_par_list$par_list)))
  reason_remove = c(rep("gng", length(gng_remove$all_participants)), rep("track", length(track_remove$all_participants)))
  
  # Make a list of all the removed participants -- "all_remove"
  all_remove_notclean = data.frame(rbind(gng_remove, track_remove), reason_remove)
    # remove any duplicates from the list
    all_remove_nodup = all_remove_notclean[!duplicated(all_remove_notclean$all_participants), ]
    # reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)  
    all_remove = dplyr::arrange(all_remove_nodup, all_participants)  
    
# Filter raw data using the final participant list for: ospan_raw, gng_raw, survey_raw, and track_raw
  # This will now the the working data for the study (i.e. the "final" data)  
    # OSpan Data  
    ospan_final = dplyr::filter(ospan_raw, !ospan_participant %in% all_remove$all_participants)
    # Gng Data  
    gng_final = dplyr::filter(gng_data_extracted, !gng_participant %in% all_remove$all_participants)
    # Survey Data  
    survey_final = dplyr::filter(survey_raw, !sur_participant %in% all_remove$all_participants)
    # Tracking Data  
    track_final = dplyr::filter(track_raw, !track_participant %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # Start by creating a data frame with the final participant list in the 1st column: "participant"
  all_par = as.data.frame(track_final$track_participant)
    colnames(all_par) = c("participant")

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # OSpan Data
    colnames(ospan_final)[which(names(ospan_final) == "ospan_participant")] = c("participant")
    # Gng Data
    colnames(gng_final)[which(names(gng_final) == "gng_participant")] = c("participant")
    # Survey Data
    colnames(survey_final)[which(names(survey_final) == "sur_participant")] = c("participant")
    # Tracking Data
    colnames(track_final)[which(names(track_final) == "track_participant")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data"
  all_data = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data)[which(names(all_data) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

2. All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e. either “go” or “no-go”), presented in WIDE FORMAT: “all_data_cue” (click to see more)

  • This data included the following:
    • participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v, trial_tot_h, score_h, resp_tot_h, no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the raw data into one data frame. (click to expand)

# use the list of all the participants to remove -- "all_remove" -- to filter the raw gng long data
  # for the "go" targets
  gng_v_final = dplyr::filter(gng_data_extracted_v, !participant_long %in% all_remove$all_participants)
  # for the "no-go" targets
  gng_h_final = dplyr::filter(gng_data_extracted_h, !participant_long %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # use the final participant list -- "all_par" -- as the 1st column in the data frames

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # this was already done for the other data frames (e.g. ospan_final, survey_final)
    # for the "vertical" targets
    colnames(gng_v_final)[which(names(gng_v_final) == "participant_long")] = c("participant")
    # for the "horizontal" targets
    colnames(gng_h_final)[which(names(gng_h_final) == "participant_long")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_cue"
  all_data_cue = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(gng_h_final, -c(age_long, lang_long)), dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_cue)[which(names(all_data_cue) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    

3. All Data - Go/No-Go separated by Cue Type (LONG FORMAT)

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e. either “go” or “no-go”), presented in LONG FORMAT: “all_data_cue_long” (click to see more)

  • This data included the following:
    • cue, participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age_long, lang_long, gng_trial_tot_long, gng_score_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long, gng_rt_av_long, gng_exclude_long, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the raw data into one data frame. (click to expand)

# Re-structure the data into 1 data frame in a long format for a repeated measures analysis

  # using the "final" data frames of all the data, make 2 data frames, one for the "vertical" (all_data_v) and one for the "horizontal" (all_data_h) targets
    # for the "vertical" targets -- all_data_v
    all_data_v = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_v)[which(names(all_data_v) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    # for the "horizontal" targets -- all_data_h
    all_data_h = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_h_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_h)[which(names(all_data_h) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

  # rename target-specific columns for both data frames (to bind)
    # for the "go" trials
    names(all_data_v)[names(all_data_v) == "trial_tot_v"] = "gng_trial_tot_long" 
    names(all_data_v)[names(all_data_v) == "score_v"] = "gng_score_long" 
    names(all_data_v)[names(all_data_v) == "resp_tot_v"] = "gng_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "no_resp_tot_v"] = "gng_no_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_tot_v"] = "gng_err_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_com_v"] = "gng_err_com_long" 
    names(all_data_v)[names(all_data_v) == "err_omi_v"] = "gng_err_omi_long" 
    names(all_data_v)[names(all_data_v) == "rt_av_v"] = "gng_rt_av_long" 
    names(all_data_v)[names(all_data_v) == "exclude_v"] = "gng_exclude_long"
    # for the "no-go" trials
    names(all_data_h)[names(all_data_h) == "trial_tot_h"] = "gng_trial_tot_long" 
    names(all_data_h)[names(all_data_h) == "score_h"] = "gng_score_long" 
    names(all_data_h)[names(all_data_h) == "resp_tot_h"] = "gng_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "no_resp_tot_h"] = "gng_no_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_tot_h"] = "gng_err_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_com_h"] = "gng_err_com_long" 
    names(all_data_h)[names(all_data_h) == "err_omi_h"] = "gng_err_omi_long" 
    names(all_data_h)[names(all_data_h) == "rt_av_h"] = "gng_rt_av_long" 
    names(all_data_h)[names(all_data_h) == "exclude_h"] = "gng_exclude_long"
  
  # Create a dataframe depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  cue_type = as.data.frame(c(rep(1, length(all_data_v$participant)), rep(2, length(all_data_h$participant))))
    colnames(cue_type) = c("cue")  

  # Bind the go and no-go dataframes into 1 (in long format)
  gng_cue_bind = dplyr::bind_rows(all_data_v, all_data_h)
  
  # add the target type column depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  all_data_cue_long = dplyr::bind_cols(cue_type, gng_cue_bind)
    

4. OSpan Analysis

All the data related to the OSpan Analysis from the present study in one data frame: “ospan_analysis” (click to see more)

  • This data included the following:
    • participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85

This code organized the final data into the ospan-related data frame. (click to expand)


# make a new data frame for the ospan analysis
ospan_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85)

5. Go/No-Go: Error Analysis

All the data related to the Go/No-Go Error Analysis from the present study in one data frame: “gng_err_analysis”

Note: this data is in a long format for a repeated measures analysis (i.e. based on target type: either “go” or “no-go”). (click to see more)

  • This data included the following:
    • participant, age_long, gender, lang_long, program, year, location, power, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long

This code organized the final long data into a data frame for the Go/No-Go Error Analysis. (click to expand)

# make a new data frame for the gng error analysis
gng_err_analysis = dplyr::select(all_data_cue_long, participant, age_long, gender, lang_long, program, year, location, power, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long)

6. Go/No-Go: Accuracy Analysis

All the data related to the Go/No-Go Accuracy Analysis from the present study in one data frame: “gng_ac_analysis” (click to see more)

  • This data included the following:
    • participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av

This code organized the final data into a data frame related to the Go/No-Go Accuracy Analysis. (click to expand)


# make a new data frame for the gng accuracy analysis
gng_ac_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av) 

7. SAD: Factor Analysis

All the data related to the Factor Analysis from the present study in one data frame: “factor_analysis” (click to see more)

  • This data included the following:
    • participant, age, gender, lang, program, year, location, power, balance, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the final data into the factor analysis-related data frame. (click to expand)


# make a new data frame for the factor analysis
factor_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13)

8. SAD: Generalized Linear Model

All the data related to the GLM Analysis from the present study in one data frame: “glm_analysis” (click to see more)

  • This data included the following:
    • participant, age, gender, lang, program, year, location, power, balance, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the final data into the GLM-related data frame. (click to expand)


# make a new data frame for the glm analysis
glm_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13)

Descriptives: Smartphone Use

all_data_des = dplyr::select(all_data, participant, track_date, track_time, age, gender, lang, program, year)

# summary of age
summary(all_data_des$age)
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
  17.00   18.00   19.00   18.89   19.00   38.00 
# frequency of age
plyr::count(all_data_des$age)

# show frequency for gender
  # define gender as factor
  all_data_des$gender = factor(all_data_des$gender, 
                               levels = c(1, 2), 
                               labels = c("Male", "Female"))
  # show counts
  plyr::count(all_data_des$gender)

# show freq of year of study
plyr::count(all_data_des$year)

# show freq of program
  # first define the program as a factor
    all_data_des$program = factor(all_data_des$program, 
                  levels = c(3, 4, 13, 14, 28, 32, 37, 12, 17, 19, 15, 20, 24, 23, 26, 1, 2, 5, 7, 8, 11, 16, 18, 22, 27, 29, 33, 36, 6, 9, 10, 21, 25, 30, 31, 34, 35),
                  labels = c(rep("Arts & Humanities", 7), "Engineering", rep("Health Sciences", 2), rep("Information & Media Studies", 3), "Medicine & Dentistry", "Music", rep("Science", 13), rep("Social Science", 9)))
  # then show counts
  plyr::count(all_data_des$program)
NA
NA

Planned Analysis

The following analyses were completed based on a pre-registered direct replication on OSF (view registration: https://osf.io/5fq4r) of Ward et al.’s (2017) second experiment. The analyses tested the three main hypotheses in the present study: (1) the location effect hypothesis, (2) the power effect hypothesis, and (3) the moderation effect hypothesis.

Each analysis contains the following:

  1. Descriptive Statistics
  2. Analysis (including any relevant assumption tests)
  3. Visualizations

Testing the Location and Power Hypotheses:

OSpan Task

A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = OSpan absolute score).

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions, specifically, that participant payed attention to both the math component and letter recall component of the task. Only data from those who scored 85% or higher on the math component of the task were included.

  • Exclude data from any partitipants with (1) missing data and/or (2) the math criteria (i.e. minimum 85% on math accuracy). There was a total of 389 participants in the OSpan analysis.
    • Missing Data: 3 participants were removed due to missing ospan data.
    • Math Criteria (min. 85%): 20 participants were removed because they did not meet the math criteria.
# filter any participants with missing OSpan data for the analysis  
    # for all ps, excluding any p with missing data in the OSpan Absolute Score ONLY
    ospan_analysis_noNA = ospan_analysis[!is.na(ospan_analysis$ospan_absolute_score), ] 
    # for all ps, excluding any that do not meet the math criteria (min 85% accuracy)
    ospan_analysis_final = ospan_analysis_noNA[ospan_analysis_noNA[,"ospan_math_criteria_85"] == 1,]

Descriptive Statistics

The following table depicts the descriptive statistics for the OSpan analysis: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  ospan_des = psych::describeBy(ospan_analysis_final, list(ospan_analysis_final$location, ospan_analysis_final$power), mat = TRUE)  
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  ospan_des_t = rbind((cbind(t(ospan_des["ospan_absolute_score1",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score2",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(ospan_des["ospan_absolute_score4",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score5",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score6",c("mean", "sd", "min", "max", "n")]))))
    colnames(ospan_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  kable(ospan_des_t, caption = "Descriptive Statistics: Absolute Score on the OSpan Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

Descriptive Statistics: Absolute Score on the OSpan Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 41.4571 42.7353 38.6393
SD 16.2157 15.3259 18.1338
min. 6.0000 6.0000 4.0000
max. 75.0000 75.0000 75.0000
n 70.0000 68.0000 61.0000
Smartphone Power OFF
M 40.7458 40.7761 43.6250
SD 18.7650 17.5187 15.6788
min. 0.0000 3.0000 0.0000
max. 75.0000 75.0000 75.0000
n 59.0000 67.0000 64.0000

NA

Analysis

To assess if OSpan absolute score statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The ANOVA made three assumptions. All assumptions were met (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# get the required data from the ospan_analysis data: participant, location, power, and score
ospan_anova_data = ospan_analysis_final[, c("participant", "location", "power", "balance", "ospan_absolute_score")]
    colnames(ospan_anova_data) = c("participant", "location", "power", "balance", "score")
    rownames(ospan_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    ospan_anova_data$location = factor(ospan_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    ospan_anova_data$power = factor(ospan_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
    ospan_anova_data$balance = factor(ospan_anova_data$balance, 
                  levels = c(1, 2),
                  labels = c("Ospan First", "Go/No-Go First"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
ospan_anova = ezANOVA(
  data = ospan_anova_data
  , dv = .(score)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
ospan_res = data.frame("residuals" = proj(ospan_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(ospan_res$residuals, main = "Q-Q Plot of the OSpan Residuals") 
qqline(ospan_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right (see visualization below).

qplot(ospan_res$residuals, main = "Histogram of Ospan Residuals") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.99, p = .004 (see code below).

ospan_shap = shapiro.test(ospan_res$residuals)
ospan_shap

    Shapiro-Wilk normality test

data:  ospan_res$residuals
W = 0.98848, p-value = 0.003638
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: -0.42 to -0.01) and Kurtosis (range: -0.83 to 0.1) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).
# use the "ospan_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  ospan_sk_t = rbind((cbind(t(ospan_des["ospan_absolute_score1", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score2", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(ospan_des["ospan_absolute_score4", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score5", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score6", c("skew", "kurtosis", "n")]))))
    colnames(ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(ospan_sk_t, caption = "Skewness and Kurtosis of the Absolute Score on the OSpan Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

Skewness and Kurtosis of the Absolute Score on the OSpan Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness -0.2342 -0.4151 -0.0056
Kurtosis -0.7095 -0.2257 -0.8323
n 70.0000 68.0000 61.0000
Smartphone Power OFF
Skewness -0.3900 -0.2216 -0.4169
Kurtosis -0.8277 -0.7885 0.1035
n 59.0000 67.0000 64.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 386) = 0.31, p = = .74, smartphone power, F(1, 387) = 0.21, p = .65, and balance, F(1, 387) = 0, p = .99(see code below).

# conduct levene's
  # for location
  ospan_lev_loc = leveneTest(data = ospan_anova_data, score ~ as.factor(location), center = median)
  ospan_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2  0.3054  0.737
      386               
  # for power
  ospan_lev_pow = leveneTest(data = ospan_anova_data, score ~ as.factor(power), center = median)
  ospan_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1  0.2076 0.6489
      387               
  # for balance
  ospan_lev_bal = leveneTest(data = ospan_anova_data, score ~ as.factor(balance), center = median)
  ospan_lev_bal
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1   3e-04 0.9858
      387               
Results

All results were based on \(\alpha = 0.05\). Below are the reported statistics for both main effects and the interactions, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on Ospan absolute score, F(2, 377) = 0.12, p = .89, \(\eta^{2}\) = .001.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on Ospan absolute score, F(1, 377) = 0.05, p = .83, \(\eta^{2}\) < .001.

Interaction of Balance

There was no significant main effect of balance (i.e. which task was completed first) on Ospan absolute score, F(1, 377) = 2.72, p = .1, \(\eta^{2}\) = .01.

Interaction of Smartphone Location & Smartphone Power

There was no significant interaction between smartphone location and smartphone power on Ospan absolute score, F(2, 377) = 1.41, p = .24, \(\eta^{2}\) = .01.

Interaction of Smartphone Location & Balance

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on Ospan absolute score, F(2, 377) = 2.18, p = .11, \(\eta^{2}\) = .01.

Interaction of Smartphone Power & Balance

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on Ospan absolute score, F(1, 377) = 1.14, p = .29, \(\eta^{2}\) = .003.

Interaction of Smartphone Location, Smartphone Power, & Balance

There was no significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on Ospan absolute score, F(2, 377) = 2.64, p = .07, \(\eta^{2}\) = .01.

Post-hoc Tests

Since there was no significant main or interaction effects, no post-hocs were completed.

# show ANOVA results in kable table 
kable(ospan_anova$ANOVA, caption = "3x2 ANOVA on Ospan Absolute Score", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

3x2 ANOVA on Ospan Absolute Score
Effect DFn DFd SSn SSd F p p<.05 ges
(Intercept) 1 377 645211.1638 106102.5 2292.5429 0.0000 * 0.8588
location 2 377 67.1897 106102.5 0.1194 0.8875 0.0006
power 1 377 12.6653 106102.5 0.0450 0.8321 0.0001
balance 1 377 764.5825 106102.5 2.7167 0.1001 0.0072
location:power 2 377 794.8131 106102.5 1.4121 0.2449 0.0074
location:balance 2 377 1229.6413 106102.5 2.1846 0.1139 0.0115
power:balance 1 377 321.3575 106102.5 1.1418 0.2859 0.0030
location:power:balance 2 377 1486.1928 106102.5 2.6403 0.0727 0.0138

NA

Visualizations

The following shows a visulization of the Ospan absolute score based on the two smartphone conditions: location and power. Note: Since the balance factor (i.e. which task was completed first) did not have a significant main effect or interaction effect on the data, this factor was not visualized in the plots

Cue-Dependent Go/No-Go Task

An error (i.e. omission errors) and accuracy (i.e. RT) analysis was completed using the omission errors and average RT for each participant.

Error Analysis

A 3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Vertical, Horizontal) mixed ANOVA was completed (DV = omission error(s)). The between-subjects factors were smartphone location and power, and the within-subjects factor was pre-target cue type (i.e. “cue type”; either “vertical” or “horizontal”).

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials) were used for the final analysis.

  • Exclude data from any partitipants with (1) missing data and/or (2) the response criteria, where participants scored higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials). There was a total of 403 participants in the Go/No-Go Error analysis.
    • Missing Data: 0 participants were removed due to missing data.
    • Response Criteria: 6 participants were removed because they did not meet the response criteria.
# filter any participants with missing data or that do not meet the inclusionn criteria

  # for all ps, excluding any p with missing data in the Omission Error 
  gng_err_noNA = gng_err_analysis[!is.na(gng_err_analysis$gng_err_omi_long),]
  
  # for all ps, excluding any that scored below chance performance (i.e. respondd to at least 50% of the "go" trials and withheld response to at least 50% of the "no-go" trials) using the "exclude" column
  gng_err_final = gng_err_noNA[gng_err_noNA$gng_exclude_long == 2,]
Descriptive Statistics

The following table depicts the descriptive statistics for the Go/No-Go Error Analysis: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  # for the "vertical" trials
  gng_err_des_v = describeBy(gng_err_final[gng_err_final$cue == 1,], 
                           list(gng_err_final[gng_err_final$cue == 1,]$location, 
                                gng_err_final[gng_err_final$cue == 1,]$power), mat = TRUE) 
  # for the "horizontal" trials
  gng_err_des_h = describeBy(gng_err_final[gng_err_final$cue == 2,], 
                           list(gng_err_final[gng_err_final$cue == 2,]$location, 
                                gng_err_final[gng_err_final$cue == 2,]$power), mat = TRUE) 
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  # for the "vertical" trials
  gng_err_des_v_t = rbind((cbind(t(gng_err_des_v["gng_err_omi_long1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_err_des_v["gng_err_omi_long4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_err_des_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_des_v_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  # for the "horizontal" trials
  gng_err_des_h_t = rbind((cbind(t(gng_err_des_h["gng_err_omi_long1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_err_des_h["gng_err_omi_long4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_err_des_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_des_h_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_v_kt = 
    kable(gng_err_des_v_t, caption = "Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Vertical' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))
  
  gng_err_h_kt = 
    kable(gng_err_des_h_t, caption = "Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Horizontal' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))
  
  gng_err_v_kt

Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Vertical' Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 0.6849 0.4085 1.2273
SD 2.1140 0.8877 5.3430
min. 0.0000 0.0000 0.0000
max. 14.0000 4.0000 43.0000
n 73.0000 71.0000 66.0000
Smartphone Power OFF
M 0.9333 0.5522 0.3939
SD 3.0413 2.8830 1.0058
min. 0.0000 0.0000 0.0000
max. 20.0000 23.0000 6.0000
n 60.0000 67.0000 66.0000

  gng_err_h_kt

Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Horizontal' Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 0.1233 0.1268 0.4091
SD 0.3706 0.3753 1.3809
min. 0.0000 0.0000 0.0000
max. 2.0000 2.0000 10.0000
n 73.0000 71.0000 66.0000
Smartphone Power OFF
M 0.1167 0.2090 0.1667
SD 0.4903 0.9135 0.4501
min. 0.0000 0.0000 0.0000
max. 3.0000 7.0000 2.0000
n 60.0000 67.0000 66.0000

NA
Analysis

To assess if the omission errors statistically differed, we conducted a 3x2x2 mixed ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off), and the within-subjects factor of cue type (i.e. “vertical” or “horizontal”). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The mixed ANOVA made four assumptions. The normality assumption was not met. Therefore, a robust ANOVA was completed to assess if the omission errors statistically differed between the factors (i.e. smartphone location, smartphone power, and cue type) (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# get the required data from the gng_err_final data: participant, location, power, and score
gng_err_anova_data = gng_err_final[,c("participant", "location", "power", "cue", "balance", "gng_err_omi_long")]
    colnames(gng_err_anova_data) = c("participant", "location", "power", "cue", "balance", "err_omi")
    rownames(gng_err_anova_data) = NULL
    gng_err_anova_data$location = as.factor(gng_err_anova_data$location)
    gng_err_anova_data$power = as.factor(gng_err_anova_data$power)
    gng_err_anova_data$cue = as.factor(gng_err_anova_data$cue)
    gng_err_anova_data$balance = as.factor(gng_err_anova_data$balance)

# run mixed ANOVA (IV b/w: Smartphone Power & Location; IV w/in: Target Type; DV: Omission Errors)
gng_err_anova = ezANOVA(
  data = gng_err_anova_data
  , dv = .(err_omi)
  , wid = .(participant)
  , within = .(cue)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therefore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_err_res = data.frame("residuals" = proj(gng_err_anova$aov)[[3]][, "Residuals"])
  • It appears that the residuals lie closely to a straight line (i.e. only a couple lie away form the line), which suggests normality (see visualization below).

qqnorm(gng_err_res$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals") 
qqline(gng_err_res$residuals)

  • It appears that residuals are leptokurtic (i.e. there is a high peak near the center) (see visualization below).

qplot(gng_err_res$residuals, main = "Histogram of Go/No-Go Error Residuals", binwidth = .15) + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.42, p < .001 (see code below).

gng_err_shap = shapiro.test(gng_err_res$residuals)
gng_err_shap

    Shapiro-Wilk normality test

data:  gng_err_res$residuals
W = 0.42168, p-value < 2.2e-16
  • The data’s Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: 2.65 to 7.01) and Kurtosis (range: 7.36 to 51.13) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: 1.73 to 7.54) and Kurtosis (range: 2.45 to 58.7) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).
# create & show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

#--FOR THE VERTICAL CUE TYPE
# use the "gng_err_des_v" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_v_t = rbind((cbind(t(gng_err_des_v["gng_err_tot_long1", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long2", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_err_des_v["gng_err_tot_long4", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long5", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_v_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_sk_v = kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Vertical' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))
  
#--FOR THE HORIZONTAL CUE TYPE
# use the "gng_err_des_h" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_h_t = rbind((cbind(t(gng_err_des_h["gng_err_tot_long1", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long2", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_err_des_h["gng_err_tot_long4", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long5", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_h_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_sk_h = kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Horizonatal' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))
  
# show the tables
  # for the Vertical Cue Type
  gng_err_sk_v

Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Vertical' Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 3.4795 2.6488 7.0078
Kurtosis 13.3863 7.3608 51.1329
n 73.0000 71.0000 66.0000
Smartphone Power OFF
Skewness 4.2401 6.9556 2.9178
Kurtosis 20.7613 50.5909 9.9703
n 60.0000 67.0000 66.0000

  # for the Horizonatal Cue Type
  gng_err_sk_h

Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Horizonatal' Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 7.5440 2.7262 3.9242
Kurtosis 58.7026 8.6904 20.5069
n 73.0000 71.0000 66.0000
Smartphone Power OFF
Skewness 2.1496 7.4708 1.7321
Kurtosis 4.8818 56.3296 2.4486
n 60.0000 67.0000 66.0000

NA
  • Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range for the ANOVA analysis.

3. Homogeneity of Variance

  • This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location, smartphone power, and balance).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for both smartphone location, F(2, 803) = 0.77, p = .47, smartphone power, F(1, 804) = 0.41, p = .52, and balance, F(1, 804) = 1.21, p = .27.(see code below).


# conduct levene's
  # for location
  gng_err_lev_loc = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(location), center = median)
  gng_err_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2  0.7651 0.4656
      803               
  # for power
  gng_err_lev_pow = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(power), center = median)
  gng_err_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1  0.4097 0.5223
      804               
  # for balance
  gng_err_lev_bal = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(balance), center = median)
  gng_err_lev_bal
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1  1.2119 0.2713
      804               

4. Sphericity

This assumption was met since sphericity is always met for two levels of a repeated measure factor and is, therefore, unnecessary to evaluate (Girden, 1992).

Results

Since the normality assumption was not met for the mixed ANOVA, a robust ANOVA was completed. This was done using the “welchADF” package (more details on this package: https://journal.r-project.org/archive/2017/RJ-2017-049/index.html)). The results from this analysis are shown below, followed by a table depicting the statistics.


# compute the main effects
gng_err_wel = welchADF.test(formula = err_omi ~ location + power + balance + cue, response = "err_omi", between.s = .(location, power, balance), within.s = .(cue), subject = "participant", contrast = c("omnibus", "all.pairwise"), data = gng_err_anova_data, effect.size = TRUE, trimming = TRUE)

# NOTE:to get the effect sizes, the package will allow you to get the pairwise (i.e. post-hoc) effect sizes 
  # the robust ANOVA above does not automatically get the effect sizes 
  # Note: it must be done for each factor separately with the following as a guide
# welchADF.test(formula = err_omi ~ location + power + balance + cue, response = "err_omi", between.s = .(location, power, balance), within.s = .(cue), subject = "participant", contrast = "all.pairwise", data = gng_err_anova_data, effect.size = TRUE, trimming = TRUE, effect = "cue")

#welchADF.test(formula = err_omi ~ location + power + cue, data = gng_err_anova_data, effect = "cue", contrast = "all.pairwise", trimming = TRUE, effect.size = TRUE)

# add all the interaction effects
gng_err_robust = update.default(gng_err_wel, .~location*power*balance*cue)

All results were base on \(\alpha = 0.05\). Below are the reported statistics for both main effects and the interaction, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on omission errors, WJ(2, 117.22) = 1.26, p = .29.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on omission errors, WJ(1, 193.68) = 0.91, p = .34.

Main Effect of Smartphone Balance

There was no significant main effect of balance (i.e. which task was completed first) on omission errors, WJ(1, 193.68) = 0.81, p = .37`.

Main Effect of Smartphone Cue Type

There was no significant main effect of cue type on omission errors, WJ(1, 193.68) = 3.3, p = .07`.

Interaction of Smartphone Location & Power

There was no significant interaction between smartphone location and power on omission errors, WJ(2, 117.22) = 0.37, p = .69`.

Interaction of Smartphone Location & Balance

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on omission errors, WJ(2, 117.22) = 0.87, p = .42`.

Interaction of Smartphone Power & Balance

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on omission errors, WJ(1, 193.68) = 0.59, p = .44`.

Interaction of Smartphone Location & Cue Type

There was no significant interaction between smartphone location and cue type on omission errors, WJ(2, 117.22) = 0.56, p = .57`.

Interaction of Smartphone Power & Cue Type

There was no significant interaction between smartphone power and cue type on omission errors, WJ(1, 193.68) = 0.94, p = .33`.

Interaction of Balance & Cue Type

There was no significant interaction between balance (i.e. which task was completed first) and cue type on omission errors, WJ(1, 193.68) = 0.19, p = .67`.

Interaction of Smartphone Location, Smartphone Power, & Balance

There was no significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on omission errors, WJ(2, 117.22) = 0.5, p = .61`.

Interaction of Smartphone Location, Smartphone Power, & Cue Type

There was no significant interaction between smartphone location, smartphone power, and cue type on omission errors, WJ(2, 117.22) = 0.39, p = .68`.

Interaction of Smartphone Location, Balance, & Cue Type

There was no significant interaction between smartphone location, balance (i.e. which task was completed first), and cue type on omission errors, WJ(2, 117.22) = 0.43, p = .65`.

Interaction of Smartphone Power, Balance, & Cue Type

There was no significant interaction between smartphone power, balance (i.e. which task was completed first), and cue type on omission errors, WJ(1, 193.68) = 0.62, p = .43`.

Interaction of Smartphone Location, Smartphone Power, Balance, & Cue Type

There was no significant interaction between smartphone location, smartphone power, balance (i.e. which task was completed first), and cue type on omission errors, WJ(2, 117.22) = 0.52, p = .6`.

Post-hoc Tests

Since there was no significant main or interaction effects, no post-hocs were completed.


# create data frame from robust ANOVA results to display
  # get the Numerator DF (DFn)
  gng_err_robust_dfn = as.numeric(c(gng_err_robust$location$numeratorDF, gng_err_robust$power$numeratorDF, 
                                  gng_err_robust$balance$numeratorDF, gng_err_robust$cue$numeratorDF, 
                                  gng_err_robust$`location:power`$numeratorDF, gng_err_robust$`location:balance`$numeratorDF, 
                                  gng_err_robust$`power:balance`$numeratorDF, gng_err_robust$`location:cue`$numeratorDF, 
                                  gng_err_robust$`power:cue`$numeratorDF, gng_err_robust$`balance:cue`$numeratorDF, 
                                  gng_err_robust$`location:power:balance`$numeratorDF,
                                  gng_err_robust$`location:power:cue`$numeratorDF,
                                  gng_err_robust$`location:balance:cue`$numeratorDF, 
                                  gng_err_robust$`power:balance:cue`$numeratorDF, 
                                  gng_err_robust$`location:power:balance:cue`$numeratorDF))
  # get the Denominator DF (DFd)
  gng_err_robust_dfd = as.numeric(c(gng_err_robust$location$denominatorDF, gng_err_robust$power$denominatorDF, 
                                  gng_err_robust$balance$denominatorDF, gng_err_robust$cue$denominatorDF, 
                                  gng_err_robust$`location:power`$denominatorDF, gng_err_robust$`location:balance`$denominatorDF,
                                  gng_err_robust$`power:balance`$denominatorDF, gng_err_robust$`location:cue`$denominatorDF, 
                                  gng_err_robust$`power:cue`$denominatorDF, gng_err_robust$`balance:cue`$denominatorDF, 
                                  gng_err_robust$`location:power:balance`$denominatorDF,
                                  gng_err_robust$`location:power:cue`$denominatorDF,
                                  gng_err_robust$`location:balance:cue`$denominatorDF, 
                                  gng_err_robust$`power:balance:cue`$denominatorDF, 
                                  gng_err_robust$`location:power:balance:cue`$denominatorDF))
  # get the WJ statistic
  gng_err_robust_WJ = as.numeric(c(gng_err_robust$location$welch.T, gng_err_robust$power$welch.T, 
                                  gng_err_robust$balance$welch.T, gng_err_robust$cue$welch.T, 
                                  gng_err_robust$`location:power`$welch.T, gng_err_robust$`location:balance`$welch.T,
                                  gng_err_robust$`power:balance`$welch.T, gng_err_robust$`location:cue`$welch.T, 
                                  gng_err_robust$`power:cue`$welch.T, gng_err_robust$`balance:cue`$welch.T, 
                                  gng_err_robust$`location:power:balance`$welch.T,
                                  gng_err_robust$`location:power:cue`$welch.T,
                                  gng_err_robust$`location:balance:cue`$welch.T, 
                                  gng_err_robust$`power:balance:cue`$welch.T, 
                                  gng_err_robust$`location:power:balance:cue`$welch.T))
  # get the p value
  gng_err_robust_pval = as.numeric(c(gng_err_robust$location$pval, gng_err_robust$power$pval, 
                                  gng_err_robust$balance$pval, gng_err_robust$cue$pval, 
                                  gng_err_robust$`location:power`$pval, gng_err_robust$`location:balance`$pval,
                                  gng_err_robust$`power:balance`$pval, gng_err_robust$`location:cue`$pval, 
                                  gng_err_robust$`power:cue`$pval, gng_err_robust$`balance:cue`$pval, 
                                  gng_err_robust$`location:power:balance`$pval,
                                  gng_err_robust$`location:power:cue`$pval,
                                  gng_err_robust$`location:balance:cue`$pval, 
                                  gng_err_robust$`power:balance:cue`$pval, 
                                  gng_err_robust$`location:power:balance:cue`$pval))
  
  # bind the data into 1 data frame
  gng_err_robust_table = as.data.frame(cbind(gng_err_robust_dfn, gng_err_robust_dfd, gng_err_robust_WJ, gng_err_robust_pval))
    # add columns: (1) to denote the effect, and (2) to denote significance at p < .05 
    gng_err_robust_table = cbind(names(gng_err_robust)[1:15], 
                                 gng_err_robust_table, c(rep("", length(gng_err_robust_pval))))
    colnames(gng_err_robust_table) = c("Effect","DFn", "DFd", "WJ Stat", "p", "p < .05")
    rownames(gng_err_robust_table) = NULL
  
# show ANOVA results in kable table 
kable(gng_err_robust_table, caption = "3x2x2 Robust Mixed ANOVA on Omission Error in the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

3x2x2 Robust Mixed ANOVA on Omission Error in the Go/No-Go Task
Effect DFn DFd WJ Stat p p < .05
location 2 117.2240 1.2612 0.2871
power 1 193.6793 0.9055 0.3425
balance 1 193.6793 0.8056 0.3705
cue 1 193.6793 3.2994 0.0709
location:power 2 117.2240 0.3709 0.6909
location:balance 2 117.2240 0.8711 0.4212
power:balance 1 193.6793 0.5929 0.4423
location:cue 2 117.2240 0.5581 0.5738
power:cue 1 193.6793 0.9445 0.3323
balance:cue 1 193.6793 0.1864 0.6664
location:power:balance 2 117.2240 0.5014 0.6070
location:power:cue 2 117.2240 0.3900 0.6779
location:balance:cue 2 117.2240 0.4301 0.6515
power:balance:cue 1 193.6793 0.6245 0.4304
location:power:balance:cue 2 117.2240 0.5207 0.5955

NA
Visualizations

The following shows a visulization of the Go/No-Go omission error data based on the three factors: smartphone location, smartphone power, and cue type.

To visualize the mixed design, two plots were created (i.e. one for each cue type). Note: Since the balance factor (i.e. which task was completed first) did not have a significant main effect or interaction effect on the data, this factor is not visualized in the plots

#--FOR THE VERTICAL CUE TYPE-- 

# create data frame with location conditions as characters rather than intergers 
  # First, get a subset of the data
  gng_err_bar_v_data = gng_err_final[gng_err_final$cue == 1, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_v_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_v_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_v_data$location = factor(gng_err_bar_v_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_v_data$power = factor(gng_err_bar_v_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_v = ggplot(gng_err_bar_v_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Vertical' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_v_data$err_omi), max(gng_err_bar_v_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_v_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

#--FOR THE HORIZONTAL CUE TYPE-- 

# create data frame with location conditions as characters rather than intergers 
  # First, get a subset of the data
  gng_err_bar_h_data = gng_err_final[gng_err_final$cue == 2, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_h_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_h_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_h_data$location = factor(gng_err_bar_h_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_h_data$power = factor(gng_err_bar_h_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_h = ggplot(gng_err_bar_h_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Horizontal' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the horizontal cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_h_data$err_omi), max(gng_err_bar_h_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_h_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# Show plots
  # for the Vertical cue type
  gng_err_bar_v

  # for the Horizontal cue type
  gng_err_bar_h

NA
NA

Accuracy Analysis

A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was completed (DV = average RT). The between-subjects factors were smartphone location and power.

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Any participants who had a reaction time that was higher than two standard deviations (SD) from the mean reaction time were not included in the final analysis.

  • Exclude data from any partitipants with (1) missing data and/or (2) the reaction time SD criteria (i.e. within two standard devations from the mean RT). There was a total of 395 participants in the Go/No-Go Error analysis.
    • Missing Data: 3 participants were removed due to missing data (note: this also removed data from participants who did not response during the task).
    • Reaction Time SD Criteria: 11 participants were removed because they did not meet the raction time SD criteria. The number of participants removed with respect to their smartphone location and power conditions was as follows:
      • For the power “on” condition: 3 (on desk), 2 (pocket/bag), 1 (outside)
      • For the power “off” condition: 1 (on desk), 4 (pocket/bag), 0 (outside)

# filter any participants with missing data or that do not meet the inclusionn criteria

  # for all ps, excluding any p with missing data in the average reaction time column (gng_rt_av)
  # Note: this also removed participants who did not respond at all during the task
  gng_ac_noNA = gng_ac_analysis[!is.na(gng_ac_analysis$gng_rt_av),]
  
  # for all ps, excluding any that did not meet the reaction time deviation criteria (i.e. within two standard devations from the mean RT)
  # get the mean RT with all the data included, divided by their respective smartphone condition (i.e. either location or power)
    # get the descriptive of the data
    gng_ac_noNA_des = psych::describeBy(gng_ac_noNA, list(gng_ac_noNA$location, gng_ac_noNA$power), mat = TRUE)
    # find the inclusion criteria for each condition (M - 2*SD)
      # power = ON 
        # location = "on desk"
        gng_ac_criteria1 = gng_ac_noNA_des["gng_rt_av1","mean"] + (gng_ac_noNA_des["gng_rt_av1","sd"] * 2)
        # location = "pocket/bag"
        gng_ac_criteria2 = gng_ac_noNA_des["gng_rt_av2","mean"] + (gng_ac_noNA_des["gng_rt_av2","sd"] * 2)
        # location = "outside"
        gng_ac_criteria3 = gng_ac_noNA_des["gng_rt_av3","mean"] + (gng_ac_noNA_des["gng_rt_av3","sd"] * 2)
      # power = OFF 
        # location = "on desk"
        gng_ac_criteria4 = gng_ac_noNA_des["gng_rt_av4","mean"] + (gng_ac_noNA_des["gng_rt_av4","sd"] * 2)
        # location = "pocket/bag"
        gng_ac_criteria5 = gng_ac_noNA_des["gng_rt_av5","mean"] + (gng_ac_noNA_des["gng_rt_av5","sd"] * 2)
        # location = "outside"
        gng_ac_criteria6 = gng_ac_noNA_des["gng_rt_av6","mean"] + (gng_ac_noNA_des["gng_rt_av6","sd"] * 2)
    # find the participants to remove from the data 
      # power = ON 
        # location = "on desk"
        gng_ac_remove1 = (gng_ac_noNA %>% filter(location == 1 & power == 1 & gng_rt_av > gng_ac_criteria1))[,"participant"]
        # location = "pocket/bag"
        gng_ac_remove2 = (gng_ac_noNA %>% filter(location == 2 & power == 1 & gng_rt_av > gng_ac_criteria2))[,"participant"]
        # location = "outside"
        gng_ac_remove3 = (gng_ac_noNA %>% filter(location == 3 & power == 1 & gng_rt_av > gng_ac_criteria3))[,"participant"]
      # power = OFF 
        # location = "on desk"
        gng_ac_remove4 = (gng_ac_noNA %>% filter(location == 1 & power == 2 & gng_rt_av > gng_ac_criteria4))[,"participant"]
        # location = "pocket/bag"
        gng_ac_remove5 = (gng_ac_noNA %>% filter(location == 2 & power == 2 & gng_rt_av > gng_ac_criteria5))[,"participant"]
        # location = "outside"
        gng_ac_remove6 = (gng_ac_noNA %>% filter(location == 3 & power == 3 & gng_rt_av > gng_ac_criteria6))[,"participant"]
    # make a list of all the participant to remove from the data
      gng_ac_all_remove = dplyr::arrange(data.frame("remove" = as.numeric(c(gng_ac_remove1, gng_ac_remove2, gng_ac_remove3, gng_ac_remove4, gng_ac_remove5, gng_ac_remove6))), remove)
        # note: this code also re-orders the list from smallest to largest
  # Filter the data using the final remove list 
  gng_ac_final = dplyr::filter(gng_ac_noNA, !participant %in% gng_ac_all_remove$remove)
Descriptive Statistics

The following table depicts the descriptive statistics for the Go/No-Go Accuracy Analysis: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_des = psych::describeBy(gng_ac_final, list(gng_ac_final$location, gng_ac_final$power), mat = TRUE)  
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_des_t = rbind((cbind(t(gng_ac_des["gng_rt_av1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_des["gng_rt_av4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_des_t, caption = "Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 1.6553 1.6617 1.6608
SD 0.0405 0.0516 0.0412
min. 1.5783 1.5618 1.5600
max. 1.7555 1.7730 1.7449
n 70.0000 70.0000 65.0000
Smartphone Power OFF
M 1.6579 1.6506 1.6608
SD 0.0574 0.0511 0.0623
min. 1.5690 1.5604 1.5635
max. 1.8442 1.7749 1.8688
n 60.0000 64.0000 66.0000

NA
Analysis

To assess if the reaction time statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The ANOVA made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# get the required data from the ospan_analysis data: participant, location, power, and score
gng_ac_anova_data = gng_ac_final[, c("participant", "location", "power", "balance", "gng_rt_av")]
    colnames(gng_ac_anova_data) = c("participant", "location", "power", "balance", "rt_av")
    rownames(gng_ac_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    gng_ac_anova_data$location = factor(gng_ac_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    gng_ac_anova_data$power = factor(gng_ac_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
    gng_ac_anova_data$balance = factor(gng_ac_anova_data$balance, 
                  levels = c(1, 2),
                  labels = c("Ospan First", "Go/No-Go First"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
gng_ac_anova = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_ac_res = data.frame("residuals" = proj(gng_ac_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals") 
qqline(gng_ac_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the left and some data on the far right (see visualization below).

qplot(gng_ac_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.98, p < .001 (see code below).

gng_ac_shap = shapiro.test(gng_ac_res$residuals)
gng_ac_shap

    Shapiro-Wilk normality test

data:  gng_ac_res$residuals
W = 0.97715, p-value = 6.914e-06
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: -0.02 to 1.38) and Kurtosis (range: -0.88 to 1.83) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).
# use the "gng_ac_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_sk_t = rbind((cbind(t(gng_ac_des["gng_rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_des["gng_rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 0.3923 0.2726 -0.0191
Kurtosis -0.4925 -0.8790 -0.4936
n 70.0000 70.0000 65.0000
Smartphone Power OFF
Skewness 1.3850 0.4283 0.7265
Kurtosis 1.8330 -0.5305 0.2687
n 60.0000 64.0000 66.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 392) = 1.12, p = .33 and balance, F(1, 393) = 2.33, p = .13. However, the homogeneity assumption was not met for smartphone power, F(1, 393) = 5.81, p = .02 (see code below).

# conduct levene's
  # for location
  gng_ac_lev_loc = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(location), center = median)
  gng_ac_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2   1.119 0.3276
      392               
  # for power
  gng_ac_lev_pow = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(power), center = median)
  gng_ac_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value  Pr(>F)  
group   1  5.8139 0.01636 *
      393                  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  # for balance
  gng_ac_lev_bal = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(balance), center = median)
  gng_ac_lev_bal
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1  2.3327 0.1275
      393               
  • The homogeneity of variance assumption was not met for the smarphone power condition. Therefore, considering there are unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of power).
# run the corrected ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
# this will be used to report the main effect of smartphone power only 
gng_ac_anova_cor = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_pow_es = (EtaSq(gng_ac_anova$aov, type = 3, anova = FALSE))[2]
Results

All results were based on \(\alpha = 0.05\). Below are the reported statistics for the main effects and the interactions, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on reaction time, F(2, 383) = 0.7, p = .5, \(\eta^{2}\) = .004.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on reaction time, F(1, 383) = 0.81, p = .37, \(\eta^{2}\) = .01.

Interaction of Balance

There was no significant main effect of balance (i.e. which task was completed first) on reaction time, F(1, 383) = 0.06, p = .8, \(\eta^{2}\) < .001.

Interaction of Smartphone Location & Smartphone Power

There was no significant interaction between smartphone location and smartphone power on reaction time, F(2, 383) = 0.33, p = .72, \(\eta^{2}\) = .002.

Interaction of Smartphone Location & Balance

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on reaction time, F(2, 383) = 2.07, p = .13, \(\eta^{2}\) = .01.

Interaction of Smartphone Power & Balance

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on reaction time, F(1, 383) = 1.46, p = .23, \(\eta^{2}\) = .004.

Interaction of Smartphone Location, Smartphone Power, & Balance

There was a significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on reaction time, F(2, 383) = 5.49, p = .004, \(\eta^{2}\) = .03.

Post-hoc Tests

Since there was a significant three-way interaction (i.e smartphone location, smartphone power, and balance), further analyses were completed (see “Post-Hocs: Simple Two-Way Interactions” below).


# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_t = gng_ac_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_t = dplyr::select(gng_ac_t, -SSn, -SSd)
    # chance the data for the main effect of power to corrected numbers
    gng_ac_t[gng_ac_t$Effect == "power",] = c(gng_ac_anova_cor$ANOVA[3,], gng_ac_pow_es)
# show ANOVA results in kable table 
kable(gng_ac_t, caption = "3x2x2 ANOVA on Go/No-Go Reaction Time", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

3x2x2 ANOVA on Go/No-Go Reaction Time
Effect DFn DFd F p p<.05 ges
(Intercept) 1 383 417915.0714 0.0000 * 0.9991
location 2 383 0.7043 0.4951 0.0037
power 1 383 0.8121 0.3681 0.0084
balance 1 383 0.0649 0.7991 0.0002
location:power 2 383 0.3301 0.7190 0.0017
location:balance 2 383 2.0740 0.1271 0.0107
power:balance 1 383 1.4648 0.2269 0.0038
location:power:balance 2 383 5.4934 0.0044 * 0.0279

NA
Post-Hocs: Simple Two-Way Interactions

A three-way interaction (i.e. between smartphone location, smartphone power, and balance) suggested that a two-way interaction operated differently depending on the levels of the third factor. In order to interpret this complex interaction, the data was broken down into a subset based on the levels of one of the factors (i.e. “simple two-way interactions”).

The main manipulation in the present study was smartphone location and power, additionally, the balance factor was designed to provide an experimental control for the counter-balanced task order. Therefore, the simple two-way interactions of the smartphone factors (i.e. location and power) were assessed at each level of balance (i.e. OSpan first and Go/No-Go first). Although this required two more analyses, this approach allowed us to disentangle the three-way interaction and therefore interpret where the differences were among the factors. Two new data sets were created for the simple two-way interactions: (1) for the OSpan First Data (gng_ac_ospan_data) and (2) for the Go/No-Go First Data (gng_ac_gng_data).


# get a subset of the gng_ac_anova_data
  # for the OSpan first balance group only
  gng_ac_ospan_data = gng_ac_anova_data[gng_ac_anova_data$balance == "Ospan First", ]

  # for the Go/No-Go first balance group only
  gng_ac_gng_data = gng_ac_anova_data[gng_ac_anova_data$balance == "Go/No-Go First", ]

In order to control for multiple comparisons, a Bonferroni adjustment was used on the p-values to determine significance. This adjustment was completed by dividing the current alpha level (\(\alpha = 0.05\)) by the number of simple two-way interactions being compuetd (i.e. 2), therefore, the the bonferroni-corrected alpha value for statistical significance was 0.025.

OSpan First: Descriptive Statistics

The first simple two-way interaction ANOVA was completed using the “OSpan First” subset of the Balance Factor (i.e. participants who completed the OSpan task first, then the Go/No-Go task). As in the main 3-way ANOVA, the between-subjects factors were smartphone location (i.e. desk, pocket/bag or outside) and power (i.e. ON or OFF), and the dependent variable was accuracy (i.e. average reaction time).

There was a total of 180 participants in the OSpan First analysis. The following table depicts the descriptive statistics: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_ospan_des = psych::describeBy(gng_ac_ospan_data, list(gng_ac_ospan_data$location, gng_ac_ospan_data$power), mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_ospan_des_t = rbind((cbind(t(gng_ac_ospan_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av3", c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_ospan_des["rt_av4", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av5", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av6", c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_ospan_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_ospan_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  
# use kable to show the descriptives table
  kable(gng_ac_ospan_des_t, caption = "Ospan First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

Ospan First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 1.6590 1.6547 1.6726
SD 0.0402 0.0539 0.0378
min. 1.5864 1.5802 1.6030
max. 1.7555 1.7730 1.7431
n 34.0000 31.0000 27.0000
Smartphone Power OFF
M 1.6343 1.6612 1.6585
SD 0.0242 0.0486 0.0595
min. 1.5906 1.5889 1.5723
max. 1.6867 1.7749 1.7820
n 24.0000 32.0000 32.0000

NA
OSpan First: Assumptions

The two-way interaction made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_ospan_anova = ezANOVA(
  data = gng_ac_ospan_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )
  
# calculate & extract the residuals from the ANOVA
gng_ac_ospan_res = data.frame("residuals" = proj(gng_ac_ospan_anova$aov)[,"Residuals"])  
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_ospan_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the OSpan First Group") 
qqline(gng_ac_ospan_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution (see visualization below).

qplot(gng_ac_ospan_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the OSpan First Group") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.98, p = .004 (see code below).

gng_ac_ospan_shap = shapiro.test(gng_ac_ospan_res$residuals)
gng_ac_ospan_shap

    Shapiro-Wilk normality test

data:  gng_ac_ospan_res$residuals
W = 0.97676, p-value = 0.004195
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: 0.02 to 0.49) and Kurtosis (range: -1.17 to -0.46) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).

# use the descriptives data -- gng_ac_ospan_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_ospan_sk_t = rbind((cbind(t(gng_ac_ospan_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_ospan_des["rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_ospan_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the OSpan First Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the OSpan First Group
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 0.4543 0.4852 0.0235
Kurtosis -0.4591 -0.8623 -0.9030
n 34.0000 31.0000 27.0000
Smartphone Power OFF
Skewness 0.2582 0.4521 0.3469
Kurtosis -0.6946 -0.6692 -1.1674
n 24.0000 32.0000 32.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone power, F(1, 178) = 0.33, p = .57. However, the homogeneity assumption was not met for smartphone location, F(2, 177) = 4.42, p = .01 (see code below).

# conduct levene's
  # for location
  gng_ac_ospan_lev_loc = leveneTest(data = gng_ac_ospan_data, rt_av ~ as.factor(location), center = median)
  gng_ac_ospan_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value  Pr(>F)  
group   2  4.4174 0.01342 *
      177                  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  # for power
  gng_ac_ospan_lev_pow = leveneTest(data = gng_ac_ospan_data, rt_av ~ as.factor(power), center = median)
  gng_ac_ospan_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   1  0.3272  0.568
      178               
  • The homogeneity of variance assumption was not met for the smartphone location condition. Therefore, considering there were unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of location).
# run the corrected ANOVA (IV: Smartphone Power & Location; DV: Go/No-Go RT)
# this will be used to report the main effect of smartphone power only  
gng_ac_ospan_anova_cor = ezANOVA(
  data = gng_ac_ospan_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_ospan_loc_es = (EtaSq(gng_ac_ospan_anova_cor$aov, type = 3, anova = FALSE))[2]
OSpan First: Results

All results were based on \(\alpha_{adjusted} = 0.025\). Below are the reported statistics for the main and interaction effect of the simple two-way ANOVA, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on reaction time, F(2, 174) = 3.19, p = .04, \(\eta^{2}\) = .02.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on reaction time, F(1, 174) = 2.39, p = .12, \(\eta^{2}\) = .01.

Interaction of Smartphone Location & Smartphone Power

There was no significant interaction between smartphone location and smartphone power on reaction time, F(2, 174) = 1.76, p = .18, \(\eta^{2}\) = .02.

Post-hoc Tests

Since there was no significant main or interaction effects, no post-hocs were completed.


# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_ospan_t = gng_ac_ospan_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_ospan_t = dplyr::select(gng_ac_ospan_t, -SSn, -SSd)
    # change the data for the main effect of location to corrected numbers
    gng_ac_ospan_t[gng_ac_ospan_t$Effect == "location",] = c(gng_ac_ospan_anova_cor$ANOVA[2,], gng_ac_ospan_loc_es)
    # re-define the alpha value based on the bonferroni correction
      # rename the "p<.05" column appropriately -- "p<.025"
      colnames(gng_ac_ospan_t)[6] = "p<.025"
      # change values to fit the data
      gng_ac_ospan_t$`p<.025` = ifelse(gng_ac_ospan_t$p < .025, "*", "")
    
# show ANOVA results in kable table 
kable(gng_ac_ospan_t, caption = "OSpan First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

OSpan First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task
Effect DFn DFd F p p<.025 ges
(Intercept) 1 174 226497.4374 0.0000 * 0.9992
location 2 174 3.1905 0.0436 0.0218
power 1 174 2.3946 0.1236 0.0136
location:power 2 174 1.7570 0.1756 0.0198

NA
Go/No-Go First: Descriptive Statistics

The second simple two-way interaction ANOVA was completed using the “Go/No-Go First” subset of the Balance Factor (i.e. participants who completed the Go/No-Go task first, then the OSpan task). As in the main 3-way ANOVA, the between-subjects factors were smartphone location (i.e. desk, pocket/bag or outside) and power (i.e. ON or OFF), and the dependent variable was accuracy (i.e. average reaction time).

There was a total of 215 participants in the Go/No-Go First analysis. The following table depicts the descriptive statistics: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_des = psych::describeBy(gng_ac_gng_data, list(gng_ac_gng_data$location, gng_ac_gng_data$power), mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_des_t = rbind((cbind(t(gng_ac_gng_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av3", c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_gng_des["rt_av4", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av5", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av6", c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_gng_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  
# use kable to show the descriptives table
  kable(gng_ac_gng_des_t, caption = "Go/No-Go First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

Go/No-Go First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
M 1.6519 1.6673 1.6525
SD 0.0411 0.0496 0.0419
min. 1.5783 1.5618 1.5600
max. 1.7472 1.7557 1.7449
n 36.0000 39.0000 38.0000
Smartphone Power OFF
M 1.6735 1.6399 1.6630
SD 0.0674 0.0522 0.0656
min. 1.5690 1.5604 1.5635
max. 1.8442 1.7667 1.8688
n 36.0000 32.0000 34.0000

NA
Go/No-Go First: Assumptions

The two-way interaction made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_anova = ezANOVA(
  data = gng_ac_gng_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_res = data.frame("residuals" = proj(gng_ac_gng_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_gng_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Go/No-Go First Group") 
qqline(gng_ac_gng_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution (see visualization below).

qplot(gng_ac_gng_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Go/No-Go First Group") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.97, p < .001 (see code below).

gng_ac_gng_shap = shapiro.test(gng_ac_gng_res$residuals)
gng_ac_gng_shap

    Shapiro-Wilk normality test

data:  gng_ac_gng_res$residuals
W = 0.97149, p-value = 0.0002423
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: 0.06 to 0.95) and Kurtosis (range: -0.88 to 0.88) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).

# use the descriptives data -- gng_ac_gng_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_sk_t = rbind((cbind(t(gng_ac_gng_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_gng_des["rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_gng_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_gng_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Go/No-Go First Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Go/No-Go First Group
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 0.3417 0.1117 0.0623
Kurtosis -0.6968 -0.8811 -0.4550
n 36.0000 39.0000 38.0000
Smartphone Power OFF
Skewness 0.8433 0.5198 0.9459
Kurtosis 0.0585 -0.5200 0.8837
n 36.0000 32.0000 34.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 212) = 0.03, p = .97. However, the homogeneity assumption was not met for smartphone power, F(1, 213) = 6.81, p = .01 (see code below).

# conduct levene's
  # for location
  gng_ac_gng_lev_loc = leveneTest(data = gng_ac_gng_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2  0.0283 0.9721
      212               
  # for power
  gng_ac_gng_lev_pow = leveneTest(data = gng_ac_gng_data, rt_av ~ as.factor(power), center = median)
  gng_ac_gng_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value   Pr(>F)   
group   1   6.814 0.009687 **
      213                    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  • The homogeneity of variance assumption was not met for the smartphone power condition. Therefore, considering there were unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of power).
# run the corrected ANOVA (IV: Smartphone Power & Location ; DV: Go/No-Go RT)
# this will be used to report the main effect of smartphone power only  
gng_ac_gng_anova_cor = ezANOVA(
  data = gng_ac_gng_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_gng_loc_es = (EtaSq(gng_ac_gng_anova_cor$aov, type = 3, anova = FALSE))[2]
Go/No-Go First: Results

All results were based on \(\alpha = 0.025\). Below are the reported statistics for the main and interaction effect of the simple two-way ANOVA, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on reaction time, F(2, 209) = 0.51, p = .6, \(\eta^{2}\) = .005.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on reaction time, F(1, 209) = 0.05, p = .83, \(\eta^{2}\) = .01.

Interaction of Smartphone Location & Smartphone Power

There was a significant interaction between smartphone location and smartphone power on reaction time, F(2, 209) = 4.05, p = .02, \(\eta^{2}\) = .04.

Post-hoc Tests

Since there was a significant two-way interaction (i.e smartphone location and smartphone power), further analyses were completed (see “Simple Main Effects: Smartphone Location vs. Smartphone Power (for the Go/No-Go First Balance Group)” below).


# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_gng_t = gng_ac_gng_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_gng_t = dplyr::select(gng_ac_gng_t, -SSn, -SSd)
    # change the data for the main effect of location to corrected numbers
    gng_ac_gng_t[gng_ac_gng_t$Effect == "power",] = c(gng_ac_gng_anova_cor$ANOVA[3,], gng_ac_gng_loc_es)
    # re-define the alpha value based on the bonferroni correction
      # rename the "p<.05" column appropriately -- "p<.025"
      colnames(gng_ac_gng_t)[6] = "p<.025"
      # change values to fit the data
      gng_ac_gng_t$`p<.025` = ifelse(gng_ac_gng_t$p < 0.025, "*", "")
    
# show ANOVA results in kable table 
kable(gng_ac_gng_t, caption = "Go/No-Go First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

Go/No-Go First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task
Effect DFn DFd F p p<.025 ges
(Intercept) 1 209 204027.9611 0.0000 * 0.9990
location 2 209 0.5138 0.5990 0.0049
power 1 209 0.0475 0.8277 0.0134
location:power 2 209 4.0527 0.0188 * 0.0373

NA
Simple Main Effects: Smartphone Location vs. Smartphone Power (for the Go/No-Go First Balance Group)

The simple two-way interaction showed that, when the Go/No-Go Task was completed first, there was a significant interaction between the smartphone conditions. In order to interpret this effect, simple main effects were completed investigating the effect of smartphone location at each level of smartphone power. Therefore, a simple one-way ANOVA for smartphone location was completed for each smartphone power condition separately.

Two new data sets were created for the simple two-way interactions: (1) for the Power ON Data (gng_ac_gng_ON_data) and (2) for the Power OFF Data (gng_ac_gng_OFF_data).


# get a subset of the gng_ac_gng_data
  # for the ON power group only
  gng_ac_gng_ON_data = gng_ac_gng_data[gng_ac_gng_data$power == "ON", ]

  # for the OFF power group only
  gng_ac_gng_OFF_data = gng_ac_gng_data[gng_ac_gng_data$power == "OFF", ]

To correct the family-wise error rate, a Holm-Bonferroni adjustment was used when assessing the significance of main effects associated with these ANOVAs. The Holm-Bonferroni adjustment defined a corrected alpha level according to the following formula: \[\alpha_{corrected} = \frac{\alpha}{number of comparisons − rank of comparison + 1}\]

Powered ON: Descriptive Statistics

The first simple main effect ANOVA was completed using the “Powered ON” subset of the Power Factor (i.e. participants who completed the Go/No-Go task first with their smartphones powered ON). The between-subjects factor was smartphone location (i.e. desk, pocket/bag or outside) and the dependent variable was accuracy (i.e. average reaction time).

There was a total of 113 participants in the Powered ON analysis. The following table depicts the descriptive statistics: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_ON_des = psych::describeBy(gng_ac_gng_ON_data, gng_ac_gng_ON_data$location, mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_ON_des_t = cbind(t(gng_ac_gng_ON_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_ON_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_ON_des["rt_av3", c("mean", "sd", "min", "max", "n")]))
    colnames(gng_ac_gng_ON_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_ON_des_t) = c("M", "SD", "min.", "max.", "n")
  
# use kable to show the descriptives table
  kable(gng_ac_gng_ON_des_t, caption = "Powered ON Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

Powered ON Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
M 1.6519 1.6673 1.6525
SD 0.0411 0.0496 0.0419
min. 1.5783 1.5618 1.5600
max. 1.7472 1.7557 1.7449
n 36.0000 39.0000 38.0000

NA
Powered ON: Assumptions

The one-way simple main effect ANOVA made three assumptions. All assumptions were met. (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_ON_anova = ezANOVA(
  data = gng_ac_gng_ON_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_ON_res = data.frame("residuals" = proj(gng_ac_gng_ON_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_gng_ON_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Powered ON Group") 
qqline(gng_ac_gng_ON_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution (see visualization below).

qplot(gng_ac_gng_ON_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Powered ON Group") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was not significant (based on \(\alpha = 0.05\)), W = 0.99, p = .33 (see code below).

gng_ac_gng_ON_shap = shapiro.test(gng_ac_gng_ON_res$residuals)
gng_ac_gng_ON_shap

    Shapiro-Wilk normality test

data:  gng_ac_gng_ON_res$residuals
W = 0.98671, p-value = 0.3314
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: 0.06 to 0.34) and Kurtosis (range: -0.88 to -0.45) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).

# use the descriptives data -- gng_ac_gng_ON_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_ON_sk_t = cbind(t(gng_ac_gng_ON_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_ON_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_ON_des["rt_av3", c("skew", "kurtosis", "n")]))
    colnames(gng_ac_gng_ON_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_ON_sk_t) = c("Skewness", "Kurtosis", "n")
    
# use kable to show the descriptives table
  kable(gng_ac_gng_ON_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered ON Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered ON Group
Smartphone Location
On Desk In Pocket/Bag Outside
Skewness 0.3417 0.1117 0.0623
Kurtosis -0.6968 -0.8811 -0.4550
n 36.0000 39.0000 38.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for the between-subjects variable.

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 110) = 0.95, p = .39 (see code below).

# conduct levene's
  # for location
  gng_ac_gng_ON_lev_loc = leveneTest(data = gng_ac_gng_ON_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_ON_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2  0.9466 0.3912
      110               
Powered OFF: Descriptive Statistics

The second simple main effect ANOVA was completed using the “Powered OFF” subset of the Power Factor (i.e. participants who completed the Go/No-Go task first with their smartphones powered OFF). The between-subjects factor was smartphone location (i.e. desk, pocket/bag or outside) and the dependent variable was accuracy (i.e. average reaction time).

There was a total of 113 participants in the Powered ON analysis. The following table depicts the descriptive statistics: the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n).


# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_OFF_des = psych::describeBy(gng_ac_gng_OFF_data, gng_ac_gng_OFF_data$location, mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_OFF_des_t = cbind(t(gng_ac_gng_OFF_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_OFF_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_OFF_des["rt_av3", c("mean", "sd", "min", "max", "n")]))
    colnames(gng_ac_gng_OFF_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_OFF_des_t) = c("M", "SD", "min.", "max.", "n")
  
# use kable to show the descriptives table
  kable(gng_ac_gng_OFF_des_t, caption = "Powered OFF Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

Powered OFF Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
M 1.6735 1.6399 1.6630
SD 0.0674 0.0522 0.0656
min. 1.5690 1.5604 1.5635
max. 1.8442 1.7667 1.8688
n 36.0000 32.0000 34.0000

NA
Powered ON: Assumptions

The one-way simple main effect ANOVA made three assumptions. All assumptions were met. (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_OFF_anova = ezANOVA(
  data = gng_ac_gng_OFF_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_OFF_res = data.frame("residuals" = proj(gng_ac_gng_OFF_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_gng_OFF_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Powered OFF Group") 
qqline(gng_ac_gng_OFF_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution (see visualization below).

qplot(gng_ac_gng_OFF_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Powered OFF Group") + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.95, p = .001 (see code below).

gng_ac_gng_OFF_shap = shapiro.test(gng_ac_gng_OFF_res$residuals)
gng_ac_gng_OFF_shap

    Shapiro-Wilk normality test

data:  gng_ac_gng_OFF_res$residuals
W = 0.94912, p-value = 0.0006317
  • The data’s Skewness and Kurtosis was examined and showed that both the Skewness (range: 0.52 to 0.95) and Kurtosis (range: -0.52 to 0.88) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).

# use the descriptives data -- gng_ac_gng_OFF_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_OFF_sk_t = cbind(t(gng_ac_gng_OFF_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_OFF_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_OFF_des["rt_av3", c("skew", "kurtosis", "n")]))
    colnames(gng_ac_gng_OFF_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_OFF_sk_t) = c("Skewness", "Kurtosis", "n")
    
# use kable to show the descriptives table
  kable(gng_ac_gng_OFF_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered OFF Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered OFF Group
Smartphone Location
On Desk In Pocket/Bag Outside
Skewness 0.8433 0.5198 0.9459
Kurtosis 0.0585 -0.5200 0.8837
n 36.0000 32.0000 34.0000

NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for the between-subjects variable.

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 99) = 0.65, p = .52 (see code below).

# conduct levene's
  # for location
  gng_ac_gng_OFF_lev_loc = leveneTest(data = gng_ac_gng_OFF_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_OFF_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
      Df F value Pr(>F)
group  2   0.649 0.5248
      99               
Simple Main Effects: Results

All results were based on a Holm-Bonferroni adjustment applied to the original p-values. Below are the reported statistics for both analyses (i.e. powered ON and OFF), followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location in the Powered ON Condition

There was no significant main effect of smartphone location on reaction time for those in the powered OFF condition, F(2, 110) = 1.47, p = .24, \(\eta^{2}\) = .03.

Main Effect of Smartphone Location in the Powered OFF Condition

There was no significant main effect of smartphone location on reaction time for those in the powered OFF condition, F(2, 99) = 2.54, p = .17, \(\eta^{2}\) = .05.

Post-hoc Tests

Since there was no significant main or interaction effects, no post-hocs were completed.


# create a table with the anova results including the adjusted p-values 
  # create a table with both power condition data
  gng_ac_gng_POW_t = rbind(gng_ac_gng_ON_anova$ANOVA[gng_ac_gng_ON_anova$ANOVA$Effect == "location",], 
                           gng_ac_gng_OFF_anova$ANOVA[gng_ac_gng_OFF_anova$ANOVA$Effect == "location",])
  # replace the p-values with the adjusted values (Holm-Bonferroni adjustment)
  gng_ac_gng_POW_t$p = p.adjust(c(gng_ac_gng_ON_anova$ANOVA$p[2], gng_ac_gng_OFF_anova$ANOVA$p[2]), method = "holm")
  # adjust the p<.05 column to reflect the new data (if there was any change)
  gng_ac_gng_POW_t$`p<.05` = ifelse(gng_ac_gng_POW_t$p < 0.05, "*", "")
  # replace the names in the effects column with "ON" & "OFF"
  gng_ac_gng_POW_t$Effect = c("ON", "OFF")
  # rename the columns
  colnames(gng_ac_gng_POW_t) = c("Smartphone Power Condition", "DFn", "DFd", "SSn", "SSd", "F", "p", "p<.05", "ges")
  # remove the row names
  rownames(gng_ac_gng_POW_t) = NULL

# show ANOVA results in kable table 
kable(gng_ac_gng_POW_t, caption = "Simple Main Effects Analyses: Reaction Time on the Go/No-Go Task for the Go/No-Go First Balance Group", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

Simple Main Effects Analyses: Reaction Time on the Go/No-Go Task for the Go/No-Go First Balance Group
Smartphone Power Condition DFn DFd SSn SSd F p p<.05> ges
ON 2 110 0.0058 0.2178 1.4665 0.2352 0.0260
OFF 2 99 0.0198 0.3850 2.5411 0.1678 0.0488

NA
Visualizations

The following shows a visulization of Accuracy (i.e. reaction time) on the Cue-Dependent Go/No-go Task. There was a significant three-way interaction in the main ANOVA (i.e. between smartphone location, smartphone power, and balance). Additionally, the main factors of interest were the smartphone conditions (i.e. location and power). Therefore, two visualizations are shown depicting the smartphone conditions, one for each of the levels of the balance factor.

Testing the Moderation Hypothesis:

SAD

Factor Analysis

To examine the underlying factors measured by the Smartphone Attachment and Dependecy Inventory (SAD) from Ward et al. (2017), a principal components factor analysis with a Varimax rotation was completed on participant’s SAD data. The analysis was done exploratorily and was compared to Ward et al.’s findings, where they found two factors, smarphone dependence (i.e. six items; items #1-6; 31.02% of the variance) and emotional attachment (i.e. five items; items #8-11; 21.65% of the variance), which explained 52.67% of the total variance. It should be noted that two items (i.e. item #7 and 13) did not strongly load onto a factor and were excluded from subsequent analyses.

To perform the principal components factor analysis, we first examined the data for (1) missing data, (2) correlations between the items, and (3) the factorability of the data.

  1. Missing Data
  • From the descriptives table, it seems that there is no missing or incomplete data. This was confirmed by finding the complete cases, which were equal to the total sample size (i.e. 409). Therefore, no data was removed due to missing values for this analysis (see descriptives table below).

# get a data frame with the SAD items only for the factor analysis
fa_data = dplyr::select(factor_analysis, SAD_1:SAD_13)

# find the total complete cases to find any missing data from the data -- any participant with missing data must be removed
fa_complete = sum(complete.cases(fa_data))

# Show descriptives for the factor analysis data
fa_des = describe(fa_data)

# show descriptives in kable table 
kable(fa_des, caption = "Descriptives for Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

Descriptives for Items in the SAD
vars n mean sd median trimmed mad min max range skew kurtosis se
SAD_1 1 409 4.7262 1.7443 5 4.8389 1.4826 1 7 6 -0.5344 -0.7959 0.0862
SAD_2 2 409 4.0709 1.8543 5 4.1033 1.4826 1 7 6 -0.1517 -1.2732 0.0917
SAD_3 3 409 3.7897 1.9097 3 3.7508 2.9652 1 7 6 0.1153 -1.2714 0.0944
SAD_4 4 409 4.6259 1.7821 5 4.7234 1.4826 1 7 6 -0.4991 -0.8988 0.0881
SAD_5 5 409 4.5819 1.7598 5 4.6596 1.4826 1 7 6 -0.4048 -0.9655 0.0870
SAD_6 6 409 4.6039 1.6714 5 4.7021 1.4826 1 7 6 -0.6096 -0.6416 0.0826
SAD_7 7 409 5.7188 1.2154 6 5.9058 1.4826 1 7 6 -1.3640 2.3117 0.0601
SAD_8 8 409 3.3399 1.7944 3 3.2523 2.9652 1 7 6 0.3104 -1.1182 0.0887
SAD_9 9 409 4.0000 1.7864 4 4.0365 1.4826 1 7 6 -0.1570 -1.0538 0.0883
SAD_10 10 409 4.7115 1.4983 5 4.8419 1.4826 1 7 6 -0.7497 0.1181 0.0741
SAD_11 11 409 4.5966 1.4094 5 4.7264 1.4826 1 7 6 -0.6631 0.2059 0.0697
SAD_12 12 409 4.7042 1.7872 5 4.8207 1.4826 1 7 6 -0.5207 -0.7931 0.0884
SAD_13 13 409 5.2665 1.4002 6 5.4347 1.4826 1 7 6 -1.0832 1.0722 0.0692

NA
  1. Correlations between the Items
  • The correlations between the items were examined to determine if they were either not high enough (i.e., r < .30) or too high (i.e., r > .80).
    • The correlations for the 13 items in the SAD ranged from r(407) = .16 to r(407) = .72, p = .001.
    • A visual inspection of the correlations showed that items 12 and 13 might not be correlated enough with the rest of the items (i.e. 11 and 9 correlations below .30, respectively). There were no issues associated with too many high correations for any items.
    • The lower correlations for items 12 and 13 should be kept in mind when completing the factor analysis and these items may be removed.
  • To check for multicollinearity, the determinant of the correlation matrix was calculated. The determinant was greater than the necessary value of 0.00001 (i.e., 0.00452), so, there was no issues with multicollinearity (Field et al., 2012).
  • Therefore, all items in the SAD correlated reasonably with each other and without an excessively large correalation so no items were removed at this stage.

# get the pearson correlations between all items (and corresponding p-values)
fa_cor = rcorr(as.matrix(fa_data), type = "pearson") 

# make a table with the data to display it better
fa_cor_t = as.data.frame(fa_cor$r)
  # include only the lower triangle of the data
  fa_cor_t[upper.tri(fa_cor_t)] = NA
  
# find the determinant of the correlation matrix
fa_det = det(fa_cor$r)

# Print table using kable
  # make any NAs appear as blank within kable
  opts = options(knitr.kable.NA = "")
  # print table
  kable(fa_cor_t, caption = "Pearson Correlations between Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("p < .001", notation = "alphabet")

Pearson Correlations between Items in the SAD
SAD_1 SAD_2 SAD_3 SAD_4 SAD_5 SAD_6 SAD_7 SAD_8 SAD_9 SAD_10 SAD_11 SAD_12 SAD_13
SAD_1 1.0000
SAD_2 0.7176 1.0000
SAD_3 0.5515 0.6064 1.0000
SAD_4 0.5528 0.5836 0.5537 1.0000
SAD_5 0.4481 0.4733 0.4522 0.5322 1.0000
SAD_6 0.4377 0.4235 0.3724 0.4891 0.5544 1.0000
SAD_7 0.3081 0.3395 0.3018 0.3757 0.4354 0.5025 1.0000
SAD_8 0.3712 0.3691 0.2841 0.4001 0.3866 0.4217 0.3035 1.0000
SAD_9 0.3445 0.4129 0.3291 0.3903 0.4117 0.3152 0.3714 0.4106 1.0000
SAD_10 0.3364 0.3647 0.2683 0.3248 0.3009 0.3986 0.3753 0.5416 0.4579 1.0000
SAD_11 0.3837 0.4564 0.3782 0.3604 0.3350 0.3482 0.2942 0.3422 0.5685 0.5692 1.0000
SAD_12 0.2483 0.2363 0.2460 0.2591 0.2372 0.2585 0.2855 0.2943 0.2180 0.2426 0.2318 1.0000
SAD_13 0.2186 0.1579 0.2007 0.2316 0.2592 0.3133 0.3235 0.2165 0.1685 0.2202 0.1639 0.4939 1
a p < .001

    # restore the original options so that "NA" shows up again
    options(opts)
  • Another way to visualize this is using a Scatter Plot Matrix
    • This shows: (1) a scatter plot bellow the lower diagonal, with a line of best fit (red); (2) a historgram and density plot of the item on the diagonal; and (3) the correlation values (pearson r) above the diagonal, with corresponding significance shown with asterisks. The x-axis in each scatter plot represents the column variable and the y-axis the row variable.
# show a SPLOM (Scatter Plot Matrix) of the Data 
pairs.panels(fa_data, pch = ".", density = TRUE, ellipses = FALSE, lm = TRUE, digits = 2, method = "pearson", jiggle = TRUE, factor = 1, hist.col = "cadetblue3", stars = TRUE, gap = .5, verOdd = TRUE, horOdd = TRUE, main = "Scatter Plot Matrix of Items in the SAD")

  1. Factorability of the Data
  • This assessed if there were meaningful latent factors to be found within the data, which was assessed in two ways: Barlett’s test of sphericity and the Kaiser-Meyer-Olkin measure of sampling adequacy.
    • Barlett’s test of sphericity:
      • This test was statistically significant, which suggested good factorability, \(\chi^{2}\)(78) = 2174.97, p < .001
    • Kaiser-Meyer-Olkin measure of sampling adequacy:
      • This measure showed an overall measure of sampling adequacy (MSA) and an MSA per item. The overall MSA was higher than both the minimum and preferred criterias (0.50 and 0.60, respectively; Kaiser, 1974, Hutcheson & Sofroniou, 1999), \(MSA_{overall}\) = 0.89. Each item’s MSA was also higher than the minimum and preferred criteria, \(MSA_{1-13}\) = 0.79 - 0.94.
  • Therefore, both factorability tests confirmed that the planned factor analysis was appropriate.
# run the Barlett Test
fa_bartest = cortest.bartlett(fa_data)
fa_bartest
$chisq
[1] 2174.971

$p.value
[1] 0

$df
[1] 78
# run the KMO 
fa_kmo = KMO(fa_data)
fa_kmo
Kaiser-Meyer-Olkin factor adequacy
Call: KMO(r = fa_data)
Overall MSA =  0.89
MSA for each item = 
 SAD_1  SAD_2  SAD_3  SAD_4  SAD_5  SAD_6  SAD_7  SAD_8  SAD_9 SAD_10 SAD_11 SAD_12 SAD_13 
  0.89   0.88   0.93   0.94   0.92   0.90   0.91   0.88   0.89   0.84   0.85   0.82   0.79 

Now that factorability was confirmed, we found the appropriate number of factors to extract. This was done using: (1) a parallel analysis scree plot, (2) eigen values (i.e., from the parallel analysis), and (3) the factor structure between the two solutions (i.e., using factor diagrams).

  1. Parralel Analysis Scree Plot.

This plotted the eigenvalues from the data and a parrallel analysis which generated random correlation matrices. The number of factors is then determined by comparing the resulting eigenvalues to the observed data, that is, observed eigenvalues that were higher than the corresponding random eigenvalues were more likely to form meaningful factors.

  • The parrallel analysis scree plot showed that there are 2 appropriate factors (i.e. using the principal components extraction method).
    • From closer inspection of the plot, the last factor was very close to the random eigenvalue cut-off line, therefore, the 2 and 3 factor solutions were compared to determine the best fit (i.e., the eigen values and factor structures were examined).

fa_par = fa.parallel(fa_data, fa = "pc", main = "Parallel Analysis Scree Plot for Items in the SAD")
Parallel analysis suggests that the number of factors =  NA  and the number of components =  2 

  1. Eigen Values

For a sample size greater than 250 (i.e., 409 complete cases) the average communalities (i.e., the proportion of common variance within the items) for the two factor (0.52) and three factor (0.62) solutions were wihtin the acceptable value (i.e., 0.60; Field et al., 2012). Therefore, Kaiser’s criterion (i.e., a meaningful factor has an eigen value greater than 1) was used for the eigen values. It should be noted that although the average communalities were not very high, the scree plot and factor diagrams were used to choose the appropriate number of factors.

  • The eigen values (unrotated solution) showed three meaningful factors (i.e., 5.5, 1.31, 1.19), which supported the three-factor solution as a better representation of the data.
# Eigen values
  # view eigen values from the parallel analysis scree plot
  round(fa_par$pc.values, 3)
 [1] 5.502 1.308 1.186 0.883 0.718 0.589 0.522 0.475 0.463 0.417 0.369 0.304 0.265
  1. Factor Diagrams

A factor diagram for the two- and three-factor solutions (rotated solution) was used to determine which solution represents the most interpretable solution. The diagrams show the factor loadings for each item on the given factor for which they load the highest on. The items were ordered (i.e., descending) based on these loadings.
Note: The factor diagrams are based on the rotated solutions to better distribute the factor loadings

  • The factor diagrams showed that the two factor solution resulted in most items loading onto only one factor and was not easily interpreted. The three factor solution showed a better distribution and was more interpretable.

Therefore, the three factor solution was chosen to better represent the data.

##-- VARIMAX ROTATION  
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_v = principal(fa_data, nfactors = 2, rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_v$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_v = principal(fa_data, nfactors = 3, rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_v$loadings) = c("Factor 1", "Factor 2", "Factor 3")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_v, main = "Two Factor Solution for items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)


  # 3 factor solution
  fa.diagram(fa_three_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)

  
  
##-- OBLIMIN ROTATION  
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_o = principal(fa_data, nfactors = 2, rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_o$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_o = principal(fa_data, nfactors = 3, rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_o$loadings) = c("Factor 1", "Factor 2", "Factor 3")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_o, main = "Two Factor Solution for items in the SAD (Oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)


  # 3 factor solution
  fa.diagram(fa_three_o, main = "Three Factor Solution for Items in the SAD (Oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)

Now that we determined the number of factors (i.e., three), we observed the communalities. Each item’s communality was the percentage of variance that was explained by the retained factors for each item. These ranged from 45% to 74%, with an average of 62% (see table for more details).

# get the communalities from the model
fa_com = as.data.frame(fa_three_v$communality)

# create matrix to present communalities in a table
fa_com_t = rbind((fa_com[,1])*100)
  # rename row and column names
  colnames(fa_com_t) = rownames(fa_com)
  rownames(fa_com_t) = NULL

# Print table using kable
  # print table
  kable(fa_com_t, caption = "Communalities for the Three-Factor Solution for Items in the SAD (%)", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("Communalities show the percentage of variance explained by each factor for each item on the SAD.", notation = "alphabet")
Communalities for the Three-Factor Solution for Items in the SAD (%)
SAD_1 SAD_2 SAD_3 SAD_4 SAD_5 SAD_6 SAD_7 SAD_8 SAD_9 SAD_10 SAD_11 SAD_12 SAD_13
67.7017 73.8798 63.8423 64.4335 53.2454 51.671 45.0262 50.2766 60.4257 72.1824 64.965 60.1458 71.8026
a Communalities show the percentage of variance explained by each factor for each item on the SAD.

Next, we looked at the percentage of variance accounted for by using the eigenvalues (compute the variance assosiated with the factors). This was done for both the unrotated and rotated (i.e., varimax) three-factor solution. As you can see in the table below, a total of 61.51% of variance was accounted for by the three extracted factors. This variance was mostly accounted for by factor one in the unrotated solution, but was more equally distributed after rotation (see table for more details).


# Run an unrotated three-factor solution
  fa_three = principal(fa_data, nfactors = 3, rotate = "none", scores = T) 
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three$loadings) = c("Factor 1", "Factor 2", "Factor 3") 
    
  
# To find the variace accounted for, you can either (1) calculate it from the eigen values for each factor, or (2) grab the calculated values from the model 
  # Both are shown below, but one is commented out to avoid creating unessesary variables
  # (1) Calculated 
    # For the unrotated solution
    #var_per = 100*fa_three$values[1:3]/length(fa_three$values)
    
    # For the rotated solution
    #var_per_v = 100*(as.numeric(fa_three_v$Vaccounted[1,]))/length(fa_three_v$values)
      
  # (2) The information is found in the "Vaccounted" table from the output, where "SS loadings" (row 1) shows the eigen values and "Proportion Var" (row 2) shows the variance accounted for.
    # For the unrotated solution
    var_per = 100*as.numeric(fa_three$Vaccounted[2,])
    
    # For the rotated solution
    var_per_v = 100*as.numeric(fa_three_v$Vaccounted[2,])

# Create & Show a table with eigen values, percent var explained, and total percent var explained for unrotated & rotated three-factor solutions
    # unrotated solution
    var_per_t = as.data.frame(cbind(rbind(as.numeric(fa_three$Vaccounted[1,]), var_per), 
                                    rbind(as.numeric(fa_three_v$Vaccounted[1,]), var_per_v)))
    # name the columns & rows accordingly
    colnames(var_per_t) = c(rep(c("Factor 1", "Factor 2", "Factor 3"),2))
    rownames(var_per_t) = c("Eigen Values", "Variance Explained (%)")

  # use kable to show the table
  kable(var_per_t, caption = "Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  add_header_above(c(" ", "Unrotated" = 3, "Varimax Rotation" = 3))

Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD
Unrotated
Varimax Rotation
Factor 1 Factor 2 Factor 3 Factor 1 Factor 2 Factor 3
Eigen Values 5.5022 1.3080 1.1857 3.4009 2.6434 1.9516
Variance Explained (%) 42.3250 10.0616 9.1210 26.1611 20.3339 15.0125

NA

Lastly, we looked at the factor loadings of the three-factor solution. Below you can see the rotated (i.e., varimax) and unrotated facter loadings. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. As we saw in tha factor diagrams, the unrotated factor loadings are difficult to interpret: all items loaded strongly onto factor 1 (i.e., loading was greater than 0.40). The roatated factor loadings were more easy to interpret, with only one item loading onto more than one factor (i.e., item six; see below for more details).


# Unrotated Factor Loadings 
print(fa_three$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3
SAD_1   0.725                    
SAD_2   0.756                    
SAD_3   0.675                    
SAD_4   0.739                    
SAD_5   0.705                    
SAD_6   0.700                    
SAD_7   0.611                    
SAD_8   0.633                    
SAD_9   0.645            -0.432  
SAD_10  0.639            -0.541  
SAD_11  0.650            -0.470  
SAD_12  0.462    0.585           
SAD_13  0.423    0.669           

               Factor 1 Factor 2 Factor 3
SS loadings       5.502    1.308    1.186
Proportion Var    0.423    0.101    0.091
Cumulative Var    0.423    0.524    0.615
# Rotated (i.e., varimax) Factor Loadings 
print(fa_three_v$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3
SAD_1  0.788                     
SAD_2  0.809                     
SAD_3  0.780                     
SAD_4  0.744                     
SAD_5  0.613                     
SAD_6  0.491             0.416   
SAD_7                    0.487   
SAD_8           0.611            
SAD_9           0.723            
SAD_10          0.819            
SAD_11          0.754            
SAD_12                   0.754   
SAD_13                   0.841   

               Factor 1 Factor 2 Factor 3
SS loadings       3.401    2.643    1.952
Proportion Var    0.262    0.203    0.150
Cumulative Var    0.262    0.465    0.615

In order to validate the SAD, we completed a reliability analysis (i.e., is the scale consistent) by looking at the split-half reliability using Cronbach’s alpha. This method is the most common measure of scale reliability (Field et al., 2012): the data for each participant is randomly split into two sets, which forms a “half” split for the scale. Then, a correlation is calculated between these sets to determine how similar they are: higher reliability would be seen with highly correlated sets. Since the way that the data is split can impact the correlations, Cronbach’s alpha represents an average of all the possible splits using the following equation:

\[\alpha = \frac{N^2 \overline{Cov}}{\sum s_{item}^2 + \sum Cov_{item}} \] To obtain the best reliability measure, we must divide our data into any subscales, which in this case, are our three factors from our factor analysis. Therefore, we split the data into three sets for the reliability analysis.


# split the data respective to the 3 factors from the FA to perform the reliability analysis
  # factor 1 (items: 1, 2, 3, 4, 5)
  f1_data = fa_data[,c(1, 2, 3, 4, 5)]
  # factor 2 (items: 8, 9, 10, 11)
  f2_data = fa_data[,c(8, 9, 10, 11)]
  # factor 3 (items: 7, 12, 13)
  f3_data = fa_data[,c(7, 12, 13)]

Once we have our subscales, we complete the reliability analysis on each subscale.


# compute reliability analysis for each subscale using "alpha()" function in "psych" package
  # factor 1
  f1_rel = psych::alpha(f1_data)
  # factor 2
  f2_rel = psych::alpha(f2_data)
  # factor 3
  f3_rel = psych::alpha(f3_data)

To view the analysis, we broke it down into three compoenents: (1) The alpha values and overall mean and standard deviation of the subscales, (2) The alpha values if an item was dropped (i.e., Alpha Drop Values), and (3) Item statistics.

  1. Alpha Values
  • The alpha values from the reliability analyses included Cronbach’s \(\alpha\) (i.e., based on the covariances), a standardized \(\alpha\) (i.e., based on the correlations), and Guttman’s Lambda 6. Additionally, the overall average correlation, mean, and standard deviation for the subscale is shown (see table below for more details).
  • As seen in the table, Cronbach’s \(\alpha\) was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 (\(\alpha_C\) = 0.86) and Factor 2 (\(\alpha_C\) = 0.78), but not for Factor 3 (\(\alpha_C\) = 0.63). The overall correlations for Factor 1, r(407) = .55, and Factor 2, r(407) = .48, were moderate, whereas Factor 3’s correlation, r(407) = .37, was weak-moderate. These statistics were further explored with dropped items.
# show the alpha values from the 3 subscales
  # create data frame with the information from the 3 reliability analyses
  fa_rel_alphas = as.data.frame(rbind("Factor 1" = dplyr::select(f1_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 2" = dplyr::select(f2_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 3" = dplyr::select(f3_rel$total, -c("S/N", "ase", "median_r"))))
  # add an extra column for the "overall" or average alpha
  fa_rel_alphas = cbind("Overall alpha" = c(mean(as.numeric(dplyr::select(f1_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f2_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f3_rel$total, c("raw_alpha", "std.alpha"))))), 
                        fa_rel_alphas)

  # use kable to show the table
  kable(fa_rel_alphas, caption = "Reliability Analysis for Subscales in the SAD: Alpha Values", digits = 4, align = 'c', 
        col.names = c("Overall alpha", "Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T)

Reliability Analysis for Subscales in the SAD: Alpha Values
Overall alpha Cronbach's alpha Standardized alpha Guttman's lambda 6 Average r M SD
Factor 1 0.8580 0.8579 0.8580 0.8383 0.5471 4.3589 1.4463
Factor 2 0.7839 0.7798 0.7880 0.7565 0.4817 4.1620 1.2658
Factor 3 0.6332 0.6308 0.6356 0.5519 0.3676 5.2298 1.1277

NA
  1. Alpha Drop Values
  • To further explore the reliability of each subscale, we looked at the alpha values after removing each item. This allowed us to determine whether a given item was reducing the reliability of the subscale. That is, if dropping an item increased \(\alpha\) or the correlation, the removing the item from the analysis should be considered. The data presented in the table below shows Cronbach’s \(\alpha\), standardized \(\alpha\), Guttman’s Lambda 6, and the average correlation.
  • As seen in the table below, dropping an item did not improve reliability for Factor 1 (range of \(\alpha_C\) = 0.81 - 0.85) or Factor 2 (range of \(\alpha_C\) = 0.69 - 0.76). However, removing item #7 did improve reliability for Factor 3 (\(\alpha_{C-drop7}\) = 0.65). With respect to the average correlation, dropping an item did not improve for Factor 2, r(407) = .44 - = .53. However, item #5 for Factor 1, r(407) = .59, and item #7 for Factor 3, r(407) = .49, did show a some improvement in average correlation. This was explored further with the items statistics.
# show the alpha drop values from the 3 subscales
  # create data frame with the information
  fa_rel_adrop = as.data.frame(rbind(dplyr::select(f1_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f2_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f3_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r"))))

  # use kable to show the table
  kable(fa_rel_adrop, caption = "Reliability Analysis for Subscales in the SAD: Alpha Drop Values", digits = 4, align = 'c', 
        col.names = c("Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data)))

Reliability Analysis for Subscales in the SAD: Alpha Drop Values
Cronbach's alpha Standardized alpha Guttman's lambda 6 Average r
Factor 1
SAD_1 0.8207 0.8207 0.7796 0.5336
SAD_2 0.8091 0.8095 0.7648 0.5151
SAD_3 0.8312 0.8309 0.8018 0.5513
SAD_4 0.8254 0.8253 0.7939 0.5415
SAD_5 0.8536 0.8542 0.8218 0.5942
Factor 2
SAD_8 0.7646 0.7732 0.7003 0.5319
SAD_9 0.7291 0.7380 0.6750 0.4843
SAD_10 0.6938 0.7025 0.6278 0.4404
SAD_11 0.7202 0.7268 0.6450 0.4700
Factor 3
SAD_7 0.6482 0.6612 0.4939 0.4939
SAD_12 0.4852 0.4889 0.3235 0.3235
SAD_13 0.4196 0.4441 0.2855 0.2855

NA
  1. Item Statistics
  • The item statistics were explored to determine whether a certain item did not correlate with the overall subscale. The table below shows: r, which is the correlations between each item and the total score from the subscale (i.e., the item-total correlations), which includes the item itself; and Drop, which is the correlation of an item with the subscale after removing the item (i.e., the item-rest correlation or the corrected item-total correlation). Additionally, the sample size, mean, and standard deviation is shown.
  • No issues were found with the item statistics for Factor 1, r(407) = .57 - = .75; Factor 2, r(407) = .52 - = .66; and Factor 3, r(407) = .35 - = .52. That is, no items had a item-rest correlation (i.e., the drop column in the table below) below the cut-off (i.e., .30; Field et al., 2012).

# show the correlations for the 3 subscales
  # create data frame with the information
  fa_rel_item = as.data.frame(rbind(dplyr::select(f1_rel$item.stats, -c("std.r", "r.cor")), 
                                     dplyr::select(f2_rel$item.stats, -c("std.r", "r.cor")), 
                                     dplyr::select(f3_rel$item.stats, -c("std.r", "r.cor"))))

  # use kable to show the table
  kable(fa_rel_item, caption = "Reliability Analysis for Subscales in the SAD: Item Statistics", digits = 4, align = 'c', 
        col.names = c("N", "r", "Drop", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data)))  

Reliability Analysis for Subscales in the SAD: Item Statistics
N r Drop M SD
Factor 1
SAD_1 409 0.8161 0.7053 4.7262 1.7443
SAD_2 409 0.8487 0.7458 4.0709 1.8543
SAD_3 409 0.7991 0.6648 3.7897 1.9097
SAD_4 409 0.8052 0.6858 4.6259 1.7821
SAD_5 409 0.7234 0.5709 4.5819 1.7598
Factor 2
SAD_8 409 0.7548 0.5210 3.3399 1.7944
SAD_9 409 0.7921 0.5841 4.0000 1.7864
SAD_10 409 0.8078 0.6557 4.7115 1.4983
SAD_11 409 0.7687 0.6083 4.5966 1.4094
Factor 3
SAD_7 409 0.6440 0.3488 5.7188 1.2154
SAD_12 409 0.8352 0.4874 4.7042 1.7872
SAD_13 409 0.7910 0.5248 5.2665 1.4002

NA

Overall, the reliability for Factor 1 and Factor 2 was adequate. However, Factor 3 showed mixed results: item #7 was not as stable as the other items. However, this item did fit the minimum criteria for reliability and was therefore left in the subscale. OR The item met the minimum criteria, but was removed to improve the overall factor analysis fit with the data.

Conclusions

A principal components factor analysis was completed on the 13 items in the SAD (Ward et al., 2017) with an orthogonal rotation (i.e., varimax rotation). Factorability of the data was confirmed using: (1) Bartlett’s test for sphericity, \(\chi^{2}\)(78) = 2174.97, p < .001, which confirmed that correlations between the items were sufficiently large; and (2) the Kaiser-Meyer-Olkin measure for sampling adequacy, \(MSA_{overall}\) = 0.89, \(MSA_{1-13}\) = 0.79 - 0.94, which confirmed that both the overall MSA and each item’s MSA were above their required criteria (0.60 and 0.77, respectively; Kaiser, 1974). The appropriate number of factors (i.e., three) was confirmed using (1) a parallel analysis scree plot, where the two- and three-factor solutions were chosen to compare for th ebest fit for the data; (2) the eigen values (e.g., 5.5022442, 1.3080016, 1.1857354, 0.8831051, for factors 1 to 4, respectively) for the unrotated solution, where three meaningful factors were found using Kaiser’s criterion (i.e., eigen values greater than one); and (3) the factor structure of the two- and three-factor solutions, where the three-factor solution showed a better distribution of the data and was more interpretable. Therefore, the three-factor solution was chosen for the final analysis and explained 61.51% of the variance. Additionally, an orthogonal (i.e., varimax) rotation was chosen for the final analysis. The items in the final factors suggested the following three factors: smartphone dependence, emotional attachment, and smartphone distractability. Smartphone dependence was relate to the degree of dependence on one’s smartphone, it consisted of 5 items and explained 26.16% of the variance (e.g., “I would have trouble getting through a normal day without my smartphone.”, and “I feel like I could not live without my smartphone.”). Emotional attachment was related to one’s smartphone use for emotional support, it consisted of 4 items and explained 20.33% of the variance (e.g., “I feel lonely when my smartphone does not ring or vibrate for several hours.”, and “Using my smartphone makes me feel happy.”). Smartphone distractability was related to one’s smartphone retaining one’s attention, it consisted of 3 items and explained 15.01% of the variance (e.g., “I find it tough to focus whenever my smartphone is nearby.”, and “I become less attentive to my surroundings when I’m using my smartphone.”). One item did not clearly load onto any one factor and was excluded from further analyses (i.e., item 6; Costello & Osborne, 2005). See table below for all items and factor loadings.


# create table with all items, factor loadings, eigen values, % of virance explained, and Cronbach's alpha 

# get factor loadings
  fa_load = as.data.frame(fa_three_v$Structure[1:13,])
  # rename columns
  colnames(fa_load) = c("Factor 1", "Factor 2", "Factor 3")

# make list of the items & other info
  fa_table_list = as.data.frame(c("I would have trouble getting through a normal day without my smartphone.",
                              "It would be painful for me to give up my smartphone for a day.",
                              "I feel like I could not live without my smartphone.",
                              "If I forgot to bring my smartphone with me, I would feel anxious.",
                              "It drives me crazy when my smartphone runs out of battery.",
                              "I am upset and annoyed when I find I do not have reception on my smartphone.",
                              "I feel impatient when the Internet connection speed on my smartphone is slow.",
                              "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                              "Using my smartphone relieves me of my stress.",
                              "I feel excited when I have a new message or notification.",
                              "Using my smartphone makes me feel happy.",
                              "I find it tough to focus whenever my smartphone is nearby.",
                              "I become less attentive to my surroundings when I’m using my smartphone.", 
                              "Eigen Values", 
                              "Variance Explained (%)"))
  
# create the final table
  # bind the loadings and items
    fa_three_t = cbind(fa_table_list, rbind(fa_load, var_per_t[4:6]))
    
    # rename columns & rows
      colnames(fa_three_t) = c("Item", "Factor 1: Smartphone Dependence", "Factor 2: Emotional Attachment", "Factor 3: Smartphone Distractability")
      rownames(fa_three_t)[14:15] = c("", " ") 
      
# use kable to show the table
  kable(fa_three_t, caption = "Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD (N = 409)", digits = 4, align = "lccc") %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows("", 14, 15)

Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD (N = 409)
Item Factor 1: Smartphone Dependence Factor 2: Emotional Attachment Factor 3: Smartphone Distractability
SAD_1 I would have trouble getting through a normal day without my smartphone. 0.7883 0.2098 0.1075
SAD_2 It would be painful for me to give up my smartphone for a day. 0.8090 0.2884 0.0343
SAD_3 I feel like I could not live without my smartphone. 0.7804 0.1354 0.1054
SAD_4 If I forgot to bring my smartphone with me, I would feel anxious. 0.7440 0.2265 0.1987
SAD_5 It drives me crazy when my smartphone runs out of battery. 0.6126 0.2608 0.2987
SAD_6 I am upset and annoyed when I find I do not have reception on my smartphone. 0.4907 0.3211 0.4157
SAD_7 I feel impatient when the Internet connection speed on my smartphone is slow. 0.3062 0.3453 0.4872
SAD_8 I feel lonely when my smartphone does not ring or vibrate for several hours. 0.2434 0.6107 0.2656
SAD_9 Using my smartphone relieves me of my stress. 0.2739 0.7235 0.0762
SAD_10 I feel excited when I have a new message or notification. 0.1246 0.8194 0.1867
SAD_11 Using my smartphone makes me feel happy. 0.2836 0.7540 0.0271
SAD_12 I find it tough to focus whenever my smartphone is nearby. 0.1202 0.1339 0.7544
SAD_13 I become less attentive to my surroundings when I’m using my smartphone. 0.0900 0.0445 0.8414
Eigen Values 3.4009 2.6434 1.9516
Variance Explained (%) 26.1611 20.3339 15.0125

NA
NA

Look at a reliability analysis

Generalized Linear Model

  • A univariate generalized linear model was used to examine if smartphone reliance (i.e. SAD score) was a moderator of the relationship between the experimental manipulation and task performance.

Note: These analysis were completed regardless of the results for the location and power hypotheses.

---
title: "Reexamining the “Brain Drain” Effect: Does Ward et al. (2017) replicate?"
author: "A. C. Ruiz Pardo, J. P. Minda" 
date: "April 27, 2020"
output:
  html_notebook:
    toc: true
    toc_depth: '6'
    toc_float: true
    theme: lumen 
  html_document:
    df_print: paged
    toc: true
    toc_depth: '6'
    toc_float: true
    theme: lumen 
---
<style type="text/css">

body{ /* Normal  */
      font-size: 15px;}
td {  /* Table  */
  font-size: 14px;}
h1.title {
  font-size: 40px;}
h1 { /* Header 1 */
  font-size: 35px;
  color: DarkBlue;}
h2 { /* Header 2 */
  font-size: 33px;
  color: DarkBlue;}
h3 { /* Header 3 */
  font-size: 31px;
  color: DarkBlue;}
h4 { /* Header 4 */
  font-size: 29px;
  color: DarkBlue;}
h5 { /* Header 5 */
  font-size: 27px;
  color: DarkBlue;}
h6 { /* Header 6 */
  font-size: 25px;
  color: DarkBlue;}

</style>

<div style="text-align: justify"> 

# Introduction  

This document outlines the analyses from Study 1 in **insert article title here** (**insert authors**, 2020). The purpose of this study was to investigate Ward et al.'s (2017) second experiment by completing a direct replication. Therefore, the present study investigated how the mere prescence of one's smartphone effects cogntiion. This was pre-registered through OSF and can be found at the following link: <https://osf.io/5fq4r>.

The main goal of the current study was to see if  Ward et al.’s findings replicated (experiment two). Therefore, as in Ward et al., participants were randomly assigned to one of *six* possible conditions. These conditions were based on two independent variables: **smartphone location** and **smartphone power**. For smartphone location, a participant's smartphone was in one of three locations: (1) on the participant’s desk ("on desk"), (2) in their pocket/bag ("pocket/bag"), or (3) outside the testing room ("outside"). For smartphone power, a participant's smartphone was either (1) powered ON or (2) powered OFF, in their respective location. 
For all conditions, participants were instructed to keep their smartphones on silent (i.e., no virbations if any notifications are received during the study). Also, as per Ward et al., participants in the "on desk" location conditions were instructed to keep their devices facing up.

Each participant completed **two main tasks** (counter-balanced):

**1. An Automated Operation Span (OSpan) Task** (Unsworth et al., 2005), which required participants to retain letter strings (i.e. ranging from three to seven letters in length) in memory while solving some simple math problems (e.g. “(7/7) + 6 = ?”). This task was a behavioural measure of the attentional control component of working memory. 

- Scoring: 
    - Performance, measured with the OSpan absolute score, was shown by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e. 3 + 5 + 0 = 8). Therefore, OSpan absolute score showed performance where higher scores represented better performance.

**2. The Cue-Dependent Go/No-Go Task** (Bezdjia et al., 2009), which required participants to respond to "go" targets (i.e. green rectangle) while withholding any response to "no-go" targets (i.e. blue rectangle). This task included a cue component, where vertical targets (i.e., rectangles) were more likely to be "go" targets (i.e. 80% "go" and 20% "no-go") and horizontal targets were more likely to be "no-go" targets (i.e. 80% "no-go" and 20% "go"). Participants were not explicitly made aware of the cue component. This task was a behavioural measure of sustained attention.

- Scoring:
    - Performance was measured with mean omission errors and RT. It should be noted that mean errors can be divided into total error, commission error, and omission error. Commission errors occurred when a participant responded to a target stimulus. Omission errors occurred when a participant failed to respond to a non-target stimulus. Total errors were the sum of commission and omission errors. The present study focused on mean omission errors. Therefore, for each participant, higher mean omission errors represented lower performance. Additionally, higher mean RT also showed lower performance (i.e. indicative of greater interference).

Finally, after completing both tasks, all participants completed **three** survey measures:  

**1. The Demographic Questionnaire**, which was created for the present study and consisted of four items: age (i.e. in years), gender (i.e. male, female, other, or prefer not to say), program (e.g. psychology, engineering), and year of study (e.g. first, fourth). The purpose of these items was to give a brief description of the sample. 

**2. The Smartphone Usage Questionnaire (SUQ)**, which was created for the present study and consisted of modified items from Ward et al.’s survey measures (i.e. found in the “BRAIN DRAIN” WEB APPENDIX”). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (“Never”) to 7 (“Always”). There were 10 items in total and there were five types of items, which measured: (1) smartphone use frequency (three items; e.g. “On average, how many text messages do you send per day?”); (2) smartphone use without external stimulation (two items; e.g. “If I am waiting to meet a friend, I pass the time by using my smartphone.”), or during other activities (two items; e.g. “I use my smartphone while driving.”); (3) smartphone subjective value (one item; e.g. “How much money would it take for you to give up your phone for a full day?”); (4) smartphone notification type (one item; e.g. “Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.”); and (5) phantom vibrations (one item; e.g. “Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?”). The purpose of the SUQ was to measure participants' typical smartphone use. 

**3. Smartphone Attachment and Dependency Inventory (SAD)**, which was from Ward et al.'s (2017) study. The SAD used 13-items (i.e. 7-point likert scale) that asked participants to rate (i.e., from 1 = *Strongly Disagree* to 7 = *Strongly Agree*) how much they agreed with statements regarding their emotional attachment and dependency on their smartphones. The purpose of the SAD was to measure each participant’s reliance on their smartphone.

- Scoring:
    - Smartphone reliance was calculated with a sum total for all items with a range of 13 to 91. Higher scores indicated a higher level of reliance with three levels: low reliance (13-39), moderate reliance (40-65), and high reliance (66+).

A total of **`r length(all_participants$all_participants)`** undergraduate students participated in this study. Of the total sample size, **`r length(all_remove$all_participants)`** participants were excluded due to either testing error (`r length(all_remove$all_participants[which(all_remove$reason_remove == "gng")])`; e.g. incomplete task data), or experimenter or external confounds (`r length(all_remove$all_participants[which(all_remove$reason_remove == "track")])`; e.g. interruption during testing, distracting noise during testing). Therefore, **`r length(all_data$participant)`** participants were included in the analyses for the location (on desk = `r length(which(all_data$location == 1))`; pocket/bag = `r length(which(all_data$location == 2))`; outside = `r length(which(all_data$location == 3))`) and power (ON = `r length(which(all_data$power == 1))`; OFF = `r length(which(all_data$power == 2))`) condition.

# Data Analysis Plan  
The present study completed a pre-registered data anallysis (pre-registered direct replication on Open Science Framework; OSF). Below is a summary of the data analysis plan (view full version <https://osf.io/d5eja> & <https://osf.io/ptc7k/>).  

## Hypotheses

1. The **location effect hypothesis** predicted that those who were closest in proximity to their smartphone (i.e. those with their smartphones on their desk) would show lower performance on the OSpan task.  

2. The **power effect hypothesis** predicted that smartphone power (i.e. either ON or OFF) would not affect performance on both cognitive tasks.  
  
3. The **moderation effect hypothesis** predicted that smartphone attachment and dependency (i.e. SAD score) would moderate the location effect: those who reported higher smartphone attachment and dependency would perform worse on the OSpan task.  

*Note: All hypotheses were based on the findings from Ward et al. (2017)*

## Inclusion / Exclusion Criteria

There were **two** exclusion critera in the present study:  

**1. OSpan Task** 

  - Only data from participants who scored 85% accuracy on the math component or above was used.
    - Criteria helped control for any participants who did not follow instructions and neglected the math component of the task.
  *Note: This was the same criteria as in Ward et al. (2017)*  
    
**2. Cue-Dependent Go/No-Go Task**  

  - Only data from those who scored higher than chance performance was used (e.g. reponded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials).
    - Criteria helped control for any participants who did not complete the task as instructed (e.g. guessing).
  *Note: There was no inclusion/exclusion criteria for thsi task in Ward et al. (2017)*  

## Analysis

### Testing the Location and Power Hypotheses:  

**1. OSpan Task**  

- The OSpan absolute score was used to measure OSpan performance.
- A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted **(DV = OSpan absolute score)**.  
  - Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s) and/or power status were significantly different.
  
**2. Cue-Dependent Go/No-Go Task**  

- An error (i.e. omission errors) and accuracy (i.e. RT) analysis was completed using the average omission errors and RT for each participant.  

  **(a) Error Analysis**  
    - A **3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Go, No-Go) mixed ANOVA** was completed **(DV = average omission error(s))**.  
      - The between-subjects factors:smartphone location and power  
      - The within-subjects factor: pre-target cue type  
      - Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s), power status, and/or cue type(s) were significantly different.  
    
  **(b) Accuracy Analysis**  
    - A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted **(DV = average RT)**.  
      - The between-subjects factors:smartphone location and power  
      - The within-subjects factor: pre-target cue type  
      - Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s) and/or power status were significantly different.  

### Testing the Moderation Hypothesis:  

**3. SAD **  

**(a) Factor Analysis**  

  - A principal components factor analysis with a Varimax rotation was completed for participant's SAD score.
    - This was done to confirm the two main factors found by Ward et al. (2017): smartphone dependence and emotional attachment.  

**(b) Generalized Linear Model**  

  - A univariate generalized linear model was used to examine if smartphone reliance (i.e. SAD score) was a moderator of the relationship between the experimental manipulation and task performance. 

*Note: These analysis were completed regardless of the results for the location and power hypotheses.*  

# Analysis Prep  
## Load Required Libraries

The following packages were used to complete the analyses, visualizations, etc. in this document. 

```{r message=FALSE, warning=FALSE}
# Code has been commented out to reduce computation time, uncomment any to run the code

# library(devtools)
# install_github("vqv/ggbiplot")
library(ggbiplot)
library(welchADF)
library(schoRsch)
library(plyr)
library(dplyr)
library(lattice)
library(car)
library(knitr)
library(ggplot2)
library(ggpubr)
library(kableExtra)
library(magrittr)
library(ez)
library(Hmisc)
library(rstatix)
library(purrr)
library(Formula)
library(gridExtra)
library(psych)
library(survival)
library(tidyverse)
library(DescTools)
library(GPArotation)
library(apaTables)


```

## Additional Functions

### Rounding p-values

This rounding function was adapted from [Emily Nielsen's Rpubs](https://rpubs.com/egnielsen). The function ("**p_round(x)**") was created to assess and print p-values. If $p \ge .005$, the function will display “$p = $” and the value rounded to two decimal places. If $ .0005 \le p <.005$, the function will display “$p = $” and the value rounded to three decimal places. If $p < .0005$, the function will display “$p < .001$”.

```{r}

p_round <- function(x){
  if(x > .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 2)), sep = ''))
  }  
  else if(x == .005){x1 = (paste("=", gsub("0\\.","\\.", 0.01)))
  }
  else if(x > .0005 & x < .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 3)), sep = ''))
  }  
  else if(x == .0005){x1 = (paste("=", gsub("0\\.","\\.", 0.001)))
  }
  else{x1 = (paste("<", gsub("0\\.","\\.", 0.001)))
  } 
  (x1)
}

```

## Raw Data  
There are **five** raw data files that were imported for the present study (see the *Import Raw Data* section).  

**Task Data**  

1. OSpan Raw Data  
    - spreadsheet, collected manually from each participant
2. Go/No-Go Raw Data  
    - psychopy excel files 
    - *Note:* raw data was extracted from folder into one file (see the *Import Raw Data* section)  

**Survey Measures**  

3. Demographic & SUQ Raw Data  
    - csv files exported from Qualtrics
4. SAD Raw Data  
    - csv files exported from Qualtrics
5. Testing Tracking Sheet   
    - csv files, collected during testing  

### Import Raw Data  
#### 1. OSpan Raw Data  
<details>

<summary> The **OSpan Raw Data** was collected using the Inquisit 5 software (**LINK**), which used the same automated OSpan task as in Ward et al. (2017). The raw data from the OSpan was imported as a dataframe, "**ospan_raw**", and included the following: *(click to see description)* </summary>  
 
- **Participant number [*ospan_participant*]**  
- **OSpan ID [*ospan_ID*]**  
  - A 9-digit ID given by the Iquisit 5 software
- **OSpan Absolute Score [*ospan_absolute_score*]**  
  - Performance measured by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e. 3 + 5 + 0 = 8).  
- **Total Letters Recalled [*ospan_total_corr*]**  
  - Performance measured by how many letters were recalled in total, regardless of whether all letters were correct within a given block.    
- **Total Math Errors [*ospan_math_err_tot*]**  
  - the sum of all math errors (i.e. speed and accuracy)   
- **Speed Errors [*ospan_math_err_speed*]**  
  - the sum of the speed related math errors  
    - where a participant exceeded their time limit to respond to the math component of the task  
- **Accuracy Errors [*ospan_math_err_accuracy*]**  
  - the sum of the accuracy related math errors  
    - where a participant answered incorrectly on the math component of the task  
- **Math Percentage [*ospan_math_percent_corr*]**  
  - the percent of all math trials that were answered correctly  
    - Participants were asked to keep their math percentage at 85% or above to ensure participants did not ignore the math component of the task to improve on the letter recall component
- **Math Percentage Criteria [*ospan_math_criteria_85*]**  
  - whether a participant met the math criteria (i.e. 85% or above; yes or no)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  
    - 1 = yes
    - 0 = no * *including any participant that did not complete the OSpan task*  

</details>  
</details> 

<details>  
<summary> This code imported the OSpan raw data. *(click to expand)* </summary>  

```{r}
ospan_raw = utils::read.csv("raw_ospan.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
ospan_raw = dplyr::arrange(ospan_raw, ospan_participant)
```

</details>  


#### 2. Go/No-Go Raw Data  
<details>  

<summary> The **Go/No-Go Raw Data** was collected using Psychopy (version: **XXXX**). Each participant's data was stored in a separate folder (default from psychopy) as an excel file. The following code re-organized the raw data into a single wide format of the data as a dataframe, "**gng_data_extracted**", and two data frames that separate the raw data by cue type (i.e."vertical" or "horizontal"), "**gng_data_extracted_v**" and "**gng_data_extracted_h**". *(click to see descriptions)* </summary>  

For the **gng_data_extracted** data frame:  

- **Participant number [*gng_participant*]**  
- **Age [*age*]**  
- **First Language [*lang*]**  
  - self-reported whether their fist language was English (i.e. yes or no)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |  1  |  2 |
    |-----------:|:---:|:--:|
    | *response* | Yes | No |

- **Total trials completed [*gng_trial_tot*]**  
- **GnG Score [*gng_score*]**  
  - the sum of the correct answers  
- **Total Responses [*gng_resp_tot*]**
  - the sum of the responses for all trials
- **Total No Responses [*gng_no_resp_tot*]**  
  - the sum of the trials without a response  
- **Total Errors [*gng_err_tot*]**  
  - the sum of all errors for all trials  
- **Commission Errors [*gng_err_com*]**  
  - the sum of all comission errors, defined as:  
    - where a reponse was made during a "no-go" trial (i.e. committing an erorr)  
- **Omission Errors [*gng_err_omi*]**  
  - the sum of all omission errors, defined as:  
    - where a reponse was NOT made during a "go" trial (i.e. failing to respond)  
- **Average Reaction Time (RT) [*gng_rt_av*]**  

For the **gng_data_extracted_v** data frame:  

- **Participant number [*gng_participant*]**  
- **Age [*age_long*]**  
- **First Language [*lang_long*]**  
  - self-reported whether their fist language was English (i.e. yes or no)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |  1  |  2 |
    |-----------:|:---:|:--:|
    | *response* | Yes | No |

- **Total trials completed [*trial_tot_v*]**  
- **GnG Score [*score_v*]**  
  - the sum of the correct answers  
- **Total Responses [*resp_tot_v*]**
  - the sum of the responses for all trials
- **Total No Responses [*no_resp_tot_v*]**  
  - the sum of the trials without a response  
- **Total Errors [*err_tot_v*]**  
  - the sum of all errors for all trials  
- **Commission Errors [*err_com_v*]**  
  - the sum of all comission errors, defined as:  
    - where a reponse was made during a "no-go" trial (i.e. committing an erorr)  
- **Omission Errors [*err_omi_v*]**  
  - the sum of all omission errors, defined as:  
    - where a reponse was NOT made during a "go" trial (i.e. failing to respond)  
- **Average Reaction Time (RT) [*rt_av_v*]**  
- **Response Exclusion Criteria [*exclude_v*]**  
  - This depicts whether a participant should be excluded from analysis based on the response exclusion criteria
    - To be included, participants should score higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |                1               |               2               |
      |-----------:|:------------------------------:|:-----------------------------:|
      | *response* | Yes (participant **excluded**) | No (participant **included**) |  

For the **gng_data_extracted_h** data frame:  

- **Participant number [*gng_participant*]**  
- **Age [*age_long*]**  
- **First Language [*lang_long*]**  
  - self-reported whether their fist language was English (i.e. yes or no)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |  1  |  2 |
    |-----------:|:---:|:--:|
    | *response* | Yes | No |  

- **Total trials completed [*trial_tot_h*]**  
- **GnG Score [*score_h*]**  
  - the sum of the correct answers  
- **Total Responses [*resp_tot_h*]**
  - the sum of the responses for all trials
- **Total No Responses [*no_resp_tot_h*]**  
  - the sum of the trials without a response  
- **Total Errors [*err_tot_h*]**  
  - the sum of all errors for all trials  
- **Commission Errors [*err_com_h*]**  
  - the sum of all comission errors, defined as:  
    - where a reponse was made during a "no-go" trial (i.e. committing an erorr)  
- **Omission Errors [*err_omi_h*]**  
  - the sum of all omission errors, defined as:  
    - where a reponse was NOT made during a "go" trial (i.e. failing to respond)  
- **Average Reaction Time (RT) [*rt_av_h*]**  
- **Response Exclusion Criteria [*exclude_h*]**  
  - This depicts whether a participant should be excluded from analysis based on the response exclusion criteria
    - To be included, participants should score higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |                1               |               2               |
      |-----------:|:------------------------------:|:-----------------------------:|
      | *response* | Yes (participant **excluded**) | No (participant **included**) |  

</details> 
</details> 

<details>
<summary> This code imported the Go/No-Go raw data without separating the data by cue type. *(click to expand)* </summary>

```{r gng_organize, warning=FALSE}

#---WIDE DATA FRAME---"gng_data_extracted"  

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
extract_gng = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant)
  participant = as.numeric(temp_data[which(temp_data == "participant"),2])
  
  # Age (age)
  age = as.integer(temp_data[which(temp_data == "Age"),2])
  
  # Gender (gender)
  # will be populated from survey measures
  
  # First Language (lang)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  
  # Total trials completed (trial_tot)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$order))
  
  # GnG Score (score)
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score = sum(temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Total Responses (resp_tot)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$key_resp_slash.keys_raw) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$key_resp_slash.keys_raw))
  
  # Total No Responses (no_resp_tot)
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot = (trial_tot - resp_tot)
  
  # Total Errors (err_tot)
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot = sum((temp_data$key_resp_slash.corr_raw == "0") == "TRUE", na.rm = TRUE) # alternative: sum(!temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Commission Errors (err_com)
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an erorr)
  err_com = sum((temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Omission Errors (err_omi)
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi = sum((temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Average Reaction Time (RT) (rt_av)
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av = mean(temp_data$key_resp_slash.rt_raw, na.rm = TRUE)
  
  temp_dataframe = as.data.frame(c(participant, age, lang, trial_tot, 
                                   score, resp_tot, no_resp_tot, 
                                   err_tot, err_com, err_omi, rt_av))
}

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
files_psychopy = dir("/Users/anaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files
result = sapply(files_psychopy, extract_gng)

#--(4) Create a data frame of the result of the function on the files

# take the lists of the extracted data and turn it into a data frame
gng_data_extracted = data.frame(Reduce(rbind, result)) # alternative method: x = data.frame(t(sapply(result,c)))
# rename the columns
colnames(gng_data_extracted) = c("gng_participant", "age", "lang", "gng_trial_tot", "gng_score", 
                                 "gng_resp_tot", "gng_no_resp_tot", "gng_err_tot", "gng_err_com", 
                                 "gng_err_omi", "gng_rt_av") 
# remove row names to avoid confusion
rownames(gng_data_extracted) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted = dplyr::arrange(gng_data_extracted, gng_participant)

```

</details>  

<details>
<summary> This code imported the Go/No-Go raw data and separated the data by cue type (i.e. "vertical" or "horizontal"). *(click to expand)* </summary>

```{r gng_organize_target, warning=FALSE}

#--- for the LONG DATA FRAME - CUE TYPE (vertical vs. horizontal) --- "gng_data_extracted_v" & "gng_data_extracted_h" 
  
#-- This code differs from the above code in that it organizes the gng data based the CUE type (i.e. either a "vertical" (v) or "horizontal" (h)).
  # This means that the data is in a long format to test the within-subjects factor of cue type.

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files
# (5) Re-structure the data into 1 data frame in a long format for a repeated measures analysis

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once...
# for the "vertical" (v) (extract_gng_v) and "horizontal" (h) (extract_gng_h) trials

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
# NOTE: this function extracts 2 versions of each value: one for the "vertical" (v) trials, and one for the "horizontal" (h) trials

extract_gng_v = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "vertical" (v) (trial_tot_v)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_v = sum((temp_data$cue == "v_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "vertical" (v) (score_v)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "vertical" (v) (resp_tot_v)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "vertical" (v) (no_resp_tot_v)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_v = (trial_tot_v - resp_tot_v)  
  
  # Total Errors for the "vertical" (v) (err_tot_v) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "vertical" (v) (err_com_v) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "vertical" (v) (err_omi_v)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  

  # Average Reaction Time (RT) for the "vertical" (v) (rt_av_v)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_v = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "v_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "vertical" (v) (exclude_v)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
    # first, get the total trials and response totals for the "go" and "no-go" trials
      # "go"
      trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
      # "no-go"
      trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
    # next, find the minimum response needed based on the 50% criteria
    min_resp = trial_tot_go * .50
    # find if the participant meets exclusion criteria for the "go" and "no-go" trials
    meets_min_go = resp_tot_go > (min_resp)
    meets_min_nogo = resp_tot_nogo < (min_resp)
    # find if the participant meets exclusion criteria (exclude = 1; include = 2)
    exclude_v = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_v = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, 
                                        no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v))
}  

extract_gng_h = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "horizontal" (h) (trial_tot_h)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_h = sum((temp_data$cue == "h_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "horizontal" (h) (score_h)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "horizontal" (h) (resp_tot_h)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "horizontal" (h) (no_resp_tot_h)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_h = (trial_tot_h - resp_tot_h)  
  
  # Total Errors for the "horizontal" (h) (err_tot_h) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "horizontal" (h) (err_com_h) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "horizontal" (h) (err_omi_h)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Average Reaction Time (RT) for the "horizontal" (h) (rt_av_h)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_h = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "h_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "horizontal" (h) (exclude_h)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
  # first, get the total trials and response totals for the "go" and "no-go" trials
  # "go"
  trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # "no-go"
  trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # next, find the minimum response needed based on the 50% criteria
  min_resp = trial_tot_go * .50
  # find if the participant meets exclusion criteria for the "go" and "no-go" trials
  meets_min_go = resp_tot_go > (min_resp)
  meets_min_nogo = resp_tot_nogo < (min_resp)
  # find if the participant meets exclusion criteria (exclude = 1; include = 2)
  exclude_h = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_h = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_h, score_h, resp_tot_h, 
                                     no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h))
}  

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
# THIS IS THE SAME AS IN THE ABOVE CODE, SO IT IS COMMENTED OUT TO AVOID EXTRA COMPUTING POWER

# files_psychopy = dir("/Users/anaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
#                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files for the "vertical" (v) and "horizontal" (h) trials  
result_v = sapply(files_psychopy, extract_gng_v)
result_h = sapply(files_psychopy, extract_gng_h)

#--(4) Create a data frame of the result of the function on the files...
# for the "vertical" (gng_data_extracted_v) and "horizontal" (gng_data_extracted_h) trials  

# take the lists of the extracted data and turn it into a data frame for the "vertical" (h) trials
gng_data_extracted_v = data.frame(Reduce(rbind, result_v)) 
# rename the columns
colnames(gng_data_extracted_v) = c("participant_long", "age_long", "lang_long", "trial_tot_v", "score_v", 
                                         "resp_tot_v", "no_resp_tot_v", "err_tot_v", "err_com_v", 
                                         "err_omi_v", "rt_av_v", "exclude_v")

# take the lists of the extracted data and turn it into a data frame for the "horizontal" (h) trials
gng_data_extracted_h = data.frame(Reduce(rbind, result_h)) 
# rename the columns
colnames(gng_data_extracted_h) = c("participant_long", "age_long", "lang_long", "trial_tot_h", "score_h", 
                                           "resp_tot_h", "no_resp_tot_h", "err_tot_h", "err_com_h", 
                                           "err_omi_h", "rt_av_h", "exclude_h")
# remove row names to avoid confusion
rownames(gng_data_extracted_v) = NULL
rownames(gng_data_extracted_h) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted_v = dplyr::arrange(gng_data_extracted_v, participant_long)
gng_data_extracted_h = dplyr::arrange(gng_data_extracted_h, participant_long)

```

</details>  

#### 3. & 4. Demographic, SUQ, and SAD Raw Data  

The **Demographic & SUQ Raw Data and SAD Raw Data** was collected using Qualtrics the Inquisit 5 software (**LINK**), which was the same as Ward et al. (2017). The raw data from the OSpan was imported as a dataframe, "**ospan_raw**", and included the following: *(click to see description)*    

<details>  
<summary> <span style="color:DarkBlue; font-size:18px;">General, Manipulation, and Demographic Information</span> </summary>  
  
<details>  
<summary> **General Information:** *(click to expand)* </summary>  

- **Start Date [*date_start*]**  
  - The date and time when a participant began their survey on Qualtrics (Y/M/D H:M)  
- **End Date [*date_end*]**  
  - The date and time when a participant finished their survey on Qualtrics (Y/M/D H:M)  
- **Duration (in seconds) [*duration(s)*]**  
  - The duration of the survey for the participant in seconds
- **Recorded Date [*date_rec*]**  
  - The date and time when a participant's data was recorded on Qualtrics (Y/M/D H:M)  
- **Participant number [*sur_participant*]**  
</details>

<details>  
<summary> **Manipulation Condition:** *(click to expand)* </summary>  

- **Smartphone Location [*sur_location*]**  
  - The smartphone location assigned to the participant (i.e. desk, pocket/bag, or outside)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |   1  |      2     |    3    |
    |-----------:|:----:|:----------:|:-------:|
    | *response* | Desk | Pocket/Bag | Outside |

- **Smartphone Power [*sur_power*]**  
  - The smartphone power assigned to the participant (i.e. ON or OFF)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |  1 |  2  |
    |-----------:|:--:|:---:|
    | *response* | On | Off |

- **Smartphone Condition [*sur_condition*]**  
  - The smartphone condition (i.e. combination of location & power) assigned to the participant  
  <details>  
  <summary> coding information *(click to expand)* </summary>  
    
    |       code |     1     |        2        |       3      |      4     |         5        |       6       |
    |-----------:|:---------:|:---------------:|:------------:|:----------:|:----------------:|:-------------:|
    | *response* | ON & Desk | ON & Pocket/Bag | ON & Outside | OFF & Desk | OFF & Pocket/Bag | OFF & Outside |

</details>  


<details>  
<summary> **Demographics:** *(click to expand)* </summary>  

- **Gender [*gender*]**  
  - Participant's self-reported gender (i.e., male, female, other, or prefer not to say)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |   1  |    2   |   3   |         4         |
    |-----------:|:----:|:------:|:-----:|:-----------------:|
    | *response* | Male | Female | Other | Prefer not to say |

- **Year [*year*]**  
  - Participant's self-reported year of study in their respective program (e.g. 1, 2, 3, 4)  
- **Program [*program*]**  
  - Participant's self-reported program of study (e.g., biology, BMOS, psychology)  
    <details>  
    <summary> coding information *(click to expand)* </summary>  
  
      |                        |                         |                            |                     |
      |------------------------|-------------------------|----------------------------|---------------------|
      | 1 = actuarial          | 11 = economics          | 21 = management            | 31 = psychology     |
      | 2 = anthropology       | science                 | 22 = mathematics           | 32 = SASAH          |
      | 3 = arts               | 13 = english            | 23 = medical science       | 33 = science        |
      | 4 = arts & humanities  | 14 = film               | 24 = MIT                   | 34 = social science |
      | 5 = biology            | 15 = FIMS               | 25 = MOS                   | 35 = sociology      |
      | 6 = BMOS               | 16 = geography          | 26 = music                 | 36 = software       |
      | 7 = chemistry          | 17 = health science     | 27 = nutrition & dietetics | 37 = visual arts    |
      | 8 = computer science   | 18 = integrated science | 28 = philosophy            |                     |
      | 9 = consumer behaviour | 19 = kinesiology        | 29 = physics               |                     |
      | 10 = criminology       | 20 = linguistics        | 30 = political science     |                     |  
</details>  
</details> 
</details> 
   

##### The Smartphone Use Questionnaire (SUQ)  
The purpose of the SUQ was to measure participants' typical smartphone use. The SUQ was created for the present study and consisted of modified items from Ward and colleagues’ (2017) survey measures. (i.e. found in the “BRAIN DRAIN” WEB APPENDIX”). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (“Never”) to 7 (“Always”). There were 10 items in total and there were **five types of items**, which measured: 

<details>  
<summary> **1. Smartphone Use Frequency: (*SUQ_use_1 - SUQ_use_3*)** </summary> 

Each of the following items used the following coding:

|     CODE |  1  |   2  |   3   |  4  |
|---------:|:---:|:----:|:-----:|:---:|
| *response* | 0-5 | 6-10 | 11-15 | > 15 |

  - **Text Messages (per day) [*SUQ_use_1*]**  
    - Answer to the following question: "*On average, how many text messages do you send per day?*"  
  - **Social Media Messages (per day) [*SUQ_use_2*]**  
    - Answer to the following question: "*On average, how many social media based messages do you send per day from your smartphone? (iMessage, Facebook Messenger, WhatsApp, WeChat, direct messages within social media platforms, etc.)*"  
  - **Social Media Posts (per day) [*SUQ_use_3*]**  
    - Answer to the following question: "*On average, how many social media posts (e.g. written post, picture, article, etc.) do you send per day from your smartphone? (Facebook, Twitter, Instagram, etc.)*"  
</details> 


<details>  
<summary> **2. Smartphone Use With External Stimulation or During Other Activities: (*SUQ_use_4 - SUQ_use_7*)** </summary>  

Each of the following items used the following coding:  

|                   code |   1   |  2 |  3 |            4            | 5  | 6  | 7      |
|-----------------------:|:-----:|:--:|:--:|:-----------------------:|----|----|--------|
|             *response* | Never | -- | -- | About half <br>the time | -- | -- | Always |
| -- *no text specified* |       |    |    |                         |    |    |        |

  - **Text Messages (per day) [*SUQ_use_4*]**  
    - Answer to the following question: "*I look at my smartphone before I roll out of bed in the morning.*"  
  - **Social Media Messages (per day) [*SUQ_use_5*]**  
    - Answer to the following question: "*If I am waiting to meet a friend, I pass the time by using my smartphone.*"  
  - **Social Media Posts (per day) [*SUQ_use_6*]**  
    - Answer to the following question: "*I use my smartphone while driving.*"  
  - **Social Media Posts (per day) [*SUQ_use_7*]**  
    - Answer to the following question: "*If my smartphone rings or vibrates in the middle of personal business, I look at it.*"  
</details> 


<details>  
<summary> **3. Smartphone Subjective Value: (*SUQ_exp_1*)** </summary>  

  - **Subjective Value [*SUQ_use_1*]**  
    - Answer to the following question: "*How much money would it take for you to give up your phone for a full day?*"  
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |    1   |    2    |    3    |   4  |
      |-----------:|:------:|:-------:|:-------:|:----:|
      | *response* | $0-$20 | $21-$40 | $41-$60 | > $60 |

</details>  
</details>  


<details>  
<summary> **4. Smartphone Notification Type: (*SUQ_exp_2 & SUQ_exp_3*)** </summary>  

  - **Notification Type [*SUQ_use_2*]**  
    - Answer to the following question: "*Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.*"  
    <details>  
    <summary> coding information *(click to expand)* </summary>  
      
      |       code |   1   |     2    |    3    |     4     |     5    |     6    |        7        |
      |-----------:|:-----:|:--------:|:-------:|:---------:|:--------:|:--------:|:---------------:|
      | *response* | Email | Facebook | Twitter | Instagram | LinkedIn | Snapchat | Other (specify) |
      
  - **Specify Notification Type [*SUQ_exp_3*]**
    - Answer to the following question: "*Please Specify: Where you receive notifications (a sound or vibration) on your phone?*"
</details>  
</details>  


<details>  
<summary> **5. Phantom Vibrations: (*SUQ_exp_4*)** </summary>  

  - **Phantom Vibrations [*SUQ_use_4*]**  
    - Answer to the following question: "*Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?*"  
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |  1  |  2 |
      |-----------:|:---:|:--:|
      | *response* | Yes | No |

</details>  
</details>  

##### The Smartphone Attachment and Dependency Inventory (SAD; Ward et al., 2017)  

<details>  
<summary> The SAD asked participants to respond to 13 items **(*SAD_1-SAD_13*)** using a 7-point likert scale. *(click to expand)* </summary>  

|       code |           1          |     2    |           3          |    4    |         5         |   6   |         7         |
|-----------:|:--------------------:|:--------:|:--------------------:|:-------:|:-----------------:|:-----:|:-----------------:|
| *response* | Strongly<br>Disagree | Disagree | Somewhat<br>Disagree | Neutral | Somewhat<br>Agree | Agree | Strongly<br>Agree |

- **SAD_1:** I would have trouble getting through a normal day without my smartphone.  
- **SAD_2:** It would be painful for me to give up my smartphone for a day.  
- **SAD_3:** I feel like I could not live without my smartphone.  
- **SAD_4:** If I forgot to bring my smartphone with me, I would feel anxious.  
- **SAD_5:** It drives me crazy when my smartphone runs out of battery.  
- **SAD_6:** I am upset and annoyed when I find I do not have reception on my smartphone.  
- **SAD_7:** I feel impatient when the Internet connection speed on my smartphone is slow.  
- **SAD_8:** I feel lonely when my smartphone does not ring or vibrate for several hours.  
- **SAD_9:** Using my smartphone relieves me of my stress.  
- **SAD_20:** I feel excited when I have a new message or notification.  
- **SAD_11:** Using my smartphone makes me feel happy.  
- **SAD_12:** I find it tough to focus whenever my smartphone is nearby.  
- **SAD_13:** I become less attentive to my surroundings when I’m using my smartphone.  

</details> 

</details>
  
<details>
<summary> This code imported the Demographic, SUQ, and SAD raw data. *(click to expand)* </summary>  


```{r}
survey_raw = utils::read.csv("raw_survey.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
survey_raw = dplyr::arrange(survey_raw, sur_participant)
```

  </details>  
</details>  

#### 5. Testing Tracking Sheet Raw Data  

<details>  

<summary> The **Testing Tracking Sheet Raw Data** was collected during testing *by the experimenter* to store relevant information on participant's (1) General Information (e.g. testing start date), (2) Manipulation Condition (e.g. assigned location and power manipulation), and (3) Data Cleaning (e.g. exclusion critera from testing error or other non-task-specific criteria). The raw data from the tracking sheet was imported as a dataframe, "**track_raw**", and included the following: *(click to see description)* </summary>  

<details>  
<summary> **1. General Information:** </summary>  

- **Testing Date [*track_date*]**  
  - The date a participant completed the study (Y/M/D)  
- **Testing Time [*date_end*]**  
  - The time a participant was scheduled to complete the study (24hr-time). *Note: Many participants were tested at the same time, but in separate cubicle-type desks to avoid any group-testing dynamic*  
- **Participant number [*track_participant*]**  
</details>  


<details>  
<summary> **2. Manupulation Condition:** </summary>  

- **Smartphone Location [*track_location*]**  
  - The smartphone location assigned to the participant (i.e. desk, pocket/bag, or outside)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |   1  |      2     |    3    |
    |-----------:|:----:|:----------:|:-------:|
    | *response* | Desk | Pocket/Bag | Outside |

- **Smartphone Power [*track_power*]**  
  - The smartphone power assigned to the participant (i.e. ON or OFF)  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |  1 |  2  |
    |-----------:|:--:|:---:|
    | *response* | On | Off |

- **Smartphone Condition [*track_condition*]**  
  - The smartphone condition (i.e. combination of location & power) assigned to the participant  
  <details>  
  <summary> coding information *(click to expand)* </summary>  
    
    |       code |     1     |        2        |       3      |      4     |         5        |       6       |
    |-----------:|:---------:|:---------------:|:------------:|:----------:|:----------------:|:-------------:|
    | *response* | ON & Desk | ON & Pocket/Bag | ON & Outside | OFF & Desk | OFF & Pocket/Bag | OFF & Outside |

- **Counter Balanced Order [*balance*]**  
  - This depicts which of the main tasks (i.e. either the OSpan task or the Cue-Dependent Go/No-Go Task) was completed first  
  <details>  
  <summary> coding information *(click to expand)* </summary>  

    |       code |              1             |                      2                      |
    |-----------:|:--------------------------:|:-------------------------------------------:|
    | *response* | OSpan Task completed first | Cue-Dependent Go/No-Go Task completed first |
    
</details>  

<details>  
<summary> **3. Data Cleaning:** </summary>  

  - **Exclude from Analysis [*exclude*]**  
    - This depicts whether a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e. for reasons other than the task exclusion criteria in the data analysis plan). 
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |                1               |               2               |
      |-----------:|:------------------------------:|:-----------------------------:|
      | *response* | Yes (participant **excluded**) | No (participant **included**) |

  - **Reason for Exclusion from Analysis [*reason*]**  
    - This depicts the reason (summarized into 5 categories) a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e. for reasons other than the task exclusion criteria in the data analysis plan). *Note: this includes a category for "no exclusion issues", where a participant was not excluded*  
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |          1          |              2              |        3       |      4      |       5       |
      |-----------:|:-------------------:|:---------------------------:|:--------------:|:-----------:|:-------------:|
      | *response* | no exclusion issues | external distractor (noise) | gng incomplete | gng missing | ospan missing |
      
  - **Possible Outlier [*pos_out*]**  
    - This depicts whether a participant is a possible outlier for exploratory analyses. *Note: this was not applied to any analysis unless explicitly stated*
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |                     1                    |                      2                      |
      |-----------:|:----------------------------------------:|:-------------------------------------------:|
      | *response* | Yes (does qualify as a possible outlier) | No (does not qualify as a possible outlier) |
      
  - **Experimenter [*experimenter*]**  
    - This depicts which experimenter (3 total) collected the data for a participant. *Note: this was collected for exploratory analyses and was not used unless explicitly stated*
    <details>  
    <summary> coding information *(click to expand)* </summary>  

      |       code |  1  |  2 |  3 |
      |-----------:|:---:|:--:|:--:|
      | *response* | ARP | SW | LM |

</details>  
</details>  

<details>
<summary> This code imported the study tracking sheet raw data. *(click to expand)* </summary>  

```{r}
track_raw = utils::read.csv("raw_tracking.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
track_raw = dplyr::arrange(track_raw, track_participant)
```

  </details> 
</details>   

## Construct Working Dataframes  

In order to complete the present study's analyses, eight data frames were created.  

### 1. All Data  

<details>  

<summary> All the data from the present study in one data frame: **"all_data"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(all_data)`  

<details>
<summary> This code organized the raw data into one data frame. *(click to expand)* </summary>

```{r}

# Find the final participant list for the analyses  
  # get a list of participants that (1) completed the gng task and (2) are not excluded (i.e., inclusion list) based on the "exclude" column in the tracking sheet  
  # (1) First get the gng participant list  
  gng_par_list = gng_data_extracted$gng_participant  
  # (2) Next, get the inclusion list  
  include_par_list = data.frame(track_raw$track_participant[which(track_raw$exclude %in% 2)])  
    colnames(include_par_list) = c("par_list")
  # Get a list of all the participants who completed the study (from the testing tracking sheet)
  all_participants = as.data.frame(track_raw$track_participant)
    colnames(all_participants) = c("all_participants")
  
# find the participants to remove from the data
  # Filter each data list from the list of all participants  
  gng_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% gng_par_list)))
  track_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% include_par_list$par_list)))
  reason_remove = c(rep("gng", length(gng_remove$all_participants)), rep("track", length(track_remove$all_participants)))
  
  # Make a list of all the removed participants -- "all_remove"
  all_remove_notclean = data.frame(rbind(gng_remove, track_remove), reason_remove)
    # remove any duplicates from the list
    all_remove_nodup = all_remove_notclean[!duplicated(all_remove_notclean$all_participants), ]
    # reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)  
    all_remove = dplyr::arrange(all_remove_nodup, all_participants)  
    
# Filter raw data using the final participant list for: ospan_raw, gng_raw, survey_raw, and track_raw
  # This will now the the working data for the study (i.e. the "final" data)  
    # OSpan Data  
    ospan_final = dplyr::filter(ospan_raw, !ospan_participant %in% all_remove$all_participants)
    # Gng Data  
    gng_final = dplyr::filter(gng_data_extracted, !gng_participant %in% all_remove$all_participants)
    # Survey Data  
    survey_final = dplyr::filter(survey_raw, !sur_participant %in% all_remove$all_participants)
    # Tracking Data  
    track_final = dplyr::filter(track_raw, !track_participant %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # Start by creating a data frame with the final participant list in the 1st column: "participant"
  all_par = as.data.frame(track_final$track_participant)
    colnames(all_par) = c("participant")

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # OSpan Data
    colnames(ospan_final)[which(names(ospan_final) == "ospan_participant")] = c("participant")
    # Gng Data
    colnames(gng_final)[which(names(gng_final) == "gng_participant")] = c("participant")
    # Survey Data
    colnames(survey_final)[which(names(survey_final) == "sur_participant")] = c("participant")
    # Tracking Data
    colnames(track_final)[which(names(track_final) == "track_participant")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data"
  all_data = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data)[which(names(all_data) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

```
  
</details>

</details>
  

### 2. All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)

<details>  

<summary> All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e. either "go" or "no-go"), presented in WIDE FORMAT: **"all_data_cue"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(all_data_cue)`  

<details>
<summary> This code organized the raw data into one data frame. *(click to expand)* </summary>
```{r warning=FALSE}
# use the list of all the participants to remove -- "all_remove" -- to filter the raw gng long data
  # for the "go" targets
  gng_v_final = dplyr::filter(gng_data_extracted_v, !participant_long %in% all_remove$all_participants)
  # for the "no-go" targets
  gng_h_final = dplyr::filter(gng_data_extracted_h, !participant_long %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # use the final participant list -- "all_par" -- as the 1st column in the data frames

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # this was already done for the other data frames (e.g. ospan_final, survey_final)
    # for the "vertical" targets
    colnames(gng_v_final)[which(names(gng_v_final) == "participant_long")] = c("participant")
    # for the "horizontal" targets
    colnames(gng_h_final)[which(names(gng_h_final) == "participant_long")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_cue"
  all_data_cue = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(gng_h_final, -c(age_long, lang_long)), dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_cue)[which(names(all_data_cue) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    
```

</details>  
</details>  
  
### 3. All Data - Go/No-Go separated by Cue Type (LONG FORMAT)

<details>  

<summary> All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e. either "go" or "no-go"), presented in LONG FORMAT: **"all_data_cue_long"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(all_data_cue_long)`  

<details>
<summary> This code organized the raw data into one data frame. *(click to expand)* </summary>
```{r message=FALSE, warning=FALSE}
# Re-structure the data into 1 data frame in a long format for a repeated measures analysis

  # using the "final" data frames of all the data, make 2 data frames, one for the "vertical" (all_data_v) and one for the "horizontal" (all_data_h) targets
    # for the "vertical" targets -- all_data_v
    all_data_v = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_v)[which(names(all_data_v) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    # for the "horizontal" targets -- all_data_h
    all_data_h = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_h_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_h)[which(names(all_data_h) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

  # rename target-specific columns for both data frames (to bind)
    # for the "go" trials
    names(all_data_v)[names(all_data_v) == "trial_tot_v"] = "gng_trial_tot_long" 
    names(all_data_v)[names(all_data_v) == "score_v"] = "gng_score_long" 
    names(all_data_v)[names(all_data_v) == "resp_tot_v"] = "gng_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "no_resp_tot_v"] = "gng_no_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_tot_v"] = "gng_err_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_com_v"] = "gng_err_com_long" 
    names(all_data_v)[names(all_data_v) == "err_omi_v"] = "gng_err_omi_long" 
    names(all_data_v)[names(all_data_v) == "rt_av_v"] = "gng_rt_av_long" 
    names(all_data_v)[names(all_data_v) == "exclude_v"] = "gng_exclude_long"
    # for the "no-go" trials
    names(all_data_h)[names(all_data_h) == "trial_tot_h"] = "gng_trial_tot_long" 
    names(all_data_h)[names(all_data_h) == "score_h"] = "gng_score_long" 
    names(all_data_h)[names(all_data_h) == "resp_tot_h"] = "gng_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "no_resp_tot_h"] = "gng_no_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_tot_h"] = "gng_err_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_com_h"] = "gng_err_com_long" 
    names(all_data_h)[names(all_data_h) == "err_omi_h"] = "gng_err_omi_long" 
    names(all_data_h)[names(all_data_h) == "rt_av_h"] = "gng_rt_av_long" 
    names(all_data_h)[names(all_data_h) == "exclude_h"] = "gng_exclude_long"
  
  # Create a dataframe depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  cue_type = as.data.frame(c(rep(1, length(all_data_v$participant)), rep(2, length(all_data_h$participant))))
    colnames(cue_type) = c("cue")  

  # Bind the go and no-go dataframes into 1 (in long format)
  gng_cue_bind = dplyr::bind_rows(all_data_v, all_data_h)
  
  # add the target type column depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  all_data_cue_long = dplyr::bind_cols(cue_type, gng_cue_bind)
    
```

</details>  
</details>  
  
    
### 4. OSpan Analysis

<details>  

<summary> All the data related to the OSpan Analysis from the present study in one data frame: **"ospan_analysis"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(ospan_analysis)`  

<details>  
<summary> This code organized the final data into the ospan-related data frame. *(click to expand)* </summary>  
  
```{r}

# make a new data frame for the ospan analysis
ospan_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85)

```
  
</details>  

</details>  

### 5. Go/No-Go: Error Analysis  

<details>  

<summary> All the data related to the Go/No-Go Error Analysis from the present study in one data frame: **"gng_err_analysis"** 

Note: this data is in a long format for a repeated measures analysis (i.e. based on target type: either "go" or "no-go"). *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(gng_err_analysis)`  

<details>  
<summary> This code organized the final long data into a data frame for the Go/No-Go Error Analysis. *(click to expand)* </summary>  
  
```{r}
# make a new data frame for the gng error analysis
gng_err_analysis = dplyr::select(all_data_cue_long, participant, age_long, gender, lang_long, program, year, location, power, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long)

```
  
</details>  

</details>  

### 6. Go/No-Go: Accuracy Analysis  

<details>  

<summary> All the data related to the Go/No-Go Accuracy Analysis from the present study in one data frame: **"gng_ac_analysis"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(gng_ac_analysis)`  

<details>  
<summary> This code organized the final data into a data frame related to the Go/No-Go Accuracy Analysis. *(click to expand)* </summary>  
  
```{r}

# make a new data frame for the gng accuracy analysis
gng_ac_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av) 

```
  
</details>  

</details>  
  
### 7. SAD: Factor Analysis  

<details>  

<summary> All the data related to the Factor Analysis from the present study in one data frame: **"factor_analysis"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(factor_analysis)`  

<details>  
<summary> This code organized the final data into the factor analysis-related data frame. *(click to expand)* </summary>  
  
```{r}

# make a new data frame for the factor analysis
factor_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13)

```
  
</details>  

</details>  
  
### 8. SAD: Generalized Linear Model  

<details>  

<summary> All the data related to the GLM Analysis from the present study in one data frame: **"glm_analysis"** *(click to see more)* </summary>  

- This data included the following:   
  - `r colnames(glm_analysis)`  

<details>  
<summary> This code organized the final data into the GLM-related data frame. *(click to expand)* </summary>  
  
```{r}

# make a new data frame for the glm analysis
glm_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13)

```
  
</details>  

</details>  

# Descriptives: Smartphone Use

```{r}
all_data_des = dplyr::select(all_data, participant, track_date, track_time, age, gender, lang, program, year)

# summary of age
summary(all_data_des$age)
# frequency of age
plyr::count(all_data_des$age)

# show frequency for gender
  # define gender as factor
  all_data_des$gender = factor(all_data_des$gender, 
                               levels = c(1, 2), 
                               labels = c("Male", "Female"))
  # show counts
  plyr::count(all_data_des$gender)

# show freq of year of study
plyr::count(all_data_des$year)

# show freq of program
  # first define the program as a factor
    all_data_des$program = factor(all_data_des$program, 
                  levels = c(3, 4, 13, 14, 28, 32, 37, 12, 17, 19, 15, 20, 24, 23, 26, 1, 2, 5, 7, 8, 11, 16, 18, 22, 27, 29, 33, 36, 6, 9, 10, 21, 25, 30, 31, 34, 35),
                  labels = c(rep("Arts & Humanities", 7), "Engineering", rep("Health Sciences", 2), rep("Information & Media Studies", 3), "Medicine & Dentistry", "Music", rep("Science", 13), rep("Social Science", 9)))
  # then show counts
  plyr::count(all_data_des$program)


```


# Planned Analysis

The following analyses were completed based on a pre-registered direct replication on OSF (view registration: <https://osf.io/5fq4r>) of Ward et al.'s (2017) second experiment. The analyses tested the three main hypotheses in the present study: (1) the location effect hypothesis, (2) the power effect hypothesis, and (3) the moderation effect hypothesis.  

Each analysis contains the following:  

1. Descriptive Statistics
2. Analysis *(including any relevant assumption tests)*
3. Visualizations

## Testing the Location and Power Hypotheses:  

### OSpan Task  

A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted *(DV = OSpan absolute score)*.   

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions, specifically, that participant payed attention to both the math component and letter recall component of the task. Only data from those who scored 85% or higher on the math component of the task were included.  

- Exclude data from any partitipants with (1) missing data and/or (2) the math criteria (i.e. minimum 85% on math accuracy). There was a total of **`r length(ospan_analysis_final$participant)`** participants in the OSpan analysis. 
  - Missing Data: `r length(which(is.na(ospan_analysis$ospan_absolute_score) == TRUE))` participants were removed due to missing ospan data.
  - Math Criteria (min. 85%): `r length(which(ospan_analysis$ospan_math_criteria_85 == 0))` participants were removed because they did not meet the math criteria.

```{r}
# filter any participants with missing OSpan data for the analysis  
    # for all ps, excluding any p with missing data in the OSpan Absolute Score ONLY
    ospan_analysis_noNA = ospan_analysis[!is.na(ospan_analysis$ospan_absolute_score), ] 
    # for all ps, excluding any that do not meet the math criteria (min 85% accuracy)
    ospan_analysis_final = ospan_analysis_noNA[ospan_analysis_noNA[,"ospan_math_criteria_85"] == 1,]
```

#### Descriptive Statistics 

The following table depicts the descriptive statistics for the OSpan analysis: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  ospan_des = psych::describeBy(ospan_analysis_final, list(ospan_analysis_final$location, ospan_analysis_final$power), mat = TRUE)  
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  ospan_des_t = rbind((cbind(t(ospan_des["ospan_absolute_score1",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score2",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(ospan_des["ospan_absolute_score4",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score5",c("mean", "sd", "min", "max", "n")]),
                             t(ospan_des["ospan_absolute_score6",c("mean", "sd", "min", "max", "n")]))))
    colnames(ospan_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  kable(ospan_des_t, caption = "Descriptive Statistics: Absolute Score on the OSpan Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

```

#### Analysis 

To assess if OSpan absolute score statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off). Since there are unequal sample sizes, we used Type III sum of squares.

##### Assumptions  

<details>  
<summary> The ANOVA made three assumptions. All assumptions were met *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# get the required data from the ospan_analysis data: participant, location, power, and score
ospan_anova_data = ospan_analysis_final[, c("participant", "location", "power", "balance", "ospan_absolute_score")]
    colnames(ospan_anova_data) = c("participant", "location", "power", "balance", "score")
    rownames(ospan_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    ospan_anova_data$location = factor(ospan_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    ospan_anova_data$power = factor(ospan_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
    ospan_anova_data$balance = factor(ospan_anova_data$balance, 
                  levels = c(1, 2),
                  labels = c("Ospan First", "Go/No-Go First"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
ospan_anova = ezANOVA(
  data = ospan_anova_data
  , dv = .(score)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
ospan_res = data.frame("residuals" = proj(ospan_anova$aov)[,"Residuals"])   

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(ospan_res$residuals, main = "Q-Q Plot of the OSpan Residuals") 
qqline(ospan_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(ospan_res$residuals, main = "Histogram of Ospan Residuals") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(ospan_shap$statistic["W"]), 2)`, *p* `r p_round(ospan_shap$p.value)` *(see code below)*.  
   
```{r}

ospan_shap = shapiro.test(ospan_res$residuals)
ospan_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(as.numeric(ospan_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(ospan_sk_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(ospan_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(ospan_sk_t[c(2,5),])), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}
# use the "ospan_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  ospan_sk_t = rbind((cbind(t(ospan_des["ospan_absolute_score1", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score2", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(ospan_des["ospan_absolute_score4", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score5", c("skew", "kurtosis", "n")]),
                             t(ospan_des["ospan_absolute_score6", c("skew", "kurtosis", "n")]))))
    colnames(ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(ospan_sk_t, caption = "Skewness and Kurtosis of the Absolute Score on the OSpan Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r ospan_lev_loc$Df[1]`, `r ospan_lev_loc$Df[2]`) = `r round(ospan_lev_loc[2][1,1], 2)`, *p* = `r p_round(ospan_lev_loc[3][1,1])`, smartphone power, *F*(`r ospan_lev_pow$Df[1]`, `r ospan_lev_pow$Df[2]`) = `r round(ospan_lev_pow[2][1,1], 2)`, *p* `r p_round(ospan_lev_pow[3][1,1])`, and balance, *F*(`r ospan_lev_bal$Df[1]`, `r ospan_lev_bal$Df[2]`) = `r round(ospan_lev_bal[2][1,1], 2)`, *p* `r p_round(ospan_lev_bal[3][1,1])`*(see code below)*.

```{r}

# conduct levene's
  # for location
  ospan_lev_loc = leveneTest(data = ospan_anova_data, score ~ as.factor(location), center = median)
  ospan_lev_loc
  # for power
  ospan_lev_pow = leveneTest(data = ospan_anova_data, score ~ as.factor(power), center = median)
  ospan_lev_pow
  # for balance
  ospan_lev_bal = leveneTest(data = ospan_anova_data, score ~ as.factor(balance), center = median)
  ospan_lev_bal

```
</details>  

##### Results

All results were based on $\alpha = 0.05$. Below are the reported statistics for both main effects and the interactions, followed by a table depicting the ANOVA results.  

**Main Effect of Smartphone Location**  

There was no significant main effect of smartphone location on Ospan absolute score, *F*(`r ospan_anova$ANOVA[2,2]`, `r ospan_anova$ANOVA[2,3]`) = `r round(ospan_anova$ANOVA[2,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[2,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[2,9])`.

**Main Effect of Smartphone Power**  

There was no significant main effect of smartphone power on Ospan absolute score, *F*(`r ospan_anova$ANOVA[3,2]`, `r ospan_anova$ANOVA[3,3]`) = `r round(ospan_anova$ANOVA[3,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[3,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[3,9])`.

**Interaction of Balance**  

There was no significant main effect of balance (i.e. which task was completed first) on Ospan absolute score, *F*(`r ospan_anova$ANOVA[4,2]`, `r ospan_anova$ANOVA[4,3]`) = `r round(ospan_anova$ANOVA[4,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[4,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[4,9])`.

**Interaction of Smartphone Location & Smartphone Power**  

There was no significant interaction between smartphone location and smartphone power on Ospan absolute score, *F*(`r ospan_anova$ANOVA[5,2]`, `r ospan_anova$ANOVA[5,3]`) = `r round(ospan_anova$ANOVA[5,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[5,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[5,9])`.

**Interaction of Smartphone Location & Balance**  

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on Ospan absolute score, *F*(`r ospan_anova$ANOVA[6,2]`, `r ospan_anova$ANOVA[6,3]`) = `r round(ospan_anova$ANOVA[6,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[6,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[6,9])`.

**Interaction of Smartphone Power & Balance**  

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on Ospan absolute score, *F*(`r ospan_anova$ANOVA[7,2]`, `r ospan_anova$ANOVA[7,3]`) = `r round(ospan_anova$ANOVA[7,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[7,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[7,9])`.

**Interaction of Smartphone Location, Smartphone Power, & Balance**  

There was no significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on Ospan absolute score, *F*(`r ospan_anova$ANOVA[8,2]`, `r ospan_anova$ANOVA[8,3]`) = `r round(ospan_anova$ANOVA[8,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[8,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[8,9])`.

**Post-hoc Tests**  

Since there was no significant main or interaction effects, no post-hocs were completed.

```{r}
# show ANOVA results in kable table 
kable(ospan_anova$ANOVA, caption = "3x2 ANOVA on Ospan Absolute Score", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


#### Visualizations

The following shows a visulization of the Ospan absolute score based on the two smartphone conditions: location and power. *Note: Since the balance factor (i.e. which task was completed first) did not have a significant main effect or interaction effect on the data, this factor was not visualized in the plots*

```{r echo=FALSE, message=FALSE, warning=FALSE}

# create data frame with location conditions as characters rather than intergers 
  # First, get a subset of the descriptive data
  ospan_bar = ospan_analysis_final[, c("location", "power", "ospan_absolute_score")]
    colnames(ospan_bar) = c("location", "power", "score")
    rownames(ospan_bar) = NULL 
    
  ospan_bar$location = factor(ospan_bar$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  ospan_bar$power = factor(ospan_bar$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))

#main_WMW_plot = prelim_hon_DT_data1
#main_WMW_plot[,2] = recode(main_WMW_plot$Condition, '1' = "On Desk", '3' = "Outside")

       
# plot bar graph - GOOD
ggplot(ospan_bar, aes(x = location, y = score, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the OSpan Task",
       subtitle = "Average Absolute OSpan Score",
       x = "Smartphone Location", 
       y = "Score", 
       fill = "Smartphone Power",
       caption = "Note: Score is the number of correct letters recalled based \n on if all letters in a given block were recalled correctly.") +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(ospan_bar$score), max(ospan_bar$score)), breaks = ) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = ospan_bar, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

```

### Cue-Dependent Go/No-Go Task  

An error (i.e. omission errors) and accuracy (i.e. RT) analysis was completed using the omission errors and average RT for each participant.  

#### Error Analysis  

A **3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Vertical, Horizontal)** mixed ANOVA was completed *(DV = omission error(s))*. The between-subjects factors were smartphone location and power, and the within-subjects factor was pre-target cue type (i.e. "cue type"; either "vertical" or "horizontal").  

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials) were used for the final analysis.  

- Exclude data from any partitipants with (1) missing data and/or (2) the response criteria, where participants scored higher than chance performance (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials). There was a total of **`r length(gng_err_final$participant[gng_err_final$cue == 1])`** participants in the Go/No-Go Error analysis. 
  - Missing Data: `r abs(length(gng_err_noNA$participant[gng_err_noNA$cue == 1]) - length(gng_err_analysis$participant[gng_err_analysis$cue == 1]))` participants were removed due to missing data.
  - Response Criteria: `r abs(length(gng_err_noNA$participant[gng_err_noNA$cue == 1]) - length(gng_err_final$participant[gng_err_final$cue == 1]))` participants were removed because they did not meet the response criteria.

```{r}
# filter any participants with missing data or that do not meet the inclusionn criteria

  # for all ps, excluding any p with missing data in the Omission Error 
  gng_err_noNA = gng_err_analysis[!is.na(gng_err_analysis$gng_err_omi_long),]
  
  # for all ps, excluding any that scored below chance performance (i.e. respondd to at least 50% of the "go" trials and withheld response to at least 50% of the "no-go" trials) using the "exclude" column
  gng_err_final = gng_err_noNA[gng_err_noNA$gng_exclude_long == 2,]

```

##### Descriptive Statistics 

The following table depicts the descriptive statistics for the Go/No-Go Error Analysis: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  # for the "vertical" trials
  gng_err_des_v = describeBy(gng_err_final[gng_err_final$cue == 1,], 
                           list(gng_err_final[gng_err_final$cue == 1,]$location, 
                                gng_err_final[gng_err_final$cue == 1,]$power), mat = TRUE) 
  # for the "horizontal" trials
  gng_err_des_h = describeBy(gng_err_final[gng_err_final$cue == 2,], 
                           list(gng_err_final[gng_err_final$cue == 2,]$location, 
                                gng_err_final[gng_err_final$cue == 2,]$power), mat = TRUE) 
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  # for the "vertical" trials
  gng_err_des_v_t = rbind((cbind(t(gng_err_des_v["gng_err_omi_long1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_err_des_v["gng_err_omi_long4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_v["gng_err_omi_long6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_err_des_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_des_v_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  # for the "horizontal" trials
  gng_err_des_h_t = rbind((cbind(t(gng_err_des_h["gng_err_omi_long1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_err_des_h["gng_err_omi_long4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_err_des_h["gng_err_omi_long6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_err_des_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_des_h_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_v_kt = 
    kable(gng_err_des_v_t, caption = "Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Vertical' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))
  
  gng_err_h_kt = 
    kable(gng_err_des_h_t, caption = "Descriptive Statistics: Omission Errors on the Go/No-Go Task for 'Horizontal' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))
  
  gng_err_v_kt
  gng_err_h_kt

```

##### Analysis 

To assess if the omission errors statistically differed, we conducted a 3x2x2 mixed ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off), and the within-subjects factor of cue type (i.e. "vertical" or "horizontal"). Since there are unequal sample sizes, we used Type III sum of squares.

###### Assumptions  

<details>  
<summary> The mixed ANOVA made four assumptions. The normality assumption was not met. Therefore, a robust ANOVA was completed to assess if the omission errors statistically differed between the factors (i.e. smartphone location, smartphone power, and cue type) *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing. 
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  

```{r echo=TRUE, message=FALSE, warning=FALSE}

# get the required data from the gng_err_final data: participant, location, power, and score
gng_err_anova_data = gng_err_final[,c("participant", "location", "power", "cue", "balance", "gng_err_omi_long")]
    colnames(gng_err_anova_data) = c("participant", "location", "power", "cue", "balance", "err_omi")
    rownames(gng_err_anova_data) = NULL
    gng_err_anova_data$location = as.factor(gng_err_anova_data$location)
    gng_err_anova_data$power = as.factor(gng_err_anova_data$power)
    gng_err_anova_data$cue = as.factor(gng_err_anova_data$cue)
    gng_err_anova_data$balance = as.factor(gng_err_anova_data$balance)

# run mixed ANOVA (IV b/w: Smartphone Power & Location; IV w/in: Target Type; DV: Omission Errors)
gng_err_anova = ezANOVA(
  data = gng_err_anova_data
  , dv = .(err_omi)
  , wid = .(participant)
  , within = .(cue)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therefore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_err_res = data.frame("residuals" = proj(gng_err_anova$aov)[[3]][, "Residuals"])

```
    
- It appears that the residuals lie closely to a straight line (i.e. only a couple lie away form the line), which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_err_res$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals") 
qqline(gng_err_res$residuals)

```
    
- It appears that residuals are leptokurtic (i.e. there is a high peak near the center) *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_err_res$residuals, main = "Histogram of Go/No-Go Error Residuals", binwidth = .15) + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_err_shap$statistic["W"]), 2)`, *p* `r p_round(gng_err_shap$p.value)` *(see code below)*. 
   
```{r}

gng_err_shap = shapiro.test(gng_err_res$residuals)
gng_err_shap

```

- The data's Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}
# create & show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

#--FOR THE VERTICAL CUE TYPE
# use the "gng_err_des_v" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_v_t = rbind((cbind(t(gng_err_des_v["gng_err_tot_long1", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long2", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_err_des_v["gng_err_tot_long4", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long5", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_v["gng_err_tot_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_v_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_sk_v = kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Vertical' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))
  
#--FOR THE HORIZONTAL CUE TYPE
# use the "gng_err_des_h" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_h_t = rbind((cbind(t(gng_err_des_h["gng_err_tot_long1", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long2", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_err_des_h["gng_err_tot_long4", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long5", c("skew", "kurtosis", "n")]),
                             t(gng_err_des_h["gng_err_tot_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_h_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  gng_err_sk_h = kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Horizonatal' Trials", digits = 4, align = 'c') %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", "Smartphone Location" = 3)) %>%
    group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))
  
# show the tables
  # for the Vertical Cue Type
  gng_err_sk_v
  # for the Horizonatal Cue Type
  gng_err_sk_h

```

- Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range for the ANOVA analysis. 

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span>  

  - This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location, smartphone power, and balance).

  - The homogeneity assumption was met (based on $\alpha = 0.05$) for both smartphone location, *F*(`r gng_err_lev_loc$Df[1]`, `r gng_err_lev_loc$Df[2]`) = `r round(gng_err_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_err_lev_loc[3][1,1])`, smartphone power, *F*(`r gng_err_lev_pow$Df[1]`, `r gng_err_lev_pow$Df[2]`) = `r round(gng_err_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_err_lev_pow[3][1,1])`, and balance, *F*(`r gng_err_lev_bal$Df[1]`, `r gng_err_lev_bal$Df[2]`) = `r round(gng_err_lev_bal[2][1,1], 2)`, *p* `r p_round(gng_err_lev_bal[3][1,1])`.*(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_err_lev_loc = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(location), center = median)
  gng_err_lev_loc
  # for power
  gng_err_lev_pow = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(power), center = median)
  gng_err_lev_pow
  # for balance
  gng_err_lev_bal = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(balance), center = median)
  gng_err_lev_bal

```

<span style="color:DarkBlue; font-size:18px;">4. Sphericity</span>   

This assumption was met since sphericity is always met for two levels of a repeated measure factor and is, therefore, unnecessary to evaluate (Girden, 1992). 

</details>  


###### Results

Since the normality assumption was not met for the mixed ANOVA, a robust ANOVA was completed. This was done using the "welchADF" package (more details on this package: <https://journal.r-project.org/archive/2017/RJ-2017-049/index.html)>). The results from this analysis are shown below, followed by a table depicting the statistics.   

```{r warning=FALSE}

# compute the main effects
gng_err_wel = welchADF.test(formula = err_omi ~ location + power + balance + cue, response = "err_omi", between.s = .(location, power, balance), within.s = .(cue), subject = "participant", contrast = c("omnibus", "all.pairwise"), data = gng_err_anova_data, effect.size = TRUE, trimming = TRUE)

# NOTE:to get the effect sizes, the package will allow you to get the pairwise (i.e. post-hoc) effect sizes 
  # the robust ANOVA above does not automatically get the effect sizes 
  # Note: it must be done for each factor separately with the following as a guide
# welchADF.test(formula = err_omi ~ location + power + balance + cue, response = "err_omi", between.s = .(location, power, balance), within.s = .(cue), subject = "participant", contrast = "all.pairwise", data = gng_err_anova_data, effect.size = TRUE, trimming = TRUE, effect = "cue")

#welchADF.test(formula = err_omi ~ location + power + cue, data = gng_err_anova_data, effect = "cue", contrast = "all.pairwise", trimming = TRUE, effect.size = TRUE)

# add all the interaction effects
gng_err_robust = update.default(gng_err_wel, .~location*power*balance*cue)

```

All results were base on $\alpha = 0.05$. Below are the reported statistics for both main effects and the interaction, followed by a table depicting the ANOVA results.  

**Main Effect of Smartphone Location**  

There was no significant main effect of smartphone location on omission errors, *WJ*(`r gng_err_robust_table$DFn[1]`, `r round(gng_err_robust_table$DFd[1], 2)`) = `r round(gng_err_robust_table[1,4], 2)`, *p* `r p_round(gng_err_robust_table$p[1])`.

**Main Effect of Smartphone Power**  

There was no significant main effect of smartphone power on omission errors, *WJ*(`r gng_err_robust_table$DFn[2]`, `r round(gng_err_robust_table$DFd[2], 2)`) = `r round(gng_err_robust_table[2,4], 2)`, *p* `r p_round(gng_err_robust_table$p[2])`.  

**Main Effect of Smartphone Balance**  

There was no significant main effect of balance (i.e. which task was completed first) on omission errors, *WJ*(`r gng_err_robust_table$DFn[3]`, `r round(gng_err_robust_table$DFd[3], 2)`) = `r round(gng_err_robust_table[3,4], 2)`, *p* `r p_round(gng_err_robust_table$p[3])``.

**Main Effect of Smartphone Cue Type**  

There was no significant main effect of cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[4]`, `r round(gng_err_robust_table$DFd[4], 2)`) = `r round(gng_err_robust_table[4,4], 2)`, *p* `r p_round(gng_err_robust_table$p[4])``.  

**Interaction of Smartphone Location & Power**  

There was no significant interaction between smartphone location and power on omission errors, *WJ*(`r gng_err_robust_table$DFn[5]`, `r round(gng_err_robust_table$DFd[5], 2)`) = `r round(gng_err_robust_table[5,4], 2)`, *p* `r p_round(gng_err_robust_table$p[5])``.

**Interaction of Smartphone Location & Balance**  

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on omission errors, *WJ*(`r gng_err_robust_table$DFn[6]`, `r round(gng_err_robust_table$DFd[6], 2)`) = `r round(gng_err_robust_table[6,4], 2)`, *p* `r p_round(gng_err_robust_table$p[6])``.

**Interaction of Smartphone Power & Balance**  

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on omission errors, *WJ*(`r gng_err_robust_table$DFn[7]`, `r round(gng_err_robust_table$DFd[7], 2)`) = `r round(gng_err_robust_table[7,4], 2)`, *p* `r p_round(gng_err_robust_table$p[7])``. 

**Interaction of Smartphone Location & Cue Type**  

There was no significant interaction between smartphone location and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[8]`, `r round(gng_err_robust_table$DFd[8], 2)`) = `r round(gng_err_robust_table[8,4], 2)`, *p* `r p_round(gng_err_robust_table$p[8])``. 

**Interaction of Smartphone Power & Cue Type**  

There was no significant interaction between smartphone power and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[9]`, `r round(gng_err_robust_table$DFd[9], 2)`) = `r round(gng_err_robust_table[9,4], 2)`, *p* `r p_round(gng_err_robust_table$p[9])``. 

**Interaction of Balance & Cue Type**  

There was no significant interaction between balance (i.e. which task was completed first) and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[10]`, `r round(gng_err_robust_table$DFd[10], 2)`) = `r round(gng_err_robust_table[10,4], 2)`, *p* `r p_round(gng_err_robust_table$p[10])``. 

**Interaction of Smartphone Location, Smartphone Power, & Balance**  

There was no significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on omission errors, *WJ*(`r gng_err_robust_table$DFn[11]`, `r round(gng_err_robust_table$DFd[11], 2)`) = `r round(gng_err_robust_table[11,4], 2)`, *p* `r p_round(gng_err_robust_table$p[11])``. 

**Interaction of Smartphone Location, Smartphone Power, & Cue Type**  

There was no significant interaction between smartphone location, smartphone power, and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[12]`, `r round(gng_err_robust_table$DFd[12], 2)`) = `r round(gng_err_robust_table[12,4], 2)`, *p* `r p_round(gng_err_robust_table$p[12])``. 

**Interaction of Smartphone Location, Balance, & Cue Type**  

There was no significant interaction between smartphone location, balance (i.e. which task was completed first), and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[13]`, `r round(gng_err_robust_table$DFd[13], 2)`) = `r round(gng_err_robust_table[13,4], 2)`, *p* `r p_round(gng_err_robust_table$p[13])``. 

**Interaction of Smartphone Power, Balance, & Cue Type**  

There was no significant interaction between smartphone power, balance (i.e. which task was completed first), and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[14]`, `r round(gng_err_robust_table$DFd[14], 2)`) = `r round(gng_err_robust_table[14,4], 2)`, *p* `r p_round(gng_err_robust_table$p[14])``. 

**Interaction of Smartphone Location, Smartphone Power, Balance, & Cue Type**  

There was no significant interaction between smartphone location, smartphone power, balance (i.e. which task was completed first), and cue type on omission errors, *WJ*(`r gng_err_robust_table$DFn[15]`, `r round(gng_err_robust_table$DFd[15], 2)`) = `r round(gng_err_robust_table[15,4], 2)`, *p* `r p_round(gng_err_robust_table$p[15])``. 

**Post-hoc Tests**  

Since there was no significant main or interaction effects, no post-hocs were completed.

```{r}

# create data frame from robust ANOVA results to display
  # get the Numerator DF (DFn)
  gng_err_robust_dfn = as.numeric(c(gng_err_robust$location$numeratorDF, gng_err_robust$power$numeratorDF, 
                                  gng_err_robust$balance$numeratorDF, gng_err_robust$cue$numeratorDF, 
                                  gng_err_robust$`location:power`$numeratorDF, gng_err_robust$`location:balance`$numeratorDF, 
                                  gng_err_robust$`power:balance`$numeratorDF, gng_err_robust$`location:cue`$numeratorDF, 
                                  gng_err_robust$`power:cue`$numeratorDF, gng_err_robust$`balance:cue`$numeratorDF, 
                                  gng_err_robust$`location:power:balance`$numeratorDF,
                                  gng_err_robust$`location:power:cue`$numeratorDF,
                                  gng_err_robust$`location:balance:cue`$numeratorDF, 
                                  gng_err_robust$`power:balance:cue`$numeratorDF, 
                                  gng_err_robust$`location:power:balance:cue`$numeratorDF))
  # get the Denominator DF (DFd)
  gng_err_robust_dfd = as.numeric(c(gng_err_robust$location$denominatorDF, gng_err_robust$power$denominatorDF, 
                                  gng_err_robust$balance$denominatorDF, gng_err_robust$cue$denominatorDF, 
                                  gng_err_robust$`location:power`$denominatorDF, gng_err_robust$`location:balance`$denominatorDF,
                                  gng_err_robust$`power:balance`$denominatorDF, gng_err_robust$`location:cue`$denominatorDF, 
                                  gng_err_robust$`power:cue`$denominatorDF, gng_err_robust$`balance:cue`$denominatorDF, 
                                  gng_err_robust$`location:power:balance`$denominatorDF,
                                  gng_err_robust$`location:power:cue`$denominatorDF,
                                  gng_err_robust$`location:balance:cue`$denominatorDF, 
                                  gng_err_robust$`power:balance:cue`$denominatorDF, 
                                  gng_err_robust$`location:power:balance:cue`$denominatorDF))
  # get the WJ statistic
  gng_err_robust_WJ = as.numeric(c(gng_err_robust$location$welch.T, gng_err_robust$power$welch.T, 
                                  gng_err_robust$balance$welch.T, gng_err_robust$cue$welch.T, 
                                  gng_err_robust$`location:power`$welch.T, gng_err_robust$`location:balance`$welch.T,
                                  gng_err_robust$`power:balance`$welch.T, gng_err_robust$`location:cue`$welch.T, 
                                  gng_err_robust$`power:cue`$welch.T, gng_err_robust$`balance:cue`$welch.T, 
                                  gng_err_robust$`location:power:balance`$welch.T,
                                  gng_err_robust$`location:power:cue`$welch.T,
                                  gng_err_robust$`location:balance:cue`$welch.T, 
                                  gng_err_robust$`power:balance:cue`$welch.T, 
                                  gng_err_robust$`location:power:balance:cue`$welch.T))
  # get the p value
  gng_err_robust_pval = as.numeric(c(gng_err_robust$location$pval, gng_err_robust$power$pval, 
                                  gng_err_robust$balance$pval, gng_err_robust$cue$pval, 
                                  gng_err_robust$`location:power`$pval, gng_err_robust$`location:balance`$pval,
                                  gng_err_robust$`power:balance`$pval, gng_err_robust$`location:cue`$pval, 
                                  gng_err_robust$`power:cue`$pval, gng_err_robust$`balance:cue`$pval, 
                                  gng_err_robust$`location:power:balance`$pval,
                                  gng_err_robust$`location:power:cue`$pval,
                                  gng_err_robust$`location:balance:cue`$pval, 
                                  gng_err_robust$`power:balance:cue`$pval, 
                                  gng_err_robust$`location:power:balance:cue`$pval))
  
  # bind the data into 1 data frame
  gng_err_robust_table = as.data.frame(cbind(gng_err_robust_dfn, gng_err_robust_dfd, gng_err_robust_WJ, gng_err_robust_pval))
    # add columns: (1) to denote the effect, and (2) to denote significance at p < .05 
    gng_err_robust_table = cbind(names(gng_err_robust)[1:15], 
                                 gng_err_robust_table, c(rep("", length(gng_err_robust_pval))))
    colnames(gng_err_robust_table) = c("Effect","DFn", "DFd", "WJ Stat", "p", "p < .05")
    rownames(gng_err_robust_table) = NULL
  
# show ANOVA results in kable table 
kable(gng_err_robust_table, caption = "3x2x2 Robust Mixed ANOVA on Omission Error in the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```

##### Visualizations

The following shows a visulization of the Go/No-Go omission error data based on the three factors: smartphone location, smartphone power, and cue type.

To visualize the mixed design, two plots were created (i.e. one for each cue type). *Note: Since the balance factor (i.e. which task was completed first) did not have a significant main effect or interaction effect on the data, this factor is not visualized in the plots*

```{r message=FALSE, warning=FALSE}
#--FOR THE VERTICAL CUE TYPE-- 

# create data frame with location conditions as characters rather than intergers 
  # First, get a subset of the data
  gng_err_bar_v_data = gng_err_final[gng_err_final$cue == 1, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_v_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_v_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_v_data$location = factor(gng_err_bar_v_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_v_data$power = factor(gng_err_bar_v_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_v = ggplot(gng_err_bar_v_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Vertical' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_v_data$err_omi), max(gng_err_bar_v_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_v_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

#--FOR THE HORIZONTAL CUE TYPE-- 

# create data frame with location conditions as characters rather than intergers 
  # First, get a subset of the data
  gng_err_bar_h_data = gng_err_final[gng_err_final$cue == 2, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_h_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_h_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_h_data$location = factor(gng_err_bar_h_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_h_data$power = factor(gng_err_bar_h_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_h = ggplot(gng_err_bar_h_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Horizontal' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the horizontal cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_h_data$err_omi), max(gng_err_bar_h_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_h_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# Show plots
  # for the Vertical cue type
  gng_err_bar_v
  # for the Horizontal cue type
  gng_err_bar_h


```

#### Accuracy Analysis  

A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was completed *(DV = average RT)*. The between-subjects factors were smartphone location and power.  

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Any participants who had a reaction time that was higher than two standard deviations (SD) from the mean reaction time were not included in the final analysis.  

- Exclude data from any partitipants with (1) missing data and/or (2) the reaction time SD criteria (i.e. within two standard devations from the mean RT). There was a total of **`r length(gng_ac_final$participant)`** participants in the Go/No-Go Error analysis. 
  - Missing Data: `r abs(nrow(gng_ac_analysis) - nrow(gng_ac_noNA))` participants were removed due to missing data (*note: this also removed data from participants who did not response during the task*).
  - Reaction Time SD Criteria: `r nrow(gng_ac_all_remove)` participants were removed because they did not meet the raction time SD criteria. The number of participants removed with respect to their smartphone location and power conditions was as follows: 
    - For the power "on" condition: `r length(gng_ac_remove1)` (on desk), `r length(gng_ac_remove2)` (pocket/bag), `r length(gng_ac_remove3)` (outside)
    - For the power "off" condition: `r length(gng_ac_remove4)` (on desk), `r length(gng_ac_remove5)` (pocket/bag), `r length(gng_ac_remove6)` (outside)


```{r}

# filter any participants with missing data or that do not meet the inclusionn criteria

  # for all ps, excluding any p with missing data in the average reaction time column (gng_rt_av)
  # Note: this also removed participants who did not respond at all during the task
  gng_ac_noNA = gng_ac_analysis[!is.na(gng_ac_analysis$gng_rt_av),]
  
  # for all ps, excluding any that did not meet the reaction time deviation criteria (i.e. within two standard devations from the mean RT)
  # get the mean RT with all the data included, divided by their respective smartphone condition (i.e. either location or power)
    # get the descriptive of the data
    gng_ac_noNA_des = psych::describeBy(gng_ac_noNA, list(gng_ac_noNA$location, gng_ac_noNA$power), mat = TRUE)
    # find the inclusion criteria for each condition (M - 2*SD)
      # power = ON 
        # location = "on desk"
        gng_ac_criteria1 = gng_ac_noNA_des["gng_rt_av1","mean"] + (gng_ac_noNA_des["gng_rt_av1","sd"] * 2)
        # location = "pocket/bag"
        gng_ac_criteria2 = gng_ac_noNA_des["gng_rt_av2","mean"] + (gng_ac_noNA_des["gng_rt_av2","sd"] * 2)
        # location = "outside"
        gng_ac_criteria3 = gng_ac_noNA_des["gng_rt_av3","mean"] + (gng_ac_noNA_des["gng_rt_av3","sd"] * 2)
      # power = OFF 
        # location = "on desk"
        gng_ac_criteria4 = gng_ac_noNA_des["gng_rt_av4","mean"] + (gng_ac_noNA_des["gng_rt_av4","sd"] * 2)
        # location = "pocket/bag"
        gng_ac_criteria5 = gng_ac_noNA_des["gng_rt_av5","mean"] + (gng_ac_noNA_des["gng_rt_av5","sd"] * 2)
        # location = "outside"
        gng_ac_criteria6 = gng_ac_noNA_des["gng_rt_av6","mean"] + (gng_ac_noNA_des["gng_rt_av6","sd"] * 2)
    # find the participants to remove from the data 
      # power = ON 
        # location = "on desk"
        gng_ac_remove1 = (gng_ac_noNA %>% filter(location == 1 & power == 1 & gng_rt_av > gng_ac_criteria1))[,"participant"]
        # location = "pocket/bag"
        gng_ac_remove2 = (gng_ac_noNA %>% filter(location == 2 & power == 1 & gng_rt_av > gng_ac_criteria2))[,"participant"]
        # location = "outside"
        gng_ac_remove3 = (gng_ac_noNA %>% filter(location == 3 & power == 1 & gng_rt_av > gng_ac_criteria3))[,"participant"]
      # power = OFF 
        # location = "on desk"
        gng_ac_remove4 = (gng_ac_noNA %>% filter(location == 1 & power == 2 & gng_rt_av > gng_ac_criteria4))[,"participant"]
        # location = "pocket/bag"
        gng_ac_remove5 = (gng_ac_noNA %>% filter(location == 2 & power == 2 & gng_rt_av > gng_ac_criteria5))[,"participant"]
        # location = "outside"
        gng_ac_remove6 = (gng_ac_noNA %>% filter(location == 3 & power == 3 & gng_rt_av > gng_ac_criteria6))[,"participant"]
    # make a list of all the participant to remove from the data
      gng_ac_all_remove = dplyr::arrange(data.frame("remove" = as.numeric(c(gng_ac_remove1, gng_ac_remove2, gng_ac_remove3, gng_ac_remove4, gng_ac_remove5, gng_ac_remove6))), remove)
        # note: this code also re-orders the list from smallest to largest
  # Filter the data using the final remove list 
  gng_ac_final = dplyr::filter(gng_ac_noNA, !participant %in% gng_ac_all_remove$remove)

```

##### Descriptive Statistics 

The following table depicts the descriptive statistics for the Go/No-Go Accuracy Analysis: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_des = psych::describeBy(gng_ac_final, list(gng_ac_final$location, gng_ac_final$power), mat = TRUE)  
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_des_t = rbind((cbind(t(gng_ac_des["gng_rt_av1",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av2",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av3",c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_des["gng_rt_av4",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av5",c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_des["gng_rt_av6",c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_des_t, caption = "Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

```

##### Analysis 

To assess if the reaction time statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e. on desk, in pocket/bag, or outside) and power (i.e. on or off). Since there are unequal sample sizes, we used Type III sum of squares.

###### Assumptions  

<details>  
<summary> The ANOVA made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# get the required data from the ospan_analysis data: participant, location, power, and score
gng_ac_anova_data = gng_ac_final[, c("participant", "location", "power", "balance", "gng_rt_av")]
    colnames(gng_ac_anova_data) = c("participant", "location", "power", "balance", "rt_av")
    rownames(gng_ac_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    gng_ac_anova_data$location = factor(gng_ac_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    gng_ac_anova_data$power = factor(gng_ac_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
    gng_ac_anova_data$balance = factor(gng_ac_anova_data$balance, 
                  levels = c(1, 2),
                  labels = c("Ospan First", "Go/No-Go First"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
gng_ac_anova = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_ac_res = data.frame("residuals" = proj(gng_ac_anova$aov)[,"Residuals"])   

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_ac_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals") 
qqline(gng_ac_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the left and some data on the far right *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_ac_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_shap$p.value)` *(see code below)*.  
   
```{r}

gng_ac_shap = shapiro.test(gng_ac_res$residuals)
gng_ac_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(as.numeric(gng_ac_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_ac_sk_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_ac_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_ac_sk_t[c(2,5),])), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}
# use the "gng_ac_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_sk_t = rbind((cbind(t(gng_ac_des["gng_rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_des["gng_rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_des["gng_rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r gng_ac_lev_loc$Df[1]`, `r gng_ac_lev_loc$Df[2]`) = `r round(gng_ac_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_lev_loc[3][1,1])` and balance, *F*(`r gng_ac_lev_bal$Df[1]`, `r gng_ac_lev_bal$Df[2]`) = `r round(gng_ac_lev_bal[2][1,1], 2)`, *p* `r p_round(gng_ac_lev_bal[3][1,1])`. However, the homogeneity assumption was not met for smartphone power, *F*(`r gng_ac_lev_pow$Df[1]`, `r gng_ac_lev_pow$Df[2]`) = `r round(gng_ac_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_ac_lev_pow[3][1,1])` *(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_ac_lev_loc = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(location), center = median)
  gng_ac_lev_loc
  # for power
  gng_ac_lev_pow = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(power), center = median)
  gng_ac_lev_pow
  # for balance
  gng_ac_lev_bal = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(balance), center = median)
  gng_ac_lev_bal

```

- The homogeneity of variance assumption was not met for the smarphone power condition. Therefore, considering there are unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of power).

```{r message=FALSE, warning=FALSE}
# run the corrected ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
# this will be used to report the main effect of smartphone power only 
gng_ac_anova_cor = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power, balance)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_pow_es = (EtaSq(gng_ac_anova$aov, type = 3, anova = FALSE))[2]

```

</details>  


###### Results

All results were based on $\alpha = 0.05$. Below are the reported statistics for the main effects and the interactions, followed by a table depicting the ANOVA results.  

**Main Effect of Smartphone Location**  

There was no significant main effect of smartphone location on reaction time, *F*(`r gng_ac_t[2,2]`, `r gng_ac_t[2,3]`) = `r round(gng_ac_t[2,4], 2)`, *p* `r p_round(gng_ac_t[2,5])`, $\eta^{2}$ `r p_round(gng_ac_t[2,7])`.

**Main Effect of Smartphone Power**  

There was no significant main effect of smartphone power on reaction time, *F*(`r gng_ac_t[3,2]`, `r gng_ac_t[3,3]`) = `r round(gng_ac_t[3,4], 2)`, *p* `r p_round(gng_ac_t[3,5])`, $\eta^{2}$ `r p_round(gng_ac_t[3,7])`.

**Interaction of Balance**  

There was no significant main effect of balance (i.e. which task was completed first) on reaction time, *F*(`r gng_ac_t[4,2]`, `r gng_ac_t[4,3]`) = `r round(gng_ac_t[4,4], 2)`, *p* `r p_round(gng_ac_t[4,5])`, $\eta^{2}$ `r p_round(gng_ac_t[4,7])`.

**Interaction of Smartphone Location & Smartphone Power**  

There was no significant interaction between smartphone location and smartphone power on reaction time, *F*(`r gng_ac_t[5,2]`, `r gng_ac_t[5,3]`) = `r round(gng_ac_t[5,4], 2)`, *p* `r p_round(gng_ac_t[5,5])`, $\eta^{2}$ `r p_round(gng_ac_t[5,7])`.

**Interaction of Smartphone Location & Balance**  

There was no significant interaction between smartphone location and balance (i.e. which task was completed first) on reaction time, *F*(`r gng_ac_t[6,2]`, `r gng_ac_t[6,3]`) = `r round(gng_ac_t[6,4], 2)`, *p* `r p_round(gng_ac_t[6,5])`, $\eta^{2}$ `r p_round(gng_ac_t[6,7])`.

**Interaction of Smartphone Power & Balance**  

There was no significant interaction between smartphone power and balance (i.e. which task was completed first) on reaction time, *F*(`r gng_ac_t[7,2]`, `r gng_ac_t[7,3]`) = `r round(gng_ac_t[7,4], 2)`, *p* `r p_round(gng_ac_t[7,5])`, $\eta^{2}$ `r p_round(gng_ac_t[7,7])`.

**Interaction of Smartphone Location, Smartphone Power, & Balance**  

There was a significant interaction between smartphone location, smartphone power, and balance (i.e. which task was completed first) on reaction time, *F*(`r gng_ac_t[8,2]`, `r gng_ac_t[8,3]`) = `r round(gng_ac_t[8,4], 2)`, *p* `r p_round(gng_ac_t[8,5])`, $\eta^{2}$ `r p_round(gng_ac_t[8,7])`.

**Post-hoc Tests**  

Since there was a significant three-way interaction (i.e smartphone location, smartphone power, and balance), further analyses were completed *(see "Post-Hocs: Simple Two-Way Interactions" below)*.   

```{r}

# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_t = gng_ac_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_t = dplyr::select(gng_ac_t, -SSn, -SSd)
    # chance the data for the main effect of power to corrected numbers
    gng_ac_t[gng_ac_t$Effect == "power",] = c(gng_ac_anova_cor$ANOVA[3,], gng_ac_pow_es)
# show ANOVA results in kable table 
kable(gng_ac_t, caption = "3x2x2 ANOVA on Go/No-Go Reaction Time", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


##### Post-Hocs: Simple Two-Way Interactions  

A three-way interaction (i.e. between smartphone location, smartphone power, and balance) suggested that a two-way interaction operated differently depending on the levels of the third factor. In order to interpret this complex interaction, the data was broken down into a subset based on the levels of one of the factors (i.e. "simple two-way interactions"). 

The main manipulation in the present study was smartphone location and power, additionally, the balance factor was designed to provide an experimental control for the counter-balanced task order. Therefore, the simple two-way interactions of the smartphone factors (i.e. location and power) were assessed at each level of balance (i.e. OSpan first and Go/No-Go first). Although this required two more analyses, this approach allowed us to disentangle the three-way interaction and therefore interpret where the differences were among the factors. Two new data sets were created for the simple two-way interactions: (1) for the OSpan First Data (*gng_ac_ospan_data*) and (2) for the Go/No-Go First Data (*gng_ac_gng_data*).

```{r}

# get a subset of the gng_ac_anova_data
  # for the OSpan first balance group only
  gng_ac_ospan_data = gng_ac_anova_data[gng_ac_anova_data$balance == "Ospan First", ]

  # for the Go/No-Go first balance group only
  gng_ac_gng_data = gng_ac_anova_data[gng_ac_anova_data$balance == "Go/No-Go First", ]

```


In order to control for multiple comparisons, a Bonferroni adjustment was used on the p-values to determine significance. This adjustment was completed by dividing the current alpha level ($\alpha = 0.05$) by the number of simple two-way interactions being compuetd (i.e. 2), therefore, the the bonferroni-corrected alpha value for statistical significance was `r 0.05/2`.


###### OSpan First: Descriptive Statistics 

The first simple two-way interaction ANOVA was completed using the "OSpan First" subset of the Balance Factor (i.e. participants who completed the OSpan task first, then the Go/No-Go task). As in the main 3-way ANOVA, the between-subjects factors were smartphone location (i.e. desk, pocket/bag or outside) and power (i.e. ON or OFF), and the dependent variable was accuracy (i.e. average reaction time).

There was a total of `r length(gng_ac_ospan_data$participant)` participants in the OSpan First analysis. The following table depicts the descriptive statistics: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_ospan_des = psych::describeBy(gng_ac_ospan_data, list(gng_ac_ospan_data$location, gng_ac_ospan_data$power), mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_ospan_des_t = rbind((cbind(t(gng_ac_ospan_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av3", c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_ospan_des["rt_av4", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av5", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_ospan_des["rt_av6", c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_ospan_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_ospan_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  
# use kable to show the descriptives table
  kable(gng_ac_ospan_des_t, caption = "Ospan First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

```


###### OSpan First: Assumptions  

<details>  
<summary> The two-way interaction made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_ospan_anova = ezANOVA(
  data = gng_ac_ospan_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )
  
# calculate & extract the residuals from the ANOVA
gng_ac_ospan_res = data.frame("residuals" = proj(gng_ac_ospan_anova$aov)[,"Residuals"])  

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_ac_ospan_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the OSpan First Group") 
qqline(gng_ac_ospan_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_ac_ospan_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the OSpan First Group") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_ospan_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_ospan_shap$p.value)` *(see code below)*.  
   
```{r}

gng_ac_ospan_shap = shapiro.test(gng_ac_ospan_res$residuals)
gng_ac_ospan_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(as.numeric(gng_ac_ospan_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_ac_ospan_sk_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_ac_ospan_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_ac_ospan_sk_t[c(2,5),])), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}

# use the descriptives data -- gng_ac_ospan_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_ospan_sk_t = rbind((cbind(t(gng_ac_ospan_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_ospan_des["rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_ospan_des["rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_ospan_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the OSpan First Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone power, *F*(`r gng_ac_ospan_lev_pow$Df[1]`, `r gng_ac_ospan_lev_pow$Df[2]`) = `r round(gng_ac_ospan_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_ac_ospan_lev_pow[3][1,1])`. However, the homogeneity assumption was not met for smartphone location, *F*(`r gng_ac_ospan_lev_loc$Df[1]`, `r gng_ac_ospan_lev_loc$Df[2]`) = `r round(gng_ac_ospan_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_ospan_lev_loc[3][1,1])` *(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_ac_ospan_lev_loc = leveneTest(data = gng_ac_ospan_data, rt_av ~ as.factor(location), center = median)
  gng_ac_ospan_lev_loc
  # for power
  gng_ac_ospan_lev_pow = leveneTest(data = gng_ac_ospan_data, rt_av ~ as.factor(power), center = median)
  gng_ac_ospan_lev_pow

```

- The homogeneity of variance assumption was not met for the smartphone location condition. Therefore, considering there were unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of location).

```{r message=FALSE, warning=FALSE}
# run the corrected ANOVA (IV: Smartphone Power & Location; DV: Go/No-Go RT)
# this will be used to report the main effect of smartphone power only  
gng_ac_ospan_anova_cor = ezANOVA(
  data = gng_ac_ospan_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_ospan_loc_es = (EtaSq(gng_ac_ospan_anova_cor$aov, type = 3, anova = FALSE))[2]

```

</details>  

###### OSpan First: Results  

All results were based on $\alpha_{adjusted} = 0.025$. Below are the reported statistics for the main and interaction effect of the simple two-way ANOVA, followed by a table depicting the ANOVA results.

**Main Effect of Smartphone Location**  

There was no significant main effect of smartphone location on reaction time, *F*(`r gng_ac_ospan_t[2,2]`, `r gng_ac_ospan_t[2,3]`) = `r round(gng_ac_ospan_t[2,4], 2)`, *p* `r p_round(gng_ac_ospan_t[2,5])`, $\eta^{2}$ `r p_round(gng_ac_ospan_t[2,7])`.

**Main Effect of Smartphone Power**  

There was no significant main effect of smartphone power on reaction time, *F*(`r gng_ac_ospan_t[3,2]`, `r gng_ac_ospan_t[3,3]`) = `r round(gng_ac_ospan_t[3,4], 2)`, *p* `r p_round(gng_ac_ospan_t[3,5])`, $\eta^{2}$ `r p_round(gng_ac_ospan_t[3,7])`.

**Interaction of Smartphone Location & Smartphone Power**  

There was no significant interaction between smartphone location and smartphone power on reaction time, *F*(`r gng_ac_ospan_t[4,2]`, `r gng_ac_ospan_t[4,3]`) = `r round(gng_ac_ospan_t[4,4], 2)`, *p* `r p_round(gng_ac_ospan_t[4,5])`, $\eta^{2}$ `r p_round(gng_ac_ospan_t[4,7])`.

**Post-hoc Tests**  

Since there was no significant main or interaction effects, no post-hocs were completed.


```{r}

# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_ospan_t = gng_ac_ospan_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_ospan_t = dplyr::select(gng_ac_ospan_t, -SSn, -SSd)
    # change the data for the main effect of location to corrected numbers
    gng_ac_ospan_t[gng_ac_ospan_t$Effect == "location",] = c(gng_ac_ospan_anova_cor$ANOVA[2,], gng_ac_ospan_loc_es)
    # re-define the alpha value based on the bonferroni correction
      # rename the "p<.05" column appropriately -- "p<.025"
      colnames(gng_ac_ospan_t)[6] = "p<.025"
      # change values to fit the data
      gng_ac_ospan_t$`p<.025` = ifelse(gng_ac_ospan_t$p < .025, "*", "")
    
# show ANOVA results in kable table 
kable(gng_ac_ospan_t, caption = "OSpan First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


###### Go/No-Go First: Descriptive Statistics  

The second simple two-way interaction ANOVA was completed using the "Go/No-Go First" subset of the Balance Factor (i.e. participants who completed the Go/No-Go task first, then the OSpan task). As in the main 3-way ANOVA, the between-subjects factors were smartphone location (i.e. desk, pocket/bag or outside) and power (i.e. ON or OFF), and the dependent variable was accuracy (i.e. average reaction time).

There was a total of `r length(gng_ac_gng_data$participant)` participants in the Go/No-Go First analysis. The following table depicts the descriptive statistics: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_des = psych::describeBy(gng_ac_gng_data, list(gng_ac_gng_data$location, gng_ac_gng_data$power), mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_des_t = rbind((cbind(t(gng_ac_gng_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av3", c("mean", "sd", "min", "max", "n")]))), 
                      (cbind(t(gng_ac_gng_des["rt_av4", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av5", c("mean", "sd", "min", "max", "n")]),
                             t(gng_ac_gng_des["rt_av6", c("mean", "sd", "min", "max", "n")]))))
    colnames(gng_ac_gng_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_des_t) = rep(c("M", "SD", "min.", "max.", "n"), 2)
  
# use kable to show the descriptives table
  kable(gng_ac_gng_des_t, caption = "Go/No-Go First Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 5, "Smartphone Power OFF" = 5))

```


###### Go/No-Go First: Assumptions  

<details>  
<summary> The two-way interaction made three assumptions. The independent random sampling and normality assumptions were met. The homogeneity of variance assumption was not met, therefore, a White correction was done for the relevant effect. *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_anova = ezANOVA(
  data = gng_ac_gng_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_res = data.frame("residuals" = proj(gng_ac_gng_anova$aov)[,"Residuals"])   

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_ac_gng_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Go/No-Go First Group") 
qqline(gng_ac_gng_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_ac_gng_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Go/No-Go First Group") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_gng_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_gng_shap$p.value)` *(see code below)*.  
   
```{r}

gng_ac_gng_shap = shapiro.test(gng_ac_gng_res$residuals)
gng_ac_gng_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(as.numeric(gng_ac_gng_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_ac_gng_sk_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_ac_gng_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_ac_gng_sk_t[c(2,5),])), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}

# use the descriptives data -- gng_ac_gng_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_sk_t = rbind((cbind(t(gng_ac_gng_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(gng_ac_gng_des["rt_av4", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av5", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_des["rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_gng_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_gng_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Go/No-Go First Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  group_rows(index = c("Smartphone Power ON" = 3, "Smartphone Power OFF" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r gng_ac_gng_lev_loc$Df[1]`, `r gng_ac_gng_lev_loc$Df[2]`) = `r round(gng_ac_gng_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_gng_lev_loc[3][1,1])`. However, the homogeneity assumption was not met for smartphone power, *F*(`r gng_ac_gng_lev_pow$Df[1]`, `r gng_ac_gng_lev_pow$Df[2]`) = `r round(gng_ac_gng_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_ac_gng_lev_pow[3][1,1])` *(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_ac_gng_lev_loc = leveneTest(data = gng_ac_gng_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_lev_loc
  # for power
  gng_ac_gng_lev_pow = leveneTest(data = gng_ac_gng_data, rt_av ~ as.factor(power), center = median)
  gng_ac_gng_lev_pow

```

- The homogeneity of variance assumption was not met for the smartphone power condition. Therefore, considering there were unequal sample sizes, a White-corrected F-test was conducted (i.e. if interested in assessing the main effect of power).

```{r message=FALSE, warning=FALSE}
# run the corrected ANOVA (IV: Smartphone Power & Location ; DV: Go/No-Go RT)
# this will be used to report the main effect of smartphone power only  
gng_ac_gng_anova_cor = ezANOVA(
  data = gng_ac_gng_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  , white.adjust = TRUE
)

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
gng_ac_gng_loc_es = (EtaSq(gng_ac_gng_anova_cor$aov, type = 3, anova = FALSE))[2]

```

</details>  

###### Go/No-Go First: Results  

All results were based on $\alpha = 0.025$. Below are the reported statistics for the main and interaction effect of the simple two-way ANOVA, followed by a table depicting the ANOVA results.

**Main Effect of Smartphone Location**  

There was no significant main effect of smartphone location on reaction time, *F*(`r gng_ac_gng_t[2,2]`, `r gng_ac_gng_t[2,3]`) = `r round(gng_ac_gng_t[2,4], 2)`, *p* `r p_round(gng_ac_gng_t[2,5])`, $\eta^{2}$ `r p_round(gng_ac_gng_t[2,7])`.

**Main Effect of Smartphone Power**  

There was no significant main effect of smartphone power on reaction time, *F*(`r gng_ac_gng_t[3,2]`, `r gng_ac_gng_t[3,3]`) = `r round(gng_ac_gng_t[3,4], 2)`, *p* `r p_round(gng_ac_gng_t[3,5])`, $\eta^{2}$ `r p_round(gng_ac_gng_t[3,7])`.

**Interaction of Smartphone Location & Smartphone Power**  

There was a significant interaction between smartphone location and smartphone power on reaction time, *F*(`r gng_ac_gng_t[4,2]`, `r gng_ac_gng_t[4,3]`) = `r round(gng_ac_gng_t[4,4], 2)`, *p* `r p_round(gng_ac_gng_t[4,5])`, $\eta^{2}$ `r p_round(gng_ac_gng_t[4,7])`.

**Post-hoc Tests**  

Since there was a significant two-way interaction (i.e smartphone location and smartphone power), further analyses were completed *(see “Simple Main Effects: Smartphone Location vs. Smartphone Power (for the Go/No-Go First Balance Group)” below)*.


```{r}

# create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
  # get the original anova data
  gng_ac_gng_t = gng_ac_gng_anova$ANOVA
    # since the corrected data does not have the SS, remove these columns from the original ANOVA table
    gng_ac_gng_t = dplyr::select(gng_ac_gng_t, -SSn, -SSd)
    # change the data for the main effect of location to corrected numbers
    gng_ac_gng_t[gng_ac_gng_t$Effect == "power",] = c(gng_ac_gng_anova_cor$ANOVA[3,], gng_ac_gng_loc_es)
    # re-define the alpha value based on the bonferroni correction
      # rename the "p<.05" column appropriately -- "p<.025"
      colnames(gng_ac_gng_t)[6] = "p<.025"
      # change values to fit the data
      gng_ac_gng_t$`p<.025` = ifelse(gng_ac_gng_t$p < 0.025, "*", "")
    
# show ANOVA results in kable table 
kable(gng_ac_gng_t, caption = "Go/No-Go First: Simple Two-Way Interaction on Reaction Time on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


##### Simple Main Effects: Smartphone Location vs. Smartphone Power (for the Go/No-Go First Balance Group)  
The simple two-way interaction showed that, when the Go/No-Go Task was completed first, there was a significant interaction between the smartphone conditions. In order to interpret this effect, simple main effects were completed investigating the effect of smartphone location at each level of smartphone power. Therefore, a simple one-way ANOVA for smartphone location was completed for each smartphone power condition separately. 

Two new data sets were created for the simple two-way interactions: (1) for the Power ON Data (*gng_ac_gng_ON_data*) and (2) for the Power OFF Data (*gng_ac_gng_OFF_data*).  


```{r}

# get a subset of the gng_ac_gng_data
  # for the ON power group only
  gng_ac_gng_ON_data = gng_ac_gng_data[gng_ac_gng_data$power == "ON", ]

  # for the OFF power group only
  gng_ac_gng_OFF_data = gng_ac_gng_data[gng_ac_gng_data$power == "OFF", ]

```


To correct the family-wise error rate, a Holm-Bonferroni adjustment was used when assessing the significance of main effects associated with these ANOVAs. The Holm-Bonferroni adjustment defined a corrected alpha level according to the following formula: 
$$\alpha_{corrected} = \frac{\alpha}{number of comparisons − rank of comparison + 1}$$

###### Powered ON: Descriptive Statistics 

The first simple main effect ANOVA was completed using the "Powered ON" subset of the Power Factor (i.e. participants who completed the Go/No-Go task first with their smartphones powered ON). The between-subjects factor was smartphone location (i.e. desk, pocket/bag or outside) and the dependent variable was accuracy (i.e. average reaction time).

There was a total of `r length(gng_ac_gng_ON_data$participant)` participants in the Powered ON analysis. The following table depicts the descriptive statistics: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_ON_des = psych::describeBy(gng_ac_gng_ON_data, gng_ac_gng_ON_data$location, mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_ON_des_t = cbind(t(gng_ac_gng_ON_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_ON_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_ON_des["rt_av3", c("mean", "sd", "min", "max", "n")]))
    colnames(gng_ac_gng_ON_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_ON_des_t) = c("M", "SD", "min.", "max.", "n")
  
# use kable to show the descriptives table
  kable(gng_ac_gng_ON_des_t, caption = "Powered ON Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

```

###### Powered ON: Assumptions  

<details>  
<summary> The one-way simple main effect ANOVA made three assumptions. All assumptions were met. *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_ON_anova = ezANOVA(
  data = gng_ac_gng_ON_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_ON_res = data.frame("residuals" = proj(gng_ac_gng_ON_anova$aov)[,"Residuals"])   

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_ac_gng_ON_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Powered ON Group") 
qqline(gng_ac_gng_ON_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_ac_gng_ON_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Powered ON Group") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was not significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_gng_ON_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_gng_ON_shap$p.value)` *(see code below)*.  
   
```{r}

gng_ac_gng_ON_shap = shapiro.test(gng_ac_gng_ON_res$residuals)
gng_ac_gng_ON_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(gng_ac_gng_ON_sk_t[1,]), 2)` to `r round(max(gng_ac_gng_ON_sk_t[1,]), 2)`) and Kurtosis (range: `r round(min(gng_ac_gng_ON_sk_t[2,]), 2)` to `r round(max(gng_ac_gng_ON_sk_t[2,]), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}

# use the descriptives data -- gng_ac_gng_ON_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_ON_sk_t = cbind(t(gng_ac_gng_ON_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_ON_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_ON_des["rt_av3", c("skew", "kurtosis", "n")]))
    colnames(gng_ac_gng_ON_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_ON_sk_t) = c("Skewness", "Kurtosis", "n")
    
# use kable to show the descriptives table
  kable(gng_ac_gng_ON_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered ON Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for the between-subjects variable.  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r gng_ac_gng_ON_lev_loc$Df[1]`, `r gng_ac_gng_ON_lev_loc$Df[2]`) = `r round(gng_ac_gng_ON_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_gng_ON_lev_loc[3][1,1])` *(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_ac_gng_ON_lev_loc = leveneTest(data = gng_ac_gng_ON_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_ON_lev_loc

```

</details>  

###### Powered OFF: Descriptive Statistics 

The second simple main effect ANOVA was completed using the "Powered OFF" subset of the Power Factor (i.e. participants who completed the Go/No-Go task first with their smartphones powered OFF). The between-subjects factor was smartphone location (i.e. desk, pocket/bag or outside) and the dependent variable was accuracy (i.e. average reaction time).

There was a total of `r length(gng_ac_gng_ON_data$participant)` participants in the Powered ON analysis. The following table depicts the descriptive statistics: the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*).  

```{r}

# create a data frame with the descriptives of all the data, grouped by the location & power conditions
  # this creates information for the 6 possible combined conditions
  gng_ac_gng_OFF_des = psych::describeBy(gng_ac_gng_OFF_data, gng_ac_gng_OFF_data$location, mat = TRUE)  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
  gng_ac_gng_OFF_des_t = cbind(t(gng_ac_gng_OFF_des["rt_av1", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_OFF_des["rt_av2", c("mean", "sd", "min", "max", "n")]),
                              t(gng_ac_gng_OFF_des["rt_av3", c("mean", "sd", "min", "max", "n")]))
    colnames(gng_ac_gng_OFF_des_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_OFF_des_t) = c("M", "SD", "min.", "max.", "n")
  
# use kable to show the descriptives table
  kable(gng_ac_gng_OFF_des_t, caption = "Powered OFF Descriptive Statistics: Reaction Time (sec.) on the Go/No-Go Task", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

```

###### Powered ON: Assumptions  

<details>  
<summary> The one-way simple main effect ANOVA made three assumptions. All assumptions were met. *(click to see details)*. </summary>  

<span style="color:DarkBlue; font-size:18px;">1. Independent Random Sampling</span>  

This assumption was met during testing.  
    
<span style="color:DarkBlue; font-size:18px;">2. Normality</span>  

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*  
    
```{r echo=TRUE, message=FALSE, warning=FALSE}

# run between-subjects ANOVA (IV: Smartphone Power & Location; DV: OSpan Abolute Score)
gng_ac_gng_OFF_anova = ezANOVA(
  data = gng_ac_gng_OFF_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
  )

# calculate & extract the residuals from the ANOVA
gng_ac_gng_OFF_res = data.frame("residuals" = proj(gng_ac_gng_OFF_anova$aov)[,"Residuals"])   

```
    
- It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.
    
```{r}

qqnorm(gng_ac_gng_OFF_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals for the Powered OFF Group") 
qqline(gng_ac_gng_OFF_res$residuals)

```
    
- It appears that residuals tend to follow the shape of a normal distribution *(see visualization below)*.
    
```{r message=FALSE, warning=FALSE}

qplot(gng_ac_gng_OFF_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals for the Powered OFF Group") + 
  theme_classic()

```
   
- The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_gng_OFF_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_gng_OFF_shap$p.value)` *(see code below)*.  
   
```{r}

gng_ac_gng_OFF_shap = shapiro.test(gng_ac_gng_OFF_res$residuals)
gng_ac_gng_OFF_shap

```

- The data's Skewness and Kurtosis was examined and showed that both the Skewness (range: `r round(min(gng_ac_gng_OFF_sk_t[1,]), 2)` to `r round(max(gng_ac_gng_OFF_sk_t[1,]), 2)`) and Kurtosis (range: `r round(min(gng_ac_gng_OFF_sk_t[2,]), 2)` to `r round(max(gng_ac_gng_OFF_sk_t[2,]), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.  

```{r}

# use the descriptives data -- gng_ac_gng_OFF_des

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_gng_OFF_sk_t = cbind(t(gng_ac_gng_OFF_des["rt_av1", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_OFF_des["rt_av2", c("skew", "kurtosis", "n")]),
                             t(gng_ac_gng_OFF_des["rt_av3", c("skew", "kurtosis", "n")]))
    colnames(gng_ac_gng_OFF_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_gng_OFF_sk_t) = c("Skewness", "Kurtosis", "n")
    
# use kable to show the descriptives table
  kable(gng_ac_gng_OFF_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task for the Powered OFF Group", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3))

```

- Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

<span style="color:DarkBlue; font-size:18px;">3. Homogeneity of Variance</span> 

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for the between-subjects variable.  

- The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r gng_ac_gng_OFF_lev_loc$Df[1]`, `r gng_ac_gng_OFF_lev_loc$Df[2]`) = `r round(gng_ac_gng_OFF_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_gng_OFF_lev_loc[3][1,1])` *(see code below)*.

```{r}

# conduct levene's
  # for location
  gng_ac_gng_OFF_lev_loc = leveneTest(data = gng_ac_gng_OFF_data, rt_av ~ as.factor(location), center = median)
  gng_ac_gng_OFF_lev_loc

```

</details>  

###### Simple Main Effects: Results  

All results were based on a Holm-Bonferroni adjustment applied to the original p-values. Below are the reported statistics for both analyses (i.e. powered ON and OFF), followed by a table depicting the ANOVA results.

**Main Effect of Smartphone Location in the Powered ON Condition**

There was no significant main effect of smartphone location on reaction time for those in the powered OFF condition, *F*(`r gng_ac_gng_POW_t[1,2]`, `r gng_ac_gng_POW_t[1,3]`) = `r round(gng_ac_gng_POW_t[1,6], 2)`, *p* `r p_round(gng_ac_gng_POW_t[1,7])`, $\eta^{2}$ `r p_round(gng_ac_gng_POW_t[1,9])`.

**Main Effect of Smartphone Location in the Powered OFF Condition**

There was no significant main effect of smartphone location on reaction time for those in the powered OFF condition, *F*(`r gng_ac_gng_POW_t[2,2]`, `r gng_ac_gng_POW_t[2,3]`) = `r round(gng_ac_gng_POW_t[2,6], 2)`, *p* `r p_round(gng_ac_gng_POW_t[2,7])`, $\eta^{2}$ `r p_round(gng_ac_gng_POW_t[2,9])`.

**Post-hoc Tests**  

Since there was no significant main or interaction effects, no post-hocs were completed.


```{r}

# create a table with the anova results including the adjusted p-values 
  # create a table with both power condition data
  gng_ac_gng_POW_t = rbind(gng_ac_gng_ON_anova$ANOVA[gng_ac_gng_ON_anova$ANOVA$Effect == "location",], 
                           gng_ac_gng_OFF_anova$ANOVA[gng_ac_gng_OFF_anova$ANOVA$Effect == "location",])
  # replace the p-values with the adjusted values (Holm-Bonferroni adjustment)
  gng_ac_gng_POW_t$p = p.adjust(c(gng_ac_gng_ON_anova$ANOVA$p[2], gng_ac_gng_OFF_anova$ANOVA$p[2]), method = "holm")
  # adjust the p<.05 column to reflect the new data (if there was any change)
  gng_ac_gng_POW_t$`p<.05` = ifelse(gng_ac_gng_POW_t$p < 0.05, "*", "")
  # replace the names in the effects column with "ON" & "OFF"
  gng_ac_gng_POW_t$Effect = c("ON", "OFF")
  # rename the columns
  colnames(gng_ac_gng_POW_t) = c("Smartphone Power Condition", "DFn", "DFd", "SSn", "SSd", "F", "p", "p<.05", "ges")
  # remove the row names
  rownames(gng_ac_gng_POW_t) = NULL

# show ANOVA results in kable table 
kable(gng_ac_gng_POW_t, caption = "Simple Main Effects Analyses: Reaction Time on the Go/No-Go Task for the Go/No-Go First Balance Group", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


##### Visualizations

The following shows a visulization of Accuracy (i.e. reaction time) on the Cue-Dependent Go/No-go Task. There was a significant three-way interaction in the main ANOVA (i.e. between smartphone location, smartphone power, and balance). Additionally, the main factors of interest were the smartphone conditions (i.e. location and power). Therefore, two visualizations are shown depicting the smartphone conditions, one for each of the levels of the balance factor.  

```{r echo=FALSE, message=FALSE, warning=FALSE}

#--For the "OSpan First" Balance Condition
# use the data subsets from the post-hoc analyses (i.e. the simple two-way interactions) -- gng_ac_ospan_data  
gng_ac_bar_ospan_data = select(gng_ac_ospan_data, -participant, -balance)
  # remove row names
  rownames(gng_ac_bar_ospan_data) = NULL  
       
# plot bar graph - GOOD
gng_ac_bar_ospan = 
  ggplot(gng_ac_bar_ospan_data, aes(x = location, y = rt_av, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task \nfor the OSpan First Balance Condition",
       subtitle = "Accuracy (average reaction time)",
       x = "Smartphone Location", 
       y = "Average Reaction Time (sec.)", 
       fill = "Smartphone Power",
       caption = "Note: ...") +
        # the "\n" here denote that you want a new line formed in the text
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_ac_bar_ospan_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13)) #+
  #scale_y_continuous(limits = c(min(gng_ac_bar_ospan_data$rt_av), max(gng_ac_bar_ospan_data$rt_av)))

#--For the "Go/No-Go First" Balance Condition
# use the data subsets from the post-hoc analyses (i.e. the simple two-way interactions) -- gng_ac_gng_data  
gng_ac_bar_gng_data = select(gng_ac_gng_data, -participant, -balance)
  # remove row names
  rownames(gng_ac_bar_gng_data) = NULL  
       
# plot bar graph - GOOD
gng_ac_bar_gng = 
  ggplot(gng_ac_bar_gng_data, aes(x = location, y = rt_av, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun.y = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task \nfor the Go/No-Go First Balance Condition",
       subtitle = "Accuracy (average reaction time)",
       x = "Smartphone Location", 
       y = "Average Reaction Time (sec.)", 
       fill = "Smartphone Power",
       caption = "Note: ...") +
        # the "\n" here denote that you want a new line formed in the text
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun.y="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_ac_bar_gng_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                fun.ymax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13)) #+
  #scale_y_continuous(limits = c(min(gng_ac_bar_gng_data$rt_av), max(gng_ac_bar_gng_data$rt_av)))

# show the figures
  # for the OSpan First Balance Condition
  gng_ac_bar_ospan
  # for the Go/No-Go First Balance Condition
  gng_ac_bar_gng

```

## Testing the Moderation Hypothesis:  

### SAD  

#### Factor Analysis  

To examine the underlying factors measured by the Smartphone Attachment and Dependecy Inventory (SAD) from Ward et al. (2017), a principal components factor analysis with a Varimax rotation was completed on participant's SAD data. The analysis was done exploratorily and was compared to Ward et al.'s findings, where they found two factors, smarphone dependence (i.e. six items; items #1-6; 31.02% of the variance) and emotional attachment (i.e. five items; items #8-11; 21.65% of the variance), which explained 52.67% of the total variance. It should be noted that two items (i.e. item #7 and 13) did not strongly load onto a factor and were excluded from subsequent analyses.

To perform the principal components factor analysis, we first examined the data for (1) missing data, (2) correlations between the items, and (3) the factorability of the data.

1. Missing Data

  - From the descriptives table, it seems that there is no missing or incomplete data. This was confirmed by finding the complete cases, which were equal to the total sample size (i.e. `r fa_complete`). Therefore, no data was removed due to missing values for this analysis *(see descriptives table below)*.
    
```{r}

# get a data frame with the SAD items only for the factor analysis
fa_data = dplyr::select(factor_analysis, SAD_1:SAD_13)

# find the total complete cases to find any missing data from the data -- any participant with missing data must be removed
fa_complete = sum(complete.cases(fa_data))

# Show descriptives for the factor analysis data
fa_des = describe(fa_data)

# show descriptives in kable table 
kable(fa_des, caption = "Descriptives for Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) 

```


2. Correlations between the Items

  - The correlations between the items were examined to determine if they were either not high enough (i.e., r < .30) or too high (i.e., r > .80). 
    - The correlations for the 13 items in the SAD ranged from *r*(`r fa_cor$n[1,1] - 2`) `r p_round(min(fa_cor$r[which(fa_cor$r < 1)]))` to *r*(`r fa_cor$n[1,1] - 2`) `r p_round(max(fa_cor$r[which(fa_cor$r < 1)]))`, *p* `r p_round(max(fa_cor$P[which(!is.na(fa_cor$P))]))`.
    - A visual inspection of the correlations showed that items 12 and 13 might not be correlated enough with the rest of the items (i.e. `r length(which(fa_cor$r[,12] < .30))` and `r length(which(fa_cor$r[,13] < .30))` correlations below .30, respectively). There were no issues associated with too many high correations for any items. 
    - The lower correlations for items 12 and 13 should be kept in mind when completing the factor analysis and these items may be removed.  
  - To check for multicollinearity, the determinant of the correlation matrix was calculated. The determinant was greater than the necessary value of 0.00001 (i.e., `r round(fa_det, 5)`), so, there was no issues with multicollinearity (Field et al., 2012).
  - Therefore, all items in the SAD correlated reasonably with each other and without an excessively large correalation so no items were removed at this stage.

```{r}

# get the pearson correlations between all items (and corresponding p-values)
fa_cor = rcorr(as.matrix(fa_data), type = "pearson") 

# make a table with the data to display it better
fa_cor_t = as.data.frame(fa_cor$r)
  # include only the lower triangle of the data
  fa_cor_t[upper.tri(fa_cor_t)] = NA
  
# find the determinant of the correlation matrix
fa_det = det(fa_cor$r)

# Print table using kable
  # make any NAs appear as blank within kable
  opts = options(knitr.kable.NA = "")
  # print table
  kable(fa_cor_t, caption = "Pearson Correlations between Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("p < .001", notation = "alphabet")
    # restore the original options so that "NA" shows up again
    options(opts)

```
 
  - Another way to visualize this is using a Scatter Plot Matrix
    - This shows: (1) a scatter plot bellow the lower diagonal, with a line of best fit (red); (2) a historgram and density plot of the item on the diagonal; and (3) the correlation values (pearson r) above the diagonal, with corresponding significance shown with asterisks. The x-axis in each scatter plot represents the column variable and the y-axis the row variable. 

```{r}
# show a SPLOM (Scatter Plot Matrix) of the Data 
pairs.panels(fa_data, pch = ".", density = TRUE, ellipses = FALSE, lm = TRUE, digits = 2, method = "pearson", jiggle = TRUE, factor = 1, hist.col = "cadetblue3", stars = TRUE, gap = .5, verOdd = TRUE, horOdd = TRUE, main = "Scatter Plot Matrix of Items in the SAD")

``` 

3. Factorability of the Data

  - This assessed if there were meaningful latent factors to be found within the data, which was assessed in two ways: Barlett's test of sphericity and the Kaiser-Meyer-Olkin measure of sampling adequacy.
    - Barlett's test of sphericity:
      - This test was statistically significant, which suggested good factorability, $\chi^{2}$(`r fa_bartest$df`) = `r round(fa_bartest$chisq, 2)`, *p* `r p_round(fa_bartest$p.value)`
    - Kaiser-Meyer-Olkin measure of sampling adequacy:
      - This measure showed an overall measure of sampling adequacy (MSA) and an MSA per item. The overall MSA was higher than both the minimum and preferred criterias (0.50 and 0.60, respectively; Kaiser, 1974, Hutcheson & Sofroniou, 1999), $MSA_{overall}$ = `r round(fa_kmo$MSA, 2)`. Each item's MSA was also higher than the minimum and preferred criteria, $MSA_{1-13}$ = `r min(round(fa_kmo$MSAi, 2))` - `r max(round(fa_kmo$MSAi, 2))`.
  - Therefore, both factorability tests confirmed that the planned factor analysis was appropriate. 
    
```{r message=FALSE, warning=FALSE}
# run the Barlett Test
fa_bartest = cortest.bartlett(fa_data)
fa_bartest

# run the KMO 
fa_kmo = KMO(fa_data)
fa_kmo

```

Now that factorability was confirmed, we found the appropriate number of factors to extract. This was done using: (1) a parallel analysis scree plot, (2) eigen values (i.e., from the parallel analysis), and (3) the factor structure between the two solutions (i.e., using factor diagrams).

1. Parralel Analysis Scree Plot. 

This plotted the eigenvalues from the data and a parrallel analysis which generated random correlation matrices. The number of factors is then determined by comparing the resulting eigenvalues to the observed data, that is, observed eigenvalues that were higher than the corresponding random eigenvalues were more likely to form meaningful factors.

- The parrallel analysis scree plot showed that there are `r fa_par$ncomp` appropriate factors (i.e. using the principal components extraction method).
  - From closer inspection of the plot, the last factor was very close to the random eigenvalue cut-off line, therefore, the 2 and 3 factor solutions were compared to determine the best fit (i.e., the eigen values and factor structures were examined). 

```{r}

fa_par = fa.parallel(fa_data, fa = "pc", main = "Parallel Analysis Scree Plot for Items in the SAD")

```

2. Eigen Values

For a sample size greater than 250 (i.e., `r fa_complete` complete cases) the average communalities (i.e., the proportion of common variance within the items) for the two factor (`r round(mean(fa_two_v$communality), 2)`) and three factor (`r round(mean(fa_three_v$communality), 2)`) solutions were wihtin the acceptable value (i.e., 0.60; Field et al., 2012). Therefore, Kaiser's criterion (i.e., a meaningful factor has an eigen value greater than 1) was used for the eigen values. It should be noted that although the average communalities were not very high, the scree plot and factor diagrams were used to choose the appropriate number of factors.  

- The eigen values (unrotated solution) showed three meaningful factors (i.e., `r round(fa_par$pc.values[1:3], 2)`), which *supported the three-factor solution* as a better representation of the data.  

```{r}
# Eigen values
  # view eigen values from the parallel analysis scree plot
  round(fa_par$pc.values, 3)
```

3. Factor Diagrams

A factor diagram for the two- and three-factor solutions (rotated solution) was used to determine which solution represents the most interpretable solution. The diagrams show the factor loadings for each item on the given factor for which they load the highest on. The items were ordered (i.e., descending) based on these loadings.  
  *Note: The factor diagrams are based on the rotated solutions to better distribute the factor loadings*
  
- The factor diagrams showed that the two factor solution resulted in most items loading onto only one factor and was not easily interpreted. The three factor solution showed a better distribution and was more interpretable.   

Therefore, the **three factor solution** was chosen to better represent the data. 

```{r}
##-- VARIMAX ROTATION  
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_v = principal(fa_data, nfactors = 2, rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_v$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_v = principal(fa_data, nfactors = 3, rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_v$loadings) = c("Factor 1", "Factor 2", "Factor 3")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_v, main = "Two Factor Solution for items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)

  # 3 factor solution
  fa.diagram(fa_three_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)
  
  
##-- OBLIMIN ROTATION  
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_o = principal(fa_data, nfactors = 2, rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_o$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_o = principal(fa_data, nfactors = 3, rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_o$loadings) = c("Factor 1", "Factor 2", "Factor 3")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_o, main = "Two Factor Solution for items in the SAD (Oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)

  # 3 factor solution
  fa.diagram(fa_three_o, main = "Three Factor Solution for Items in the SAD (Oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)
```

Now that we determined the number of factors (i.e., three), we observed the **communalities.** Each item's communality was the percentage of variance that was explained by the retained factors for each item. These ranged from `r percent(min(fa_com))` to `r percent(max(fa_com))`, with an average of `r percent(mean(fa_com[,1]))` (see table for more details). 

```{r}
# get the communalities from the model
fa_com = as.data.frame(fa_three_v$communality)

# create matrix to present communalities in a table
fa_com_t = rbind((fa_com[,1])*100)
  # rename row and column names
  colnames(fa_com_t) = rownames(fa_com)
  rownames(fa_com_t) = NULL

# Print table using kable
  # print table
  kable(fa_com_t, caption = "Communalities for the Three-Factor Solution for Items in the SAD (%)", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("Communalities show the percentage of variance explained by each factor for each item on the SAD.", notation = "alphabet")
```

Next, we looked at the **percentage of variance accounted for** by using the eigenvalues (compute the variance assosiated with the factors). This was done for both the unrotated and rotated (i.e., varimax) three-factor solution. As you can see in the table below, a total of `r round(sum(var_per),2)`% of variance was accounted for by the three extracted factors. This variance was mostly accounted for by factor one in the unrotated solution, but was more equally distributed after rotation (see table for more details).

```{r}

# Run an unrotated three-factor solution
  fa_three = principal(fa_data, nfactors = 3, rotate = "none", scores = T) 
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three$loadings) = c("Factor 1", "Factor 2", "Factor 3") 
    
  
# To find the variace accounted for, you can either (1) calculate it from the eigen values for each factor, or (2) grab the calculated values from the model 
  # Both are shown below, but one is commented out to avoid creating unessesary variables
  # (1) Calculated 
    # For the unrotated solution
    #var_per = 100*fa_three$values[1:3]/length(fa_three$values)
    
    # For the rotated solution
    #var_per_v = 100*(as.numeric(fa_three_v$Vaccounted[1,]))/length(fa_three_v$values)
      
  # (2) The information is found in the "Vaccounted" table from the output, where "SS loadings" (row 1) shows the eigen values and "Proportion Var" (row 2) shows the variance accounted for.
    # For the unrotated solution
    var_per = 100*as.numeric(fa_three$Vaccounted[2,])
    
    # For the rotated solution
    var_per_v = 100*as.numeric(fa_three_v$Vaccounted[2,])

# Create & Show a table with eigen values, percent var explained, and total percent var explained for unrotated & rotated three-factor solutions
    # unrotated solution
    var_per_t = as.data.frame(cbind(rbind(as.numeric(fa_three$Vaccounted[1,]), var_per), 
                                    rbind(as.numeric(fa_three_v$Vaccounted[1,]), var_per_v)))
    # name the columns & rows accordingly
    colnames(var_per_t) = c(rep(c("Factor 1", "Factor 2", "Factor 3"),2))
    rownames(var_per_t) = c("Eigen Values", "Variance Explained (%)")

  # use kable to show the table
  kable(var_per_t, caption = "Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  add_header_above(c(" ", "Unrotated" = 3, "Varimax Rotation" = 3))

```

Lastly, we looked at the **factor loadings** of the three-factor solution. Below you can see the rotated (i.e., varimax) and unrotated facter loadings. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. As we saw in tha factor diagrams, the unrotated factor loadings are difficult to interpret: all items loaded strongly onto factor 1 (i.e., loading was greater than 0.40). The roatated factor loadings were more easy to interpret, with only one item loading onto more than one factor (i.e., item six; see below for more details).

```{r}

# Unrotated Factor Loadings 
print(fa_three$loadings, cutoff = 0.40, digits = 3)

# Rotated (i.e., varimax) Factor Loadings 
print(fa_three_v$loadings, cutoff = 0.40, digits = 3)

```

In order to validate the SAD, we completed a reliability analysis (i.e., is the scale consistent) by looking at the split-half reliability using Cronbach's alpha. This method is the most common measure of scale reliability (Field et al., 2012): the data for each participant is randomly split into two sets, which forms a "half" split for the scale. Then, a correlation is calculated between these sets to determine how similar they are: higher reliability would be seen with highly correlated sets. Since the way that the data is split can impact the correlations, Cronbach's alpha represents an average of all the possible splits using the following equation:

$$\alpha =  \frac{N^2 \overline{Cov}}{\sum s_{item}^2 + \sum Cov_{item}} $$
To obtain the best reliability measure, we must divide our data into any subscales, which in this case, are our three factors from our factor analysis. Therefore, we split the data into three sets for the reliability analysis.

```{r}

# split the data respective to the 3 factors from the FA to perform the reliability analysis
  # factor 1 (items: 1, 2, 3, 4, 5)
  f1_data = fa_data[,c(1, 2, 3, 4, 5)]
  # factor 2 (items: 8, 9, 10, 11)
  f2_data = fa_data[,c(8, 9, 10, 11)]
  # factor 3 (items: 7, 12, 13)
  f3_data = fa_data[,c(7, 12, 13)]

```

Once we have our subscales, we complete the reliability analysis on each subscale.

```{r}

# compute reliability analysis for each subscale using "alpha()" function in "psych" package
  # factor 1
  f1_rel = psych::alpha(f1_data)
  # factor 2
  f2_rel = psych::alpha(f2_data)
  # factor 3
  f3_rel = psych::alpha(f3_data)

```

To view the analysis, we broke it down into three compoenents: (1) The alpha values and overall mean and standard deviation of the subscales, (2) The alpha values if an item was dropped (i.e., Alpha Drop Values), and (3) Item statistics. 

(1) Alpha Values

- The alpha values from the reliability analyses included Cronbach's $\alpha$ (i.e., based on the covariances), a standardized $\alpha$ (i.e., based on the correlations), and Guttman's Lambda 6. Additionally, the overall average correlation, mean, and standard deviation for the subscale is shown (see table below for more details).
- As seen in the table, Cronbach's $\alpha$ was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[1], 2)`) and Factor 2 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[2], 2)`), but not for Factor 3 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[3], 2)`). The overall correlations for Factor 1, *r*(`r f1_rel$item.stats[1, "n"] - 2`) `r p_round(fa_rel_alphas$average_r[1])`, and Factor 2, *r*(`r f2_rel$item.stats[1, "n"] - 2`) `r p_round(fa_rel_alphas$average_r[2])`, were moderate, whereas Factor 3's correlation, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r p_round(fa_rel_alphas$average_r[3])`, was weak-moderate. These statistics were further explored with dropped items. 

```{r}
# show the alpha values from the 3 subscales
  # create data frame with the information from the 3 reliability analyses
  fa_rel_alphas = as.data.frame(rbind("Factor 1" = dplyr::select(f1_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 2" = dplyr::select(f2_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 3" = dplyr::select(f3_rel$total, -c("S/N", "ase", "median_r"))))
  # add an extra column for the "overall" or average alpha
  fa_rel_alphas = cbind("Overall alpha" = c(mean(as.numeric(dplyr::select(f1_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f2_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f3_rel$total, c("raw_alpha", "std.alpha"))))), 
                        fa_rel_alphas)

  # use kable to show the table
  kable(fa_rel_alphas, caption = "Reliability Analysis for Subscales in the SAD: Alpha Values", digits = 4, align = 'c', 
        col.names = c("Overall alpha", "Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T)
  
```

(2) Alpha Drop Values 

- To further explore the reliability of each subscale, we looked at the alpha values after removing each item. This allowed us to determine whether a given item was reducing the reliability of the subscale. That is, if dropping an item increased $\alpha$ or the correlation, the removing the item from the analysis should be considered. The data presented in the table below shows Cronbach's $\alpha$, standardized $\alpha$, Guttman's Lambda 6, and the average correlation.
- As seen in the table below, dropping an item did not improve reliability for Factor 1 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[1:5]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[1:5]), 2)`) or Factor 2 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[6:9]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[6:9]), 2)`). However, removing item #7 did improve reliability for Factor 3 ($\alpha_{C-drop7}$ = `r round(fa_rel_adrop$raw_alpha[10], 2)`). With respect to the average correlation, dropping an item did not improve for Factor 2, *r*(`r f2_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_adrop$average_r[6:9]))` - `r p_round(max(fa_rel_adrop$average_r[6:9]))`. However, item #5 for Factor 1, *r*(`r f1_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_adrop$average_r[5]))`, and item #7 for Factor 3, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_adrop$average_r[10]))`, did show a some improvement in average correlation. This was explored further with the items statistics.

```{r}
# show the alpha drop values from the 3 subscales
  # create data frame with the information
  fa_rel_adrop = as.data.frame(rbind(dplyr::select(f1_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f2_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f3_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r"))))

  # use kable to show the table
  kable(fa_rel_adrop, caption = "Reliability Analysis for Subscales in the SAD: Alpha Drop Values", digits = 4, align = 'c', 
        col.names = c("Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data)))

```

(3) Item Statistics

- The item statistics were explored to determine whether a certain item did not correlate with the overall subscale. The table below shows: *r*, which is the correlations between each item and the total score from the subscale (i.e., the item-total correlations), which includes the item itself; and *Drop*, which is the correlation of an item with the subscale after removing the item (i.e., the item-rest correlation or the corrected item-total correlation). Additionally, the sample size, mean, and standard deviation is shown. 
- No issues were found with the item statistics for Factor 1, *r*(`r f1_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[1:5]))` - `r p_round(max(fa_rel_item$r.drop[1:5]))`; Factor 2, *r*(`r f2_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[6:9]))` - `r p_round(max(fa_rel_item$r.drop[6:9]))`; and Factor 3, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[10:12]))` - `r p_round(max(fa_rel_item$r.drop[10:12]))`. That is, no items had a item-rest correlation (i.e., the drop column in the table below) below the cut-off (i.e., .30; Field et al., 2012). 

```{r}

# show the correlations for the 3 subscales
  # create data frame with the information
  fa_rel_item = as.data.frame(rbind(dplyr::select(f1_rel$item.stats, -c("std.r", "r.cor")), 
                                     dplyr::select(f2_rel$item.stats, -c("std.r", "r.cor")), 
                                     dplyr::select(f3_rel$item.stats, -c("std.r", "r.cor"))))

  # use kable to show the table
  kable(fa_rel_item, caption = "Reliability Analysis for Subscales in the SAD: Item Statistics", digits = 4, align = 'c', 
        col.names = c("N", "r", "Drop", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data)))  

```

Overall, the reliability for Factor 1 and Factor 2 was adequate. However, Factor 3 showed mixed results: item #7 was not as stable as the other items. **However, this item did fit the minimum criteria for reliability and was therefore left in the subscale. OR The item met the minimum criteria, but was removed to improve the overall factor analysis fit with the data.**

###### Conclusions

A principal components factor analysis was completed on the 13 items in the SAD (Ward et al., 2017) with an orthogonal rotation (i.e., varimax rotation). Factorability of the data was confirmed using: (1) Bartlett's test for sphericity, $\chi^{2}$(`r fa_bartest$df`) = `r round(fa_bartest$chisq, 2)`, *p* `r p_round(fa_bartest$p.value)`, which confirmed that correlations between the items were sufficiently large; and (2) the Kaiser-Meyer-Olkin measure for sampling adequacy, $MSA_{overall}$ = `r round(fa_kmo$MSA, 2)`, $MSA_{1-13}$ = `r min(round(fa_kmo$MSAi, 2))` - `r max(round(fa_kmo$MSAi, 2))`, which confirmed that both the overall MSA and each item's MSA were above their required criteria (0.60 and 0.77, respectively; Kaiser, 1974). The appropriate number of factors (i.e., three) was confirmed using (1) a parallel analysis scree plot, where the two- and three-factor solutions were chosen to compare for th ebest fit for the data; (2) the eigen values (e.g., `r fa_three$values[1:4]`, for factors 1 to 4, respectively) for the unrotated solution, where three meaningful factors were found using Kaiser's criterion (i.e., eigen values greater than one); and (3) the factor structure of the two- and three-factor solutions, where the three-factor solution showed a better distribution of the data and was more interpretable. Therefore, the three-factor solution was chosen for the final analysis and explained `r round(sum(var_per), 2)`% of the variance. Additionally, an orthogonal (i.e., varimax) rotation was chosen for the final analysis. The items in the final factors suggested the following three factors: smartphone dependence, emotional attachment, and smartphone distractability. Smartphone dependence was relate to the degree of dependence on one's smartphone, it consisted of 5 items and explained `r round(var_per_v[1], 2)`% of the variance (e.g., "I would have trouble getting through a normal day without my smartphone.", and "I feel like I could not live without my smartphone."). Emotional attachment was related to one's smartphone use for emotional support, it consisted of 4 items and explained `r round(var_per_v[2], 2)`% of the variance (e.g., "I feel lonely when my smartphone does not ring or vibrate for several hours.", and "Using my smartphone makes me feel happy."). Smartphone distractability was related to one's smartphone retaining one's attention, it consisted of 3 items and explained `r round(var_per_v[3], 2)`% of the variance (e.g., "I find it tough to focus whenever my smartphone is nearby.", and "I become less attentive to my surroundings when I’m using my smartphone."). One item did not clearly load onto any one factor and was excluded from further analyses (i.e., item 6; Costello & Osborne, 2005). See table below for all items and factor loadings.

```{r}

# create table with all items, factor loadings, eigen values, % of virance explained, and Cronbach's alpha 

# get factor loadings
  fa_load = as.data.frame(fa_three_v$Structure[1:13,])
  # rename columns
  colnames(fa_load) = c("Factor 1", "Factor 2", "Factor 3")

# make list of the items & other info
  fa_table_list = as.data.frame(c("I would have trouble getting through a normal day without my smartphone.",
                              "It would be painful for me to give up my smartphone for a day.",
                              "I feel like I could not live without my smartphone.",
                              "If I forgot to bring my smartphone with me, I would feel anxious.",
                              "It drives me crazy when my smartphone runs out of battery.",
                              "I am upset and annoyed when I find I do not have reception on my smartphone.",
                              "I feel impatient when the Internet connection speed on my smartphone is slow.",
                              "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                              "Using my smartphone relieves me of my stress.",
                              "I feel excited when I have a new message or notification.",
                              "Using my smartphone makes me feel happy.",
                              "I find it tough to focus whenever my smartphone is nearby.",
                              "I become less attentive to my surroundings when I’m using my smartphone.", 
                              "Eigen Values", 
                              "Variance Explained (%)"))
  
# create the final table
  # bind the loadings and items
    fa_three_t = cbind(fa_table_list, rbind(fa_load, var_per_t[4:6]))
    
    # rename columns & rows
      colnames(fa_three_t) = c("Item", "Factor 1: Smartphone Dependence", "Factor 2: Emotional Attachment", "Factor 3: Smartphone Distractability")
      rownames(fa_three_t)[14:15] = c("", " ") 
      
# use kable to show the table
  kable(fa_three_t, caption = "Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD (N = 409)", digits = 4, align = "lccc") %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows("", 14, 15)


```




Look at a **reliability analysis**... 



#### Generalized Linear Model  

  - A univariate generalized linear model was used to examine if smartphone reliance (i.e. SAD score) was a moderator of the relationship between the experimental manipulation and task performance. 

*Note: These analysis were completed regardless of the results for the location and power hypotheses.*  






 



