Introduction

This document outlines the analyses from the paper Reexamining the “Brain Drain” Effect: Does Ward et al. (2017) replicate? (Ruiz Pardo & Minda, Oct 2021 preprint, https://psyarxiv.com/5y34g/). The purpose of this study was to investigate Ward et al.’s (2017) second experiment by completing a direct replication. Therefore, the present study investigated how the mere presence of one’s smartphone effects cognition. This project can be found on OSF (https://osf.io/ubys7/) and was pre-registered through OSF (https://osf.io/5fq4r) .

The main goal of the current study was to see if Ward et al.’s findings replicated (experiment two). Therefore, as in Ward et al., participants were randomly assigned to one of six possible conditions. These conditions were based on two independent variables: smartphone location and smartphone power. For smartphone location, a participant’s smartphone was in one of three locations: (1) on the participant’s desk (“on desk”), (2) in their pocket/bag (“pocket/bag”), or (3) outside the testing room (“outside”). For smartphone power, a participant’s smartphone was either (1) powered ON or (2) powered OFF, in their respective location. For all conditions, participants were instructed to keep their smartphones on silent (i.e., no vibrations if any notifications are received during the study). Also, as per Ward et al., participants in the “on desk” location conditions were instructed to keep their devices facing up.

Each participant completed two main tasks (counter-balanced):

1. An Automated Operation Span (OSpan) Task (Unsworth et al., 2005), which required participants to retain letter strings (i.e., ranging from three to seven letters in length) in memory while solving some simple math problems (e.g., “(7/7) + 6 = ?”). This task was a behavioural measure of the attentional control component of working memory.

  • Scoring:

    • Performance, measured with the OSpan absolute score, was shown by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e., 3 + 5 + 0 = 8). Therefore, OSpan absolute score showed performance where higher scores represented better performance.

2. The Cue-Dependent Go/No-Go Task (Bezdjia et al., 2009), which required participants to respond to “go” targets (i.e., green rectangle) while withholding any response to “no-go” targets (i.e. blue rectangle). This task included a cue component, where vertical targets (i.e., rectangles) were more likely to be “go” targets (i.e., 80% “go” and 20% “no-go”) and horizontal targets were more likely to be “no-go” targets (i.e., 80% “no-go” and 20% “go”). Participants were not explicitly made aware of the cue component. This task was a behavioural measure of sustained attention.

  • Scoring:

    • Performance was measured with mean omission errors and RT. It should be noted that mean errors can be divided into total error, commission error, and omission error. Commission errors occurred when a participant responded to a target stimulus. Omission errors occurred when a participant failed to respond to a non-target stimulus. Total errors were the sum of commission and omission errors. The present study focused on mean omission errors. Therefore, for each participant, higher mean omission errors represented lower performance. Additionally, higher mean RT also showed lower performance (i.e., indicative of greater interference).

Finally, after completing both tasks, all participants completed three survey measures:

1. The Demographic Questionnaire, which was created for the present study and consisted of four items: age (i.e., in years), gender (i.e., male, female, other, or prefer not to say), program (e.g., psychology, engineering), and year of study (e.g., first, fourth). The purpose of these items was to give a brief description of the sample.

2. The Smartphone Usage Questionnaire (SUQ), which was created for the present study and consisted of modified items from Ward et al.’s survey measures (i.e., found in the “BRAIN DRAIN” WEB APPENDIX“). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (”Never“) to 7 (”Always“). There were 10 items in total and there were five types of items, which measured: (1) smartphone use frequency (three items; e.g.,”On average, how many text messages do you send per day?“); (2) smartphone use without external stimulation (two items; e.g.,”If I am waiting to meet a friend, I pass the time by using my smartphone.“), or during other activities (two items; e.g.,”I use my smartphone while driving.“); (3) smartphone subjective value (one item; e.g.,”How much money would it take for you to give up your phone for a full day?“); (4) smartphone notification type (one item; e.g.,”Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.“); and (5) phantom vibrations (one item; e.g.,”Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?"). The purpose of the SUQ was to measure participants’ typical smartphone use.

3. Smartphone Attachment and Dependency Inventory (SAD), which was from Ward et al.’s (2017) study. The SAD used 13-items (i.e., 7-point likert scale) that asked participants to rate (i.e., from 1 = Strongly Disagree to 7 = Strongly Agree) how much they agreed with statements regarding their emotional attachment and dependency on their smartphones. The purpose of the SAD was to measure each participant’s reliance on their smartphone.

  • Scoring:

    • Smartphone reliance was calculated with a sum total for all items with a range of 13 to 91. Higher scores indicated a higher level of reliance with three levels: low reliance (13-39), moderate reliance (40-65), and high reliance (66+).

A total of 453 undergraduate students participated in this study. Of the total sample size, 44 participants were excluded due to either testing error (11; e.g., incomplete task data), or experimenter or external confounds (33; e.g., interruption during testing, distracting noise during testing). Additionally, 26 participants were removed during the data cleaning phase, which removed participants who met an exclusion criteria (OSpan math criteria: 20; Go/No-Go response criteria, horizontal/no-go cue: 6; Go/No-Go response criteria, vertical/go cue: 6), were identified as having outlier data (OSpan: 0; Go/No-Go Error Analysis: 0; Go/No-Go Accuracy Analysis: 0), and had incomplete or missing data (OSpan: 3; Go/No-Go Error Analysis: 0; Go/No-Go Accuracy Analysis: 3). Overall, 70 participants were removed from the analysis. Note: a participant may have been removed due to multiple criteria.

Therefore, 383 participants were included in the analyses for the location (on desk = 128; pocket/bag = 132; outside = 123) and power (ON = 196; OFF = 187) condition.

Data Analysis Plan

The present study completed a pre-registered data analysis (pre-registered direct replication on Open Science Framework; OSF). Below is a summary of the data analysis plan (view full version at https://osf.io/5fq4r or https://osf.io/ubys7/).

Hypotheses

  1. The location effect hypothesis predicted that those who were closest in proximity to their smartphone (i.e., those with their smartphones on their desk) would show lower performance on the OSpan task.

  2. The power effect hypothesis predicted that smartphone power (i.e., either ON or OFF) would not affect performance on both cognitive tasks.

  3. The moderation effect hypothesis predicted that smartphone attachment and dependency (i.e., SAD score) would moderate the location effect: those who reported higher smartphone attachment and dependency would perform worse on the OSpan task.

Note: All hypotheses were based on the findings from Ward et al. (2017)

Inclusion / Exclusion Criteria

There were two exclusion criteria in the present study:

1. OSpan Task

  • Only data from participants who scored 85% accuracy on the math component or above was used.

    • Criteria helped control for any participants who did not follow instructions and neglected the math component of the task. Note: This was the same criteria as in Ward et al. (2017)

2. Cue-Dependent Go/No-Go Task

  • Only data from those who scored higher than chance performance was used (e.g., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials).

    • Criteria helped control for any participants who did not complete the task as instructed (e.g., guessing). Note: There was no inclusion/exclusion criteria for thsi task in Ward et al. (2017)

Analysis

Testing the Location and Power Hypotheses:

1. OSpan Task

  • The OSpan absolute score was used to measure OSpan performance.

  • A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = OSpan absolute score).

    • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s) and/or power status were significantly different.

2. Cue-Dependent Go/No-Go Task

  • An error (i.e., omission errors) and accuracy (i.e., RT) analysis was completed using the average omission errors and RT for each participant.

    (a) Error Analysis

    • A 3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Go, No-Go) mixed ANOVA was completed (DV = average omission error(s)).

      • The between-subjects factors:smartphone location and power
      • The within-subjects factor: pre-target cue type
      • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s), power status, and/or cue type(s) were significantly different.

    (b) Accuracy Analysis

    • A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = average RT).

      • The between-subjects factors:smartphone location and power
      • The within-subjects factor: pre-target cue type
      • Post-hocs completed if there was a significant ANOVA (\(\alpha = 0.05\)) to determine which location(s) and/or power status were significantly different.

Testing the Moderation Hypothesis:

3. SAD

(a) Factor Analysis

  • A principal components factor analysis with a Varimax rotation was completed for participant’s SAD score.

    • This was done to confirm the two main factors found by Ward et al. (2017): smartphone dependence and emotional attachment.

(b) Generalized Linear Model

  • A univariate generalized linear model was used to examine if smartphone reliance (i.e., SAD score) was a moderator of the relationship between the experimental manipulation and task performance.

Note: These analysis were completed regardless of the results for the location and power hypotheses.

References

  • Bezdjian, S., Baker, L. A., Lozano, D. I., & Raine, A. (2009). Assessing inattention and impulsivity in children during the Go/NoGo task. The British Journal of Developmental Psychology, 27(2), 365–383. https://doi.org/10.1348/026151008X314919
  • Peirce, J. W. (2007). PsychoPy—Psychophysics software in Python. Journal of Neuroscience Methods, 162(1), 8–13. https://doi.org/10.1016/j.jneumeth.2006.11.017
  • Ruiz Pardo, A. C., & Minda, J. P. (2021). Reexamining the “brain drain” effect: Does Ward et al. (2017) replicate? In PsyArXiv. Western University. https://doi.org/10.31234/osf.io/5y34g
  • Unsworth, N., Heitz, R. P., Schrock, J. C., & Engle, R. W. (2005). An automated version of the operation span task. Behavior Research Methods, 37(3), 498–505. https://www.ncbi.nlm.nih.gov/pubmed/16405146
  • Ward, A. F., Duke, K., Gneezy, A., & Bos, M. W. (2017). Brain drain: The mere presence of one’s own smartphone reduces available cognitive capacity. Journal of the Association for Consumer Research, 2, 140–154. https://doi.org/10.1086/691462

Analysis Prep

Load Required Libraries

The following packages were used to complete the analyses, visualizations, data wrangling, etc. in this document.

Additional Functions

Rounding p-values

This rounding function was adapted from Dr. Emily Nielsen’s Rpubs. The function (“p_round(x)”) was created to assess and print p-values. If \(p \ge .005\), the function will display “$p = $” and the value rounded to two decimal places. If $ .0005 p <.005\(, the function will display “\)p = $" and the value rounded to three decimal places. If \(p < .0005\), the function will display “\(p < .001\)”. (click to expand)


p_round <- function(x){
  if(x > .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 2)), sep = ''))
  }  
  else if(x == .005){x1 = (paste("=", gsub("0\\.","\\.", 0.01)))
  }
  else if(x > .0005 & x < .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 3)), sep = ''))
  }  
  else if(x == .0005){x1 = (paste("=", gsub("0\\.","\\.", 0.001)))
  }
  else{x1 = (paste("<", gsub("0\\.","\\.", 0.001)))
  } 
  (x1)
}

Flatten Correlation Output

# ++++++++++++++++++++++++++++
# flattenCorrMatrix
# ++++++++++++++++++++++++++++
# cormat : matrix of the correlation coefficients
# pmat : matrix of the correlation p-values
flattenCorrMatrix <- function(cormat, pmat) {
  ut <- upper.tri(cormat)
  data.frame(
    row = rownames(cormat)[row(cormat)[ut]],
    column = rownames(cormat)[col(cormat)[ut]],
    cor  =(cormat)[ut],
    p = pmat[ut]
    )
}

Raw Data

There are five raw data files that were imported for the present study (see the Import Raw Data section).

Task Data

  1. OSpan Raw Data

    • spreadsheet, collected manually from each participant
  2. Go/No-Go Raw Data

    • psychopy excel files
    • Note: raw data was extracted from folder into one file (see the Import Raw Data section)

Survey Measures

  1. Demographic & SUQ Raw Data

    • csv files exported from Qualtrics
  2. SAD Raw Data

    • csv files exported from Qualtrics
  3. Testing Tracking Sheet

    • csv files, collected during testing

Import Raw Data

1. OSpan Raw Data

The OSpan Raw Data was collected using the Inquisit 5 software, which used the same automated OSpan task as in Ward et al. (2017). The raw data from the OSpan was imported as a data frame, “ospan_raw”, and included the following: (click to see description)

  • Participant number [ospan_participant]

  • OSpan ID [ospan_ID]

    • A 9-digit ID given by the Iquisit 5 software
  • OSpan Absolute Score [ospan_absolute_score]

    • Performance measured by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e., 3 + 5 + 0 = 8).
  • Total Letters Recalled [ospan_total_corr]

    • Performance measured by how many letters were recalled in total, regardless of whether all letters were correct within a given block.
  • Total Math Errors [ospan_math_err_tot]

    • the sum of all math errors (i.e., speed and accuracy)
  • Speed Errors [ospan_math_err_speed]

    • the sum of the speed related math errors

      • where a participant exceeded their time limit to respond to the math component of the task
  • Accuracy Errors [ospan_math_err_accuracy]

    • the sum of the accuracy related math errors

      • where a participant answered incorrectly on the math component of the task
  • Math Percentage [ospan_math_percent_corr]

    • the percent of all math trials that were answered correctly

      • Participants were asked to keep their math percentage at 85% or above to ensure participants did not ignore the math component of the task to improve on the letter recall component
  • Math Percentage Criteria [ospan_math_criteria_85]

    • whether a participant met the math criteria (i.e., 85% or above; yes or no)

      coding information (click to expand)

      • 1 = yes
      • 0 = no * including any participant that did not complete the OSpan task

This code imported the OSpan raw data. (click to expand)

ospan_raw = utils::read.csv("raw_ospan.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
ospan_raw = dplyr::arrange(ospan_raw, ospan_participant)

ospan_raw

2. Go/No-Go Raw Data

The Go/No-Go Raw Data was collected using PsychoPy2 Experiment Builder (v1.85.4). Each participant’s data was stored in a separate folder (default from psychopy) as an excel file. The following code re-organized the raw data into a single wide format of the data as a data frame, “gng_data_extracted”, and two data frames that separate the raw data by cue type (i.e.“vertical” or “horizontal”), “gng_data_extracted_v” and “gng_data_extracted_h”. (click to see descriptions)

For the gng_data_extracted data frame:

  • Participant number [gng_participant]

  • Age [age]

  • First Language [lang]

    • self-reported whether their fist language was English (i.e., yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [gng_trial_tot]

  • GnG Score [gng_score]

    • the sum of the correct answers
  • Total Responses [gng_resp_tot]

    • the sum of the responses for all trials
  • Total No Responses [gng_no_resp_tot]

    • the sum of the trials without a response
  • Total Errors [gng_err_tot]

    • the sum of all errors for all trials
  • Commission Errors [gng_err_com]

    • the sum of all commission errors, defined as:

      • where a response was made during a “no-go” trial (i.e., committing an error)
  • Omission Errors [gng_err_omi]

    • the sum of all omission errors, defined as:

      • where a response was NOT made during a “go” trial (i.e., failing to respond)
  • Average Reaction Time (RT) [gng_rt_av]

For the gng_data_extracted_v data frame:

  • Participant number [gng_participant]

  • Age [age_long]

  • First Language [lang_long]

    • self-reported whether their fist language was English (i.e., yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [trial_tot_v]

  • GnG Score [score_v]

    • the sum of the correct answers
  • Total Responses [resp_tot_v]

    • the sum of the responses for all trials
  • Total No Responses [no_resp_tot_v]

    • the sum of the trials without a response
  • Total Errors [err_tot_v]

    • the sum of all errors for all trials
  • Commission Errors [err_com_v]

    • the sum of all commission errors, defined as:

      • where a response was made during a “no-go” trial (i.e., committing an error)
  • Omission Errors [err_omi_v]

    • the sum of all omission errors, defined as:

      • where a response was NOT made during a “go” trial (i.e., failing to respond)
  • Average Reaction Time (RT) [rt_av_v]

  • Response Exclusion Criteria [exclude_v]

    • This depicts whether a participant should be excluded from analysis based on the response exclusion criteria

      • To be included, participants should score higher than chance performance (i.e., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)

        coding information (click to expand)

        code 1 2
        response Yes (participant excluded) No (participant included)

For the gng_data_extracted_h data frame:

  • Participant number [gng_participant]

  • Age [age_long]

  • First Language [lang_long]

    • self-reported whether their fist language was English (i.e., yes or no)

      coding information (click to expand)

      code 1 2
      response Yes No
  • Total trials completed [trial_tot_h]

  • GnG Score [score_h]

    • the sum of the correct answers
  • Total Responses [resp_tot_h]

    • the sum of the responses for all trials
  • Total No Responses [no_resp_tot_h]

    • the sum of the trials without a response
  • Total Errors [err_tot_h]

    • the sum of all errors for all trials
  • Commission Errors [err_com_h]

    • the sum of all commission errors, defined as:

      • where a response was made during a “no-go” trial (i.e., committing an error)
  • Omission Errors [err_omi_h]

    • the sum of all omission errors, defined as:

      • where a response was NOT made during a “go” trial (i.e., failing to respond)
  • Average Reaction Time (RT) [rt_av_h]

  • Response Exclusion Criteria [exclude_h]

    • This depicts whether a participant should be excluded from analysis based on the response exclusion criteria

      • To be included, participants should score higher than chance performance (i.e., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)

        coding information (click to expand)

        code 1 2
        response Yes (participant excluded) No (participant included)

This code imported the Go/No-Go raw data without separating the data by cue type. (click to expand)


#---WIDE DATA FRAME---"gng_data_extracted"  

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
extract_gng = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant)
  participant = as.numeric(temp_data[which(temp_data == "participant"),2])
  
  # Age (age)
  age = as.integer(temp_data[which(temp_data == "Age"),2])
  
  # Gender (gender)
  # will be populated from survey measures
  
  # First Language (lang)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  
  # Total trials completed (trial_tot)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$order))
  
  # GnG Score (score)
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score = sum(temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Total Responses (resp_tot)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$key_resp_slash.keys_raw) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$key_resp_slash.keys_raw))
  
  # Total No Responses (no_resp_tot)
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot = (trial_tot - resp_tot)
  
  # Total Errors (err_tot)
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot = sum((temp_data$key_resp_slash.corr_raw == "0") == "TRUE", na.rm = TRUE) # alternative: sum(!temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Commission Errors (err_com)
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an erorr)
  err_com = sum((temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Omission Errors (err_omi)
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi = sum((temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Average Reaction Time (RT) (rt_av)
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av = mean(temp_data$key_resp_slash.rt_raw, na.rm = TRUE)
  
  temp_dataframe = as.data.frame(c(participant, age, lang, trial_tot, 
                                   score, resp_tot, no_resp_tot, 
                                   err_tot, err_com, err_omi, rt_av))
}

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
files_psychopy = dir("/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files
result = sapply(files_psychopy, extract_gng)

#--(4) Create a data frame of the result of the function on the files

# take the lists of the extracted data and turn it into a data frame
gng_data_extracted = data.frame(Reduce(rbind, result)) # alternative method: x = data.frame(t(sapply(result,c)))
# rename the columns
colnames(gng_data_extracted) = c("gng_participant", "age", "lang", "gng_trial_tot", "gng_score", 
                                 "gng_resp_tot", "gng_no_resp_tot", "gng_err_tot", "gng_err_com", 
                                 "gng_err_omi", "gng_rt_av") 
# remove row names to avoid confusion
rownames(gng_data_extracted) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted = dplyr::arrange(gng_data_extracted, gng_participant)

gng_data_extracted

This code imported the Go/No-Go raw data and separated the data by cue type (i.e. “vertical” or “horizontal”). (click to expand)


#--- for the LONG DATA FRAME - CUE TYPE (vertical vs. horizontal) --- "gng_data_extracted_v" & "gng_data_extracted_h" 
  
#-- This code differs from the above code in that it organizes the gng data based the CUE type (i.e. either a "vertical" (v) or "horizontal" (h)).
  # This means that the data is in a long format to test the within-subjects factor of cue type.

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files
# (5) Re-structure the data into 1 data frame in a long format for a repeated measures analysis

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once...
# for the "vertical" (v) (extract_gng_v) and "horizontal" (h) (extract_gng_h) trials

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
# NOTE: this function extracts 2 versions of each value: one for the "vertical" (v) trials, and one for the "horizontal" (h) trials

extract_gng_v = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "vertical" (v) (trial_tot_v)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_v = sum((temp_data$cue == "v_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "vertical" (v) (score_v)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "vertical" (v) (resp_tot_v)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "vertical" (v) (no_resp_tot_v)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_v = (trial_tot_v - resp_tot_v)  
  
  # Total Errors for the "vertical" (v) (err_tot_v) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "vertical" (v) (err_com_v) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "vertical" (v) (err_omi_v)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  

  # Average Reaction Time (RT) for the "vertical" (v) (rt_av_v)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_v = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "v_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "vertical" (v) (exclude_v)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
    # first, get the total trials and response totals for the "go" and "no-go" trials
      # "go"
      trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
      # "no-go"
      trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
    # next, find the minimum response needed based on the 50% criteria
    min_resp = trial_tot_go * .50
    # find if the participant meets exclusion criteria for the "go" and "no-go" trials
    meets_min_go = resp_tot_go > (min_resp)
    meets_min_nogo = resp_tot_nogo < (min_resp)
    # find if the participant meets exclusion criteria (exclude = 1; include = 2)
    exclude_v = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_v = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, 
                                        no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v))
}  

extract_gng_h = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "horizontal" (h) (trial_tot_h)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_h = sum((temp_data$cue == "h_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "horizontal" (h) (score_h)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "horizontal" (h) (resp_tot_h)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "horizontal" (h) (no_resp_tot_h)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_h = (trial_tot_h - resp_tot_h)  
  
  # Total Errors for the "horizontal" (h) (err_tot_h) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "horizontal" (h) (err_com_h) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "horizontal" (h) (err_omi_h)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Average Reaction Time (RT) for the "horizontal" (h) (rt_av_h)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_h = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "h_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "horizontal" (h) (exclude_h)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
  # first, get the total trials and response totals for the "go" and "no-go" trials
  # "go"
  trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # "no-go"
  trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # next, find the minimum response needed based on the 50% criteria
  min_resp = trial_tot_go * .50
  # find if the participant meets exclusion criteria for the "go" and "no-go" trials
  meets_min_go = resp_tot_go > (min_resp)
  meets_min_nogo = resp_tot_nogo < (min_resp)
  # find if the participant meets exclusion criteria (exclude = 1; include = 2)
  exclude_h = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_h = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_h, score_h, resp_tot_h, 
                                     no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h))
}  

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
# THIS IS THE SAME AS IN THE ABOVE CODE, SO IT IS COMMENTED OUT TO AVOID EXTRA COMPUTING POWER

# files_psychopy = dir("/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
#                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files for the "vertical" (v) and "horizontal" (h) trials  
result_v = sapply(files_psychopy, extract_gng_v)
result_h = sapply(files_psychopy, extract_gng_h)

#--(4) Create a data frame of the result of the function on the files...
# for the "vertical" (gng_data_extracted_v) and "horizontal" (gng_data_extracted_h) trials  

# take the lists of the extracted data and turn it into a data frame for the "vertical" (h) trials
gng_data_extracted_v = data.frame(Reduce(rbind, result_v)) 
# rename the columns
colnames(gng_data_extracted_v) = c("participant_long", "age_long", "lang_long", "trial_tot_v", "score_v", 
                                         "resp_tot_v", "no_resp_tot_v", "err_tot_v", "err_com_v", 
                                         "err_omi_v", "rt_av_v", "exclude_v")

# take the lists of the extracted data and turn it into a data frame for the "horizontal" (h) trials
gng_data_extracted_h = data.frame(Reduce(rbind, result_h)) 
# rename the columns
colnames(gng_data_extracted_h) = c("participant_long", "age_long", "lang_long", "trial_tot_h", "score_h", 
                                           "resp_tot_h", "no_resp_tot_h", "err_tot_h", "err_com_h", 
                                           "err_omi_h", "rt_av_h", "exclude_h")
# remove row names to avoid confusion
rownames(gng_data_extracted_v) = NULL
rownames(gng_data_extracted_h) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted_v = dplyr::arrange(gng_data_extracted_v, participant_long)
gng_data_extracted_h = dplyr::arrange(gng_data_extracted_h, participant_long)

3. & 4. Demographic, SUQ, and SAD Raw Data

The Demographic & SUQ Raw Data and SAD Raw Data was collected using Qualtrics. The raw survey data was imported as a data frame, “survey_raw”, and included the following: (click to see description)

General, Manipulation, and Demographic Information

General Information: (click to expand)

  • Start Date [date_start]

    • The date and time when a participant began their survey on Qualtrics (Y/M/D H:M)
  • End Date [date_end]

    • The date and time when a participant finished their survey on Qualtrics (Y/M/D H:M)
  • Duration (in seconds) [duration(s)]

    • The duration of the survey for the participant in seconds
  • Recorded Date [date_rec]

    • The date and time when a participant’s data was recorded on Qualtrics (Y/M/D H:M)
  • Participant number [sur_participant]

Manipulation Condition: (click to expand)

  • Smartphone Location [sur_location]

    • The smartphone location assigned to the participant (i.e., desk, pocket/bag, or outside)

      coding information (click to expand)

      code 1 2 3
      response Desk Pocket/Bag Outside
  • Smartphone Power [sur_power]

    • The smartphone power assigned to the participant (i.e., ON or OFF)

      coding information (click to expand)

      code 1 2
      response On Off
  • Smartphone Condition [sur_condition]

    • The smartphone condition (i.e., combination of location & power) assigned to the participant

      coding information (click to expand)

      code 1 2 3 4 5 6
      response ON & Desk ON & Pocket/Bag ON & Outside OFF & Desk OFF & Pocket/Bag OFF & Outside

Demographics: (click to expand)

  • Gender [gender]

    • Participant’s self-reported gender (i.e., male, female, other, or prefer not to say)

      coding information (click to expand)

      code 1 2 3 4
      response Male Female Other Prefer not to say
  • Year [year]

    • Participant’s self-reported year of study in their respective program (e.g., 1, 2, 3, 4)
  • Program [program]

    • Participant’s self-reported program of study (e.g., biology, BMOS, psychology)

      coding information (click to expand)

      1 = actuarial 11 = economics 21 = management 31 = psychology
      2 = anthropology science 22 = mathematics 32 = SASAH
      3 = arts 13 = english 23 = medical science 33 = science
      4 = arts & humanities 14 = film 24 = MIT 34 = social science
      5 = biology 15 = FIMS 25 = MOS 35 = sociology
      6 = BMOS 16 = geography 26 = music 36 = software
      7 = chemistry 17 = health science 27 = nutrition & dietetics 37 = visual arts
      8 = computer science 18 = integrated science 28 = philosophy
      9 = consumer behaviour 19 = kinesiology 29 = physics
      10 = criminology 20 = linguistics 30 = political science
The Smartphone Use Questionnaire (SUQ)

The purpose of the SUQ was to measure participants’ typical smartphone use. The SUQ was created for the present study and consisted of modified items from Ward and colleagues’ (2017) survey measures. (i.e. found in the “BRAIN DRAIN” WEB APPENDIX“). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 (”Never“) to 7 (”Always"). There were 10 items in total and there were five types of items, which measured:

1. Smartphone Use Frequency: (SUQ_use_1 - SUQ_use_3)

Each of the following items used the following coding:

CODE 1 2 3 4
response 0-5 6-10 11-15 > 15
  • Text Messages (per day) [SUQ_use_1]

    • Answer to the following question: “On average, how many text messages do you send per day?
  • Social Media Messages (per day) [SUQ_use_2]

    • Answer to the following question: “On average, how many social media based messages do you send per day from your smartphone? (iMessage, Facebook Messenger, WhatsApp, WeChat, direct messages within social media platforms, etc.)
  • Social Media Posts (per day) [SUQ_use_3]

    • Answer to the following question: “On average, how many social media posts (e.g. written post, picture, article, etc.) do you send per day from your smartphone? (Facebook, Twitter, Instagram, etc.)

2. Smartphone Use With External Stimulation or During Other Activities: (SUQ_use_4 - SUQ_use_7)

Each of the following items used the following coding:

code 1 2 3 4 5 6 7
response Never About half
the time
Always
no text specified
  • Tendency to turn to one’s smartphone in the absence of external stimulation [SUQ_use_4 & SUQ_use_5]

    • SUQ_use_4: Answer to the following question: “I look at my smartphone before I roll out of bed in the morning.
    • SUQ_use_5: Answer to the following question: “If I am waiting to meet a friend, I pass the time by using my smartphone.
  • Tendency to turn to one’s smartphone in the midst of other activities [SUQ_use_6SUQ_use_7]

    • SUQ_use_6: Answer to the following question: “I use my smartphone while driving.
    • SUQ_use_7: Answer to the following question: “If my smartphone rings or vibrates in the middle of personal business, I look at it.

3. Smartphone Subjective Value: (SUQ_exp_1)

  • Subjective Value [SUQ_use_1]

    • Answer to the following question: “How much money would it take for you to give up your phone for a full day?

      coding information (click to expand)

      code 1 2 3 4
      response $0-$20 $21-$40 $41-$60 > $60

4. Smartphone Notification Type: (SUQ_exp_2 & SUQ_exp_3)

  • Notification Type [SUQ_use_2]

    • Answer to the following question: “Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.

      coding information (click to expand)

      code 1 2 3 4 5 6 7
      response Email Facebook Twitter Instagram LinkedIn Snapchat Other (specify)
  • Specify Notification Type [SUQ_exp_3]

    • Answer to the following question: “Please Specify: Where you receive notifications (a sound or vibration) on your phone?

5. Phantom Vibrations: (SUQ_exp_4)

  • Phantom Vibrations [SUQ_use_4]

    • Answer to the following question: “Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?

      coding information (click to expand)

      code 1 2
      response Yes No
The Smartphone Attachment and Dependency Inventory (SAD; Ward et al., 2017)

The SAD asked participants to respond to 13 items (SAD_1-SAD_13) using a 7-point likert scale. (click to expand)

code 1 2 3 4 5 6 7
response Strongly
Disagree
Disagree Somewhat
Disagree
Neutral Somewhat
Agree
Agree Strongly
Agree
  • SAD_1: I would have trouble getting through a normal day without my smartphone.
  • SAD_2: It would be painful for me to give up my smartphone for a day.
  • SAD_3: I feel like I could not live without my smartphone.
  • SAD_4: If I forgot to bring my smartphone with me, I would feel anxious.
  • SAD_5: It drives me crazy when my smartphone runs out of battery.
  • SAD_6: I am upset and annoyed when I find I do not have reception on my smartphone.
  • SAD_7: I feel impatient when the Internet connection speed on my smartphone is slow.
  • SAD_8: I feel lonely when my smartphone does not ring or vibrate for several hours.
  • SAD_9: Using my smartphone relieves me of my stress.
  • SAD_10: I feel excited when I have a new message or notification.
  • SAD_11: Using my smartphone makes me feel happy.
  • SAD_12: I find it tough to focus whenever my smartphone is nearby.
  • SAD_13: I become less attentive to my surroundings when I’m using my smartphone.

This code imported the Demographic, SUQ, and SAD raw data. (click to expand)

survey_raw = utils::read.csv("raw_survey.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
survey_raw = dplyr::arrange(survey_raw, sur_participant)

survey_raw

5. Testing Tracking Sheet Raw Data

The Testing Tracking Sheet Raw Data was collected during testing by the experimenter to store relevant information on participant’s (1) General Information (e.g. testing start date), (2) Manipulation Condition (e.g. assigned location and power manipulation), and (3) Data Cleaning (e.g. exclusion critera from testing error or other non-task-specific criteria). The raw data from the tracking sheet was imported as a dataframe, “track_raw”, and included the following: (click to see description)

1. General Information:

  • Testing Date [track_date]

    • The date a participant completed the study (Y/M/D)
  • Testing Time [date_end]

    • The time a participant was scheduled to complete the study (24hr-time). Note: Many participants were tested at the same time, but in separate cubicle-type desks to avoid any group-testing dynamic
  • Participant number [track_participant]

2. Manupulation Condition:

  • Smartphone Location [track_location]

    • The smartphone location assigned to the participant (i.e., desk, pocket/bag, or outside)

      coding information (click to expand)

      code 1 2 3
      response Desk Pocket/Bag Outside
  • Smartphone Power [track_power]

    • The smartphone power assigned to the participant (i.e., ON or OFF)

      coding information (click to expand)

      code 1 2
      response On Off
  • Smartphone Condition [track_condition]

    • The smartphone condition (i.e., combination of location & power) assigned to the participant

      coding information (click to expand)

      code 1 2 3 4 5 6
      response ON & Desk ON & Pocket/Bag ON & Outside OFF & Desk OFF & Pocket/Bag OFF & Outside
  • Counter Balanced Order [balance]

    • This depicts which of the main tasks (i.e., either the OSpan task or the Cue-Dependent Go/No-Go Task) was completed first

      coding information (click to expand)

      code 1 2
      response OSpan Task completed first Cue-Dependent Go/No-Go Task completed first

3. Data Cleaning:

  • Exclude from Analysis [exclude]

    • This depicts whether a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e., for reasons other than the task exclusion criteria in the data analysis plan).

      coding information (click to expand)

      code 1 2
      response Yes (participant excluded) No (participant included)
  • Reason for Exclusion from Analysis [reason]

    • This depicts the reason (summarized into 5 categories) a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e., for reasons other than the task exclusion criteria in the data analysis plan). Note: this includes a category for “no exclusion issues”, where a participant was not excluded

      coding information (click to expand)

      code 1 2 3 4 5
      response no exclusion issues external distractor (noise) gng incomplete gng missing ospan missing
  • Possible Outlier [pos_out]

    • This depicts whether a participant is a possible outlier for exploratory analyses. Note: this was not applied to any analysis unless explicitly stated

      coding information (click to expand)

      code 1 2
      response Yes (does qualify as a possible outlier) No (does not qualify as a possible outlier)
  • Experimenter [experimenter]

    • This depicts which experimenter (3 total) collected the data for a participant. Note: this was collected for exploratory analyses and was not used unless explicitly stated

      coding information (click to expand)

      code 1 2 3
      response ARP SW LM

This code imported the study tracking sheet raw data. (click to expand)

track_raw = utils::read.csv("raw_tracking.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
track_raw = dplyr::arrange(track_raw, track_participant)

track_raw

Construct Working Dataframes

In order to complete the present study’s analyses, eight data frames were created:

  1. All Data
  2. All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)
  3. All Data - Go/No-Go separated by Cue Type (Long FORMAT)
  4. OSpan Analysis
  5. Go/No-Go: Error Analysis
  6. Go/No-Go: Accuracy Analysis
  7. SAD: Factor Analysis
  8. Moderation Analysis

Data Cleaning was completed for (1)-(3) based on: (a) Exclusion criteria, (b) Outlier data, and (c) Incomplete/Missing data:

  1. Exclusion Criteria

    • OSpan Data: Math Criteria Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions, specifically, that participant payed attention to both the math component and letter recall component of the task. Only data from those who scored 85% or higher on the math component of the task were included.

    • Go/No-Go Data: Chance Performance Criteria Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials) were used for the final analysis.

    • Go/No-Go Data: Standard Deviation Criteria Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials) were used for the final analysis.

  2. Outlier Data

    • OSpan & Go/No-Go Data: For all task data, any extreme outliers were removed. The outliers were identified and removed using the standard deviation (SD) method. Participants who’s score was greater than 2SDs from the mean were excluded.
  3. Incomplete/Missing Data

    • For all task data, any participants with incomplete or missing data were removed.

1. All Data

All the data from the present study in one data frame: “all_data” (click to see more)

  • This data included the following:

    • participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age, lang, gng_trial_tot, gng_score, gng_resp_tot, gng_no_resp_tot, gng_err_tot, gng_err_com, gng_err_omi, gng_rt_av, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score

This code organized the raw data into one data frame. (click to expand)


# Find the final participant list for the analyses  
  # get a list of participants that (1) completed the gng task and (2) are not excluded (i.e., inclusion list) based on the "exclude" column in the tracking sheet  
  # (1) First get the gng participant list  
  gng_par_list = gng_data_extracted$gng_participant  
  # (2) Next, get the inclusion list  
  include_par_list = data.frame(track_raw$track_participant[which(track_raw$exclude %in% 2)])  
    colnames(include_par_list) = c("par_list")
  # Get a list of all the participants who completed the study (from the testing tracking sheet)
  all_participants = as.data.frame(track_raw$track_participant)
    colnames(all_participants) = c("all_participants")
  
# find the participants to remove from the data
  # Filter each data list from the list of all participants  
  gng_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% gng_par_list)))
  track_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% include_par_list$par_list)))
  reason_remove = c(rep("gng", length(gng_remove$all_participants)), rep("track", length(track_remove$all_participants)))
  
  # Make a list of all the removed participants -- "all_remove"
  all_remove_notclean = data.frame(rbind(gng_remove, track_remove), reason_remove)
    # remove any duplicates from the list
    all_remove_nodup = all_remove_notclean[!duplicated(all_remove_notclean$all_participants), ]
    # reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)  
    all_remove = dplyr::arrange(all_remove_nodup, all_participants)  
    
# Filter raw data using the final participant list for: ospan_raw, gng_raw, survey_raw, and track_raw
  # This will now be the working data for the study (i.e. the "final" data)  
    # OSpan Data  
    ospan_final = dplyr::filter(ospan_raw, !ospan_participant %in% all_remove$all_participants)
    # Gng Data  
    gng_final = dplyr::filter(gng_data_extracted, !gng_participant %in% all_remove$all_participants)
    # Survey Data  
    survey_final = dplyr::filter(survey_raw, !sur_participant %in% all_remove$all_participants)
    # Tracking Data  
    track_final = dplyr::filter(track_raw, !track_participant %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # Start by creating a data frame with the final participant list in the 1st column: "participant"
  all_par = as.data.frame(track_final$track_participant)
    colnames(all_par) = c("participant")

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # OSpan Data
    colnames(ospan_final)[which(names(ospan_final) == "ospan_participant")] = c("participant")
    # Gng Data
    colnames(gng_final)[which(names(gng_final) == "gng_participant")] = c("participant")
    # Survey Data
    colnames(survey_final)[which(names(survey_final) == "sur_participant")] = c("participant")
    # Tracking Data
    colnames(track_final)[which(names(track_final) == "track_participant")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_temp" 
  # this will be refined with data cleaning later
  all_data_temp <-  Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_temp)[which(names(all_data_temp) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    
# Add score for SAD items
  #SAD score over all 13 items
    all_data_temp <- 
      all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_score = sum(c_across(SAD_1:SAD_13)))
    # all_data_temp %>% 
    # # for each participant
    # rowwise() %>% 
    # # score is calculated by adding up all 13 items for a total sum score for each p
    # mutate(SAD_score = sum(SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13))
  
  # Smartphone Dependence subscale (items: 1, 2, 3) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_dep_score = sum(c_across(SAD_1:SAD_3)))

  # Emotional Attachment subscale (items: 8, 9, 10, 11) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_ea_score = sum(c_across(SAD_8:SAD_11)))
  
  # Smartphone Accessibility subscale (items: 5, 6, 7) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_access_score = sum(c_across(SAD_5:SAD_7)))
    
  # Smartphone Distractibility subscale (items: 12, 13) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_dist_score = sum(c_across(SAD_12:SAD_13)))
    
# Add score for SUQ_use_4-5
all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SUQ_abext_score = sum(SUQ_use_4, SUQ_use_5)/2)
  all_data_temp %>% 
  # for each participant
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone in absence of external stimulation
  mutate(SUQ_abext_score = (sum(SUQ_use_4, SUQ_use_5))/2)

# Add score for SUQ_use_6-7
all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SUQ_duract_score = sum(SUQ_use_6, SUQ_use_7)/2)
  all_data_temp %>% 
  # for each participant  
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone during other activities
  mutate(SUQ_duract_score = (sum(SUQ_use_6, SUQ_use_7))/2)
  
all_data_temp

Data Cleaning: Exclusion Criteria, Outliers, and Missing Data

The final sample size (N = 383) for this data frame excluded 32 participants who met the exclusion criteria (OSpan math criteria: 20; Go/No-Go response criteria, horizontal/no-go cue: 6; Go/No-Go response criteria, vertical/go cue: 6). A total of 0 participants were identified as outliers (OSpan: 0; Go/No-Go Error Analysis: 0; Go/No-Go Accuracy Analysis: 0) and removed. A total of 6 participants were removed due to incomplete or missing data (OSpan: 3; Go/No-Go Error Analysis: 0; Go/No-Go Accuracy Analysis: 3). This created the “all_data” data frame (click to see more).

This code cleaned all_data based on (1) exclusion criteria, (2) outliers, and (3) incomplete/missing data. (click to see code)

# get list of ps to remove for various reasons...
  
# Exclusion criteria
  # For OSPAN
  # ps scored <85% on math
  dc_ospan_ec <- 
    all_data_temp %>% 
    # apply filter to math criteria
    filter(ospan_math_criteria_85 == 0) %>% 
    # remove all rows except participant ID
    select(participant)

  # For GNG -- based on raw data separated by cue type (i.e., either vertical or horizontal)
  # Vertical cue type ("go")
  dc_gng_v_ec <- 
    gng_data_extracted_v %>% 
    # apply filter to response criteria
    filter(exclude_v == 1) %>% 
    # remove all rows except participant ID
    select(participant_long) %>% 
    # rename column as "participant"
    dplyr::rename(participant = participant_long)
  # Vertical cue type ("no-go")
  dc_gng_h_ec <- 
    gng_data_extracted_h %>% 
    # apply filter to response criteria
    filter(exclude_h == 1) %>% 
    # remove all rows except participant ID
    select(participant_long) %>% 
    # rename column as "participant"
    dplyr::rename(participant = participant_long)

# Outliers
  # For OSPAN
  dc_ospan_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(ospan_absolute_score > (mean(ospan_absolute_score, na.rm = T) + (2 * sd(ospan_absolute_score, na.rm = T))) |
           ospan_absolute_score < (mean(ospan_absolute_score, na.rm = T) - (2 * sd(ospan_absolute_score, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
  # For GNG error (omission)
  dc_gng_err_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(gng_err_omi > (mean(gng_err_omi, na.rm = T) + (2 * sd(gng_err_omi, na.rm = T))) |
           gng_err_omi < (mean(gng_err_omi, na.rm = T) - (2 * sd(gng_err_omi, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
  # For GNG accuracy (average RT)
  dc_gng_ac_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(gng_rt_av > (mean(gng_rt_av, na.rm = T) + (2 * sd(gng_rt_av, na.rm = T))) |
           gng_rt_av < (mean(gng_rt_av, na.rm = T) - (2 * sd(gng_rt_av, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
# Missing/incomplete data
  # For OSPAN
  dc_ospan_miss <- all_data_temp %>% filter(is.na(ospan_absolute_score)) %>% select(participant)
  # For GNG error (omission)
  dc_gng_err_miss <- all_data_temp %>% filter(is.na(gng_err_omi)) %>% select(participant)
  # For GNG RT (average)
  dc_gng_ac_miss <- all_data_temp %>% filter(is.na(gng_rt_av)) %>% select(participant)
    
  
# create list of ps to remove with reason for removal -- all_remove_dc
all_remove_dc <- data.frame("participant" = bind_rows(dc_ospan_ec, dc_ospan_out, dc_ospan_miss, 
                                                      dc_gng_v_ec, dc_gng_h_ec, 
                                                      dc_gng_err_out, dc_gng_err_miss, 
                                                      dc_gng_ac_out, dc_gng_ac_miss), 
                            "reason" = c(rep("ospan_ec", nrow(dc_ospan_ec)), rep("ospan_out", nrow(dc_ospan_out)), 
                                         rep("ospan_miss", nrow(dc_ospan_miss)), rep("gng_v_ec", nrow(dc_gng_v_ec)),
                                         rep("gng_h_ec", nrow(dc_gng_h_ec)),rep("gng_err_out", nrow(dc_gng_err_out)), 
                                         rep("gng_err_miss", nrow(dc_gng_err_miss)), rep("gng_ac_out", nrow(dc_gng_ac_out)), 
                                         rep("gng_ac_miss", nrow(dc_gng_ac_miss)) 
                                         )
                            )
# create list of ps to remove without duplicates to use to filter final data
all_remove_dc_list <- 
  all_remove_dc %>% 
  # remove reason column
  select(participant) %>% 
  # remove any duplicated participant IDs
  distinct()
# clean all_data using all_remove_dc_list
all_data <- 
  all_data_temp %>% 
  filter(!participant %in% all_remove_dc_list$participant)

all_data

2. All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e., either “go” or “no-go”), presented in WIDE FORMAT: “all_data_cue” (click to see more)

  • This data included the following:

    • participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v, trial_tot_h, score_h, resp_tot_h, no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score

This code organized the raw data into one data frame. (click to expand)

# use the list of all the participants to remove -- "all_remove" & "all_remove_dc_list" -- to filter the raw gng long data
  # for the "go" targets
  gng_v_final <- 
  gng_data_extracted_v %>% 
  # remove using use all_remove
  filter(!participant_long %in% all_remove$all_participants) %>% 
  # remove using use all_remove_dc_list
  filter(!participant_long %in% all_remove_dc_list$participant)
  
  # for the "no-go" targets
  gng_h_final <- 
  gng_data_extracted_h %>% 
  # remove using use all_remove
  filter(!participant_long %in% all_remove$all_participants) %>% 
  # remove using use all_remove_dc_list
  filter(!participant_long %in% all_remove_dc_list$participant)

# Combine all the final data into 1 data frame with the following:
  # use the final participant list -- "all_par" -- as the 1st column in the data frames

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # this was already done for the other data frames (e.g. ospan_final, survey_final)
    # for the "vertical" targets
    colnames(gng_v_final)[which(names(gng_v_final) == "participant_long")] = c("participant")
    # for the "horizontal" targets
    colnames(gng_h_final)[which(names(gng_h_final) == "participant_long")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_cue"
  all_data_cue = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(gng_h_final, -c(age_long, lang_long)), dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_cue)[which(names(all_data_cue) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

# Add score for SAD items
  #SAD score over all 13 items
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_score = sum(SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13))
  
  # Smartphone Dependence subscale (items: 1, 2, 3, 4, 5) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up items 1-5 for a total sum score for each p
    mutate(SAD_dep_score = sum(SAD_1, SAD_2, SAD_3))
    
  # Emotional Attachment subscale (items: 8, 9, 10, 11) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up items 8-11 for a total sum score for each p
    mutate(SAD_ea_score = sum(SAD_8, SAD_9, SAD_10, SAD_11))
    
  # Smartphone Accessability subscale (items: 5, 6, 7) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_access_score = sum(SAD_5, SAD_6, SAD_7))
    
  # Smartphone Distractibility subscale (items: 7, 12, 13) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_dist_score = sum(SAD_12, SAD_13))

# Add score for SUQ_use_4-5
all_data_cue <- 
  all_data_cue %>% 
  # for each participant
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone in absence of external stimulation
  mutate(SUQ_abext_score = (sum(SUQ_use_4, SUQ_use_5))/2)

# Add score for SUQ_use_6-7
all_data_cue <- 
  all_data_cue %>% 
  # for each participant  
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone during other activities
  mutate(SUQ_duract_score = (sum(SUQ_use_6, SUQ_use_7))/2)

all_data_cue
NA

3. All Data - Go/No-Go separated by Cue Type (LONG FORMAT)

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e., either “go” or “no-go”), presented in LONG FORMAT: “all_data_cue_long” (click to see more)

  • This data included the following:

    • cue, participant, track_date, track_time, location, power, condition, balance, exclude, reason, pos_out, experimenter, ospan_ID, ospan_absolute_score, ospan_total_corr, ospan_math_err_tot, ospan_math_err_speed, ospan_math_err_accuracy, ospan_math_percent_corr, ospan_math_criteria_85, age_long, lang_long, gng_trial_tot_long, gng_score_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long, gng_rt_av_long, gng_exclude_long, date_start, date_end, duration.s., date_rec, gender, program, year, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_use_4, SUQ_use_5, SUQ_use_6, SUQ_use_7, SUQ_exp_1, SUQ_exp_2, SUQ_exp_3, SUQ_exp_4, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13

This code organized the raw data into one data frame. (click to expand)

# Re-structure the data into 1 data frame in a long format for a repeated measures analysis

  # using the "final" data frames of all the data, make 2 data frames, one for the "vertical" (all_data_v) and one for the "horizontal" (all_data_h) targets
    # for the "vertical" targets -- all_data_v
    all_data_v = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_v)[which(names(all_data_v) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    # for the "horizontal" targets -- all_data_h
    all_data_h = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_h_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_h)[which(names(all_data_h) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

  # rename target-specific columns for both data frames (to bind)
    # for the "go" trials
    names(all_data_v)[names(all_data_v) == "trial_tot_v"] = "gng_trial_tot_long" 
    names(all_data_v)[names(all_data_v) == "score_v"] = "gng_score_long" 
    names(all_data_v)[names(all_data_v) == "resp_tot_v"] = "gng_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "no_resp_tot_v"] = "gng_no_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_tot_v"] = "gng_err_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_com_v"] = "gng_err_com_long" 
    names(all_data_v)[names(all_data_v) == "err_omi_v"] = "gng_err_omi_long" 
    names(all_data_v)[names(all_data_v) == "rt_av_v"] = "gng_rt_av_long" 
    names(all_data_v)[names(all_data_v) == "exclude_v"] = "gng_exclude_long"
    # for the "no-go" trials
    names(all_data_h)[names(all_data_h) == "trial_tot_h"] = "gng_trial_tot_long" 
    names(all_data_h)[names(all_data_h) == "score_h"] = "gng_score_long" 
    names(all_data_h)[names(all_data_h) == "resp_tot_h"] = "gng_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "no_resp_tot_h"] = "gng_no_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_tot_h"] = "gng_err_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_com_h"] = "gng_err_com_long" 
    names(all_data_h)[names(all_data_h) == "err_omi_h"] = "gng_err_omi_long" 
    names(all_data_h)[names(all_data_h) == "rt_av_h"] = "gng_rt_av_long" 
    names(all_data_h)[names(all_data_h) == "exclude_h"] = "gng_exclude_long"
  
  # Create a data frame depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  cue_type = as.data.frame(c(rep(1, length(all_data_v$participant)), rep(2, length(all_data_h$participant))))
    colnames(cue_type) = c("cue")  

  # Bind the go and no-go data frames into 1 (in long format)
  gng_cue_bind = dplyr::bind_rows(all_data_v, all_data_h)
  
  # add the target type column depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  all_data_cue_long = dplyr::bind_cols(cue_type, gng_cue_bind)
  
  all_data_cue_long

4. OSpan Analysis

All the data related to the OSpan Analysis from the present study in one data frame: “ospan_analysis” (click to see more)

  • This data included the following:

    • participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85

This code organized the final data into the ospan-related data frame. (click to expand)


# make a new data frame for the ospan analysis
ospan_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85)


ospan_analysis
NA

5. Go/No-Go: Error Analysis

All the data related to the Go/No-Go Error Analysis from the present study in one data frame: “gng_err_analysis”

Note: this data is in a long format for a repeated measures analysis (i.e., based on target type: either “go” or “no-go”). (click to see more)

  • This data included the following:

    • participant, age_long, gender, lang_long, program, year, location, power, condition, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long

This code organized the final long data into a data frame for the Go/No-Go Error Analysis. (click to expand)

# make a new data frame for the gng error analysis
gng_err_analysis = dplyr::select(all_data_cue_long, participant, age_long, gender, lang_long, program, year, location, power, condition, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long)

gng_err_analysis
NA

6. Go/No-Go: Accuracy Analysis

All the data related to the Go/No-Go Accuracy Analysis from the present study in one data frame: “gng_ac_analysis” (click to see more)

  • This data included the following:

    • participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av

This code organized the final data into a data frame related to the Go/No-Go Accuracy Analysis. (click to expand)


# make a new data frame for the gng accuracy analysis
gng_ac_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av) 

gng_ac_analysis

7. SAD: Factor Analysis

All the data related to the Factor Analysis from the present study in one data frame: “factor_analysis” (click to see more)

  • This data included the following:

    • participant, age, gender, lang, program, year, location, power, balance, SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13, SAD_score

This code organized the final data into the factor analysis-related data frame. (click to expand)


# make a new data frame for the factor analysis
factor_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13, SAD_score)

factor_analysis
NA

8. Moderation Analysis

All the data related to the Moderation Analysis from the present study in one data frame: “mod_analysis” (click to see more)

  • This data included the following:

    • participant, ospan_absolute_score, age, gender, lang, program, year, location, power, balance, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_exp_1, SUQ_exp_2, SUQ_exp_4, location_desk, location_out

This code organized the final data into the moderation-related data frame. (click to expand)


# make a new data frame for the glm analysis
mod_analysis = dplyr::select(all_data, participant, ospan_absolute_score, age, gender, lang, program, year, location, power, balance, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_exp_1, SUQ_exp_2, SUQ_exp_4)

mod_analysis
NA

Descriptive Statistics

The following figures and tables depict the descriptive statistics for the present study’s data. This was done across all conditions (Toltal) and divided by condition: smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). For any continuous variables, the table(s) depict the mean (M), standard deviation (SD), minimum score (min.), maximum score (max.), and the sample size (n). For any nominal variables, the table(s) depict the variable, levels, and frequency for each level. This was done for the following:

  1. Demographics Data

    • Continuous variables (i.e., age)
    • Nominal variables (i.e., Sample Size, Balance, Gender, Language, Program, and Year)
  2. Task Data

    • OSpan Data (i.e., absolute score)
    • Go/No-Go Data (i.e., omission errors, average reaction time)
  3. Questionnaire Data

    • SUQ Data (e.g., typical smartphone usage)
    • SAD Data (i.e., SAD score)

This code organized the descriptive statistic data sets (click to see code)

This code organized the final data into the descriptive statistics for the Demographic Data (click to see code).

# make new data frame for descriptive stats
all_data_demo_des <- 
  all_data %>% 
  
  # subset data for descriptive variables only
  select(location, power, condition, balance, age, gender, lang, program, year) %>% 
  
  # set some variables as factors
  mutate(location = factor(location, levels = c(1:3), labels = c("Desk", "Pocket/Bag", "Outside"))) %>% 
  mutate(power = factor(power, levels = c(1:2), labels = c("On", "Off"))) %>% 
  mutate(condition = factor(condition, levels = c(1:6), labels = c("On & Desk", "On & Pocket/Bag", "On & Outside", "Off & Desk", "Off & Pocket/Bag", "Off & Outside"))) %>% 
  mutate(balance = factor(balance, levels = c(1:2), labels = c("Ospan Frist", "GnG First"))) %>% 
  mutate(gender = factor(gender, levels = c(1:2), labels = c("Male", "Female"))) %>% 
  mutate(lang = factor(lang, levels = c(0, 1), labels = c("Other", "English"))) %>% 
  mutate(program = factor(program, levels = c(3, 4, 13, 14, 28, 32, 37, 12, 17, 19, 15, 20, 24, 23, 26, 1, 2, 5, 7, 8, 11, 16, 18, 22, 27, 29, 33, 36, 6, 9, 10, 21, 25, 30, 31, 34, 35),
                          labels = c(rep("Arts & Humanities", 7), "Engineering", rep("Health Sciences", 2), rep("Information & Media Studies", 3), 
                                     "Medicine & Dentistry", "Music", rep("Science", 13), rep("Social Science", 9)))) %>% 
  mutate(year = factor(year, levels = c(1:4), labels = c("First Year", "Second Year", "Third Year", "Fourth Year")))

all_data_demo_des

This code organized the final data into the descriptive statistics for the Task Data (click to see code).

# make new data frame for descriptive stats
all_data_task_des <- 
  all_data %>% 
  # subset data for descriptive variables only
  select(location, power, condition, ospan_absolute_score, ospan_total_corr, ospan_math_percent_corr, gng_score, gng_err_omi, gng_rt_av)

all_data_task_des

This code organized the final data into the descriptive statistics for the Questionnaire Data (click to see code).

# make new data frame for descriptive stats
all_data_ques_des <- 
  all_data %>% 
  
  # subset data for descriptive variables only
  select(location, power, condition, SUQ_use_1:SUQ_use_3, SUQ_exp_1, SUQ_exp_2, SUQ_exp_4, 
         SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score) %>% 
  
  # set some variables as factors
  mutate(SUQ_use_1 = factor(SUQ_use_1, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_use_2 = factor(SUQ_use_2, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_use_3 = factor(SUQ_use_3, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_exp_1 = factor(SUQ_exp_1, levels = c(1:4), labels = c("$0-$20", "$21-$40", "$41-$60", ">$60")))

all_data_ques_des

Demographic Data

The following table depicts the continuous variable(s) (i.e., age) from all participants in the present study (click to see code).

Demographic Information by Condition and Overall: Age
ON
OFF
On Desk In Pocket/Bag Outside On Desk In Pocket/Bag Outside TOTAL
Age (years)
1 M 18.80 18.66 19.10 18.66 19.08 18.95 18.87
2 SD 0.91 0.88 2.67 1.05 1.33 1.05 1.43
3 min. 18.00 18.00 18.00 17.00 18.00 18.00 17.00
4 max. 21.00 21.00 38.00 22.00 23.00 22.00 38.00
5 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Note:
Continuous descriptive statistics shown. 'TOTAL' column shows statistics across all conditions.

The following table depicts the nominal variables (i.e., Sample Size, Balance, Gender, Language, Program, and Year) from all participants in the present study (click to see code).

Demographic Data by Condition and Overall: Sample Size, Balance, Gender, Language, Program, and Year
ON
OFF
On Desk In Pocket/Bag Outside On Desk In Pocket/Bag Outside TOTAL
Sample Size
1 n 70 67 59 58 65 64 383
Balance
2 Ospan Frist 35 28 26 23 34 31 177
3 GnG First 35 39 33 35 31 33 206
Gender
4 Male 32 32 30 34 32 25 185
5 Female 38 35 29 24 33 39 198
First Language
6 Other 15 12 9 7 10 11 64
7 English 55 55 50 51 55 53 319
Year
8 First Year 46 48 41 42 44 42 263
9 Second Year 15 10 10 7 10 14 66
10 Third Year 2 5 3 6 5 7 28
11 Fourth Year 4 1 3 1 6 1 16
12 No Selection 3 3 2 2 0 0 10
Program
13 Arts & Humanities 2 2 0 2 2 1 9
14 Engineering 1 0 0 0 0 0 1
15 Health Sciences 10 8 14 6 10 10 58
16 Information & Media Studies 3 2 1 1 3 3 13
17 Medicine & Dentistry 14 8 10 14 12 12 70
18 Music 0 1 2 1 1 0 5
19 Science 18 17 13 12 11 18 89
20 Social Science 19 28 18 20 26 19 130
21 No Selection 3 1 1 2 0 1 8
Note:
Nominal descriptive statistics shown as frequencies. Row 1 depicts sample size across all variables. 'TOTAL' column depicts frequencies across all conditions.

Task Data

The following table depicts the OSpan and Go/No-Go variable(s) (i.e., OSpan Absolute Score, OSpan Total Correct, OSpan Math Percentage Correct, Go/No-Go Score, Go/No-Go Omission Errors, Go/No-Go Average Reaction Time) from all participants in the present study (click to see code).

Task Data by Condition and Overall: OSpan Absolute Score, OSpan Total Correct, OSpan Math Percentage Correct, Go/No-Go Score, Go/No-Go Omission Errors, Go/No-Go Average Reaction Time
ON
OFF
On Desk In Pocket/Bag Outside On Desk In Pocket/Bag Outside TOTAL
OSpan Task
Absolute Score
1 M 41.46 42.76 39.05 40.76 41.25 43.62 41.54
2 SD 16.22 15.44 18.25 18.93 17.33 15.68 16.90
3 min. 6.00 6.00 4.00 0.00 3.00 0.00 0.00
4 max. 75.00 75.00 75.00 75.00 75.00 75.00 75.00
5 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Total Score
6 M 57.81 59.01 55.93 56.64 57.52 59.36 57.77
7 SD 12.72 11.71 14.27 14.88 12.34 12.91 13.09
8 min. 11.00 11.00 17.00 7.00 23.00 2.00 2.00
9 max. 75.00 75.00 75.00 75.00 75.00 75.00 75.00
10 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Math Correct (%)
11 M 95.03 95.74 94.89 94.99 95.65 95.46 95.30
12 SD 3.42 3.06 3.22 3.46 3.17 3.32 3.27
13 min. 86.67 88.00 85.33 86.67 86.67 85.33 85.33
14 max. 100.00 100.00 100.00 100.00 100.00 100.00 100.00
15 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Cue-Dependent Go/No-Go Task
Total Score
16 M 247.70 248.49 248.36 247.71 247.52 248.86 248.10
17 SD 6.81 2.37 2.10 4.43 10.35 1.62 5.62
18 min. 197.00 235.00 242.00 222.00 167.00 244.00 167.00
19 max. 250.00 250.00 250.00 250.00 250.00 250.00 250.00
20 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Omission Errors
21 M 0.81 0.45 0.76 1.09 0.78 0.45 0.72
22 SD 2.38 1.06 1.52 3.55 3.80 0.96 2.46
23 min. 0.00 0.00 0.00 0.00 0.00 0.00 0.00
24 max. 15.00 5.00 8.00 23.00 30.00 5.00 30.00
25 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Omission Errors for 'Go' Trials
26 M 0.70 0.36 0.51 0.97 0.57 0.31 0.56
27 SD 2.16 0.85 1.06 3.09 2.93 0.73 2.03
28 min. 0.00 0.00 0.00 0.00 0.00 0.00 0.00
29 max. 14.00 4.00 5.00 20.00 23.00 4.00 23.00
30 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Omission Errors for 'No-Go' Trials
31 M 0.11 0.09 0.25 0.12 0.22 0.14 0.15
32 SD 0.36 0.29 0.68 0.50 0.93 0.39 0.56
33 min. 0.00 0.00 0.00 0.00 0.00 0.00 0.00
34 max. 2.00 1.00 3.00 3.00 7.00 2.00 7.00
35 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Average Reaction Time
36 M 1.66 1.66 1.66 1.67 1.66 1.66 1.66
37 SD 0.05 0.06 0.04 0.09 0.06 0.06 0.06
38 min. 1.58 1.56 1.56 1.57 1.56 1.56 1.56
39 max. 1.88 1.80 1.77 2.21 1.84 1.87 2.21
40 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Note:
Continuous descriptive statistics shown for Task data (i.e., OSpan Task and Cue-Dependent Go/No-Go Task). 'TOTAL' column shows statistics across all conditions.

Questionnaire Data

The following table depicts the SUQ and SAD variable(s) for both continuous (i.e., SUQ score for smartphone use with absence of other stimulation, SUQ score for smartphone use during other actions, SAD score) and nominal (i.e., Average text messages per day, Average social media based messages sent per day, Average social media based posts per day, Smartphone subjective value, Types of notifications, Phantom vibrations) variables from all participants in the present study (click to see code).

The continuous data includes:

  • SUQ score for smartphone use with absence of other stimulation

  • SUQ score for smartphone use during other actions

  • SAD score (i.e., attachment and dependency to one’s smartphone):

    • \(SAD_{overall}\): SAD score for all 13 items

    • \(SAD_{Dep}\): SAD score for the Smartphone Dependence subscale (items: 1, 2, 3, 4, 5)

    • \(SAD_{EA}\): SAD score for the Emotional Attachment subscale (items: 8, 9, 10, 11)

    • \(SAD_{Dist}\): SAD score for the Smartphone Distractibility subscale (items: 7, 12, 13)

      This code organized the continuous data (click to see code). </summary)

Questionnaire Data by Condition and Overall: Continuous Variables
ON
OFF
On Desk In Pocket/Bag Outside On Desk In Pocket/Bag Outside TOTAL
Smartphone Usage Questionnaire (SUQ)
Tendency to use one's smartphone in the abscence of other stimulation
1 M 6.12 6.28 6.21 6.19 6.32 6.36 6.25
2 SD 1.14 1.06 1.04 1.04 0.92 0.97 1.03
3 min. 1.50 2.00 2.50 4.00 3.50 3.50 1.50
4 max. 7.00 7.00 7.00 7.00 7.00 7.00 7.00
5 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Tendency to use one's smartphone in the during other actions
6 M 2.71 2.93 2.51 2.84 2.78 2.58 2.73
7 SD 1.24 1.38 1.06 1.18 1.24 1.16 1.22
8 min. 1.00 1.00 1.00 1.00 1.00 1.00 1.00
9 max. 6.00 7.00 6.50 6.00 7.00 7.00 7.00
10 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Smartphone Attachment and Dependency Inventory (SAD)
Overall: Items #1-13
11 M 57.67 58.94 58.08 59.86 57.32 62.23 58.99
12 SD 15.32 13.58 14.60 13.15 12.39 12.46 13.66
13 min. 19.00 27.00 22.00 25.00 19.00 32.00 19.00
14 max. 89.00 87.00 86.00 87.00 79.00 87.00 89.00
15 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Smartphone Dependence: Items #1-3
16 M 11.73 12.51 12.42 13.31 12.20 13.77 12.63
17 SD 5.57 4.61 5.07 4.39 4.38 3.98 4.73
18 min. 3.00 3.00 3.00 3.00 3.00 5.00 3.00
19 max. 21.00 21.00 21.00 21.00 20.00 21.00 21.00
20 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Emotional Attachment: Items #8-11
21 M 16.80 16.30 16.39 16.55 16.43 17.88 16.73
22 SD 5.05 5.32 5.07 4.58 4.73 4.63 4.91
23 min. 4.00 4.00 4.00 5.00 4.00 6.00 4.00
24 max. 28.00 28.00 27.00 26.00 26.00 28.00 28.00
25 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Accessibility: Items #5, 6, 7
26 M 14.67 15.01 14.98 15.31 14.60 15.55 9.98
27 SD 4.01 3.43 3.84 3.95 3.68 3.23 2.70
28 min. 3.00 6.00 3.00 6.00 4.00 8.00 2.00
29 max. 21.00 21.00 21.00 21.00 21.00 21.00 14.00
30 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Smartphone Distractability: Items #12, 13
31 M 10.10 10.36 9.66 10.07 9.75 9.88 6.25
32 SD 2.84 2.49 2.78 2.89 2.43 2.82 1.03
33 min. 2.00 2.00 4.00 2.00 3.00 2.00 1.50
34 max. 14.00 14.00 14.00 14.00 14.00 14.00 7.00
35 n 70.00 67.00 59.00 58.00 65.00 64.00 383.00
Note:
Continuous descriptive statistics shown for Questionnaire data (i.e., SUQ score for smartphone use with absence of other stimulation, SUQ score for smartphone use during other actions, SAD score). 'TOTAL' column shows statistics across all conditions.


The nominal data includes:

  • Average text messages per day
  • Average social media based messages sent per day
  • Average social media based posts per day
  • Smartphone subjective value
  • Types of notifications
  • Phantom vibrations

This code organized the nominal data (click to see code). </summary)

Questionnaire Data by Condition and Overall: Nominal Variables
ON
OFF
On Desk In Pocket/Bag Outside On Desk In Pocket/Bag Outside TOTAL
Sample Size
1 n 70 67 59 58 65 64 383
Smartphone Usage Questionnaire (SUQ)
Average Text Messages per Day
2 0-5 18 11 10 10 10 9 68
3 6-10 5 11 9 6 6 8 45
4 11-15 8 3 5 5 10 8 39
5 >15 39 42 35 37 39 39 231
Average Social Media Based Messages Sent per Day
6 0-5 12 7 9 10 12 12 62
7 6-10 12 6 9 4 7 3 41
8 11-15 5 8 5 5 7 4 34
9 >15 41 46 36 39 39 45 246
Average Social Media Based Posts per Day
10 0-5 60 58 45 44 52 51 310
11 6-10 5 3 1 6 8 6 29
12 11-15 2 1 5 5 1 3 17
13 >15 3 5 8 3 4 4 27
Smartphone Subjective Value
14 $0-$20 32 24 25 12 23 24 140
15 $21-$40 13 13 12 22 21 12 93
16 $41-$60 8 11 11 12 16 9 67
17 >$60 17 19 11 12 5 19 83
Types of Notifications*
18 Email 47 45 47 46 39 53 277
19 Facebook 42 41 38 36 36 36 229
20 Twitter 13 22 14 14 11 22 96
21 Instagram 44 46 39 33 37 43 242
22 LinkedIn 5 2 4 5 1 7 24
23 Snapchat 53 51 47 43 49 50 293
24 Other 23 16 10 11 17 18 95
Phantom Vibrations
25 Experienced 62 54 52 53 54 56 331
26 Not Experienced 8 13 7 5 11 8 52
Note:
Continuous descriptive statistics shown for Questionnaire data (i.e., Sample size, Average text messages per day, Average social media based messages sent per day, Average social media based posts per day, Smartphone subjective value, Types of notifications, Phantom vibrations). 'TOTAL' column shows statistics across all conditions.
* Each participant could choose multiple types of notifications.

Planned Analyses

The following analyses were completed based on a pre-registered direct replication on OSF (view registration: https://osf.io/ubys7/?view_only=5db76bba381a4dc4aac130f7cd3a0de5) of Ward et al.’s (2017) second experiment. The analyses tested the three main hypotheses in the present study: (1) the location effect hypothesis, (2) the power effect hypothesis, and (3) the moderation effect hypothesis.

Each analysis contains the following:

  1. Analysis (including any relevant assumption tests)
  2. Visualizations

Note: Descriptive statistics are shown in the Descriptive Statistics section

1. & 2. Testing the Location and Power Hypotheses:

OSpan Task

A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was conducted (DV = OSpan absolute score).

Prior to completing the analysis, planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Specifically, only participants who payed attention to both the math component and letter recall component of the task were included. Only data from those who scored 85% or higher on the math component of the task were included. View the Data Cleaning section under the data Construct Working Dataframes heading above. There was a total of 383 participants in the OSpan analysis.

Analysis

To assess if OSpan absolute score statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. All assumptions were met (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output; click to see code).


# get the required data from the ospan_analysis data: participant, location, power, and score
ospan_anova_data = ospan_analysis[, c("participant", "location", "power", "ospan_absolute_score")]
    colnames(ospan_anova_data) = c("participant", "location", "power", "score")
    rownames(ospan_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    ospan_anova_data$location = factor(ospan_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    ospan_anova_data$power = factor(ospan_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
ospan_anova = ezANOVA(
  data = ospan_anova_data
  , dv = .(score)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)
Warning: Converting "participant" to factor for ANOVA.
Warning: Data is unbalanced (unequal N per group). Make sure you specified a well-considered value for the type argument to ezANOVA().
Coefficient covariances computed by hccm()
# calculate & extract the residuals from the ANOVA
ospan_res = data.frame("residuals" = proj(ospan_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(ospan_res$residuals, main = "Q-Q Plot of the OSpan Residuals") 
qqline(ospan_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right (see visualization below).

qplot(ospan_res$residuals, main = "Histogram of Ospan Residuals", binwidth = 5) + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.99, p = .001 (see code below).

ospan_shap = shapiro.test(ospan_res$residuals)
ospan_shap

    Shapiro-Wilk normality test

data:  ospan_res$residuals
W = 0.98594, p-value = 0.0008969
  • The Skewness and Kurtosis were examined and showed that both the Skewness (range: -0.42 to -0.05) and Kurtosis (range: -0.86 to 0.1) were within the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below; click to see code).

# use the "ospan_des" data frame from the descriptive stats

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  ospan_sk_t = rbind((cbind(t(des_task_cont["ospan_absolute_score1", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score2", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(des_task_cont["ospan_absolute_score4", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score5", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score6", c("skew", "kurtosis", "n")]))))
    colnames(ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
# use kable to show the descriptive table
  kable(ospan_sk_t, caption = "Skewness and Kurtosis of the Absolute Score on the OSpan Task", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Skewness and Kurtosis of the Absolute Score on the OSpan Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness -0.23 -0.42 -0.05
Kurtosis -0.71 -0.26 -0.84
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness -0.39 -0.22 -0.42
Kurtosis -0.86 -0.78 0.10
n 58.00 65.00 64.00
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e., smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 380) = 0.5, p = = .61 and smartphone power, F(1, 381) = 0.22, p = .64 (see code below).
# conduct levene's
  # for location
  ospan_lev_loc = leveneTest(data = ospan_anova_data, score ~ as.factor(location), center = mean)
  ospan_lev_loc
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value Pr(>F)
group   2  0.4966  0.609
      380               
  # for power
  ospan_lev_pow = leveneTest(data = ospan_anova_data, score ~ as.factor(power), center = mean)
  ospan_lev_pow
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value Pr(>F)
group   1  0.2177 0.6411
      381               
Results

All results were based on \(\alpha = 0.05\). Below are the reported statistics for both main effects and the interactions, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on Ospan absolute score, F(2, 377) = 0.1, p = .91, \(\eta^{2}\) = .001.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on Ospan absolute score, F(1, 377) = 0.21, p = .65, \(\eta^{2}\) = .001.

Interaction of Smartphone Location & Smartphone Power

There was no significant interaction between smartphone location and smartphone power on Ospan absolute score, F(2, 377) = 1.19, p = .3, \(\eta^{2}\) = .01.

Post-hoc Tests

Since there was no significant main or interaction effects, no post-hocs were completed.

# show ANOVA results in kable table 
kable(ospan_anova$ANOVA, caption = "3x2 ANOVA on Ospan Absolute Score", digits = 4, align = 'c') %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
3x2 ANOVA on Ospan Absolute Score
Effect DFn DFd SSn SSd F p p<.05 ges
(Intercept) 1 377 656179.2350 108330.1 2283.5723 0.0000 * 0.8583
location 2 377 56.1861 108330.1 0.0978 0.9069 0.0005
power 1 377 59.0225 108330.1 0.2054 0.6507 0.0005
location:power 2 377 684.6542 108330.1 1.1913 0.3050 0.0063

Visualizations

The following shows a visualization of the OSpan absolute score based on the two smartphone conditions: location and power. (click to see code).

# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the descriptive data
  ospan_bar_data <- ospan_analysis[, c("location", "power", "ospan_absolute_score")]
    colnames(ospan_bar_data) = c("location", "power", "score")
    rownames(ospan_bar_data) = NULL
    
  ospan_bar_data$location = factor(ospan_bar_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("Desk", "Pocket/Bag", "Outside"))
  ospan_bar_data$power = factor(ospan_bar_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
# plot bar graph - GOOD
ospan_bar <- 
  ggplot(ospan_bar_data, aes(x = location, y = score, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = mean, position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the OSpan Task",
       subtitle = "Average Absolute OSpan Score",
       caption = "Note: Score is the number of correct letters recalled based \n on if all letters in a given block were recalled correctly.",
       x = "Smartphone Location", 
       y = "OSpan Absolute Score", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(ospan_bar_data$score), max(ospan_bar_data$score)), breaks = ) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun=mean, position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = ospan_bar_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# show fig
ospan_bar
No summary function supplied, defaulting to `mean_se()`

# export plot
# ggsave(filename="fig_ospan_bar_june16", plot = ospan_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)

Cue-Dependent Go/No-Go Task

An error (i.e., omission errors) and accuracy (i.e., RT) analysis was completed using the omission errors and average RT for each participant.

Error Analysis

A 3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Vertical, Horizontal) mixed ANOVA was completed (DV = omission error(s)). The between-subjects factors were smartphone location and power, and the within-subjects factor was pre-target cue type (i.e., “cue type”; either “vertical” or “horizontal”).

Prior to completing the analysis, planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials) were used for the final analysis. View the Data Cleaning section under the data Construct Working Dataframes heading above. There was a total of 383 participants in the Go/No-Go Error analysis.

Analysis

To assess if the omission errors statistically differed, we conducted a 3x2x2 mixed ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e. on or off), and the within-subjects factor of cue type (i.e., “vertical” or “horizontal”). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The mixed ANOVA made four assumptions: independent random sampling, normality, homogeneity of variance (i.e., for the between-subject factors), and sphericity (i.e., for the within-subjects factor). Normality was outside of the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). The homogeneity of variance assumption was not met. Due to this and since there are unequal sample sizes, a White-corrected F-test was completed for the between-subject factors (i.e., smartphone location, power) (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output; click to see code).


# get the required data from the gng_err_analysis data: participant, location, power, and score
gng_err_anova_data = gng_err_analysis[,c("participant", "location", "power", "cue", "gng_err_omi_long")]
    colnames(gng_err_anova_data) = c("participant", "location", "power", "cue", "err_omi")
    rownames(gng_err_anova_data) = NULL
    gng_err_anova_data$location = as.factor(gng_err_anova_data$location)
    gng_err_anova_data$power = as.factor(gng_err_anova_data$power)
    gng_err_anova_data$cue = as.factor(gng_err_anova_data$cue)

# run mixed ANOVA (IV b/w: Smartphone Power & Location; IV w/in: Target Type; DV: Omission Errors)
gng_err_anova = ezANOVA(
  data = gng_err_anova_data
  , dv = .(err_omi)
  , wid = .(participant)
  , within = .(cue)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therefore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)
Warning: Converting "participant" to factor for ANOVA.
Warning: Data is unbalanced (unequal N per group). Make sure you specified a well-considered value for the type argument to ezANOVA().
# calculate & extract the residuals from the ANOVA
gng_err_res = data.frame("residuals" = proj(gng_err_anova$aov)[[3]][, "Residuals"])
  • It appears that the residuals lie closely to a straight line (i.e., only a couple lie away form the line), which suggests normality (see visualization below).

qqnorm(gng_err_res$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals") 
qqline(gng_err_res$residuals)

  • It appears that residuals are leptokurtic (i.e., there is a high peak near the center). This was expected since the Cue-Dependent Go/No-Go task is an easy response inhibition task, resulting in low error rates (see visualization below).

qplot(gng_err_res$residuals, main = "Histogram of Go/No-Go Error Residuals", binwidth = .15) + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.53, p < .001 (see code below).

gng_err_shap = shapiro.test(gng_err_res$residuals)
gng_err_shap

    Shapiro-Wilk normality test

data:  gng_err_res$residuals
W = 0.53223, p-value < 2.2e-16
  • The Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: 2.5 to 6.98) and Kurtosis (range: 5.8 to 50.22) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: 2.76 to 6.18) and Kurtosis (range: 5.99 to 41.42) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).

# create a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

#--FOR THE VERTICAL / Go CUE TYPE
# use the "des_task_cont_cue_v" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_v_t <- rbind((cbind(t(des_task_cont_cue_v["gng_err_omi_long1", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long2", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long3", c("skew", "kurtosis", "n")]))), 
                          (cbind(t(des_task_cont_cue_v["gng_err_omi_long4", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long5", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long6", c("skew", "kurtosis", "n")]))))
  colnames(gng_err_sk_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
  rownames(gng_err_sk_v_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
  
#--FOR THE HORIZONTAL / No-Go CUE TYPE
# use the "gng_err_des_h" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_h_t <- rbind((cbind(t(des_task_cont_cue_h["gng_err_omi_long1", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long2", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long3", c("skew", "kurtosis", "n")]))), 
                          (cbind(t(des_task_cont_cue_h["gng_err_omi_long4", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long5", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_h_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
# show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)
    
# for vertical cue type
# use kable to show the descriptives table
kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 4.38 2.50 2.52
Kurtosis 21.24 5.80 6.18
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness 4.51 6.98 2.81
Kurtosis 22.88 50.22 8.96
n 58.00 65.00 64.00
    
# for horizontal cue type
# use kable to show the descriptives table
kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 3.22 2.81 2.81
Kurtosis 10.47 5.99 7.34
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness 4.43 6.18 2.76
Kurtosis 20.04 41.42 7.34
n 58.00 65.00 64.00
  • Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

  • This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e., smartphone location, smartphone power, and balance).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone power, F(1, 764) = 0.97, p = .32. However, it was not met for smartphone location, F(2, 763) = 3.72, p = .02,(see code below).


# conduct levene's
  # for location
  gng_err_lev_loc = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_loc
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value  Pr(>F)  
group   2   3.724 0.02458 *
      763                  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  # for power
  gng_err_lev_pow = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(power), center = mean)
  gng_err_lev_pow
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value Pr(>F)
group   1  0.9704 0.3249
      764               

4. Sphericity

This assumption was met since sphericity is always met for two levels of a repeated measure factor and is, therefore, unnecessary to evaluate (Girden, 1992).

Results

Since the homogeneity assumption was not met for the mixed ANOVA, a White-corrected F-test was completed. The results from this analysis are shown below, followed by a table depicting the statistics (click to see code).

Since the ezANOVA function can only complete a White-corrected F-test on between-subject factors, a separate ANOVA was computed using this correction for the between-subject effects. Additionally, the effect sizes were compluted separately using the EtaSq function.

All results were base on \(\alpha = 0.05\). Below are the reported statistics for both main effects and the interaction, followed by a table depicting the ANOVA results (click to see code).

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on omission errors, F(2, 377) = 0.67, p = .51, \(\eta_{G}^2\) = = .002.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on omission errors, F(1, 377) = 0.15, p = .7, \(\eta_{G}^2\) = = .001.

Main Effect of Smartphone Cue Type

There was a significant main effect of cue type on omission errors, F(1, 377) = 23.22, p < .001, \(\eta_{G}^2\) = = .02.

Interaction of Smartphone Location & Power

There was no significant interaction between smartphone location and power on omission errors, F(2, 377) = 1.01, p = .37, \(\eta_{G}^2\) = = .003.

Interaction of Smartphone Location & Cue Type

There was a significant interaction between smartphone location and cue type on omission errors, F(2, 377) = 3.19, p = .04, \(\eta_{G}^2\) = = .01. Due to a significant interaction, the significant main effect of cue type was not explored further.

Interaction of Smartphone Power & Cue Type

There was no significant interaction between smartphone power and cue type on omission errors, F(1, 377) = 0.26, p = .61, \(\eta_{G}^2\) = < .001.

Interaction of Smartphone Location, Smartphone Power, & Cue Type

There was no significant interaction between smartphone location, smartphone power, and cue type on omission errors, F(2, 377) = 0.32, p = .72, \(\eta_{G}^2\) = = .001.

Post-hoc Tests

Since there was a significant interaction between cue type and smartphone location, a post-hocs assessing the simple main effects of cue type (i.e., vertical/go, horizontal/no-go) across the levels of smartphone location (i.e., desk, pocket/bag, outside) was completed.

# create data frame from mixed ANOVA results to display (including both white-corr for b/w sub effects)
gng_err_anova_t <- 
  data.frame("effect" = gng_err_anova$ANOVA$Effect[2:8], 
             "DFn" = gng_err_anova$ANOVA$DFn[2:8], 
             "DFd" = gng_err_anova$ANOVA$DFd[2:8], 
             "F" = c(gng_err_anova_white$ANOVA$F[2:3], gng_err_anova$ANOVA$F[4], gng_err_anova_white$ANOVA$F[4], gng_err_anova$ANOVA$F[6:8]), 
             "p" = c(gng_err_anova_white$ANOVA$p[2:3], gng_err_anova$ANOVA$p[4], gng_err_anova_white$ANOVA$p[4], gng_err_anova$ANOVA$p[6:8]), 
             "p<.05" = c(gng_err_anova_white$ANOVA$`p<.05`[2:3], gng_err_anova$ANOVA$`p<.05`[4], gng_err_anova_white$ANOVA$`p<.05`[4], gng_err_anova$ANOVA$`p<.05`[6:8]), 
             "ges" = c(gng_err_anova_white_es[1:2, 2], gng_err_anova$ANOVA$ges[4], gng_err_anova_white_es[3, 2], gng_err_anova$ANOVA$ges[6:8])
             )
# rename rows
rownames(gng_err_anova_t) = NULL
# show ANOVA results in kable table 
kable(gng_err_anova_t, caption = "3x2x2 Mixed ANOVA on Omission Error in the Go/No-Go Task", digits = 4, align = 'c', col.names = c("Effect", "DFn", "DFd", "F", "p", "p < .05", "ges")) %>%
  row_spec(0, bold = T) %>% 
  footnote(general = "Note: between-subject effects (i.e., location, power, location:power) are White-corrected.") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
3x2x2 Mixed ANOVA on Omission Error in the Go/No-Go Task
Effect DFn DFd F p p < .05 ges
location 2 377 0.6684 0.5131 0.0023
power 1 377 0.1490 0.6997 0.0010
cue 1 377 23.2213 0.0000 * 0.0190
location:power 2 377 1.0102 0.3651 0.0034
location:cue 2 377 3.1866 0.0424 * 0.0053
power:cue 1 377 0.2592 0.6109 0.0002
location:power:cue 2 377 0.3246 0.7230 0.0005
Note:
Note: between-subject effects (i.e., location, power, location:power) are White-corrected.
Post-Hocs

An ANOVA was conducted to investigate the effect of cue type on omission errors for each smartphone location separately (with a Holm-Bonferroni adjustment).

PH: Assumptions

The two between-subjects ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. For both ANOVAs, normality was outside of the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). The homogeneity of variance assumption was met for the horizontal (i.e., “no-go”) cue type, but not for the vertical (i.e., “go”) cue type. Due to this and since there are unequal sample sizes, a White-corrected F-test was completed for the vertical (i.e., “go”) cue type ANOVA and a normal ANOVA for the horizontal (i.e., “no-go”) cue type (click to see details).

1. Independent Random Sampling

This assumption was met during testing for both ANOVAs.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output; click to see code).


# get the required data from the gng_err_analysis data: participant, location, and score
  # for vertical/go trials
  gng_err_anova_data_V <- 
    gng_err_anova_data %>% 
    # get cue type = "go" or vertical
    filter(cue == 1) %>% 
    # remove power factor
    select(-power)
  # for horizontal/no-go trials
  gng_err_anova_data_H <- 
    gng_err_anova_data %>% 
    # get cue type = "no-go" or horizontal 
    filter(cue == 2) %>% 
    # remove power factor
    select(-power)

# run ANOVA (IV: Smartphone Location; DV: Omission Errors)
  # for vertical/go trials
  gng_err_anova_V = ezANOVA(
    data = gng_err_anova_data_V
    , dv = .(err_omi)
    , wid = .(participant)
    , between = .(location)
    , type = 3 # unequal sample sizes, therefore use type 3
    , detailed = TRUE
    , return_aov = TRUE
    )
Warning: Converting "participant" to factor for ANOVA.
Warning: Data is unbalanced (unequal N per group). Make sure you specified a well-considered value for the type argument to ezANOVA().
Coefficient covariances computed by hccm()
  # for horizontal/no-go trials
  gng_err_anova_H = ezANOVA(
    data = gng_err_anova_data_H
    , dv = .(err_omi)
    , wid = .(participant)
    , between = .(location)
    , type = 3 # unequal sample sizes, therefore use type 3
    , detailed = TRUE
    , return_aov = TRUE
    )
Warning: Converting "participant" to factor for ANOVA.
Warning: Data is unbalanced (unequal N per group). Make sure you specified a well-considered value for the type argument to ezANOVA().
Coefficient covariances computed by hccm()
# calculate & extract the residuals from the ANOVA
  # for vertical/go trials
  gng_err_res_V = data.frame("residuals" = gng_err_anova_V$aov$residuals)
  # for horizontal/no-go trials
  gng_err_res_H = data.frame("residuals" = gng_err_anova_H$aov$residuals)
  • It appears that the residuals do not lie closely to a straight line: the data are heavy-tailed on the right (see visualization below).
# for vertical/go trials
qqnorm(gng_err_res_V$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals for Vertical/Go Trials") 
qqline(gng_err_res_V$residuals)


# for horizontal/no-go trials
qqnorm(gng_err_res_H$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals for Horizontal/No-Go Trials") 
qqline(gng_err_res_H$residuals)

  • It appears that residuals are leptokurtic (i.e., there is a high peak near the center). This was expected since the Cue-Dependent Go/No-Go task is an easy response inhibition task, resulting in low error rates (see visualization below).
# for vertical/go trials
qplot(gng_err_res_V$residuals, main = "Histogram of Go/No-Go Error Residuals for Vertical/Go Trials", binwidth = .5) + 
  theme_classic()

# for horizontal/no-go trials
qplot(gng_err_res_H$residuals, main = "Histogram of Go/No-Go Error Residuals for Horizontal/No-Go Trials", binwidth = .5) + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)) for the vertical (i.e., “go”) trials, W = 0.35, p < .001, and horizontal (i.e., “no-go”) trials, W = 0.34, p < .001 (see code below).
# for vertical/go trials
gng_err_shap_V <- shapiro.test(gng_err_res_V$residuals)
gng_err_shap_V

    Shapiro-Wilk normality test

data:  gng_err_res_V$residuals
W = 0.34529, p-value < 2.2e-16
# for horizontal/no-go trials
gng_err_shap_H <- shapiro.test(gng_err_res_H$residuals)
gng_err_shap_H

    Shapiro-Wilk normality test

data:  gng_err_res_H$residuals
W = 0.33566, p-value < 2.2e-16
  • The Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: 2.5 to 6.98) and Kurtosis (range: 5.8 to 50.22) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: 2.76 to 6.18) and Kurtosis (range: 5.99 to 41.42) were outside of the acceptable range of \(\pm 2.0\) and \(\pm 9.0\), respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) (see table below).
### NOTE: same table as before post-hocs

# show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

# for vertical cue type
# use kable to show the descriptives table
kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 4.38 2.50 2.52
Kurtosis 21.24 5.80 6.18
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness 4.51 6.98 2.81
Kurtosis 22.88 50.22 8.96
n 58.00 65.00 64.00

# for horizontal cue type
# use kable to show the descriptives table
kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 3.22 2.81 2.81
Kurtosis 10.47 5.99 7.34
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness 4.43 6.18 2.76
Kurtosis 20.04 41.42 7.34
n 58.00 65.00 64.00
  • Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

  • This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for vertical (i.e., “go”) trials and horizontal (i.e., “no-go”) trials.

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for the horizontal (i.e., “no-go”) cue type, F(2, 380) = 1.93, p = .15. However, it was not met for vertical (i.e., “go”) cue type, F(2, 380) = 4.74, p = .01,(see code below).


# conduct levene's
  # for location
  gng_err_lev_V = leveneTest(data = gng_err_anova_data_V, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_V
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value   Pr(>F)   
group   2  4.7449 0.009218 **
      380                    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  # for power
  gng_err_lev_H = leveneTest(data = gng_err_anova_data_H, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_H
Levene's Test for Homogeneity of Variance (center = mean)
       Df F value Pr(>F)
group   2  1.9329 0.1462
      380               
PH: SME Results

Since the homogeneity assumption was not met for the vertical (i.e., “go”) cue type ANOVA, a White-corrected F-test was completed. The ANOVA for the horizontal (i.e., “no-g”) cue type was completed without a correction. Both ANOVAs had a Holm-Bonferroni adjustment. The results from this analysis are shown below, followed by a table depicting the statistics (click to see code).

Run the white-corrected ANOVA for the vertical (i.e., “go”) cue type ANOVA.

Complete the Holm-Bonferroni adjustment for both ANOVAs.

# get corrected p-values for each ANOVA
gng_err_anova_cue_padj <- 
  p.adjust(c(gng_err_anova_V_white$ANOVA$p[2], # p-value from white-corrected V/go ANOVA
             gng_err_anova_H$ANOVA$p[2]), # p-value from H/no-go ANOVA
         method = c("holm"), # method: Holm-Bonferroni
         n = 2 # number of comparisons
         )

All results were base on \(\alpha = 0.05\). Below are the reported statistics for the main effects for each SME ANOVA, followed by a table depicting the SME ANOVA results (click to see code).

Cue Type: “Go” / Vertical

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on omission errors for the vertical/“go” cue type, F(2, 380) = 1.42, p = .49, \(\eta_{G}^2\) = = .01.

Cue Type: “No-Go” / Horizontal

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on omission errors for the horizontal/“no-go” cue type, F(2, 380) = 0.6, p = .55, \(\eta_{G}^2\) = = .003.

Post-hoc Tests

Since there was no significant main effect for either cue type, no post-hocs were completed.

# create data frame from mixed ANOVA results to display (including both white-corr for b/w sub effects)
gng_err_anova_cue_t <- 
  data.frame("effect" = c(gng_err_anova_V_white$ANOVA$Effect[2], gng_err_anova_H$ANOVA$Effect[2]), 
             "DFn" = c(gng_err_anova_V_white$ANOVA$DFn[2], gng_err_anova_H$ANOVA$DFn[2]), 
             "DFd" = c(gng_err_anova_V_white$ANOVA$DFd[2], gng_err_anova_H$ANOVA$DFd[2]), 
             "F" = c(gng_err_anova_V_white$ANOVA$F[2], gng_err_anova_H$ANOVA$F[2]), 
             "p" = gng_err_anova_cue_padj,
             "p<.05" = c(gng_err_anova_V_white$ANOVA$`p<.05`[2], gng_err_anova_H$ANOVA$`p<.05`[2]), 
             "ges" = c(gng_err_anova_V_white_es[2], gng_err_anova_H$ANOVA$ges[2])
             )
# rename rows
rownames(gng_err_anova_cue_t) = NULL
# show ANOVA results in kable table 
kable(gng_err_anova_cue_t, caption = "Tests of simple main effects on Go/No-Go omission errors for cue type (i.e., 'go' or 'no-go') across smartphone location levels (i.e., desk, pocket/bag, outside).", digits = 4, align = 'c', col.names = c("Effect", "DFn", "DFd", "F", "p", "p < .05", "ges")) %>%
  row_spec(0, bold = T) %>% 
  pack_rows("Cue Type: 'Go'", 1, 1) %>% 
  pack_rows("Cue Type: 'No-Go'", 2, 2) %>% 
  footnote(general = "Note: 'Go' cue type effect was White-corrected. Holm-Bonferroni adjusted p-values depicted.") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
Tests of simple main effects on Go/No-Go omission errors for cue type (i.e., 'go' or 'no-go') across smartphone location levels (i.e., desk, pocket/bag, outside).
Effect DFn DFd F p p < .05 ges
Cue Type: 'Go'
location 2 380 1.4177 0.4871 0.0081
Cue Type: 'No-Go'
location 2 380 0.5976 0.5506 0.0031
Note:
Note: 'Go' cue type effect was White-corrected. Holm-Bonferroni adjusted p-values depicted.
Visualizations

The following shows a visualization of the Go/No-Go omission error data based on the three factors: smartphone location, smartphone power, and cue type.

  • To visualize the mixed design, two plots were created (i.e., one for each cue type).
#--FOR THE VERTICAL CUE TYPE-- 

# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the data
  gng_err_bar_v_data = gng_err_analysis[gng_err_analysis$cue == 1, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_v_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_v_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_v_data$location = factor(gng_err_bar_v_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_v_data$power = factor(gng_err_bar_v_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_v = ggplot(gng_err_bar_v_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Vertical' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_v_data$err_omi), max(gng_err_bar_v_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_v_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

#--FOR THE HORIZONTAL CUE TYPE-- 

# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the data
  gng_err_bar_h_data = gng_err_analysis[gng_err_analysis$cue == 2, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_h_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_h_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_h_data$location = factor(gng_err_bar_h_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_h_data$power = factor(gng_err_bar_h_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_h = ggplot(gng_err_bar_h_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Horizontal' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the horizontal cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_h_data$err_omi), max(gng_err_bar_h_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_h_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# Show plots
  # for the Vertical cue type
  gng_err_bar_v
No summary function supplied, defaulting to `mean_se()`

  # for the Horizontal cue type
  gng_err_bar_h
No summary function supplied, defaulting to `mean_se()`

  
# export plots
  # for the Vertical cue type
  # ggsave(filename="fig_gng_err_bar_v", plot = gng_err_bar_v, device = "png", height = 5, width = 8, units = "in", dpi = 500)
  # for the Horizontal cue type
  # ggsave(filename="fig_gng_err_bar_h", plot = gng_err_bar_h, device = "png", height = 5, width = 8, units = "in", dpi = 500)
  • Shown as one plot…

# create data frame with location conditions as characters rather than integers 
gng_err_bar_data <- 
  gng_err_anova_data %>% 
  # make location a factor
  mutate(location = factor(location, levels = c(1:3), labels = c("Desk", "Pocket/Bag", "Outside"))) %>% 
  # make power a factor
  mutate(power = factor(power, levels = c(1:2), labels = c("ON", "OFF"))) %>% 
  # make cue a factor
  mutate(cue = factor(cue, levels = c(1:2), labels = c("(A) 'Go' Trials", "(B) 'No-Go' Trials")))

# create plot
gng_err_bar <- 
  ggplot(gng_err_bar_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the Go/No-Go Task",
       subtitle = "Omission Errors",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n"),
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_data$err_omi), max(gng_err_bar_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  facet_wrap(~cue, ncol = 2) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))
# show plot
gng_err_bar
No summary function supplied, defaulting to `mean_se()`
No summary function supplied, defaulting to `mean_se()`

# export plots
# ggsave(filename="fig_gng_err_bar_june16", plot = gng_err_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)

Accuracy Analysis

A 3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF) between-subjects ANOVA was completed (DV = average RT). The between-subjects factors were smartphone location and power (click to see code).

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Any participants who had a reaction time that was higher than two standard deviations (SD) from the mean reaction time were not included in the final analysis. View the Data Cleaning section under the data Construct Working Dataframes heading above. There was a total of 383 participants in the Go/No-Go Accuracy analysis.

Analysis

To assess if the reaction time statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). Since there are unequal sample sizes, we used Type III sum of squares.

Assumptions

The ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. Normality was met but there was a slight skew to the right in the histogram and kurtosis was outside of the typical range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). (click to see details).

1. Independent Random Sampling

This assumption was met during testing.

2. Normality

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data (extracted from the ANOVA output).


# get the required data from the ospan_analysis data: participant, location, power, and score
gng_ac_anova_data <- gng_ac_analysis[, c("participant", "location", "power", "gng_rt_av")]
    colnames(gng_ac_anova_data) = c("participant", "location", "power", "rt_av")
    rownames(gng_ac_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    gng_ac_anova_data$location = factor(gng_ac_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    gng_ac_anova_data$power = factor(gng_ac_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
gng_ac_anova = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)
Warning: Converting "participant" to factor for ANOVA.
Warning: Data is unbalanced (unequal N per group). Make sure you specified a well-considered value for the type argument to ezANOVA().
Coefficient covariances computed by hccm()
# calculate & extract the residuals from the ANOVA
gng_ac_res <- data.frame("residuals" = proj(gng_ac_anova$aov)[,"Residuals"])   
  • It appears that the residuals lie closely to a straight line, which suggests normality (see visualization below).

qqnorm(gng_ac_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals") 
qqline(gng_ac_res$residuals)

  • It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right and some data on the far right (see visualization below).

qplot(gng_ac_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals", binwidth = .01) + 
  theme_classic()

  • The Shapiro-Wilk test of normality was significant (based on \(\alpha = 0.05\)), W = 0.86, p < .001 (see code below).

gng_ac_shap = shapiro.test(gng_ac_res$residuals)
gng_ac_shap

    Shapiro-Wilk normality test

data:  gng_ac_res$residuals
W = 0.86237, p-value < 2.2e-16
  • The data’s Skewness and Kurtosis was examined and showed that the Skewness (range: 0.2 to 3.74) was within the acceptable range of \(\pm 2.0\) (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the Kurtosis, the on desk/powered off condition was outside the acceptable range of \(\pm 9.0\) (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010; overall range: -0.6 to 18.3) (see table below).
# use the "gng_ac_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_sk_t = rbind((cbind(t(des_task_cont["gng_rt_av1", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av2", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(des_task_cont["gng_rt_av4", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av5", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task", digits = 2, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  kable_classic()
Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task
Smartphone Location
On Desk In Pocket/Bag Outside
Smartphone Power ON
Skewness 1.50 0.44 0.20
Kurtosis 3.40 -0.60 -0.30
n 70.00 67.00 59.00
Smartphone Power OFF
Skewness 3.74 0.81 0.76
Kurtosis 18.30 0.12 0.51
n 58.00 65.00 64.00
NA
  • Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Although there was a slight skew to the right in the histogram and kurtosis was outside of the typical range, ultimately, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

3. Homogeneity of Variance

This assumption was tested by conducting Levene’s Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

  • The homogeneity assumption was met (based on \(\alpha = 0.05\)) for smartphone location, F(2, 380) = 0.26, p = .77 and smartphone power, F(1, 381) = 3.52, p = .06 (see code below).

# conduct levene's
  # for location
  gng_ac_lev_loc = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(location), center = median)
  gng_ac_lev_loc
Levene's Test for Homogeneity of Variance (center = median)
       Df F value Pr(>F)
group   2  0.2647 0.7675
      380               
  # for power
  gng_ac_lev_pow = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(power), center = median)
  gng_ac_lev_pow
Levene's Test for Homogeneity of Variance (center = median)
       Df F value  Pr(>F)  
group   1  3.5235 0.06127 .
      381                  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
  • The homogeneity of variance assumption was met. Based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).
Results

All results were based on \(\alpha = 0.05\). Below are the reported statistics for the main effects and the interactions, followed by a table depicting the ANOVA results.

Main Effect of Smartphone Location

There was no significant main effect of smartphone location on reaction time, F(2, 377) = 0, p = 1.49, \(\eta^{2}\) = .77.

Main Effect of Smartphone Power

There was no significant main effect of smartphone power on reaction time, F(1, 377) = 0, p = 1.49, \(\eta^{2}\) = .96.

Interaction of Smartphone Location & Smartphone Power

There was no significant interaction between smartphone location and smartphone power on reaction time, F(2, 377) = 0, p = 1.49, \(\eta^{2}\) = .96.

Post-hoc Tests

Since there was a significant three-way interaction (i.e smartphone location, smartphone power, and balance), further analyses were completed (see “Post-Hocs: Simple Two-Way Interactions” below).

# create table from ANOVA results
gng_ac_t <- gng_ac_anova$ANOVA

# show ANOVA results in kable table 
kable(gng_ac_anova$ANOVA, caption = "3x2 ANOVA on Average Reaction Time in the Go/No-Go Task", digits = 3, align = 'c') %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
3x2 ANOVA on Average Reaction Time in the Go/No-Go Task
Effect DFn DFd SSn SSd F p p<.05 ges
(Intercept) 1 377 1054.389 1.491 266672.599 0.000 * 0.999
location 2 377 0.002 1.491 0.261 0.770 0.001
power 1 377 0.000 1.491 0.002 0.962 0.000
location:power 2 377 0.000 1.491 0.044 0.957 0.000
Visualizations

The following shows a visualization of Accuracy (i.e. reaction time) on the Cue-Dependent Go/No-go Task. There was no significant main effect of smartphone location or smartphone power and no significant interaction between smartphone location and power.

# First, get a subset of the data
gng_ac_bar_data <- 
  gng_ac_anova_data %>% 
  select(-participant) 
# %>% 
#   mutate(location = factor(location, labels = c("On Desk", "Pocket/Bag", "Outside"))) # didnt need this, its already a factor... 
  
# remove row names
rownames(gng_ac_bar_data) <- NULL

# plot bar graph
gng_ac_bar <- 
  ggplot(gng_ac_bar_data, aes(x = location, y = rt_av, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the Go/No-Go Task \nfor the OSpan First Balance Condition",
       subtitle = "Accuracy (average reaction time)",
       caption = "Note: ...",
       x = "Smartphone Location", 
       y = "Average Reaction Time (sec.)", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_ac_bar_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13)) #+
  #scale_y_continuous(limits = c(min(gng_ac_bar_ospan_data$rt_av), max(gng_ac_bar_ospan_data$rt_av)))


# show the figures
gng_ac_bar
No summary function supplied, defaulting to `mean_se()`

  
# export plots
# ggsave(filename="fig_gng_ac_bar_june16", plot = gng_ac_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)

3. Testing the Moderation Hypothesis:

Exploratory Factor Analysis of the SAD

To examine the underlying factors measured by the Smartphone Attachment and Dependency Inventory (SAD) from Ward et al. (2017), a principal components factor analysis with a Varimax rotation was completed on participant’s SAD data. The analysis was exploratory and was compared to Ward et al.’s findings. There was a total of 383 participants in the factor analysis.

Checking the Data

To perform the principal components factor analysis, we first examined the data for (1) missing data, (2) correlations between the items, and (3) the factorability of the data.

1. Missing Data
  • From the descriptives table, it seems that there is no missing or incomplete data. This was confirmed by finding the complete cases, which were equal to the total sample size (i.e. 383). Therefore, no data was removed due to missing values for this analysis (see descriptives table below).

# get a data frame with the SAD items only for the factor analysis
fa_data = dplyr::select(factor_analysis, SAD_1:SAD_13)

# find the total complete cases to find any missing data from the data -- any participant with missing data must be removed
fa_complete = sum(complete.cases(fa_data))

# Show descriptives for the factor analysis data
fa_des = describe(fa_data)

# show descriptives in kable table 
kable(describe.by(fa_data), caption = "Descriptives for Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
Descriptives for Items in the SAD
vars n mean sd median trimmed mad min max range skew kurtosis se
SAD_1 1 383 4.7415 1.7301 5 4.8534 1.4826 1 7 6 -0.5343 -0.7942 0.0884
SAD_2 2 383 4.1018 1.8412 5 4.1336 1.4826 1 7 6 -0.1719 -1.2594 0.0941
SAD_3 3 383 3.7885 1.8978 4 3.7557 2.9652 1 7 6 0.1022 -1.2704 0.0970
SAD_4 4 383 4.6449 1.7722 5 4.7459 1.4826 1 7 6 -0.5174 -0.8707 0.0906
SAD_5 5 383 4.6188 1.7338 5 4.7003 1.4826 1 7 6 -0.4236 -0.9274 0.0886
SAD_6 6 383 4.6319 1.6556 5 4.7296 1.4826 1 7 6 -0.6095 -0.6438 0.0846
SAD_7 7 383 5.7598 1.1621 6 5.9283 1.4826 1 7 6 -1.2825 2.0867 0.0594
SAD_8 8 383 3.3420 1.7872 3 3.2606 2.9652 1 7 6 0.2816 -1.1532 0.0913
SAD_9 9 383 4.0209 1.7589 4 4.0684 1.4826 1 7 6 -0.1584 -1.0277 0.0899
SAD_10 10 383 4.7337 1.4783 5 4.8664 1.4826 1 7 6 -0.7401 0.0883 0.0755
SAD_11 11 383 4.6319 1.3586 5 4.7524 1.4826 1 7 6 -0.6467 0.2508 0.0694
SAD_12 12 383 4.7050 1.7692 5 4.8208 1.4826 1 7 6 -0.5285 -0.7616 0.0904
SAD_13 13 383 5.2715 1.3746 6 5.4332 1.4826 1 7 6 -1.0673 1.0619 0.0702
NA
2. Correlations between the Items
  • The correlations between the items were examined to determine if they were either not high enough (i.e., r < .30) or too high (i.e., r > .80).

    • The correlations for the 13 items in the SAD ranged from r(381) = .14 to r(381) = .72, p = .01.
    • A visual inspection of the correlations showed that items 12 and 13 might not be correlated enough with the rest of the items (i.e. 11 and 10 correlations below .30, respectively). There were no issues associated with too many high correations for any items.
    • The lower correlations for items 12 and 13 should be kept in mind when completing the factor analysis and these items may be removed.
  • To check for multicollinearity, the determinant of the correlation matrix was calculated. The determinant was greater than the necessary value of 0.00001 (i.e., 0.00576), so, there was no issues with multicollinearity (Field et al., 2012).

  • Therefore, all items in the SAD correlated reasonably with each other and without an excessively large correlation so no items were removed at this stage.


# get the pearson correlations between all items (and corresponding p-values)
fa_cor = rcorr(as.matrix(fa_data), type = "pearson") 

# make a table with the data to display it better
fa_cor_t = as.data.frame(fa_cor$r)
  # include only the lower triangle of the data
  fa_cor_t[upper.tri(fa_cor_t)] = NA
  
# find the determinant of the correlation matrix
fa_det = det(fa_cor$r)

# Print table using kable
  # make any NAs appear as blank within kable
  opts = options(knitr.kable.NA = "")
  # print table
  kable(fa_cor_t, caption = "Pearson Correlations between Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("p < .001", notation = "alphabet") %>% 
    kable_classic()
Pearson Correlations between Items in the SAD
SAD_1 SAD_2 SAD_3 SAD_4 SAD_5 SAD_6 SAD_7 SAD_8 SAD_9 SAD_10 SAD_11 SAD_12 SAD_13
SAD_1 1.0000
SAD_2 0.7150 1.0000
SAD_3 0.5510 0.6010 1.0000
SAD_4 0.5557 0.5727 0.5590 1.0000
SAD_5 0.4357 0.4525 0.4480 0.5249 1.0000
SAD_6 0.4319 0.4074 0.3767 0.4782 0.5365 1.0000
SAD_7 0.2763 0.3063 0.3081 0.3449 0.3988 0.4723 1.0000
SAD_8 0.3546 0.3545 0.2645 0.3889 0.3717 0.3974 0.2791 1.0000
SAD_9 0.3226 0.3906 0.3127 0.3677 0.3889 0.2831 0.3418 0.3733 1.0000
SAD_10 0.3179 0.3428 0.2598 0.3125 0.2708 0.3577 0.3329 0.5230 0.4381 1.0000
SAD_11 0.3826 0.4504 0.3758 0.3620 0.3148 0.3271 0.2721 0.3010 0.5575 0.5637 1.0000
SAD_12 0.2401 0.2302 0.2620 0.2470 0.2329 0.2569 0.2583 0.2986 0.2207 0.2341 0.2455 1.0000
SAD_13 0.1980 0.1390 0.2218 0.2138 0.2643 0.3167 0.2818 0.2200 0.1590 0.2070 0.1826 0.4701 1
a p < .001
    # restore the original options so that "NA" shows up again
    options(opts)
  • Another way to visualize this is using a Scatter Plot Matrix

    • This shows: (1) a scatter plot bellow the lower diagonal, with a line of best fit (red); (2) a historgram and density plot of the item on the diagonal; and (3) the correlation values (pearson r) above the diagonal, with corresponding significance shown with asterisks. The x-axis in each scatter plot represents the column variable and the y-axis the row variable.
# show a SPLOM (Scatter Plot Matrix) of the Data 
pairs.panels(fa_data, pch = ".", density = TRUE, ellipses = FALSE, lm = TRUE, digits = 2, method = "pearson", jiggle = TRUE, factor = 1, hist.col = "cadetblue3", stars = TRUE, gap = .5, verOdd = TRUE, horOdd = TRUE, main = "Scatter Plot Matrix of Items in the SAD")

3. Factorability of the Data
  • This assessed if there were meaningful latent factors to be found within the data, which was assessed in two ways: Barlett’s test of sphericity for correlation adequacy and the Kaiser-Meyer-Olkin measure of sampling adequacy.

    • Barlett’s test of sphericity for correlation adequacy:

      • This test was statistically significant, which suggested good factorability, \(\chi^{2}\)(78) = 1943.16, p < .001
    • Kaiser-Meyer-Olkin measure of sampling adequacy:

      • This measure showed an overall measure of sampling adequacy (MSA) and an MSA per item. The overall MSA was higher than both the minimum and preferred criteria (0.50 and 0.60, respectively; Kaiser, 1974, Hutcheson & Sofroniou, 1999), \(MSA_{overall}\) = 0.88. Each item’s MSA was also higher than the minimum and preferred criteria, \(MSA_{1-13}\) = 0.79 - 0.94.
  • Therefore, both factorability tests confirmed that the planned factor analysis was appropriate.

# run the Barlett Test
fa_bartest = cortest.bartlett(fa_data)
R was not square, finding R from data
fa_bartest
$chisq
[1] 1943.16

$p.value
[1] 0

$df
[1] 78
# run the KMO 
fa_kmo = KMO(fa_data)
fa_kmo
Kaiser-Meyer-Olkin factor adequacy
Call: KMO(r = fa_data)
Overall MSA =  0.88
MSA for each item = 
 SAD_1  SAD_2  SAD_3  SAD_4  SAD_5  SAD_6  SAD_7  SAD_8  SAD_9 SAD_10 SAD_11 SAD_12 SAD_13 
  0.88   0.87   0.92   0.94   0.92   0.90   0.90   0.86   0.88   0.82   0.83   0.83   0.79 

Extract Factors

Now that factorability was confirmed, we found the appropriate number of factors to extract. This was done using: (1) a parallel analysis scree plot, (2) eigen values (i.e., from the parallel analysis), and (3) the factor structure between the two solutions (i.e., using factor diagrams).

1. Parallel Analysis Scree Plot.

This plotted the eigenvalues from the data and a parallel analysis which generated random correlation matrices. The number of factors is then determined by comparing the resulting eigenvalues to the observed data, that is, observed eigenvalues that were higher than the corresponding random eigenvalues were more likely to form meaningful factors.

  • The parallel analysis scree plot showed that there are NA appropriate factors (i.e. using the principal components extraction method).

    • From closer inspection of the plot, the last factor was very close to the random eigenvalue cut-off line, therefore, the 2, 3, and 4 factor solutions were compared to determine the best fit (i.e., the eigen values and factor structures were examined).

fa_par <- fa.parallel(fa_data, fa = "fa", main = "Parallel Analysis Scree Plot for Items in the SAD")
Parallel analysis suggests that the number of factors =  4  and the number of components =  NA 

2. Eigen Values

For a sample size greater than 250 (i.e., 383 complete cases) the average communalities (i.e., the proportion of common variance within the items) for the two factor (0.43) and three factor (0.49) solutions were wihtin the acceptable value (i.e., 0.60; Field et al., 2012). Therefore, Kaiser’s criterion (i.e., a meaningful factor has an eigen value greater than 1) was used for the eigen values. It should be noted that although the average communalities were not very high, the scree plot and factor diagrams were used to choose the appropriate number of factors.

  • The eigen values (unrotated solution) showed three meaningful factors (i.e., 5.34, 1.3, 1.2), which supported the three-factor solution as a better representation of the data.
# Eigen values
  # view eigen values from the parallel analysis scree plot
  round(fa_par$pc.values, 3)
 [1] 5.340 1.303 1.196 0.903 0.763 0.611 0.566 0.479 0.472 0.416 0.377 0.306 0.268
3. Factor Diagrams

A factor diagram for the two-, three-, and four-factor solutions (rotated solution) was used to determine which solution represents the most interpretable solution. The diagrams show the factor loadings for each item on the given factor for which they load the highest on. The items were ordered (i.e., descending) based on these loadings.
Note: The factor diagrams are based on the rotated solutions to better distribute the factor loadings

  • The factor diagrams showed that the two factor solution resulted in most items loading onto only one factor and was not easily interpreted. The four-factor solution showed a 2-item factor, which is not easily interpretable as it did not meet the 3-item minumum (Costello & Osborn, 2005). The three and four factor solutions showed better distributions and were more interpretable. After reviewing the items, the four-factor solution provided a logical structure compared to the three-factor solution. That is, the third factor in the 3-factor solution grouped items 6 & 7 with 12 & 13, however items 6 & 7 ask about emotional associations whereas items 12 & 13 are regarding attention. Although the fourth factor in the four-factor solution only contained 2 items, these items were considered unique in that they are the only items in the SAD which asked participants about attention with respect to their smartphone. Therefore, the four-factor solution was chosen.

Therefore, the three factor solution was chosen to better represent the data.

##-- VARIMAX ROTATION  -- 
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_v = fa(fa_data, nfactors = 2, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_v$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_v = fa(fa_data, nfactors = 3, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_v$loadings) = c("Factor 1", "Factor 2", "Factor 3")
  
  # 4 factor solution
  fa_four_v = fa(fa_data, nfactors = 4, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_four_v$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_v, main = "Two Factor Solution for items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)


  # 3 factor solution
  fa.diagram(fa_three_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)

  
  # 4 factor solution
  fa.diagram(fa_four_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)

Achieving Simple Structure

Now that we determined the number of factors (i.e., four), we assessed for simple structure (i.e., a final solution of the factor analysis with the simplest solution). Simple structure was achieved by reviewing assessing the loadings for strong loadings (i.e., using a cut-off of > .40) and removing: (1) split variables (i.e., variables that loaded onto more than one factor), (2) non-loading variables, and (3) weak factors (i.e., factor with only 1 variable). Factor loadings are the correlations between each variable and the factor, that is, they represent the relationship (i.e., positive or negative) and strength (i.e., strong, moderate, or weak) between the variable and the factor.

First Solution: 13 SAD Items

First, we looked at the rotated factor loadings (i.e., varimax) for all 13 items in the SAD for the three-factor solution. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places.

    1. Split variables: Item #4 loaded strongly onto factor 1 (0.58) and factor 3 (0.41).
    1. Non-Loading Variables: All items loaded onto at least one factor.
    1. Weak Factors: All factors had at least 2 variables loading strongly, which was discussed earlier.
  • Therefore, since item #46 was split between factor 1 and factor 3, it was removed from the analysis. This new solution was assessed with the same criteria to see if simple structure could be achieved.

# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3 Factor 4
SAD_1  0.721                              
SAD_2  0.800                              
SAD_3  0.646                              
SAD_4  0.580             0.405            
SAD_5                    0.567            
SAD_6                    0.650            
SAD_7                    0.469            
SAD_8           0.431                     
SAD_9           0.572                     
SAD_10          0.733                     
SAD_11          0.677                     
SAD_12                            0.659   
SAD_13                            0.647   

               Factor 1 Factor 2 Factor 3 Factor 4
SS loadings       2.399    1.887    1.516    1.094
Proportion Var    0.185    0.145    0.117    0.084
Cumulative Var    0.185    0.330    0.446    0.530
# make data frame to show values
fa_load_v <- as.data.frame(fa_four_v$loadings[1:13,])
Second Solution: 12 SAD Items (removed item 4)

Now, we removed item #4 and looked at the rotated factor loadings (i.e., varimax) for the remaining 12 items in the SAD for the four-factor solution. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places.

    1. Split variables: No items loaded strongly onto more than one factor.
    1. Non-Loading Variables: All items loaded onto at least one factor.
    1. Weak Factors: All factors had at least 2 items loading strongly.
  • Therefore, simple structure was achieved and this final solution was used for further analysis.
# create data set without item # 6
fa_data2 <- 
  fa_data %>% 
  select(-SAD_4)

# run FA with 4 factors and a varimax rotation
  # rotated (varimax)
  fa_four_v2 <- fa(fa_data2, nfactors = 4, fm = "pa", rotate = "varimax", scores = T) 
  # change name in loadings for clarity when visualizing the model
    colnames(fa_four_v2$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")

# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v2$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3 Factor 4
SAD_1  0.715                              
SAD_2  0.822                              
SAD_3  0.613                              
SAD_5                    0.557            
SAD_6                    0.684            
SAD_7                    0.480            
SAD_8           0.437                     
SAD_9           0.578                     
SAD_10          0.733                     
SAD_11          0.675                     
SAD_12                            0.674   
SAD_13                            0.631   

               Factor 1 Factor 2 Factor 3 Factor 4
SS loadings       2.001    1.858    1.421    1.076
Proportion Var    0.167    0.155    0.118    0.090
Cumulative Var    0.167    0.322    0.440    0.530
# make data frame to show values
fa_load_v2 <- as.data.frame(fa_four_v2$loadings[1:12,])

Describing the Final Solution

Now that we have acheived simple structure, we looked at the (1) Communalities, (2) Variance Accounted for in percentage, and (3) Factor Loadings. This was done using the final solution (i.e., four-factor, varimax rotation, excluding item #4).

(1) Communalities

First. we observed the communalities. Each item’s communality was the percentage of variance that was explained by the retained factors for each item. These ranged from 36.12% to 79.25%, with an average of 52.97 (see table for more details).

# get the communalities from the model
fa_com = as.data.frame(fa_four_v2$communality)
  
# create matrix to present communalities in a table
fa_com_t = rbind((fa_com[,1])*100)
  # rename row and column names
  colnames(fa_com_t) = rownames(fa_com)
  rownames(fa_com_t) = NULL

# Print table using kable
  # print table
  kable(fa_com_t, caption = "Communalities for the Three-Factor Solution for Items in the SAD (%)", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("Communalities show the percentage of variance explained by each factor for each item on the SAD.", notation = "alphabet") %>% 
    kable_classic()
Communalities for the Three-Factor Solution for Items in the SAD (%)
SAD_1 SAD_2 SAD_3 SAD_5 SAD_6 SAD_7 SAD_8 SAD_9 SAD_10 SAD_11 SAD_12 SAD_13
62.9032 79.2493 49.7133 50.4155 61.0745 36.1421 36.116 43.8967 61.0182 56.9063 51.6665 46.5197
a Communalities show the percentage of variance explained by each factor for each item on the SAD.
(2) Variance Accounted (%)

Next, we looked at the percentage of variance accounted for by using the eigenvalues (compute the variance associated with the factors). This was done for both the unrotated and rotated (i.e., varimax) final four-factor solution. As you can see in the table below, a total of 52.97% of variance was accounted for by the three extracted factors (the same before and after rotation). This variance was mostly accounted for by factor one in the unrotated solution (factor 1 = 36.65%; factor 2 = 6.86%; factor 3 = 5.9%), but was more equally distributed after rotation (factor 1 = 16.68%; factor 2 = 15.48%; factor 3 = 11.84%; see table for more details).


# Run an unrotated three-factor solution
  fa_four2 <- fa(fa_data2, nfactors = 4, fm = "pa", rotate = "none", scores = T) 
    # change name in loadings for clarity when visualizing the model
    colnames(fa_four2$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4") 
    
  
# To find the variance accounted for, you can either (1) calculate it from the eigen values for each factor, or (2) grab the calculated values from the model 
  # Both are shown below, but one is commented out to avoid creating excess variables
  # (1) Calculated 
    # For the unrotated solution
    #var_per = 100*fa_four2$values[1:3]/length(fa_four2$values)
    
    # For the rotated solution
    #var_per_v = 100*(as.numeric(fa_four_v2$Vaccounted[1,]))/length(fa_four_v2$values)
      
  # (2) The information is found in the "Vaccounted" table from the output, where "SS loadings" (row 1) shows the eigen values and "Proportion Var" (row 2) shows the variance accounted for.
    # For the unrotated solution
    var_per = 100*as.numeric(fa_four2$Vaccounted[2,])
    
    # For the rotated solution
    var_per_v = 100*as.numeric(fa_four_v2$Vaccounted[2,])

# Create & Show a table with eigen values, percent var explained, and total percent var explained for unrotated & rotated three-factor solutions
    # unrotated solution
    var_per_t = as.data.frame(cbind(rbind(as.numeric(fa_four2$Vaccounted[1,]), var_per), 
                                    rbind(as.numeric(fa_four_v2$Vaccounted[1,]), var_per_v)))
    # name the columns & rows accordingly
    colnames(var_per_t) = c(rep(c("Factor 1", "Factor 2", "Factor 3", "Factor 4"),2))
    rownames(var_per_t) = c("Eigen Values", "Variance Explained (%)")

  # use kable to show the table
  kable(var_per_t, caption = "Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  add_header_above(c(" ", "Unrotated" = 4, "Varimax Rotation" = 4)) %>%
    kable_classic()
Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD
Unrotated
Varimax Rotation
Factor 1 Factor 2 Factor 3 Factor 4 Factor 1 Factor 2 Factor 3 Factor 4
Eigen Values 4.3977 0.8230 0.7082 0.4274 2.0014 1.8580 1.4209 1.0759
Variance Explained (%) 36.6472 6.8581 5.9014 3.5618 16.6784 15.4836 11.8406 8.9658
NA
(3) Factor Loadings

Lastly, we looked at the factor loadings of the final four-factor solution. Below you can see the rotated (i.e., varimax) and unrotated factor loadings. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. The unrotated factor loadings are difficult to interpret: all items loaded strongly onto factor 1 (i.e., loading was greater than 0.40). The rotated factor loadings were more easy to interpret (see below for more details).


# Unrotated Factor Loadings 
print(fa_four2$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3 Factor 4
SAD_1   0.694                             
SAD_2   0.746   -0.455                    
SAD_3   0.634                             
SAD_5   0.645                             
SAD_6   0.661                             
SAD_7   0.534                             
SAD_8   0.567                             
SAD_9   0.590                             
SAD_10  0.616            -0.444           
SAD_11  0.636                             
SAD_12  0.451                             
SAD_13  0.404    0.441                    

               Factor 1 Factor 2 Factor 3 Factor 4
SS loadings       4.398    0.823    0.708    0.427
Proportion Var    0.366    0.069    0.059    0.036
Cumulative Var    0.366    0.435    0.494    0.530
# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v2$loadings, cutoff = 0.40, digits = 3)

Loadings:
       Factor 1 Factor 2 Factor 3 Factor 4
SAD_1  0.715                              
SAD_2  0.822                              
SAD_3  0.613                              
SAD_5                    0.557            
SAD_6                    0.684            
SAD_7                    0.480            
SAD_8           0.437                     
SAD_9           0.578                     
SAD_10          0.733                     
SAD_11          0.675                     
SAD_12                            0.674   
SAD_13                            0.631   

               Factor 1 Factor 2 Factor 3 Factor 4
SS loadings       2.001    1.858    1.421    1.076
Proportion Var    0.167    0.155    0.118    0.090
Cumulative Var    0.167    0.322    0.440    0.530

Reliability Analyses

In order to validate the SAD, we completed a reliability analysis (i.e., is the scale consistent) by looking at the split-half reliability using Cronbach’s alpha. This method is the most common measure of scale reliability (Field et al., 2012): the data for each participant is randomly split into two sets, which forms a “half” split for the scale. Then, a correlation is calculated between these sets to determine how similar they are: higher reliability would be seen with highly correlated sets. Since the way that the data is split can impact the correlations, Cronbach’s alpha represents an average of all the possible splits using the following equation:

\[\alpha = \frac{N^2 \overline{Cov}}{\sum s_{item}^2 + \sum Cov_{item}} \]

To obtain the best reliability measure, we must divide our data into any subscales, which in this case, are our three factors from our factor analysis. Therefore, we split the data into three sets for the reliability analysis. (click to see code)


# split the data respective to the 3 factors from the FA to perform the reliability analysis
  # factor 1 (items: 1, 2, 3)
  f1_data = fa_data2 %>% select(SAD_1:SAD_3)
  # factor 2 (items: 8, 9, 10, 11)
  f2_data = fa_data2 %>% select(SAD_8:SAD_11)
  # factor 3 (items: 5, 6, 7)
  f3_data = fa_data2 %>% select(SAD_5:SAD_7)
  # factor 4 (items: 12, 13)
  f4_data = fa_data2 %>% select(SAD_12, SAD_13)

Once we have our subscales, we completed the reliability analysis on each subscale. (click to see code)


# compute reliability analysis for each subscale using "alpha()" function in "psych" package
  # factor 1
  f1_rel = psych::alpha(f1_data)
  # factor 2
  f2_rel = psych::alpha(f2_data)
  # factor 3
  f3_rel = psych::alpha(f3_data)
  # factor 4
  f4_rel = psych::alpha(f4_data)

To view the analysis, we broke it down into three components: (1) the alpha values and overall mean and standard deviation of the subscales, (2) alpha drop values (i.e., the alpha values if an item was dropped, and (3) the item statistics.

(1) Alpha Values

The alpha values from the reliability analyses included Cronbach’s \(\alpha\) (i.e., based on the covariances), a standardized \(\alpha\) (i.e., based on the correlations), and Guttman’s Lambda 6. Additionally, the overall average correlation, mean, and standard deviation for the subscale is shown (see table below for more details).

  • As seen in the table, Cronbach’s \(\alpha\) was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 (\(\alpha_C\) = 0.83), Factor 2 (\(\alpha_C\) = 0.76), and Factor 3 (\(\alpha_C\) = 0.72), but not for Factor 4 (\(\alpha_C\) = 0.63). The overall correlation for Factor 1, r(381) 0.62, was strong. Factor 2, r(381) 0.46, Factor 3, r(381) 0.47, and Factor 4, r(381) 0.47, were moderate. All correlations were positive. These statistics were further explored with dropped items.
# show the alpha values from the 3 subscales
  # create data frame with the information from the 3 reliability analyses
  fa_rel_alphas = as.data.frame(rbind("Factor 1" = dplyr::select(f1_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 2" = dplyr::select(f2_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 3" = dplyr::select(f3_rel$total, -c("S/N", "ase", "median_r")),
                                      "Factor 4" = dplyr::select(f4_rel$total, -c("S/N", "ase", "median_r")))
                                )
  # add an extra column for the "overall" or average alpha
  fa_rel_alphas = cbind("Overall alpha" = c(mean(as.numeric(dplyr::select(f1_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f2_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f3_rel$total, c("raw_alpha", "std.alpha")))),
                                            mean(as.numeric(dplyr::select(f4_rel$total, c("raw_alpha", "std.alpha"))))),
                        fa_rel_alphas)

  # use kable to show the table
  kable(fa_rel_alphas, caption = "Reliability Analysis for Subscales in the SAD: Alpha Values", digits = 4, align = 'c', 
        col.names = c("Overall alpha", "Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>% 
    kable_classic()
Reliability Analysis for Subscales in the SAD: Alpha Values
Overall alpha Cronbach's alpha Standardized alpha Guttman's lambda 6 Average r M SD
Factor 1 0.8309 0.8301 0.8317 0.7770 0.6223 4.2106 1.5762
Factor 2 0.7676 0.7624 0.7727 0.7438 0.4594 4.1821 1.2273
Factor 3 0.7217 0.7173 0.7261 0.6449 0.4692 5.0035 1.2294
Factor 4 0.6327 0.6259 0.6395 0.4701 0.4701 4.9883 1.3515
NA
(2) Alpha Drop Values

To further explore the reliability of each subscale, we looked at the alpha values after removing each item. This allowed us to determine whether a given item was reducing the reliability of the subscale. That is, if dropping an item increased \(\alpha\) or the correlation, then removing the item from the analysis should be considered. The data presented in the table below shows Cronbach’s \(\alpha\), standardized \(\alpha\), Guttman’s Lambda 6, and the average correlation.

  • As seen in the table below, dropping an item did not improve reliability for Factor 1 (range of \(\alpha_C\) = 0.71 - 0.83), Factor 2 (range of \(\alpha_C\) = 0.66 - 0.75), Factor 3 (range of \(\alpha_C\) = 0.54 - 0.7), and Factor 4 (range of \(\alpha_C\) = 0.37 - 0.61).
  • With respect to the average correlation, dropping an item only showed a substantially improvement for Factor 1, where dropping item 3 resulted in a stronger correlation, r(381) = 0.72. This was explored further with the items statistics.
# show the alpha drop values from the 3 subscales
  # create data frame with the information
  fa_rel_adrop = as.data.frame(rbind(dplyr::select(f1_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f2_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f3_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f4_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r"))
                                     ))

  # use kable to show the table
  kable(fa_rel_adrop, caption = "Reliability Analysis for Subscales in the SAD: Alpha Drop Values", digits = 4, align = 'c', 
        col.names = c("Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data), "Factor 4" = ncol(f4_data))) %>% 
    kable_classic()
Reliability Analysis for Subscales in the SAD: Alpha Drop Values
Cronbach's alpha Standardized alpha Guttman's lambda 6 Average r
Factor 1
SAD_1 0.7506 0.7508 0.6010 0.6010
SAD_2 0.7085 0.7105 0.5510 0.5510
SAD_3 0.8329 0.8338 0.7150 0.7150
Factor 2
SAD_8 0.7543 0.7645 0.6912 0.5198
SAD_9 0.7098 0.7208 0.6617 0.4626
SAD_10 0.6648 0.6764 0.6028 0.4106
SAD_11 0.6987 0.7062 0.6228 0.4448
Factor 3
SAD_5 0.6151 0.6416 0.4723 0.4723
SAD_6 0.5389 0.5702 0.3988 0.3988
SAD_7 0.6978 0.6983 0.5365 0.5365
Factor 4
SAD_12 0.6050 0.4701 0.2210 0.4701
SAD_13 0.3652 0.4701 0.2210 0.4701
NA
(3) Item Statistics

The item statistics were explored to determine whether a certain item did not correlate with the overall subscale. The table below shows: r, which is the correlations between each item and the total score from the subscale (i.e., the item-total correlations), which includes the item itself; and Drop, which is the correlation of an item with the subscale after removing the item (i.e., the item-rest correlation or the corrected item-total correlation). Additionally, the sample size, mean, and standard deviation is shown.

  • No issues were found with the item statistics for Factor 1, r(381) = .62 - = .74, Factor 2, r(381) = .49 - = .65, Factor 3, r(381) = .5 - = .61, and Factor 4, r(381) = .47 - = .47. That is, no items had a item-rest correlation (i.e., the drop column in the table below) below the cut-off (i.e., .30; Field et al., 2012).

# show the correlations for the 3 subscales
  # create data frame with the information
  fa_rel_item = as.data.frame(rbind(dplyr::select(f1_rel$item.stats, -c("std.r", "r.cor")), 
                                    dplyr::select(f2_rel$item.stats, -c("std.r", "r.cor")), 
                                    dplyr::select(f3_rel$item.stats, -c("std.r", "r.cor")),
                                    dplyr::select(f4_rel$item.stats, -c("std.r", "r.cor"))
                                    ))

  # use kable to show the table
  kable(fa_rel_item, caption = "Reliability Analysis for Subscales in the SAD: Item Statistics", digits = 4, align = 'c', 
        col.names = c("N", "r", "Drop", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data), "Factor 4" = ncol(f4_data)))  %>% 
    kable_classic()
Reliability Analysis for Subscales in the SAD: Item Statistics
N r Drop M SD
Factor 1
SAD_1 383 0.8654 0.7061 4.7415 1.7301
SAD_2 383 0.8922 0.7440 4.1018 1.8412
SAD_3 383 0.8370 0.6228 3.7885 1.8978
Factor 2
SAD_8 383 0.7386 0.4857 3.3420 1.7872
SAD_9 383 0.7804 0.5595 4.0209 1.7589
SAD_10 383 0.8045 0.6466 4.7337 1.4783
SAD_11 383 0.7559 0.5905 4.6319 1.3586
Factor 3
SAD_5 383 0.8366 0.5560 4.6188 1.7338
SAD_6 383 0.8499 0.6056 4.6319 1.6556
SAD_7 383 0.7146 0.4959 5.7598 1.1621
Factor 4
SAD_12 383 0.8936 0.4701 4.7050 1.7692
SAD_13 383 0.8162 0.4701 5.2715 1.3746
NA

Overall, the reliability for Factor 1, Factor 2, Factor 3, and Factor 4 was adequate. Although Factor 1 showed mixed results for item 3 (i.e., item was not as stable for alpha), Factor 1 was stable for all other comparisons/measures, therefore item 3 was not removed from the subscale. Similarly, Factor 4 was inspected carefully since it was a two-tem factor. These results suggets that it is a stable factor and so it was included in further analyses.

Conclusions

The Present Study

A principal axis factor analysis was used to analyze the underlying factors in the 13-item SAD inventory (Ward et al., 2017) using an orthogonal rotation (i.e., varimax rotation). Factorability of the data was confirmed using: (1) Bartlett’s test for correlation adequacy, \(\chi^{2}\)(78) = 1943.16, p < .001, which confirmed that correlations between the items were sufficiently large; and (2) the Kaiser-Meyer-Olkin measure for sampling adequacy (MSA), \(MSA_{overall}\) = 0.88, \(MSA_{1-13}\) = 0.79 - 0.94, which confirmed that both the overall MSA and each item’s MSA were above their required criteria (0.60 and 0.77, respectively; Kaiser, 1974). The appropriate number of factors (i.e., four) was confirmed using (1) a parallel analysis scree plot, where the two- , three-, and four-factor solutions were chosen to compare for the best fit for the data; (2) the eigen values (e.g., 5.34, 1.303, 1.196, 0.903, for factors 1 to 4, respectively) for the unrotated solution, where three meaningful factors were found using Kaiser’s criterion (i.e., eigen values greater than one); and (3) the factor structure of the two-, three-, and four-factor solutions, where the four-factor solution showed a better distribution of the data and was more interpretable. Therefore, the four-factor solution was chosen for the final analysis and an orthogonal (i.e., varimax) rotation was chosen for the final analysis. After an initial four-factor solution was completed using all 13 items in the SAD, however, item 4 was split-loaded (i.e., between factor 1 and 3). Therefore, this item was dropped and a final four-factor solution was completed with the remaining 12 items in the SAD. This final solution achieved simple structure and was used for subsequent analyses.

The for each reliability of the four factors was assessed using Cronbach’s \(\alpha\), which was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 (\(\alpha_C\) = 0.83), Factor 2 (\(\alpha_C\) = 0.76), and Factor 3 (\(\alpha_C\) = 0.72), but not for Factor 4 (\(\alpha_C\) = 0.63). Therefore, the reliability for Factor 1, Factor 2, and Factor 3 was adequate. Factor 1 did show a less stable item (i.e., 3), which showed a small improvement in correlation when dropped (from 0.62 to 0.72 ). Factor 4 was a weaker factor with only two items. Overall, the four factors all showed reliability and stability. Each was used as a separate subscale for following analyses. The mean scores for each factor were: Factor 1 M = 12.63 (SD = 4.73), Factor 2 M = 16.73 (SD = 4.91), Factor 3 M = 9.98 (SD = 2.7), and Factor 4 M = 6.25 (SD = 1.03). The model also showed a moderate fit: the Root Mean Square of the Residual (RMSR; 0.03). Further descriptive statistics can be found in the Descriptive Statistics section above.

The items in the final solution’s factors explained 52.97% of the variance and suggested the following four factors: dependence, emotional attachment, accessibility, and distractibility. Dependence was relate to the degree of dependence on one’s smartphone, it consisted of 3 items (i.e., SAD 1, SAD 2, and SAD 3) and explained 16.68% of the variance (e.g., “I would have trouble getting through a normal day without my smartphone.”, and “I feel like I could not live without my smartphone.”). Emotional attachment was related to one’s smartphone use for emotional support, it consisted of 4 items (i.e, SAD 8, 9, 10, and 11) and explained 15.48% of the variance (e.g., “I feel lonely when my smartphone does not ring or vibrate for several hours.”, and “Using my smartphone makes me feel happy.”). Accessibility was related to the ability to acees the utility of one’s phone (e.g., powered on, internet access), it consisted of 3 items (i.e., SAD 5, 6, & 7) and explained 11.84% of the variance (e.g., “It drives me crazy when my smartphone runs out of battery.”, and “I feel impatient when the Internet connection speed on my smartphone is slow.”). Distractibility was related to one’s smartphone retaining one’s attention, it consisted of 2 items and explained 8.97% of the variance (e.g., “I find it tough to focus whenever my smartphone is nearby.”, and “I become less attentive to my surroundings when I’m using my smartphone.”). One item did not clearly load onto any one factor and was excluded from further analyses (i.e., item 4; Costello & Osborne, 2005). The tables below depict the final and initial (i.e., including item 4) four-factor solutions. three

# create table with all items, factor loadings, eigen values, % of variance explained, and Cronbach's alpha 

# get factor loadings
  fa_load <- as.data.frame(fa_four_v2$loadings[1:12,])
  # change row names
    # row.names(fa_load) <- c(1:nrow(fa_load))
    
  # reorder loadings to match final table
    fa_load <- 
      fa_load %>%
      # add temporary column used to reorder
      mutate(item = c(row.names(fa_four_v2$loadings[1:12,]))) %>% 
      # reorder using slice & match
      slice(match(c("SAD_1", "SAD_2", "SAD_3", "SAD_8", "SAD_9", "SAD_10", "SAD_11", "SAD_5", "SAD_6", "SAD_7", "SAD_12", "SAD_13"), item)) %>% 
      # remove temp column
      select(-item)

# make list of the items & other info
  fa_table_list <- c("I would have trouble getting through a normal day without my smartphone.",
                    "It would be painful for me to give up my smartphone for a day.",
                    "I feel like I could not live without my smartphone.",
                    "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                    "Using my smartphone relieves me of my stress.",
                    "I feel excited when I have a new message or notification.",
                    "Using my smartphone makes me feel happy.",
                    "It drives me crazy when my smartphone runs out of battery.",
                    "I am upset and annoyed when I find I do not have reception on my smartphone.",
                    "I feel impatient when the Internet connection speed on my smartphone is slow.",
                    "I find it tough to focus whenever my smartphone is nearby.",
                    "I become less attentive to my surroundings when I’m using my smartphone.",
                    "Eigen Values", 
                    "Variance Explained (%)")
  
# create the final table
  # bind the loadings and items
    fa_four_t = cbind(c(1:3, 8:11, 5:7, 12, 13, "", ""), fa_table_list, rbind(fa_load, var_per_t[5:8]))
    
    # rename columns & rows
      colnames(fa_four_t) = c("item_num", "item_desc", "f1", "f2", "f3", "f4")
      rownames(fa_four_t) = 1:nrow(fa_four_t)
      
# format cells
# fa_four_t$f1 <- cell_spec(fa_four_t$f1, bold = ifelse(fa_four_t$f1 > 0.40, T, F))
      
# use kable to show the table
table_fa_four_t <- 
kable(fa_four_t, caption = "Final Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD", 
      digits = 2, align = "llcccc", col.names = c("", "", "1", "2", "3", "4")) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", " ", "Factor Loading" = 4), bold = T) %>% 
    pack_rows("Factor 1: Dependence", 1, 3) %>% 
    pack_rows("Factor 2: Emotional Attachment", 4, 7) %>% 
    pack_rows("Factor 3: Accessibility", 8, 10) %>% 
    pack_rows("Factor 4: Distractibility", 11, 12) %>% 
    pack_rows("", 13, 14) %>% 
    footnote(general = "Values represent rotated factor loadings from items in the Smartphone Attachment and Dependency Inventory (Ward et al., 2017).
             Factor loadings have been sorted based on strong loading (> 0.40). Item #4 was split-loaded and removed from the final solution.", 
             symbol = c("N = 383")) %>% 
    kable_classic()

# show table
table_fa_four_t
Final Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD
Factor Loading
1 2 3 4
Factor 1: Dependence
1 I would have trouble getting through a normal day without my smartphone. 0.72 0.21 0.24 0.12
2 It would be painful for me to give up my smartphone for a day. 0.82 0.28 0.19 0.05
3 I feel like I could not live without my smartphone. 0.61 0.18 0.24 0.17
Factor 2: Emotional Attachment
8 I feel lonely when my smartphone does not ring or vibrate for several hours. 0.18 0.44 0.32 0.19
9 Using my smartphone relieves me of my stress. 0.24 0.58 0.20 0.09
10 I feel excited when I have a new message or notification. 0.10 0.73 0.21 0.13
11 Using my smartphone makes me feel happy. 0.30 0.67 0.09 0.11
Factor 3: Accessibility
5 It drives me crazy when my smartphone runs out of battery. 0.37 0.20 0.56 0.14
6 I am upset and annoyed when I find I do not have reception on my smartphone. 0.26 0.21 0.68 0.19
7 I feel impatient when the Internet connection speed on my smartphone is slow. 0.15 0.25 0.48 0.21
Factor 4: Distractibility
12 I find it tough to focus whenever my smartphone is nearby. 0.15 0.18 0.09 0.67
13 I become less attentive to my surroundings when I’m using my smartphone. 0.06 0.08 0.24 0.63
Eigen Values 2.00 1.86 1.42 1.08
Variance Explained (%) 16.68 15.48 11.84 8.97
Note:
Values represent rotated factor loadings from items in the Smartphone Attachment and Dependency Inventory (Ward et al., 2017).
Factor loadings have been sorted based on strong loading (> 0.40). Item #4 was split-loaded and removed from the final solution.
* N = 383

#export table
#install PhantomJS using: "webshot::install_phantomjs()"
# save_kable(table_fa_four_t, "table_fa_four_t.pdf")



## INITIAL SOLUTION -- for reference not the final solution

# use initial rotated 4-factor solution from above -- "fa_four_v"
    
# To find the eigenvalues & variance accounted for, you can either grab the calculated values from the model 
    # For the initial 4-factor solution
    var_ev_per_in <- data.frame(rbind(fa_four_v$e.values[1:4],  #eigen values 
                       100*as.numeric(fa_four_v$Vaccounted[2,]) # variance accounted for, in percentage
                       ))
    colnames(var_ev_per_in) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")
    rownames(var_ev_per_in) = c("Eigen Values", "Variance Explained (%)")

# create table with all items, factor loadings, eigen values, % of variance explained, and Cronbach's alpha 

# get factor loadings
  fa_load_in = as.data.frame(fa_four_v$loadings[1:13,])

# make list of the items & other info
  fa_table_list_in = data.frame("details" = c("I would have trouble getting through a normal day without my smartphone.",
                                     "It would be painful for me to give up my smartphone for a day.",
                                     "I feel like I could not live without my smartphone.",
                                     "If I forgot to bring my smartphone with me, I would feel anxious.",
                                     "It drives me crazy when my smartphone runs out of battery.",
                                     "I am upset and annoyed when I find I do not have reception on my smartphone.",
                                     "I feel impatient when the Internet connection speed on my smartphone is slow.",
                                     "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                                     "Using my smartphone relieves me of my stress.",
                                     "I feel excited when I have a new message or notification.",
                                     "Using my smartphone makes me feel happy.",
                                     "I find it tough to focus whenever my smartphone is nearby.",
                                     "I become less attentive to my surroundings when I’m using my smartphone.", 
                                     "Eigen Values", 
                                     "Variance Explained (%)"))
  
# create the final table
  # bind the loadings and items
    fa_four_t_in = cbind(fa_table_list_in, rbind(fa_load_in, var_ev_per_in))
    
    # rename columns & rows
      colnames(fa_four_t_in) = c("Item", "Factor 1", "Factor 2", "Factor 3", "Factor 4")
      rownames(fa_four_t_in) = c(1:13, "", " ")
      
# use kable to show the table
  kable(fa_four_t_in, caption = "First Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD.", digits = 2, align = "lccc") %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    pack_rows("", 14, 15) %>% 
    footnote(general = "The initial solution is shown, including item #4, which was dropped as a split variable between factor 1 and factor 3.", symbol = "N = 383") %>% 
    kable_classic()
First Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD.
Item Factor 1 Factor 2 Factor 3 Factor 4
1 I would have trouble getting through a normal day without my smartphone. 0.72 0.22 0.21 0.12
2 It would be painful for me to give up my smartphone for a day. 0.80 0.29 0.16 0.06
3 I feel like I could not live without my smartphone. 0.65 0.17 0.23 0.17
4 If I forgot to bring my smartphone with me, I would feel anxious. 0.58 0.21 0.41 0.12
5 It drives me crazy when my smartphone runs out of battery. 0.39 0.19 0.57 0.14
6 I am upset and annoyed when I find I do not have reception on my smartphone. 0.28 0.21 0.65 0.20
7 I feel impatient when the Internet connection speed on my smartphone is slow. 0.16 0.25 0.47 0.22
8 I feel lonely when my smartphone does not ring or vibrate for several hours. 0.19 0.43 0.33 0.19
9 Using my smartphone relieves me of my stress. 0.25 0.57 0.21 0.09
10 I feel excited when I have a new message or notification. 0.11 0.73 0.21 0.14
11 Using my smartphone makes me feel happy. 0.31 0.68 0.08 0.11
12 I find it tough to focus whenever my smartphone is nearby. 0.15 0.17 0.09 0.66
13 I become less attentive to my surroundings when I’m using my smartphone. 0.06 0.08 0.23 0.65
Eigen Values 5.34 1.30 1.20 0.90
Variance Explained (%) 18.45 14.52 11.66 8.41
Note:
The initial solution is shown, including item #4, which was dropped as a split variable between factor 1 and factor 3.
* N = 383
NA
Ward et al. (2017)

Ward et al. (2017) found two factors: smartphone dependence (i.e. six items; items #1-6; 31.02% of the variance) and emotional attachment (i.e. five items; items #8-11; 21.65% of the variance), which explained 52.67% of the total variance. Two items (i.e. item #7 and 13) were dropped from their subsequent analyses since they did not strongly load onto a factor (see Web Appendix Table 1 below).


Note: this table is from Ward et al. (2017) Web Appendix. Items are listed in order (i.e., 1-13). Only strongly-loaded values are shown.

Comparing Findings

The present study’s final factor solution showed some similarities with Ward et al.’s solution:

  • Factor 1 contained items # 1, 2, 3, 4, and 5
  • Factor 2 contained items # 8, 9, 10, and 11

Some differences between the solutions:

  • Only one item (i.e., item #4 due to split-loading) was dropped in the present study compared to the two items dropped in Ward et al. (i.e., items 7 due to split-loading to and 13 due to being in a weak, single-item factor).
  • Factor 3 did have more than one item load strongly (items 5, 6, 7) and was not dropped.
  • Four factors were in the final solution in the present study.

Overall, the analyses were very similar. We compared the total variances explained and found that both the initial (53.04%) and final (52.97%) solution in the present study explained more total variance than the Ward et al. (2017) solution (52.67%). Comparing the reliability tests showed that Ward et al.’s subscales (\(\alpha_{Factor1}\) = 0.89; \(\alpha_{Factor2}\) = 0.79) had slightly larger values than the resent study (\(\alpha_{Factor1}\) = 0.83; \(\alpha_{Factor2}\) = 0.76; \(\alpha_{Factor3}\) = 0.72; \(\alpha_{Factor4}\) = 0.63).

Correlations: SAD Subscales vs. OSpan

# set up the data & run correlation
SAD_corr <- 
  all_data %>% 
  select(ospan_absolute_score, gng_err_omi, gng_rt_av , SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, age) %>% 
  as.matrix() %>% 
  rcorr(type = "pearson")

# create new SAD_corr to shown only lower triangle... 
SAD_corr2 <- SAD_corr
# round to 4 decimals... 
SAD_corr2$r <- round(SAD_corr2$r, 4)
SAD_corr2$P <- round(SAD_corr2$P, 4)
SAD_corr2$n <- round(SAD_corr2$n, 4)
# remove upper triangle form r, p, and n
SAD_corr2$r[upper.tri(SAD_corr2$r)] <- "-"
SAD_corr2$P[upper.tri(SAD_corr2$P)] <- "-"
SAD_corr2$n[upper.tri(SAD_corr2$n)] <- "-"

# show corr table with flattenCorr

kable(flattenCorrMatrix(SAD_corr$r, SAD_corr$P), caption = "Correlations between SAD Score and Age with OSpan Performance") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()
Correlations between SAD Score and Age with OSpan Performance
row column cor p
ospan_absolute_score gng_err_omi -0.0804066 0.1161850
ospan_absolute_score gng_rt_av -0.0236680 0.6442656
gng_err_omi gng_rt_av 0.0749570 0.1431385
ospan_absolute_score SAD_score -0.0068892 0.8930979
gng_err_omi SAD_score -0.0969179 0.0580945
gng_rt_av SAD_score 0.0488313 0.3405428
ospan_absolute_score SAD_dep_score 0.0451542 0.3781849
gng_err_omi SAD_dep_score -0.0877913 0.0861970
gng_rt_av SAD_dep_score 0.0738880 0.1489460
SAD_score SAD_dep_score 0.8250672 0.0000000
ospan_absolute_score SAD_ea_score -0.0746627 0.1447200
gng_err_omi SAD_ea_score -0.0673728 0.1882756
gng_rt_av SAD_ea_score 0.0519168 0.3108716
SAD_score SAD_ea_score 0.8095738 0.0000000
SAD_dep_score SAD_ea_score 0.5135457 0.0000000
ospan_absolute_score SAD_access_score -0.0244905 0.6327969
gng_err_omi SAD_access_score -0.0457629 0.3717835
gng_rt_av SAD_access_score 0.0159967 0.7549969
SAD_score SAD_access_score 0.8055359 0.0000000
SAD_dep_score SAD_access_score 0.5607329 0.0000000
SAD_ea_score SAD_access_score 0.5338376 0.0000000
ospan_absolute_score SAD_dist_score 0.0422234 0.4099421
gng_err_omi SAD_dist_score -0.0890381 0.0818112
gng_rt_av SAD_dist_score 0.0118260 0.8175549
SAD_score SAD_dist_score 0.5604164 0.0000000
SAD_dep_score SAD_dist_score 0.2946769 0.0000000
SAD_ea_score SAD_dist_score 0.3406335 0.0000000
SAD_access_score SAD_dist_score 0.3810652 0.0000000
ospan_absolute_score age -0.0798192 0.1188839
gng_err_omi age -0.0698051 0.1727843
gng_rt_av age 0.0372320 0.4675230
SAD_score age -0.1713676 0.0007579
SAD_dep_score age -0.1407677 0.0057867
SAD_ea_score age -0.1580994 0.0019125
SAD_access_score age -0.1891536 0.0001966
SAD_dist_score age 0.0114289 0.8235785
Note:
SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown.

# print tables using kable
kable(as.data.frame(format(SAD_corr2$r, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: r values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()
Correlations between SAD Score and Age with OSpan Performance: r values
ospan_absolute_score gng_err_omi gng_rt_av SAD_score SAD_dep_score SAD_ea_score SAD_access_score SAD_dist_score age
ospan_absolute_score 1 - - - - - - - -
gng_err_omi -0.0804 1 - - - - - - -
gng_rt_av -0.0237 0.075 1 - - - - - -
SAD_score -0.0069 -0.0969 0.0488 1 - - - - -
SAD_dep_score 0.0452 -0.0878 0.0739 0.8251 1 - - - -
SAD_ea_score -0.0747 -0.0674 0.0519 0.8096 0.5135 1 - - -
SAD_access_score -0.0245 -0.0458 0.016 0.8055 0.5607 0.5338 1 - -
SAD_dist_score 0.0422 -0.089 0.0118 0.5604 0.2947 0.3406 0.3811 1 -
age -0.0798 -0.0698 0.0372 -0.1714 -0.1408 -0.1581 -0.1892 0.0114 1
Note:
SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown.
  

kable(as.data.frame(format(SAD_corr2$P, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: p-values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()
Correlations between SAD Score and Age with OSpan Performance: p-values
ospan_absolute_score gng_err_omi gng_rt_av SAD_score SAD_dep_score SAD_ea_score SAD_access_score SAD_dist_score age
ospan_absolute_score NA - - - - - - - -
gng_err_omi 0.1162 NA - - - - - - -
gng_rt_av 0.6443 0.1431 NA - - - - - -
SAD_score 0.8931 0.0581 0.3405 NA - - - - -
SAD_dep_score 0.3782 0.0862 0.1489 0 NA - - - -
SAD_ea_score 0.1447 0.1883 0.3109 0 0 NA - - -
SAD_access_score 0.6328 0.3718 0.755 0 0 0 NA - -
SAD_dist_score 0.4099 0.0818 0.8176 0 0 0 0 NA -
age 0.1189 0.1728 0.4675 8e-04 0.0058 0.0019 2e-04 0.8236 NA
Note:
SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown.

kable(as.data.frame(format(SAD_corr2$n, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: n values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()
Correlations between SAD Score and Age with OSpan Performance: n values
ospan_absolute_score gng_err_omi gng_rt_av SAD_score SAD_dep_score SAD_ea_score SAD_access_score SAD_dist_score age
ospan_absolute_score 383 - - - - - - - -
gng_err_omi 383 383 - - - - - - -
gng_rt_av 383 383 383 - - - - - -
SAD_score 383 383 383 383 - - - - -
SAD_dep_score 383 383 383 383 383 - - - -
SAD_ea_score 383 383 383 383 383 383 - - -
SAD_access_score 383 383 383 383 383 383 383 - -
SAD_dist_score 383 383 383 383 383 383 383 383 -
age 383 383 383 383 383 383 383 383 383
Note:
SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown.

# does not fit nicely on page, possibly change later...
corrplot(SAD_corr$r, method = "circle", col = (colorRampPalette(c("purple", "grey", "blue"))(50)),
         type = "upper",
         # addCoef.col = "black", # Add coefficient of correlation
         tl.col = "darkblue", tl.srt = 90, tl.cex = .8, #Text label color and rotation
         # Combine with significance level
         p.mat = SAD_corr$P, sig.level = 0.05,
         addgrid.col = "white",
         # insig = "blank",
         insig = "pch", pch = 10, pch.col = "red", pch.cex = .1, # add this instead of insig above to denot insig p values with red dot
         # hide correlation coefficient on the principal diagonal
         diag = FALSE,
         win.asp = 1
         )

Exploratory Moderation Analyses

Potential moderators of OSpan performance were explored. These included both (1) continuous and (2) categorical moderators. Note: These analysis were completed regardless of the results for the location and power hypotheses.

1. Continuous Moderators

These variables were examined using a general linear model for each potential moderator: DV = OSpan Absolute Score, IV: Smartphone Location (desk, pocket/bag, outside), M: see list below. Simple slopes for each moderator x IV comparison were explored.

  • SAD Score: Smartphone Dependence, Emotional Attachment, Distractibilty
  • SUQ Score: Smartphone use with absence of other stimulation, Smartphone use during other actions
  • Demographic Info: Age

Each moderation analysis was completed using the MeMoBootR package’s moderation model 1 using a categorical IV (from Hayes, 2017; see figures below). This package provided: assumption tests, outliers, simple slopes, and a graph of the simple slopes. Since the IV was categorical (i.e., smartphone location: desk, pocket/bag, or outside), the variable is coded as a factor so that it is not misinterpreted as a continuous variable. This means that the IV is dummy coded and the analysis uses the first of the factors as the “comparison group”. In order to assess all possible comparions between the three location groups, the factor is re-ordered and run again. Additionally, to allow for better interpretation, the continuous moderator was mean-centered. The dummy coding and mean-centered scoring of the continuous moderator is done within the MeMoBootR package.


Conceptual diagram of a simple moderation model from Hayes (2017)




Statistical diagram of a simple moderation model with multicategorical independent variable with g categories from Hayes (2017)

2. Categorical Moderators

These variables were examined using a two-way ANOVA for each potential moderator: DV = OSpan Absolute Score, \(IV_{1}\): Smartphone Location (desk, pocket/bag, outside), \(IV_{M}\): see list below. Simple main effects for each moderator x IV comparison were explored.

  • SUQ Score: Texts per Day, Social Media Messages per Day, Social Media Posts per Day, Subjective Smartphone Value, Notification Type, Phantom Vibrations
  • Demographic Info: Gender, Program, Academic Year, First Language

Continuous Moderators

To examine the potential moderation of smartphone reliance (i.e. Smartphone Dependence, Emotional Attachment, and Distractibility) and age on the relationship between the experimental manipulation and task performance on the OSpan, a generalized linear model was used. Each potential moderator was evaluated separately, along with the relevant assumption tests (e.g., normality). The data was screened for outliers (i.e., cooks distance and leverage) and participants were removed from the analysis the criteria for both cooks and leverage was met.

Set up the Data

First, in order to provide all possible comparisons in the analyses, 2 new location variables were created in the mod_analysis data frame. Each was a factored version of the original smartphone location variable, where the order of the factors were either (1) desk > pocket/bag > outside (i.e., “location_desk”) or (2) outside > pocket/bag > desk (i.e., “location_out”). (click to see code)

# change location var as factor with different orders -- 2 new vars
mod_analysis <- 
  mod_analysis %>% 
  # "location_desk": desk vs. pocket/bag &  desk vs. outside
  mutate(location_desk = factor(location, levels = c(1, 2, 3), 
                                labels = c("desk", "pocketbag", "outside"))) %>% 
  # "location_out": outside vs. desk &  outside vs. pocket/bag
  mutate(location_out = factor(location, levels = c(3, 2, 1), 
                               labels = c("outside", "pocketbag", "desk")))
Run Analyses

Next, each moderation analysis was run to use the outputs (click to see code).

Run the moderation analyses for SAD variable(s): Smartphone Dependency, Emotional Attachment, and Distractibility.

#-- Run moderation analysis using moderation1() in the MeMoBootR package
  # note: moderator (m) and IV (x) are switched since moderator is continuous & IV is categorical in present study
    # was checked vs. manual lm() version & computes exact same values
  # note: analysis mean-centers the continuous var automatically
  # note: this analysis will compare: desk vs. pocket/bag&  desk vs. outside initially 
    # then, the data is re-organized to compare: pocket/bag vs. outside 
    # this was saved into 2 separate analyses, but most output is common b/w them

## SMARTPHONE DEPENDENCE ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_dep1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dep_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dep1$datascreening$fulldata)

# for outside 1st
mod_SAD_dep2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dep_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dep2$datascreening$fulldata)

## EMOTIONAL ATTACHMENT ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_ea1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_ea_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_ea1$datascreening$fulldata)

# for outside 1st
mod_SAD_ea2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_ea_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_ea2$datascreening$fulldata)

## ACCESSIBILITY ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_access1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_access_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_access1$datascreening$fulldata)

# for outside 1st
mod_SAD_access2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_access_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_access2$datascreening$fulldata)

## SMARTPHONE DISTRACTIBILITY ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_dist1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dist_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dist1$datascreening$fulldata)

# for outside 1st
mod_SAD_dist2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dist_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dist2$datascreening$fulldata)

Run the moderation analyses for SUQ variable(s): Age.

#-- Run moderation analysis using moderation1() in the MeMoBootR package
  # note: moderator (m) and IV (x) are switched since moderator is continuous & IV is categorical in present study
    # was checked vs. manual lm() version & computes exact same values
  # note: analysis mean-centers the continuous var automatically
  # note: this analysis will compare: desk vs. pocket/bag&  desk vs. outside initially 
    # then, the data is re-organized to compare: pocket/bag vs. outside 
    # this was saved into 2 separate analyses, but most output is common b/w them

## AGE ##
# run & save analysis to view components separately 
# for desk 1st
mod_SUQ_age1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "age", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SUQ_age1$datascreening$fulldata)

# for outside 1st
mod_SUQ_age2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "age", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers


# Can view the full data set, including the identified outliers (last columns)
# View(mod_SUQ_age2$datascreening$fulldata)
Outliers

The model identified outliers based on leverage and cook’s distance and removed participants who were outside of the criteria for both measures.

  • Leverage

    • Tests for how much influence over the slope a single observation (i.e., participant) has: how much an individual score will change the b-values
    • Scores that fall outside of the “cut-off” criteria have a large effect on the slope independent of other score and are therefore considered outliers. These are defined with the equation \(\frac{2K+2}{N}\), where K is the number of IVs/predictors and N is the sample size.
  • Cook’s Distance

    • Tests for influential observations: the combination of leverage and discrepancy (i.e., how far a score is from other data points).
    • Scores that fall outside of the “cut-off” criteria have a large effect on the slope independent of other score and/or are far from the rest of the data and are therefore considered outliers. These are defined with the equation \(\frac{4}{N-K-1}\), where K is the number of IVs/predictors and N is the sample size.
  • Participants Removed

    • Any participant who fell outside of both the leverage and cook criteria were removed from each analysis. This was done separately for each analysis (see table below).
# create a table depicting the total outliers removed for each continuous moderation analysis
# details shown for 1st (dependency)
mod_cont_out_t <- 
  as.data.frame(cbind(c("Dependency", "Emotional Attachment", "Accessibility", "Distractibility", "Age"), 
                      rbind(
                      # for SAD_dep
                      c(sum(mod_SAD_dep1$datascreening$fulldata$badleverage), # total ps that fell outside of the leverage criteria
                        sum(mod_SAD_dep1$datascreening$fulldata$badcooks), # total ps that fell outside of the cooks criteria
                        sum(mod_SAD_dep1$datascreening$fulldata$totalout == 1), # total ps that fell outside of ONE outlier criteria (i.e., leverage OR cooks)
                        sum(mod_SAD_dep1$datascreening$fulldata$totalout == 2), # total ps that fell outside of BOTH outlier criteria (i.e., leverage AND cooks)
                        nrow(mod_SAD_dep1$datascreening$fulldata) - sum(mod_SAD_dep1$datascreening$fulldata$totalout == 2) # total sample size used in given analysis
                        ), 
                      # for SAD_ea
                      c(sum(mod_SAD_ea1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_ea1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_ea1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_ea1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_ea1$datascreening$fulldata) - sum(mod_SAD_ea1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SAD_access
                      c(sum(mod_SAD_access1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_access1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_access1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_access1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_access1$datascreening$fulldata) - sum(mod_SAD_access1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SAD_dist
                      c(sum(mod_SAD_dist1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_dist1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_dist1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_dist1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_dist1$datascreening$fulldata) - sum(mod_SAD_dist1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SUQ_age
                      c(sum(mod_SUQ_age1$datascreening$fulldata$badleverage), 
                        sum(mod_SUQ_age1$datascreening$fulldata$badcooks), 
                        sum(mod_SUQ_age1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SUQ_age1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SUQ_age1$datascreening$fulldata) - sum(mod_SUQ_age1$datascreening$fulldata$totalout == 2)
                        )
                      )
                      )
                )

# rename columns
colnames(mod_cont_out_t) <- c(" ", "Leverage", "Cooks", "Both", "Removed", "N")

# use kable to show table
kable(mod_cont_out_t, caption = "Summary of identified and or removed outliers in GLM mderation analysis.", align = 'rccccc') %>%
  footnote(general = "Sample size prior to removing outlier data for all analyses was 383.") %>% 
  column_spec(1, bold = T) %>%
  add_header_above(c(" ", "Criteria" = 2, "Total Criteria" = 2, "")) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of identified and or removed outliers in GLM mderation analysis.
Criteria
Total Criteria
Leverage Cooks Both Removed N
Dependency 31 18 35 7 376
Emotional Attachment 27 18 37 4 379
Accessibility 21 16 25 6 377
Distractibility 25 17 26 8 375
Age 26 17 17 13 370
Note:
Sample size prior to removing outlier data for all analyses was 383.
Assumptions

The model completed assumption tests for: (1) additivity/multicollinearity, (2) linearity, (3) normality, (4) homogeneity and homoscedasticity. All assumptions were met for all moderators. (click to see details)

(1) Additivity / Multicollinearity

Additivity / Multicollinearity was evaluated for each moderator. Correlations between the moderator and each IV comparison was visually inspected to ensure the correlations were not too high or low (i.e., approximately ranging between .30 to .80). All moderators’ correlations were within an acceptable range.


Dependency Moderator: correlations ranged from 0.43 to 0.73.

# grab table from output
mod_SAD_dep_corr_t <- 
  as.data.frame(cbind(c("Dependency Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dep x DvPB", "Dep x DvO"), 
                      mod_SAD_dep1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Dependency Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dep x OvPB", "Dep x OvD"),
                      mod_SAD_dep2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_dep_corr_t) <- c("var1", "Dependency Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dep x DvPB", "Dep x DvO",
                                  "row", "var2",
                             "Dependency Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dep x OvPB", "Dep x OvD")
rownames(mod_SAD_dep_corr_t) <- 1:nrow(mod_SAD_dep_corr_t)

# reset values as numeric
mod_SAD_dep_corr_t[,2] <- as.numeric(mod_SAD_dep_corr_t[,2])
mod_SAD_dep_corr_t[,3] <- as.numeric(mod_SAD_dep_corr_t[,3])
mod_SAD_dep_corr_t[,4] <- as.numeric(mod_SAD_dep_corr_t[,4])
mod_SAD_dep_corr_t[,5] <- as.numeric(mod_SAD_dep_corr_t[,5])
mod_SAD_dep_corr_t[,6] <- as.numeric(mod_SAD_dep_corr_t[,6])
mod_SAD_dep_corr_t[,9] <- as.numeric(mod_SAD_dep_corr_t[,9])
mod_SAD_dep_corr_t[,10] <- as.numeric(mod_SAD_dep_corr_t[,10])
mod_SAD_dep_corr_t[,11] <- as.numeric(mod_SAD_dep_corr_t[,11])
mod_SAD_dep_corr_t[,12] <- as.numeric(mod_SAD_dep_corr_t[,12])
mod_SAD_dep_corr_t[,13] <- as.numeric(mod_SAD_dep_corr_t[,13])

# Print table using kable
kable(mod_SAD_dep_corr_t, caption = "Correlations for the dependency moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
Correlations for the dependency moderator.
Comparison Group:
Desk
Comparison Group:
Outside
Moderator
Location
Interaction
Moderator
Location
Interaction
1 2 3 4 5 6 7 8 9 10
1 Dependency Score 1.00 0.62 0.60 -0.66 -0.65 6 Dependency Score 1.00 0.70 0.73 -0.72 -0.76
2 Desk vs. Pocket/Bag 0.62 1.00 0.43 -0.94 -0.41 7 Outside vs. Pocket/Bag 0.70 1.00 0.56 -0.95 -0.53
3 Desk vs. Outside 0.60 0.43 1.00 -0.40 -0.94 8 Outside vs. Desk 0.73 0.56 1.00 -0.52 -0.94
4 Dep x DvPB -0.66 -0.94 -0.40 1.00 0.43 9 Dep x OvPB -0.72 -0.95 -0.52 1.00 0.54
5 Dep x DvO -0.65 -0.41 -0.94 0.43 1.00 10 Dep x OvD -0.76 -0.53 -0.94 0.54 1.00


Emotional Attachment Moderator: correlations ranged from 0.51 to 0.72.

# grab table from output
mod_SAD_ea_corr_t <- 
  as.data.frame(cbind(c("Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "EA x DvPB", "EA x DvO"), 
                      mod_SAD_ea1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "EA x OvPB", "EA x OvD"),
                      mod_SAD_ea2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_ea_corr_t) <- c("var1", "Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "EA x DvPB", "EA x DvO",
                                  "row", "var2",
                             "Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "EA x OvPB", "EA x OvD")
rownames(mod_SAD_ea_corr_t) <- 1:nrow(mod_SAD_ea_corr_t)

# reset values as numeric
mod_SAD_ea_corr_t[,2] <- as.numeric(mod_SAD_ea_corr_t[,2])
mod_SAD_ea_corr_t[,3] <- as.numeric(mod_SAD_ea_corr_t[,3])
mod_SAD_ea_corr_t[,4] <- as.numeric(mod_SAD_ea_corr_t[,4])
mod_SAD_ea_corr_t[,5] <- as.numeric(mod_SAD_ea_corr_t[,5])
mod_SAD_ea_corr_t[,6] <- as.numeric(mod_SAD_ea_corr_t[,6])
mod_SAD_ea_corr_t[,9] <- as.numeric(mod_SAD_ea_corr_t[,9])
mod_SAD_ea_corr_t[,10] <- as.numeric(mod_SAD_ea_corr_t[,10])
mod_SAD_ea_corr_t[,11] <- as.numeric(mod_SAD_ea_corr_t[,11])
mod_SAD_ea_corr_t[,12] <- as.numeric(mod_SAD_ea_corr_t[,12])
mod_SAD_ea_corr_t[,13] <- as.numeric(mod_SAD_ea_corr_t[,13])

# Print table using kable
kable(mod_SAD_ea_corr_t, caption = "Correlations for the emotional attachment moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
Correlations for the emotional attachment moderator.
Comparison Group:
Desk
Comparison Group:
Outside
Moderator
Location
Interaction
Moderator
Location
Interaction
1 2 3 4 5 6 7 8 9 10
1 Emotional Attachment Score 1.00 0.71 0.66 -0.73 -0.70 6 Emotional Attachment Score 1.00 0.72 0.70 -0.74 -0.71
2 Desk vs. Pocket/Bag 0.71 1.00 0.51 -0.96 -0.49 7 Outside vs. Pocket/Bag 0.72 1.00 0.54 -0.96 -0.52
3 Desk vs. Outside 0.66 0.51 1.00 -0.48 -0.96 8 Outside vs. Desk 0.70 0.54 1.00 -0.51 -0.96
4 EA x DvPB -0.73 -0.96 -0.48 1.00 0.51 9 EA x OvPB -0.74 -0.96 -0.51 1.00 0.53
5 EA x DvO -0.70 -0.49 -0.96 0.51 1.00 10 EA x OvD -0.71 -0.52 -0.96 0.53 1.00


Accessibility Moderator: correlations ranged from 0.46 to 0.73.

# grab table from output
mod_SAD_access_corr_t <- 
  as.data.frame(cbind(c("Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Access x DvPB", "Access x DvO"), 
                      mod_SAD_access1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Access x OvPB", "Access x OvD"),
                      mod_SAD_access2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_access_corr_t) <- c("var1", "Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Access x DvPB", "Access x DvO",
                                  "row", "var2",
                             "Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Access x OvPB", "Access x OvD")
rownames(mod_SAD_access_corr_t) <- 1:nrow(mod_SAD_access_corr_t)

# reset values as numeric
mod_SAD_access_corr_t[,2] <- as.numeric(mod_SAD_access_corr_t[,2])
mod_SAD_access_corr_t[,3] <- as.numeric(mod_SAD_access_corr_t[,3])
mod_SAD_access_corr_t[,4] <- as.numeric(mod_SAD_access_corr_t[,4])
mod_SAD_access_corr_t[,5] <- as.numeric(mod_SAD_access_corr_t[,5])
mod_SAD_access_corr_t[,6] <- as.numeric(mod_SAD_access_corr_t[,6])
mod_SAD_access_corr_t[,9] <- as.numeric(mod_SAD_access_corr_t[,9])
mod_SAD_access_corr_t[,10] <- as.numeric(mod_SAD_access_corr_t[,10])
mod_SAD_access_corr_t[,11] <- as.numeric(mod_SAD_access_corr_t[,11])
mod_SAD_access_corr_t[,12] <- as.numeric(mod_SAD_access_corr_t[,12])
mod_SAD_access_corr_t[,13] <- as.numeric(mod_SAD_access_corr_t[,13])

# Print table using kable
kable(mod_SAD_access_corr_t, caption = "Correlations for the accessibility moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
Correlations for the accessibility moderator.
Comparison Group:
Desk
Comparison Group:
Outside
Moderator
Location
Interaction
Moderator
Location
Interaction
1 2 3 4 5 6 7 8 9 10
1 Emotional Attachment Score 1.00 0.67 0.64 -0.68 -0.67 6 Emotional Attachment Score 1.00 0.72 0.73 -0.72 -0.74
2 Desk vs. Pocket/Bag 0.67 1.00 0.46 -0.97 -0.45 7 Outside vs. Pocket/Bag 0.72 1.00 0.55 -0.97 -0.53
3 Desk vs. Outside 0.64 0.46 1.00 -0.44 -0.97 8 Outside vs. Desk 0.73 0.55 1.00 -0.53 -0.97
4 Access x DvPB -0.68 -0.97 -0.44 1.00 0.46 9 Access x OvPB -0.72 -0.97 -0.53 1.00 0.54
5 Access x DvO -0.67 -0.45 -0.97 0.46 1.00 10 Access x OvD -0.74 -0.53 -0.97 0.54 1.00


Distractibility Moderator: correlations ranged from 0.45 to 0.7.

# grab table from output
mod_SAD_dist_corr_t <- 
  as.data.frame(cbind(c("Distractibility Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dist x DvPB", "Dist x DvO"), 
                      mod_SAD_dist1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Distractibility Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dist x OvPB", "Dist x OvD"),
                      mod_SAD_dist2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_dist_corr_t) <- c("var1", "Distractibility Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dist x DvPB", "Dist x DvO",
                                  "row", "var2",
                             "Distractibility Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dist x OvPB", "Dist x OvD")
rownames(mod_SAD_dist_corr_t) <- 1:nrow(mod_SAD_dist_corr_t)

# reset values as numeric
mod_SAD_dist_corr_t[,2] <- as.numeric(mod_SAD_dist_corr_t[,2])
mod_SAD_dist_corr_t[,3] <- as.numeric(mod_SAD_dist_corr_t[,3])
mod_SAD_dist_corr_t[,4] <- as.numeric(mod_SAD_dist_corr_t[,4])
mod_SAD_dist_corr_t[,5] <- as.numeric(mod_SAD_dist_corr_t[,5])
mod_SAD_dist_corr_t[,6] <- as.numeric(mod_SAD_dist_corr_t[,6])
mod_SAD_dist_corr_t[,9] <- as.numeric(mod_SAD_dist_corr_t[,9])
mod_SAD_dist_corr_t[,10] <- as.numeric(mod_SAD_dist_corr_t[,10])
mod_SAD_dist_corr_t[,11] <- as.numeric(mod_SAD_dist_corr_t[,11])
mod_SAD_dist_corr_t[,12] <- as.numeric(mod_SAD_dist_corr_t[,12])
mod_SAD_dist_corr_t[,13] <- as.numeric(mod_SAD_dist_corr_t[,13])

# Print table using kable
kable(mod_SAD_dist_corr_t, caption = "Correlations for the distractibility moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
Correlations for the distractibility moderator.
Comparison Group:
Desk
Comparison Group:
Outside
Moderator
Location
Interaction
Moderator
Location
Interaction
1 2 3 4 5 6 7 8 9 10
1 Distractibility Score 1.00 0.66 0.70 -0.67 -0.71 6 Distractibility Score 1.00 0.64 0.66 -0.67 -0.70
2 Desk vs. Pocket/Bag 0.66 1.00 0.49 -0.97 -0.47 7 Outside vs. Pocket/Bag 0.64 1.00 0.45 -0.97 -0.45
3 Desk vs. Outside 0.70 0.49 1.00 -0.47 -0.97 8 Outside vs. Desk 0.66 0.45 1.00 -0.44 -0.97
4 Dist x DvPB -0.67 -0.97 -0.47 1.00 0.48 9 Dist x OvPB -0.67 -0.97 -0.44 1.00 0.47
5 Dist x DvO -0.71 -0.47 -0.97 0.48 1.00 10 Dist x OvD -0.70 -0.45 -0.97 0.47 1.00


Age Moderator: correlations ranged from 0.47 to 0.77.

# grab table from output
mod_SUQ_age_corr_t <- 
  as.data.frame(cbind(c("Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Age x DvPB", "Age x DvO"), 
                      mod_SUQ_age1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Age", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Age x OvPB", "Age x OvD"),
                      mod_SUQ_age2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SUQ_age_corr_t) <- c("var1", "Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Age x DvPB", "Age x DvO",
                                   "row", "var2",
                                   "Age", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Age x OvPB", "Age x OvD")
rownames(mod_SUQ_age_corr_t) <- 1:nrow(mod_SUQ_age_corr_t)

# reset values as numeric
mod_SUQ_age_corr_t[,2] <- as.numeric(mod_SUQ_age_corr_t[,2])
mod_SUQ_age_corr_t[,3] <- as.numeric(mod_SUQ_age_corr_t[,3])
mod_SUQ_age_corr_t[,4] <- as.numeric(mod_SUQ_age_corr_t[,4])
mod_SUQ_age_corr_t[,5] <- as.numeric(mod_SUQ_age_corr_t[,5])
mod_SUQ_age_corr_t[,6] <- as.numeric(mod_SUQ_age_corr_t[,6])
mod_SUQ_age_corr_t[,9] <- as.numeric(mod_SUQ_age_corr_t[,9])
mod_SUQ_age_corr_t[,10] <- as.numeric(mod_SUQ_age_corr_t[,10])
mod_SUQ_age_corr_t[,11] <- as.numeric(mod_SUQ_age_corr_t[,11])
mod_SUQ_age_corr_t[,12] <- as.numeric(mod_SUQ_age_corr_t[,12])
mod_SUQ_age_corr_t[,13] <- as.numeric(mod_SUQ_age_corr_t[,13])

# Print table using kable
kable(mod_SUQ_age_corr_t, caption = "Correlations for the age moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
Correlations for the age moderator.
Comparison Group:
Desk
Comparison Group:
Outside
Moderator
Location
Interaction
Moderator
Location
Interaction
1 2 3 4 5 6 7 8 9 10
1 Age 1.00 0.77 0.76 -0.78 -0.76 6 Age 1.00 0.72 0.65 -0.72 -0.65
2 Desk vs. Pocket/Bag 0.77 1.00 0.59 -1.00 -0.59 7 Outside vs. Pocket/Bag 0.72 1.00 0.47 -1.00 -0.47
3 Desk vs. Outside 0.76 0.59 1.00 -0.59 -1.00 8 Outside vs. Desk 0.65 0.47 1.00 -0.47 -1.00
4 Age x DvPB -0.78 -1.00 -0.59 1.00 0.59 9 Age x OvPB -0.72 -1.00 -0.47 1.00 0.47
5 Age x DvO -0.76 -0.59 -1.00 0.59 1.00 10 Age x OvD -0.65 -0.47 -1.00 0.47 1.00
NA
(2) Linearity

Linearity was evaluated for each moderator using Q-Q plots. For all moderators, the data fell along or close to the line. Therefore, linearity was met for all moderators.

# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$linearity"
grid.arrange(ggqqplot(mod_SAD_dep1$model1$residuals, title = "Q-Q Plot for \nDependence", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_ea1$model1$residuals, title = "Q-Q Plot for \nEmotional Attachment", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_access1$model1$residuals, title = "Q-Q Plot for \nAccessibility", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_dist1$model1$residuals, title = "Q-Q Plot for \nDistractibility", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SUQ_age1$model1$residuals, title = "Q-Q Plot for Age", 
                      shape = 1, ggtheme = theme_classic()),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Q-Q Plots For Continuous Moderators", face = "bold")
             )

(3) Normality

Normality was evaluated for each moderator by plotting a histogram of residuals. For all moderators, the data fell along or close to a normal curve. Therefore, normality was met for all moderators.

# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$normality"
grid.arrange(qplot(mod_SAD_dep1$model1$residuals, main = "Dependency", xlab = "Residuals", ylab = "Frequency", binwidth = 5),
             qplot(mod_SAD_ea1$model1$residuals, main = "Emotional Attachment", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SAD_access1$model1$residuals, main = "Accessibility", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SAD_dist1$model1$residuals, main = "Distractibility", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SUQ_age1$model1$residuals, main = "Age", xlab = "Residuals", ylab = "Frequency", binwidth = 5),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Histogram of Residuals For Continuous Moderators", face = "bold")
             )

(4) Homogeneity and Homoscedasticity

Homogeneity and homoscedasticity were evaluated for each moderator by plotting a scatter plot of the fitted values vs. residuals. For all moderators, the line of best fit showed an overall linear trend. Except for age, all moderators showed a good distribution throughout the plot. This was not surprising since reported age was whole-numbers only and the sample size did not cover a large age distribution. Therefore, homogeneity and homoscedasticity were met for all moderators.

# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$linearity"
grid.arrange(qplot(x = mod_SAD_dep1$model1$fitted.values, y = mod_SAD_dep1$model1$residuals, 
                   main = "Dependency", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(),
             qplot(x = mod_SAD_ea1$model1$fitted.values, y = mod_SAD_ea1$model1$residuals, 
                   main = "Emotional Attachment", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SAD_access1$model1$fitted.values, y = mod_SAD_access1$model1$residuals, 
                   main = "Accessibility", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SAD_dist1$model1$fitted.values, y = mod_SAD_dist1$model1$residuals, 
                   main = "Distractibility", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SUQ_age1$model1$fitted.values, y = mod_SUQ_age1$model1$residuals, 
                   main = "Age", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Residual-Fitted Plot For Continuous Moderators", face = "bold")
             )
`geom_smooth()` using method = 'loess' and formula 'y ~ x'
`geom_smooth()` using method = 'loess' and formula 'y ~ x'
`geom_smooth()` using method = 'loess' and formula 'y ~ x'
`geom_smooth()` using method = 'loess' and formula 'y ~ x'
`geom_smooth()` using method = 'loess' and formula 'y ~ x'

Overall Model

Now that assumptions were checked and met, each model was reviewed. For each moderator’s model showed: b-values, se, t, p-value, over all model fit (i.e., f-value with p-value), and \(R^{2}\).

Dependence

For dependency the moderator, IV, and interactions were not significant.

# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_dep1$model1)" or "summary(mod_SAD_dep2$model1)"
mod_SAD_dep_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Dependence", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Dep x DvPB", "Dep x DvO", "Dep x OvPB"), 
                   "b" = round(c(mod_SAD_dep1$model1$coefficients[2:4], mod_SAD_dep2$model1$coefficients[3], mod_SAD_dep1$model1$coefficients[5:6],
                                 mod_SAD_dep2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,2], summary(mod_SAD_dep2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_dep1$model1)$coefficients[5:6,2], summary(mod_SAD_dep2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,3], summary(mod_SAD_dep2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_dep1$model1)$coefficients[5:6,3], summary(mod_SAD_dep2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,4], summary(mod_SAD_dep2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_dep1$model1)$coefficients[5:6,4], summary(mod_SAD_dep2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_dep1$model1)$fstatistic[2], summary(mod_SAD_dep1$model1)$fstatistic[3], summary(mod_SAD_dep1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_dep1$model1)$fstatistic[1], summary(mod_SAD_dep1$model1)$fstatistic[2], summary(mod_SAD_dep1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_dep1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_dep_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_dep_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Dependence.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of moderation analysis with IV: Smartphone Location and M: Dependence.
b SE t p
Moderator
1 Dependence 0.30817 0.29519 1.04398 0.29718
IV: Smartphone Location
2 Desk vs. Pocket/Bag 0.7494 2.06902 0.3622 0.71741
3 Desk vs. Outside 1.05897 2.11485 0.50073 0.61686
4 Outside vs. Pocket/Bag -0.30957 2.10022 -0.1474 0.8829
Interactions: Moderator x IV
5 Dep x DvPB -0.40457 0.4453 -0.90852 0.36419
6 Dep x DvO -0.2315 0.45315 -0.51085 0.60976
7 Dep x OvPB -0.17307 0.47893 -0.36137 0.71803
Model
df1 df2 F p
8 Overall Model Fit 5 370 0.3079 0.90811
9 R-squared 0.00414
Emotional Attachment

For emotional attachment the moderator was significant, xxxxx. However, the IV, and interactions were not significant.

# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_ea1$model1)" or "summary(mod_SAD_ea2$model1)"
mod_SAD_ea_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Emotional Attachment", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "EA x DvPB", "EA x DvO", "EA x OvPB"), 
                   "b" = round(c(mod_SAD_ea1$model1$coefficients[2:4], mod_SAD_ea2$model1$coefficients[3], mod_SAD_ea1$model1$coefficients[5:6],
                                 mod_SAD_ea2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,2], summary(mod_SAD_ea2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_ea1$model1)$coefficients[5:6,2], summary(mod_SAD_ea2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,3], summary(mod_SAD_ea2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_ea1$model1)$coefficients[5:6,3], summary(mod_SAD_ea2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,4], summary(mod_SAD_ea2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_ea1$model1)$coefficients[5:6,4], summary(mod_SAD_ea2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_ea1$model1)$fstatistic[2], summary(mod_SAD_ea1$model1)$fstatistic[3], summary(mod_SAD_ea1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_ea1$model1)$fstatistic[1], summary(mod_SAD_ea1$model1)$fstatistic[2], summary(mod_SAD_ea1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_ea1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_ea_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_ea_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Emotional Attachment.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of moderation analysis with IV: Smartphone Location and M: Emotional Attachment.
b SE t p
Moderator
1 Emotional Attachment -0.62853 0.31548 -1.9923 0.04707
IV: Smartphone Location
2 Desk vs. Pocket/Bag 0.79885 2.08812 0.38257 0.70226
3 Desk vs. Outside 0.26949 2.13268 0.12636 0.89951
4 Outside vs. Pocket/Bag 0.52936 2.12 0.2497 0.80296
Interactions: Moderator x IV
5 EA x DvPB 0.31867 0.4328 0.73629 0.46201
6 EA x DvO 0.60612 0.45097 1.34403 0.17975
7 EA x OvPB -0.28745 0.43776 -0.65663 0.51182
Model
df1 df2 F p
8 Overall Model Fit 5 373 1.05259 0.38647
9 R-squared 0.01391
Accessibility

For accessibility the moderator, IV, and interactions were not significant.

# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_access1$model1)" or "summary(mod_SAD_access2$model1)"
mod_SAD_access_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Accessibility", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Access x DvPB", "Access x DvO", "Access x OvPB"), 
                   "b" = round(c(mod_SAD_access1$model1$coefficients[2:4], mod_SAD_access2$model1$coefficients[3], mod_SAD_access1$model1$coefficients[5:6],
                                 mod_SAD_access2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,2], summary(mod_SAD_access2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_access1$model1)$coefficients[5:6,2], summary(mod_SAD_access2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,3], summary(mod_SAD_access2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_access1$model1)$coefficients[5:6,3], summary(mod_SAD_access2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,4], summary(mod_SAD_access2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_access1$model1)$coefficients[5:6,4], summary(mod_SAD_access2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_access1$model1)$fstatistic[2], summary(mod_SAD_access1$model1)$fstatistic[3], summary(mod_SAD_access1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_access1$model1)$fstatistic[1], summary(mod_SAD_access1$model1)$fstatistic[2], summary(mod_SAD_access1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_access1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_access_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_access_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Accessibility", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of moderation analysis with IV: Smartphone Location and M: Accessibility
b SE t p
Moderator
1 Accessibility -0.31986 0.39716 -0.80538 0.42112
IV: Smartphone Location
2 Desk vs. Pocket/Bag 1.00154 2.09887 0.47718 0.63351
3 Desk vs. Outside 0.51756 2.14074 0.24177 0.80909
4 Outside vs. Pocket/Bag 0.48398 2.11938 0.22836 0.81949
Interactions: Moderator x IV
5 Access x DvPB 0.61708 0.58046 1.06308 0.28844
6 Access x DvO 0.28644 0.59532 0.48115 0.63069
7 Access x OvPB 0.33064 0.61309 0.5393 0.59001
Model
df1 df2 F p
8 Overall Model Fit 5 371 0.26925 0.92978
9 R-squared 0.00362
Distractibility

For distractibility the moderator, IV, and interactions were not significant.

# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_dist1$model1)" or "summary(mod_SAD_dist2$model1)"
mod_SAD_dist_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Distractibility", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Dist x DvPB", "Dist x DvO", "Dist x OvPB"), 
                   "b" = round(c(mod_SAD_dist1$model1$coefficients[2:4], mod_SAD_dist2$model1$coefficients[3], mod_SAD_dist1$model1$coefficients[5:6],
                                 mod_SAD_dist2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,2], summary(mod_SAD_dist2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_dist1$model1)$coefficients[5:6,2], summary(mod_SAD_dist2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,3], summary(mod_SAD_dist2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_dist1$model1)$coefficients[5:6,3], summary(mod_SAD_dist2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,4], summary(mod_SAD_dist2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_dist1$model1)$coefficients[5:6,4], summary(mod_SAD_dist2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_dist1$model1)$fstatistic[2], summary(mod_SAD_dist1$model1)$fstatistic[3], summary(mod_SAD_dist1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_dist1$model1)$fstatistic[1], summary(mod_SAD_dist1$model1)$fstatistic[2], summary(mod_SAD_dist1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_dist1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_dist_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_dist_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Distractibility.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of moderation analysis with IV: Smartphone Location and M: Distractibility.
b SE t p
Moderator
1 Distractibility 0.31015 0.57377 0.54054 0.58915
IV: Smartphone Location
2 Desk vs. Pocket/Bag 1.18539 2.09766 0.5651 0.57235
3 Desk vs. Outside 0.82823 2.13942 0.38713 0.69888
4 Outside vs. Pocket/Bag 0.35716 2.11276 0.16905 0.86585
Interactions: Moderator x IV
5 Dist x DvPB 0.14489 0.85393 0.16967 0.86536
6 Dist x DvO -0.21229 0.80601 -0.26339 0.7924
7 Dist x OvPB 0.35718 0.84878 0.42082 0.67413
Model
df1 df2 F p
8 Overall Model Fit 5 369 0.22999 0.94932
9 R-squared 0.00311
Age

For age the moderator, IV, and interactions were not significant.

# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SUQ_age1$model1)" or "summary(mod_SUQ_age2$model1)"
mod_SUQ_age_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Age x DvPB", "Age x DvO", "Age x OvPB"), 
                   "b" = round(c(mod_SUQ_age1$model1$coefficients[2:4], mod_SUQ_age2$model1$coefficients[3], mod_SUQ_age1$model1$coefficients[5:6],
                                 mod_SUQ_age2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,2], summary(mod_SUQ_age2$model1)$coefficients[3,2], 
                                  summary(mod_SUQ_age1$model1)$coefficients[5:6,2], summary(mod_SUQ_age2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,3], summary(mod_SUQ_age2$model1)$coefficients[3,3], 
                                 summary(mod_SUQ_age1$model1)$coefficients[5:6,3], summary(mod_SUQ_age2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,4], summary(mod_SUQ_age2$model1)$coefficients[3,4], 
                                 summary(mod_SUQ_age1$model1)$coefficients[5:6,4], summary(mod_SUQ_age2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SUQ_age1$model1)$fstatistic[2], summary(mod_SUQ_age1$model1)$fstatistic[3], summary(mod_SUQ_age1$model1)$fstatistic[1], 
                pf(summary(mod_SUQ_age1$model1)$fstatistic[1], summary(mod_SUQ_age1$model1)$fstatistic[2], summary(mod_SUQ_age1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SUQ_age1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SUQ_age_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SUQ_age_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Age", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
Summary of moderation analysis with IV: Smartphone Location and M: Age
b SE t p
Moderator
1 Age -1.97546 1.77501 -1.11293 0.26647
IV: Smartphone Location
2 Desk vs. Pocket/Bag 1.55526 2.09747 0.74149 0.45887
3 Desk vs. Outside 1.00269 2.12117 0.47271 0.6367
4 Outside vs. Pocket/Bag 0.55256 2.09603 0.26362 0.79222
Interactions: Moderator x IV
5 Age x DvPB 0.02068 2.28762 0.00904 0.99279
6 Age x DvO -0.64199 2.32517 -0.2761 0.78263
7 Age x OvPB 0.66267 2.08285 0.31815 0.75055
Model
df1 df2 F p
8 Overall Model Fit 5 364 1.30937 0.25925
9 R-squared 0.01767
Simple Slopes
Dependence
Click to see output for simple slopes, including generated interpretations.
# summary of SS
lapply(mod_SAD_dep1$slopemodels, summary)
$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.098 -12.254   2.744  12.939  33.977 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    41.2565     1.5256  27.042   <2e-16 ***
SAD_dep_score   0.3082     0.3056   1.008    0.315    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.12 on 124 degrees of freedom
Multiple R-squared:  0.008133,  Adjusted R-squared:  0.0001336 
F-statistic: 1.017 on 1 and 124 DF,  p-value: 0.3153


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.272 -10.501   2.114  11.969  33.403 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    42.0059     1.4226  29.528   <2e-16 ***
SAD_dep_score  -0.0964     0.3266  -0.295    0.768    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.19 on 128 degrees of freedom
Multiple R-squared:  0.0006804, Adjusted R-squared:  -0.007127 
F-statistic: 0.08715 on 1 and 128 DF,  p-value: 0.7683


$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.257 -11.025   1.474  12.455  32.589 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.31547    1.49293  28.344   <2e-16 ***
SAD_dep_score  0.07667    0.33836   0.227    0.821    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.27 on 118 degrees of freedom
Multiple R-squared:  0.0004349, Adjusted R-squared:  -0.008036 
F-statistic: 0.05135 on 1 and 118 DF,  p-value: 0.8211
lapply(mod_SAD_dep2$slopemodels, summary)
$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.257 -11.025   1.474  12.455  32.589 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.31547    1.49293  28.344   <2e-16 ***
SAD_dep_score  0.07667    0.33836   0.227    0.821    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.27 on 118 degrees of freedom
Multiple R-squared:  0.0004349, Adjusted R-squared:  -0.008036 
F-statistic: 0.05135 on 1 and 118 DF,  p-value: 0.8211


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.272 -10.501   2.114  11.969  33.403 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    42.0059     1.4226  29.528   <2e-16 ***
SAD_dep_score  -0.0964     0.3266  -0.295    0.768    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.19 on 128 degrees of freedom
Multiple R-squared:  0.0006804, Adjusted R-squared:  -0.007127 
F-statistic: 0.08715 on 1 and 128 DF,  p-value: 0.7683


$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.098 -12.254   2.744  12.939  33.977 

Coefficients:
              Estimate Std. Error t value Pr(>|t|)    
(Intercept)    41.2565     1.5256  27.042   <2e-16 ***
SAD_dep_score   0.3082     0.3056   1.008    0.315    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.12 on 124 degrees of freedom
Multiple R-squared:  0.008133,  Adjusted R-squared:  0.0001336 
F-statistic: 1.017 on 1 and 124 DF,  p-value: 0.3153
# generated interpretations
cat(mod_SAD_dep1$interpretation)
At desk levels of location_desk, you see that every unit increase in SAD_dep_score predicts 0.31 unit change in ospan_absolute_score. 

At pocketbag levels of location_desk, you see that every unit increase in SAD_dep_score predicts -0.1 unit change in ospan_absolute_score. 

At outside levels of location_desk, you see that every unit increase in SAD_dep_score predicts 0.08 unit change in ospan_absolute_score. 
cat(mod_SAD_dep2$interpretation)
At outside levels of location_out, you see that every unit increase in SAD_dep_score predicts 0.08 unit change in ospan_absolute_score. 

At pocketbag levels of location_out, you see that every unit increase in SAD_dep_score predicts -0.1 unit change in ospan_absolute_score. 

At desk levels of location_out, you see that every unit increase in SAD_dep_score predicts 0.31 unit change in ospan_absolute_score. 
Emotional Attachment
Click to see output for simple slopes, including generated interpretations.
# summary of SS
lapply(mod_SAD_ea1$slopemodels, summary)
$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.524 -13.081   2.847  13.190  33.219 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)   41.2539     1.5288  26.984   <2e-16 ***
SAD_ea_score  -0.6285     0.3247  -1.936   0.0551 .  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.23 on 125 degrees of freedom
Multiple R-squared:  0.02911,   Adjusted R-squared:  0.02134 
F-statistic: 3.748 on 1 and 125 DF,  p-value: 0.05513


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.552 -11.348   2.237  10.927  33.927 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.0528     1.4290  29.428   <2e-16 ***
SAD_ea_score  -0.3099     0.2886  -1.074    0.285    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.3 on 129 degrees of freedom
Multiple R-squared:  0.00886,   Adjusted R-squared:  0.001177 
F-statistic: 1.153 on 1 and 129 DF,  p-value: 0.2849


$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.385 -10.609   2.391  13.346  33.660 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)  41.52341    1.52544   27.22   <2e-16 ***
SAD_ea_score -0.02242    0.32127   -0.07    0.944    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.69 on 119 degrees of freedom
Multiple R-squared:  4.091e-05, Adjusted R-squared:  -0.008362 
F-statistic: 0.004868 on 1 and 119 DF,  p-value: 0.9445
lapply(mod_SAD_ea2$slopemodels, summary)
$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.385 -10.609   2.391  13.346  33.660 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)  41.52341    1.52544   27.22   <2e-16 ***
SAD_ea_score -0.02242    0.32127   -0.07    0.944    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.69 on 119 degrees of freedom
Multiple R-squared:  4.091e-05, Adjusted R-squared:  -0.008362 
F-statistic: 0.004868 on 1 and 119 DF,  p-value: 0.9445


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.552 -11.348   2.237  10.927  33.927 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.0528     1.4290  29.428   <2e-16 ***
SAD_ea_score  -0.3099     0.2886  -1.074    0.285    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.3 on 129 degrees of freedom
Multiple R-squared:  0.00886,   Adjusted R-squared:  0.001177 
F-statistic: 1.153 on 1 and 129 DF,  p-value: 0.2849


$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-40.524 -13.081   2.847  13.190  33.219 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)   41.2539     1.5288  26.984   <2e-16 ***
SAD_ea_score  -0.6285     0.3247  -1.936   0.0551 .  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.23 on 125 degrees of freedom
Multiple R-squared:  0.02911,   Adjusted R-squared:  0.02134 
F-statistic: 3.748 on 1 and 125 DF,  p-value: 0.05513
# generated interpretations
cat(mod_SAD_ea1$interpretation)
At desk levels of location_desk, you see that every unit increase in SAD_ea_score predicts -0.63 unit change in ospan_absolute_score. 

At pocketbag levels of location_desk, you see that every unit increase in SAD_ea_score predicts -0.31 unit change in ospan_absolute_score. 

At outside levels of location_desk, you see that every unit increase in SAD_ea_score predicts -0.02 unit change in ospan_absolute_score. 
cat(mod_SAD_ea2$interpretation)
At outside levels of location_out, you see that every unit increase in SAD_ea_score predicts -0.02 unit change in ospan_absolute_score. 

At pocketbag levels of location_out, you see that every unit increase in SAD_ea_score predicts -0.31 unit change in ospan_absolute_score. 

At desk levels of location_out, you see that every unit increase in SAD_ea_score predicts -0.63 unit change in ospan_absolute_score. 
Accessibility
Click to see output for simple slopes, including generated interpretations.
# summary of SS
lapply(mod_SAD_access1$slopemodels, summary)
$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-42.917 -13.277   2.962  13.682  35.282 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)       40.9536     1.5324  26.725   <2e-16 ***
SAD_access_score  -0.3199     0.4059  -0.788    0.432    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.13 on 123 degrees of freedom
Multiple R-squared:  0.005023,  Adjusted R-squared:  -0.003066 
F-statistic: 0.621 on 1 and 123 DF,  p-value: 0.4322


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-38.320  -9.685   0.977  12.046  34.572 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)       41.9552     1.4344  29.249   <2e-16 ***
SAD_access_score   0.2972     0.4134   0.719    0.474    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.37 on 129 degrees of freedom
Multiple R-squared:  0.00399,   Adjusted R-squared:  -0.003731 
F-statistic: 0.5168 on 1 and 129 DF,  p-value: 0.4735


$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.643 -10.609   2.457  13.324  33.691 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)      41.47118    1.53100  27.088   <2e-16 ***
SAD_access_score -0.03342    0.44437  -0.075     0.94    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.8 on 119 degrees of freedom
Multiple R-squared:  4.754e-05, Adjusted R-squared:  -0.008355 
F-statistic: 0.005657 on 1 and 119 DF,  p-value: 0.9402
lapply(mod_SAD_access2$slopemodels, summary)
$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.643 -10.609   2.457  13.324  33.691 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)      41.47118    1.53100  27.088   <2e-16 ***
SAD_access_score -0.03342    0.44437  -0.075     0.94    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.8 on 119 degrees of freedom
Multiple R-squared:  4.754e-05, Adjusted R-squared:  -0.008355 
F-statistic: 0.005657 on 1 and 119 DF,  p-value: 0.9402


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-38.320  -9.685   0.977  12.046  34.572 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)       41.9552     1.4344  29.249   <2e-16 ***
SAD_access_score   0.2972     0.4134   0.719    0.474    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.37 on 129 degrees of freedom
Multiple R-squared:  0.00399,   Adjusted R-squared:  -0.003731 
F-statistic: 0.5168 on 1 and 129 DF,  p-value: 0.4735


$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-42.917 -13.277   2.962  13.682  35.282 

Coefficients:
                 Estimate Std. Error t value Pr(>|t|)    
(Intercept)       40.9536     1.5324  26.725   <2e-16 ***
SAD_access_score  -0.3199     0.4059  -0.788    0.432    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.13 on 123 degrees of freedom
Multiple R-squared:  0.005023,  Adjusted R-squared:  -0.003066 
F-statistic: 0.621 on 1 and 123 DF,  p-value: 0.4322
# generated interpretations
cat(mod_SAD_access1$interpretation)
At desk levels of location_desk, you see that every unit increase in SAD_access_score predicts -0.32 unit change in ospan_absolute_score. 

At pocketbag levels of location_desk, you see that every unit increase in SAD_access_score predicts 0.3 unit change in ospan_absolute_score. 

At outside levels of location_desk, you see that every unit increase in SAD_access_score predicts -0.03 unit change in ospan_absolute_score. 
cat(mod_SAD_access2$interpretation)
At outside levels of location_out, you see that every unit increase in SAD_access_score predicts -0.03 unit change in ospan_absolute_score. 

At pocketbag levels of location_out, you see that every unit increase in SAD_access_score predicts 0.3 unit change in ospan_absolute_score. 

At desk levels of location_out, you see that every unit increase in SAD_access_score predicts -0.32 unit change in ospan_absolute_score. 
Distractibility
Click to see output for simple slopes, including generated interpretations.
# summary of SS
lapply(mod_SAD_dist1$slopemodels, summary)
$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.832 -12.090   3.393  13.307  33.928 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)     40.7987     1.5357  26.567   <2e-16 ***
SAD_dist_score   0.3101     0.5865   0.529    0.598    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.06 on 122 degrees of freedom
Multiple R-squared:  0.002287,  Adjusted R-squared:  -0.005891 
F-statistic: 0.2796 on 1 and 122 DF,  p-value: 0.5979


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.386 -10.727   1.659  12.069  33.069 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)     41.9841     1.4292  29.376   <2e-16 ***
SAD_dist_score   0.4550     0.6174   0.737    0.462    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.29 on 128 degrees of freedom
Multiple R-squared:  0.004226,  Adjusted R-squared:  -0.003554 
F-statistic: 0.5432 on 1 and 128 DF,  p-value: 0.4624


$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-38.420 -10.713   1.993  12.993  33.287 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)    41.62692    1.52668  27.266   <2e-16 ***
SAD_dist_score  0.09785    0.56734   0.172    0.863    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.73 on 119 degrees of freedom
Multiple R-squared:  0.0002499, Adjusted R-squared:  -0.008151 
F-statistic: 0.02975 on 1 and 119 DF,  p-value: 0.8634
lapply(mod_SAD_dist2$slopemodels, summary)
$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-38.420 -10.713   1.993  12.993  33.287 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)    41.62692    1.52668  27.266   <2e-16 ***
SAD_dist_score  0.09785    0.56734   0.172    0.863    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.73 on 119 degrees of freedom
Multiple R-squared:  0.0002499, Adjusted R-squared:  -0.008151 
F-statistic: 0.02975 on 1 and 119 DF,  p-value: 0.8634


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.386 -10.727   1.659  12.069  33.069 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)     41.9841     1.4292  29.376   <2e-16 ***
SAD_dist_score   0.4550     0.6174   0.737    0.462    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.29 on 128 degrees of freedom
Multiple R-squared:  0.004226,  Adjusted R-squared:  -0.003554 
F-statistic: 0.5432 on 1 and 128 DF,  p-value: 0.4624


$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-39.832 -12.090   3.393  13.307  33.928 

Coefficients:
               Estimate Std. Error t value Pr(>|t|)    
(Intercept)     40.7987     1.5357  26.567   <2e-16 ***
SAD_dist_score   0.3101     0.5865   0.529    0.598    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 17.06 on 122 degrees of freedom
Multiple R-squared:  0.002287,  Adjusted R-squared:  -0.005891 
F-statistic: 0.2796 on 1 and 122 DF,  p-value: 0.5979
# generated interpretations
cat(mod_SAD_dist1$interpretation)
At desk levels of location_desk, you see that every unit increase in SAD_dist_score predicts 0.31 unit change in ospan_absolute_score. 

At pocketbag levels of location_desk, you see that every unit increase in SAD_dist_score predicts 0.46 unit change in ospan_absolute_score. 

At outside levels of location_desk, you see that every unit increase in SAD_dist_score predicts 0.1 unit change in ospan_absolute_score. 
cat(mod_SAD_dist2$interpretation)
At outside levels of location_out, you see that every unit increase in SAD_dist_score predicts 0.1 unit change in ospan_absolute_score. 

At pocketbag levels of location_out, you see that every unit increase in SAD_dist_score predicts 0.46 unit change in ospan_absolute_score. 

At desk levels of location_out, you see that every unit increase in SAD_dist_score predicts 0.31 unit change in ospan_absolute_score. 
Age
Click to see output for simple slopes, including generated interpretations.
# summary of SS
lapply(mod_SUQ_age1$slopemodels, summary)
$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-42.173 -11.941   2.803  11.827  34.803 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   40.683      1.550  26.254   <2e-16 ***
age           -1.975      1.833  -1.078    0.283    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.98 on 120 degrees of freedom
Multiple R-squared:  0.00959,   Adjusted R-squared:  0.001337 
F-statistic: 1.162 on 1 and 120 DF,  p-value: 0.2832


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-35.758 -10.474   0.788  10.936  33.242 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.238      1.369   30.86   <2e-16 ***
age           -1.955      1.348   -1.45     0.15    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 15.36 on 124 degrees of freedom
Multiple R-squared:  0.01668,   Adjusted R-squared:  0.008746 
F-statistic: 2.103 on 1 and 124 DF,  p-value: 0.1495


$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.042 -11.409   1.341  11.980  36.575 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   41.686      1.547  26.940   <2e-16 ***
age           -2.617      1.550  -1.688    0.094 .  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.98 on 120 degrees of freedom
Multiple R-squared:  0.0232,    Adjusted R-squared:  0.01506 
F-statistic:  2.85 on 1 and 120 DF,  p-value: 0.09398
lapply(mod_SUQ_age2$slopemodels, summary)
$outside

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-41.042 -11.409   1.341  11.980  36.575 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   41.686      1.547  26.940   <2e-16 ***
age           -2.617      1.550  -1.688    0.094 .  
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.98 on 120 degrees of freedom
Multiple R-squared:  0.0232,    Adjusted R-squared:  0.01506 
F-statistic:  2.85 on 1 and 120 DF,  p-value: 0.09398


$pocketbag

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-35.758 -10.474   0.788  10.936  33.242 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   42.238      1.369   30.86   <2e-16 ***
age           -1.955      1.348   -1.45     0.15    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 15.36 on 124 degrees of freedom
Multiple R-squared:  0.01668,   Adjusted R-squared:  0.008746 
F-statistic: 2.103 on 1 and 124 DF,  p-value: 0.1495


$desk

Call:
lm(formula = eqsim, data = finaldata)

Residuals:
    Min      1Q  Median      3Q     Max 
-42.173 -11.941   2.803  11.827  34.803 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)   40.683      1.550  26.254   <2e-16 ***
age           -1.975      1.833  -1.078    0.283    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 16.98 on 120 degrees of freedom
Multiple R-squared:  0.00959,   Adjusted R-squared:  0.001337 
F-statistic: 1.162 on 1 and 120 DF,  p-value: 0.2832
# generated interpretations
cat(mod_SUQ_age1$interpretation)
At desk levels of location_desk, you see that every unit increase in age predicts -1.98 unit change in ospan_absolute_score. 

At pocketbag levels of location_desk, you see that every unit increase in age predicts -1.95 unit change in ospan_absolute_score. 

At outside levels of location_desk, you see that every unit increase in age predicts -2.62 unit change in ospan_absolute_score. 
cat(mod_SUQ_age2$interpretation)
At outside levels of location_out, you see that every unit increase in age predicts -2.62 unit change in ospan_absolute_score. 

At pocketbag levels of location_out, you see that every unit increase in age predicts -1.95 unit change in ospan_absolute_score. 

At desk levels of location_out, you see that every unit increase in age predicts -1.98 unit change in ospan_absolute_score. 
Visualizations

The visualizations below are presented either in panel form (i.e., with all plots presented together) or separate.

Separate

Dependence was not a significant moderator for OSpan absolute score. No notable trends in the data were observed.

mod_SAD_dep_bar_data <- 
  mod_SAD_dep1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_dep_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_dep_score = c(scale(SAD_dep_score, center = TRUE, scale = FALSE)))

mod_SAD_dep_bar <- ggplot(mod_SAD_dep_bar_data, aes(x = SAD_dep_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_dep_bar_data$SAD_dep_score), max(mod_SAD_dep_bar_data$SAD_dep_score)), 
                  ylim = c(min(mod_SAD_dep_bar_data$ospan_absolute_score), max(mod_SAD_dep_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Dependence", 
       x = "Dependence", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone dependency is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_dep_bar
`geom_smooth()` using formula 'y ~ x'

# export plot
# ggsave(filename="fig_mod_SAD_dep_bar_june14", plot = mod_SAD_dep_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)

Emotional Attachment was not a significant moderator for OSpan absolute score. A notable trend in the data was observed, where participants with less emotional attachment showed better OSpan performance in the on desk condition. The outside condition showed the same but smaller trned. The pocket/bag condition did not show this trend as the slope were closer to zero.

mod_SAD_ea_bar_data <- 
  mod_SAD_ea1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_ea_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_ea_score = c(scale(SAD_ea_score, center = TRUE, scale = FALSE)))

mod_SAD_ea_bar <- ggplot(mod_SAD_ea_bar_data, aes(x = SAD_ea_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_ea_bar_data$SAD_ea_score), max(mod_SAD_ea_bar_data$SAD_ea_score)), 
                  ylim = c(min(mod_SAD_ea_bar_data$ospan_absolute_score), max(mod_SAD_ea_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Emotional Attachment", 
       x = "Emotional Attachment", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone Emotional Attachment is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_ea_bar
`geom_smooth()` using formula 'y ~ x'

# export plot
# ggsave(filename="fig_mod_SAD_ea_bar_june14", plot = mod_SAD_ea_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)

Accessibility was not a significant moderator for OSpan absolute score. No notable trends in the data were observed.

mod_SAD_access_bar_data <- 
  mod_SAD_access1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_access_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_access_score = c(scale(SAD_access_score, center = TRUE, scale = FALSE)))

mod_SAD_access_bar <- ggplot(mod_SAD_access_bar_data, aes(x = SAD_access_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_access_bar_data$SAD_access_score), max(mod_SAD_access_bar_data$SAD_access_score)), 
                  ylim = c(min(mod_SAD_access_bar_data$ospan_absolute_score), max(mod_SAD_access_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Accessibility", 
       x = "Accessibility", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Accessibility is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_access_bar
`geom_smooth()` using formula 'y ~ x'

# export plot
# ggsave(filename="fig_mod_SAD_access_bar_june14", plot = mod_SAD_access_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)

Distractibility was not a significant moderator for OSpan absolute score. No notable trends in the data were observed.

mod_SAD_dist_bar_data <- 
  mod_SAD_dist1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_dist_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_dist_score = c(scale(SAD_dist_score, center = TRUE, scale = FALSE)))

mod_SAD_dist_bar <- ggplot(mod_SAD_dist_bar_data, aes(x = SAD_dist_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_dist_bar_data$SAD_dist_score), max(mod_SAD_dist_bar_data$SAD_dist_score)), 
                  ylim = c(min(mod_SAD_dist_bar_data$ospan_absolute_score), max(mod_SAD_dist_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Distractibility", 
       x = "Distractibility", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone Distractibility is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_dist_bar
`geom_smooth()` using formula 'y ~ x'

# export plot
# ggsave(filename="fig_mod_SAD_dist_bar_june14", plot = mod_SAD_dist_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)

Age was not a significant moderator for OSpan absolute score. A notable trend in the data was observed: for all location conditions, younger participants tended to show better task performance.

mod_SUQ_age_bar_data <- 
  mod_SUQ_age1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, age, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(age = c(scale(age, center = TRUE, scale = FALSE)))

mod_SUQ_age_bar <- ggplot(mod_SUQ_age_bar_data, aes(x = age, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SUQ_age_bar_data$age), max(mod_SUQ_age_bar_data$age)), 
                  ylim = c(min(mod_SUQ_age_bar_data$ospan_absolute_score), max(mod_SUQ_age_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Age", 
       x = "Age (years)", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Age is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SUQ_age_bar
`geom_smooth()` using formula 'y ~ x'

# export plot
# ggsave(filename="fig_mod_SUQ_age_bar", plot = mod_SUQ_age_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
Panel
# create join data set with all moderator data
mod_all_data_plot <- 
  cbind(data.frame("moderator" = c(rep(1, nrow(mod_SAD_dep_bar_data)), rep(2, nrow(mod_SAD_ea_bar_data)), rep(3, nrow(mod_SAD_access_bar_data)), rep(4, nrow(mod_SAD_dist_bar_data)))), 
        rbind(
        # SAD_dep data
        mod_SAD_dep_bar_data %>% dplyr::rename(mod_score = SAD_dep_score), # rename column to join correctly
        mod_SAD_ea_bar_data %>% dplyr::rename(mod_score = SAD_ea_score), # rename column to join correctly
        mod_SAD_access_bar_data %>% dplyr::rename(mod_score = SAD_access_score), # rename column to join correctly
        mod_SAD_dist_bar_data %>% dplyr::rename(mod_score = SAD_dist_score) # rename column to join correctly
        )
        ) %>% 
  mutate(moderator = factor(moderator, levels = c(1:4), labels = c("(A) Dependency", "(B) Emotional Attachment", "(C) Accessibility", "(D) Distractibility"))) %>% 
  mutate(location_desk = factor(location_desk, labels = c("Desk", "Pocket/Bag", "Outside")))

glimpse(mod_all_data_plot)
Rows: 1,507
Columns: 4
$ moderator            <fct> (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A) Dependency, (A…
$ ospan_absolute_score <int> 38, 37, 28, 51, 37, 55, 29, 43, 29, 35, 46, 50, 39, 61, 13, 37, 61, 43, 24, 52, 8, 29, 62, 29, 34, 15, 68, 37, 37, 27, 55, 12, 55, 68, 48, 44, 48, 19, 15, 55, 10, 26, 61, 15, 52,…
$ mod_score            <dbl> -1.7579787, 4.2420213, -4.7579787, -1.7579787, -2.7579787, 0.2420213, 1.2420213, -2.7579787, -2.7579787, -1.7579787, 8.2420213, 0.2420213, 2.2420213, -2.7579787, 8.2420213, 4.242…
$ location_desk        <fct> Desk, Desk, Pocket/Bag, Desk, Desk, Pocket/Bag, Pocket/Bag, Outside, Outside, Desk, Desk, Pocket/Bag, Outside, Outside, Desk, Desk, Pocket/Bag, Outside, Desk, Pocket/Bag, Outside…
# create panel fig
mod_all_plot <-
  ggplot(mod_all_data_plot, aes(x = mod_score, y = ospan_absolute_score, group = location_desk)) +
  geom_point(stat = "identity", aes(shape = location_desk, color = location_desk), size = 1.5)+
  geom_smooth(method = lm, aes(fill = location_desk, color = location_desk), size = 1) + 
  labs(
    # title = "Moderating OSpan Absolute Score",
       x = "Mean-Centered Score", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       shape = "Smartphone Location") +
  facet_wrap(~moderator, ncol = 2) +
  scale_fill_manual(values = c("#18BAB3", "#62129B", "#007298")) +
  scale_colour_manual(values = c("#18BAB3", "#62129B", "#007298")) +
  scale_y_continuous(limits = c(0, 75), breaks = c(seq(0,75, 25))) + 
  theme_classic() +
  theme(legend.position = "bottom")
  

# show plot
mod_all_plot
`geom_smooth()` using formula 'y ~ x'

# export plots
# ggsave(filename="fig_mod_all_plot_june16", plot = mod_all_plot, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)
---
title: 'Reexamining the “Brain Drain” Effect: Does Ward et al. (2017) replicate?'
author: "A. C. Ruiz Pardo and  Dr. J. P. Minda"
date: "October 15, 2021"
output:
  html_notebook:
    toc: yes
    toc_depth: '6'
    toc_float:
      collapsed: yes
    theme: lumen
    code_folding: hide
  pdf_document:
    toc: yes
    toc_depth: '6'
  html_document:
    toc: yes
    toc_depth: '6'
    toc_float:
      collapsed: yes
    theme: lumen
    code_folding: hide
---

```{=html}
<style type="text/css">

body{ /* Normal  */
      font-size: 15px;
      color: black;}
td {  /* Table  */
  font-size: 14px;}
h1.title {
  font-size: 40px;
  color: #4F2683}
h1 { /* Header 1 */
  font-size: 35px;
  color: #4F2683;}
h2 { /* Header 2 */
  font-size: 33px;
  color: #4F2683;}
h3 { /* Header 3 */
  font-size: 31px;
  color: #4F2683;}
h4 { /* Header 4 */
  font-size: 29px;
  color: #4F2683;}
h5 { /* Header 5 */
  font-size: 27px;
  color: #4F2683;}
h6 { /* Header 6 */
  font-size: 25px;
  color: #4F2683;}

</style>
```
<div style="text-align: left">

```{r include=FALSE}
knitr::opts_chunk$set(warning=FALSE, message=FALSE)
```

# Introduction 

This document outlines the analyses from the paper **Reexamining the "Brain Drain" Effect: Does Ward et al. (2017) replicate?** (Ruiz Pardo & Minda, Oct 2021 preprint, <https://psyarxiv.com/5y34g/>). The purpose of this study was to investigate Ward et al.'s (2017) second experiment by completing a direct replication. Therefore, the present study investigated how the mere presence of one's smartphone effects cognition. This project can be found on OSF (<https://osf.io/ubys7/>) and was pre-registered through OSF (<https://osf.io/5fq4r>) .

The main goal of the current study was to see if Ward et al.'s findings replicated (experiment two). Therefore, as in Ward et al., participants were randomly assigned to one of *six* possible conditions. These conditions were based on two independent variables: **smartphone location** and **smartphone power**. For smartphone location, a participant's smartphone was in one of three locations: (1) on the participant's desk ("on desk"), (2) in their pocket/bag ("pocket/bag"), or (3) outside the testing room ("outside"). For smartphone power, a participant's smartphone was either (1) powered ON or (2) powered OFF, in their respective location. For all conditions, participants were instructed to keep their smartphones on silent (i.e., no vibrations if any notifications are received during the study). Also, as per Ward et al., participants in the "on desk" location conditions were instructed to keep their devices facing up.

Each participant completed **two main tasks** (counter-balanced):

**1. An Automated Operation Span (OSpan) Task** (Unsworth et al., 2005), which required participants to retain letter strings (i.e., ranging from three to seven letters in length) in memory while solving some simple math problems (e.g., "(7/7) + 6 = ?"). This task was a behavioural measure of the attentional control component of working memory.

-   Scoring:

    -   Performance, measured with the OSpan absolute score, was shown by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e., 3 + 5 + 0 = 8). Therefore, OSpan absolute score showed performance where higher scores represented better performance.

**2. The Cue-Dependent Go/No-Go Task** (Bezdjia et al., 2009), which required participants to respond to "go" targets (i.e., green rectangle) while withholding any response to "no-go" targets (i.e. blue rectangle). This task included a cue component, where vertical targets (i.e., rectangles) were more likely to be "go" targets (i.e., 80% "go" and 20% "no-go") and horizontal targets were more likely to be "no-go" targets (i.e., 80% "no-go" and 20% "go"). Participants were not explicitly made aware of the cue component. This task was a behavioural measure of sustained attention.

-   Scoring:

    -   Performance was measured with mean omission errors and RT. It should be noted that mean errors can be divided into total error, commission error, and omission error. Commission errors occurred when a participant responded to a target stimulus. Omission errors occurred when a participant failed to respond to a non-target stimulus. Total errors were the sum of commission and omission errors. The present study focused on mean omission errors. Therefore, for each participant, higher mean omission errors represented lower performance. Additionally, higher mean RT also showed lower performance (i.e., indicative of greater interference).

Finally, after completing both tasks, all participants completed **three** survey measures:

**1. The Demographic Questionnaire**, which was created for the present study and consisted of four items: age (i.e., in years), gender (i.e., male, female, other, or prefer not to say), program (e.g., psychology, engineering), and year of study (e.g., first, fourth). The purpose of these items was to give a brief description of the sample.

**2. The Smartphone Usage Questionnaire (SUQ)**, which was created for the present study and consisted of modified items from Ward et al.'s survey measures (i.e., found in the "BRAIN DRAIN" WEB APPENDIX"). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 ("Never") to 7 ("Always"). There were 10 items in total and there were five types of items, which measured: (1) smartphone use frequency (three items; e.g., "On average, how many text messages do you send per day?"); (2) smartphone use without external stimulation (two items; e.g., "If I am waiting to meet a friend, I pass the time by using my smartphone."), or during other activities (two items; e.g., "I use my smartphone while driving."); (3) smartphone subjective value (one item; e.g., "How much money would it take for you to give up your phone for a full day?"); (4) smartphone notification type (one item; e.g., "Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply."); and (5) phantom vibrations (one item; e.g., "Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?"). The purpose of the SUQ was to measure participants' typical smartphone use.

**3. Smartphone Attachment and Dependency Inventory (SAD)**, which was from Ward et al.'s (2017) study. The SAD used 13-items (i.e., 7-point likert scale) that asked participants to rate (i.e., from 1 = *Strongly Disagree* to 7 = *Strongly Agree*) how much they agreed with statements regarding their emotional attachment and dependency on their smartphones. The purpose of the SAD was to measure each participant's reliance on their smartphone.

-   Scoring:

    -   Smartphone reliance was calculated with a sum total for all items with a range of 13 to 91. Higher scores indicated a higher level of reliance with three levels: low reliance (13-39), moderate reliance (40-65), and high reliance (66+).

A total of **`r length(all_participants$all_participants)`** undergraduate students participated in this study. Of the total sample size, `r length(all_remove$all_participants)` participants were excluded due to either testing error (`r length(all_remove$all_participants[which(all_remove$reason_remove == "gng")])`; e.g., incomplete task data), or experimenter or external confounds (`r length(all_remove$all_participants[which(all_remove$reason_remove == "track")])`; e.g., interruption during testing, distracting noise during testing). Additionally, `r length(all_remove_dc_list$participant)` participants were removed during the data cleaning phase, which removed participants who met an exclusion criteria (OSpan math criteria: `r sum(all_remove_dc$reason == "ospan_ec")`; Go/No-Go response criteria, horizontal/no-go cue: `r sum(all_remove_dc$reason == "gng_h_ec")`; Go/No-Go response criteria, vertical/go cue: `r sum(all_remove_dc$reason == "gng_v_ec")`), were identified as having outlier data (OSpan: `r sum(all_remove_dc$reason == "ospan_out")`; Go/No-Go Error Analysis: `r sum(all_remove_dc$reason == "gng_err_out")`; Go/No-Go Accuracy Analysis: `r sum(all_remove_dc$reason == "gng_ac_out")`), and had incomplete or missing data (OSpan: `r sum(all_remove_dc$reason == "ospan_miss")`; Go/No-Go Error Analysis: `r sum(all_remove_dc$reason == "gng_err_miss")`; Go/No-Go Accuracy Analysis: `r sum(all_remove_dc$reason == "gng_ac_miss")`). Overall, **`r nrow(all_participants) - nrow(all_data)`** participants were removed from the analysis. Note: a participant may have been removed due to multiple criteria.

Therefore, **`r length(all_data$participant)`** participants were included in the analyses for the location (on desk = `r length(which(all_data$location == 1))`; pocket/bag = `r length(which(all_data$location == 2))`; outside = `r length(which(all_data$location == 3))`) and power (ON = `r length(which(all_data$power == 1))`; OFF = `r length(which(all_data$power == 2))`) condition.

# Data Analysis Plan

The present study completed a pre-registered data analysis (pre-registered direct replication on Open Science Framework; OSF). Below is a summary of the data analysis plan (view full version at <https://osf.io/5fq4r> or <https://osf.io/ubys7/>).

## Hypotheses

1.  The **location effect hypothesis** predicted that those who were closest in proximity to their smartphone (i.e., those with their smartphones on their desk) would show lower performance on the OSpan task.

2.  The **power effect hypothesis** predicted that smartphone power (i.e., either ON or OFF) would not affect performance on both cognitive tasks.

3.  The **moderation effect hypothesis** predicted that smartphone attachment and dependency (i.e., SAD score) would moderate the location effect: those who reported higher smartphone attachment and dependency would perform worse on the OSpan task.

*Note: All hypotheses were based on the findings from Ward et al. (2017)*

## Inclusion / Exclusion Criteria

There were **two** exclusion criteria in the present study:

**1. OSpan Task**

-   Only data from participants who scored 85% accuracy on the math component or above was used.

    -   Criteria helped control for any participants who did not follow instructions and neglected the math component of the task. *Note: This was the same criteria as in Ward et al. (2017)*

**2. Cue-Dependent Go/No-Go Task**

-   Only data from those who scored higher than chance performance was used (e.g., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials).

    -   Criteria helped control for any participants who did not complete the task as instructed (e.g., guessing). *Note: There was no inclusion/exclusion criteria for thsi task in Ward et al. (2017)*

## Analysis

### Testing the Location and Power Hypotheses:

**1. OSpan Task**

-   The OSpan absolute score was used to measure OSpan performance.

-   A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted **(DV = OSpan absolute score)**.

    -   Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s) and/or power status were significantly different.

**2. Cue-Dependent Go/No-Go Task**

-   An error (i.e., omission errors) and accuracy (i.e., RT) analysis was completed using the average omission errors and RT for each participant.

    **(a) Error Analysis**

    -   A **3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Go, No-Go) mixed ANOVA** was completed **(DV = average omission error(s))**.

        -   The between-subjects factors:smartphone location and power\
        -   The within-subjects factor: pre-target cue type\
        -   Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s), power status, and/or cue type(s) were significantly different.

    **(b) Accuracy Analysis**

    -   A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted **(DV = average RT)**.

        -   The between-subjects factors:smartphone location and power\
        -   The within-subjects factor: pre-target cue type\
        -   Post-hocs completed if there was a significant ANOVA ($\alpha = 0.05$) to determine which location(s) and/or power status were significantly different.

### Testing the Moderation Hypothesis:

**3. SAD**

**(a) Factor Analysis**

-   A principal components factor analysis with a Varimax rotation was completed for participant's SAD score.

    -   This was done to confirm the two main factors found by Ward et al. (2017): smartphone dependence and emotional attachment.

**(b) Generalized Linear Model**

-   A univariate generalized linear model was used to examine if smartphone reliance (i.e., SAD score) was a moderator of the relationship between the experimental manipulation and task performance.

*Note: These analysis were completed regardless of the results for the location and power hypotheses.*

## References

- Bezdjian, S., Baker, L. A., Lozano, D. I., & Raine, A. (2009). Assessing inattention and impulsivity in children during the Go/NoGo task. The British Journal of Developmental Psychology, 27(2), 365–383. <https://doi.org/10.1348/026151008X314919>
- Peirce, J. W. (2007). PsychoPy—Psychophysics software in Python. Journal of Neuroscience Methods, 162(1), 8–13. <https://doi.org/10.1016/j.jneumeth.2006.11.017>
- Ruiz Pardo, A. C., & Minda, J. P. (2021). Reexamining the “brain drain” effect: Does Ward et al. (2017) replicate? In PsyArXiv. Western University. <https://doi.org/10.31234/osf.io/5y34g>
- Unsworth, N., Heitz, R. P., Schrock, J. C., & Engle, R. W. (2005). An automated version of the operation span task. Behavior Research Methods, 37(3), 498–505. <https://www.ncbi.nlm.nih.gov/pubmed/16405146>
- Ward, A. F., Duke, K., Gneezy, A., & Bos, M. W. (2017). Brain drain: The mere presence of one’s own smartphone reduces available cognitive capacity. Journal of the Association for Consumer Research, 2, 140–154. <https://doi.org/10.1086/691462>

# Analysis Prep

## Load Required Libraries

The following packages were used to complete the analyses, visualizations, data wrangling, etc. in this document.

```{r load_libs, message=FALSE, warning=FALSE, include=FALSE}
# Note: Code has been commented out to reduce computation time

# library(welchADF)
library(schoRsch)
library(plyr)
library(dplyr)
library(lattice)
library(car)
library(knitr)
library(ggplot2) # to plot figures
library(ggpubr) # to plot figures (e.g., qqplots)
library(kableExtra) # to make nice tables
library(magrittr)
library(ez) # to run ANOVAs
library(Hmisc)
library(purrr)
library(Formula)
library(gridExtra)
library(psych)
library(survival)
library(tidyverse)
library(DescTools)
library(GPArotation)
library(apaTables)
library(stats) # to perform t-tests
library(effsize) # to calculate cohen's d effect size
library(coin) # to run Mann-Whitney U test
# install.packages("devtools")
library(devtools)
# devtools::install_github("doomlab/MeMoBootR") # install - used to run moderation
library(MeMoBootR) # for moderation
library(gridExtra) # to plot multiple plots together

library(corrplot)

# library(rstatix) # to run t-tests with easy output for visualizations
# library(tidyutils)
# library(devtools)
# install_github("vqv/ggbiplot")
#library(ggbiplot)
# library(rmdformats)
```

## Additional Functions

### Rounding p-values

<details>

<summary>

This rounding function was adapted from [Dr. Emily Nielsen's Rpubs](https://rpubs.com/egnielsen). The function ("**p_round(x)**") was created to assess and print p-values. If $p \ge .005$, the function will display "\$p = \$" and the value rounded to two decimal places. If \$ .0005 \le p \<.005$, the function will display “$p = \$" and the value rounded to three decimal places. If $p < .0005$, the function will display "$p < .001$". *(click to expand)*

</summary>

```{r p_round function}

p_round <- function(x){
  if(x > .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 2)), sep = ''))
  }  
  else if(x == .005){x1 = (paste("=", gsub("0\\.","\\.", 0.01)))
  }
  else if(x > .0005 & x < .005)
    {x1 = (paste("= ", gsub("0\\.","\\.", round(x, digits = 3)), sep = ''))
  }  
  else if(x == .0005){x1 = (paste("=", gsub("0\\.","\\.", 0.001)))
  }
  else{x1 = (paste("<", gsub("0\\.","\\.", 0.001)))
  } 
  (x1)
}

```

</details>

### Flatten Correlation Output

```{r create_flattenCorr_function}
# ++++++++++++++++++++++++++++
# flattenCorrMatrix
# ++++++++++++++++++++++++++++
# cormat : matrix of the correlation coefficients
# pmat : matrix of the correlation p-values
flattenCorrMatrix <- function(cormat, pmat) {
  ut <- upper.tri(cormat)
  data.frame(
    row = rownames(cormat)[row(cormat)[ut]],
    column = rownames(cormat)[col(cormat)[ut]],
    cor  =(cormat)[ut],
    p = pmat[ut]
    )
}
```

## Raw Data

There are **five** raw data files that were imported for the present study (see the *Import Raw Data* section).

**Task Data**

1.  OSpan Raw Data

    -   spreadsheet, collected manually from each participant

2.  Go/No-Go Raw Data

    -   psychopy excel files
    -   *Note:* raw data was extracted from folder into one file (see the *Import Raw Data* section)

**Survey Measures**

3.  Demographic & SUQ Raw Data

    -   csv files exported from Qualtrics

4.  SAD Raw Data

    -   csv files exported from Qualtrics

5.  Testing Tracking Sheet

    -   csv files, collected during testing

### Import Raw Data

#### 1. OSpan Raw Data

<details>

<summary>

The **OSpan Raw Data** was collected using the [Inquisit 5 software](https://www.millisecond.com/products/inquisit6/weboverview.aspx), which used the same automated OSpan task as in Ward et al. (2017). The raw data from the OSpan was imported as a data frame, "**ospan_raw**", and included the following: *(click to see description)*

</summary>

-   **Participant number [*ospan_participant*]**\

-   **OSpan ID [*ospan_ID*]**

    -   A 9-digit ID given by the Iquisit 5 software

-   **OSpan Absolute Score [*ospan_absolute_score*]**

    -   Performance measured by how many trials a participant correctly recalled all the letters in a given block (75 blocks in total). For example, a participant who recalled three letters (in a block with three letters), five letters (in a block with five letters), and two letters (in a block with six letters) would have an OSpan absolute score of eight for those blocks (i.e., 3 + 5 + 0 = 8).\

-   **Total Letters Recalled [*ospan_total_corr*]**

    -   Performance measured by how many letters were recalled in total, regardless of whether all letters were correct within a given block.\

-   **Total Math Errors [*ospan_math_err_tot*]**

    -   the sum of all math errors (i.e., speed and accuracy)\

-   **Speed Errors [*ospan_math_err_speed*]**

    -   the sum of the speed related math errors

        -   where a participant exceeded their time limit to respond to the math component of the task\

-   **Accuracy Errors [*ospan_math_err_accuracy*]**

    -   the sum of the accuracy related math errors

        -   where a participant answered incorrectly on the math component of the task\

-   **Math Percentage [*ospan_math_percent_corr*]**

    -   the percent of all math trials that were answered correctly

        -   Participants were asked to keep their math percentage at 85% or above to ensure participants did not ignore the math component of the task to improve on the letter recall component

-   **Math Percentage Criteria [*ospan_math_criteria_85*]**

    -   whether a participant met the math criteria (i.e., 85% or above; yes or no)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        -   1 = yes
        -   0 = no \* *including any participant that did not complete the OSpan task*

</details>

</details>

<details>

<summary>

This code imported the OSpan raw data. *(click to expand)*

</summary>

```{r import_ospan_raw, message=FALSE, warning=FALSE}
ospan_raw = utils::read.csv("raw_ospan.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
ospan_raw = dplyr::arrange(ospan_raw, ospan_participant)

ospan_raw
```

</details>

#### 2. Go/No-Go Raw Data {#gng_raw}

<details>

<summary>

The **Go/No-Go Raw Data** was collected using PsychoPy2 Experiment Builder (v1.85.4). Each participant's data was stored in a separate folder (default from psychopy) as an excel file. The following code re-organized the raw data into a single wide format of the data as a data frame, "**gng_data_extracted**", and two data frames that separate the raw data by cue type (i.e."vertical" or "horizontal"), "**gng_data_extracted_v**" and "**gng_data_extracted_h**". *(click to see descriptions)*

</summary>

For the **gng_data_extracted** data frame:

-   **Participant number [*gng_participant*]**\

-   **Age [*age*]**\

-   **First Language [*lang*]**

    -   self-reported whether their fist language was English (i.e., yes or no)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | Yes | No  |

-   **Total trials completed [*gng_trial_tot*]**\

-   **GnG Score [*gng_score*]**

    -   the sum of the correct answers\

-   **Total Responses [*gng_resp_tot*]**

    -   the sum of the responses for all trials

-   **Total No Responses [*gng_no_resp_tot*]**

    -   the sum of the trials without a response\

-   **Total Errors [*gng_err_tot*]**

    -   the sum of all errors for all trials\

-   **Commission Errors [*gng_err_com*]**

    -   the sum of all commission errors, defined as:

        -   where a response was made during a "no-go" trial (i.e., committing an error)\

-   **Omission Errors [*gng_err_omi*]**

    -   the sum of all omission errors, defined as:

        -   where a response was NOT made during a "go" trial (i.e., failing to respond)\

-   **Average Reaction Time (RT) [*gng_rt_av*]**

For the **gng_data_extracted_v** data frame:

-   **Participant number [*gng_participant*]**\

-   **Age [*age_long*]**\

-   **First Language [*lang_long*]**

    -   self-reported whether their fist language was English (i.e., yes or no)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | Yes | No  |

-   **Total trials completed [*trial_tot_v*]**\

-   **GnG Score [*score_v*]**

    -   the sum of the correct answers\

-   **Total Responses [*resp_tot_v*]**

    -   the sum of the responses for all trials

-   **Total No Responses [*no_resp_tot_v*]**

    -   the sum of the trials without a response\

-   **Total Errors [*err_tot_v*]**

    -   the sum of all errors for all trials\

-   **Commission Errors [*err_com_v*]**

    -   the sum of all commission errors, defined as:

        -   where a response was made during a "no-go" trial (i.e., committing an error)\

-   **Omission Errors [*err_omi_v*]**

    -   the sum of all omission errors, defined as:

        -   where a response was NOT made during a "go" trial (i.e., failing to respond)\

-   **Average Reaction Time (RT) [*rt_av_v*]**\

-   **Response Exclusion Criteria [*exclude_v*]**

    -   This depicts whether a participant should be excluded from analysis based on the response exclusion criteria

        -   To be included, participants should score higher than chance performance (i.e., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials)

            <details>

            <summary>

            coding information *(click to expand)*

            </summary>

            |       code |               1                |               2               |
            |-----------:|:------------------------------:|:-----------------------------:|
            | *response* | Yes (participant **excluded**) | No (participant **included**) |

For the **gng_data_extracted_h** data frame:

-   **Participant number [*gng_participant*]**\

-   **Age [*age_long*]**\

-   **First Language [*lang_long*]**

    -   self-reported whether their fist language was English (i.e., yes or no)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | Yes | No  |

-   **Total trials completed [*trial_tot_h*]**\

-   **GnG Score [*score_h*]**

    -   the sum of the correct answers\

-   **Total Responses [*resp_tot_h*]**

    -   the sum of the responses for all trials

-   **Total No Responses [*no_resp_tot_h*]**

    -   the sum of the trials without a response\

-   **Total Errors [*err_tot_h*]**

    -   the sum of all errors for all trials\

-   **Commission Errors [*err_com_h*]**

    -   the sum of all commission errors, defined as:

        -   where a response was made during a "no-go" trial (i.e., committing an error)\

-   **Omission Errors [*err_omi_h*]**

    -   the sum of all omission errors, defined as:

        -   where a response was NOT made during a "go" trial (i.e., failing to respond)\

-   **Average Reaction Time (RT) [*rt_av_h*]**\

-   **Response Exclusion Criteria [*exclude_h*]**

    -   This depicts whether a participant should be excluded from analysis based on the response exclusion criteria

        -   To be included, participants should score higher than chance performance (i.e., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials)

            <details>

            <summary>

            coding information *(click to expand)*

            </summary>

            |       code |               1                |               2               |
            |-----------:|:------------------------------:|:-----------------------------:|
            | *response* | Yes (participant **excluded**) | No (participant **included**) |

</details>

</details>

<details>

<summary>

This code imported the Go/No-Go raw data without separating the data by cue type. *(click to expand)*

</summary>

```{r gng_organize, message=FALSE, warning=FALSE}

#---WIDE DATA FRAME---"gng_data_extracted"  

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
extract_gng = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant)
  participant = as.numeric(temp_data[which(temp_data == "participant"),2])
  
  # Age (age)
  age = as.integer(temp_data[which(temp_data == "Age"),2])
  
  # Gender (gender)
  # will be populated from survey measures
  
  # First Language (lang)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  
  # Total trials completed (trial_tot)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$order))
  
  # GnG Score (score)
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score = sum(temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Total Responses (resp_tot)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot = sum((is.na(temp_data$cue) == "FALSE" & is.na(temp_data$key_resp_slash.keys_raw) == "FALSE"), na.rm = TRUE) #alternative: sum(complete.cases(temp_data$key_resp_slash.keys_raw))
  
  # Total No Responses (no_resp_tot)
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot = (trial_tot - resp_tot)
  
  # Total Errors (err_tot)
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot = sum((temp_data$key_resp_slash.corr_raw == "0") == "TRUE", na.rm = TRUE) # alternative: sum(!temp_data$key_resp_slash.corr_raw, na.rm = TRUE)
  
  # Commission Errors (err_com)
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an erorr)
  err_com = sum((temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Omission Errors (err_omi)
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi = sum((temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw)) == "TRUE", na.rm = TRUE)
  
  # Average Reaction Time (RT) (rt_av)
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av = mean(temp_data$key_resp_slash.rt_raw, na.rm = TRUE)
  
  temp_dataframe = as.data.frame(c(participant, age, lang, trial_tot, 
                                   score, resp_tot, no_resp_tot, 
                                   err_tot, err_com, err_omi, rt_av))
}

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
files_psychopy = dir("/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files
result = sapply(files_psychopy, extract_gng)

#--(4) Create a data frame of the result of the function on the files

# take the lists of the extracted data and turn it into a data frame
gng_data_extracted = data.frame(Reduce(rbind, result)) # alternative method: x = data.frame(t(sapply(result,c)))
# rename the columns
colnames(gng_data_extracted) = c("gng_participant", "age", "lang", "gng_trial_tot", "gng_score", 
                                 "gng_resp_tot", "gng_no_resp_tot", "gng_err_tot", "gng_err_com", 
                                 "gng_err_omi", "gng_rt_av") 
# remove row names to avoid confusion
rownames(gng_data_extracted) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted = dplyr::arrange(gng_data_extracted, gng_participant)

gng_data_extracted
```

</details>

<details>

<summary>

This code imported the Go/No-Go raw data and separated the data by cue type (i.e. "vertical" or "horizontal"). *(click to expand)*

</summary>

```{r gng_organize_target, message=FALSE, warning=FALSE}

#--- for the LONG DATA FRAME - CUE TYPE (vertical vs. horizontal) --- "gng_data_extracted_v" & "gng_data_extracted_h" 
  
#-- This code differs from the above code in that it organizes the gng data based the CUE type (i.e. either a "vertical" (v) or "horizontal" (h)).
  # This means that the data is in a long format to test the within-subjects factor of cue type.

#--This code completes the following tasks:
# (1) Create a function that extracts any required information from all psychopy folders at once
# (2) Create a list of all the directory paths to each individual file 
# (3) Apply the function to the files 
# (4) Create a data frame of the result of the function on the files
# (5) Re-structure the data into 1 data frame in a long format for a repeated measures analysis

#--The FINAL OUTPUT = a wide format of the raw GnG data including:
# participant, age, lang, trial_tot, score, resp_tot, no_resp_tot, err_tot, err_com, err_omi, rt_av

#--NOTES:

# Step (3) will give warnings if there is any missing data that the function was not specified how to handle
# example: Any participant that did not respond to an item or during all trials will have an "NA" response in the data
# example: Any participant who reported their age or first language in an irregular manner (e.g. "ninteen" rather than "19")
# Either example will result in a warning, since the function would have to force that output to be "NA" as well

# Individual participant files do not stay in chronological order 
# (they are re-ordered so that all ps that start with the digit "1" go first, then the "2"...)

#--(1) Create a function that extracts any required information from all psychopy folders at once...
# for the "vertical" (v) (extract_gng_v) and "horizontal" (h) (extract_gng_h) trials

# create a function that extracts all the required information from each excel file
# details for what is extracted in this function is found within it, 
# to use it to extract any other &/ additional information, edit within the function itself
# NOTE: this function extracts 2 versions of each value: one for the "vertical" (v) trials, and one for the "horizontal" (h) trials

extract_gng_v = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "vertical" (v) (trial_tot_v)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_v = sum((temp_data$cue == "v_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "vertical" (v) (score_v)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "vertical" (v) (resp_tot_v)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "vertical" (v) (no_resp_tot_v)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_v = (trial_tot_v - resp_tot_v)  
  
  # Total Errors for the "vertical" (v) (err_tot_v) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_v = sum((temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "vertical" (v) (err_com_v) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "vertical" (v) (err_omi_v)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_v = sum(temp_data$cue == "v_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  

  # Average Reaction Time (RT) for the "vertical" (v) (rt_av_v)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_v = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "v_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "vertical" (v) (exclude_v)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
    # first, get the total trials and response totals for the "go" and "no-go" trials
      # "go"
      trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
      # "no-go"
      trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
      resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
    # next, find the minimum response needed based on the 50% criteria
    min_resp = trial_tot_go * .50
    # find if the participant meets exclusion criteria for the "go" and "no-go" trials
    meets_min_go = resp_tot_go > (min_resp)
    meets_min_nogo = resp_tot_nogo < (min_resp)
    # find if the participant meets exclusion criteria (exclude = 1; include = 2)
    exclude_v = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_v = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_v, score_v, resp_tot_v, 
                                        no_resp_tot_v, err_tot_v, err_com_v, err_omi_v, rt_av_v, exclude_v))
}  

extract_gng_h = function(data) {
  temp_data = readxl::read_excel(data)
  
  # Participant number (participant_long)
  participant_long = as.numeric(temp_data[which(temp_data == "participant"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Age (age_long)
  age_long = as.integer(temp_data[which(temp_data == "Age"),2])
  # only one is extracted since this will not change b/w CUE type
  
  # Gender (gender)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # First Language (lang_long)
  # this finds if the language reported was English: if yes = 1; no = 0
  lang_long = ifelse(stringr::str_detect(as.character(temp_data[which(temp_data == "First Language"),2]), stringr::fixed("english", ignore_case=TRUE)), 1, 0)
  # only one is extracted since this will not change b/w CUE type
  
  # Location Condition (location; either: (1) desk, (2) pocket/bag, or (3) outside)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Power Condition (power; either: (1) ON,or (2) OFF)
  # will be populated from survey measures
  # only one is extracted since this will not change b/w CUE type
  
  # Total trials completed for the "horizontal" (h) (trial_tot_h)
  # count the "order" column, which shows the order in which the trials were showed (random for each participant)
  # note: must ignore the NA values
  trial_tot_h = sum((temp_data$cue == "h_b.png" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  
  # GnG Score for the "horizontal" (h) (score_h)  
  # sum of the correct answers from the "key_resp_slash.corr_raw" column
  # note: must ignore the NA values
  score_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw, na.rm = TRUE)  
  
  # Total Responses for the "horizontal" (h) (resp_tot_h)
  # sum of the responses from the "key_resp_slash.keys_raw" column
  # note: must ignore the NA values
  resp_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)  
  
  # Total No Responses for the "horizontal" (h) (no_resp_tot_h)  
  # subtract total responses made from completed trials 
  # note: must ignore the NA values
  no_resp_tot_h = (trial_tot_h - resp_tot_h)  
  
  # Total Errors for the "horizontal" (h) (err_tot_h) 
  # sum of the incorrect answers from the "key_resp_slash.corr_raw"
  err_tot_h = sum((temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0"), na.rm = TRUE)  
  
  # Commission Errors for the "horizontal" (h) (err_com_h) 
  # sum any cases where a reponse was made & the answer was incorrect (i.e. committing an error)
  err_com_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & !is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Omission Errors for the "horizontal" (h) (err_omi_h)  
  # sum any cases where a reponse was NOT made & the answer was incorrect (i.e. failing to respond)
  err_omi_h = sum(temp_data$cue == "h_b.png" & temp_data$key_resp_slash.corr_raw == "0" & is.na(temp_data$key_resp_slash.keys_raw), na.rm = TRUE)  
  
  # Average Reaction Time (RT) for the "horizontal" (h) (rt_av_h)  
  # calculate the average / mean RT for all responses
  # Note: the "no-go" trials required no response for a correct trial (for 1,000ms)
  rt_av_h = mean(temp_data$key_resp_slash.rt_raw[which(temp_data$cue == "h_b.png")], na.rm = TRUE)  
  
  # Exclusion Criteria for the "horizontal" (h) (exclude_h)
  # calculate whether participants scored higher than chance performance 
  # (i.e. responded to at least 50% of “Go” trials and withheld response to at least 50% of “No-Go” trials)
  # code: exclude = 1; include (do not exclude) = 2
  # NOTE: this calculation is meant to exclude the participant as a whole, therefore, 
  # it does not calculate it for the cue type specifically
  # first, get the total trials and response totals for the "go" and "no-go" trials
  # "go"
  trial_tot_go = sum((temp_data$corrAns == "slash" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_go = sum((temp_data$corrAns == "slash" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # "no-go"
  trial_tot_nogo = sum((temp_data$corrAns == "none" & is.na(temp_data$order) == "FALSE"), na.rm = TRUE)
  resp_tot_nogo = sum((temp_data$corrAns == "none" & temp_data$key_resp_slash.keys_raw == "'slash'"), na.rm = TRUE)
  # next, find the minimum response needed based on the 50% criteria
  min_resp = trial_tot_go * .50
  # find if the participant meets exclusion criteria for the "go" and "no-go" trials
  meets_min_go = resp_tot_go > (min_resp)
  meets_min_nogo = resp_tot_nogo < (min_resp)
  # find if the participant meets exclusion criteria (exclude = 1; include = 2)
  exclude_h = ifelse((meets_min_go == "TRUE" & meets_min_nogo == "TRUE") == "FALSE", 1, 2)
  
  temp_dataframe_h = as.data.frame(c(participant_long, age_long, lang_long, trial_tot_h, score_h, resp_tot_h, 
                                     no_resp_tot_h, err_tot_h, err_com_h, err_omi_h, rt_av_h, exclude_h))
}  

#--(2) Create a list of all the directory paths to each individual file 

# Find the location of each excel (.xlsx) files
# the folders containing the files should be in 1 folder (e.g. "" in this case)
# this returns a list of each file's location
# Note: the pattern is set to ignore any hidden / temporary files by adding the start of each file name (e.g. "^NoGo")
# THIS IS THE SAME AS IN THE ABOVE CODE, SO IT IS COMMENTED OUT TO AVOID EXTRA COMPUTING POWER

# files_psychopy = dir("/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study 1- Smartphone Replication/Dissertation (04:20:20)/Raw Data Organizing/gng_raw_folders", 
#                     recursive = TRUE, full.names=TRUE, pattern="^NoGo(.*)xlsx$")

#--(3) Apply the function to the files 

# Apply the function to all excel files for the "vertical" (v) and "horizontal" (h) trials  
result_v = sapply(files_psychopy, extract_gng_v)
result_h = sapply(files_psychopy, extract_gng_h)

#--(4) Create a data frame of the result of the function on the files...
# for the "vertical" (gng_data_extracted_v) and "horizontal" (gng_data_extracted_h) trials  

# take the lists of the extracted data and turn it into a data frame for the "vertical" (h) trials
gng_data_extracted_v = data.frame(Reduce(rbind, result_v)) 
# rename the columns
colnames(gng_data_extracted_v) = c("participant_long", "age_long", "lang_long", "trial_tot_v", "score_v", 
                                         "resp_tot_v", "no_resp_tot_v", "err_tot_v", "err_com_v", 
                                         "err_omi_v", "rt_av_v", "exclude_v")

# take the lists of the extracted data and turn it into a data frame for the "horizontal" (h) trials
gng_data_extracted_h = data.frame(Reduce(rbind, result_h)) 
# rename the columns
colnames(gng_data_extracted_h) = c("participant_long", "age_long", "lang_long", "trial_tot_h", "score_h", 
                                           "resp_tot_h", "no_resp_tot_h", "err_tot_h", "err_com_h", 
                                           "err_omi_h", "rt_av_h", "exclude_h")
# remove row names to avoid confusion
rownames(gng_data_extracted_v) = NULL
rownames(gng_data_extracted_h) = NULL

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
gng_data_extracted_v = dplyr::arrange(gng_data_extracted_v, participant_long)
gng_data_extracted_h = dplyr::arrange(gng_data_extracted_h, participant_long)
```

</details>

#### 3. & 4. Demographic, SUQ, and SAD Raw Data

The **Demographic & SUQ Raw Data and SAD Raw Data** was collected using [Qualtrics](https://www.qualtrics.com/). The raw survey data was imported as a data frame, "**survey_raw**", and included the following: *(click to see description)*

<details>

<summary>

[General, Manipulation, and Demographic Information]{style="color:DarkBlue; font-size:18px;"}

</summary>

<details>

<summary>

**General Information:** *(click to expand)*

</summary>

-   **Start Date [*date_start*]**

    -   The date and time when a participant began their survey on Qualtrics (Y/M/D H:M)\

-   **End Date [*date_end*]**

    -   The date and time when a participant finished their survey on Qualtrics (Y/M/D H:M)\

-   **Duration (in seconds) [*duration(s)*]**

    -   The duration of the survey for the participant in seconds

-   **Recorded Date [*date_rec*]**

    -   The date and time when a participant's data was recorded on Qualtrics (Y/M/D H:M)\

-   **Participant number [*sur_participant*]**\

</details>

<details>

<summary>

**Manipulation Condition:** *(click to expand)*

</summary>

-   **Smartphone Location [*sur_location*]**

    -   The smartphone location assigned to the participant (i.e., desk, pocket/bag, or outside)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1   |     2      |    3    |
        |-----------:|:----:|:----------:|:-------:|
        | *response* | Desk | Pocket/Bag | Outside |

-   **Smartphone Power [*sur_power*]**

    -   The smartphone power assigned to the participant (i.e., ON or OFF)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | On  | Off |

-   **Smartphone Condition [*sur_condition*]**

    -   The smartphone condition (i.e., combination of location & power) assigned to the participant\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------+-----------+-----------------+--------------+------------+------------------+---------------+
        | code       | 1         | 2               | 3            | 4          | 5                | 6             |
        +===========:+:=========:+:===============:+:============:+:==========:+:================:+:=============:+
        | *response* | ON & Desk | ON & Pocket/Bag | ON & Outside | OFF & Desk | OFF & Pocket/Bag | OFF & Outside |
        +------------+-----------+-----------------+--------------+------------+------------------+---------------+

</details>

<details>

<summary>

**Demographics:** *(click to expand)*

</summary>

-   **Gender [*gender*]**

    -   Participant's self-reported gender (i.e., male, female, other, or prefer not to say)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1   |   2    |   3   |         4         |
        |-----------:|:----:|:------:|:-----:|:-----------------:|
        | *response* | Male | Female | Other | Prefer not to say |

-   **Year [*year*]**

    -   Participant's self-reported year of study in their respective program (e.g., 1, 2, 3, 4)\

-   **Program [*program*]**

    -   Participant's self-reported program of study (e.g., biology, BMOS, psychology)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------------------+-------------------------+----------------------------+---------------------+
        | 1 = actuarial          | 11 = economics          | 21 = management            | 31 = psychology     |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 2 = anthropology       | science                 | 22 = mathematics           | 32 = SASAH          |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 3 = arts               | 13 = english            | 23 = medical science       | 33 = science        |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 4 = arts & humanities  | 14 = film               | 24 = MIT                   | 34 = social science |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 5 = biology            | 15 = FIMS               | 25 = MOS                   | 35 = sociology      |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 6 = BMOS               | 16 = geography          | 26 = music                 | 36 = software       |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 7 = chemistry          | 17 = health science     | 27 = nutrition & dietetics | 37 = visual arts    |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 8 = computer science   | 18 = integrated science | 28 = philosophy            |                     |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 9 = consumer behaviour | 19 = kinesiology        | 29 = physics               |                     |
        +------------------------+-------------------------+----------------------------+---------------------+
        | 10 = criminology       | 20 = linguistics        | 30 = political science     |                     |
        +------------------------+-------------------------+----------------------------+---------------------+

</details>

</details>

</details>

##### The Smartphone Use Questionnaire (SUQ)

The purpose of the SUQ was to measure participants' typical smartphone use. The SUQ was created for the present study and consisted of modified items from Ward and colleagues' (2017) survey measures. (i.e. found in the "BRAIN DRAIN" WEB APPENDIX"). Some items were forced-choice and some were on a 7-point likert scale ranging from 1 ("Never") to 7 ("Always"). There were 10 items in total and there were **five types of items**, which measured:

<details>

<summary>

**1. Smartphone Use Frequency: (*SUQ_use_1 - SUQ_use_3*)**

</summary>

Each of the following items used the following coding:

|       CODE |  1  |  2   |   3   |   4   |
|-----------:|:---:|:----:|:-----:|:-----:|
| *response* | 0-5 | 6-10 | 11-15 | \> 15 |

-   **Text Messages (per day) [*SUQ_use_1*]**

    -   Answer to the following question: "*On average, how many text messages do you send per day?*"\

-   **Social Media Messages (per day) [*SUQ_use_2*]**

    -   Answer to the following question: "*On average, how many social media based messages do you send per day from your smartphone? (iMessage, Facebook Messenger, WhatsApp, WeChat, direct messages within social media platforms, etc.)*"\

-   **Social Media Posts (per day) [*SUQ_use_3*]**

    -   Answer to the following question: "*On average, how many social media posts (e.g. written post, picture, article, etc.) do you send per day from your smartphone? (Facebook, Twitter, Instagram, etc.)*"\

</details>

<details>

<summary>

**2. Smartphone Use With External Stimulation or During Other Activities: (*SUQ_use_4 - SUQ_use_7*)**

</summary>

Each of the following items used the following coding:

|                   code |   1   |  2  |  3  |            4            | 5   | 6   | 7      |
|-----------------------:|:-----:|:---:|:---:|:-----------------------:|-----|-----|--------|
|             *response* | Never | --  | --  | About half <br>the time | --  | --  | Always |
| -- *no text specified* |       |     |     |                         |     |     |        |

-   **Tendency to turn to one's smartphone in the absence of external stimulation [*SUQ_use_4* & *SUQ_use_5*]**

    -   **SUQ_use_4**: Answer to the following question: "*I look at my smartphone before I roll out of bed in the morning.*"\
    -   **SUQ_use_5**: Answer to the following question: "*If I am waiting to meet a friend, I pass the time by using my smartphone.*"

-   **Tendency to turn to one's smartphone in the midst of other activities [*SUQ_use_6SUQ_use_7*]**

    -   **SUQ_use_6**: Answer to the following question: "*I use my smartphone while driving.*"\
    -   **SUQ_use_7**: Answer to the following question: "*If my smartphone rings or vibrates in the middle of personal business, I look at it.*"\

</details>

<details>

<summary>

**3. Smartphone Subjective Value: (*SUQ_exp_1*)**

</summary>

-   **Subjective Value [*SUQ_use_1*]**

    -   Answer to the following question: "*How much money would it take for you to give up your phone for a full day?*"\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |    1     |     2     |     3     |    4    |
        |-----------:|:--------:|:---------:|:---------:|:-------:|
        | *response* | \$0-\$20 | \$21-\$40 | \$41-\$60 | \> \$60 |

</details>

</details>

<details>

<summary>

**4. Smartphone Notification Type: (*SUQ_exp_2 & SUQ_exp_3*)**

</summary>

-   **Notification Type [*SUQ_use_2*]**

    -   Answer to the following question: "*Do you receive notifications (a sound or vibration) on your phone? Please indicate all that apply.*"\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |   1   |    2     |    3    |     4     |    5     |    6     |        7        |
        |-----------:|:-----:|:--------:|:-------:|:---------:|:--------:|:--------:|:---------------:|
        | *response* | Email | Facebook | Twitter | Instagram | LinkedIn | Snapchat | Other (specify) |

-   **Specify Notification Type [*SUQ_exp_3*]**

    -   Answer to the following question: "*Please Specify: Where you receive notifications (a sound or vibration) on your phone?*"

</details>

</details>

<details>

<summary>

**5. Phantom Vibrations: (*SUQ_exp_4*)**

</summary>

-   **Phantom Vibrations [*SUQ_use_4*]**

    -   Answer to the following question: "*Have you ever thought you heard your phone ring or thought you felt it vibrate, only to find out you were wrong?*"\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | Yes | No  |

</details>

</details>

##### The Smartphone Attachment and Dependency Inventory (SAD; Ward et al., 2017)

<details>

<summary>

The SAD asked participants to respond to 13 items **(*SAD_1-SAD_13*)** using a 7-point likert scale. *(click to expand)*

</summary>

+------------+----------------------+----------+----------------------+---------+-------------------+-------+-------------------+
| code       | 1                    | 2        | 3                    | 4       | 5                 | 6     | 7                 |
+===========:+:====================:+:========:+:====================:+:=======:+:=================:+:=====:+:=================:+
| *response* | Strongly<br>Disagree | Disagree | Somewhat<br>Disagree | Neutral | Somewhat<br>Agree | Agree | Strongly<br>Agree |
+------------+----------------------+----------+----------------------+---------+-------------------+-------+-------------------+

-   **SAD_1:** I would have trouble getting through a normal day without my smartphone.\
-   **SAD_2:** It would be painful for me to give up my smartphone for a day.\
-   **SAD_3:** I feel like I could not live without my smartphone.\
-   **SAD_4:** If I forgot to bring my smartphone with me, I would feel anxious.\
-   **SAD_5:** It drives me crazy when my smartphone runs out of battery.\
-   **SAD_6:** I am upset and annoyed when I find I do not have reception on my smartphone.\
-   **SAD_7:** I feel impatient when the Internet connection speed on my smartphone is slow.\
-   **SAD_8:** I feel lonely when my smartphone does not ring or vibrate for several hours.\
-   **SAD_9:** Using my smartphone relieves me of my stress.\
-   **SAD_10:** I feel excited when I have a new message or notification.\
-   **SAD_11:** Using my smartphone makes me feel happy.\
-   **SAD_12:** I find it tough to focus whenever my smartphone is nearby.\
-   **SAD_13:** I become less attentive to my surroundings when I'm using my smartphone.

</details>

</details>

<details>

<summary>

This code imported the Demographic, SUQ, and SAD raw data. *(click to expand)*

</summary>

```{r import_survey_raw, message=FALSE, warning=FALSE}
survey_raw = utils::read.csv("raw_survey.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
survey_raw = dplyr::arrange(survey_raw, sur_participant)

survey_raw
```

</details>

</details>

#### 5. Testing Tracking Sheet Raw Data

<details>

<summary>

The **Testing Tracking Sheet Raw Data** was collected during testing *by the experimenter* to store relevant information on participant's (1) General Information (e.g. testing start date), (2) Manipulation Condition (e.g. assigned location and power manipulation), and (3) Data Cleaning (e.g. exclusion critera from testing error or other non-task-specific criteria). The raw data from the tracking sheet was imported as a dataframe, "**track_raw**", and included the following: *(click to see description)*

</summary>

<details>

<summary>

**1. General Information:**

</summary>

-   **Testing Date [*track_date*]**

    -   The date a participant completed the study (Y/M/D)\

-   **Testing Time [*date_end*]**

    -   The time a participant was scheduled to complete the study (24hr-time). *Note: Many participants were tested at the same time, but in separate cubicle-type desks to avoid any group-testing dynamic*\

-   **Participant number [*track_participant*]**\

</details>

<details>

<summary>

**2. Manupulation Condition:**

</summary>

-   **Smartphone Location [*track_location*]**

    -   The smartphone location assigned to the participant (i.e., desk, pocket/bag, or outside)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1   |     2      |    3    |
        |-----------:|:----:|:----------:|:-------:|
        | *response* | Desk | Pocket/Bag | Outside |

-   **Smartphone Power [*track_power*]**

    -   The smartphone power assigned to the participant (i.e., ON or OFF)\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |
        |-----------:|:---:|:---:|
        | *response* | On  | Off |

-   **Smartphone Condition [*track_condition*]**

    -   The smartphone condition (i.e., combination of location & power) assigned to the participant\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------+-----------+-----------------+--------------+------------+------------------+---------------+
        | code       | 1         | 2               | 3            | 4          | 5                | 6             |
        +===========:+:=========:+:===============:+:============:+:==========:+:================:+:=============:+
        | *response* | ON & Desk | ON & Pocket/Bag | ON & Outside | OFF & Desk | OFF & Pocket/Bag | OFF & Outside |
        +------------+-----------+-----------------+--------------+------------+------------------+---------------+

-   **Counter Balanced Order [*balance*]**

    -   This depicts which of the main tasks (i.e., either the OSpan task or the Cue-Dependent Go/No-Go Task) was completed first\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------+----------------------------+---------------------------------------------+
        | code       | 1                          | 2                                           |
        +===========:+:==========================:+:===========================================:+
        | *response* | OSpan Task completed first | Cue-Dependent Go/No-Go Task completed first |
        +------------+----------------------------+---------------------------------------------+

</details>

<details>

<summary>

**3. Data Cleaning:**

</summary>

-   **Exclude from Analysis [*exclude*]**

    -   This depicts whether a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e., for reasons other than the task exclusion criteria in the data analysis plan).

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |               1                |               2               |
        |-----------:|:------------------------------:|:-----------------------------:|
        | *response* | Yes (participant **excluded**) | No (participant **included**) |

-   **Reason for Exclusion from Analysis [*reason*]**

    -   This depicts the reason (summarized into 5 categories) a participant should be excluded from analysis due to testing error or other non-task-specific criteria (i.e., for reasons other than the task exclusion criteria in the data analysis plan). *Note: this includes a category for "no exclusion issues", where a participant was not excluded*\

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------+---------------------+-----------------------------+----------------+-------------+---------------+
        | code       | 1                   | 2                           | 3              | 4           | 5             |
        +===========:+:===================:+:===========================:+:==============:+:===========:+:=============:+
        | *response* | no exclusion issues | external distractor (noise) | gng incomplete | gng missing | ospan missing |
        +------------+---------------------+-----------------------------+----------------+-------------+---------------+

-   **Possible Outlier [*pos_out*]**

    -   This depicts whether a participant is a possible outlier for exploratory analyses. *Note: this was not applied to any analysis unless explicitly stated*

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        +------------+------------------------------------------+---------------------------------------------+
        | code       | 1                                        | 2                                           |
        +===========:+:========================================:+:===========================================:+
        | *response* | Yes (does qualify as a possible outlier) | No (does not qualify as a possible outlier) |
        +------------+------------------------------------------+---------------------------------------------+

-   **Experimenter [*experimenter*]**

    -   This depicts which experimenter (3 total) collected the data for a participant. *Note: this was collected for exploratory analyses and was not used unless explicitly stated*

        <details>

        <summary>

        coding information *(click to expand)*

        </summary>

        |       code |  1  |  2  |  3  |
        |-----------:|:---:|:---:|:---:|
        | *response* | ARP | SW  | LM  |

</details>

</details>

<details>

<summary>

This code imported the study tracking sheet raw data. *(click to expand)*

</summary>

```{r}
track_raw = utils::read.csv("raw_tracking.csv", header = TRUE)

# reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)
track_raw = dplyr::arrange(track_raw, track_participant)

track_raw
```

</details>

</details>

## Construct Working Dataframes {#makeDFs}

In order to complete the present study's analyses, **eight** data frames were created:

1.  All Data
2.  All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)
3.  All Data - Go/No-Go separated by Cue Type (Long FORMAT)
4.  OSpan Analysis
5.  Go/No-Go: Error Analysis
6.  Go/No-Go: Accuracy Analysis
7.  SAD: Factor Analysis
8.  Moderation Analysis

**Data Cleaning** was completed for (1)-(3) based on: (a) Exclusion criteria, (b) Outlier data, and (c) Incomplete/Missing data:

a.  **Exclusion Criteria**

    -   **OSpan Data: Math Criteria** Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions, specifically, that participant payed attention to both the math component and letter recall component of the task. Only data from those who scored 85% or higher on the math component of the task were included.

    -   **Go/No-Go Data: Chance Performance Criteria** Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials) were used for the final analysis.

        -   Note: This data cleaning was completed in the [Go/No-Go Raw Data section](#gng_raw).

    -   **Go/No-Go Data: Standard Deviation Criteria** Planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials) were used for the final analysis.

        -   Note: This data cleaning was completed in the [Go/No-Go Raw Data section](#gng_raw).

b.  **Outlier Data**

    -   **OSpan & Go/No-Go Data:** For all task data, any *extreme outliers* were removed. The outliers were identified and removed using the *standard deviation (SD)* method. Participants who's score was greater than 2SDs from the mean were excluded.

c.  **Incomplete/Missing Data**

    -   For all task data, any participants with incomplete or missing data were removed.

### 1. All Data

<details>

<summary>

All the data from the present study in one data frame: **"all_data"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(all_data)`

<details>

<summary>

This code organized the raw data into one data frame. *(click to expand)*

</summary>

```{r create_all_data_temp, message=FALSE, warning=FALSE}

# Find the final participant list for the analyses  
  # get a list of participants that (1) completed the gng task and (2) are not excluded (i.e., inclusion list) based on the "exclude" column in the tracking sheet  
  # (1) First get the gng participant list  
  gng_par_list = gng_data_extracted$gng_participant  
  # (2) Next, get the inclusion list  
  include_par_list = data.frame(track_raw$track_participant[which(track_raw$exclude %in% 2)])  
    colnames(include_par_list) = c("par_list")
  # Get a list of all the participants who completed the study (from the testing tracking sheet)
  all_participants = as.data.frame(track_raw$track_participant)
    colnames(all_participants) = c("all_participants")
  
# find the participants to remove from the data
  # Filter each data list from the list of all participants  
  gng_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% gng_par_list)))
  track_remove = data.frame("all_participants" = filter(all_participants, !(all_participants %in% include_par_list$par_list)))
  reason_remove = c(rep("gng", length(gng_remove$all_participants)), rep("track", length(track_remove$all_participants)))
  
  # Make a list of all the removed participants -- "all_remove"
  all_remove_notclean = data.frame(rbind(gng_remove, track_remove), reason_remove)
    # remove any duplicates from the list
    all_remove_nodup = all_remove_notclean[!duplicated(all_remove_notclean$all_participants), ]
    # reorganize rows so that the data is ordered by the participant number (e.g. 1, 2, 3, 4...)  
    all_remove = dplyr::arrange(all_remove_nodup, all_participants)  
    
# Filter raw data using the final participant list for: ospan_raw, gng_raw, survey_raw, and track_raw
  # This will now be the working data for the study (i.e. the "final" data)  
    # OSpan Data  
    ospan_final = dplyr::filter(ospan_raw, !ospan_participant %in% all_remove$all_participants)
    # Gng Data  
    gng_final = dplyr::filter(gng_data_extracted, !gng_participant %in% all_remove$all_participants)
    # Survey Data  
    survey_final = dplyr::filter(survey_raw, !sur_participant %in% all_remove$all_participants)
    # Tracking Data  
    track_final = dplyr::filter(track_raw, !track_participant %in% all_remove$all_participants)

# Combine all the final data into 1 data frame with the following:
  # Start by creating a data frame with the final participant list in the 1st column: "participant"
  all_par = as.data.frame(track_final$track_participant)
    colnames(all_par) = c("participant")

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # OSpan Data
    colnames(ospan_final)[which(names(ospan_final) == "ospan_participant")] = c("participant")
    # Gng Data
    colnames(gng_final)[which(names(gng_final) == "gng_participant")] = c("participant")
    # Survey Data
    colnames(survey_final)[which(names(survey_final) == "sur_participant")] = c("participant")
    # Tracking Data
    colnames(track_final)[which(names(track_final) == "track_participant")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_temp" 
  # this will be refined with data cleaning later
  all_data_temp <-  Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_temp)[which(names(all_data_temp) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    
# Add score for SAD items
  #SAD score over all 13 items
    all_data_temp <- 
      all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_score = sum(c_across(SAD_1:SAD_13)))
    # all_data_temp %>% 
    # # for each participant
    # rowwise() %>% 
    # # score is calculated by adding up all 13 items for a total sum score for each p
    # mutate(SAD_score = sum(SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13))
  
  # Smartphone Dependence subscale (items: 1, 2, 3) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_dep_score = sum(c_across(SAD_1:SAD_3)))

  # Emotional Attachment subscale (items: 8, 9, 10, 11) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_ea_score = sum(c_across(SAD_8:SAD_11)))
  
  # Smartphone Accessibility subscale (items: 5, 6, 7) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_access_score = sum(c_across(SAD_5:SAD_7)))
    
  # Smartphone Distractibility subscale (items: 12, 13) -- based on FA 
    all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_dist_score = sum(c_across(SAD_12:SAD_13)))
    
# Add score for SUQ_use_4-5
all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SUQ_abext_score = sum(SUQ_use_4, SUQ_use_5)/2)
  all_data_temp %>% 
  # for each participant
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone in absence of external stimulation
  mutate(SUQ_abext_score = (sum(SUQ_use_4, SUQ_use_5))/2)

# Add score for SUQ_use_6-7
all_data_temp <- all_data_temp %>% rowwise() %>% dplyr::mutate(SUQ_duract_score = sum(SUQ_use_6, SUQ_use_7)/2)
  all_data_temp %>% 
  # for each participant  
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone during other activities
  mutate(SUQ_duract_score = (sum(SUQ_use_6, SUQ_use_7))/2)
  
all_data_temp
```

</details>

</details>

#### Data Cleaning: Exclusion Criteria, Outliers, and Missing Data {#all_data_DC}

<details>

<summary>

The final sample size (*N* = `r nrow(all_data)`) for this data frame excluded `r sum(all_remove_dc$reason == "ospan_ec") + sum(all_remove_dc$reason == "gng_h_ec") + sum(all_remove_dc$reason == "gng_v_ec")` participants who met the exclusion criteria (OSpan math criteria: `r sum(all_remove_dc$reason == "ospan_ec")`; Go/No-Go response criteria, horizontal/no-go cue: `r sum(all_remove_dc$reason == "gng_h_ec")`; Go/No-Go response criteria, vertical/go cue: `r sum(all_remove_dc$reason == "gng_v_ec")`). A total of `r sum(all_remove_dc$reason == "ospan_out") + sum(all_remove_dc$reason == "gng_err_out") + sum(all_remove_dc$reason == "gng_ac_out")` participants were identified as outliers (OSpan: `r sum(all_remove_dc$reason == "ospan_out")`; Go/No-Go Error Analysis: `r sum(all_remove_dc$reason == "gng_err_out")`; Go/No-Go Accuracy Analysis: `r sum(all_remove_dc$reason == "gng_ac_out")`) and removed. A total of `r sum(all_remove_dc$reason == "ospan_miss") + sum(all_remove_dc$reason == "gng_err_miss") + sum(all_remove_dc$reason == "gng_ac_miss")` participants were removed due to incomplete or missing data (OSpan: `r sum(all_remove_dc$reason == "ospan_miss")`; Go/No-Go Error Analysis: `r sum(all_remove_dc$reason == "gng_err_miss")`; Go/No-Go Accuracy Analysis: `r sum(all_remove_dc$reason == "gng_ac_miss")`). This created the **"all_data"** data frame *(click to see more)*.

</summary>

<details>

<summary>

This code cleaned *all_data* based on (1) exclusion criteria, (2) outliers, and (3) incomplete/missing data. *(click to see code)*

</summary>

```{r create_all_remove_dc&all_remove_dc_list, message=FALSE, warning=FALSE}
# get list of ps to remove for various reasons...
  
# Exclusion criteria
  # For OSPAN
  # ps scored <85% on math
  dc_ospan_ec <- 
    all_data_temp %>% 
    # apply filter to math criteria
    filter(ospan_math_criteria_85 == 0) %>% 
    # remove all rows except participant ID
    select(participant)

  # For GNG -- based on raw data separated by cue type (i.e., either vertical or horizontal)
  # Vertical cue type ("go")
  dc_gng_v_ec <- 
    gng_data_extracted_v %>% 
    # apply filter to response criteria
    filter(exclude_v == 1) %>% 
    # remove all rows except participant ID
    select(participant_long) %>% 
    # rename column as "participant"
    dplyr::rename(participant = participant_long)
  # Vertical cue type ("no-go")
  dc_gng_h_ec <- 
    gng_data_extracted_h %>% 
    # apply filter to response criteria
    filter(exclude_h == 1) %>% 
    # remove all rows except participant ID
    select(participant_long) %>% 
    # rename column as "participant"
    dplyr::rename(participant = participant_long)

# Outliers
  # For OSPAN
  dc_ospan_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(ospan_absolute_score > (mean(ospan_absolute_score, na.rm = T) + (2 * sd(ospan_absolute_score, na.rm = T))) |
           ospan_absolute_score < (mean(ospan_absolute_score, na.rm = T) - (2 * sd(ospan_absolute_score, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
  # For GNG error (omission)
  dc_gng_err_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(gng_err_omi > (mean(gng_err_omi, na.rm = T) + (2 * sd(gng_err_omi, na.rm = T))) |
           gng_err_omi < (mean(gng_err_omi, na.rm = T) - (2 * sd(gng_err_omi, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
  # For GNG accuracy (average RT)
  dc_gng_ac_out <- 
    all_data_temp %>% 
    # remove ps with score > 2SD from mean
    filter(gng_rt_av > (mean(gng_rt_av, na.rm = T) + (2 * sd(gng_rt_av, na.rm = T))) |
           gng_rt_av < (mean(gng_rt_av, na.rm = T) - (2 * sd(gng_rt_av, na.rm = T)))
           ) %>% 
    # remove all rows except participant ID
    select(participant)
  
# Missing/incomplete data
  # For OSPAN
  dc_ospan_miss <- all_data_temp %>% filter(is.na(ospan_absolute_score)) %>% select(participant)
  # For GNG error (omission)
  dc_gng_err_miss <- all_data_temp %>% filter(is.na(gng_err_omi)) %>% select(participant)
  # For GNG RT (average)
  dc_gng_ac_miss <- all_data_temp %>% filter(is.na(gng_rt_av)) %>% select(participant)
    
  
# create list of ps to remove with reason for removal -- all_remove_dc
all_remove_dc <- data.frame("participant" = bind_rows(dc_ospan_ec, dc_ospan_out, dc_ospan_miss, 
                                                      dc_gng_v_ec, dc_gng_h_ec, 
                                                      dc_gng_err_out, dc_gng_err_miss, 
                                                      dc_gng_ac_out, dc_gng_ac_miss), 
                            "reason" = c(rep("ospan_ec", nrow(dc_ospan_ec)), rep("ospan_out", nrow(dc_ospan_out)), 
                                         rep("ospan_miss", nrow(dc_ospan_miss)), rep("gng_v_ec", nrow(dc_gng_v_ec)),
                                         rep("gng_h_ec", nrow(dc_gng_h_ec)),rep("gng_err_out", nrow(dc_gng_err_out)), 
                                         rep("gng_err_miss", nrow(dc_gng_err_miss)), rep("gng_ac_out", nrow(dc_gng_ac_out)), 
                                         rep("gng_ac_miss", nrow(dc_gng_ac_miss)) 
                                         )
                            )
# create list of ps to remove without duplicates to use to filter final data
all_remove_dc_list <- 
  all_remove_dc %>% 
  # remove reason column
  select(participant) %>% 
  # remove any duplicated participant IDs
  distinct()
```

```{r clean_all_data, message=FALSE, warning=FALSE}
# clean all_data using all_remove_dc_list
all_data <- 
  all_data_temp %>% 
  filter(!participant %in% all_remove_dc_list$participant)

all_data
```

</details>

</details>

### 2. All Data - Go/No-Go separated by Cue Type (WIDE FORMAT)

<details>

<summary>

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e., either "go" or "no-go"), presented in WIDE FORMAT: **"all_data_cue"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(all_data_cue)`

<details>

<summary>

This code organized the raw data into one data frame. *(click to expand)*

</summary>

```{r create_all_data_cue, message=FALSE, warning=FALSE}
# use the list of all the participants to remove -- "all_remove" & "all_remove_dc_list" -- to filter the raw gng long data
  # for the "go" targets
  gng_v_final <- 
  gng_data_extracted_v %>% 
  # remove using use all_remove
  filter(!participant_long %in% all_remove$all_participants) %>% 
  # remove using use all_remove_dc_list
  filter(!participant_long %in% all_remove_dc_list$participant)
  
  # for the "no-go" targets
  gng_h_final <- 
  gng_data_extracted_h %>% 
  # remove using use all_remove
  filter(!participant_long %in% all_remove$all_participants) %>% 
  # remove using use all_remove_dc_list
  filter(!participant_long %in% all_remove_dc_list$participant)

# Combine all the final data into 1 data frame with the following:
  # use the final participant list -- "all_par" -- as the 1st column in the data frames

  # to use the merge function, change the respective "participant" column name in each data frame to "participant"
    # this was already done for the other data frames (e.g. ospan_final, survey_final)
    # for the "vertical" targets
    colnames(gng_v_final)[which(names(gng_v_final) == "participant_long")] = c("participant")
    # for the "horizontal" targets
    colnames(gng_h_final)[which(names(gng_h_final) == "participant_long")] = c("participant")

# Merge all 4 data frames into 1 data frame -- "all_data_cue"
  all_data_cue = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(gng_h_final, -c(age_long, lang_long)), dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
    # rename the tracking sheet columns for the location, power, and condition 
    colnames(all_data_cue)[which(names(all_data_cue) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

# Add score for SAD items
  #SAD score over all 13 items
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_score = sum(SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6, SAD_7, SAD_8, SAD_9, SAD_10, SAD_11, SAD_12, SAD_13))
  
  # Smartphone Dependence subscale (items: 1, 2, 3, 4, 5) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up items 1-5 for a total sum score for each p
    mutate(SAD_dep_score = sum(SAD_1, SAD_2, SAD_3))
    
  # Emotional Attachment subscale (items: 8, 9, 10, 11) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up items 8-11 for a total sum score for each p
    mutate(SAD_ea_score = sum(SAD_8, SAD_9, SAD_10, SAD_11))
    
  # Smartphone Accessability subscale (items: 5, 6, 7) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_access_score = sum(SAD_5, SAD_6, SAD_7))
    
  # Smartphone Distractibility subscale (items: 7, 12, 13) -- based on FA 
    all_data_cue <- 
    all_data_cue %>% 
    # for each participant
    rowwise() %>% 
    # score is calculated by adding up all 13 items for a total sum score for each p
    mutate(SAD_dist_score = sum(SAD_12, SAD_13))

# Add score for SUQ_use_4-5
all_data_cue <- 
  all_data_cue %>% 
  # for each participant
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone in absence of external stimulation
  mutate(SUQ_abext_score = (sum(SUQ_use_4, SUQ_use_5))/2)

# Add score for SUQ_use_6-7
all_data_cue <- 
  all_data_cue %>% 
  # for each participant  
  rowwise() %>% 
  # score is calculated as average between 2 items -- measures ps tendency to turn to smartphone during other activities
  mutate(SUQ_duract_score = (sum(SUQ_use_6, SUQ_use_7))/2)

all_data_cue
    
```

</details>

</details>

### 3. All Data - Go/No-Go separated by Cue Type (LONG FORMAT)

<details>

<summary>

All the data from the present study in one data frame, with the Go/No-Go data separated by target type (i.e., either "go" or "no-go"), presented in LONG FORMAT: **"all_data_cue_long"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(all_data_cue_long)`

<details>

<summary>

This code organized the raw data into one data frame. *(click to expand)*

</summary>

```{r create_all_data_cue_long, message=FALSE, warning=FALSE}
# Re-structure the data into 1 data frame in a long format for a repeated measures analysis

  # using the "final" data frames of all the data, make 2 data frames, one for the "vertical" (all_data_v) and one for the "horizontal" (all_data_h) targets
    # for the "vertical" targets -- all_data_v
    all_data_v = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_v_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_v)[which(names(all_data_v) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")
    # for the "horizontal" targets -- all_data_h
    all_data_h = Reduce(function(x, y) merge(x, y, by="participant"), list(track_final, ospan_final, gng_h_final, dplyr::select(survey_final, -c(sur_location, sur_power, sur_condition))))
      # rename the tracking sheet columns for the location, power, and condition 
      colnames(all_data_h)[which(names(all_data_h) == c("track_location", "track_power", "track_condition"))] = c("location", "power", "condition")

  # rename target-specific columns for both data frames (to bind)
    # for the "go" trials
    names(all_data_v)[names(all_data_v) == "trial_tot_v"] = "gng_trial_tot_long" 
    names(all_data_v)[names(all_data_v) == "score_v"] = "gng_score_long" 
    names(all_data_v)[names(all_data_v) == "resp_tot_v"] = "gng_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "no_resp_tot_v"] = "gng_no_resp_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_tot_v"] = "gng_err_tot_long" 
    names(all_data_v)[names(all_data_v) == "err_com_v"] = "gng_err_com_long" 
    names(all_data_v)[names(all_data_v) == "err_omi_v"] = "gng_err_omi_long" 
    names(all_data_v)[names(all_data_v) == "rt_av_v"] = "gng_rt_av_long" 
    names(all_data_v)[names(all_data_v) == "exclude_v"] = "gng_exclude_long"
    # for the "no-go" trials
    names(all_data_h)[names(all_data_h) == "trial_tot_h"] = "gng_trial_tot_long" 
    names(all_data_h)[names(all_data_h) == "score_h"] = "gng_score_long" 
    names(all_data_h)[names(all_data_h) == "resp_tot_h"] = "gng_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "no_resp_tot_h"] = "gng_no_resp_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_tot_h"] = "gng_err_tot_long" 
    names(all_data_h)[names(all_data_h) == "err_com_h"] = "gng_err_com_long" 
    names(all_data_h)[names(all_data_h) == "err_omi_h"] = "gng_err_omi_long" 
    names(all_data_h)[names(all_data_h) == "rt_av_h"] = "gng_rt_av_long" 
    names(all_data_h)[names(all_data_h) == "exclude_h"] = "gng_exclude_long"
  
  # Create a data frame depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  cue_type = as.data.frame(c(rep(1, length(all_data_v$participant)), rep(2, length(all_data_h$participant))))
    colnames(cue_type) = c("cue")  

  # Bind the go and no-go data frames into 1 (in long format)
  gng_cue_bind = dplyr::bind_rows(all_data_v, all_data_h)
  
  # add the target type column depicting the target type: either "go" (coded as 1) or "no-go" (coded as 2)
  all_data_cue_long = dplyr::bind_cols(cue_type, gng_cue_bind)
  
  all_data_cue_long
```

</details>

</details>

### 4. OSpan Analysis

<details>

<summary>

All the data related to the OSpan Analysis from the present study in one data frame: **"ospan_analysis"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(ospan_analysis)`

<details>

<summary>

This code organized the final data into the ospan-related data frame. *(click to expand)*

</summary>

```{r create_ospan_analysis}

# make a new data frame for the ospan analysis
ospan_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, ospan_absolute_score, ospan_math_percent_corr, ospan_math_criteria_85)


ospan_analysis

```

</details>

</details>

### 5. Go/No-Go: Error Analysis

<details>

<summary>

All the data related to the Go/No-Go Error Analysis from the present study in one data frame: **"gng_err_analysis"**

Note: this data is in a long format for a repeated measures analysis (i.e., based on target type: either "go" or "no-go"). *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(gng_err_analysis)`

<details>

<summary>

This code organized the final long data into a data frame for the Go/No-Go Error Analysis. *(click to expand)*

</summary>

```{r create_gng_err_analysis}
# make a new data frame for the gng error analysis
gng_err_analysis = dplyr::select(all_data_cue_long, participant, age_long, gender, lang_long, program, year, location, power, condition, balance, cue, gng_exclude_long, gng_trial_tot_long, gng_resp_tot_long, gng_no_resp_tot_long, gng_err_tot_long, gng_err_com_long, gng_err_omi_long)

gng_err_analysis

```

</details>

</details>

### 6. Go/No-Go: Accuracy Analysis

<details>

<summary>

All the data related to the Go/No-Go Accuracy Analysis from the present study in one data frame: **"gng_ac_analysis"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(gng_ac_analysis)`

<details>

<summary>

This code organized the final data into a data frame related to the Go/No-Go Accuracy Analysis. *(click to expand)*

</summary>

```{r create_gng_ac_analysis}

# make a new data frame for the gng accuracy analysis
gng_ac_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, gng_trial_tot, gng_resp_tot, gng_no_resp_tot, gng_rt_av) 

gng_ac_analysis
```

</details>

</details>

### 7. SAD: Factor Analysis

<details>

<summary>

All the data related to the Factor Analysis from the present study in one data frame: **"factor_analysis"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(factor_analysis)`

<details>

<summary>

This code organized the final data into the factor analysis-related data frame. *(click to expand)*

</summary>

```{r create_factor_analysis}

# make a new data frame for the factor analysis
factor_analysis = dplyr::select(all_data, participant, age, gender, lang, program, year, location, power, balance, SAD_1:SAD_13, SAD_score)

factor_analysis

```

</details>

</details>

### 8. Moderation Analysis

<details>

<summary>

All the data related to the Moderation Analysis from the present study in one data frame: **"mod_analysis"** *(click to see more)*

</summary>

-   This data included the following:

    -   `r colnames(mod_analysis)`

<details>

<summary>

This code organized the final data into the moderation-related data frame. *(click to expand)*

</summary>

```{r create_mod_analysis}

# make a new data frame for the glm analysis
mod_analysis = dplyr::select(all_data, participant, ospan_absolute_score, age, gender, lang, program, year, location, power, balance, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score, SUQ_use_1, SUQ_use_2, SUQ_use_3, SUQ_exp_1, SUQ_exp_2, SUQ_exp_4)

mod_analysis

```

</details>

</details>

# Descriptive Statistics {#DS}

The following figures and tables depict the descriptive statistics for the present study's data. This was done across all conditions (*Toltal*) and divided by condition: smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). For any **continuous variables**, the table(s) depict the mean (*M*), standard deviation (*SD*), minimum score (*min.*), maximum score (*max.*), and the sample size (*n*). For any **nominal variables**, the table(s) depict the variable, levels, and frequency for each level. This was done for the following:

1.  Demographics Data

    -   Continuous variables (i.e., age)
    -   Nominal variables (i.e., Sample Size, Balance, Gender, Language, Program, and Year)

2.  Task Data

    -   OSpan Data (i.e., absolute score)
    -   Go/No-Go Data (i.e., omission errors, average reaction time)

3.  Questionnaire Data

    -   SUQ Data (e.g., typical smartphone usage)
    -   SAD Data (i.e., SAD score)

<details>

<summary>

This code organized the descriptive statistic data sets *(click to see code)*

</summary>

<details>

<summary>

This code organized the final data into the descriptive statistics for the **Demographic Data** *(click to see code)*.

</summary>

```{r create_all_data_demo_des, message=FALSE, warning=FALSE}
# make new data frame for descriptive stats
all_data_demo_des <- 
  all_data %>% 
  
  # subset data for descriptive variables only
  select(location, power, condition, balance, age, gender, lang, program, year) %>% 
  
  # set some variables as factors
  mutate(location = factor(location, levels = c(1:3), labels = c("Desk", "Pocket/Bag", "Outside"))) %>% 
  mutate(power = factor(power, levels = c(1:2), labels = c("On", "Off"))) %>% 
  mutate(condition = factor(condition, levels = c(1:6), labels = c("On & Desk", "On & Pocket/Bag", "On & Outside", "Off & Desk", "Off & Pocket/Bag", "Off & Outside"))) %>% 
  mutate(balance = factor(balance, levels = c(1:2), labels = c("Ospan Frist", "GnG First"))) %>% 
  mutate(gender = factor(gender, levels = c(1:2), labels = c("Male", "Female"))) %>% 
  mutate(lang = factor(lang, levels = c(0, 1), labels = c("Other", "English"))) %>% 
  mutate(program = factor(program, levels = c(3, 4, 13, 14, 28, 32, 37, 12, 17, 19, 15, 20, 24, 23, 26, 1, 2, 5, 7, 8, 11, 16, 18, 22, 27, 29, 33, 36, 6, 9, 10, 21, 25, 30, 31, 34, 35),
                          labels = c(rep("Arts & Humanities", 7), "Engineering", rep("Health Sciences", 2), rep("Information & Media Studies", 3), 
                                     "Medicine & Dentistry", "Music", rep("Science", 13), rep("Social Science", 9)))) %>% 
  mutate(year = factor(year, levels = c(1:4), labels = c("First Year", "Second Year", "Third Year", "Fourth Year")))

all_data_demo_des
```

</details>

<details>

<summary>

This code organized the final data into the descriptive statistics for the **Task Data** *(click to see code)*.

</summary>

```{r create_all_data_task_des, message=FALSE, warning=FALSE}
# make new data frame for descriptive stats
all_data_task_des <- 
  all_data %>% 
  # subset data for descriptive variables only
  select(location, power, condition, ospan_absolute_score, ospan_total_corr, ospan_math_percent_corr, gng_score, gng_err_omi, gng_rt_av)

all_data_task_des
```

</details>

<details>

<summary>

This code organized the final data into the descriptive statistics for the **Questionnaire Data** *(click to see code)*.

</summary>

```{r create_all_data_ques_des, message=FALSE, warning=FALSE}
# make new data frame for descriptive stats
all_data_ques_des <- 
  all_data %>% 
  
  # subset data for descriptive variables only
  select(location, power, condition, SUQ_use_1:SUQ_use_3, SUQ_exp_1, SUQ_exp_2, SUQ_exp_4, 
         SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, SUQ_abext_score, SUQ_duract_score) %>% 
  
  # set some variables as factors
  mutate(SUQ_use_1 = factor(SUQ_use_1, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_use_2 = factor(SUQ_use_2, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_use_3 = factor(SUQ_use_3, levels = c(1:4), labels = c("0-5", "6-10", "11-15", ">15"))) %>% 
  mutate(SUQ_exp_1 = factor(SUQ_exp_1, levels = c(1:4), labels = c("$0-$20", "$21-$40", "$41-$60", ">$60")))

all_data_ques_des
```

</details>

</details>

## Demographic Data {#DSdemo}

<details>

<summary>

The following table depicts the **continuous variable(s)** (i.e., age) from all participants in the present study *(click to see code)*.

</summary>

```{r create_des_demo_cont, message=FALSE, warning=FALSE, include=FALSE}
# create a data frame with the descriptive stats of all the data, GROUPED by the location & power conditions
  # this creates information for the 6 possible combined conditions
des_demo_cont <- 
  # create this as a data frame rather than matrix
  as.data.frame(all_data_demo_des %>% 
                  # select only age & the condition columns -- since age is the only continuous demographic variable
                  select(c(location, power, age)) %>% 
                  # get descriptive stats for age & output as a matrix ("mat = T")
                  describeBy(group = list(all_data_demo_des$location, all_data_demo_des$power), mat = T)
                )

# create a data frame with the descriptive stats of all the data, ACROSS the location & power conditions
des_demo_cont_across <- 
  # get descriptive stats for age
    # create this as a data frame rather than matrix
  as.data.frame(psych::describe(all_data_demo_des$age))
  
# organize the data into a new data frame depicting the M, SD, min., max., and n.
des_demo_cont_t <- 
  as.data.frame(cbind(c("M", "SD", "min.", "max.", "n"), 
                      rbind(c(des_demo_cont[c("age1", "age2", "age3", "age4", "age5", "age6"),]$mean), 
                            c(des_demo_cont[c("age1", "age2", "age3", "age4", "age5", "age6"),]$sd), 
                            c(des_demo_cont[c("age1", "age2", "age3", "age4", "age5", "age6"),]$min), 
                            c(des_demo_cont[c("age1", "age2", "age3", "age4", "age5", "age6"),]$max), 
                            c(des_demo_cont[c("age1", "age2", "age3", "age4", "age5", "age6"),]$n)
                            ), 
                      c(des_demo_cont_across$mean, des_demo_cont_across$sd, des_demo_cont_across$min, 
                        des_demo_cont_across$max, des_demo_cont_across$n)
                      )
                )
# rename columns  
colnames(des_demo_cont_t) = c("var", "dON", "pbON", "oON", "dOFF", "pbOFF", "oOFF", "overall")
# change all columns except "var" to numeric
des_demo_cont_t$dON <- as.numeric(des_demo_cont_t$dON)
des_demo_cont_t$pbON <- as.numeric(des_demo_cont_t$pbON)
des_demo_cont_t$oON <- as.numeric(des_demo_cont_t$oON)
des_demo_cont_t$dOFF <- as.numeric(des_demo_cont_t$dOFF)
des_demo_cont_t$pbOFF <- as.numeric(des_demo_cont_t$pbOFF)
des_demo_cont_t$oOFF <- as.numeric(des_demo_cont_t$oOFF)
des_demo_cont_t$overall <- as.numeric(des_demo_cont_t$overall)
```

</details>

```{r show_des_demo_cont_t, echo=FALSE, message=FALSE, warning=FALSE}
# show table using kable
kable(des_demo_cont_t, caption = "Demographic Information by Condition and Overall: Age", digits = 2, align = rep('c'), col.names = c(" ", rep(c("On Desk", "In Pocket/Bag", "Outside"),2), "TOTAL"), row.names = c(1:7)) %>% 
  footnote(general = "Continuous descriptive statistics shown. 'TOTAL' column shows statistics across all conditions.") %>% 
  pack_rows("Age (years)", 1, 5) %>% 
  # add_header_above(c(" ", " ", "Smartphone Location" = 6, " ")) %>%
  add_header_above(c(" "," ", "ON" = 3, "OFF" = 3, " "), bold = T) %>%
  row_spec(0, bold = T) %>%
  column_spec(6, border_left = T) %>%
  column_spec(9, border_left = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

<details>

<summary>

The following table depicts the **nominal variables** (i.e., Sample Size, Balance, Gender, Language, Program, and Year) from all participants in the present study *(click to see code)*.

</summary>

```{r create_des_demo_nom, message=FALSE, warning=FALSE, include=FALSE}
# create counts for each nominal variable divided by condition
  # details shown for first variable only des_nom
des_demo_nom_bal <- 
  all_data_demo_des %>% 
  # select only the variables of interest: condition & balance
  select(condition, balance) %>% 
  # divide by the condition type (i.e., mix of smartphone power and location) & balance (i.e., Ospan 1st or GnG 1st)
  group_by(condition, balance) %>% 
  # count the frequencies for balance
  dplyr::count() %>% 
  # transpose the table
  spread(condition, n) %>% 
  # rename the column to join later
  dplyr::rename(Variable = balance) %>% 
  # for each row...
  rowwise() %>% 
  # add sum column across conditions
  dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))

des_demo_nom_gender <- all_data_demo_des %>% select(condition, gender) %>% group_by(condition, gender) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = gender) %>% rowwise() %>% dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))

des_demo_nom_lang <- all_data_demo_des %>% select(condition, lang) %>% group_by(condition, lang) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = lang) %>% rowwise() %>% dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))

des_demo_nom_program <- all_data_demo_des %>% select(condition, program) %>% group_by(condition, program) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = program) %>% rowwise() %>% dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))

des_demo_nom_year <- all_data_demo_des %>% select(condition, year) %>% group_by(condition, year) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = year) %>% rowwise() %>% dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))

des_demo_nom_tot <- bind_cols(tibble(Variable = "n"),
                                all_data_demo_des %>% select(condition) %>% group_by(condition) %>% dplyr::count() %>% spread(condition, n) %>% 
                                rowwise() %>% dplyr::mutate(Total = sum(`On & Desk`, `On & Pocket/Bag`, `On & Outside`, 
                                                                 `Off & Desk`, `Off & Pocket/Bag`, `Off & Outside`, na.rm = T))
                              )

# bind all variables into one data frame -- "des_demo_nom"
des_demo_nom <- 
  rbind(des_demo_nom_tot, des_demo_nom_bal, des_demo_nom_gender, des_demo_nom_lang, des_demo_nom_year, des_demo_nom_program)
# change "NA" variables to "No Selection"
des_demo_nom$Variable[which(is.na(des_demo_nom$Variable))] <- c(rep("No Selection", length(des_demo_nom$Variable[which(is.na(des_demo_nom$Variable))])))
```

</details>

```{r show_des_demo_nom, echo=FALSE, message=FALSE, warning=FALSE}
# show "NA" as "0"
options(knitr.kable.NA = 0)  

# show table using kable
kable(des_demo_nom, caption = "Demographic Data by Condition and Overall: Sample Size, Balance, Gender, Language, Program, and Year", align = rep('c'), col.names = c(" ", rep(c(" On Desk", "In Pocket/Bag", "Outside"), 2), "TOTAL"), row.names = c(1:7), digits = 2) %>% 
  footnote(general = "Nominal descriptive statistics shown as frequencies. Row 1 depicts sample size across all variables. 'TOTAL' column depicts frequencies across all conditions.") %>% 
  pack_rows("Sample Size", 1, 1) %>% 
  pack_rows("Balance", 2, 3) %>% 
  pack_rows("Gender", 4, 5) %>% 
  pack_rows("First Language", 6, 7) %>% 
  pack_rows("Year", 8, 12) %>% 
  pack_rows("Program", 13, 21) %>% 
  add_header_above(c(" ", " ", "ON" = 3,  "OFF" = 3, " "), bold = T) %>%
  row_spec(0, bold = T) %>% 
  column_spec(6, border_left = T) %>%
  column_spec(9, border_left = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  kable_classic()

# restore global R options
options(knitr.kable.NA = NA)  

# kable(list(des_cont, des_cont))
```

## Task Data

<details>

<summary>

The following table depicts the **OSpan and Go/No-Go variable(s)** (i.e., OSpan Absolute Score, OSpan Total Correct, OSpan Math Percentage Correct, Go/No-Go Score, Go/No-Go Omission Errors, Go/No-Go Average Reaction Time) from all participants in the present study *(click to see code)*.

</summary>

```{r create_des_task_cont, message=FALSE, warning=FALSE, include=FALSE}
# create a data frame with the descriptive stats of all the data, GROUPED by the location & power conditions
  # this creates information for the 6 possible combined conditions

# not divided by cue type
des_task_cont <- 
  # create this as a data frame rather than matrix
  as.data.frame(all_data_task_des %>% 
                  # select variables & the condition columns
                  select(-condition) %>% 
                  # get descriptive stats & output as a matrix ("mat = T")
                  describeBy(group = list(all_data_task_des$location, all_data_task_des$power), mat = T)
                )

# divided by cue type -- for GnG error analysis
# for the vertical/go cue type
des_task_cont_cue_v <- 
  # create this as a data frame rather than matrix
  as.data.frame(gng_err_analysis %>% 
                  # select variables & the condition columns
                  select(c(condition, cue, gng_err_omi_long)) %>% 
                  # filter for vertical/go cue type
                  filter(cue == 1) %>%
                  # get descriptive stats & output as a matrix ("mat = T")
                  describeBy(group = list(filter(gng_err_analysis, cue == 1)$condition), mat = T)
                )
# for the horizontal/no-go cue type
des_task_cont_cue_h <- 
  # create this as a data frame rather than matrix
  as.data.frame(gng_err_analysis %>% 
                  # select variables & the condition columns
                  select(c(condition, cue, gng_err_omi_long)) %>% 
                  # filter for horizontal/no-go cue type
                  filter(cue == 2) %>%
                  # get descriptive stats & output as a matrix ("mat = T")
                  describeBy(group = list(filter(gng_err_analysis, cue == 1)$condition), mat = T)
                )

# create a data frame with the descriptive stats of all the data, ACROSS the location & power conditions

# not divided by cue type
des_task_cont_across <- 
  # get descriptive stats for variables
    # create this as a data frame rather than matrix
  as.data.frame(psych::describe(select(all_data_task_des, -condition)))

# divided by cue type -- for GnG error analysis
# for the vertical/go cue type
des_task_cont_across_cue_v <- 
  # get descriptive stats for variables
    # create this as a data frame rather than matrix
  as.data.frame(psych::describe(select(filter(gng_err_analysis, cue == 1), c(condition, cue, gng_err_omi_long))))
# for the horizontal/no-go cue type
des_task_cont_across_cue_h <- 
  # get descriptive stats for variables
    # create this as a data frame rather than matrix
  as.data.frame(psych::describe(select(filter(gng_err_analysis, cue == 2), c(condition, cue, gng_err_omi_long))))
  

# organize the data into a new data frame depicting the M, SD, min., max., and n.
des_task_cont_t <- 
  as.data.frame(cbind(rep(c("M", "SD", "min.", "max.", "n"), 8), 
                      rbind(c(des_task_cont[c("ospan_absolute_score1", "ospan_absolute_score2", "ospan_absolute_score3", 
                                              "ospan_absolute_score4", "ospan_absolute_score5", "ospan_absolute_score6"),]$mean), 
                            c(des_task_cont[c("ospan_absolute_score1", "ospan_absolute_score2", "ospan_absolute_score3",
                                              "ospan_absolute_score4", "ospan_absolute_score5", "ospan_absolute_score6"),]$sd), 
                            c(des_task_cont[c("ospan_absolute_score1", "ospan_absolute_score2", "ospan_absolute_score3",
                                              "ospan_absolute_score4", "ospan_absolute_score5", "ospan_absolute_score6"),]$min), 
                            c(des_task_cont[c("ospan_absolute_score1", "ospan_absolute_score2", "ospan_absolute_score3",
                                              "ospan_absolute_score4", "ospan_absolute_score5", "ospan_absolute_score6"),]$max), 
                            c(des_task_cont[c("ospan_absolute_score1", "ospan_absolute_score2", "ospan_absolute_score3",
                                              "ospan_absolute_score4", "ospan_absolute_score5", "ospan_absolute_score6"),]$n), 
                            c(des_task_cont[c("ospan_total_corr1", "ospan_total_corr2", "ospan_total_corr3", 
                                              "ospan_total_corr4", "ospan_total_corr5", "ospan_total_corr6"),]$mean), 
                            c(des_task_cont[c("ospan_total_corr1", "ospan_total_corr2", "ospan_total_corr3",
                                              "ospan_total_corr4", "ospan_total_corr5", "ospan_total_corr6"),]$sd), 
                            c(des_task_cont[c("ospan_total_corr1", "ospan_total_corr2", "ospan_total_corr3",
                                              "ospan_total_corr4", "ospan_total_corr5", "ospan_total_corr6"),]$min), 
                            c(des_task_cont[c("ospan_total_corr1", "ospan_total_corr2", "ospan_total_corr3",
                                              "ospan_total_corr4", "ospan_total_corr5", "ospan_total_corr6"),]$max), 
                            c(des_task_cont[c("ospan_total_corr1", "ospan_total_corr2", "ospan_total_corr3",
                                              "ospan_total_corr4", "ospan_total_corr5", "ospan_total_corr6"),]$n), 
                            c(des_task_cont[c("ospan_math_percent_corr1", "ospan_math_percent_corr2", "ospan_math_percent_corr3", 
                                              "ospan_math_percent_corr4", "ospan_math_percent_corr5", "ospan_math_percent_corr6"),]$mean), 
                            c(des_task_cont[c("ospan_math_percent_corr1", "ospan_math_percent_corr2", "ospan_math_percent_corr3",
                                              "ospan_math_percent_corr4", "ospan_math_percent_corr5", "ospan_math_percent_corr6"),]$sd), 
                            c(des_task_cont[c("ospan_math_percent_corr1", "ospan_math_percent_corr2", "ospan_math_percent_corr3",
                                              "ospan_math_percent_corr4", "ospan_math_percent_corr5", "ospan_math_percent_corr6"),]$min), 
                            c(des_task_cont[c("ospan_math_percent_corr1", "ospan_math_percent_corr2", "ospan_math_percent_corr3",
                                              "ospan_math_percent_corr4", "ospan_math_percent_corr5", "ospan_math_percent_corr6"),]$max), 
                            c(des_task_cont[c("ospan_math_percent_corr1", "ospan_math_percent_corr2", "ospan_math_percent_corr3",
                                              "ospan_math_percent_corr4", "ospan_math_percent_corr5", "ospan_math_percent_corr6"),]$n), 
                            c(des_task_cont[c("gng_score1", "gng_score2", "gng_score3", 
                                              "gng_score4", "gng_score5", "gng_score6"),]$mean), 
                            c(des_task_cont[c("gng_score1", "gng_score2", "gng_score3",
                                              "gng_score4", "gng_score5", "gng_score6"),]$sd), 
                            c(des_task_cont[c("gng_score1", "gng_score2", "gng_score3",
                                              "gng_score4", "gng_score5", "gng_score6"),]$min), 
                            c(des_task_cont[c("gng_score1", "gng_score2", "gng_score3",
                                              "gng_score4", "gng_score5", "gng_score6"),]$max), 
                            c(des_task_cont[c("gng_score1", "gng_score2", "gng_score3",
                                              "gng_score4", "gng_score5", "gng_score6"),]$n), 
                            c(des_task_cont[c("gng_err_omi1", "gng_err_omi2", "gng_err_omi3", 
                                              "gng_err_omi4", "gng_err_omi5", "gng_err_omi6"),]$mean), 
                            c(des_task_cont[c("gng_err_omi1", "gng_err_omi2", "gng_err_omi3",
                                              "gng_err_omi4", "gng_err_omi5", "gng_err_omi6"),]$sd), 
                            c(des_task_cont[c("gng_err_omi1", "gng_err_omi2", "gng_err_omi3",
                                              "gng_err_omi4", "gng_err_omi5", "gng_err_omi6"),]$min), 
                            c(des_task_cont[c("gng_err_omi1", "gng_err_omi2", "gng_err_omi3",
                                              "gng_err_omi4", "gng_err_omi5", "gng_err_omi6"),]$max), 
                            c(des_task_cont[c("gng_err_omi1", "gng_err_omi2", "gng_err_omi3",
                                              "gng_err_omi4", "gng_err_omi5", "gng_err_omi6"),]$n), 
                            c(des_task_cont_cue_v[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$mean), 
                            c(des_task_cont_cue_v[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$sd), 
                            c(des_task_cont_cue_v[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$min), 
                            c(des_task_cont_cue_v[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$max), 
                            c(des_task_cont_cue_v[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$n),
                            c(des_task_cont_cue_h[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$mean), 
                            c(des_task_cont_cue_h[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$sd), 
                            c(des_task_cont_cue_h[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$min), 
                            c(des_task_cont_cue_h[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$max), 
                            c(des_task_cont_cue_h[c("gng_err_omi_long1", "gng_err_omi_long2", "gng_err_omi_long3",
                                                    "gng_err_omi_long4", "gng_err_omi_long5", "gng_err_omi_long6"),]$n), 
                            c(des_task_cont[c("gng_rt_av1", "gng_rt_av2", "gng_rt_av3", 
                                              "gng_rt_av4", "gng_rt_av5", "gng_rt_av6"),]$mean), 
                            c(des_task_cont[c("gng_rt_av1", "gng_rt_av2", "gng_rt_av3",
                                              "gng_rt_av4", "gng_rt_av5", "gng_rt_av6"),]$sd), 
                            c(des_task_cont[c("gng_rt_av1", "gng_rt_av2", "gng_rt_av3",
                                              "gng_rt_av4", "gng_rt_av5", "gng_rt_av6"),]$min), 
                            c(des_task_cont[c("gng_rt_av1", "gng_rt_av2", "gng_rt_av3",
                                              "gng_rt_av4", "gng_rt_av5", "gng_rt_av6"),]$max), 
                            c(des_task_cont[c("gng_rt_av1", "gng_rt_av2", "gng_rt_av3",
                                              "gng_rt_av4", "gng_rt_av5", "gng_rt_av6"),]$n)
                            ),
                      c(des_task_cont_across["ospan_absolute_score", c("mean", "sd", "min", "max", "n")], 
                        des_task_cont_across["ospan_total_corr", c("mean", "sd", "min", "max", "n")], 
                        des_task_cont_across["ospan_math_percent_corr", c("mean", "sd", "min", "max", "n")], 
                        des_task_cont_across["gng_score", c("mean", "sd", "min", "max", "n")], 
                        des_task_cont_across["gng_err_omi", c("mean", "sd", "min", "max", "n")], 
                        des_task_cont_across_cue_v["gng_err_omi_long", c("mean", "sd", "min", "max", "n")],
                        des_task_cont_across_cue_h["gng_err_omi_long", c("mean", "sd", "min", "max", "n")],
                        des_task_cont_across["gng_rt_av", c("mean", "sd", "min", "max", "n")])
                      )
                )
# rename columns  
colnames(des_task_cont_t) = c("var", "dON", "pbON", "oON", "dOFF", "pbOFF", "oOFF", "overall")
# change all columns except "var" to numeric
des_task_cont_t$dON <- as.numeric(des_task_cont_t$dON)
des_task_cont_t$pbON <- as.numeric(des_task_cont_t$pbON)
des_task_cont_t$oON <- as.numeric(des_task_cont_t$oON)
des_task_cont_t$dOFF <- as.numeric(des_task_cont_t$dOFF)
des_task_cont_t$pbOFF <- as.numeric(des_task_cont_t$pbOFF)
des_task_cont_t$oOFF <- as.numeric(des_task_cont_t$oOFF)
des_task_cont_t$overall <- as.numeric(des_task_cont_t$overall)
# rename rows
rownames(des_task_cont_t) = c(1:nrow(des_task_cont_t))

# write.table(des_task_cont_t,"/Users/anaceciliaruizpardo/Downloads/df.txt",sep="|")
# x = read.table("/Users/anaceciliaruizpardo/Downloads/df.txt",sep="|")
# write.csv(x, "Path/df.csv")
```

</details>

```{r show_des_task_cont_t, echo=FALSE, message=FALSE, warning=FALSE}
# show table using kable
kable(des_task_cont_t, caption = "Task Data by Condition and Overall: OSpan Absolute Score, OSpan Total Correct, OSpan Math Percentage Correct, Go/No-Go Score, Go/No-Go Omission Errors, Go/No-Go Average Reaction Time", digits = 2, align = rep('c'), col.names = c(" ", rep(c("On Desk", "In Pocket/Bag", "Outside"),2), "TOTAL"), row.names = c(1:nrow(des_task_cont_t))) %>% 
  footnote(general = "Continuous descriptive statistics shown for Task data (i.e., OSpan Task and Cue-Dependent Go/No-Go Task). 'TOTAL' column shows statistics across all conditions.") %>% 
  pack_rows("OSpan Task ", 1, 15) %>% 
  pack_rows("Absolute Score", 1, 5) %>% 
  pack_rows("Total Score", 6, 10) %>% 
  pack_rows("Math Correct (%)", 11, 15) %>% 
  pack_rows(" Cue-Dependent Go/No-Go Task", 16, 40) %>% 
  pack_rows("Total Score", 16, 20) %>% 
  pack_rows("Omission Errors", 21, 25) %>% 
  pack_rows("Omission Errors for 'Go' Trials", 26, 30) %>% 
  pack_rows("Omission Errors for 'No-Go' Trials", 31, 35) %>% 
  pack_rows("Average Reaction Time", 36, 40) %>% 
  # add_header_above(c(" ", " ", "Smartphone Location" = 6, " ")) %>%
  add_header_above(c(" "," ", "ON" = 3, "OFF" = 3, " "), bold = T) %>%
  row_spec(0, bold = T) %>%
  column_spec(6, border_left = T) %>%
  column_spec(9, border_left = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

## Questionnaire Data

The following table depicts the **SUQ and SAD variable(s)** for both *continuous* (i.e., SUQ score for smartphone use with absence of other stimulation, SUQ score for smartphone use during other actions, SAD score) and *nominal* (i.e., Average text messages per day, Average social media based messages sent per day, Average social media based posts per day, Smartphone subjective value, Types of notifications, Phantom vibrations) variables from all participants in the present study *(click to see code)*.

The **continuous** data includes:

-   SUQ score for smartphone use with absence of other stimulation

-   SUQ score for smartphone use during other actions

-   SAD score (i.e., attachment and dependency to one's smartphone):

    -   $SAD_{overall}$: SAD score for all 13 items

    -   $SAD_{Dep}$: SAD score for the *Smartphone Dependence* subscale (items: 1, 2, 3, 4, 5)

    -   $SAD_{EA}$: SAD score for the *Emotional Attachment* subscale (items: 8, 9, 10, 11)

    -   $SAD_{Dist}$: SAD score for the *Smartphone Distractibility* subscale (items: 7, 12, 13)

        <details>

        <summary>

        This code organized the continuous data *(click to see code)*. \</summary)

    ```{r create_des_ques_cont, message=FALSE, warning=FALSE, include=FALSE}
    # create a data frame with the descriptive stats of all the data, GROUPED by the location & power conditions
      # this creates information for the 6 possible combined conditions
    des_ques_cont <- 
      # create this as a data frame rather than matrix
      as.data.frame(all_data_ques_des %>% 
                      # select variables & the condition columns
                      select(c(location, power, SUQ_abext_score, SUQ_duract_score, SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score)) %>% 
                      # get descriptive stats & output as a matrix ("mat = T")
                      describeBy(group = list(all_data_ques_des$location, all_data_ques_des$power), mat = T)
                    )

    # create a data frame with the descriptive stats of all the data, ACROSS the location & power conditions
    des_ques_cont_across <- 
      # get descriptive stats for variables
        # create this as a data frame rather than matrix
      as.data.frame(psych::describe(select(all_data_ques_des, c(location, power, SUQ_abext_score, SUQ_duract_score, SAD_score, SAD_dep_score, SAD_ea_score, SAD_dist_score))))
      
    # organize the data into a new data frame depicting the M, SD, min., max., and n.
    des_ques_cont_t <- 
      as.data.frame(cbind(rep(c("M", "SD", "min.", "max.", "n"), 3), 
                          rbind(c(des_ques_cont[c("SUQ_abext_score1", "SUQ_abext_score2", "SUQ_abext_score3", 
                                                  "SUQ_abext_score4", "SUQ_abext_score5", "SUQ_abext_score6"),]$mean), 
                                c(des_ques_cont[c("SUQ_abext_score1", "SUQ_abext_score2", "SUQ_abext_score3",
                                                  "SUQ_abext_score4", "SUQ_abext_score5", "SUQ_abext_score6"),]$sd), 
                                c(des_ques_cont[c("SUQ_abext_score1", "SUQ_abext_score2", "SUQ_abext_score3",
                                                  "SUQ_abext_score4", "SUQ_abext_score5", "SUQ_abext_score6"),]$min), 
                                c(des_ques_cont[c("SUQ_abext_score1", "SUQ_abext_score2", "SUQ_abext_score3",
                                                  "SUQ_abext_score4", "SUQ_abext_score5", "SUQ_abext_score6"),]$max), 
                                c(des_ques_cont[c("SUQ_abext_score1", "SUQ_abext_score2", "SUQ_abext_score3",
                                                  "SUQ_abext_score4", "SUQ_abext_score5", "SUQ_abext_score6"),]$n), 
                                c(des_ques_cont[c("SUQ_duract_score1", "SUQ_duract_score2", "SUQ_duract_score3", 
                                                  "SUQ_duract_score4", "SUQ_duract_score5", "SUQ_duract_score6"),]$mean), 
                                c(des_ques_cont[c("SUQ_duract_score1", "SUQ_duract_score2", "SUQ_duract_score3",
                                                  "SUQ_duract_score4", "SUQ_duract_score5", "SUQ_duract_score6"),]$sd), 
                                c(des_ques_cont[c("SUQ_duract_score1", "SUQ_duract_score2", "SUQ_duract_score3",
                                                  "SUQ_duract_score4", "SUQ_duract_score5", "SUQ_duract_score6"),]$min), 
                                c(des_ques_cont[c("SUQ_duract_score1", "SUQ_duract_score2", "SUQ_duract_score3",
                                                  "SUQ_duract_score4", "SUQ_duract_score5", "SUQ_duract_score6"),]$max), 
                                c(des_ques_cont[c("SUQ_duract_score1", "SUQ_duract_score2", "SUQ_duract_score3",
                                                  "SUQ_duract_score4", "SUQ_duract_score5", "SUQ_duract_score6"),]$n),
                                c(des_ques_cont[c("SAD_score1", "SAD_score2", "SAD_score3", 
                                                  "SAD_score4", "SAD_score5", "SAD_score6"),]$mean), 
                                c(des_ques_cont[c("SAD_score1", "SAD_score2", "SAD_score3",
                                                  "SAD_score4", "SAD_score5", "SAD_score6"),]$sd), 
                                c(des_ques_cont[c("SAD_score1", "SAD_score2", "SAD_score3",
                                                  "SAD_score4", "SAD_score5", "SAD_score6"),]$min), 
                                c(des_ques_cont[c("SAD_score1", "SAD_score2", "SAD_score3",
                                                  "SAD_score4", "SAD_score5", "SAD_score6"),]$max), 
                                c(des_ques_cont[c("SAD_score1", "SAD_score2", "SAD_score3",
                                                  "SAD_score4", "SAD_score5", "SAD_score6"),]$n),
                                c(des_ques_cont[c("SAD_dep_score1", "SAD_dep_score2", "SAD_dep_score3", 
                                                  "SAD_dep_score4", "SAD_dep_score5", "SAD_dep_score6"),]$mean), 
                                c(des_ques_cont[c("SAD_dep_score1", "SAD_dep_score2", "SAD_dep_score3",
                                                  "SAD_dep_score4", "SAD_dep_score5", "SAD_dep_score6"),]$sd), 
                                c(des_ques_cont[c("SAD_dep_score1", "SAD_dep_score2", "SAD_dep_score3",
                                                  "SAD_dep_score4", "SAD_dep_score5", "SAD_dep_score6"),]$min), 
                                c(des_ques_cont[c("SAD_dep_score1", "SAD_dep_score2", "SAD_dep_score3",
                                                  "SAD_dep_score4", "SAD_dep_score5", "SAD_dep_score6"),]$max), 
                                c(des_ques_cont[c("SAD_dep_score1", "SAD_dep_score2", "SAD_dep_score3",
                                                  "SAD_dep_score4", "SAD_dep_score5", "SAD_dep_score6"),]$n), 
                                c(des_ques_cont[c("SAD_ea_score1", "SAD_ea_score2", "SAD_ea_score3", 
                                                  "SAD_ea_score4", "SAD_ea_score5", "SAD_ea_score6"),]$mean), 
                                c(des_ques_cont[c("SAD_ea_score1", "SAD_ea_score2", "SAD_ea_score3",
                                                  "SAD_ea_score4", "SAD_ea_score5", "SAD_ea_score6"),]$sd), 
                                c(des_ques_cont[c("SAD_ea_score1", "SAD_ea_score2", "SAD_ea_score3",
                                                  "SAD_ea_score4", "SAD_ea_score5", "SAD_ea_score6"),]$min), 
                                c(des_ques_cont[c("SAD_ea_score1", "SAD_ea_score2", "SAD_ea_score3",
                                                  "SAD_ea_score4", "SAD_ea_score5", "SAD_ea_score6"),]$max), 
                                c(des_ques_cont[c("SAD_ea_score1", "SAD_ea_score2", "SAD_ea_score3",
                                                  "SAD_ea_score4", "SAD_ea_score5", "SAD_ea_score6"),]$n), 
                                c(des_ques_cont[c("SAD_access_score1", "SAD_access_score2", "SAD_access_score3", 
                                                  "SAD_access_score4", "SAD_access_score5", "SAD_access_score6"),]$mean), 
                                c(des_ques_cont[c("SAD_access_score1", "SAD_access_score2", "SAD_access_score3",
                                                  "SAD_access_score4", "SAD_access_score5", "SAD_access_score6"),]$sd), 
                                c(des_ques_cont[c("SAD_access_score1", "SAD_access_score2", "SAD_access_score3",
                                                  "SAD_access_score4", "SAD_access_score5", "SAD_access_score6"),]$min), 
                                c(des_ques_cont[c("SAD_access_score1", "SAD_access_score2", "SAD_access_score3",
                                                  "SAD_access_score4", "SAD_access_score5", "SAD_access_score6"),]$max), 
                                c(des_ques_cont[c("SAD_access_score1", "SAD_access_score2", "SAD_access_score3",
                                                  "SAD_access_score4", "SAD_access_score5", "SAD_access_score6"),]$n),
                                c(des_ques_cont[c("SAD_dist_score1", "SAD_dist_score2", "SAD_dist_score3", 
                                                  "SAD_dist_score4", "SAD_dist_score5", "SAD_dist_score6"),]$mean), 
                                c(des_ques_cont[c("SAD_dist_score1", "SAD_dist_score2", "SAD_dist_score3",
                                                  "SAD_dist_score4", "SAD_dist_score5", "SAD_dist_score6"),]$sd), 
                                c(des_ques_cont[c("SAD_dist_score1", "SAD_dist_score2", "SAD_dist_score3",
                                                  "SAD_dist_score4", "SAD_dist_score5", "SAD_dist_score6"),]$min), 
                                c(des_ques_cont[c("SAD_dist_score1", "SAD_dist_score2", "SAD_dist_score3",
                                                  "SAD_dist_score4", "SAD_dist_score5", "SAD_dist_score6"),]$max), 
                                c(des_ques_cont[c("SAD_dist_score1", "SAD_dist_score2", "SAD_dist_score3",
                                                  "SAD_dist_score4", "SAD_dist_score5", "SAD_dist_score6"),]$n)
                                ),
                          c(des_ques_cont_across["SUQ_abext_score", c("mean", "sd", "min", "max", "n")], 
                            des_ques_cont_across["SUQ_duract_score", c("mean", "sd", "min", "max", "n")], 
                            des_ques_cont_across["SAD_score", c("mean", "sd", "min", "max", "n")], 
                            des_ques_cont_across["SAD_dep_score", c("mean", "sd", "min", "max", "n")], 
                            des_ques_cont_across["SAD_ea_score", c("mean", "sd", "min", "max", "n")], 
                            des_ques_cont_across["SAD_dist_score", c("mean", "sd", "min", "max", "n")]
                            )
                          )
                    )
    # rename columns  
    colnames(des_ques_cont_t) <-  c("var", "dON", "pbON", "oON", "dOFF", "pbOFF", "oOFF", "overall")
    # change all columns except "var" to numeric
    des_ques_cont_t$dON <- as.numeric(des_ques_cont_t$dON)
    des_ques_cont_t$pbON <- as.numeric(des_ques_cont_t$pbON)
    des_ques_cont_t$oON <- as.numeric(des_ques_cont_t$oON)
    des_ques_cont_t$dOFF <- as.numeric(des_ques_cont_t$dOFF)
    des_ques_cont_t$pbOFF <- as.numeric(des_ques_cont_t$pbOFF)
    des_ques_cont_t$oOFF <- as.numeric(des_ques_cont_t$oOFF)
    des_ques_cont_t$overall <- as.numeric(des_ques_cont_t$overall)
    # rename rows
    rownames(des_ques_cont_t) = c(1:nrow(des_ques_cont_t))
    ```

</details>

```{r show_des_ques_cont_t, echo=FALSE, message=FALSE, warning=FALSE}
# show table using kable
kable(des_ques_cont_t, caption = "Questionnaire Data by Condition and Overall: Continuous Variables", digits = 2, align = rep('c'), col.names = c(" ", rep(c("On Desk", "In Pocket/Bag", "Outside"),2), "TOTAL"), row.names = c(1:nrow(des_ques_cont_t))) %>% 
  footnote(general = "Continuous descriptive statistics shown for Questionnaire data (i.e., SUQ score for smartphone use with absence of other stimulation, SUQ score for smartphone use during other actions, SAD score). 'TOTAL' column shows statistics across all conditions.") %>% 
  pack_rows("Smartphone Usage Questionnaire (SUQ)", 1, 10) %>% 
  pack_rows("Tendency to use one's smartphone in the abscence of other stimulation", 1, 5) %>% 
  pack_rows("Tendency to use one's smartphone in the during other actions", 6, 10) %>% 
  pack_rows("Smartphone Attachment and Dependency Inventory (SAD)", 11, 35) %>% 
  pack_rows("Overall: Items #1-13", 11, 15) %>% 
  pack_rows("Smartphone Dependence: Items #1-3", 16, 20) %>% 
  pack_rows("Emotional Attachment: Items #8-11", 21, 25) %>% 
  pack_rows("Accessibility: Items #5, 6, 7", 26, 30) %>% 
  pack_rows("Smartphone Distractability: Items #12, 13", 31, 35) %>% 
  # add_header_above(c(" ", " ", "Smartphone Location" = 6, " ")) %>%
  add_header_above(c(" "," ", "ON" = 3, "OFF" = 3, " "), bold = T) %>%
  row_spec(0, bold = T) %>%
  column_spec(6, border_left = T) %>%
  column_spec(9, border_left = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

\
The **nominal** data includes:

-   Average text messages per day
-   Average social media based messages sent per day
-   Average social media based posts per day
-   Smartphone subjective value
-   Types of notifications
-   Phantom vibrations

<details>

<summary>

This code organized the nominal data *(click to see code)*. \</summary)

```{r create_des_ques_nom, message=FALSE, warning=FALSE, include=FALSE}
#nom (SUQ use1, SUQ use2, SUQ use3, SUQ exp1, SUQ exp2, SUQ exp4)
# create counts for each nominal variable divided by condition
  # details shown for first variable only des_nom
des_ques_nom_text <- 
  all_data_ques_des %>% 
  # select only the variables of interest: condition & SUQ_use_1
  select(condition, SUQ_use_1) %>%
  # divide by the condition type (i.e., mix of smartphone power and location)
  group_by(condition, SUQ_use_1) %>% 
  # count the frequencies for SUQ_use_1
  dplyr::count() %>% 
  # transpose the table
  spread(condition, n) %>% 
  # rename the column to join later
  dplyr::rename(Variable = SUQ_use_1) %>% 
  # for each row...
  rowwise() %>% 
  # add sum column across conditions
  dplyr::mutate(Total = sum(c_across(`1`:`6`), na.rm = T))
  
   all_data_temp %>% rowwise() %>% dplyr::mutate(SAD_score = sum(c_across(SAD_1:SAD_13)))

des_ques_nom_message <- all_data_ques_des %>% select(condition, SUQ_use_2) %>% group_by(condition, SUQ_use_2) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = SUQ_use_2) %>% rowwise() %>% dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T))

des_ques_nom_post <- all_data_ques_des %>% select(condition, SUQ_use_3) %>% group_by(condition, SUQ_use_3) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = SUQ_use_3) %>% rowwise() %>% dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T))

des_ques_nom_value <- all_data_ques_des %>% select(condition, SUQ_exp_1) %>% group_by(condition, SUQ_exp_1) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = SUQ_exp_1) %>% rowwise() %>% dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T))

des_nom_note <- 
  all_data_ques_des %>%
  select(condition, SUQ_exp_2) %>% 
  # split up multiple selections for each participant to count over all -- notification types
  separate_rows(SUQ_exp_2) %>%
  group_by(condition, SUQ_exp_2) %>% dplyr::count() %>% spread(condition, n) %>% 
  dplyr::rename(Variable = SUQ_exp_2) %>% rowwise() %>% dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T)) %>% 
  dplyr::mutate(Variable = factor(Variable, levels = c(1:7), labels = c("Email", "Facebook", "Twitter", "Instagram", "LinkedIn", "Snapchat", "Other")))

des_ques_nom_phantom <- all_data_ques_des %>% select(condition, SUQ_exp_4) %>% group_by(condition, SUQ_exp_4) %>% dplyr::count() %>% spread(condition, n) %>% dplyr::rename(Variable = SUQ_exp_4) %>% rowwise() %>% dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T)) %>% mutate(Variable = factor(Variable, levels = c(1:2), labels = c("Experienced", "Not Experienced")))

des_ques_nom_tot <- bind_cols(tibble(Variable = "n"), 
                         all_data_ques_des %>% group_by(condition) %>% dplyr::summarise(n = n()) %>% spread(condition, n) %>% rowwise() %>% 
                           dplyr::mutate(Total = sum(`1`, `2`, `3`, `4`, `5`, `6`, na.rm = T))
                         )

# bind all variables into one data frame -- "des_ques_nom"
des_ques_nom <- 
  rbind(des_ques_nom_tot, des_ques_nom_text, des_ques_nom_message, des_ques_nom_post, des_ques_nom_value, des_nom_note, des_ques_nom_phantom)

colnames(des_ques_nom) <- c("var", "dON", "pbON", "oON", "dOFF", "pbOFF", "oOFF", "overall")
```

</details>

```{r show_des_ques_nom, echo=FALSE, message=FALSE, warning=FALSE}
# show table using kable
kable(des_ques_nom, caption = "Questionnaire Data by Condition and Overall: Nominal Variables", digits = 2, align = rep('c'), col.names = c(" ", rep(c("On Desk", "In Pocket/Bag", "Outside"),2), "TOTAL"), row.names = c(1:nrow(des_ques_nom))) %>% 
  footnote(general = "Continuous descriptive statistics shown for Questionnaire data (i.e., Sample size, Average text messages per day, Average social media based messages sent per day, Average social media based posts per day, Smartphone subjective value, Types of notifications, Phantom vibrations). 'TOTAL' column shows statistics across all conditions.", 
           symbol = c("Each participant could choose multiple types of notifications.")) %>% 
  pack_rows("Sample Size", 1, 1) %>% 
  pack_rows("Smartphone Usage Questionnaire (SUQ)", 2, 26) %>% 
  pack_rows("Average Text Messages per Day", 2, 5) %>% 
  pack_rows("Average Social Media Based Messages Sent per Day", 6, 9) %>% 
  pack_rows("Average Social Media Based Posts per Day", 10, 13) %>% 
  pack_rows("Smartphone Subjective Value", 14, 17) %>% 
  pack_rows("Types of Notifications*", 18, 24) %>% 
  pack_rows("Phantom Vibrations", 25, 26) %>% 
  # add_header_above(c(" ", " ", "Smartphone Location" = 6, " ")) %>%
  add_header_above(c(" "," ", "ON" = 3, "OFF" = 3, " "), bold = T) %>%
  row_spec(0, bold = T) %>%
  column_spec(6, border_left = T) %>%
  column_spec(9, border_left = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

# Planned Analyses

The following analyses were completed based on a pre-registered direct replication on OSF (view registration: <https://osf.io/ubys7/?view_only=5db76bba381a4dc4aac130f7cd3a0de5>) of Ward et al.'s (2017) second experiment. The analyses tested the three main hypotheses in the present study: (1) the location effect hypothesis, (2) the power effect hypothesis, and (3) the moderation effect hypothesis.

Each analysis contains the following:

a.  Analysis *(including any relevant assumption tests)*
b.  Visualizations

*Note: Descriptive statistics are shown in the [Descriptive Statistics section](#DS)*

## 1. & 2. Testing the Location and Power Hypotheses:

### OSpan Task

A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was conducted *(DV = OSpan absolute score)*.

Prior to completing the analysis, planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Specifically, only participants who payed attention to both the math component and letter recall component of the task were included. Only data from those who scored 85% or higher on the math component of the task were included. View the [Data Cleaning section](#all_data_DC) under the data [Construct Working Dataframes](#makeDFs) heading above. There was a total of `r nrow(ospan_analysis)` participants in the OSpan analysis.

#### Analysis

To assess if OSpan absolute score statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). Since there are unequal sample sizes, we used Type III sum of squares.

##### Assumptions

<details>

<summary>

The ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. All assumptions were met *(click to see details)*.

</summary>

<details>

<summary>

[1. Independent Random Sampling]{style="color:#4F2683; font-size:18px;"}

This assumption was met during testing.

[2. Normality]{style="color:#4F2683; font-size:18px;"}

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output; click to see code)*.

</summary>

```{r create_ospan_anova&res, message=FALSE, warning=FALSE}

# get the required data from the ospan_analysis data: participant, location, power, and score
ospan_anova_data = ospan_analysis[, c("participant", "location", "power", "ospan_absolute_score")]
    colnames(ospan_anova_data) = c("participant", "location", "power", "score")
    rownames(ospan_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    ospan_anova_data$location = factor(ospan_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    ospan_anova_data$power = factor(ospan_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
ospan_anova = ezANOVA(
  data = ospan_anova_data
  , dv = .(score)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
ospan_res = data.frame("residuals" = proj(ospan_anova$aov)[,"Residuals"])   

```

</details>

-   It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.

```{r show_ospan_qq, message=FALSE, warning=FALSE}

qqnorm(ospan_res$residuals, main = "Q-Q Plot of the OSpan Residuals") 
qqline(ospan_res$residuals)

```

-   It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right *(see visualization below)*.

```{r show_ospan_res_plot, message=FALSE, warning=FALSE}

qplot(ospan_res$residuals, main = "Histogram of Ospan Residuals", binwidth = 5) + 
  theme_classic()

```

-   The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(ospan_shap$statistic["W"]), 2)`, *p* `r p_round(ospan_shap$p.value)` *(see code below)*.

```{r create&show_shap, message=FALSE, warning=FALSE}

ospan_shap = shapiro.test(ospan_res$residuals)
ospan_shap

```

<details>

<summary>

-   The Skewness and Kurtosis were examined and showed that both the Skewness (range: `r round(min(as.numeric(ospan_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(ospan_sk_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(ospan_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(ospan_sk_t[c(2,5),])), 2)`) were within the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below; click to see code)*.

    </summary>

```{r create_ospan_sk_t, message=FALSE, warning=FALSE}
# use the "ospan_des" data frame from the descriptive stats

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  ospan_sk_t = rbind((cbind(t(des_task_cont["ospan_absolute_score1", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score2", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(des_task_cont["ospan_absolute_score4", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score5", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["ospan_absolute_score6", c("skew", "kurtosis", "n")]))))
    colnames(ospan_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(ospan_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
```

</details>

```{r show_ospan_sk_t, message=FALSE, warning=FALSE}
# use kable to show the descriptive table
  kable(ospan_sk_t, caption = "Skewness and Kurtosis of the Absolute Score on the OSpan Task", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

-   Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

[3. Homogeneity of Variance]{style="color:#4F2683; font-size:18px;"}

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e., smartphone location and smartphone power).

-   The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r ospan_lev_loc$Df[1]`, `r ospan_lev_loc$Df[2]`) = `r round(ospan_lev_loc[2][1,1], 2)`, *p* = `r p_round(ospan_lev_loc[3][1,1])` and smartphone power, *F*(`r ospan_lev_pow$Df[1]`, `r ospan_lev_pow$Df[2]`) = `r round(ospan_lev_pow[2][1,1], 2)`, *p* `r p_round(ospan_lev_pow[3][1,1])` *(see code below)*.

```{r create&show_levenes, message=FALSE, warning=FALSE}
# conduct levene's
  # for location
  ospan_lev_loc = leveneTest(data = ospan_anova_data, score ~ as.factor(location), center = mean)
  ospan_lev_loc
  # for power
  ospan_lev_pow = leveneTest(data = ospan_anova_data, score ~ as.factor(power), center = mean)
  ospan_lev_pow
```

</details>

##### Results

All results were based on $\alpha = 0.05$. Below are the reported statistics for both main effects and the interactions, followed by a table depicting the ANOVA results.

**Main Effect of Smartphone Location**

There was no significant main effect of smartphone location on Ospan absolute score, *F*(`r ospan_anova$ANOVA[2,2]`, `r ospan_anova$ANOVA[2,3]`) = `r round(ospan_anova$ANOVA[2,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[2,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[2,9])`.

**Main Effect of Smartphone Power**

There was no significant main effect of smartphone power on Ospan absolute score, *F*(`r ospan_anova$ANOVA[3,2]`, `r ospan_anova$ANOVA[3,3]`) = `r round(ospan_anova$ANOVA[3,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[3,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[3,9])`.

**Interaction of Smartphone Location & Smartphone Power**

There was no significant interaction between smartphone location and smartphone power on Ospan absolute score, *F*(`r ospan_anova$ANOVA[4,2]`, `r ospan_anova$ANOVA[4,3]`) = `r round(ospan_anova$ANOVA[4,6], 2)`, *p* `r p_round(ospan_anova$ANOVA[4,7])`, $\eta^{2}$ `r p_round(ospan_anova$ANOVA[4,9])`.

**Post-hoc Tests**

Since there was no significant main or interaction effects, no post-hocs were completed.

```{r show_ospan_anova_table, message=FALSE, warning=FALSE}
# show ANOVA results in kable table 
kable(ospan_anova$ANOVA, caption = "3x2 ANOVA on Ospan Absolute Score", digits = 4, align = 'c') %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
```

#### Visualizations

<details>

<summary>

The following shows a visualization of the OSpan absolute score based on the two smartphone conditions: location and power. *(click to see code)*.

</summary>

```{r create_ospan_bar, message=FALSE, warning=FALSE}
# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the descriptive data
  ospan_bar_data <- ospan_analysis[, c("location", "power", "ospan_absolute_score")]
    colnames(ospan_bar_data) = c("location", "power", "score")
    rownames(ospan_bar_data) = NULL
    
  ospan_bar_data$location = factor(ospan_bar_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("Desk", "Pocket/Bag", "Outside"))
  ospan_bar_data$power = factor(ospan_bar_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
```

</details>

```{r show_ospan_bar, echo=TRUE, message=FALSE, warning=FALSE}
# plot bar graph - GOOD
ospan_bar <- 
  ggplot(ospan_bar_data, aes(x = location, y = score, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = mean, position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the OSpan Task",
       subtitle = "Average Absolute OSpan Score",
       caption = "Note: Score is the number of correct letters recalled based \n on if all letters in a given block were recalled correctly.",
       x = "Smartphone Location", 
       y = "OSpan Absolute Score", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(ospan_bar_data$score), max(ospan_bar_data$score)), breaks = ) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun=mean, position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = ospan_bar_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# show fig
ospan_bar

# export plot
# ggsave(filename="fig_ospan_bar_june16", plot = ospan_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)
```

### Cue-Dependent Go/No-Go Task

An error (i.e., omission errors) and accuracy (i.e., RT) analysis was completed using the omission errors and average RT for each participant.

#### Error Analysis

A **3(Smartphone location: desk, pocket/bag, outside) x 2(Smartphone power: ON, OFF) x 2(Cue type: Vertical, Horizontal)** mixed ANOVA was completed *(DV = omission error(s))*. The between-subjects factors were smartphone location and power, and the within-subjects factor was pre-target cue type (i.e., "cue type"; either "vertical" or "horizontal").

Prior to completing the analysis, planned exclusion criteria was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Only data from those who scored higher than chance performance (i.e., responded to at least 50% of "Go" trials and withheld response to at least 50% of "No-Go" trials) were used for the final analysis. View the [Data Cleaning section](#all_data_DC) under the data [Construct Working Dataframes](#makeDFs) heading above. There was a total of `r nrow(gng_err_analysis)/2` participants in the Go/No-Go Error analysis.

##### Analysis

To assess if the omission errors statistically differed, we conducted a 3x2x2 mixed ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e. on or off), and the within-subjects factor of cue type (i.e., "vertical" or "horizontal"). Since there are unequal sample sizes, we used Type III sum of squares.

###### Assumptions

<details>

<summary>

The mixed ANOVA made four assumptions: independent random sampling, normality, homogeneity of variance (i.e., for the between-subject factors), and sphericity (i.e., for the within-subjects factor). Normality was outside of the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). The homogeneity of variance assumption was not met. 
Due to this and since there are unequal sample sizes, a White-corrected F-test was completed for the between-subject factors (i.e., smartphone location, power) *(click to see details)*.

</summary>

[1. Independent Random Sampling]{style="color:#4F2683; font-size:18px;"}

This assumption was met during testing.

[2. Normality]{style="color:#4F2683; font-size:18px;"}

<details>

<summary>

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the resdiuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output; click to see code).*

</summary>

```{r create_gng_err_anova, message=FALSE, warning=FALSE, error=FALSE}

# get the required data from the gng_err_analysis data: participant, location, power, and score
gng_err_anova_data = gng_err_analysis[,c("participant", "location", "power", "cue", "gng_err_omi_long")]
    colnames(gng_err_anova_data) = c("participant", "location", "power", "cue", "err_omi")
    rownames(gng_err_anova_data) = NULL
    gng_err_anova_data$location = as.factor(gng_err_anova_data$location)
    gng_err_anova_data$power = as.factor(gng_err_anova_data$power)
    gng_err_anova_data$cue = as.factor(gng_err_anova_data$cue)

# run mixed ANOVA (IV b/w: Smartphone Power & Location; IV w/in: Target Type; DV: Omission Errors)
gng_err_anova = ezANOVA(
  data = gng_err_anova_data
  , dv = .(err_omi)
  , wid = .(participant)
  , within = .(cue)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therefore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_err_res = data.frame("residuals" = proj(gng_err_anova$aov)[[3]][, "Residuals"])

```

</details>

-   It appears that the residuals lie closely to a straight line (i.e., only a couple lie away form the line), which suggests normality *(see visualization below)*.

```{r show_gng_err_qq, message=FALSE, warning=FALSE}

qqnorm(gng_err_res$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals") 
qqline(gng_err_res$residuals)

```

-   It appears that residuals are leptokurtic (i.e., there is a high peak near the center). This was expected since the Cue-Dependent Go/No-Go task is an easy response inhibition task, resulting in low error rates *(see visualization below)*.

```{r show_gng_err_res, message=FALSE, warning=FALSE}

qplot(gng_err_res$residuals, main = "Histogram of Go/No-Go Error Residuals", binwidth = .15) + 
  theme_classic()

```

-   The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_err_shap$statistic["W"]), 2)`, *p* `r p_round(gng_err_shap$p.value)` *(see code below)*.

```{r show_gng_err_shap, message=FALSE, warning=FALSE}

gng_err_shap = shapiro.test(gng_err_res$residuals)
gng_err_shap

```

<details>

<summary>

-   The Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.

    </summary>

```{r create_gng_err_sk_v_t&gng_err_sk_h_t, message=FALSE, warning=FALSE}
# create a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

#--FOR THE VERTICAL / Go CUE TYPE
# use the "des_task_cont_cue_v" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_v_t <- rbind((cbind(t(des_task_cont_cue_v["gng_err_omi_long1", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long2", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long3", c("skew", "kurtosis", "n")]))), 
                          (cbind(t(des_task_cont_cue_v["gng_err_omi_long4", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long5", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_v["gng_err_omi_long6", c("skew", "kurtosis", "n")]))))
  colnames(gng_err_sk_v_t) = c("On Desk", "In Pocket/Bag", "Outside")
  rownames(gng_err_sk_v_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
  
#--FOR THE HORIZONTAL / No-Go CUE TYPE
# use the "gng_err_des_h" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_err_sk_h_t <- rbind((cbind(t(des_task_cont_cue_h["gng_err_omi_long1", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long2", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long3", c("skew", "kurtosis", "n")]))), 
                          (cbind(t(des_task_cont_cue_h["gng_err_omi_long4", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long5", c("skew", "kurtosis", "n")]),
                                 t(des_task_cont_cue_h["gng_err_omi_long6", c("skew", "kurtosis", "n")]))))
    colnames(gng_err_sk_h_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_err_sk_h_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
```

</details>

```{r show_gng_err_sk_v_t&gng_err_sk_h_t, message=FALSE, warning=FALSE}
# show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)
    
# for vertical cue type
# use kable to show the descriptives table
kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
    
# for horizontal cue type
# use kable to show the descriptives table
kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

-   Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

[3. Homogeneity of Variance]{style="color:#4F2683; font-size:18px;"}

-   This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e., smartphone location, smartphone power, and balance).

-   The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone power, *F*(`r gng_err_lev_pow$Df[1]`, `r gng_err_lev_pow$Df[2]`) = `r round(gng_err_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_err_lev_pow[3][1,1])`. However, it was not met for smartphone location, *F*(`r gng_err_lev_loc$Df[1]`, `r gng_err_lev_loc$Df[2]`) = `r round(gng_err_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_err_lev_loc[3][1,1])`,*(see code below)*.

```{r show_gng_err_levenes, message=FALSE, warning=FALSE}

# conduct levene's
  # for location
  gng_err_lev_loc = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_loc
  # for power
  gng_err_lev_pow = leveneTest(data = gng_err_anova_data, err_omi ~ as.factor(power), center = mean)
  gng_err_lev_pow

```

[4. Sphericity]{style="color:DarkBlue; font-size:18px;"}

This assumption was met since sphericity is always met for two levels of a repeated measure factor and is, therefore, unnecessary to evaluate (Girden, 1992).

</details>

###### Results

<details>

<summary>

Since the homogeneity assumption was not met for the mixed ANOVA, a White-corrected F-test was completed. The results from this analysis are shown below, followed by a table depicting the statistics *(click to see code)*.

</summary>

Since the ezANOVA function can only complete a White-corrected F-test on between-subject factors, a separate ANOVA was computed using this correction for the between-subject effects. Additionally, the effect sizes were compluted separately using the EtaSq function. 

```{r create_gng_err_robust, message=FALSE, warning=FALSE, include=FALSE, error=FALSE}
# complete white-corr ANOVA for b/w sub factors (location & power)
gng_err_anova_white <- 
  ezANOVA(data = gng_err_anova_data, 
          dv = .(err_omi), 
          wid = .(participant), 
          between = .(location, power), 
          detailed = TRUE, 
          type = "III", # for unequal sample sizes
          return_aov = TRUE, 
          white.adjust = TRUE, # add white-correction
          within_full = .(cue) # let function know the full ANOVA includes this w/in-sub factor
          )
# compute effect size
gng_err_anova_white_es <- EtaSq(gng_err_anova_white$aov, type = 3, anova = FALSE)

```

</details>

<details>

<summary>

All results were base on $\alpha = 0.05$. Below are the reported statistics for both main effects and the interaction, followed by a table depicting the ANOVA results *(click to see code)*.

**Main Effect of Smartphone Location**

There was no significant main effect of smartphone location on omission errors, *F*(`r gng_err_anova_t$DFn[1]`, `r gng_err_anova_t$DFd[1]`) = `r round(gng_err_anova_t[1,4], 2)`, *p* `r p_round(gng_err_anova_t$p[1])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[1])`.

**Main Effect of Smartphone Power**

There was no significant main effect of smartphone power on omission errors, *F*(`r gng_err_anova_t$DFn[2]`, `r gng_err_anova_t$DFd[2]`) = `r round(gng_err_anova_t[2,4], 2)`, *p* `r p_round(gng_err_anova_t$p[2])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[2])`.

**Main Effect of Smartphone Cue Type**

There was a significant main effect of cue type on omission errors, *F*(`r gng_err_anova_t$DFn[3]`, `r gng_err_anova_t$DFd[3]`) = `r round(gng_err_anova_t[3,4], 2)`, *p* `r p_round(gng_err_anova_t$p[3])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[3])`.

**Interaction of Smartphone Location & Power**

There was no significant interaction between smartphone location and power on omission errors, *F*(`r gng_err_anova_t$DFn[4]`, `r gng_err_anova_t$DFd[4]`) = `r round(gng_err_anova_t[4,4], 2)`, *p* `r p_round(gng_err_anova_t$p[4])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[4])`.

**Interaction of Smartphone Location & Cue Type**

There was a significant interaction between smartphone location and cue type on omission errors, *F*(`r gng_err_anova_t$DFn[5]`, `r gng_err_anova_t$DFd[5]`) = `r round(gng_err_anova_t[5,4], 2)`, *p* `r p_round(gng_err_anova_t$p[5])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[5])`. Due to a significant interaction, the significant main effect of cue type was not explored further. 

**Interaction of Smartphone Power & Cue Type**

There was no significant interaction between smartphone power and cue type on omission errors, *F*(`r gng_err_anova_t$DFn[6]`, `r gng_err_anova_t$DFd[6]`) = `r round(gng_err_anova_t[6,4], 2)`, *p* `r p_round(gng_err_anova_t$p[6])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[6])`.

**Interaction of Smartphone Location, Smartphone Power, & Cue Type**

There was no significant interaction between smartphone location, smartphone power, and cue type on omission errors, *F*(`r gng_err_anova_t$DFn[7]`, `r gng_err_anova_t$DFd[7]`) = `r round(gng_err_anova_t[7,4], 2)`, *p* `r p_round(gng_err_anova_t$p[7])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_t$ges[7])`.

**Post-hoc Tests**

Since there was a significant interaction between cue type and smartphone location, a post-hocs assessing the simple main effects of cue type (i.e., vertical/go, horizontal/no-go) across the levels of smartphone location (i.e., desk, pocket/bag, outside) was completed. 

</summary>

```{r create_gng_err_robust_table, message=FALSE, warning=FALSE}
# create data frame from mixed ANOVA results to display (including both white-corr for b/w sub effects)
gng_err_anova_t <- 
  data.frame("effect" = gng_err_anova$ANOVA$Effect[2:8], 
             "DFn" = gng_err_anova$ANOVA$DFn[2:8], 
             "DFd" = gng_err_anova$ANOVA$DFd[2:8], 
             "F" = c(gng_err_anova_white$ANOVA$F[2:3], gng_err_anova$ANOVA$F[4], gng_err_anova_white$ANOVA$F[4], gng_err_anova$ANOVA$F[6:8]), 
             "p" = c(gng_err_anova_white$ANOVA$p[2:3], gng_err_anova$ANOVA$p[4], gng_err_anova_white$ANOVA$p[4], gng_err_anova$ANOVA$p[6:8]), 
             "p<.05" = c(gng_err_anova_white$ANOVA$`p<.05`[2:3], gng_err_anova$ANOVA$`p<.05`[4], gng_err_anova_white$ANOVA$`p<.05`[4], gng_err_anova$ANOVA$`p<.05`[6:8]), 
             "ges" = c(gng_err_anova_white_es[1:2, 2], gng_err_anova$ANOVA$ges[4], gng_err_anova_white_es[3, 2], gng_err_anova$ANOVA$ges[6:8])
             )
# rename rows
rownames(gng_err_anova_t) = NULL
```

</details>

```{r show_gng_err_robust_table, message=FALSE, warning=FALSE}
# show ANOVA results in kable table 
kable(gng_err_anova_t, caption = "3x2x2 Mixed ANOVA on Omission Error in the Go/No-Go Task", digits = 4, align = 'c', col.names = c("Effect", "DFn", "DFd", "F", "p", "p < .05", "ges")) %>%
  row_spec(0, bold = T) %>% 
  footnote(general = "Note: between-subject effects (i.e., location, power, location:power) are White-corrected.") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
```

###### Post-Hocs

An ANOVA was conducted to investigate the effect of cue type on omission errors for each smartphone location separately (with a Holm-Bonferroni adjustment). 

###### PH: Assumptions

<details>

<summary>

The two between-subjects ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. For both ANOVAs, normality was outside of the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). The homogeneity of variance assumption was met for the horizontal (i.e., "no-go") cue type, but not for the vertical (i.e., "go") cue type. Due to this and since there are unequal sample sizes, a White-corrected F-test was completed for the vertical (i.e., "go") cue type ANOVA and a normal ANOVA for the horizontal (i.e., "no-go") cue type *(click to see details)*.

</summary>

[1. Independent Random Sampling]{style="color:#4F2683; font-size:18px;"}

This assumption was met during testing for both ANOVAs.

[2. Normality]{style="color:#4F2683; font-size:18px;"}

<details>

<summary>

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output; click to see code).*

</summary>

```{r create_gng_err_anova_V&H, message=FALSE, warning=FALSE, error=FALSE}

# get the required data from the gng_err_analysis data: participant, location, and score
  # for vertical/go trials
  gng_err_anova_data_V <- 
    gng_err_anova_data %>% 
    # get cue type = "go" or vertical
    filter(cue == 1) %>% 
    # remove power factor
    select(-power)
  # for horizontal/no-go trials
  gng_err_anova_data_H <- 
    gng_err_anova_data %>% 
    # get cue type = "no-go" or horizontal 
    filter(cue == 2) %>% 
    # remove power factor
    select(-power)

# run ANOVA (IV: Smartphone Location; DV: Omission Errors)
  # for vertical/go trials
  gng_err_anova_V = ezANOVA(
    data = gng_err_anova_data_V
    , dv = .(err_omi)
    , wid = .(participant)
    , between = .(location)
    , type = 3 # unequal sample sizes, therefore use type 3
    , detailed = TRUE
    , return_aov = TRUE
    )
  # for horizontal/no-go trials
  gng_err_anova_H = ezANOVA(
    data = gng_err_anova_data_H
    , dv = .(err_omi)
    , wid = .(participant)
    , between = .(location)
    , type = 3 # unequal sample sizes, therefore use type 3
    , detailed = TRUE
    , return_aov = TRUE
    )

# calculate & extract the residuals from the ANOVA
  # for vertical/go trials
  gng_err_res_V = data.frame("residuals" = gng_err_anova_V$aov$residuals)
  # for horizontal/no-go trials
  gng_err_res_H = data.frame("residuals" = gng_err_anova_H$aov$residuals)

```

</details>

-   It appears that the residuals do not lie closely to a straight line: the data are heavy-tailed on the right *(see visualization below)*.

```{r show_gng_err_V&H_qq, message=FALSE, warning=FALSE}
# for vertical/go trials
qqnorm(gng_err_res_V$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals for Vertical/Go Trials") 
qqline(gng_err_res_V$residuals)

# for horizontal/no-go trials
qqnorm(gng_err_res_H$residuals, main = "Q-Q Plot of the Go/No-Go Error Residuals for Horizontal/No-Go Trials") 
qqline(gng_err_res_H$residuals)

```

-   It appears that residuals are leptokurtic (i.e., there is a high peak near the center). This was expected since the Cue-Dependent Go/No-Go task is an easy response inhibition task, resulting in low error rates *(see visualization below)*.

```{r show_gng_err_V&H histograms, message=FALSE, warning=FALSE}
# for vertical/go trials
qplot(gng_err_res_V$residuals, main = "Histogram of Go/No-Go Error Residuals for Vertical/Go Trials", binwidth = .5) + 
  theme_classic()
# for horizontal/no-go trials
qplot(gng_err_res_H$residuals, main = "Histogram of Go/No-Go Error Residuals for Horizontal/No-Go Trials", binwidth = .5) + 
  theme_classic()
```

-   The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$) for the vertical (i.e., "go") trials, *W* = `r round(as.numeric(gng_err_shap_V$statistic["W"]), 2)`, *p* `r p_round(gng_err_shap_V$p.value)`, and horizontal (i.e., "no-go") trials, *W* = `r round(as.numeric(gng_err_shap_H$statistic["W"]), 2)`, *p* `r p_round(gng_err_shap_H$p.value)` *(see code below)*.

```{r show_gng_err_shap_V&H, message=FALSE, warning=FALSE}
# for vertical/go trials
gng_err_shap_V <- shapiro.test(gng_err_res_V$residuals)
gng_err_shap_V
# for horizontal/no-go trials
gng_err_shap_H <- shapiro.test(gng_err_res_H$residuals)
gng_err_shap_H
```

<details>

<summary>

-   The Skewness and Kurtosis was examined. For the vertical cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_v_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the horizontal cue type, both the Skewness (range: `r round(min(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(1,4),])), 2)`) and Kurtosis (range: `r round(min(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_err_sk_h_t[c(2,5),])), 2)`) were outside of the acceptable range of $\pm 2.0$ and $\pm 9.0$, respectively (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010) *(see table below)*.
</summary>

</details>

```{r show_gng_err_sk_v_t&gng_err_sk_h_t2, message=FALSE, warning=FALSE}
### NOTE: same table as before post-hocs

# show a table of the skewness & kurtosis for the gng data, separated by cue type (i.e. one for the vertical and horizontal trials)

# for vertical cue type
# use kable to show the descriptives table
kable(gng_err_sk_v_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'Go' (Vertical) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()

# for horizontal cue type
# use kable to show the descriptives table
kable(gng_err_sk_h_t, caption = "Skewness and Kurtosis of the Omission Errors on the Go/No-Go Task for 'No-Go' (Horizonatal) Trials", digits = 2, align = 'c') %>%
  add_header_above(c(" ", "Smartphone Location" = 3), bold = T) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  row_spec(0, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

-   Therefore, based on all the normality assumption checks, normality was not assumed to be within the acceptable range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

[3. Homogeneity of Variance]{style="color:#4F2683; font-size:18px;"}

-   This assumption was tested by conducting Levene's Test for Homogeneity of Variance for vertical (i.e., "go") trials and horizontal (i.e., "no-go") trials.

-   The homogeneity assumption was met (based on $\alpha = 0.05$) for the horizontal (i.e., "no-go") cue type, *F*(`r gng_err_lev_H$Df[1]`, `r gng_err_lev_H$Df[2]`) = `r round(gng_err_lev_H[2][1,1], 2)`, *p* `r p_round(gng_err_lev_H[3][1,1])`. However, it was not met for vertical (i.e., "go") cue type, *F*(`r gng_err_lev_V$Df[1]`, `r gng_err_lev_V$Df[2]`) = `r round(gng_err_lev_V[2][1,1], 2)`, *p* `r p_round(gng_err_lev_V[3][1,1])`,*(see code below)*.

```{r show_gng_err_lev_V&H, message=FALSE, warning=FALSE}

# conduct levene's
  # for location
  gng_err_lev_V = leveneTest(data = gng_err_anova_data_V, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_V
  # for power
  gng_err_lev_H = leveneTest(data = gng_err_anova_data_H, err_omi ~ as.factor(location), center = mean)
  gng_err_lev_H

```

</details>

###### PH: SME Results

<details>

<summary>

Since the homogeneity assumption was not met for the vertical (i.e., "go") cue type ANOVA, a White-corrected F-test was completed. The ANOVA for the horizontal (i.e., "no-g") cue type was completed without a correction. Both ANOVAs had a Holm-Bonferroni adjustment. The results from this analysis are shown below, followed by a table depicting the statistics *(click to see code)*.

</summary>

Run the white-corrected ANOVA for the vertical (i.e., "go") cue type ANOVA. 

```{r create_gng_err_anova_V_white, message=FALSE, warning=FALSE, include=FALSE, error=FALSE}
# complete white-corr ANOVA for b/w sub factors (location & power)
gng_err_anova_V_white <- 
  ezANOVA(data = gng_err_anova_data_V, 
          dv = .(err_omi), 
          wid = .(participant), 
          between = .(location), 
          detailed = TRUE, 
          type = "III", # for unequal sample sizes
          return_aov = TRUE, 
          white.adjust = TRUE # add white-correction
          )
# compute effect size
gng_err_anova_V_white_es <- EtaSq(gng_err_anova_V_white$aov, type = 3, anova = FALSE)
```

Complete the Holm-Bonferroni adjustment for both ANOVAs.
```{r create_gng_err_anova_cue_padj}
# get corrected p-values for each ANOVA
gng_err_anova_cue_padj <- 
  p.adjust(c(gng_err_anova_V_white$ANOVA$p[2], # p-value from white-corrected V/go ANOVA
             gng_err_anova_H$ANOVA$p[2]), # p-value from H/no-go ANOVA
         method = c("holm"), # method: Holm-Bonferroni
         n = 2 # number of comparisons
         )

```

</details>

<details>

<summary>

All results were base on $\alpha = 0.05$. Below are the reported statistics for the main effects for each SME ANOVA, followed by a table depicting the SME ANOVA results *(click to see code)*.


**Cue Type: "Go" / Vertical**

***Main Effect of Smartphone Location***

There was no significant main effect of smartphone location on omission errors for the vertical/"go" cue type, *F*(`r gng_err_anova_cue_t$DFn[1]`, `r gng_err_anova_cue_t$DFd[1]`) = `r round(gng_err_anova_cue_t$F[1], 2)`, *p* `r p_round(gng_err_anova_cue_t$p[1])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_cue_t$ges[1])`.

**Cue Type: "No-Go" / Horizontal**

***Main Effect of Smartphone Location***

There was no significant main effect of smartphone location on omission errors for the horizontal/"no-go" cue type, *F*(`r gng_err_anova_cue_t$DFn[2]`, `r gng_err_anova_cue_t$DFd[2]`) = `r round(gng_err_anova_cue_t$F[2], 2)`, *p* `r p_round(gng_err_anova_cue_t$p[2])`, $\eta_{G}^2$ = `r p_round(gng_err_anova_cue_t$ges[2])`.

**Post-hoc Tests**

Since there was no significant main effect for either cue type, no post-hocs were completed. 

</summary>

```{r create_gng_err_anova_cue_t, message=FALSE, warning=FALSE}
# create data frame from mixed ANOVA results to display (including both white-corr for b/w sub effects)
gng_err_anova_cue_t <- 
  data.frame("effect" = c(gng_err_anova_V_white$ANOVA$Effect[2], gng_err_anova_H$ANOVA$Effect[2]), 
             "DFn" = c(gng_err_anova_V_white$ANOVA$DFn[2], gng_err_anova_H$ANOVA$DFn[2]), 
             "DFd" = c(gng_err_anova_V_white$ANOVA$DFd[2], gng_err_anova_H$ANOVA$DFd[2]), 
             "F" = c(gng_err_anova_V_white$ANOVA$F[2], gng_err_anova_H$ANOVA$F[2]), 
             "p" = gng_err_anova_cue_padj,
             "p<.05" = c(gng_err_anova_V_white$ANOVA$`p<.05`[2], gng_err_anova_H$ANOVA$`p<.05`[2]), 
             "ges" = c(gng_err_anova_V_white_es[2], gng_err_anova_H$ANOVA$ges[2])
             )
# rename rows
rownames(gng_err_anova_cue_t) = NULL
```

</details>

```{r show_gng_err_anova_cue_t, message=FALSE, warning=FALSE}
# show ANOVA results in kable table 
kable(gng_err_anova_cue_t, caption = "Tests of simple main effects on Go/No-Go omission errors for cue type (i.e., 'go' or 'no-go') across smartphone location levels (i.e., desk, pocket/bag, outside).", digits = 4, align = 'c', col.names = c("Effect", "DFn", "DFd", "F", "p", "p < .05", "ges")) %>%
  row_spec(0, bold = T) %>% 
  pack_rows("Cue Type: 'Go'", 1, 1) %>% 
  pack_rows("Cue Type: 'No-Go'", 2, 2) %>% 
  footnote(general = "Note: 'Go' cue type effect was White-corrected. Holm-Bonferroni adjusted p-values depicted.") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
```


##### Visualizations

The following shows a visualization of the Go/No-Go omission error data based on the three factors: smartphone location, smartphone power, and cue type. 

- To visualize the mixed design, two plots were created (i.e., one for each cue type).

```{r gng_err_plot_separate, message=FALSE, warning=FALSE, out.width=c('50%', '50%'), fig.show='hold'}
#--FOR THE VERTICAL CUE TYPE-- 

# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the data
  gng_err_bar_v_data = gng_err_analysis[gng_err_analysis$cue == 1, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_v_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_v_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_v_data$location = factor(gng_err_bar_v_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_v_data$power = factor(gng_err_bar_v_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_v = ggplot(gng_err_bar_v_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Vertical' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_v_data$err_omi), max(gng_err_bar_v_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_v_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

#--FOR THE HORIZONTAL CUE TYPE-- 

# create data frame with location conditions as characters rather than integers 
  # First, get a subset of the data
  gng_err_bar_h_data = gng_err_analysis[gng_err_analysis$cue == 2, c("location", "power", "gng_err_omi_long")]
    colnames(gng_err_bar_h_data) = c("location", "power", "err_omi")
    rownames(gng_err_bar_h_data) = NULL 
  # reformat location & power as factors  
  gng_err_bar_h_data$location = factor(gng_err_bar_h_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
  gng_err_bar_h_data$power = factor(gng_err_bar_h_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))
       
# create plot
gng_err_bar_h = ggplot(gng_err_bar_h_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(title = "Performance on the Go/No-Go Task: 'Horizontal' Cue Type",
       subtitle = "Omission Errors",
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power",
       caption = paste0(strwrap("Figure X. Distributions of scores on the horizontal cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n")) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_h_data$err_omi), max(gng_err_bar_h_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_h_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))

# Show plots
  # for the Vertical cue type
  gng_err_bar_v
  # for the Horizontal cue type
  gng_err_bar_h
  
# export plots
  # for the Vertical cue type
  # ggsave(filename="fig_gng_err_bar_v", plot = gng_err_bar_v, device = "png", height = 5, width = 8, units = "in", dpi = 500)
  # for the Horizontal cue type
  # ggsave(filename="fig_gng_err_bar_h", plot = gng_err_bar_h, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```

- Shown as one plot...

```{r gng_err_plot_together, error=FALSE, warning=FALSE, message=FALSE}

# create data frame with location conditions as characters rather than integers 
gng_err_bar_data <- 
  gng_err_anova_data %>% 
  # make location a factor
  mutate(location = factor(location, levels = c(1:3), labels = c("Desk", "Pocket/Bag", "Outside"))) %>% 
  # make power a factor
  mutate(power = factor(power, levels = c(1:2), labels = c("ON", "OFF"))) %>% 
  # make cue a factor
  mutate(cue = factor(cue, levels = c(1:2), labels = c("(A) 'Go' Trials", "(B) 'No-Go' Trials")))

# create plot
gng_err_bar <- 
  ggplot(gng_err_bar_data, aes(x = location, y = err_omi, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the Go/No-Go Task",
       subtitle = "Omission Errors",
       caption = paste0(strwrap("Figure X. Distributions of scores on the vertical cue type on the Cue-Dependent Go/No-Go Task, arranged by smartphone location (along the x-axis) and smartphone power (different bars). Omission errors occurred when a participant failed to respond to a non-target stimulus. Coloured dots represent individual data points. Black dots and error bars depict means and standard errors, respectively.", width = 80), collapse = "\n"),
       x = "Smartphone Location", 
       y = "Omission Error", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  scale_y_continuous(limits = c(min(gng_err_bar_data$err_omi), max(gng_err_bar_data$err_omi))) +
  #scale_colour_manual(values = cols) +
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_err_bar_data, stat = "summary", funmin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  facet_wrap(~cue, ncol = 2) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13))
# show plot
gng_err_bar

# export plots
# ggsave(filename="fig_gng_err_bar_june16", plot = gng_err_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)
```


```{r eval=FALSE, include=FALSE}
# # install the package
# install.packages("ggstatsplot")
# # Load the package
# library(ggstatsplot)  
# ggbetweenstats(gng_err_analysis[,c("condition", "gng_err_omi_long")], condition, gng_err_omi_long, outlier.tagging = TRUE)
```

#### Accuracy Analysis

A **3(smartphone location: desk, pocket/bag, outside) x 2(smartphone power: ON, OFF)** between-subjects ANOVA was completed *(DV = average RT)*. The between-subjects factors were smartphone location and power *(click to see code)*.

Prior to completing the analysis, planned exclusion critera was applied to the data. Exclusion criteria for this analysis helped to control for any participants who did not follow the instructions. Any participants who had a reaction time that was higher than two standard deviations (SD) from the mean reaction time were not included in the final analysis. View the [Data Cleaning section](#all_data_DC) under the data [Construct Working Dataframes](#makeDFs) heading above. There was a total of `r nrow(gng_ac_analysis)` participants in the Go/No-Go Accuracy analysis.

##### Analysis

To assess if the reaction time statistically differed, we conducted a 3x2 ANOVA with the between-subjects factors of smartphone location (i.e., on desk, in pocket/bag, or outside) and power (i.e., on or off). Since there are unequal sample sizes, we used Type III sum of squares.

###### Assumptions

<details>

<summary>

The ANOVA made three assumptions: independent random sampling, normality, and homogeneity of variance. Normality was met but there was a slight skew to the right in the histogram and kurtosis was outside of the typical range, however, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007). *(click to see details)*.

</summary>

[1. Independent Random Sampling]{style="color:#4F2683; font-size:18px;"}

This assumption was met during testing.

[2. Normality]{style="color:#4F2683; font-size:18px;"}

This assumption was tested by visualizing the residuals, applying a Shapiro-Wilk test to the residuals, and observing the Skewness and Kurtosis of the data *(extracted from the ANOVA output).*

```{r echo=TRUE, message=FALSE, warning=FALSE}

# get the required data from the ospan_analysis data: participant, location, power, and score
gng_ac_anova_data <- gng_ac_analysis[, c("participant", "location", "power", "gng_rt_av")]
    colnames(gng_ac_anova_data) = c("participant", "location", "power", "rt_av")
    rownames(gng_ac_anova_data) = NULL
    # reformat location,  power, and balance as factors  
    gng_ac_anova_data$location = factor(gng_ac_anova_data$location, 
                  levels = c(1, 2, 3),
                  labels = c("On Desk", "Pocket/Bag", "Outside"))
    gng_ac_anova_data$power = factor(gng_ac_anova_data$power, 
                  levels = c(1, 2),
                  labels = c("ON", "OFF"))

# run between-subjects ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
gng_ac_anova = ezANOVA(
  data = gng_ac_anova_data
  , dv = .(rt_av)
  , wid = .(participant)
  , between = .(location, power)
  , type = 3 # unequal sample sizes, therfore use type 3
  , detailed = TRUE
  , return_aov = TRUE
)

# calculate & extract the residuals from the ANOVA
gng_ac_res <- data.frame("residuals" = proj(gng_ac_anova$aov)[,"Residuals"])   
```

-   It appears that the residuals lie closely to a straight line, which suggests normality *(see visualization below)*.

```{r}

qqnorm(gng_ac_res$residuals, main = "Q-Q Plot of the Go/No-Go Reaction Time Residuals") 
qqline(gng_ac_res$residuals)

```

-   It appears that residuals tend to follow the shape of a normal distribution, with a slight skew to the right and some data on the far right *(see visualization below)*.

```{r message=FALSE, warning=FALSE}

qplot(gng_ac_res$residuals, main = "Histogram of Go/No-Go Reaction Time Residuals", binwidth = .01) + 
  theme_classic()

```

-   The Shapiro-Wilk test of normality was significant (based on $\alpha = 0.05$), *W* = `r round(as.numeric(gng_ac_shap$statistic["W"]), 2)`, *p* `r p_round(gng_ac_shap$p.value)` *(see code below)*.

```{r}

gng_ac_shap = shapiro.test(gng_ac_res$residuals)
gng_ac_shap

```

-   The data's Skewness and Kurtosis was examined and showed that the Skewness (range: `r round(min(as.numeric(gng_ac_sk_t[c(1,4),])), 2)` to `r round(max(as.numeric(gng_ac_sk_t[c(1,4),])), 2)`) was within the acceptable range of $\pm 2.0$ (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010). For the Kurtosis, the on desk/powered off condition was outside the acceptable range of $\pm 9.0$ (Schmider, Ziegler, Danay, Beyer, & Bühner, 2010; overall range: `r round(min(as.numeric(gng_ac_sk_t[c(2,5),])), 2)` to `r round(max(as.numeric(gng_ac_sk_t[c(2,5),])), 2)`) *(see table below)*.

```{r}
# use the "gng_ac_des" data frame from the descriptives

# organize the data into a new data frame depicting the Skew, kurtosis, and sample size (n).
  gng_ac_sk_t = rbind((cbind(t(des_task_cont["gng_rt_av1", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av2", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av3", c("skew", "kurtosis", "n")]))), 
                      (cbind(t(des_task_cont["gng_rt_av4", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av5", c("skew", "kurtosis", "n")]),
                             t(des_task_cont["gng_rt_av6", c("skew", "kurtosis", "n")]))))
    colnames(gng_ac_sk_t) = c("On Desk", "In Pocket/Bag", "Outside")
    rownames(gng_ac_sk_t) = rep(c("Skewness", "Kurtosis", "n"), 2)
    
# use kable to show the descriptives table
  kable(gng_ac_sk_t, caption = "Skewness and Kurtosis of the Reaction Time on the Go/No-Go Task", digits = 2, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" ", "Smartphone Location" = 3)) %>%
  pack_rows("Smartphone Power ON", 1, 3) %>% 
  pack_rows("Smartphone Power OFF", 4, 6) %>%
  kable_classic()

```

-   Therefore, based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Although there was a slight skew to the right in the histogram and kurtosis was outside of the typical range, ultimately, ANOVA tests are typically robust to violations of normality (Gardner & Tremblay, 2007).

[3. Homogeneity of Variance]{style="color:#4F2683; font-size:18px;"}

This assumption was tested by conducting Levene's Test for Homogeneity of Variance for each between-subjects variable (i.e. smartphone location and smartphone power).

-   The homogeneity assumption was met (based on $\alpha = 0.05$) for smartphone location, *F*(`r gng_ac_lev_loc$Df[1]`, `r gng_ac_lev_loc$Df[2]`) = `r round(gng_ac_lev_loc[2][1,1], 2)`, *p* `r p_round(gng_ac_lev_loc[3][1,1])` and smartphone power, *F*(`r gng_ac_lev_pow$Df[1]`, `r gng_ac_lev_pow$Df[2]`) = `r round(gng_ac_lev_pow[2][1,1], 2)`, *p* `r p_round(gng_ac_lev_pow[3][1,1])` *(see code below)*.

```{r message=FALSE, warning=FALSE}

# conduct levene's
  # for location
  gng_ac_lev_loc = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(location), center = median)
  gng_ac_lev_loc
  # for power
  gng_ac_lev_pow = leveneTest(data = gng_ac_anova_data, rt_av ~ as.factor(power), center = median)
  gng_ac_lev_pow
```

-   The homogeneity of variance assumption was met. Based on all the normality assumption checks, normality was assumed to be within the acceptable range for the ANOVA analysis. Additionally, it should be noted that ANOVA analyses can be robust to violations of normality (Gardner & Tremblay, 2007).

```{r eval=FALSE, message=FALSE, warning=FALSE, include=FALSE}
## NOT SHOWN -- this would run w/ a white correction if homo not met... 

# run the corrected ANOVA (IV: Smartphone Power & Location, Balance; DV: OSpan Abolute Score)
# this will be used to report the main effect of smartphone power only 
# gng_ac_anova_cor = ezANOVA(
#   data = gng_ac_anova_data
#   , dv = .(rt_av)
#   , wid = .(participant)
#   , between = .(location, power)
#   , type = 3 # unequal sample sizes, therfore use type 3
#   , detailed = TRUE
#   , return_aov = TRUE
#   , white.adjust = F
# )

# Calculate effect size:
# this will be used to report the main effect of smartphone power only 
# gng_ac_pow_es = (EtaSq(gng_ac_anova$aov, type = 3, anova = FALSE))[2]
```

</details>

###### Results

All results were based on $\alpha = 0.05$. Below are the reported statistics for the main effects and the interactions, followed by a table depicting the ANOVA results.

**Main Effect of Smartphone Location**

There was no significant main effect of smartphone location on reaction time, *F*(`r gng_ac_t[2,2]`, `r gng_ac_t[2,3]`) = `r round(gng_ac_t[2,4], 2)`, *p* `r p_round(gng_ac_t[2,5])`, $\eta^{2}$ `r p_round(gng_ac_t[2,7])`.

**Main Effect of Smartphone Power**

There was no significant main effect of smartphone power on reaction time, *F*(`r gng_ac_t[3,2]`, `r gng_ac_t[3,3]`) = `r round(gng_ac_t[3,4], 2)`, *p* `r p_round(gng_ac_t[3,5])`, $\eta^{2}$ `r p_round(gng_ac_t[3,7])`.

**Interaction of Smartphone Location & Smartphone Power**

There was no significant interaction between smartphone location and smartphone power on reaction time, *F*(`r gng_ac_t[4,2]`, `r gng_ac_t[4,3]`) = `r round(gng_ac_t[4,4], 2)`, *p* `r p_round(gng_ac_t[4,5])`, $\eta^{2}$ `r p_round(gng_ac_t[4,7])`.

**Post-hoc Tests**

Since there was a significant three-way interaction (i.e smartphone location, smartphone power, and balance), further analyses were completed *(see "Post-Hocs: Simple Two-Way Interactions" below)*.

```{r}
# create table from ANOVA results
gng_ac_t <- gng_ac_anova$ANOVA

# show ANOVA results in kable table 
kable(gng_ac_anova$ANOVA, caption = "3x2 ANOVA on Average Reaction Time in the Go/No-Go Task", digits = 3, align = 'c') %>%
  row_spec(0, bold = T) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()
```

```{r eval=FALSE, include=FALSE}
## NOT SHOWN -- do if homo not met... 

# # create a table with the anova results including the corrected values for the main effect of power (i.e. since it did not meet the homogeneity of variance assumption)
#   # get the original anova data
#   gng_ac_t = gng_ac_anova$ANOVA
#     # since the corrected data does not have the SS, remove these columns from the original ANOVA table
#     gng_ac_t = dplyr::select(gng_ac_t, -SSn, -SSd)
#     # chance the data for the main effect of power to corrected numbers
#     gng_ac_t[gng_ac_t$Effect == "power",] = c(gng_ac_anova_cor$ANOVA[3,], gng_ac_pow_es)
# # show ANOVA results in kable table 
# kable(gng_ac_t, caption = "3x2x2 ANOVA on Go/No-Go Reaction Time", digits = 4, align = 'c') %>%
#   kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
#   kable_classic()
```

##### Visualizations

The following shows a visualization of Accuracy (i.e. reaction time) on the Cue-Dependent Go/No-go Task. There was no significant main effect of smartphone location or smartphone power and no significant interaction between smartphone location and power.

```{r message=FALSE, warning=FALSE}
# First, get a subset of the data
gng_ac_bar_data <- 
  gng_ac_anova_data %>% 
  select(-participant) 
# %>% 
#   mutate(location = factor(location, labels = c("On Desk", "Pocket/Bag", "Outside"))) # didnt need this, its already a factor... 
  
# remove row names
rownames(gng_ac_bar_data) <- NULL

# plot bar graph
gng_ac_bar <- 
  ggplot(gng_ac_bar_data, aes(x = location, y = rt_av, fill = power), show.legend = FALSE) +
  geom_bar(stat = "summary", fun = "mean", position = "dodge") + 
    guides(shape = FALSE) + 
  scale_fill_manual(values = c("cadetblue3", "deepskyblue4")) + 
  labs(
       title = "Performance on the Go/No-Go Task \nfor the OSpan First Balance Condition",
       subtitle = "Accuracy (average reaction time)",
       caption = "Note: ...",
       x = "Smartphone Location", 
       y = "Average Reaction Time (sec.)", 
       fill = "Smartphone Power"
       ) +
        # the "\n" here denote that you want a new line formed in the text
  geom_point(stat = "identity", aes(color = power), size = 1, 
             position = position_jitterdodge(jitter.width = .53), show.legend = FALSE) +
  scale_color_manual(values = c("deepskyblue4", "purple4")) + 
  geom_point(stat="summary", fun="mean", position = position_dodge(.9), size = 1.5, show.legend = FALSE) +
  geom_errorbar(data = gng_ac_bar_data, stat = "summary", fun.ymin = function(x) mean(x) - sd(x)/sqrt(length(x)), 
                funmax = function(x) mean(x) + sd(x)/sqrt(length(x)), size= .5, width= .25, position = position_dodge(.9)) +
  theme_classic() +
  theme(plot.title = element_text(color = "black", size = 14, face = "bold"), 
        plot.subtitle = element_text(color = "black", size = 13), 
        plot.caption = element_text(hjust = 0, size = 12, face = "italic"),
        text = element_text(size = 13)) #+
  #scale_y_continuous(limits = c(min(gng_ac_bar_ospan_data$rt_av), max(gng_ac_bar_ospan_data$rt_av)))


# show the figures
gng_ac_bar
  
# export plots
# ggsave(filename="fig_gng_ac_bar_june16", plot = gng_ac_bar, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)
```

## 3. Testing the Moderation Hypothesis:

### Exploratory Factor Analysis of the SAD

To examine the underlying factors measured by the Smartphone Attachment and Dependency Inventory (SAD) from Ward et al. (2017), a principal components factor analysis with a Varimax rotation was completed on participant's SAD data. The analysis was exploratory and was compared to Ward et al.'s findings. There was a total of `r nrow(fa_data)` participants in the factor analysis.

#### Checking the Data

To perform the principal components factor analysis, we first examined the data for (1) missing data, (2) correlations between the items, and (3) the factorability of the data.

##### 1. Missing Data

-   From the descriptives table, it seems that there is no missing or incomplete data. This was confirmed by finding the complete cases, which were equal to the total sample size (i.e. `r fa_complete`). Therefore, no data was removed due to missing values for this analysis *(see descriptives table below)*.

```{r message=FALSE, warning=FALSE}

# get a data frame with the SAD items only for the factor analysis
fa_data = dplyr::select(factor_analysis, SAD_1:SAD_13)

# find the total complete cases to find any missing data from the data -- any participant with missing data must be removed
fa_complete = sum(complete.cases(fa_data))

# Show descriptives for the factor analysis data
fa_des = describe(fa_data)

# show descriptives in kable table 
kable(describe.by(fa_data), caption = "Descriptives for Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>% 
  kable_classic()

```

##### 2. Correlations between the Items

-   The correlations between the items were examined to determine if they were either not high enough (i.e., r \< .30) or too high (i.e., r \> .80).

    -   The correlations for the 13 items in the SAD ranged from *r*(`r fa_cor$n[1,1] - 2`) `r p_round(min(fa_cor$r[which(fa_cor$r < 1)]))` to *r*(`r fa_cor$n[1,1] - 2`) `r p_round(max(fa_cor$r[which(fa_cor$r < 1)]))`, *p* `r p_round(max(fa_cor$P[which(!is.na(fa_cor$P))]))`.
    -   A visual inspection of the correlations showed that items 12 and 13 might not be correlated enough with the rest of the items (i.e. `r length(which(fa_cor$r[,12] < .30))` and `r length(which(fa_cor$r[,13] < .30))` correlations below .30, respectively). There were no issues associated with too many high correations for any items.
    -   The lower correlations for items 12 and 13 should be kept in mind when completing the factor analysis and these items may be removed.\

-   To check for multicollinearity, the determinant of the correlation matrix was calculated. The determinant was greater than the necessary value of 0.00001 (i.e., `r round(fa_det, 5)`), so, there was no issues with multicollinearity (Field et al., 2012).

-   Therefore, all items in the SAD correlated reasonably with each other and without an excessively large correlation so no items were removed at this stage.

```{r}

# get the pearson correlations between all items (and corresponding p-values)
fa_cor = rcorr(as.matrix(fa_data), type = "pearson") 

# make a table with the data to display it better
fa_cor_t = as.data.frame(fa_cor$r)
  # include only the lower triangle of the data
  fa_cor_t[upper.tri(fa_cor_t)] = NA
  
# find the determinant of the correlation matrix
fa_det = det(fa_cor$r)

# Print table using kable
  # make any NAs appear as blank within kable
  opts = options(knitr.kable.NA = "")
  # print table
  kable(fa_cor_t, caption = "Pearson Correlations between Items in the SAD", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("p < .001", notation = "alphabet") %>% 
    kable_classic()
    # restore the original options so that "NA" shows up again
    options(opts)

```

-   Another way to visualize this is using a Scatter Plot Matrix

    -   This shows: (1) a scatter plot bellow the lower diagonal, with a line of best fit (red); (2) a historgram and density plot of the item on the diagonal; and (3) the correlation values (pearson r) above the diagonal, with corresponding significance shown with asterisks. The x-axis in each scatter plot represents the column variable and the y-axis the row variable.

```{r}
# show a SPLOM (Scatter Plot Matrix) of the Data 
pairs.panels(fa_data, pch = ".", density = TRUE, ellipses = FALSE, lm = TRUE, digits = 2, method = "pearson", jiggle = TRUE, factor = 1, hist.col = "cadetblue3", stars = TRUE, gap = .5, verOdd = TRUE, horOdd = TRUE, main = "Scatter Plot Matrix of Items in the SAD")

```

##### 3. Factorability of the Data

-   This assessed if there were meaningful latent factors to be found within the data, which was assessed in two ways: Barlett's test of sphericity for correlation adequacy and the Kaiser-Meyer-Olkin measure of sampling adequacy.

    -   **Barlett's test of sphericity for correlation adequacy**:

        -   This test was statistically significant, which suggested good factorability, $\chi^{2}$(`r fa_bartest$df`) = `r round(fa_bartest$chisq, 2)`, *p* `r p_round(fa_bartest$p.value)`

    -   **Kaiser-Meyer-Olkin measure of sampling adequacy**:

        -   This measure showed an overall measure of sampling adequacy (MSA) and an MSA per item. The overall MSA was higher than both the minimum and preferred criteria (0.50 and 0.60, respectively; Kaiser, 1974, Hutcheson & Sofroniou, 1999), $MSA_{overall}$ = `r round(fa_kmo$MSA, 2)`. Each item's MSA was also higher than the minimum and preferred criteria, $MSA_{1-13}$ = `r min(round(fa_kmo$MSAi, 2))` - `r max(round(fa_kmo$MSAi, 2))`.

-   Therefore, both factorability tests confirmed that the planned factor analysis was appropriate.

```{r message=FALSE, warning=FALSE}
# run the Barlett Test
fa_bartest = cortest.bartlett(fa_data)
fa_bartest

# run the KMO 
fa_kmo = KMO(fa_data)
fa_kmo

```

#### Extract Factors

Now that factorability was confirmed, we found the appropriate number of factors to extract. This was done using: (1) a parallel analysis scree plot, (2) eigen values (i.e., from the parallel analysis), and (3) the factor structure between the two solutions (i.e., using factor diagrams).

##### 1. Parallel Analysis Scree Plot.

This plotted the eigenvalues from the data and a parallel analysis which generated random correlation matrices. The number of factors is then determined by comparing the resulting eigenvalues to the observed data, that is, observed eigenvalues that were higher than the corresponding random eigenvalues were more likely to form meaningful factors.

-   The parallel analysis scree plot showed that there are `r fa_par$ncomp` appropriate factors (i.e. using the principal components extraction method).

    -   From closer inspection of the plot, the last factor was very close to the random eigenvalue cut-off line, therefore, the 2, 3, and 4 factor solutions were compared to determine the best fit (i.e., the eigen values and factor structures were examined).

```{r message=FALSE, warning=FALSE}

fa_par <- fa.parallel(fa_data, fa = "fa", main = "Parallel Analysis Scree Plot for Items in the SAD")
```

##### 2. Eigen Values

For a sample size greater than 250 (i.e., `r fa_complete` complete cases) the average communalities (i.e., the proportion of common variance within the items) for the two factor (`r round(mean(fa_two_v$communality), 2)`) and three factor (`r round(mean(fa_three_v$communality), 2)`) solutions were wihtin the acceptable value (i.e., 0.60; Field et al., 2012). Therefore, Kaiser's criterion (i.e., a meaningful factor has an eigen value greater than 1) was used for the eigen values. It should be noted that although the average communalities were not very high, the scree plot and factor diagrams were used to choose the appropriate number of factors.

-   The eigen values (unrotated solution) showed three meaningful factors (i.e., `r round(fa_par$pc.values[1:3], 2)`), which *supported the three-factor solution* as a better representation of the data.

```{r}
# Eigen values
  # view eigen values from the parallel analysis scree plot
  round(fa_par$pc.values, 3)
```

##### 3. Factor Diagrams

A factor diagram for the two-, three-, and four-factor solutions (rotated solution) was used to determine which solution represents the most interpretable solution. The diagrams show the factor loadings for each item on the given factor for which they load the highest on. The items were ordered (i.e., descending) based on these loadings.\
*Note: The factor diagrams are based on the rotated solutions to better distribute the factor loadings*

-   The factor diagrams showed that the two factor solution resulted in most items loading onto only one factor and was not easily interpreted. The four-factor solution showed a 2-item factor, which is not easily interpretable as it did not meet the 3-item minumum (Costello & Osborn, 2005). The three and four factor solutions showed better distributions and were more interpretable. After reviewing the items, the four-factor solution provided a logical structure compared to the three-factor solution. That is, the third factor in the 3-factor solution grouped items 6 & 7 with 12 & 13, however items 6 & 7 ask about emotional associations whereas items 12 & 13 are regarding attention. Although the fourth factor in the four-factor solution only contained 2 items, these items were considered unique in that they are the only items in the SAD which asked participants about attention with respect to their smartphone. Therefore, the **four-factor** solution was chosen. 

Therefore, the **three factor solution** was chosen to better represent the data.

```{r fa_diagrams, out.width=c('50%','50%'), fig.show='hold'}
##-- VARIMAX ROTATION  -- 
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_v = fa(fa_data, nfactors = 2, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_v$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_v = fa(fa_data, nfactors = 3, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_v$loadings) = c("Factor 1", "Factor 2", "Factor 3")
  
  # 4 factor solution
  fa_four_v = fa(fa_data, nfactors = 4, fm = "pa", rotate = "varimax", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_four_v$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_v, main = "Two Factor Solution for items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)

  # 3 factor solution
  fa.diagram(fa_three_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)
  
  # 4 factor solution
  fa.diagram(fa_four_v, main = "Three Factor Solution for Items in the SAD (Varimax Rotation)", sort = TRUE, adj = 1, digits = 2)
```

```{r eval=FALSE, message=FALSE, warning=FALSE, include=FALSE}

##-- OBLIMIN ROTATION --  
# run a factor analysis to compare the two- and three-factor solutions (no rotation)
  # 2 factor solution
  fa_two_o = fa(fa_data, nfactors = 2, fm = "pa", rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_two_o$loadings) = c("Factor 1", "Factor 2")
  
  # 3 factor solution
  fa_three_o = fa(fa_data, nfactors = 3, fm = "pa", rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_three_o$loadings) = c("Factor 1", "Factor 2", "Factor 3")
  
  # 4 factor solution
  fa_four_o = fa(fa_data, nfactors = 4, fm = "pa", rotate = "oblimin", scores = T)
    # change name in loadings for clarity when visualizing the model
    colnames(fa_four_o$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")

# Factor Diagram
  # 2 factor solution
  fa.diagram(fa_two_o, main = "Two Factor Solution for items in the SAD (oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)

  # 3 factor solution
  fa.diagram(fa_three_o, main = "Three Factor Solution for Items in the SAD (oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)
  
  # 4 factor solution
  fa.diagram(fa_four_o, main = "Three Factor Solution for Items in the SAD (oblimin Rotation)", sort = TRUE, adj = 1, digits = 2)

```

#### Achieving Simple Structure

Now that we determined the number of factors (i.e., four), we assessed for **simple structure** (i.e., a final solution of the factor analysis with the simplest solution). Simple structure was achieved by reviewing assessing the loadings for strong loadings (i.e., using a cut-off of \> .40) and removing: (1) split variables (i.e., variables that loaded onto more than one factor), (2) non-loading variables, and (3) weak factors (i.e., factor with only 1 variable). Factor loadings are the correlations between each variable and the factor, that is, they represent the relationship (i.e., positive or negative) and strength (i.e., strong, moderate, or weak) between the variable and the factor.

##### First Solution: 13 SAD Items

First, we looked at the **rotated factor loadings** (i.e., varimax) for all 13 items in the SAD for the three-factor solution. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. 

- (1) Split variables: Item #4 loaded strongly onto factor 1 (`r round(fa_load_v[4,1], 2)`) and factor 3 (`r round(fa_load_v[4,3], 2)`). 

- (2) Non-Loading Variables: All items loaded onto at least one factor. 

- (3) Weak Factors: All factors had at least 2 variables loading strongly, which was discussed earlier. 

- Therefore, since item #46 was split between factor 1 and factor 3, it was removed from the analysis. This new solution was assessed with the same criteria to see if simple structure could be achieved.

```{r}

# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v$loadings, cutoff = 0.40, digits = 3)

# make data frame to show values
fa_load_v <- as.data.frame(fa_four_v$loadings[1:13,])

```

##### Second Solution: 12 SAD Items (removed item 4)

Now, we removed item #4 and looked at the **rotated factor loadings** (i.e., varimax) for the remaining 12 items in the SAD for the four-factor solution. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. 

- (1) Split variables: No items loaded strongly onto more than one factor.

- (2) Non-Loading Variables: All items loaded onto at least one factor.

- (3) Weak Factors: All factors had at least 2 items loading strongly. 

- Therefore, **simple structure was achieved and this final solution was used for further analysis**.

```{r}
# create data set without item # 6
fa_data2 <- 
  fa_data %>% 
  select(-SAD_4)

# run FA with 4 factors and a varimax rotation
  # rotated (varimax)
  fa_four_v2 <- fa(fa_data2, nfactors = 4, fm = "pa", rotate = "varimax", scores = T) 
  # change name in loadings for clarity when visualizing the model
    colnames(fa_four_v2$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")

# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v2$loadings, cutoff = 0.40, digits = 3)

# make data frame to show values
fa_load_v2 <- as.data.frame(fa_four_v2$loadings[1:12,])

```

#### Describing the Final Solution

Now that we have acheived simple structure, we looked at the (1) Communalities, (2) Variance Accounted for in percentage, and (3) Factor Loadings. This was done using the final solution (i.e., four-factor, varimax rotation, excluding item #4).

##### (1) Communalities

First. we observed the **communalities.** Each item's communality was the percentage of variance that was explained by the retained factors for each item. These ranged from `r round(min(fa_four_v2$communality)*100, 2)`% to `r round(max(fa_four_v2$communality)*100, 2)`%, with an average of `r round(mean(fa_four_v2$communality)*100, 2)` (see table for more details).

```{r}
# get the communalities from the model
fa_com = as.data.frame(fa_four_v2$communality)
  
# create matrix to present communalities in a table
fa_com_t = rbind((fa_com[,1])*100)
  # rename row and column names
  colnames(fa_com_t) = rownames(fa_com)
  rownames(fa_com_t) = NULL

# Print table using kable
  # print table
  kable(fa_com_t, caption = "Communalities for the Three-Factor Solution for Items in the SAD (%)", digits = 4, align = 'c', escape = FALSE) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_footnote("Communalities show the percentage of variance explained by each factor for each item on the SAD.", notation = "alphabet") %>% 
    kable_classic()
```

##### (2) Variance Accounted (%)

Next, we looked at the **percentage of variance accounted for** by using the eigenvalues (compute the variance associated with the factors). This was done for both the unrotated and rotated (i.e., varimax) final four-factor solution. As you can see in the table below, a total of `r round(sum(var_per), 2)`% of variance was accounted for by the three extracted factors (the same before and after rotation). This variance was mostly accounted for by factor one in the unrotated solution (factor 1 = `r round(var_per[1], 2)`%; factor 2 = `r round(var_per[2], 2)`%; factor 3 = `r round(var_per[3], 2)`%), but was more equally distributed after rotation (factor 1 = `r round(var_per_v[1], 2)`%; factor 2 = `r round(var_per_v[2], 2)`%; factor 3 = `r round(var_per_v[3], 2)`%; see table for more details).

```{r}

# Run an unrotated three-factor solution
  fa_four2 <- fa(fa_data2, nfactors = 4, fm = "pa", rotate = "none", scores = T) 
    # change name in loadings for clarity when visualizing the model
    colnames(fa_four2$loadings) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4") 
    
  
# To find the variance accounted for, you can either (1) calculate it from the eigen values for each factor, or (2) grab the calculated values from the model 
  # Both are shown below, but one is commented out to avoid creating excess variables
  # (1) Calculated 
    # For the unrotated solution
    #var_per = 100*fa_four2$values[1:3]/length(fa_four2$values)
    
    # For the rotated solution
    #var_per_v = 100*(as.numeric(fa_four_v2$Vaccounted[1,]))/length(fa_four_v2$values)
      
  # (2) The information is found in the "Vaccounted" table from the output, where "SS loadings" (row 1) shows the eigen values and "Proportion Var" (row 2) shows the variance accounted for.
    # For the unrotated solution
    var_per = 100*as.numeric(fa_four2$Vaccounted[2,])
    
    # For the rotated solution
    var_per_v = 100*as.numeric(fa_four_v2$Vaccounted[2,])

# Create & Show a table with eigen values, percent var explained, and total percent var explained for unrotated & rotated three-factor solutions
    # unrotated solution
    var_per_t = as.data.frame(cbind(rbind(as.numeric(fa_four2$Vaccounted[1,]), var_per), 
                                    rbind(as.numeric(fa_four_v2$Vaccounted[1,]), var_per_v)))
    # name the columns & rows accordingly
    colnames(var_per_t) = c(rep(c("Factor 1", "Factor 2", "Factor 3", "Factor 4"),2))
    rownames(var_per_t) = c("Eigen Values", "Variance Explained (%)")

  # use kable to show the table
  kable(var_per_t, caption = "Eigen Values and Percent of Variance Explained for the Unrotated and Rotated (varimax) for the Three-Factor Solution for Items in the SAD", digits = 4, align = 'c') %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  add_header_above(c(" ", "Unrotated" = 4, "Varimax Rotation" = 4)) %>%
    kable_classic()

```

##### (3) Factor Loadings

Lastly, we looked at the **factor loadings** of the final four-factor solution. Below you can see the rotated (i.e., varimax) and unrotated factor loadings. The loadings are shown with a cut-off of 0.40 and rounded to three decimal places. The unrotated factor loadings are difficult to interpret: all items loaded strongly onto factor 1 (i.e., loading was greater than 0.40). The rotated factor loadings were more easy to interpret (see below for more details).

```{r}

# Unrotated Factor Loadings 
print(fa_four2$loadings, cutoff = 0.40, digits = 3)

# Rotated (i.e., varimax) Factor Loadings 
print(fa_four_v2$loadings, cutoff = 0.40, digits = 3)

```

#### Reliability Analyses

In order to validate the SAD, we completed a reliability analysis (i.e., is the scale consistent) by looking at the split-half reliability using Cronbach's alpha. This method is the most common measure of scale reliability (Field et al., 2012): the data for each participant is randomly split into two sets, which forms a "half" split for the scale. Then, a correlation is calculated between these sets to determine how similar they are: higher reliability would be seen with highly correlated sets. Since the way that the data is split can impact the correlations, Cronbach's alpha represents an average of all the possible splits using the following equation:

$$\alpha =  \frac{N^2 \overline{Cov}}{\sum s_{item}^2 + \sum Cov_{item}} $$

<details>

<summary>

To obtain the best reliability measure, we must **divide our data into any subscales**, which in this case, are our **three factors** from our factor analysis. Therefore, we split the data into three sets for the reliability analysis. *(click to see code)*

</summary>

```{r}

# split the data respective to the 3 factors from the FA to perform the reliability analysis
  # factor 1 (items: 1, 2, 3)
  f1_data = fa_data2 %>% select(SAD_1:SAD_3)
  # factor 2 (items: 8, 9, 10, 11)
  f2_data = fa_data2 %>% select(SAD_8:SAD_11)
  # factor 3 (items: 5, 6, 7)
  f3_data = fa_data2 %>% select(SAD_5:SAD_7)
  # factor 4 (items: 12, 13)
  f4_data = fa_data2 %>% select(SAD_12, SAD_13)

```

</details>

<details>

<summary>

Once we have our subscales, we **completed the reliability analysis on *each subscale*.** *(click to see code)*

</summary>

```{r}

# compute reliability analysis for each subscale using "alpha()" function in "psych" package
  # factor 1
  f1_rel = psych::alpha(f1_data)
  # factor 2
  f2_rel = psych::alpha(f2_data)
  # factor 3
  f3_rel = psych::alpha(f3_data)
  # factor 4
  f4_rel = psych::alpha(f4_data)

```

</details>

To view the analysis, we broke it down into three components: (1) the alpha values and overall mean and standard deviation of the subscales, (2) alpha drop values (i.e., the alpha values if an item was dropped, and (3) the item statistics.

##### (1) Alpha Values

The alpha values from the reliability analyses included Cronbach's $\alpha$ (i.e., based on the covariances), a standardized $\alpha$ (i.e., based on the correlations), and Guttman's Lambda 6. Additionally, the overall average correlation, mean, and standard deviation for the subscale is shown (see table below for more details).

-   As seen in the table, Cronbach's $\alpha$ was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[1], 2)`), Factor 2 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[2], 2)`), and Factor 3 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[3], 2)`), but not for Factor 4 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[4], 2)`). The overall correlation for Factor 1, *r*(`r f1_rel$item.stats[1, "n"] - 2`) `r round(fa_rel_alphas$average_r[1],2)`, was strong. Factor 2, *r*(`r f2_rel$item.stats[1, "n"] - 2`) `r round(fa_rel_alphas$average_r[2],2)`, Factor 3, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r round(fa_rel_alphas$average_r[3],2)`, and Factor 4, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r round(fa_rel_alphas$average_r[4],2)`, were moderate. All correlations were positive. These statistics were further explored with dropped items.

```{r}
# show the alpha values from the 3 subscales
  # create data frame with the information from the 3 reliability analyses
  fa_rel_alphas = as.data.frame(rbind("Factor 1" = dplyr::select(f1_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 2" = dplyr::select(f2_rel$total, -c("S/N", "ase", "median_r")), 
                                      "Factor 3" = dplyr::select(f3_rel$total, -c("S/N", "ase", "median_r")),
                                      "Factor 4" = dplyr::select(f4_rel$total, -c("S/N", "ase", "median_r")))
                                )
  # add an extra column for the "overall" or average alpha
  fa_rel_alphas = cbind("Overall alpha" = c(mean(as.numeric(dplyr::select(f1_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f2_rel$total, c("raw_alpha", "std.alpha")))), 
                                            mean(as.numeric(dplyr::select(f3_rel$total, c("raw_alpha", "std.alpha")))),
                                            mean(as.numeric(dplyr::select(f4_rel$total, c("raw_alpha", "std.alpha"))))),
                        fa_rel_alphas)

  # use kable to show the table
  kable(fa_rel_alphas, caption = "Reliability Analysis for Subscales in the SAD: Alpha Values", digits = 4, align = 'c', 
        col.names = c("Overall alpha", "Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>% 
    kable_classic()
  
```

##### (2) Alpha Drop Values

To further explore the reliability of each subscale, we looked at the alpha values after removing each item. This allowed us to determine whether a given item was reducing the reliability of the subscale. That is, if dropping an item increased $\alpha$ or the correlation, then removing the item from the analysis should be considered. The data presented in the table below shows Cronbach's $\alpha$, standardized $\alpha$, Guttman's Lambda 6, and the average correlation.

-   As seen in the table below, dropping an item did not improve reliability for Factor 1 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[1:3]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[1:3]), 2)`), Factor 2 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[4:7]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[4:7]), 2)`), Factor 3 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[8:10]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[8:10]), 2)`), and Factor 4 (range of $\alpha_C$ = `r round(min(fa_rel_adrop$raw_alpha[11:12]), 2)` - `r round(max(fa_rel_adrop$raw_alpha[11:12]), 2)`). 
- With respect to the average correlation, dropping an item only showed a substantially improvement for Factor 1, where dropping item 3 resulted in a stronger correlation, *r*(`r f2_rel$item.stats[1, "n"] - 2`) = `r round(fa_rel_adrop$average_r[3], 2)`. This was explored further with the items statistics.

```{r}
# show the alpha drop values from the 3 subscales
  # create data frame with the information
  fa_rel_adrop = as.data.frame(rbind(dplyr::select(f1_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f2_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f3_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r")), 
                                     dplyr::select(f4_rel$alpha.drop, c("raw_alpha", "std.alpha", "G6(smc)", "average_r"))
                                     ))

  # use kable to show the table
  kable(fa_rel_adrop, caption = "Reliability Analysis for Subscales in the SAD: Alpha Drop Values", digits = 4, align = 'c', 
        col.names = c("Cronbach's alpha", "Standardized alpha", "Guttman's lambda 6", "Average r")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data), "Factor 4" = ncol(f4_data))) %>% 
    kable_classic()

```

##### (3) Item Statistics

The item statistics were explored to determine whether a certain item did not correlate with the overall subscale. The table below shows: *r*, which is the correlations between each item and the total score from the subscale (i.e., the item-total correlations), which includes the item itself; and *Drop*, which is the correlation of an item with the subscale after removing the item (i.e., the item-rest correlation or the corrected item-total correlation). Additionally, the sample size, mean, and standard deviation is shown.

-   No issues were found with the item statistics for Factor 1, *r*(`r f1_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[1:3]))` - `r p_round(max(fa_rel_item$r.drop[1:3]))`, Factor 2, *r*(`r f2_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[4:7]))` - `r p_round(max(fa_rel_item$r.drop[4:7]))`, Factor 3, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[8:10]))` - `r p_round(max(fa_rel_item$r.drop[8:10]))`, and Factor 4, *r*(`r f3_rel$item.stats[1, "n"] - 2`) `r p_round(min(fa_rel_item$r.drop[11:12]))` - `r p_round(max(fa_rel_item$r.drop[11:12]))`. That is, no items had a item-rest correlation (i.e., the drop column in the table below) below the cut-off (i.e., .30; Field et al., 2012).

```{r}

# show the correlations for the 3 subscales
  # create data frame with the information
  fa_rel_item = as.data.frame(rbind(dplyr::select(f1_rel$item.stats, -c("std.r", "r.cor")), 
                                    dplyr::select(f2_rel$item.stats, -c("std.r", "r.cor")), 
                                    dplyr::select(f3_rel$item.stats, -c("std.r", "r.cor")),
                                    dplyr::select(f4_rel$item.stats, -c("std.r", "r.cor"))
                                    ))

  # use kable to show the table
  kable(fa_rel_item, caption = "Reliability Analysis for Subscales in the SAD: Item Statistics", digits = 4, align = 'c', 
        col.names = c("N", "r", "Drop", "M", "SD")) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = T) %>%
  pack_rows(index = c("Factor 1" = ncol(f1_data), "Factor 2" = ncol(f2_data), "Factor 3" = ncol(f3_data), "Factor 4" = ncol(f4_data)))  %>% 
    kable_classic()

```

Overall, the reliability for Factor 1, Factor 2, Factor 3, and Factor 4 was adequate. Although Factor 1 showed mixed results for item 3 (i.e., item was not as stable for alpha), Factor 1 was stable for all other comparisons/measures, therefore item 3 was not removed from the subscale. Similarly, Factor 4 was inspected carefully since it was a two-tem factor. These results suggets that it is a stable factor and so it was included in further analyses.

#### Conclusions

##### The Present Study

A principal axis factor analysis was used to analyze the underlying factors in the 13-item SAD inventory (Ward et al., 2017) using an orthogonal rotation (i.e., varimax rotation). Factorability of the data was confirmed using: (1) Bartlett's test for correlation adequacy, $\chi^{2}$(`r fa_bartest$df`) = `r round(fa_bartest$chisq, 2)`, *p* `r p_round(fa_bartest$p.value)`, which confirmed that correlations between the items were sufficiently large; and (2) the Kaiser-Meyer-Olkin measure for sampling adequacy (MSA), $MSA_{overall}$ = `r round(fa_kmo$MSA, 2)`, $MSA_{1-13}$ = `r min(round(fa_kmo$MSAi, 2))` - `r max(round(fa_kmo$MSAi, 2))`, which confirmed that both the overall MSA and each item's MSA were above their required criteria (0.60 and 0.77, respectively; Kaiser, 1974). The appropriate number of factors (i.e., four) was confirmed using (1) a parallel analysis scree plot, where the two- , three-, and four-factor solutions were chosen to compare for the best fit for the data; (2) the eigen values (e.g., `r round(fa_four_v$e.values[1:4], 3)`, for factors 1 to 4, respectively) for the unrotated solution, where three meaningful factors were found using Kaiser's criterion (i.e., eigen values greater than one); and (3) the factor structure of the two-, three-, and four-factor solutions, where the four-factor solution showed a better distribution of the data and was more interpretable. Therefore, the four-factor solution was chosen for the final analysis and an orthogonal (i.e., varimax) rotation was chosen for the final analysis. After an initial four-factor solution was completed using all 13 items in the SAD, however, item 4 was split-loaded (i.e., between factor 1 and 3). Therefore, this item was dropped and a final four-factor solution was completed with the remaining 12 items in the SAD. This final solution achieved simple structure and was used for subsequent analyses.

The for each reliability of the four factors was assessed using Cronbach's $\alpha$, which was within the acceptable range (i.e., greater than .70; Kline, 1999) for Factor 1 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[1], 2)`), Factor 2 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[2], 2)`), and Factor 3 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[3], 2)`), but not for Factor 4 ($\alpha_C$ = `r round(fa_rel_alphas$raw_alpha[4], 2)`). Therefore, the reliability for Factor 1, Factor 2, and Factor 3 was adequate. Factor 1 did show a less stable item (i.e., 3), which showed a small improvement in correlation when dropped (from `r round(fa_rel_alphas$average_r[1], 2)` to `r round(fa_rel_adrop$average_r[3], 2)` ). Factor 4 was a weaker factor with only two items. Overall, the four factors all showed reliability and stability. Each was used as a separate subscale for following analyses. The mean scores for each factor were: Factor 1 *M* = `r round(des_ques_cont_across["SAD_dep_score","mean"], 2)` (*SD* = `r round(des_ques_cont_t$overall[17], 2)`), Factor 2 M = `r round(des_ques_cont_t$overall[21], 2)` (*SD* = `r round(des_ques_cont_t$overall[22], 2)`), Factor 3 *M* = `r round(des_ques_cont_t$overall[26], 2)` (*SD* = `r round(des_ques_cont_t$overall[27], 2)`), and Factor 4 *M* = `r round(des_ques_cont_t$overall[31], 2)` (*SD* = `r round(des_ques_cont_t$overall[32], 2)`). The model also showed a moderate fit: the Root Mean Square of the Residual (RMSR; `r round(fa_four_v2$rms, 2)`). Further descriptive statistics can be found in the [Descriptive Statistics section](#DS) above.

The items in the final solution's factors explained `r round(sum(var_per), 2)`% of the variance and suggested the following four factors: dependence, emotional attachment, accessibility, and   distractibility. Dependence was relate to the degree of dependence on one's smartphone, it consisted of 3 items (i.e., SAD 1, SAD 2, and SAD 3) and explained `r round(var_per_v[1], 2)`% of the variance (e.g., "I would have trouble getting through a normal day without my smartphone.", and "I feel like I could not live without my smartphone."). Emotional attachment was related to one's smartphone use for emotional support, it consisted of 4 items (i.e, SAD 8, 9, 10, and 11) and explained `r round(var_per_v[2], 2)`% of the variance (e.g., "I feel lonely when my smartphone does not ring or vibrate for several hours.", and "Using my smartphone makes me feel happy."). Accessibility was related to the ability to acees the utility of one's phone (e.g., powered on, internet access), it consisted of 3 items (i.e., SAD 5, 6, & 7) and explained `r round(var_per_v[3], 2)`% of the variance (e.g., "It drives me crazy when my smartphone runs out of battery.", and "I feel impatient when the Internet connection speed on my smartphone is slow."). Distractibility was related to one's smartphone retaining one's attention, it consisted of 2 items and explained `r round(var_per_v[4], 2)`% of the variance (e.g., "I find it tough to focus whenever my smartphone is nearby.", and "I become less attentive to my surroundings when I'm using my smartphone."). One item did not clearly load onto any one factor and was excluded from further analyses (i.e., item 4; Costello & Osborne, 2005). The tables below depict the final and initial (i.e., including item 4) four-factor solutions. three

```{r create & show fa_three_t, message=FALSE, warning=FALSE}
# create table with all items, factor loadings, eigen values, % of variance explained, and Cronbach's alpha 

# get factor loadings
  fa_load <- as.data.frame(fa_four_v2$loadings[1:12,])
  # change row names
    # row.names(fa_load) <- c(1:nrow(fa_load))
    
  # reorder loadings to match final table
    fa_load <- 
      fa_load %>%
      # add temporary column used to reorder
      mutate(item = c(row.names(fa_four_v2$loadings[1:12,]))) %>% 
      # reorder using slice & match
      slice(match(c("SAD_1", "SAD_2", "SAD_3", "SAD_8", "SAD_9", "SAD_10", "SAD_11", "SAD_5", "SAD_6", "SAD_7", "SAD_12", "SAD_13"), item)) %>% 
      # remove temp column
      select(-item)

# make list of the items & other info
  fa_table_list <- c("I would have trouble getting through a normal day without my smartphone.",
                    "It would be painful for me to give up my smartphone for a day.",
                    "I feel like I could not live without my smartphone.",
                    "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                    "Using my smartphone relieves me of my stress.",
                    "I feel excited when I have a new message or notification.",
                    "Using my smartphone makes me feel happy.",
                    "It drives me crazy when my smartphone runs out of battery.",
                    "I am upset and annoyed when I find I do not have reception on my smartphone.",
                    "I feel impatient when the Internet connection speed on my smartphone is slow.",
                    "I find it tough to focus whenever my smartphone is nearby.",
                    "I become less attentive to my surroundings when I’m using my smartphone.",
                    "Eigen Values", 
                    "Variance Explained (%)")
  
# create the final table
  # bind the loadings and items
    fa_four_t = cbind(c(1:3, 8:11, 5:7, 12, 13, "", ""), fa_table_list, rbind(fa_load, var_per_t[5:8]))
    
    # rename columns & rows
      colnames(fa_four_t) = c("item_num", "item_desc", "f1", "f2", "f3", "f4")
      rownames(fa_four_t) = 1:nrow(fa_four_t)
      
# format cells
# fa_four_t$f1 <- cell_spec(fa_four_t$f1, bold = ifelse(fa_four_t$f1 > 0.40, T, F))
      
# use kable to show the table
table_fa_four_t <- 
kable(fa_four_t, caption = "Final Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD", 
      digits = 2, align = "llcccc", col.names = c("", "", "1", "2", "3", "4")) %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    add_header_above(c(" ", " ", "Factor Loading" = 4), bold = T) %>% 
    pack_rows("Factor 1: Dependence", 1, 3) %>% 
    pack_rows("Factor 2: Emotional Attachment", 4, 7) %>% 
    pack_rows("Factor 3: Accessibility", 8, 10) %>% 
    pack_rows("Factor 4: Distractibility", 11, 12) %>% 
    pack_rows("", 13, 14) %>% 
    footnote(general = "Values represent rotated factor loadings from items in the Smartphone Attachment and Dependency Inventory (Ward et al., 2017).
             Factor loadings have been sorted based on strong loading (> 0.40). Item #4 was split-loaded and removed from the final solution.", 
             symbol = c("N = 383")) %>% 
    kable_classic()

# show table
table_fa_four_t

#export table
#install PhantomJS using: "webshot::install_phantomjs()"
# save_kable(table_fa_four_t, "table_fa_four_t.pdf")
```

\

```{r create & show fa_three_t_in, message=FALSE, warning=FALSE}

## INITIAL SOLUTION -- for reference not the final solution

# use initial rotated 4-factor solution from above -- "fa_four_v"
    
# To find the eigenvalues & variance accounted for, you can either grab the calculated values from the model 
    # For the initial 4-factor solution
    var_ev_per_in <- data.frame(rbind(fa_four_v$e.values[1:4],  #eigen values 
                       100*as.numeric(fa_four_v$Vaccounted[2,]) # variance accounted for, in percentage
                       ))
    colnames(var_ev_per_in) = c("Factor 1", "Factor 2", "Factor 3", "Factor 4")
    rownames(var_ev_per_in) = c("Eigen Values", "Variance Explained (%)")

# create table with all items, factor loadings, eigen values, % of variance explained, and Cronbach's alpha 

# get factor loadings
  fa_load_in = as.data.frame(fa_four_v$loadings[1:13,])

# make list of the items & other info
  fa_table_list_in = data.frame("details" = c("I would have trouble getting through a normal day without my smartphone.",
                                     "It would be painful for me to give up my smartphone for a day.",
                                     "I feel like I could not live without my smartphone.",
                                     "If I forgot to bring my smartphone with me, I would feel anxious.",
                                     "It drives me crazy when my smartphone runs out of battery.",
                                     "I am upset and annoyed when I find I do not have reception on my smartphone.",
                                     "I feel impatient when the Internet connection speed on my smartphone is slow.",
                                     "I feel lonely when my smartphone does not ring or vibrate for several hours.",
                                     "Using my smartphone relieves me of my stress.",
                                     "I feel excited when I have a new message or notification.",
                                     "Using my smartphone makes me feel happy.",
                                     "I find it tough to focus whenever my smartphone is nearby.",
                                     "I become less attentive to my surroundings when I’m using my smartphone.", 
                                     "Eigen Values", 
                                     "Variance Explained (%)"))
  
# create the final table
  # bind the loadings and items
    fa_four_t_in = cbind(fa_table_list_in, rbind(fa_load_in, var_ev_per_in))
    
    # rename columns & rows
      colnames(fa_four_t_in) = c("Item", "Factor 1", "Factor 2", "Factor 3", "Factor 4")
      rownames(fa_four_t_in) = c(1:13, "", " ")
      
# use kable to show the table
  kable(fa_four_t_in, caption = "First Solution: Summary of exploratory varimax rotated factor analysis results for the 13-Items in the SAD.", digits = 2, align = "lccc") %>%
    kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
    pack_rows("", 14, 15) %>% 
    footnote(general = "The initial solution is shown, including item #4, which was dropped as a split variable between factor 1 and factor 3.", symbol = "N = 383") %>% 
    kable_classic()

```

##### Ward et al. (2017)

Ward et al. (2017) found two factors: smartphone dependence (i.e. six items; items #1-6; 31.02% of the variance) and emotional attachment (i.e. five items; items #8-11; 21.65% of the variance), which explained 52.67% of the total variance. Two items (i.e. item #7 and 13) were dropped from their subsequent analyses since they did not strongly load onto a factor (see Web Appendix Table 1 below).

\

<center>

![Note: this table is from Ward et al. (2017) Web Appendix. Items are listed in order (i.e., 1-13). Only strongly-loaded values are shown.](/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study%201-%20Smartphone%20Replication/Dissertation%20(04:20:20)/ward_FA_appendix_table.png)

</center>

##### Comparing Findings

The present study's final factor solution showed some similarities with Ward et al.'s solution:

-   Factor 1 contained items \# 1, 2, 3, 4, and 5
-   Factor 2 contained items \# 8, 9, 10, and 11

Some differences between the solutions:

- Only one item (i.e., item #4 due to split-loading) was dropped in the present study compared to the two items dropped in Ward et al. (i.e., items 7 due to split-loading to and 13 due to being in a weak, single-item factor).
- Factor 3 did have more than one item load strongly (items 5, 6, 7) and was not dropped.
- Four factors were in the final solution in the present study. 

Overall, the analyses were very similar. We compared the total variances explained and found that both the initial (`r round(sum(var_ev_per_in[2,]), 2)`%) and final (`r round(sum(var_per_v), 2)`%) solution in the present study explained more total variance than the Ward et al. (2017) solution (52.67%). Comparing the reliability tests showed that Ward et al.'s subscales ($\alpha_{Factor1}$ = 0.89; $\alpha_{Factor2}$ = 0.79) had slightly larger values than the resent study ($\alpha_{Factor1}$ = `r round(fa_rel_alphas$raw_alpha[1], 2)`; $\alpha_{Factor2}$ = `r round(fa_rel_alphas$raw_alpha[2], 2)`; $\alpha_{Factor3}$ = `r round(fa_rel_alphas$raw_alpha[3], 2)`; $\alpha_{Factor4}$ = `r round(fa_rel_alphas$raw_alpha[4], 2)`).


### Correlations: SAD Subscales vs. OSpan 
```{r main_corrs_overall}
# set up the data & run correlation
SAD_corr <- 
  all_data %>% 
  select(ospan_absolute_score, gng_err_omi, gng_rt_av , SAD_score, SAD_dep_score, SAD_ea_score, SAD_access_score, SAD_dist_score, age) %>% 
  as.matrix() %>% 
  rcorr(type = "pearson")

# create new SAD_corr to shown only lower triangle... 
SAD_corr2 <- SAD_corr
# round to 4 decimals... 
SAD_corr2$r <- round(SAD_corr2$r, 4)
SAD_corr2$P <- round(SAD_corr2$P, 4)
SAD_corr2$n <- round(SAD_corr2$n, 4)
# remove upper triangle form r, p, and n
SAD_corr2$r[upper.tri(SAD_corr2$r)] <- "-"
SAD_corr2$P[upper.tri(SAD_corr2$P)] <- "-"
SAD_corr2$n[upper.tri(SAD_corr2$n)] <- "-"

# show corr table with flattenCorr

kable(flattenCorrMatrix(SAD_corr$r, SAD_corr$P), caption = "Correlations between SAD Score and Age with OSpan Performance") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()

# print tables using kable
kable(as.data.frame(format(SAD_corr2$r, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: r values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()
  

kable(as.data.frame(format(SAD_corr2$P, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: p-values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()

kable(as.data.frame(format(SAD_corr2$n, scientific = FALSE)), caption = "Correlations between SAD Score and Age with OSpan Performance: n values") %>%
  footnote(general = "SAD overall and Smartphone Dependency, Emotional Attachment, Accessibility, and Distractibility shown. ") %>% 
  kable_styling(bootstrap_options = c("hover", "striped"), full_width = F) %>% 
  kable_classic()

# does not fit nicely on page, possibly change later...
corrplot(SAD_corr$r, method = "circle", col = (colorRampPalette(c("purple", "grey", "blue"))(50)),
         type = "upper",
         # addCoef.col = "black", # Add coefficient of correlation
         tl.col = "darkblue", tl.srt = 90, tl.cex = .8, #Text label color and rotation
         # Combine with significance level
         p.mat = SAD_corr$P, sig.level = 0.05,
         addgrid.col = "white",
         # insig = "blank",
         insig = "pch", pch = 10, pch.col = "red", pch.cex = .1, # add this instead of insig above to denot insig p values with red dot
         # hide correlation coefficient on the principal diagonal
         diag = FALSE,
         win.asp = 1
         )
```
### Exploratory Moderation Analyses

Potential moderators of OSpan performance were explored. These included both (1) continuous and (2) categorical moderators. *Note: These analysis were completed regardless of the results for the location and power hypotheses.*

**1. Continuous Moderators**

These variables were examined using a **general linear model for each potential moderator**: DV = OSpan Absolute Score, IV: Smartphone Location (desk, pocket/bag, outside), M: see list below. Simple slopes for each moderator x IV comparison were explored.

-   SAD Score: Smartphone Dependence, Emotional Attachment, Distractibilty
-   SUQ Score: Smartphone use with absence of other stimulation, Smartphone use during other actions
-   Demographic Info: Age

Each moderation analysis was completed using the MeMoBootR package's moderation model 1 using a categorical IV (from Hayes, 2017; see figures below). This package provided: assumption tests, outliers, simple slopes, and a graph of the simple slopes. Since the IV was categorical (i.e., smartphone location: desk, pocket/bag, or outside), the variable is coded as a factor so that it is not misinterpreted as a continuous variable. This means that the IV is *dummy coded* and the analysis uses the first of the factors as the "*comparison group*". In order to assess all possible comparions between the three location groups, the factor is re-ordered and run again. Additionally, to allow for better interpretation, the continuous moderator was *mean-centered*. The *dummy coding* and *mean-centered scoring* of the continuous moderator is done within the MeMoBootR package.

\

<center>

![Conceptual diagram of a simple moderation model from Hayes (2017)](/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study%201-%20Smartphone%20Replication/Dissertation%20(04:20:20)/hayes_moderation_model1.png)

</center>

\
\
\

<center>

![Statistical diagram of a simple moderation model with multicategorical independent variable with g categories from Hayes (2017)](/Users/anaceciliaruizpardo/RStudio/Smartphone_study/Study%201-%20Smartphone%20Replication/Dissertation%20(04:20:20)/hayes_moderation_model1_stats_catX.png)

</center>

**2. Categorical Moderators**

These variables were examined using a **two-way ANOVA for each potential moderator**: DV = OSpan Absolute Score, $IV_{1}$: Smartphone Location (desk, pocket/bag, outside), $IV_{M}$: see list below. Simple main effects for each moderator x IV comparison were explored.

-   SUQ Score: Texts per Day, Social Media Messages per Day, Social Media Posts per Day, Subjective Smartphone Value, Notification Type, Phantom Vibrations
-   Demographic Info: Gender, Program, Academic Year, First Language

#### Continuous Moderators

To examine the potential moderation of smartphone reliance (i.e. Smartphone Dependence, Emotional Attachment, and Distractibility) and age on the relationship between the experimental manipulation and task performance on the OSpan, a generalized linear model was used. Each potential moderator was evaluated separately, along with the relevant assumption tests (e.g., normality). The data was screened for outliers (i.e., cooks distance and leverage) and participants were removed from the analysis the criteria for both cooks and leverage was met.

##### Set up the Data

<details>

<summary>

First, in order to provide all possible comparisons in the analyses, 2 new *location* variables were created in the *mod_analysis* data frame. Each was a factored version of the original smartphone location variable, where the order of the factors were either (1) *desk \> pocket/bag \> outside* (i.e., "*location_desk*") or (2) *outside \> pocket/bag \> desk* (i.e., "*location_out*"). *(click to see code)*

</summary>

```{r modify_mod_analysis_add_loc_vars, message=FALSE, warning=FALSE}
# change location var as factor with different orders -- 2 new vars
mod_analysis <- 
  mod_analysis %>% 
  # "location_desk": desk vs. pocket/bag &  desk vs. outside
  mutate(location_desk = factor(location, levels = c(1, 2, 3), 
                                labels = c("desk", "pocketbag", "outside"))) %>% 
  # "location_out": outside vs. desk &  outside vs. pocket/bag
  mutate(location_out = factor(location, levels = c(3, 2, 1), 
                               labels = c("outside", "pocketbag", "desk")))
```

</details>

##### Run Analyses

<details>

<summary>

Next, each moderation analysis was run to use the outputs *(click to see code)*.

</summary>

Run the moderation analyses for SAD variable(s): Smartphone Dependency, Emotional Attachment, and Distractibility.

```{r run_mod_SAD_dep, message=FALSE, warning=FALSE, result='hide'}
#-- Run moderation analysis using moderation1() in the MeMoBootR package
  # note: moderator (m) and IV (x) are switched since moderator is continuous & IV is categorical in present study
    # was checked vs. manual lm() version & computes exact same values
  # note: analysis mean-centers the continuous var automatically
  # note: this analysis will compare: desk vs. pocket/bag&  desk vs. outside initially 
    # then, the data is re-organized to compare: pocket/bag vs. outside 
    # this was saved into 2 separate analyses, but most output is common b/w them

## SMARTPHONE DEPENDENCE ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_dep1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dep_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dep1$datascreening$fulldata)

# for outside 1st
mod_SAD_dep2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dep_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dep2$datascreening$fulldata)

## EMOTIONAL ATTACHMENT ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_ea1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_ea_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_ea1$datascreening$fulldata)

# for outside 1st
mod_SAD_ea2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_ea_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_ea2$datascreening$fulldata)

## ACCESSIBILITY ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_access1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_access_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_access1$datascreening$fulldata)

# for outside 1st
mod_SAD_access2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_access_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_access2$datascreening$fulldata)

## SMARTPHONE DISTRACTIBILITY ##
# run & save analysis to view components separately 
# for desk 1st
mod_SAD_dist1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dist_score", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dist1$datascreening$fulldata)

# for outside 1st
mod_SAD_dist2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "SAD_dist_score", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SAD_dist2$datascreening$fulldata)

```

Run the moderation analyses for SUQ variable(s): Age.

```{r run_mod_SUQ_age, message=FALSE, warning=FALSE, result='hide'}
#-- Run moderation analysis using moderation1() in the MeMoBootR package
  # note: moderator (m) and IV (x) are switched since moderator is continuous & IV is categorical in present study
    # was checked vs. manual lm() version & computes exact same values
  # note: analysis mean-centers the continuous var automatically
  # note: this analysis will compare: desk vs. pocket/bag&  desk vs. outside initially 
    # then, the data is re-organized to compare: pocket/bag vs. outside 
    # this was saved into 2 separate analyses, but most output is common b/w them

## AGE ##
# run & save analysis to view components separately 
# for desk 1st
mod_SUQ_age1 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "age", # continuous var
              m = "location_desk", # categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SUQ_age1$datascreening$fulldata)

# for outside 1st
mod_SUQ_age2 <-  
  moderation1(y = "ospan_absolute_score", # DV
              x = "age", # continuous var
              m = "location_out", #categorical var -- will show simple slopes
              cvs = NULL, # covariates (if any)
              df = mod_analysis, # data frame of columns
              with_out = F) # remove outliers

# Can view the full data set, including the identified outliers (last columns)
# View(mod_SUQ_age2$datascreening$fulldata)

```

</details>

##### Outliers

The model identified outliers based on leverage and cook's distance and removed participants who were outside of the criteria for both measures.

-   **Leverage**

    -   Tests for how much influence over the slope a single observation (i.e., participant) has: how much an individual score will change the b-values
    -   Scores that fall outside of the "cut-off" criteria have a large effect on the slope independent of other score and are therefore considered outliers. These are defined with the equation $\frac{2K+2}{N}$, where *K* is the number of IVs/predictors and *N* is the sample size.

-   **Cook's Distance**

    -   Tests for influential observations: the combination of leverage and discrepancy (i.e., how far a score is from other data points).
    -   Scores that fall outside of the "cut-off" criteria have a large effect on the slope independent of other score and/or are far from the rest of the data and are therefore considered outliers. These are defined with the equation $\frac{4}{N-K-1}$, where *K* is the number of IVs/predictors and *N* is the sample size.

-   **Participants Removed**

    -   Any participant who fell outside of both the leverage and cook criteria were removed from each analysis. This was done separately for each analysis (see table below).

```{r create_mod_cont_out_t, message=FALSE, warning=FALSE}
# create a table depicting the total outliers removed for each continuous moderation analysis
# details shown for 1st (dependency)
mod_cont_out_t <- 
  as.data.frame(cbind(c("Dependency", "Emotional Attachment", "Accessibility", "Distractibility", "Age"), 
                      rbind(
                      # for SAD_dep
                      c(sum(mod_SAD_dep1$datascreening$fulldata$badleverage), # total ps that fell outside of the leverage criteria
                        sum(mod_SAD_dep1$datascreening$fulldata$badcooks), # total ps that fell outside of the cooks criteria
                        sum(mod_SAD_dep1$datascreening$fulldata$totalout == 1), # total ps that fell outside of ONE outlier criteria (i.e., leverage OR cooks)
                        sum(mod_SAD_dep1$datascreening$fulldata$totalout == 2), # total ps that fell outside of BOTH outlier criteria (i.e., leverage AND cooks)
                        nrow(mod_SAD_dep1$datascreening$fulldata) - sum(mod_SAD_dep1$datascreening$fulldata$totalout == 2) # total sample size used in given analysis
                        ), 
                      # for SAD_ea
                      c(sum(mod_SAD_ea1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_ea1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_ea1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_ea1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_ea1$datascreening$fulldata) - sum(mod_SAD_ea1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SAD_access
                      c(sum(mod_SAD_access1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_access1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_access1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_access1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_access1$datascreening$fulldata) - sum(mod_SAD_access1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SAD_dist
                      c(sum(mod_SAD_dist1$datascreening$fulldata$badleverage), 
                        sum(mod_SAD_dist1$datascreening$fulldata$badcooks), 
                        sum(mod_SAD_dist1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SAD_dist1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SAD_dist1$datascreening$fulldata) - sum(mod_SAD_dist1$datascreening$fulldata$totalout == 2)
                        ),
                      # for SUQ_age
                      c(sum(mod_SUQ_age1$datascreening$fulldata$badleverage), 
                        sum(mod_SUQ_age1$datascreening$fulldata$badcooks), 
                        sum(mod_SUQ_age1$datascreening$fulldata$totalout == 1), 
                        sum(mod_SUQ_age1$datascreening$fulldata$totalout == 2), 
                        nrow(mod_SUQ_age1$datascreening$fulldata) - sum(mod_SUQ_age1$datascreening$fulldata$totalout == 2)
                        )
                      )
                      )
                )

# rename columns
colnames(mod_cont_out_t) <- c(" ", "Leverage", "Cooks", "Both", "Removed", "N")

# use kable to show table
kable(mod_cont_out_t, caption = "Summary of identified and or removed outliers in GLM mderation analysis.", align = 'rccccc') %>%
  footnote(general = "Sample size prior to removing outlier data for all analyses was 383.") %>% 
  column_spec(1, bold = T) %>%
  add_header_above(c(" ", "Criteria" = 2, "Total Criteria" = 2, "")) %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

##### Assumptions

<details>

<summary>

The model completed assumption tests for: (1) additivity/multicollinearity, (2) linearity, (3) normality, (4) homogeneity and homoscedasticity. All assumptions were met for all moderators. *(click to see details)*

</summary>

###### (1) Additivity / Multicollinearity

Additivity / Multicollinearity was evaluated for each moderator. Correlations between the moderator and each IV comparison was visually inspected to ensure the correlations were not too high or low (i.e., approximately ranging between .30 to .80). All moderators' correlations were within an acceptable range.

\
**Dependency Moderator**: correlations ranged from `r round(min(mod_SAD_dep_corr_t[1:3, c(2:4, 9:11)]), 2)` to `r round(max(mod_SAD_dep_corr_t[1:3, c(2:4, 9:11)][mod_SAD_dep_corr_t[1:3, c(2:4, 9:11)] !=max(mod_SAD_dep_corr_t[1:3, c(2:4, 9:11)])]), 2)`.

```{r create & show mod_SAD_dep_corr_t, message=FALSE, warning=FALSE}
# grab table from output
mod_SAD_dep_corr_t <- 
  as.data.frame(cbind(c("Dependency Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dep x DvPB", "Dep x DvO"), 
                      mod_SAD_dep1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Dependency Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dep x OvPB", "Dep x OvD"),
                      mod_SAD_dep2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_dep_corr_t) <- c("var1", "Dependency Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dep x DvPB", "Dep x DvO",
                                  "row", "var2",
                             "Dependency Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dep x OvPB", "Dep x OvD")
rownames(mod_SAD_dep_corr_t) <- 1:nrow(mod_SAD_dep_corr_t)

# reset values as numeric
mod_SAD_dep_corr_t[,2] <- as.numeric(mod_SAD_dep_corr_t[,2])
mod_SAD_dep_corr_t[,3] <- as.numeric(mod_SAD_dep_corr_t[,3])
mod_SAD_dep_corr_t[,4] <- as.numeric(mod_SAD_dep_corr_t[,4])
mod_SAD_dep_corr_t[,5] <- as.numeric(mod_SAD_dep_corr_t[,5])
mod_SAD_dep_corr_t[,6] <- as.numeric(mod_SAD_dep_corr_t[,6])
mod_SAD_dep_corr_t[,9] <- as.numeric(mod_SAD_dep_corr_t[,9])
mod_SAD_dep_corr_t[,10] <- as.numeric(mod_SAD_dep_corr_t[,10])
mod_SAD_dep_corr_t[,11] <- as.numeric(mod_SAD_dep_corr_t[,11])
mod_SAD_dep_corr_t[,12] <- as.numeric(mod_SAD_dep_corr_t[,12])
mod_SAD_dep_corr_t[,13] <- as.numeric(mod_SAD_dep_corr_t[,13])

# Print table using kable
kable(mod_SAD_dep_corr_t, caption = "Correlations for the dependency moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
```

\
**Emotional Attachment Moderator**: correlations ranged from `r round(min(mod_SAD_ea_corr_t[1:3, c(2:4, 9:11)]), 2)` to `r round(max(mod_SAD_ea_corr_t[1:3, c(2:4, 9:11)][mod_SAD_ea_corr_t[1:3, c(2:4, 9:11)] !=max(mod_SAD_ea_corr_t[1:3, c(2:4, 9:11)])]), 2)`.

```{r create & show mod_SAD_ea_corr_t, message=FALSE, warning=FALSE}
# grab table from output
mod_SAD_ea_corr_t <- 
  as.data.frame(cbind(c("Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "EA x DvPB", "EA x DvO"), 
                      mod_SAD_ea1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "EA x OvPB", "EA x OvD"),
                      mod_SAD_ea2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_ea_corr_t) <- c("var1", "Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "EA x DvPB", "EA x DvO",
                                  "row", "var2",
                             "Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "EA x OvPB", "EA x OvD")
rownames(mod_SAD_ea_corr_t) <- 1:nrow(mod_SAD_ea_corr_t)

# reset values as numeric
mod_SAD_ea_corr_t[,2] <- as.numeric(mod_SAD_ea_corr_t[,2])
mod_SAD_ea_corr_t[,3] <- as.numeric(mod_SAD_ea_corr_t[,3])
mod_SAD_ea_corr_t[,4] <- as.numeric(mod_SAD_ea_corr_t[,4])
mod_SAD_ea_corr_t[,5] <- as.numeric(mod_SAD_ea_corr_t[,5])
mod_SAD_ea_corr_t[,6] <- as.numeric(mod_SAD_ea_corr_t[,6])
mod_SAD_ea_corr_t[,9] <- as.numeric(mod_SAD_ea_corr_t[,9])
mod_SAD_ea_corr_t[,10] <- as.numeric(mod_SAD_ea_corr_t[,10])
mod_SAD_ea_corr_t[,11] <- as.numeric(mod_SAD_ea_corr_t[,11])
mod_SAD_ea_corr_t[,12] <- as.numeric(mod_SAD_ea_corr_t[,12])
mod_SAD_ea_corr_t[,13] <- as.numeric(mod_SAD_ea_corr_t[,13])

# Print table using kable
kable(mod_SAD_ea_corr_t, caption = "Correlations for the emotional attachment moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
```

\
**Accessibility Moderator**: correlations ranged from `r round(min(mod_SAD_access_corr_t[1:3, c(2:4, 9:11)]), 2)` to `r round(max(mod_SAD_access_corr_t[1:3, c(2:4, 9:11)][mod_SAD_access_corr_t[1:3, c(2:4, 9:11)] !=max(mod_SAD_access_corr_t[1:3, c(2:4, 9:11)])]), 2)`.

```{r create & show mod_SAD_access_corr_t, message=FALSE, warning=FALSE}
# grab table from output
mod_SAD_access_corr_t <- 
  as.data.frame(cbind(c("Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Access x DvPB", "Access x DvO"), 
                      mod_SAD_access1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Access x OvPB", "Access x OvD"),
                      mod_SAD_access2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_access_corr_t) <- c("var1", "Emotional Attachment Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Access x DvPB", "Access x DvO",
                                  "row", "var2",
                             "Emotional Attachment Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Access x OvPB", "Access x OvD")
rownames(mod_SAD_access_corr_t) <- 1:nrow(mod_SAD_access_corr_t)

# reset values as numeric
mod_SAD_access_corr_t[,2] <- as.numeric(mod_SAD_access_corr_t[,2])
mod_SAD_access_corr_t[,3] <- as.numeric(mod_SAD_access_corr_t[,3])
mod_SAD_access_corr_t[,4] <- as.numeric(mod_SAD_access_corr_t[,4])
mod_SAD_access_corr_t[,5] <- as.numeric(mod_SAD_access_corr_t[,5])
mod_SAD_access_corr_t[,6] <- as.numeric(mod_SAD_access_corr_t[,6])
mod_SAD_access_corr_t[,9] <- as.numeric(mod_SAD_access_corr_t[,9])
mod_SAD_access_corr_t[,10] <- as.numeric(mod_SAD_access_corr_t[,10])
mod_SAD_access_corr_t[,11] <- as.numeric(mod_SAD_access_corr_t[,11])
mod_SAD_access_corr_t[,12] <- as.numeric(mod_SAD_access_corr_t[,12])
mod_SAD_access_corr_t[,13] <- as.numeric(mod_SAD_access_corr_t[,13])

# Print table using kable
kable(mod_SAD_access_corr_t, caption = "Correlations for the accessibility moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
```

\
**Distractibility Moderator**: correlations ranged from `r round(min(mod_SAD_dist_corr_t[1:3, c(2:4, 9:11)]), 2)` to `r round(max(mod_SAD_dist_corr_t[1:3, c(2:4, 9:11)][mod_SAD_dist_corr_t[1:3, c(2:4, 9:11)] !=max(mod_SAD_dist_corr_t[1:3, c(2:4, 9:11)])]), 2)`.

```{r create & show mod_SAD_dist_corr_t, message=FALSE, warning=FALSE}
# grab table from output
mod_SAD_dist_corr_t <- 
  as.data.frame(cbind(c("Distractibility Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dist x DvPB", "Dist x DvO"), 
                      mod_SAD_dist1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Distractibility Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dist x OvPB", "Dist x OvD"),
                      mod_SAD_dist2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SAD_dist_corr_t) <- c("var1", "Distractibility Score", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Dist x DvPB", "Dist x DvO",
                                  "row", "var2",
                             "Distractibility Score", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Dist x OvPB", "Dist x OvD")
rownames(mod_SAD_dist_corr_t) <- 1:nrow(mod_SAD_dist_corr_t)

# reset values as numeric
mod_SAD_dist_corr_t[,2] <- as.numeric(mod_SAD_dist_corr_t[,2])
mod_SAD_dist_corr_t[,3] <- as.numeric(mod_SAD_dist_corr_t[,3])
mod_SAD_dist_corr_t[,4] <- as.numeric(mod_SAD_dist_corr_t[,4])
mod_SAD_dist_corr_t[,5] <- as.numeric(mod_SAD_dist_corr_t[,5])
mod_SAD_dist_corr_t[,6] <- as.numeric(mod_SAD_dist_corr_t[,6])
mod_SAD_dist_corr_t[,9] <- as.numeric(mod_SAD_dist_corr_t[,9])
mod_SAD_dist_corr_t[,10] <- as.numeric(mod_SAD_dist_corr_t[,10])
mod_SAD_dist_corr_t[,11] <- as.numeric(mod_SAD_dist_corr_t[,11])
mod_SAD_dist_corr_t[,12] <- as.numeric(mod_SAD_dist_corr_t[,12])
mod_SAD_dist_corr_t[,13] <- as.numeric(mod_SAD_dist_corr_t[,13])

# Print table using kable
kable(mod_SAD_dist_corr_t, caption = "Correlations for the distractibility moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()
```

\
**Age Moderator**: correlations ranged from `r round(min(mod_SUQ_age_corr_t[1:3, c(2:4, 9:11)]), 2)` to `r round(max(mod_SUQ_age_corr_t[1:3, c(2:4, 9:11)][mod_SUQ_age_corr_t[1:3, c(2:4, 9:11)] !=max(mod_SUQ_age_corr_t[1:3, c(2:4, 9:11)])]), 2)`.

```{r create & show mod_SAD_age_corr_t, message=FALSE, warning=FALSE}
# grab table from output
mod_SUQ_age_corr_t <- 
  as.data.frame(cbind(c("Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Age x DvPB", "Age x DvO"), 
                      mod_SUQ_age1$datascreening$correl[2:6, 2:6],
                      c(6:10),
                      c("Age", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Age x OvPB", "Age x OvD"),
                      mod_SUQ_age2$datascreening$correl[2:6, 2:6])
                )
# rename rows & columns
colnames(mod_SUQ_age_corr_t) <- c("var1", "Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Age x DvPB", "Age x DvO",
                                   "row", "var2",
                                   "Age", "Outside vs. Pocket/Bag", "Outside vs. Desk", "Age x OvPB", "Age x OvD")
rownames(mod_SUQ_age_corr_t) <- 1:nrow(mod_SUQ_age_corr_t)

# reset values as numeric
mod_SUQ_age_corr_t[,2] <- as.numeric(mod_SUQ_age_corr_t[,2])
mod_SUQ_age_corr_t[,3] <- as.numeric(mod_SUQ_age_corr_t[,3])
mod_SUQ_age_corr_t[,4] <- as.numeric(mod_SUQ_age_corr_t[,4])
mod_SUQ_age_corr_t[,5] <- as.numeric(mod_SUQ_age_corr_t[,5])
mod_SUQ_age_corr_t[,6] <- as.numeric(mod_SUQ_age_corr_t[,6])
mod_SUQ_age_corr_t[,9] <- as.numeric(mod_SUQ_age_corr_t[,9])
mod_SUQ_age_corr_t[,10] <- as.numeric(mod_SUQ_age_corr_t[,10])
mod_SUQ_age_corr_t[,11] <- as.numeric(mod_SUQ_age_corr_t[,11])
mod_SUQ_age_corr_t[,12] <- as.numeric(mod_SUQ_age_corr_t[,12])
mod_SUQ_age_corr_t[,13] <- as.numeric(mod_SUQ_age_corr_t[,13])

# Print table using kable
kable(mod_SUQ_age_corr_t, caption = "Correlations for the age moderator.", 
      digits = 2, align = 'rccccccrrcccc', row.names = T, col.names = c(" ", 1:5, "", "", 6:10)) %>%
  # footnote(general = "Interaction variables ") %>% 
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  add_header_above(c(" " = 2, "Moderator", "Location" = 2, "Interaction" = 2, " " = 2, "Moderator", "Location" = 2, "Interaction" = 2), bold = T) %>%
  add_header_above(c(" " = 2, "Comparison Group: \nDesk" = 5, " " = 2, "Comparison Group: \nOutside" = 5)) %>%
  column_spec(2, bold = T) %>%
  column_spec(9, bold = T) %>%
  column_spec(7, border_right = T) %>%
  kable_classic()

```

###### (2) Linearity

Linearity was evaluated for each moderator using Q-Q plots. For all moderators, the data fell along or close to the line. Therefore, linearity was met for all moderators.

```{r show mod_cont_qqs, message=FALSE, warning=FALSE}
# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$linearity"
grid.arrange(ggqqplot(mod_SAD_dep1$model1$residuals, title = "Q-Q Plot for \nDependence", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_ea1$model1$residuals, title = "Q-Q Plot for \nEmotional Attachment", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_access1$model1$residuals, title = "Q-Q Plot for \nAccessibility", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SAD_dist1$model1$residuals, title = "Q-Q Plot for \nDistractibility", 
                      shape = 1, ggtheme = theme_classic()),
             ggqqplot(mod_SUQ_age1$model1$residuals, title = "Q-Q Plot for Age", 
                      shape = 1, ggtheme = theme_classic()),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Q-Q Plots For Continuous Moderators", face = "bold")
             )
```

###### (3) Normality

Normality was evaluated for each moderator by plotting a histogram of residuals. For all moderators, the data fell along or close to a normal curve. Therefore, normality was met for all moderators.

```{r show mod_cont_norms, message=FALSE, warning=FALSE}
# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$normality"
grid.arrange(qplot(mod_SAD_dep1$model1$residuals, main = "Dependency", xlab = "Residuals", ylab = "Frequency", binwidth = 5),
             qplot(mod_SAD_ea1$model1$residuals, main = "Emotional Attachment", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SAD_access1$model1$residuals, main = "Accessibility", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SAD_dist1$model1$residuals, main = "Distractibility", xlab = "Residuals", ylab = "Frequency", binwidth = 5), 
             qplot(mod_SUQ_age1$model1$residuals, main = "Age", xlab = "Residuals", ylab = "Frequency", binwidth = 5),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Histogram of Residuals For Continuous Moderators", face = "bold")
             )
```

###### (4) Homogeneity and Homoscedasticity

Homogeneity and homoscedasticity were evaluated for each moderator by plotting a scatter plot of the fitted values vs. residuals. For all moderators, the line of best fit showed an overall linear trend. Except for age, all moderators showed a good distribution throughout the plot. This was not surprising since reported age was whole-numbers only and the sample size did not cover a large age distribution. Therefore, homogeneity and homoscedasticity were met for all moderators.

```{r message=FALSE, warning=FALSE}
# note: this is the same across the 2 IV orders (i.e., desk or outside first)
 # each moderator's q-q plot can also be found independently using: "mod_SAD_dep1$datascreening$linearity"
grid.arrange(qplot(x = mod_SAD_dep1$model1$fitted.values, y = mod_SAD_dep1$model1$residuals, 
                   main = "Dependency", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(),
             qplot(x = mod_SAD_ea1$model1$fitted.values, y = mod_SAD_ea1$model1$residuals, 
                   main = "Emotional Attachment", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SAD_access1$model1$fitted.values, y = mod_SAD_access1$model1$residuals, 
                   main = "Accessibility", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SAD_dist1$model1$fitted.values, y = mod_SAD_dist1$model1$residuals, 
                   main = "Distractibility", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(), 
             qplot(x = mod_SUQ_age1$model1$fitted.values, y = mod_SUQ_age1$model1$residuals, 
                   main = "Age", xlab = "Fitted", ylab = "Residuals", geom = "point") + geom_smooth(se = F) + theme_classic(),
             # labels = c("A", "B", "C", "D"),
             nrow = 2,
             top = text_grob("Residual-Fitted Plot For Continuous Moderators", face = "bold")
             )
```

</details>

##### Overall Model

Now that assumptions were checked and met, each model was reviewed. For each moderator's model showed: b-values, se, t, p-value, over all model fit (i.e., f-value with p-value), and $R^{2}$.

###### Dependence

For dependency the moderator, IV, and interactions were not significant.

```{r create&show mod_SAD_dep_t, message=FALSE, warning=FALSE}
# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_dep1$model1)" or "summary(mod_SAD_dep2$model1)"
mod_SAD_dep_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Dependence", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Dep x DvPB", "Dep x DvO", "Dep x OvPB"), 
                   "b" = round(c(mod_SAD_dep1$model1$coefficients[2:4], mod_SAD_dep2$model1$coefficients[3], mod_SAD_dep1$model1$coefficients[5:6],
                                 mod_SAD_dep2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,2], summary(mod_SAD_dep2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_dep1$model1)$coefficients[5:6,2], summary(mod_SAD_dep2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,3], summary(mod_SAD_dep2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_dep1$model1)$coefficients[5:6,3], summary(mod_SAD_dep2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_dep1$model1)$coefficients[2:4,4], summary(mod_SAD_dep2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_dep1$model1)$coefficients[5:6,4], summary(mod_SAD_dep2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_dep1$model1)$fstatistic[2], summary(mod_SAD_dep1$model1)$fstatistic[3], summary(mod_SAD_dep1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_dep1$model1)$fstatistic[1], summary(mod_SAD_dep1$model1)$fstatistic[2], summary(mod_SAD_dep1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_dep1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_dep_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_dep_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Dependence.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

###### Emotional Attachment

For emotional attachment the moderator was significant, xxxxx. However, the IV, and interactions were not significant.

```{r create&show mod_SAD_ea_t, message=FALSE, warning=FALSE}
# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_ea1$model1)" or "summary(mod_SAD_ea2$model1)"
mod_SAD_ea_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Emotional Attachment", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "EA x DvPB", "EA x DvO", "EA x OvPB"), 
                   "b" = round(c(mod_SAD_ea1$model1$coefficients[2:4], mod_SAD_ea2$model1$coefficients[3], mod_SAD_ea1$model1$coefficients[5:6],
                                 mod_SAD_ea2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,2], summary(mod_SAD_ea2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_ea1$model1)$coefficients[5:6,2], summary(mod_SAD_ea2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,3], summary(mod_SAD_ea2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_ea1$model1)$coefficients[5:6,3], summary(mod_SAD_ea2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_ea1$model1)$coefficients[2:4,4], summary(mod_SAD_ea2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_ea1$model1)$coefficients[5:6,4], summary(mod_SAD_ea2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_ea1$model1)$fstatistic[2], summary(mod_SAD_ea1$model1)$fstatistic[3], summary(mod_SAD_ea1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_ea1$model1)$fstatistic[1], summary(mod_SAD_ea1$model1)$fstatistic[2], summary(mod_SAD_ea1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_ea1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_ea_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_ea_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Emotional Attachment.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```


###### Accessibility

For accessibility the moderator, IV, and interactions were not significant.

```{r create&show mod_SAD_access_t, message=FALSE, warning=FALSE}
# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_access1$model1)" or "summary(mod_SAD_access2$model1)"
mod_SAD_access_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Accessibility", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Access x DvPB", "Access x DvO", "Access x OvPB"), 
                   "b" = round(c(mod_SAD_access1$model1$coefficients[2:4], mod_SAD_access2$model1$coefficients[3], mod_SAD_access1$model1$coefficients[5:6],
                                 mod_SAD_access2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,2], summary(mod_SAD_access2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_access1$model1)$coefficients[5:6,2], summary(mod_SAD_access2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,3], summary(mod_SAD_access2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_access1$model1)$coefficients[5:6,3], summary(mod_SAD_access2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_access1$model1)$coefficients[2:4,4], summary(mod_SAD_access2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_access1$model1)$coefficients[5:6,4], summary(mod_SAD_access2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_access1$model1)$fstatistic[2], summary(mod_SAD_access1$model1)$fstatistic[3], summary(mod_SAD_access1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_access1$model1)$fstatistic[1], summary(mod_SAD_access1$model1)$fstatistic[2], summary(mod_SAD_access1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_access1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_access_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_access_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Accessibility", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```


###### Distractibility

For distractibility the moderator, IV, and interactions were not significant.

```{r create&show mod_SAD_dist_t, message=FALSE, warning=FALSE}
# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SAD_dist1$model1)" or "summary(mod_SAD_dist2$model1)"
mod_SAD_dist_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Distractibility", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Dist x DvPB", "Dist x DvO", "Dist x OvPB"), 
                   "b" = round(c(mod_SAD_dist1$model1$coefficients[2:4], mod_SAD_dist2$model1$coefficients[3], mod_SAD_dist1$model1$coefficients[5:6],
                                 mod_SAD_dist2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,2], summary(mod_SAD_dist2$model1)$coefficients[3,2], 
                                  summary(mod_SAD_dist1$model1)$coefficients[5:6,2], summary(mod_SAD_dist2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,3], summary(mod_SAD_dist2$model1)$coefficients[3,3], 
                                 summary(mod_SAD_dist1$model1)$coefficients[5:6,3], summary(mod_SAD_dist2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SAD_dist1$model1)$coefficients[2:4,4], summary(mod_SAD_dist2$model1)$coefficients[3,4], 
                                 summary(mod_SAD_dist1$model1)$coefficients[5:6,4], summary(mod_SAD_dist2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SAD_dist1$model1)$fstatistic[2], summary(mod_SAD_dist1$model1)$fstatistic[3], summary(mod_SAD_dist1$model1)$fstatistic[1], 
                pf(summary(mod_SAD_dist1$model1)$fstatistic[1], summary(mod_SAD_dist1$model1)$fstatistic[2], summary(mod_SAD_dist1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SAD_dist1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SAD_dist_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SAD_dist_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Distractibility.", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

###### Age

For age the moderator, IV, and interactions were not significant.

```{r create&show mod_SUQ_age_t, message=FALSE, warning=FALSE}
# create data frame with model values -- all values rounded to 5 decimals
  # note: you can see non-formatted model summary using "summary(mod_SUQ_age1$model1)" or "summary(mod_SUQ_age2$model1)"
mod_SUQ_age_t <- 
  # add b estimate, se, t, p
  rbind(data.frame("var" = c("Age", "Desk vs. Pocket/Bag", "Desk vs. Outside", "Outside vs. Pocket/Bag", 
                                   "Age x DvPB", "Age x DvO", "Age x OvPB"), 
                   "b" = round(c(mod_SUQ_age1$model1$coefficients[2:4], mod_SUQ_age2$model1$coefficients[3], mod_SUQ_age1$model1$coefficients[5:6],
                                 mod_SUQ_age2$model1$coefficients[5]), 5), 
                   "se" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,2], summary(mod_SUQ_age2$model1)$coefficients[3,2], 
                                  summary(mod_SUQ_age1$model1)$coefficients[5:6,2], summary(mod_SUQ_age2$model1)$coefficients[5,2]), 5), 
                   "t" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,3], summary(mod_SUQ_age2$model1)$coefficients[3,3], 
                                 summary(mod_SUQ_age1$model1)$coefficients[5:6,3], summary(mod_SUQ_age2$model1)$coefficients[5,3]), 5),
                   "p" = round(c(summary(mod_SUQ_age1$model1)$coefficients[2:4,4], summary(mod_SUQ_age2$model1)$coefficients[3,4], 
                                 summary(mod_SUQ_age1$model1)$coefficients[5:6,4], summary(mod_SUQ_age2$model1)$coefficients[5,4]), 5)
                         ), 
        # add labels for: F, df1, df2, p-value, and Rsq values
        c(" ", "df1", "df2", "F", "p"), 
        # add F, df1, df2, p-value, and Rsq values
        c("Overall Model Fit", round(c(summary(mod_SUQ_age1$model1)$fstatistic[2], summary(mod_SUQ_age1$model1)$fstatistic[3], summary(mod_SUQ_age1$model1)$fstatistic[1], 
                pf(summary(mod_SUQ_age1$model1)$fstatistic[1], summary(mod_SUQ_age1$model1)$fstatistic[2], summary(mod_SUQ_age1$model1)$fstatistic[3],lower.tail=FALSE)), 5)), 
        # add R squared value
        c("R-squared", round(summary(mod_SUQ_age1$model1)$r.squared, 5), rep(c(""), 3))
        )

# rename rows
rownames(mod_SUQ_age_t) <- c(1:7, " ", 8:9)

# Print table using kable
kable(mod_SUQ_age_t, caption = "Summary of moderation analysis with IV: Smartphone Location and M: Age", align = 'rcccc', row.names = T, col.names = c(" ", "b", "SE", "t", "p")) %>%
  # footnote(general = "Interaction variables ") %>%
  pack_rows("Moderator", 1, 1) %>%
  pack_rows("IV: Smartphone Location", 2, 4) %>%
  pack_rows("Interactions: Moderator x IV", 5, 7) %>%
  pack_rows(" ", 8, 10, indent = F) %>%
  pack_rows("Model", 8, 10) %>% 
  row_spec(0, bold = T, italic = T) %>%
  row_spec(8, bold = T, italic = T) %>%
  column_spec(2, bold = T) %>%
  kable_styling(bootstrap_options = "striped", "hover", full_width = F) %>%
  kable_classic()
```

##### Simple Slopes

###### Dependence

<details>
<summary> Click to see output for simple slopes, including generated interpretations. </summary>
```{r}
# summary of SS
lapply(mod_SAD_dep1$slopemodels, summary)
lapply(mod_SAD_dep2$slopemodels, summary)

# generated interpretations
cat(mod_SAD_dep1$interpretation)
cat(mod_SAD_dep2$interpretation)
```
</details> 

###### Emotional Attachment

<details>
<summary> Click to see output for simple slopes, including generated interpretations. </summary>
```{r}
# summary of SS
lapply(mod_SAD_ea1$slopemodels, summary)
lapply(mod_SAD_ea2$slopemodels, summary)

# generated interpretations
cat(mod_SAD_ea1$interpretation)
cat(mod_SAD_ea2$interpretation)
```
</details> 

###### Accessibility 

<details>
<summary> Click to see output for simple slopes, including generated interpretations. </summary>
```{r}
# summary of SS
lapply(mod_SAD_access1$slopemodels, summary)
lapply(mod_SAD_access2$slopemodels, summary)

# generated interpretations
cat(mod_SAD_access1$interpretation)
cat(mod_SAD_access2$interpretation)
```
</details>


###### Distractibility

<details>
<summary> Click to see output for simple slopes, including generated interpretations. </summary>
```{r}
# summary of SS
lapply(mod_SAD_dist1$slopemodels, summary)
lapply(mod_SAD_dist2$slopemodels, summary)

# generated interpretations
cat(mod_SAD_dist1$interpretation)
cat(mod_SAD_dist2$interpretation)
```
</details>

###### Age

<details>
<summary> Click to see output for simple slopes, including generated interpretations. </summary>
```{r}
# summary of SS
lapply(mod_SUQ_age1$slopemodels, summary)
lapply(mod_SUQ_age2$slopemodels, summary)

# generated interpretations
cat(mod_SUQ_age1$interpretation)
cat(mod_SUQ_age2$interpretation)
```
</details> 

##### Visualizations

The visualizations below are presented either in panel form (i.e., with all plots presented together) or separate.

###### Separate
**Dependence** was **not** a significant moderator for OSpan absolute score. No notable trends in the data were observed.

```{r create & show mod_SAD_dep_bar, message=FALSE, warning=FALSE, error=FALSE}
mod_SAD_dep_bar_data <- 
  mod_SAD_dep1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_dep_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_dep_score = c(scale(SAD_dep_score, center = TRUE, scale = FALSE)))

mod_SAD_dep_bar <- ggplot(mod_SAD_dep_bar_data, aes(x = SAD_dep_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_dep_bar_data$SAD_dep_score), max(mod_SAD_dep_bar_data$SAD_dep_score)), 
                  ylim = c(min(mod_SAD_dep_bar_data$ospan_absolute_score), max(mod_SAD_dep_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Dependence", 
       x = "Dependence", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone dependency is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_dep_bar

# export plot
# ggsave(filename="fig_mod_SAD_dep_bar_june14", plot = mod_SAD_dep_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```

**Emotional Attachment** was **not** a significant moderator for OSpan absolute score. A notable trend in the data was observed, where participants with less emotional attachment showed better OSpan performance in the on desk condition. The outside condition showed the same but smaller trned. The pocket/bag condition did not show this trend as the slope were closer to zero.

```{r create & show mod_SAD_ea_bar, message=FALSE, warning=FALSE, error=FALSE}
mod_SAD_ea_bar_data <- 
  mod_SAD_ea1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_ea_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_ea_score = c(scale(SAD_ea_score, center = TRUE, scale = FALSE)))

mod_SAD_ea_bar <- ggplot(mod_SAD_ea_bar_data, aes(x = SAD_ea_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_ea_bar_data$SAD_ea_score), max(mod_SAD_ea_bar_data$SAD_ea_score)), 
                  ylim = c(min(mod_SAD_ea_bar_data$ospan_absolute_score), max(mod_SAD_ea_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Emotional Attachment", 
       x = "Emotional Attachment", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone Emotional Attachment is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_ea_bar

# export plot
# ggsave(filename="fig_mod_SAD_ea_bar_june14", plot = mod_SAD_ea_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```


**Accessibility** was **not** a significant moderator for OSpan absolute score. No notable trends in the data were observed.

```{r create & show mod_SAD_access_bar, message=FALSE, warning=FALSE, error=FALSE}
mod_SAD_access_bar_data <- 
  mod_SAD_access1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_access_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_access_score = c(scale(SAD_access_score, center = TRUE, scale = FALSE)))

mod_SAD_access_bar <- ggplot(mod_SAD_access_bar_data, aes(x = SAD_access_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_access_bar_data$SAD_access_score), max(mod_SAD_access_bar_data$SAD_access_score)), 
                  ylim = c(min(mod_SAD_access_bar_data$ospan_absolute_score), max(mod_SAD_access_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Accessibility", 
       x = "Accessibility", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Accessibility is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_access_bar

# export plot
# ggsave(filename="fig_mod_SAD_access_bar_june14", plot = mod_SAD_access_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```

**Distractibility** was **not** a significant moderator for OSpan absolute score. No notable trends in the data were observed.

```{r create & show mod_SAD_dist_bar, message=FALSE, warning=FALSE, error=FALSE}
mod_SAD_dist_bar_data <- 
  mod_SAD_dist1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, SAD_dist_score, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(SAD_dist_score = c(scale(SAD_dist_score, center = TRUE, scale = FALSE)))

mod_SAD_dist_bar <- ggplot(mod_SAD_dist_bar_data, aes(x = SAD_dist_score, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SAD_dist_bar_data$SAD_dist_score), max(mod_SAD_dist_bar_data$SAD_dist_score)), 
                  ylim = c(min(mod_SAD_dist_bar_data$ospan_absolute_score), max(mod_SAD_dist_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Distractibility", 
       x = "Distractibility", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Smartphone Distractibility is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SAD_dist_bar

# export plot
# ggsave(filename="fig_mod_SAD_dist_bar_june14", plot = mod_SAD_dist_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```

**Age** was **not** a significant moderator for OSpan absolute score. A notable trend in the data was observed: for all location conditions, younger participants tended to show better task performance.

```{r create & show mod_SUQ_age_bar, message=FALSE, warning=FALSE, error=FALSE}
mod_SUQ_age_bar_data <- 
  mod_SUQ_age1$datascreening$fulldata %>% 
  # remove rows w/ outlier ps
  filter(totalout != 2) %>% 
  # select the data we want: IV, DV, M
  select(ospan_absolute_score, age, location_desk) %>% 
  # replace moderator score with mean-centered
  mutate(age = c(scale(age, center = TRUE, scale = FALSE)))

mod_SUQ_age_bar <- ggplot(mod_SUQ_age_bar_data, aes(x = age, y = ospan_absolute_score, color = location_desk)) +
  geom_point()+
  geom_smooth(method = lm, aes(fill = location_desk)) + 
  coord_cartesian(xlim = c(min(mod_SUQ_age_bar_data$age), max(mod_SUQ_age_bar_data$age)), 
                  ylim = c(min(mod_SUQ_age_bar_data$ospan_absolute_score), max(mod_SUQ_age_bar_data$ospan_absolute_score))) +
  labs(title = "Moderating OSpan Absolute Score with Age", 
       x = "Age (years)", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       caption = "Note: Age is mean-centered."
       ) +
  scale_fill_manual(values = c("lightseagreen", "purple3", "deepskyblue4")) +
  scale_colour_manual(values = c("lightseagreen", "purple4", "deepskyblue4")) +
  theme_classic()

#show fig
mod_SUQ_age_bar

# export plot
# ggsave(filename="fig_mod_SUQ_age_bar", plot = mod_SUQ_age_bar, device = "png", height = 5, width = 8, units = "in", dpi = 500)
```

```{r eval=FALSE, message=FALSE, warning=FALSE, include=FALSE}
# THESE ARE THE FIGS THE MODERATION1 FUNCTION GIVES... 

# Dependency
mod_SAD_dep1$graphslopes

# Emotional Attachment
mod_SAD_ea1$graphslopes


# Accessibility
mod_SAD_access1$graphslopes

# Distractibility
mod_SAD_dist1$graphslopes

# Age
mod_SUQ_age1$graphslopes


# add geom_smooth(formula = y ~ x, method = "lm") to avoid errors
```

```{r eval=FALSE, message=FALSE, warning=FALSE, include=FALSE}
##overall model statistics
#F(5, 377) = 0.15, p = .981, R2 = .002

##each predictor one at a time
  #Dependency score b = 0.008, t(377) = 0.039, p = .969
    #Dependency score does not predict income rates
  
  #P/B vs. Desk b = 0.872, t(377) = 0.413, p = .680
    #not a significant difference between desk and pocket/bag location for OSpan ab score
  #outside vs. Desk b = 0.141, t(377) = 0.065, p = .948
    #not a significant difference between desk and outside location for OSpan ab score

# get means for OSpan ab score by categorical variable
  tapply(mod_analysis$ospan_absolute_score, mod_analysis$location, mean)
 # desk      pocketbag  outside 
 # 41.14062  42.01515  41.43089 

##Interaction
#Location by p/b versus desk -- not significant
  # b = -0.008, t(377) = -0.026, p = .979
#Location by outside versus desk -- not significant
  # b = 0.158, t(377) = 0.533, p = .594

#break down by M
#Dependency score predicting OSpan ab score for each group/level of location

#Simple slopes by levels of M
saved$slopemodels

##view with lapply
lapply(saved$slopemodels, summary)

#Low murder levels, illiteracy b = 622.70 not significant
#Average b = 125.00 not significant
#High b = -902.60 is significant

#at low and average murder rates, illiteracy does not predict income
#at high murder rates, illiteracy negatively predicts income

##or pull one at a time
summary(saved$slopemodels$Low)
coef(saved$slopemodels$Low)

summary(saved$slopemodels$Average)

#Interpretation of Slopes
cat(saved$interpretation)

#Graph of the Slopes
saved$graphslopes

#remember, you can run the function with out ()
#to view the graph code if you want to tweak it more
moderation1

####Power####
library(pwr)
##power runs on cohen's f - not to be confused with anova f.
##take the R squared to convert
R2 =  .20
feta = R2 / (1-R2)

#u is df model, which is number of predictors
#v is df error, but we are trying to figure out
#sample size so we leave this one blank.
#f2 is cohen f squared

##all the predictors
pwr.f2.test(u = 5, v = NULL, f2 = feta, sig.level = .05, power = .80)
#remember that you need to add u + v to get final sample size

##addition of the interaction only
pwr.f2.test(u = 2, v = NULL, f2 = feta, sig.level = .05, power = .80)
#remember that you need to add u + v to get final sample size

```

```{r manual_lm, eval=FALSE, message=FALSE, warning=FALSE, include=FALSE}
# Analysis can be run manually using lm()
# this example goes through how you would run this on 3 moderators in total
# does not include graph or assumption checks

# set up the data -- add mean-centered scores
  # note: make sure categorical var (IV = location) is a factor or will not run correctly
y <-
  # use original data file
  mod_analysis %>%
  # add mean-centered vars for each mod -- note: each mod will be used in separate analysis
  bind_cols(SAD_depC = c(scale(mod_analysis$SAD_dep_score,  center=TRUE, scale=FALSE)), # for dependency
            SAD_eaC = c(scale(mod_analysis$SAD_ea_score,  center=TRUE, scale=FALSE)), # for emotional attachment
            SAD_distC = c(scale(mod_analysis$SAD_dist_score,  center=TRUE, scale=FALSE)) # for distractibility
            ) %>%
  rename(ospan = ospan_absolute_score) # rename DV shorter name

# FOR SMARTPHONE DEPENDENCY
# to run the overall model
  summary(lm(data = y, formula = ospan ~  SAD_depC + location + location*SAD_depC))

# to run the simple slopes (i.e., post-hocs / pair-wise comparisons)
  summary(lm(formula = ospan ~ SAD_depC, data = dplyr::filter(y, location == "desk")))
  summary(lm(formula = ospan ~ SAD_depC, data = dplyr::filter(y, location == "pocket/bag")))
  summary(lm(formula = ospan ~ SAD_depC, data = dplyr::filter(y, location == "outside")))

# FOR EMOTIONAL ATTACHMENT
# to run the overall model
  summary(lm(data = y, formula = ospan ~  SAD_eaC + location + location*SAD_eaC))

# to run the simple slopes (i.e., post-hocs / pair-wise comparisons)
  summary(lm(formula = ospan ~ SAD_eaC, data = dplyr::filter(y, location == "desk")))
  summary(lm(formula = ospan ~ SAD_eaC, data = dplyr::filter(y, location == "pocket/bag")))
  summary(lm(formula = ospan ~ SAD_eaC, data = dplyr::filter(y, location == "outside")))

# FOR SMARTPHONE DISTRACTIBILITY
# to run the overall model
  summary(lm(data = y, formula = ospan ~  SAD_distC + location + location*SAD_distC))

# to run the simple slopes (i.e., post-hocs / pair-wise comparisons)
  summary(lm(formula = ospan ~ SAD_distC, data = dplyr::filter(y, location == "desk")))
  summary(lm(formula = ospan ~ SAD_distC, data = dplyr::filter(y, location == "pocket/bag")))
  summary(lm(formula = ospan ~ SAD_distC, data = dplyr::filter(y, location == "outside")))
```

```{r run_mod_ward, eval=FALSE, include=FALSE}
# exploratory -- view Ward et al.'s (2017) "Smartphone Dependency" & "Emotional Attachment" subscales
mod_ward <- 
  all_data %>% 
  rowwise() %>% 
  mutate(SAD_dep = sum(SAD_1, SAD_2, SAD_3, SAD_4, SAD_5, SAD_6), 
         SAD_ea = sum(SAD_8, SAD_9, SAD_10, SAD_11, SAD_12))
# make location factor
mod_ward$location <- factor(mod_ward$location, levels = c(1:3), labels = c("desk", "pocketbag", "outside"))

# run moderation for Smartphone Dependency 
mod_ward_dep1 <- moderation1(y = "ospan_absolute_score", #DV
                             x = "SAD_dep", #IV
                             m = "location", #Moderator for simple slopes
                             cvs = NULL, # covariates
                             df = mod_ward, # data frame of columns
                             with_out = T)

# run moderation for Emotional Attachment
mod_ward_ea1 <- moderation1(y = "ospan_absolute_score", #DV
                    x = "SAD_ea", #IV
                    m = "location", #Moderator for simple slopes
                    cvs = NULL, #covariates
                    df = mod_ward, # data frame of columns
                    with_out = T)

```


###### Panel

```{r}
# create join data set with all moderator data
mod_all_data_plot <- 
  cbind(data.frame("moderator" = c(rep(1, nrow(mod_SAD_dep_bar_data)), rep(2, nrow(mod_SAD_ea_bar_data)), rep(3, nrow(mod_SAD_access_bar_data)), rep(4, nrow(mod_SAD_dist_bar_data)))), 
        rbind(
        # SAD_dep data
        mod_SAD_dep_bar_data %>% dplyr::rename(mod_score = SAD_dep_score), # rename column to join correctly
        mod_SAD_ea_bar_data %>% dplyr::rename(mod_score = SAD_ea_score), # rename column to join correctly
        mod_SAD_access_bar_data %>% dplyr::rename(mod_score = SAD_access_score), # rename column to join correctly
        mod_SAD_dist_bar_data %>% dplyr::rename(mod_score = SAD_dist_score) # rename column to join correctly
        )
        ) %>% 
  mutate(moderator = factor(moderator, levels = c(1:4), labels = c("(A) Dependency", "(B) Emotional Attachment", "(C) Accessibility", "(D) Distractibility"))) %>% 
  mutate(location_desk = factor(location_desk, labels = c("Desk", "Pocket/Bag", "Outside")))

glimpse(mod_all_data_plot)
```

```{r}
# create panel fig
mod_all_plot <-
  ggplot(mod_all_data_plot, aes(x = mod_score, y = ospan_absolute_score, group = location_desk)) +
  geom_point(stat = "identity", aes(shape = location_desk, color = location_desk), size = 1.5)+
  geom_smooth(method = lm, aes(fill = location_desk, color = location_desk), size = 1) + 
  labs(
    # title = "Moderating OSpan Absolute Score",
       x = "Mean-Centered Score", 
       y = "OSpan Absolute Score", 
       color = "Smartphone Location",
       fill = "Smartphone Location",
       shape = "Smartphone Location") +
  facet_wrap(~moderator, ncol = 2) +
  scale_fill_manual(values = c("#18BAB3", "#62129B", "#007298")) +
  scale_colour_manual(values = c("#18BAB3", "#62129B", "#007298")) +
  scale_y_continuous(limits = c(0, 75), breaks = c(seq(0,75, 25))) + 
  theme_classic() +
  theme(legend.position = "bottom")
  

# show plot
mod_all_plot

# export plots
# ggsave(filename="fig_mod_all_plot_june16", plot = mod_all_plot, device = "jpeg", height = 5, width = 8, units = "in", dpi = 1000)

```