Overview

Previously on…

The goals of this project are to establish:

if children and adults form inaccurate beliefs about a social group from seeing a structurally skewed sample that they take on face value
if children and adults correct their beliefs given information about structural skew.

Previously in pilot 1, adults watched a sample of novel group members come off of a boat that probabilistically restricts those taller than its boat height from boarding. The observed sample was fixed across participants, but participants were randomly assigned to one of 5 different boat heights, ranging from shorter to taller (5 units through 9 units tall). Participants did not see anything about where the boat originated from, or any population information.

Adults were asked for their population inferences (how tall most novel group members are), as well as a check of their representation of the sample (how tall most of those who visited are). We hypothesized:

If adults do adjust their inferences for structural skew, then we would see different population inferences across boat height conditions, such that adults would adjust their inferences to be taller when the sample is suspiciously just under the boat height.
If adults do not adjust their inferences, we would not see different population inferences across boat height condition.

We found the latter: adults simply report the sample mean (or midpoint of the scale) for both sample means and population means.

However, it is unclear if adults were genuinely failing to adjusting their responses in response to structural skew (null hypothesis), or if this paradigm simply does not elicit any population inferences at all. We were also concerned adults could have been rationalizing the boat height as informative about the population height.

Goals

The primary goal of this pilot was to validate the paradigm and measures to confirm that adults do successfully form accurate population inferences when given population information in this paradigm. Specifically, we give participants full transparent access to the boarding sequence, allowing participants to see the height of those who attempted to board (the population), as well as those who successfully board and deboard (the sample). We also specified that the boat height was accidental, to eliminate rationalizing.

If this paradigm and measures successfully elicit people’s representation of the population, we should see that participants accurately report the population mean in their responses in this pilot, where they literally see the population.
If this paradigm and measures does not successfully elicit people’s representation of the population…. maybe we need a new paradigm/measures??

Results

Disappointingly, participants’ population representation were not sensitive to the differing population information presented in the different conditions. Generally participants either selected the medium Zarpie (6) or the tallest Zarpie (8).

As expected, and like in the previous pilot, participants’ sample representation did not differ across conditions, since all participants saw the same sample. Strangely and like in the previous pilot, their sample judgments were significantly taller than the true mean/mode of the sample (6). Generally participants either selected the medium Zarpie (6) or the tallest Zarpie (8).

Interpretation

Encoding and memory issues

Participants could have simply failed to pay attention to the Zarpies on Zarpie island.
- But, participants largely passed the memory check immediately after the boarding sequence. They remembered that all the Zarpies got onto the boat in the short condition, and that not all the Zarpies got onto the boat in the medium and tall conditions. (see Memory check)
Participants could have paid attention to the Zarpies on Zarpie island, enough to pass memory check immediately after, but then failed to take away anything from that scene, or forgot later about Zarpies on Zarpie island. Participants then simply treated the sample as the population, forgetting about the Zarpies in the boarding sequence.
- Although this could be true for the short condition (where the Zarpies on Zarpie island board successfully and become the sample), participants’ population judgments differ from their sample judgments in the medium and tall conditions (where some of the Zarpies who tried to board could not board) (see Sample vs population). This suggests participants in the medium and tall conditions did remember something about the Zarpies on Zarpie island that entered into their population judgments.
- But, participants in the medium and tall conditions distinguished sample from population, in providing taller responses for population than sample (see sample vs population), and generally saying ‘taller’ when explicitly asked to compare population and sample (see Explicit comparison).
Participants could have paid attention to the Zarpies on Zarpie island, enough to recall whether all Zarpies boarded, but come away with only a vague encoding/memory of the Zarpies on Zarpie island.
- This would explain how the population judgments in medium versus tall conditions were not different from each other, if participants only have a vague sense that they are taller, but not by how much.
Participants could have visually integrated across the parade of Zarpires on Zarpie island, but overweighted perceptually salient Zarpies, such as the ones who stoop and the ones who do not board.
Participants could have failed to visually integrate across the parade of Zarpies on Zarpie island. (In the short condition, there are only a few Zarpies in line to board and they are all visible on screen at once. In the medium and tall conditions, there are many Zarpies in line to board who parade through; the whole parade is never visible on screen at the same time.)
- Does not explain why in the short condition, participants still thought Zarpies on Zarpie island were taller than the actual mean. In the medium and tall conditions, if they only fixated on the Zarpies visible in the initial frame, their responses should be on the short end of the scale, which did not occur.
Participants could have treated the Zarpies on the island as yet “more sample”, lumping together the Zarpies they saw on the island with the Zarpies they saw who visited.
- But, participants in the medium and tall conditions distinguished sample from population, in providing taller responses for population than sample (see sample vs population), and generally saying ‘taller’ when explicitly asked to compare population and sample (see explicit comparison).
- In addition, this should still predict condition differences, since the Zarpies on the island were different across conditions, even though the sample of Zarpies who came on the boat was the same.

Measurement issues

Participants could have been split on whether the questions were about the average height or about the tallest height of the Zarpies they saw.
- For the sample question, this would explain why responses have a bimodal shape, clustered around 6 (under the average reading, since 6 is sample average) and 8 (under the tallest reading, since 8 is the tallest in the sample).
- This seems likely based on participants’ free responses to the sample representation and population representation questions.
- This does get a bit messy for the population question, and would require something else at work (e.g., imprecise visual encoding). Under the average reading, 6 is the population average in the short condition, but the population average in med and tall conditions are 7.27 and 8. This means that we should expect to see some clustering of population judgments around 7 for the medium condition, representing those who took the average reading, but there are very few responses there. Under the tallest reading, 8 is the tallest of the Zarpies on Zarpie island in the short condition, but 10 and 12 respectively are the tallest of the Zarpies on Zarpie island in the medium and tall conditions. 8 is the highest value on the response scale, so a ceiling effect could explain why responses cluster at 8. (But my intuition as a participant is that if the response I want to make is not on the scale, I must have interpreted the question wrongly, and to switch to an alternative reading of the question (e.g., the average reading.)
Participants could have interpreted the population question (“Which picture shows how tall Zarpies are on Zarpie island?”) as a question about the remainder of Zarpies on Zarpie island after those who boarded left.
- There is some evidence that supports this interpretation in free response explanations to the explicit comparison question, where several people said the Zarpies on Zarpie island must be taller because the shorter ones all left.
  - Could fix this by rewording the question to ask about “Zarpies in general” instead of “Zarpies on Zarpie island”, by signaling that the Zarpies seen lining up to board are only a small random sample of a larger population boarding, or by enlarging the population of Zarpie island such that those who leave are an insignificant number of people.

Takeaways

The sanity check pilot was not successful, likely because adults are sensitive - but only in a coarse way - to height differences in the Zarpies they saw boarding. Adults who see Zarpies failing to board (medium and tall conditions) vaguely know that the population is taller than sample, but this judgment is not precise enough to precisely distinguish how much taller (see Encoding and memory issues > #3).

In addition, there were likely measurement issues, with the wording of the question either eliciting judgments about the average versus the extreme (see Measurement issues > #1), and the population question either eliciting judgments about Zarpies as population or the remainder of Zarpies left behind on Zarpie island after those who leave are gone (see Measurement issues > #2).

Methods

Participants

Data was collected from 150 adults recruited via Prolific on Tues 3/18/2025. Participants were required to be in the United States, fluent in English, and have not participated in the earlier pilot of this study.

Participants were paid $2.25 for an estimated 8-10.5 minute task.

The final sample included 136 adults (n = 44-46 in each of the 3 conditions).

pop	n
short	46
med	46
tall	44

Exclusion criteria

14 participants (9.3% of all participants) were excluded for meeting at least 1 of the following exclusion criteria:

failing the sound check (n = 1 participants)
failing to check an item mentioned in the task (i.e., did not select “A boat” and “Zarpies”) (n = 13 participants)
failing to select the correct task description (i.e., did not select “Watching videos about fictional people from an island”) (n = 0 participants)

Demographics

mean	sd	n
age
36.61	12.50	136

The sample skewed young in age.

gender	n	prop
Female	79	58.1%
Male	52	38.2%
Non-binary	3	2.2%
31	1	0.7%
Prefer not to specify	1	0.7%

The sample reflected the diversity of the gender identities in the US.

race	n	prop
White, Caucasian, or European American	85	62.5%
Black or African American	23	16.9%
South or Southeast Asian	8	5.9%
Hispanic or Latino/a	7	5.1%
East Asian	4	2.9%
White, Caucasian, or European American,Black or African American	2	1.5%
White, Caucasian, or European American,Hispanic or Latino/a	2	1.5%
Middle Eastern or North African	1	0.7%
Mixed	1	0.7%
White, Caucasian, or European American,Middle Eastern or North African	1	0.7%
White, Caucasian, or European American,South or Southeast Asian	1	0.7%
biracial	1	0.7%

The sample was also racially diverse.

education	n	prop
Less than high school	1	0.7%
High school/GED	16	11.8%
Some college	32	23.5%
Bachelor's (B.A., B.S.)	66	48.5%
Master's (M.A., M.S.)	18	13.2%
Doctoral (Ph.D., J.D., M.D.)	3	2.2%

The sample was mostly college-educated.

Procedure

This study was administered as a Qualtrics survey, and approved by the NYU IRB (IRB-FY2024-9169).

After providing their consent, participants completed a captcha and sound check, and were asked to watch videos sound on. Participants then watched the following videos in order:

Links to short, medium, tall condition videos.

In the prior setting and familiarization phase, participants saw an actual picture of 5 human adults and then another picture of a different 5 adults appear on screen against a grid. These adults were all 10 gridline units tall.
In the boat introduction, all participants saw a boat that was 7 units tall. The boat height was specified to be accidental (“When the boat builders were building the boat, they started building the boat from the bottom, but ran out of the special wood they needed for the boat! So the boat ended up being this tall. It might be hard for anyone who is taller than the boat to get on the boat.”), to avoid any justificatory reasoning about the height of the boat being informative about the height of Zarpies or vice versa.
In the boat boarding phase, participants saw a parade of Zarpies attempt to board the boat to visit us, one at a time. Participants were told that they were all grown-up Zarpies.

Unlike the last pilot, the boat height was fixed across conditions:

boat height 7

Like the last pilot, the sample (the Zarpies who successfully boarded the boat) was held constant across conditions:

(4, 5, 6, 6, 7, 8)

Unlike the last pilot, the population (the parade of Zarpies who attempted to board the boat) was visible and differed across conditions. Bold indicates successful boarding.

In the short population condition, the parade was Zarpies of heights (4, 5, 6, 6, 7, 8). All Zarpies who attempted to board successfully boarded (6 out of 6 successful = 100% successful), with the last Zarpie (height 8) stooping to board, since they are a bit taller than the boat ceiling (7 units tall).
In the medium population condition, the parade was Zarpies of heights (4, 5, 6, 6, 7, 8, 8, 8, 9, 9, 10). Not all Zarpies who attempted to board were successful in boarding (6 out of 11 successful = 54.5% successful). The second Zarpie of height 8 stooped to board, since they are a bit taller than the boat ceiling (7 units tall).
In the tall population condition, the parade was Zarpies of heights (4, 5, 6, 6, 7, 8, 8, 8, 8, 8, 8, 9, 10, 10, 11, 12). Not all Zarpies who attempted to board were successful in boarding (6 out of 16 successful = 37.5% successful). The second Zarpie of height 8 stooped to board, since they are a bit taller than the boat ceiling (7 units tall).

After the boat training phase, participants were asked a memory check: “Did all of the Zarpies board the boat?” (yes/no), and received either an affirmation or correction.
In the sample observation phase, all participants saw the Zarpies who successfully boarded the boat get off the boat to visit us. The Zarpies got off one at a time, and each waved/descrunched if relevant. The height of this observed sample (4, 5, 6, 6, 7, 8) was held constant across conditions.

$Sample.$

Sample.

To emphasize the height of the Zarpies relative to the boat, participants watched Zarpies deboard the boat, wave, reboard the boat (with any Zarpies taller than the boat stooping down again to board again), and deboard again (with any Zarpies taller than the boat straightening up again).

Participants were asked how tall the Zarpies who visited were (Sample representation) and how tall the Zarpies on Zarpie island are (Population representation), in counterbalanced order.
Participants were asked an [explicit comparison] question asking them to compare the heights of Zarpies on Zarpie island to that of Zarpies who visited: shorter, about the same, or taller. This question always came after the sample and population questions.

Finally, participants were also asked for feedback at the end of the task: any problems or confusion they had, and what they thought the task was about (see Participant feedback).

Task comprehension

Memory check

The correct answer to this question depends on condition:

In the short population condition, the correct answer is “yes”, all of the Zarpies made it onto the boat.
In the medium and tall population conditions, the correct answer is “no”, not all of the Zarpies made it onto the boat.

Participant feedback

Participants by and large did not report any problems or confusion with the task.

Primary results

Sample representation

As a check that they could retrieve the mean of the sample they observed, participants were asked, “Which picture shows how tall the Zarpies who visited are?” Response options were a Zarpie of height 4, 5, 6, 7, or 8.

Sample question.

Since all participants saw the same sample (4, 5, 6, 6, 7, 8), all participants should provide the same response regardless of condition. This response is expected to be the mean of the sample: 6.

As expected, there was no main effect of population condition on sample representations (in a simple linear regression), since all participants observed the same sample (6 Zarpies: 4, 5, 6, 6, 7, 8).

lm(dv_sample ~ pop,
   data = data) %>% 
  Anova()

## Anova Table (Type II tests)
## 
## Response: dv_sample
##            Sum Sq  Df F value Pr(>F)
## pop         3.537   2  1.3035  0.275
## Residuals 180.456 133

Strangely, and like the last pilot, participants overall thought the sample was significantly taller than 6, the true mean & mode. Broken down by condition, participants in the short and tall conditions gave responses significantly taller than 6, while participants in the medium condition were trending in that direction (t(45) = 1.73, p = .09).

t.test(data %>% 
         select(dv_sample), 
       mu = mean(observed_sample)) # true mean of observed sample = 6

## 
##  One Sample t-test
## 
## data:  data %>% select(dv_sample)
## t = 4.9212, df = 135, p-value = 0.000002464
## alternative hypothesis: true mean is not equal to 6
## 95 percent confidence interval:
##  6.294666 6.690628
## sample estimates:
## mean of x 
##  6.492647

t.test(data %>% 
         filter(pop == "short") %>% 
         select(dv_sample), 
       mu = mean(observed_sample)) # true mean of observed sample = 6

## 
##  One Sample t-test
## 
## data:  data %>% filter(pop == "short") %>% select(dv_sample)
## t = 3.9644, df = 45, p-value = 0.0002606
## alternative hypothesis: true mean is not equal to 6
## 95 percent confidence interval:
##  6.342225 7.049079
## sample estimates:
## mean of x 
##  6.695652

t.test(data %>% 
         filter(pop == "med") %>% 
         select(dv_sample), 
       mu = mean(observed_sample)) # true mean of observed sample = 6

## 
##  One Sample t-test
## 
## data:  data %>% filter(pop == "med") %>% select(dv_sample)
## t = 1.7344, df = 45, p-value = 0.08969
## alternative hypothesis: true mean is not equal to 6
## 95 percent confidence interval:
##  5.950921 6.657775
## sample estimates:
## mean of x 
##  6.304348

t.test(data %>% 
         filter(pop == "tall") %>% 
         select(dv_sample), 
       mu = mean(observed_sample)) # true mean of observed sample = 6

## 
##  One Sample t-test
## 
## data:  data %>% filter(pop == "tall") %>% select(dv_sample)
## t = 2.8522, df = 43, p-value = 0.006648
## alternative hypothesis: true mean is not equal to 6
## 95 percent confidence interval:
##  6.139812 6.814734
## sample estimates:
## mean of x 
##  6.477273

When asked why they selected the picture they did, participants’ responses generally described:

selecting the average, or mode:
- “because that appears to be the average height, out of all the Harpies” (6)
- “This Zarpie still represents the average height of all the Zarpies that boarded the boat and visited.” (6)
- “There were lots of different sizes but this one is the most common” (7)
selecting the tallest extreme, e.g.:
- “It was the tallest zarpie so felt like the best choice.” (8)
- “Chose the one that is the tallest” (8)
- “Because I was not allowed to select all of them and”how tall” would typically be asking for maximum height if you could only choose one” (8)
- “I don’t understand the question, all of those zarpies came and they were all different heights. I chose the tallest one because it had to crouch which made it mildly more memorable.” (8)

Looking at the participants who chose 6 and 8 (the mode responses):

Participants who chose 6 generally explained their choice as choosing the “average” Zarpie.
Participants who chose 8 generally explained their choice as choosing the “tallest” Zarpie.

Population representation

As a check for their representation of the population, participants were asked: “Which picture shows how tall Zarpies are on Zarpie island?” Response options were a Zarpie of height 4, 5, 6, 7, or 8.

Population question.

If this question is a valid measure of participants’ representation of the average height of Zarpies, and participants remember how tall Zarpies are in the boarding scene and use that as their representation of Zarpies on Zarpie island, the expected response in each condition is:

pop short: (4, 5, 6, 6, 7, 8) –> population mean = 6
pop med: (4, 5, 6, 6, 7, 8, 8, 8, 9, 9, 10) –> population mean = 7.27
pop tall: (4, 5, 6, 6, 7, 8, 8, 8, 8, 8, 8, 9, 10, 10, 11, 12) –> population mean = 8

lm(dv_pop ~ pop,
   data = data) %>% 
  Anova()

## Anova Table (Type II tests)
## 
## Response: dv_pop
##           Sum Sq  Df F value Pr(>F)
## pop          1.6   2  0.6963 0.5002
## Residuals  152.8 133

When asked why they selected the picture they did, participants’ responses generally described: * selecting the average, e.g.: + “It’s the average of the height of all the zarpies I saw” + “There were varying heights to the Zarpies so I picked one that was a middle or medium height.” + “I chose option 3 because there were varying sizes of Zarpies on the island, so this would be about the average.” + “The Zarpies are all different heights so I chose one that seems to be the average of all the heights shown.”

selecting the tallest extreme, e.g.:
- “He represents tallest”
- “Shows that all the others are shorter than him/her.”
- “This is the tallest zarpie”
- “It shows the tallest a zarpie can be.”

Participants gave different explanations for their responses by condition ($p$ = .004, Fisher’s exact).

## 
##  Fisher's Exact Test for Count Data
## 
## data:  .
## p-value = 0.003973
## alternative hypothesis: two.sided

Looking at the participants who chose 6 and 8 (the mode responses):

Participants who chose 6 generally explained their choice as choosing the “average” Zarpie.
Participants who chose 8 generally explained their choice as choosing the “tallest” Zarpie.

Explicit comparison

Participants were explicitly asked to compare the population to the sample: “Do you think the Zarpies on Zarpie island are shorter, about the same, or taller than the Zarpies who visited?”

	shorter	about the same	taller
short	4%	91%	4%
med	9%	28%	63%
tall	14%	27%	59%

Should we be worried that participants in the medium and tall conditions were not at ceiling for reporting that the Zarpies on Zarpie island are “taller” than the Zarpies who visited (59-63%)?

They might be “about the same”, in the sense that they are all Zarpies at the end of the day?

## 
##  Fisher's Exact Test for Count Data
## 
## data:  .
## p-value = 0.000000000003984
## alternative hypothesis: two.sided

## 
##  Fisher's Exact Test for Count Data
## 
## data:  .
## p-value = 0.00000000009606
## alternative hypothesis: two.sided

## 
##  Fisher's Exact Test for Count Data
## 
## data:  .
## p-value = 0.0000000001825
## alternative hypothesis: two.sided

## 
##  Fisher's Exact Test for Count Data
## 
## data:  .
## p-value = 0.7802
## alternative hypothesis: two.sided

Participants’ explicit comparison responses differed by condition ($p$ < .001, Fisher’s exact). Specifically, responses in the short population condition differed from responses in the medium and tall conditions ($p$s < .001, Fisher’s exact); responses in the medium and tall conditions did not differ from each other ($p$ = .78, Fisher’s exact).

These results suggest that participants are sensitive to the fact that the population must be taller if taller Zarpies got cut-off, as in the medium and tall conditions, but the difference between the medium and tall conditions may be too subtle for them to pick up on.

Looking at participants’ explanations for their responses, many participants reasoned about those remaining on the island, rather than Zarpies as a whole, which suggests a measurement issue with what the question is eliciting.

Secondary results

Sample vs population

We can compare participants’ responses to the sample question to their responses to the population question using paired t-tests.

## 
##  Paired t-test
## 
## data:  data %>% filter(pop == "short") %>% pull(dv_sample) and data %>% filter(pop == "short") %>% pull(dv_pop)
## t = -0.45486, df = 45, p-value = 0.6514
## alternative hypothesis: true mean difference is not equal to 0
## 95 percent confidence interval:
##  -0.4719978  0.2980848
## sample estimates:
## mean difference 
##     -0.08695652

As expected, participants in the short condition did not give different responses to sample and population questions ($t$(45) = -0.45, $p$ = .65). This is expected since in the short condition, the sample and the population are identical.

## 
##  Paired t-test
## 
## data:  data %>% filter(pop == "med") %>% pull(dv_sample) and data %>% filter(pop == "med") %>% pull(dv_pop)
## t = -4.0383, df = 45, p-value = 0.000207
## alternative hypothesis: true mean difference is not equal to 0
## 95 percent confidence interval:
##  -1.0751910 -0.3595917
## sample estimates:
## mean difference 
##      -0.7173913

## 
##  Paired t-test
## 
## data:  data %>% filter(pop == "tall") %>% pull(dv_sample) and data %>% filter(pop == "tall") %>% pull(dv_pop)
## t = -2.9607, df = 43, p-value = 0.004979
## alternative hypothesis: true mean difference is not equal to 0
## 95 percent confidence interval:
##  -0.8787858 -0.1666687
## sample estimates:
## mean difference 
##      -0.5227273

In contrast, participants in the medium and tall conditions gave taller responses to the population question than to the sample question (medium: $t$(45) = -4.04, $p$ < .001, tall: $t$(43) = -2.96, $p$ = .005). This makes sense, because in those conditions, the taller portion of the population got cut off from boarding.

This result supports the idea that participants in the medium and tall conditions know the population differs from the sample, i.e., that the population is taller than the sample.

We can also compare participants’ explanations for their responses to sample vs population questions - do participants maintain the same “reading” of the question (e.g., “average” vs “tallest”) across the two questions? Note the order of the two questions was counter-balanced.

Indeed, participants generally gave the same explanations for their sample and population responses, e.g., people who explained their sample responses as the “average” also tended to explain their population responses as the “average”.

Order effects

Participants saw the two DVs in counterbalanced order:

pop_sample = population DV first, then sample DV
sample_pop = sample DV first, then population DV

There was no effect of DV order on sample responses, nor on population responses.

lm(dv_sample ~ pop * cb_dvorder,
   data = data) %>% 
  Anova()

## Anova Table (Type II tests)
## 
## Response: dv_sample
##                 Sum Sq  Df F value Pr(>F)
## pop              2.945   2  1.0869 0.3403
## cb_dvorder       2.063   1  1.5223 0.2195
## pop:cb_dvorder   2.248   2  0.8297 0.4385
## Residuals      176.144 130

lm(dv_pop ~ pop * cb_dvorder,
   data = data) %>% 
  Anova()

## Anova Table (Type II tests)
## 
## Response: dv_pop
##                 Sum Sq  Df F value  Pr(>F)  
## pop              2.052   2  0.9130 0.40386  
## cb_dvorder       3.106   1  2.7644 0.09879 .
## pop:cb_dvorder   3.625   2  1.6131 0.20323  
## Residuals      146.073 130                  
## ---
## Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

Session info

## R version 4.4.2 (2024-10-31)
## Platform: aarch64-apple-darwin20
## Running under: macOS Sequoia 15.3.1
## 
## Matrix products: default
## BLAS:   /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/lib/libRblas.0.dylib 
## LAPACK: /Library/Frameworks/R.framework/Versions/4.4-arm64/Resources/lib/libRlapack.dylib;  LAPACK version 3.12.0
## 
## locale:
## [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
## 
## time zone: America/New_York
## tzcode source: internal
## 
## attached base packages:
## [1] stats     graphics  grDevices utils     datasets  methods   base     
## 
## other attached packages:
##  [1] lmerTest_3.1-3  lme4_1.1-35.5   Matrix_1.7-1    car_3.1-3      
##  [5] carData_3.0-5   ggtext_0.1.2    lubridate_1.9.3 forcats_1.0.0  
##  [9] stringr_1.5.1   dplyr_1.1.4     purrr_1.0.2     readr_2.1.5    
## [13] tidyr_1.3.1     tibble_3.2.1    ggplot2_3.5.1   tidyverse_2.0.0
## [17] gt_0.11.1       scales_1.3.0    janitor_2.2.0   here_1.0.1     
## 
## loaded via a namespace (and not attached):
##  [1] tidyselect_1.2.1    viridisLite_0.4.2   farver_2.1.2       
##  [4] fastmap_1.2.0       digest_0.6.37       rpart_4.1.23       
##  [7] timechange_0.3.0    lifecycle_1.0.4     cluster_2.1.6      
## [10] magrittr_2.0.3      compiler_4.4.2      rlang_1.1.4        
## [13] Hmisc_5.1-3         sass_0.4.9          tools_4.4.2        
## [16] yaml_2.3.10         data.table_1.15.4   knitr_1.49         
## [19] htmlwidgets_1.6.4   labeling_0.4.3      bit_4.5.0.1        
## [22] xml2_1.3.6          abind_1.4-8         withr_3.0.2        
## [25] foreign_0.8-87      numDeriv_2016.8-1.1 nnet_7.3-19        
## [28] grid_4.4.2          colorspace_2.1-1    MASS_7.3-61        
## [31] cli_3.6.3           rmarkdown_2.29      crayon_1.5.3       
## [34] ragg_1.3.2          generics_0.1.3      rstudioapi_0.17.1  
## [37] tzdb_0.4.0          commonmark_1.9.2    minqa_1.2.8        
## [40] cachem_1.1.0        splines_4.4.2       ggthemes_5.1.0     
## [43] parallel_4.4.2      base64enc_0.1-3     vctrs_0.6.5        
## [46] boot_1.3-31         jsonlite_1.8.9      hms_1.1.3          
## [49] bit64_4.5.2         htmlTable_2.4.3     Formula_1.2-5      
## [52] systemfonts_1.1.0   jquerylib_0.1.4     glue_1.8.0         
## [55] nloptr_2.1.1        stringi_1.8.4       gtable_0.3.5       
## [58] munsell_0.5.1       pillar_1.10.0       htmltools_0.5.8.1  
## [61] R6_2.5.1            textshaping_0.4.0   rprojroot_2.0.4    
## [64] vroom_1.6.5         evaluate_1.0.1      lattice_0.22-6     
## [67] markdown_1.13       backports_1.5.0     gridtext_0.1.5     
## [70] snakecase_0.11.1    bslib_0.8.0         Rcpp_1.0.13        
## [73] checkmate_2.3.2     gridExtra_2.3       nlme_3.1-166       
## [76] xfun_0.49           pkgconfig_2.0.3

Structural skew

Pilot 2 (sanity check with adults)

Marianna Zhang

2025-03-18