Specialized Research Designs

RSM2074 Lecture Week 10

Dr. Jun Ho Chai

Specialized Research Designs

The Plan for Today

  • Developmental Designs: Tracking change over time
  • Longitudinal, Cross-sectional, Cohort-sequential
  • Small-n Designs: Deep individual focus
  • Understanding trade-offs and biases

Why Study Development?

Language Development

How does vocabulary grow from 50 to 300 words between 18-24 months?

Cognitive Abilities

When does Theory of Mind emerge? How does it relate to language?

Individual Differences

Why do some children show vocabulary spurts while others don’t?

The Central Challenge

The Fundamental Dilemma

To understand development, we need to track changes.
But tracking takes time, money, and commitment.

Two Main Approaches:

  • Same Children? Follow over time (Longitudinal = Within-Subjects)
  • Different Children? Test once at different ages (Cross-sectional = Between-Subjects)

Quasi-Experimental Designs

What is Quasi-Experimental?

Key Characteristics:

🚫 No random assignment

📊 Some comparison group

🔧 Researcher manipulates treatment

🌍 More realistic settings

Why Use Quasi-Experiments?

💼 Real-world practicality

⚖️ Ethical constraints

👥 Can’t randomize naturally occurring groups

🏛️ Policy/program evaluation

Quasi-Experiment vs True Experiment

True Experiment

🎲 Random assignment

🔐 Experimental control

📈 Internal validity

📍 Limited real-world applicability

Quasi-Experiment

🚫 No random assignment

📊 Some control

⚖️ Moderate internal validity

🌍 Real-world relevance

Common Quasi-Experimental Designs

Non-Equivalent Control Group

👥 Two existing groups

💊 One gets intervention

📉 Compare pre-post changes

Interrupted Time Series

📍 Baseline established

⚡ Intervention introduced

📈 Track outcome over time

🔍 Look for “interruption”

Longitudinal Design

What is Longitudinal Design?

Key Principle:

Within-Subjects Variable

  • Same person at multiple ages
  • Track individual change
  • Can take weeks, months, or years

Visual:

Child A: ●──●──●──●
         2  3  4  5

Child B: ●──●──●──●
         2  3  4  5

Child C: ●──●──●──●
         2  3  4  5

Follow same children over time

Track a Child’s Development

Individual Developmental Paths

Fernald et al. (2013): Not all children grow the same way

Four Growth Patterns:

  • 30% Fast starters
  • 25% Late bloomers
  • 35% Steady growers
  • 10% Early plateau

All reached normal by age 3!

Why This Matters:

✓ Only within-subjects design captures individual paths

✓ Different routes to same outcome

✓ Distinguish growth patterns hidden in averages

Advantages of Longitudinal

🔗 Causal Order

• Pretest-Posttest proves causation

• Early → Later outcomes

👤 Within-Subjects

• Same genes & family

• Changes = development

📊 Individual Paths

• Not just group averages

• Real trajectories

Disadvantages of Longitudinal

Practice Effects

• Repeated testing improves scores

• Not real development!

👋 Attrition Bias

• 15-30% drop out

• Busiest families leave first

💰 Expensive

• Multi-year funding

• Staff & tracking

Real Study Attrition Rates

Cross-Sectional Design

What is Cross-Sectional Design?

Key Principle:

Between-Subjects Variable

  • Different children at each age
  • Each tested ONCE
  • Compare groups

Visual:

Age 3: ●  ●  ●  (A, B, C)

Age 4: ●  ●  ●  (D, E, F)

Age 5: ●  ●  ●  (G, H, I)

Different children, one snapshot

Wordbank Vocabulary Data

Real data from 92,771 children (wordbank.stanford.edu)

Advantages of Cross-Sectional

Fast

• Test all ages in 1 month

• No waiting!

No Attrition

• Each child tested once

• No dropout bias

💵 Cheap

• Single wave

• Lower cost

Disadvantages of Cross-Sectional

⚠️ Cohort Effects

• Born 2010 vs 2020?

• Can’t separate age from generation

📊 No Individuals

• Only between-subjects comparison

• Hide individual paths

No Causation

• Can’t establish temporal order

• No pretest-posttest possible

Cohort Effect Problem

Same age, different childhoods = different experiences

Cohort-Sequential Design

What is Sequential Design?

The Strategy:

  1. Enroll multiple age groups
  2. Follow each over time
  3. Compare overlapping ages

= Mixed-Subjects Design

Visual:

Cohort A: ●──●──●
          3  4  5

Cohort B:       ●──●──●
                5  6  7

Overlap at age 5 reveals cohort vs age!

Sequential Structure

Time-Series Approach: Ages 3-9 in just 3 Calendar Years

Year 1

Cohort A → 3
Cohort B → 5
Cohort C → 7

Year 2

Cohort A → 4
Cohort B → 6
Cohort C → 8

Year 3

Cohort A → 5
Cohort B → 7
Cohort C → 9

✓ Result

Ages 3-9
In 3 years!
7 years saved!

🔍 Key Insight: Cohorts A & B both reach ages 5, 6, 7. If results match → age effect. If different → cohort effect!

Example: Math Motivation

Chouinard & Roy (2008): Does motivation drop at Grade 9?

Quasi-Experiment Design:

Cohort Yr 1 Yr 2 Yr 3
Cohort 1 Gr 7 Gr 8 Gr 9 ⚠️
Cohort 2 Gr 9 ⚠️ Gr 10 Gr 11

Both cohorts pass through Grade 9

Finding:

📉 Both cohorts showed sharp motivation decline at Grade 9

Conclusion:

✓ High school transition effect
✗ NOT birth cohort

This is an age/grade effect!

When to Use Which Design?

Longitudinal (Within-Subjects + Pretest-Posttest)

“Does early language predict reading?”
Need time order + tracking same children

Cross-Sectional (Between-Subjects)

“How does vocab differ by age?”
Limited time/budget, no causation

Sequential (Mixed-Subjects + Time-Series)

“Is decline age or generation?”
Need to separate cohort effects

Small-n Designs

What Are Small-n Designs?

Small sample (n=1-15), MANY measurements (20-100+ per person)

Traditional Large-n:

100 people × 2 times
= Between-subjects focus

Small-n Design:

3 people × 80 times
= Within-subjects intensive

Visual:

Person A: ●●●●●●●●●●●●●... (80 obs)
Person B: ●●●●●●●●●●●●●... (80 obs)
Person C: ●●●●●●●●●●●●●... (80 obs)

When to Use Small-n?

🎯 Individual Therapy

Does THIS intervention work for THIS person?

🔬 Rare Conditions

Can’t find 100 participants

🏫 Applied Settings

Teachers need quick feedback

Memory Retention

Ebbinghaus tested his memory 300+ times (within-subjects)

A-B-A-B Design

Does training increase vocabulary?

Example: Pretest-Posttest

Does training increase vocabulary? (Within-subjects comparison)

Strengths of Small-n

🎯 Individual Proof

• Shows effect for THIS person

• Within-subjects power

🔄 Flexible

• Can adapt real-time

• Responsive to change

Feasible

• Only need 1-15 participants

• Practical for applied settings

Limitations of Small-n

Generalization?

• Works here, but elsewhere?

• Can’t test between-subjects

⏱️ Fatigue Effects

• Need 20-100+ measurements

• Risk of exhaustion bias

🔁 Carryover

• Can’t truly “un-learn”

• Later trials affected

Sources of Bias

Four Major Threats

1. Attrition
People drop out (not random)

2. Practice Effects
Repeated testing improves scores

3. Cohort Effects
Different generations

4. History Effects
External events

Threat 1: Attrition

Who leaves? NOT random!

👨‍👩‍👧‍👦 Busiest families
😰 Struggling children
💸 Lower SES families

Visual:

Started: ●●●●●●●●●● (100)
         ↓ ↓ ↓
Ended:   ●●●●●●●     (70)

Biased sample!

Affects: Longitudinal, Sequential designs

Threat 2: Testing Effects

Repeated testing = Practice!

📝 Learn test format
🧠 Remember items
✍️ Develop strategies

Visual:

Test 1: 70% ────┐
Test 2: 75% ────┤ Practice!
Test 3: 80% ────┘ Not development!

Affects: Longitudinal, Sequential, Small-n (within-subjects designs)

Threat 3: Cohort Effects

Different childhoods!

📱 2010: No tablets
📱 2020: 5h screen/day

🏞️ 2010: 4.5h outdoor
🏠 2020: 1.2h outdoor

Visual:

Born 2010:  🏃‍♂️🌳📖
Born 2020:  📱💻🎮

Same age, different worlds!

Affects: Cross-sectional (between-subjects designs)

Threat 4: History Effects

External events!

🦠 Pandemic
📉 Economic crisis
🌊 Natural disasters

Time-Series Confound:

2018: Normal ──●──●──●
2019: Normal ──●──●──●
2020: Event! 🦠
2021: Changed ──●──●──●

Changes = Event, not development!

Affects: All longitudinal and time-series designs

Key Takeaway

The best design answers YOUR question!

Consider:

  • 🎯 Your research question
  • ⏰ Time and money available
  • 👥 Sample accessibility
  • 🔍 What biases matter most?

Naturalistic Recordings

Why Record Natural Interactions?

Lab Testing Limitations:

🏫 Artificial environment

📸 One-off behavior snapshot

👨‍🔬 Researcher-guided tasks

😟 Stress effects

Naturalistic Recording Benefits:

🏠 Real-world behavior

📈 Ongoing development

🎨 Child-directed activity

💬 Authentic language/interaction

Methods of Recording

🎙️ Audio Recording

Easiest to collect

Portable devices

Language focus

Privacy friendly

📹 Video Recording

Captures nonverbal behavior

Joint attention markers

Motor skills visible

Privacy concerns

🔀 Hybrid Approaches

Audio + observer notes

Wearable cameras

Multiple perspectives

Mixed data streams

Analyzing Naturalistic Data

📝 Transcription

Word-by-word or phrase-based

Coding conventions (CHAT, ELAN)

Time-aligned to audio/video

Intensive & expensive

🏷️ Coding Schemes

Developmental milestone markers

Social interaction patterns

Language complexity measures

Context-dependent categories

AI-Driven Automated System: LENA System

What is LENA?

📊 Language Environment Analysis

🎤 Wearable recorder (child)

⏱️ 16+ hours daily recording

🤖 Automated + manual analysis

Captures:

📢 Adult word count

🗣️ Child vocalization rate

💬 Conversational turns

🔊 TV/noise exposure

Finding: 30M word gap!

🔗 lena.org

Advantages of Naturalistic Data

🌍 Ecological Validity

Real-world behavior observed

Natural developmental contexts

Longitudinal Feasibility

Easy to repeat over time

Captures growth trajectories

📚 Rich Contextual Info

Environmental influences visible

Family practices documented

Challenges of Naturalistic Data

⏱️ Time Intensive

Hours of recording per family

Days of transcription work

🔍 Reliability Issues

Observer bias in coding

Intercoder agreement critical

🔐 Privacy & Ethics

Informed consent for recording

Data security requirements

Thank You

Questions?