Tests of Significance and Measures of Association

Harriet Goers

Learning Objectives

  • Understand how relationships are affected by sampling error
  • Use informal and formal tests of significance
  • Interpret confidence intervals
  • Distinguish between difference of means and proportions tests
  • Measure strength of relationships with association statistics

Why Test for Significance?

  • Sample result: Women mean = 54, Men mean = 49
  • Is the 5-point gap real or just noise?
  • We use statistical significance testing to decide

The Null Hypothesis

  • H₀: No relationship exists in the population
    • Any observed relationship is due to chance
  • Hₐ: A relationship exists
    • Use logic of contradiction: reject H₀ only if evidence is strong

Confidence Intervals (CIs)

  • CI shows a range where the true population parameter likely falls
  • If CI includes H₀ value → fail to reject H₀

Formula: \[ SE = \frac{s}{\sqrt{n}} \\ CI_{95\%} = \bar{x} \pm 1.96 \cdot SE \]

One-Sample Significance Test

  • Compare sample statistic to fixed value (mean or proportion)
  • Use CI or compute a P-value

Example: Is average quiz score > 15? If CI includes 15 → no significant difference

Two-Sample Tests

We now examine whether differences between groups are statistically significant.

Difference of Means

  • Use when DV is continuous (e.g., feeling thermometer)
  • Compare group means

Formula: \[ SE_{\text{diff}} = \sqrt{SE_1^2 + SE_2^2} \]

Significant if: \[ \frac{\text{Difference}}{SE_{\text{diff}}} \geq 2 \]

Difference of Proportions

  • Use when DV is binary (e.g., support/oppose)
  • Compare group proportions
  • Test statistic: Z-score

Formulas: \[ SE = \sqrt{\frac{pq}{n}}, \quad q = 1 - p \\ SE_{\text{diff}} = \sqrt{SE_1^2 + SE_2^2} \]

Key Distinction

Difference of Means Difference of Proportions
Outcome Type Continuous Binary/Categorical
Test Used t-test Z-test
Distribution Student’s t Normal

The Chi-Square Test (\(\chi^2\))

  • Use with two categorical variables
  • Tests if distributions differ across categories

Formula: \[ \chi^2 = \sum \frac{(f_o - f_e)^2}{f_e} \] Reject H₀ if \(\chi^2\) is large enough.

Measures of Association

  • Significance = whether a relationship exists
  • Association = how strong that relationship is

Preferred qualities: - Based on PRE - Asymmetric (cause → effect)

Lambda (\(\lambda\))

Use for nominal variables

Formula: \[ \lambda = \frac{E_1 - E_2}{E_1} \]

( \(E_1\) ): prediction errors without IV
( \(E_2\) ): prediction errors with IV

Somers’ dyx

Use for ordinal variables

Formula: \[ d_{yx} = \frac{C - D}{C + D + T_y} \]

  • ( C ): Concordant pairs
  • ( D ): Discordant pairs
  • ( T_y ): Ties on DV

Ranges from -1 to +1

Recap

  • Test whether differences are real using t, Z, or ( ^2 )
  • Use confidence intervals to assess uncertainty
  • Match the test to your variable types
  • Use lambda and Somers’ dyx to assess strength

Next time you see a gap, ask: > Is it real? Is it strong? Or just noise?