Lecture 8: Sampling Distributions, Law of Large Numbers, and Central Limit Theorem

Tom Hanna

2026-03-01

Sampling Distributions, Law of Large Numbers, and Central Limit Theorem: Putting it all together

Populations and Samples

Population - The entire group we want to draw conclusions about
Sample - The subset of the population that we draw data from
The sample should be a random subset of the population
A good sample is representative of the population

Getting from sample statistic to population parameter

Two tools tie sample statistics to estimates of the true population parameters: standard error and z-score

The standard error is a special case of the standard deviation
The z-score is a fairly simple problem involving subtracting two numbers and dividing by the standard error which you won’t have to do in this class

Bonuses from the Z-Score

Bonus: the z-score is used in hypothesis testing
Extra bonus: The rounded cutoff point for Z-Scores in hypothesis testing is really easy

Z ≥ 1.96

Z ≥ 2

Two important rules

Two rules to tie the sample to probability distributions and population estimates:

    + **The Central Limit Theorem** 
    + **The Law of Large Numbers**

Central Limit Theorem

For a large number of trials, the means of the trials approach a normal distribution regardless of the underlying distribution of the data
This means that for a sufficient number of trials, we can apply the normal distribution to the sample means.
This allows us to apply the 68-95-99.7 rule!

Central Limit Theorem Simulation with Uniform Distribution Data

Central Limit Theorem Simulation with Uniform Distribution Data

Central Limit Theorem Simulation with Uniform Distribution Data

Central Limit Theorem Simulation with Uniform Distribution Data

Law of Large Numbers

Averages from independent, identically distributed samples converge to the population means that they are estimating.
In its strongest form, the law states that this “almost surely” happens.
This means that for a sufficiently large sample size, we can assume with a degree of certainty that our sample statistics accurately represent the population parameters!

Law of Large Numbers

The mean of the sampling distribution is a good estimate of the mean of the population distribution:
\(\bar{x} = \mu\)
The standard error (the standard deviation of the sampling distribution) will be equal to the standard deviation of the population distribution divided by the sample size:
\(s = \frac{\sigma}{\sqrt{n}}\)

Law of Large Numbers simulation using coin flips

The mean of a single flipped coin is 0

The mean of two flips is 0.5

The mean of ten flips is 0.3

The mean of twentyfive flips is 0.44

The mean of twenty five thousand flips is 0.5023333

What’s this tell us about sample size?

The CLT allows us to apply Z at a sample size around 30
The sample size we need is determined by a number of things including the degree of certainty we are looking for
Want to do some polling? This is where margin of error comes from:

margin of error formula.

margin of error

Authorship, License, Credits

Author: Tom Hanna
Website: tomhanna.me
License: This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.</>