The sampling distribution of a statistic is the probability distribution of a statistic, i.e. what values can the statistic take on and how often will we see these values if we took every possible sample of size \(n\) from the population.
Consider taking repeated samples from a population and computing the statistic for each sample. You would get many different values of the statistic and some values would be more common than others.
You can try out the simulation yourself here
Let \(\bar X\) denote the sample mean of a random sample of \(n\) observations from a population with mean \(\mu\) and standard deviation \(\sigma\), then the mean of the sampling distribution of \(\bar X\) is equal to the population mean \(\mu\) \[E(\bar X) = \mu\]
Let \(\bar X\) denote the sample mean of a random sample of \(n\) observations from a population with mean \(\mu\) and standard deviation \(\sigma\), then the standard deviation of the sampling distribution of \(\bar X\) is equal to the population standard deviation divided by the square root of \(n\) \[\sigma_{\bar X} = \frac{\sigma}{\sqrt{n}}\] Note: The standard error of \(\bar X\) is another name for the standard deviation of the sampling distribution of \(\bar X\).
Let \(\bar X\) denote the sample mean of a random sample of \(n\) observations from a population with mean \(\mu\) and standard deviation \(\sigma\), then the sampling distribution of \(\bar X\) will be approximately normal if either of the following is true:
Let \(\bar X\) denote the sample mean of a random sample of \(n\) observations from a population with mean \(\mu\) and standard deviation \(\sigma\), then \[\bar X \sim \text{ approximately } N(\mu, \frac{\sigma}{\sqrt{n}})\] as long as one of the following is true
NOTE: From now on, we will say that \(\bar X\) has a normal distribution when we more accurately mean an approximately normal distribution
Example 1: A random sample of size 16 is taken from a normal population with mean \(\mu = 100\) and standard deviation \(\sigma = 9\).
Note: You can do the whole thing with the following R code
Example 2: Carbon monoxide (CO) emissions for a certain kind of car vary with a mean 2.9 grams per mile (g/mi) and standard deviation 0.9 g/mi. A company has 81 cars in its fleet.
Note: You can do the whole thing with the following R code