MTH 119-04 (10:00am)

Section 4.4: Means and Variances of Random Variables

Load the data set Birthdays from the MosaicData package. Note that the data set is really large with 372864 observations.

birthdays<-Birthdays

Find the mean number of births per day using the entire data set:

mean(~births, data=birthdays)

## [1] 189.0409

Now, take random samples of various sizes and compute the sample mean.

mean(~births,data=sample(birthdays, 10))

## [1] 168.7

mean(~births,data=sample(birthdays, 10))

## [1] 151

mean(~births,data=sample(birthdays, 10))

## [1] 223.3

mean(~births,data=sample(birthdays, 100))

## [1] 174.98

mean(~births,data=sample(birthdays, 100))

## [1] 195.04

mean(~births,data=sample(birthdays, 100))

## [1] 198.25

mean(~births,data=sample(birthdays, 1000))

## [1] 189.67

mean(~births,data=sample(birthdays, 1000))

## [1] 188.152

mean(~births,data=sample(birthdays, 1000))

## [1] 202.809

mean(~births,data=sample(birthdays, 10000))

## [1] 187.4937

mean(~births,data=sample(birthdays, 10000))

## [1] 190.13

mean(~births,data=sample(birthdays, 10000))

## [1] 188.4456

mean(~births,data=sample(birthdays, 10000))

## [1] 186.5359

mean(~births,data=sample(birthdays, 10000))

## [1] 189.248

Notice, that the variability decreases and the sample mean gets closer to the mean of the entire data set as the size of the sample increases.