Transport for London reports that 68% of commuters use the
Underground at least once per week.
You take a random sample of \(n = 120\)
London residents.
We are dealing with sample proportions, since each person either uses or does not use the Underground weekly. The 68% is a clue (with proportion p=0.68). So the CLT for proportions is the way to go. We get that our sampling distribution should follow the normal model N(0.68, sqrt(0.68*0.32/120)), which rounds to N(0.68, 0.043). The calculations are below:
p <- 0.68 # population proportion
n <- 120 # sample size
mean_p <- p
sd_p <- sqrt(p*(1-p)/n)
mean_p
## [1] 0.68
sd_p
## [1] 0.04258325
The probability that there are fewer than 75 out of 120 sampled residents using the underground can be found by calculating the z-score 75/120 = 0.625. We then calculate the probability of a sample having a z-score as small, or smaller, than 0.625. It turns out that the z-score is -1.291588, which we round to -1.29. The probability that a random sample has a z-score less than -1.29 is equal to 0.09852533. The relevant calculations are below.
sample_p <- 75/120
sample_z <- (sample_p - mean_p)/sd_p
sample_p
## [1] 0.625
sample_z
## [1] -1.291588
pnorm(-1.29, mean=0, sd=1)
## [1] 0.09852533
Therefore: the probability that fewer than 75 of the sampled residents use the Underground weekly is equal to 0.09852533, or roughly 9.9%.
Tourist time in the British Museum (minutes) has population mean
\(\mu = 90\) and standard deviation
\(\sigma = 30\).
We take a random sample of \(n =
50\).
# Given values
mu <- 90 # population mean (minutes)
sigma <- 30 # population standard deviation (minutes)
n <- 50 # sample size
# Sampling distribution parameters
mean_x <- mu
sd_x <- sigma / sqrt(n)
mean_x
## [1] 90
# [1] 90
sd_x
## [1] 4.242641
# [1] 4.242641
# Part B: Probability that the sample mean > 100 minutes
sample_mean <- 100
# Compute z-score
sample_z <- (sample_mean - mean_x) / sd_x
sample_z
## [1] 2.357023
# [1] 2.355
# Compute probability (area to the right of z = 2.355)
probability <- 1 - pnorm(sample_z)
probability
## [1] 0.009211063
# [1] 0.009322
The number of selfies taken per capsule ride is approximately normal
with mean \(\mu = 12\) and standard
deviation \(\sigma = 5\).
We take a random sample of \(n = 16\)
capsule rides and consider the average number of selfies taken in this
sample.
# Given values
mu <- 12 # population mean (average selfies per ride)
sigma <- 5 # population standard deviation
n <- 16 # sample size
# Sampling distribution parameters
mean_x <- mu
sd_x <- sigma / sqrt(n)
mean_x
## [1] 12
# [1] 12
sd_x
## [1] 1.25
# [1] 1.25
sample_mean <- 15
# Compute z-score
sample_z <- (sample_mean - mean_x) / sd_x
sample_z
## [1] 2.4
# [1] 2.4
# Compute probability (area to the right of z = 2.4)
probability <- 1 - pnorm(sample_z)
probability
## [1] 0.008197536
# [1] 0.008198
In the UK, about \(p = 0.74\) of
households report recycling regularly.
Suppose you randomly sample \(n = 120\)
households.
# Given values
p <- 0.74 # population proportion (households that recycle regularly)
n <- 120 # sample size
# Sampling distribution parameters
mean_p <- p
sd_p <- sqrt(p * (1 - p) / n)
mean_p
## [1] 0.74
# [1] 0.74
sd_p
## [1] 0.04004164
# [1] 0.03999583
sample_p <- 90 / 120 # sample proportion = 0.75
# Compute z-score
sample_z <- (sample_p - mean_p) / sd_p
sample_z
## [1] 0.24974
# [1] 0.2501041
# Compute probability (area to the right of z = 0.25)
probability <- 1 - pnorm(sample_z)
probability
## [1] 0.4013942
# [1] 0.4012937
A scientist finds that bumblebee flight distances between flowers, in
meters, follow a normal distribution with mean \(\mu = 3.2\) and standard deviation \(\sigma = 2.1\).
Suppose \(n = 64\).
# Given values
mu <- 3.2 # population mean (meters)
sigma <- 2.1 # population standard deviation (meters)
n <- 64 # sample size
# Sampling distribution parameters
mean_x <- mu
sd_x <- sigma / sqrt(n)
mean_x
## [1] 3.2
# [1] 3.2
sd_x
## [1] 0.2625
# [1] 0.2625
sample_mean <- 3.5
# Compute z-score
sample_z <- (sample_mean - mean_x) / sd_x
sample_z
## [1] 1.142857
# [1] 1.142857
# Compute probability (area to the right of z = 1.142857)
probability <- 1 - pnorm(sample_z)
probability
## [1] 0.126549
# [1] 0.12654895