Answer:
\[ Variance: \sigma^2 = 40, 000 \\ mean: \mu = 1, 300 lbs \\ \text{standard deviation: } sd = \sqrt{4000} \]
x <- 979
m <- 1300
sd <- 200
paste0("The probability of weight > 979 lbs: ", round(pnorm(x, m, sd, lower.tail = FALSE),4))
## [1] "The probability of weight > 979 lbs: 0.9458"
OR
1 - round(pnorm(x, m, sd), 4)
## [1] 0.9458
Answer:
x <- 8340
m <- 11000 ## mean
sd <- sqrt(1960000) ## standard deviation, variance is 1960000
paste0("The probability of lifespan > 8340 hrs: ", round(pnorm(x, m, sd, lower.tail = FALSE),4))
## [1] "The probability of lifespan > 8340 hrs: 0.9713"
Answer:
x1 <- 83000000
x2 <- 85000000
m <- 80000000
sd <- 3000000
## P = p(83million) - p(85million)
paste0("The probability to earn between $83m and $85m: ", round(pnorm(x1, m, sd, lower.tail = FALSE),4) - round(pnorm(x2, m, sd, lower.tail = FALSE),4))
## [1] "The probability to earn between $83m and $85m: 0.1109"
Answer:
This requires finding the score to be at the 86th percentile. qnorm will be useful in this aspect as it can can be thought of as the inverse of pnorm. It can return the score at the the 86th percentile as the probability is known.
p <- 1 - 0.14
m <- 456
sd <- 123
paste0("Minimum score required is: ", round(qnorm(p, m, sd), 0))
## [1] "Minimum score required is: 589"
Answer:
m <- 6.13
sd <- 0.06
lowerPercentile <- 0.07
uppPercentile <- 0.93
paste0("The upper limit lenght is: ", round(qnorm(uppPercentile, m, sd), 2), " centimeters")
## [1] "The upper limit lenght is: 6.22 centimeters"
paste0("The lower limit lenght is: ", round(qnorm(lowerPercentile, m, sd), 2), " centimeters")
## [1] "The lower limit lenght is: 6.04 centimeters"
Answer:
For Scores below the top 45% (100 - 45 = 55%) and above the bottom 20% =>
paste0("The upper limit for a C grade is: ", round(qnorm(.55, 78.8, 9.8), 0))
## [1] "The upper limit for a C grade is: 80"
paste0("The lower limit for a C grade is: ", round(qnorm(.20, 78.8, 9.8), 0))
## [1] "The lower limit for a C grade is: 71"
Answer:
paste0("The minimum score required for admission is: ", round(qnorm(.55, 21.2, 5.4), 1))
## [1] "The minimum score required for admission is: 21.9"
Answer:
This is a Binomial Distribution and can be conventionally interpreted as the number of ‘successes’ in size trials.
Usage
pbinom(x, size, prob)
x <- 10 ## Less than 11 students not to graduate on time
N <- 151 ## size in trials
p <- 0.09 ## probability to not graduate on time
paste0("The probability approximates to: ", round(pbinom(x, N, p),4))
## [1] "The probability approximates to: 0.192"
Answer:
This involves the sample of 147 tires, The standard error of the mean is involved and it is the standard deviation of the sampling distribution of the mean. It is denoted by \[\sigma_M = \frac {\sigma}{\sqrt{N}}\] Where N = sample size = 147
x <- 48.83
m <- 48
sd <- 7
N <- 147
sdm <- sd/sqrt(N)
## Using pnorm()
paste0("The probability that samples mean > 48.83 months is: ", round(pnorm(x, m, sdm, lower.tail=FALSE),4))
## [1] "The probability that samples mean > 48.83 months is: 0.0753"
Answer:
x <- 93.54
m <- 91
sd <- 10
N <- 68
sdm <- sd/sqrt(N)
## Using pnorm()
paste0("The probability that samples mean > 93.54 months is: ", round(pnorm(x, m, sdm, lower.tail=FALSE),4))
## [1] "The probability that samples mean > 93.54 months is: 0.0181"
Answer:
\[F(\rho) = \frac{1}{2}ln\bigg(\frac{1+\rho}{1-\rho}\bigg)\]
\[se = \frac{1}{\sqrt{N - 3}}\]
Where:
To find the probability of no shows between the 4th percentile (7 - 3) to the 10th percentile (7 + 3) given N => P(4%
m <- 3.9
sd <- 0.8
N <- 208
p <- ( 1 - 0.80) / 2
# using qt()
t <- abs(qt(p, N-1))
paste0("The lower and upper bounds are: ", round(m - t * sd / sqrt(N),1), " and ", round(m + t * sd / sqrt(N),1))
## [1] "The lower and upper bounds are: 3.8 and 4"
An economist wants to estimate the mean per capita income (in thousands of dollars) in a major city in California. Suppose a sample of size 7472 is drawn with x(mean) = 16.6. Assume ?? = 11 . Construct the 98% confidence interval for the mean per capita income. (Round your answers to 1 decimal place) Answer: This is similar to the previous question
m <- 16.6
sd <- 11
N <- 7472
p <- ( 1 - 0.98) / 2
# using qt()
t <- abs(qt(p, N-1))
paste0("The lower and upper bounds are: ", round(m - t * sd / sqrt(N),1), " and ", round(m + t * sd / sqrt(N),1))
## [1] "The lower and upper bounds are: 16.3 and 16.9"
Step 1. Choose the picture which best describes the problem.
Answer:
Step 1:
N <- 26
p <- 0.05
abs(round(qt(p, N-1),4))
## [1] 1.7081
step 2: This is a one-sided distribution and the picture which best describes the problem is at the top-right corner.
Using these measurements, construct a 90% confidence interval for the mean level of helium gas present in the facility. Assume the population is normally distributed.
Step 1. Calculate the sample mean for the given sample data. (Round answer to 2 decimal places)
## let sm = sample mean
data <- c(383.6, 347.1, 371.9, 347.6, 325.8, 337)
sm <- round(mean(data), 2)
paste0("The sample mean is: ", sm)
## [1] "The sample mean is: 352.17"
Step 2. Calculate the sample standard deviation for the given sample data. (Round answer to 2 decimal places)
## let sd = standard deviation of the sample data
sd <- round(sd(data), 2)
paste0("The standard deviation of the sample data is: ", sd)
## [1] "The standard deviation of the sample data is: 21.68"
Step 3. Find the critical value that should be used in constructing the confidence interval. (Round answer to 3 decimal places)
Answer: Since the 90% confidence interval is required to be constructed in step 4, it will be sensible to use 100 - 90 = 10 % = 0.01 =>
N <- 6
p <- (1 - 0.90)/2
t <- abs(round(qt(p, N-1), 3))
paste0("The critical value is: ", t)
## [1] "The critical value is: 2.015"
Step 4. Construct the 90% confidence interval. (Round answer to 2 decimal places)
c1 <- sm + t * sd / sqrt(N)
c2 <- sm - t * sd / sqrt(N)
paste0("The 90% confidence interval:")
## [1] "The 90% confidence interval:"
paste0("lower bound = ", round(c2, 2))
## [1] "lower bound = 334.34"
paste0("upper bound = ", round(c1, 2))
## [1] "upper bound = 370"
Step 1. Find the critical value that should be used in constructing the confidence interval. (Round answer to 3 decimal places)
m <- 46.4
sd <- 2.45
N <- 16
p <- ( 1 - 0.80) / 2
t <- abs(qt(p, N-1))
paste0("The critical value is: ", round(t,3))
## [1] "The critical value is: 1.341"
Step 2. Construct the 80% confidence interval. (Round answer to 1 decimal place)
c1 <- m + t * sd / sqrt(N)
c2 <- m - t * sd / sqrt(N)
paste0("The 80% confidence interval:")
## [1] "The 80% confidence interval:"
paste0("lower bound = ", round(c2, 1))
## [1] "lower bound = 45.6"
paste0("upper bound = ", round(c1, 1))
## [1] "upper bound = 47.2"
Answer: \[SE = \frac{z . \sigma}{\sqrt{N}}\\z = 2.576 = \text{value deuced from the Z-table for confidence level 99%}\\ \text{SE = the standard error}\] \[\text{making N the subject of the formula, this then gives}\\N=\bigg(\frac{z . \sigma}{SE}\bigg)^2\]
m <- 8
sd <- 1.9
z <- 2.576
se <- 0.13
N <- (z*sd/se)^2
paste0("A sample size should be around: ", round(N, 0), " toys")
## [1] "A sample size should be around: 1417 toys"
m <- 12.6
v <- 3.61
sd <- sqrt(v)
z95 <- 1.96
se <- 0.19
N <- (z95*sd/se)^2
paste0("The number of reproductions per hour should be: ", round(N, 0))
## [1] "The number of reproductions per hour should be: 384"
Step 1. Suppose a sample of 2089 tenth graders is drawn. Of the students sampled, 1734 read above the eighth grade level. Using the data, estimate the proportion of tenth graders reading at or below the eighth grade level. (Write your answer as a fraction or a decimal number rounded to 3 decimal places)
N <- 2089
N_above_8 <- 1734
## p = let probability of 10th graders reading at/below the 8th grade
p <- 1 - N_above_8/N
paste0("The proportion is: ", round(p, 3))
## [1] "The proportion is: 0.17"
Step 2. Suppose a sample of 2089 tenth graders is drawn. Of the students sampled, 1734 read above the eighth grade level. Using the data, construct the 98% confidence interval for the population proportion of tenth graders reading at or below the eighth grade level. (Round your answers to 3 decimal places)
Answer:
Since p has been gotten from step 1, then the Standard Error => \[SE = \sqrt{\frac{p(1-p)}{n}}\]
z <- 2.326 ## from the z table for confidence intervals
b8 <- N - 1734
se <- sqrt(p*(1-p)/b8)
c1 <- p+z*se
c2 <- p-z*se
paste0("lower bound: ", round(c2, 3))
## [1] "lower bound: 0.124"
paste0("upper bound: ", round(c1, 3))
## [1] "upper bound: 0.216"
Step 1. Suppose a sample of 474 tankers is drawn. Of these ships, 156 had spills. Using the data, estimate the proportion of oil tankers that had spills. (Write your answer as a fraction or a decimal number rounded to 3 decimal places)
N <- 474
n_sp <- 474 - 156
p <- 156/474
paste0("The proportion of tankers with spills is: ", round(p, 3))
## [1] "The proportion of tankers with spills is: 0.329"
Step 2. Suppose a sample of 474 tankers is drawn. Of these ships, 156 had spills. Using the data, construct the 95% confidence interval for the population proportion of oil tankers that have spills each month. (Round your answers to 3 decimal places)
z_95 <- 1.96
Se <- sqrt((p*(1-p))/n_sp)
c1 <- p+z_95*se
c2 <- p-z_95*se
paste0("lower bound: ", round(c2, 3))
## [1] "lower bound: 0.29"
paste0("upper bound: ", round(c1, 3))
## [1] "upper bound: 0.368"