3.27 Underage drinking, Part I. The Substance Abuse and Mental Health Services Administration estimated that 70% of 18-20 year olds consumed alcoholic beverages in 2008.
(a) Suppose a random sample of ten 18-20 year olds is taken. Is the use of the binomial distribution appropriate for calculating the probability that exactly six consumed alcoholic beverages? Explain.
1. The trials are independent (SRS)
2. The number of trials is fixed at 10.
3. The outcomes are either drank or not. Success is that they drank.
4. Yes.
dbinom(6,10,.7)
## [1] 0.2001209
dbinom(4,10,.3)
## [1] 0.2001209
pbinom(2,5,0.7)
## [1] 0.16308
1-dbinom(0,5,0.7)
## [1] 0.99757
Chickenpox, Part I. The National Vaccine Information Center estimates that 90% of Americans have had chickenpox by the time they reach adulthood.
(a) Suppose we take a random sample of 100 American adults. Is the use of the binomial distribution appropriate for calculating the probability that exactly 97 had chickenpox before they reached adulthood? Explain.
1. SRS
2. The number of trials is fixed to 100.
3. Either they had chickenpox or not.
4. We’re sticking with yes.
dbinom(97,100,.9)
## [1] 0.005891602
dbinom(3,100,0.1)
## [1] 0.005891602
1-dbinom(0,10,.9)
## [1] 1
pbinom(3,10,.1)
## [1] 0.9872048
Underage drinking, Part II. We learned in Exercise 3.27 that about 70% of 18-20 year olds consumed alcoholic beverages in 2008. We now consider a random sample of fifty 18-20 year olds.
50*0.7
## [1] 35
sqrt(50*.7*.3)
## [1] 3.24037
We would expect about 35 to drink with a standard deviation of 3.24.
Yes, it would be unusual because the probability of 45 or more people consuming alcoholic beverages is .07%.
1-pbinom(44,50,.7)
## [1] 0.0007228617
.07%
(45-35)/3.24
## [1] 3.08642
How does this probability relate to your answer to part (b)
Chickenpox, Part II. We learned in Exercise 3.28 that about 90% of American adults had chickenpox before adulthood. We now consider a random sample of 120 American adults.
(a) How many people in this sample would you expect to have had chickenpox in their childhood? And with what standard deviation?
120*.9
## [1] 108
sqrt(120*.9*.1)
## [1] 3.286335
pbinom(105,120,.9)
## [1] 0.2181634
No, because the probability is over 20%.
I wouldn’t be that surprised because 20% is a low but reasonable percentage.
Survey response rate. Pew Research reported in 2012 that the typical response rate to their surveys is only 9%. If for a particular survey 15,000 households are contacted, what is the probability that at least 1,500 will agree to respond?
1-dbinom(1500,15000,.09)
## [1] 0.9999985
Game of dreidel. A dreidel is a four-sided spinning top with the Hebrew letters nun, gimel, hei, and shin, one on each side. Each side is equally likely to come up in a single spin of the dreidel. Suppose you spin a dreidel three times. Calculate the probability of getting
18/52
## [1] 0.3461538
about 35%
9/52
## [1] 0.1730769
about 17%
21/52
## [1] 0.4038462
about 40%
51/52
## [1] 0.9807692
about 98%
Arachnophobia. A 2005 Gallup Poll found that that 7% of teenagers (ages 13 to 17) suffer from arachnophobia and are extremely afraid of spiders. At a summer camp there are 10 teenagers sleeping in each tent. Assume that these 10 teenagers are independent of each other.
1-pbinom(1,10,.07)
## [1] 0.1517299
There is a 15% chance that at least 1 suffers from arachnaphobia.
dbinom(2,10,.07)
## [1] 0.1233878
There is a 12% chance that exactly 2 of them suffer from arachnaphobia.
pbinom(1,10,.07)
## [1] 0.8482701
There is an 84% chance that at most one of them sufferes from arachnophobia.
Yes, it makes sense, because the probability is so high (84%) that at most only 1 teenager per tent is arachnophobic.
4.2 Identify the parameter, Part II. For each of the following situations, state whether the parameter of interest is a mean or a proportion.
Proportion, because worry or not worry is categorical, you can’t 60% not worried.
It’s a mean, because it’s showing revenue increase or decrease over time.
Proportion, because yes or no is categorical.
Same as C.
Mean, because it is a number over time, and not categorical.
College credits. A college counselor is interested in estimating how many credits a student typically enrolls in each semester. The counselor decides to randomly sample 100 students by using the registrar’s database of students. The histogram below shows the distribution of the number of credits taken by these students. Sample statistics for this distribution are also provided.
What is the point estimate for the average number of credits taken per semester by students at this college? What about the median?
What is the point estimate for the standard deviation of the number of credits taken per semester by students at this college? What about the IQR?
Is a load of 16 credits unusually high for this college? What about 18 credits? Explain your reasoning. Hint: Observations farther than two standard deviations from the mean are usually considered to be unusual.
(16-13.65)/1.91
## [1] 1.230366
The load of 16 units would not be unusual, but 18 units would be unusual.
(18-13.65)/1.91
## [1] 2.277487
No there is natural variability in the sample statistic. We would be more surprised if it was the same.
Heights of adults. Researchers studying anthropometry collected body girth measurements and skeletal diameter measurements, as well as age, weight, height and gender, for 507 physically active individuals. The histogram below shows the sample distribution of heights in centimeters.
What is the point estimate for the average height of active individuals? What about the median? 171.1, 170.3
What is the point estimate for the standard deviation of the heights of active individuals? What about the IQR? 9.4, 14
Is a person who is 1m 80cm (180 cm) tall considered unusually tall? And is a person who is 1m 55cm (155cm) considered unusually short? Explain your reasoning.
(180-171.1)/9.4
## [1] 0.9468085
(155-171.1)/9.4
## [1] -1.712766
180 tall would not be unusually tall. 155 cm tall would not be considered unusually short, but maybe a little short.
No, I would not expect another sample to be the same. Because it is a small number taken from the population it could be any number of different combinations.
We would use the Standard Error for the sample mean to quantify the variability of the estimate.
(171.1)/sqrt(507)
## [1] 7.598818
The Standard Error is about 7.6
Wireless routers. John is shopping for wireless routers and is overwhelmed by the number of available options. In order to get a feel for the average price, he takes a random sample of 75 routers and finds that the average price for this sample is $75 and the standard deviation is $25.
25/sqrt(75)
## [1] 2.886751
(75-80)/(25/sqrt(75))
## [1] -1.732051
No, it does not appear that John’s estimate would be unusual.
Chocolate chip cookies. Students are asked to count the number of chocolate chips in 22 cookies for a class activity. They found that the cookies on average had 14.77 chocolate chips with a standard deviation of 4.37 chocolate chips.
4.37/sqrt(22)
## [1] 0.9316871
.93 or about 1 chocolate chip.
(14.77-20)/(4.37/sqrt(22))
## [1] -5.613472
It is over 5 chocolate chips below the standard deviation, so yes, it is unreasonably high.