Topic 4: Sampling Distributions


In Topic 4, we learnt about sampling distributions. In this computer lab, we will consider different calculations relating to the Central Limit Theorem and the distribution of the sample mean.

In Lecture 4, we covered material that will be very helpful for today’s computer lab. If you have not already watched Lecture 4, you may wish to do so now.

After working through the questions in this computer lab, you will be ready to complete Quiz 5. If you have time during today’s lab, you may like to work on the quiz.

Important: For all the questions in this lab, where relevant, please round your answers to 4 decimal places of accuracy. When using the Rj Editor, remember to do the following:

  • Adjust the settings to “Show code and output”
  • Use options(digits = 6) to control the level of accuracy displayed in the Rj Editor results window.

If you need a refresher on the above, please watch the following video before proceeding.


1 Sample mean distribution
(normal population distribution)

🏡 Imagine that the year is 2040, and you are running an inventory on deep-space mining craft for your company. You know that the number of fuel tanks on all the different crafts are normally distributed, with population mean \(\mu = 12\) fuel tanks per craft, and population variance \(\sigma^2 = 34.42\).

Suppose that you select a random sample of \(n=25\) spacecraft in the shipyard to assess.

1.1

🏡 Let \(\overline{X}\) denote the sample mean. Write down the distribution of \(\overline{X}\) in mathematical notation.


🎧 Online students 💬 Enter your answer at the relevant location of the shared google doc.

1.2

💻 Using this distribution, carry out the following calculations in jamovi. Remember, it can be helpful to sketch pictures of the distribution when working out these different probabilities.

Hint: The pnorm R function will be helpful.

1.2.1

Find the probability \(P(\overline{X} < 10)\). 💬

1.2.2

Find the probability \(P(4 \leq \overline{X} \leq 15)\). 💬

1.2.3

Find the probability \(P(\overline{X} > 13)\). 💬

1.3

🏡 Explain in words what each of the probabilities you have obtained in 1.2 above represent, in this context.

🎧 Online students 💬 Volunteer to share your screen and explain your answers to this question.

1.4

💻 Suppose that the probability that \(\overline{X}\) is less than a certain number of fuel tanks is equal to 40%. If the true population mean is \(\mu = 12\) fuel tanks per craft, and we are assessing our sample of \(n=25\) spacecraft, what must this number of fuel tanks be? 💬

Hint: You can use jamovi for this question too (think back to Computer Lab 4). Refer to the code chunk below for some guidance if you are stuck - you will have to fill in the missing ...’s:

# We can use the qnorm function for this type of question
qnorm(..., ..., ...)

1.5

💻 Suppose that there is only a 10% probability of observing more than a certain number of fuel tanks for \(\overline{X}\) . If the true population mean is \(\mu = 12\), and we are assessing our sample of \(n=25\) spacecraft, what must this number of fuel tanks be? 💬

1.6

💻 Repeat 1.5, but this time suppose that there is only a 5% probability of observing more than a certain number of fuel tanks for \(\overline{X}\). 💬

1.7

💻 Suppose that, just as in 1.5 and 1.6, we are assessing the fuel tanks in our sample of \(n=25\) spacecraft, and we know that the true population mean is \(\mu = 12\) fuel tanks per craft. This time however, we would like to determine a range of values for the number of fuel tanks per craft, within which the majority of space craft will lie.

Firstly, we would like to determine the number of fuel tanks for \(\overline{X}\) such that there is only a 2.5% probability of observing less than this number of fuel tanks on a craft.

Secondly, we would like to determine the number of fuel tanks for \(\overline{X}\) such that there is only a 2.5% probability of observing more than this number of fuel tanks on a craft.

Compute these two values now, and report them as an interval (e.g. if the lower number is 8 and the higher number is 10, write the interval as (8, 10).) 💬

1.8

💻 Now that you have finished your inventory of this shipyard, you travel to another shipyard in a neighbouring state. Here, most of the craft are in use, so you are only able to assess \(20\) craft. You do not know the distribution of fuel tanks for the craft in this shipyard, but are told they have a mean of \(14\), and a standard deviation of \(7.55\).

Let \(\overline{X}\) denote the sample mean number of fuel tanks for craft from this new shipyard. Write down the distribution of \(\overline{X}\) in mathematical notation.

🎧 Online students 💬 Volunteer to share your screen and explain your answers to this question.

2 Sample mean distribution
(unknown population distribution)

Suppose that you are engaged in an asteroid belt mining operation. \(60\) asteroids are currently being mined, and these represent a random sample taken from a population with an unknown distribution. However, scanning tools have revealed that the mean diameter of the asteroids in the belt is \(35\) kms, with \(\sigma = 32.5\).

2.1

💻 Let \(\overline{X}\) denote the sample mean. Write down the distribution of \(\overline{X}\) in mathematical notation. What special result does your answer rely upon? 💬

2.2

💻 Using this distribution, carry out the following calculations in jamovi. Remember, it can be helpful to sketch pictures of the distribution when working out these different probabilities.

For this question, for more accuracy, try entering the standard deviation as a fraction rather than a rounded number. For example, you hopefully found in 2.1 that \(\dfrac{\sigma^2}{n} = \dfrac{32.5^2}{60}\). Using R code, we can use ^ to represent powers (e.g. for squaring a number) and / for dividing, so that for the standard deviation we can use sd = sqrt(32.5^2 / 60).

2.2.1

Find the probability \(P(\overline{X} < 42)\). 💬

2.2.2

Find the probability \(P(18.5 \leq \overline{X} \leq 38)\). 💬

2.2.3

Find the probability \(P(\overline{X} > 26)\). 💬

2.2.4

\(P(\overline{X} < 30) + P(\overline{X} > 36)\) 💬

2.3

🏡 Explain in words what each of the probabilities you have obtained in 2.2 above represent, in this context.


🎧 Online students 💬 Volunteer to share your screen and explain your answers to this question.

2.4

🏡 A fellow miner claims that the last time they mined this asteroid belt, all the asteroids were at least 42 kms in diameter. Based on your sample data, do you believe they are telling the truth?


🎧 Online students 💬 Volunteer to share your screen and explain your answers to this question.

2.5

🏡 Suppose that another mining team, operating on a nearby asteroid cluster, has had to haul \(15\) asteroids back to your station for further processing. The distribution of the asteroid cluster is unknown, with \(\mu = 19.25\) and \(\sigma^2 = 8.1\). Let \(\overline{X}\) denote the sample mean. What is the distribution of \(\overline{X}\)? Make sure to clearly explain your answer.


🎧 Online students 💬 Volunteer to share your screen and explain your answers to this question.

2.6

🏡 To further consolidate your understanding of the central limit theorem, check out the interactive web application in the Topic 4 readings.

If you have any questions, discuss with your classmates and/or your computer lab demonstrator.


Well done, that’s everything for today! If you still have time, you may like to have a go at Quiz 5, which is based on the Topic 5 readings.

Before you finish up, remember to save your work (e.g. your jamovi and Word files) somewhere safe (e.g. OneDrive) so that you can access it at a later time.


References


These notes have been prepared by Amanda Shaker and Rupert Kuveke. The copyright for the material in these notes resides with the authors named above, with the Department of Mathematical and Physical Sciences and with La Trobe University. Copyright in this work is vested in La Trobe University including all La Trobe University branding and naming. Unless otherwise stated, material within this work is licensed under a Creative Commons Attribution-Non Commercial-Non Derivatives License BY-NC-ND.