Exercise 2

Let \(n\) be a positive integer. Let \(S\) be the set of integers between \(1\) and \(n\). Consider the following process: We remove a number from \(S\) at random and write it down. We repeat this until \(S\) is empty. The result is a permutation of the integers from \(1\) to \(n\). Let \(X\) denote this permutation. Is \(X\) uniformly distributed?

Explanation of Answer

The permutation \(X\) that results from the described process is uniformly distributed among all possible permutations of the set \(S\). In this context, a uniformly distributed permutation means that each possible permutation of the set \(S\) is equally likely to occur.

Since there are \(n!\) (n factorial) possible permutations of the set \(S\) and each permutation has a probability of \(\frac{1}{n!}\) of being selected, the process generates a uniformly distributed permutation of the set \(S\). This means that each of the \(n!\) permutations is equally likely to be the result \(X\) of the described process.

Exercise 4

Suppose we are attending a college which has 3000 students. We wish to choose a subset of size 100 from the student body. Let \(X\) represent the subset, chosen using the following possible strategies. For which strategies would it be appropriate to assign the uniform distribution to \(X\)? If it is appropriate, what probability should we assign to each outcome?

(a) Take the first 100 students who enter the cafeteria to eat lunch.

(b) Ask the Registrar to sort the students by their Social Security number, and then take the first 100 in the resulting list.

(c) Ask the Registrar for a set of cards, with each card containing the name of exactly one student, and with each student appearing on exactly one card. Throw the cards out of a third-story window, then walk outside and pick up the first 100 cards that you find.

Explanation of Answer

To determine whether it’s appropriate to assign the uniform distribution to \(X\) in each scenario, we need to assess if every subset of 100 students has an equal chance of being chosen. If it is appropriate, then each subset of size 100 out of 3000 students should have the same probability of selection.

  1. Taking the first 100 students who enter the cafeteria to eat lunch:

This method is likely not uniform because the selection is influenced by the time students choose to eat, which can be affected by their schedules, habits, or preferences. Some students might have classes during lunch hours or prefer to eat at different times, reducing their chances of being included in the subset.
Uniform distribution assignment: Not appropriate.

  1. Sorting students by their Social Security number and then taking the first 100:

This method could be considered uniform if the Social Security numbers are randomly distributed across students, which they typically are. Since the sorting criterion (Social Security number) is independent of any student’s likelihood of being chosen, each student has an equal chance of being in any position in the list and thus an equal chance of being in the first 100.
Uniform distribution assignment: Appropriate.
Probability assignment: If uniform, each subset of 100 students out of 3000 should have a probability of \(\frac{1}{\binom{3000}{100}}\). \[ \frac{1}{\binom{3000}{100}} \approx 9.6055 \times 10^{-190} \]

  1. Throwing cards with student names out of a window and picking up the first 100:

This method seems to introduce randomness, assuming that the way the cards scatter is random and picking them up doesn’t introduce any bias. Each card (and thus each student) should have an equal chance of being in any position in the scattered arrangement and being among the first 100 picked up.
Uniform distribution assignment: Potentially appropriate, but this could depend on the specifics of how the cards are thrown and collected. If truly random, it’s appropriate.
Probability assignment: If uniform, each subset of 100 students out of 3000 should have a probability of \(\frac{1}{\binom{3000}{100}}\). \[ \frac{1}{\binom{3000}{100}} \approx 9.6055 \times 10^{-190} \]

In summary, strategies (b) and (c) could be appropriate for the uniform distribution assignment under certain conditions, with each outcome having a probability of \(\frac{1}{\binom{3000}{100}}\). Strategy (a) is not suitable for a uniform distribution assignment due to the likely time-based biases in the selection process.