Probability MSSA

Harold Nelson

2/5/2019

Basic Concepts of Probability

Probability is always based on an experiment. The set of all possible outcomes of an experiment is known as a sample space.

The individual members of the sample space are described as elementary outcomes. The probability of an elementary outcome is best described as the fraction of the time that the elementary outcome will occur when the experiment is performed a large number of times.

The sum of all of the probabilities of the elementary outcomes must be 1.0.

Events

An event is a subset of a sample space. The probability of an event is what we expect its relative frequency to approach as we run the experiment a large number of times. The probability of an event is the sum of the probabilities of its elementary outcomes.

Probabilities are always numbers in the range \(0\) to \(1\).

Equally Likely Elementary Outcomes

If the elementary outcomes of an experiment are discrete and all equally likely, the theoretical probability of an event is defined as

\[\frac{\text{Number of elementary outcomes in the event}}{\text{Total number of elementary outcomes}}\]

As an example, consider the probability of getting a head when you flip a fair coin. There are two elementary outcomes of this experiment, a head, or a tail. Since the coin is fair, the outcomes are equally likely. There are two possible outcomes and only one is in our event. So, the probability is \(1/2\). In everyday language, you may hear “fifty-fifty.” To stick with the language of probability we say that the probability is \(.5\).

Using equally likely elementary outcomes with discrete experiments is clearly advantageous. Consider the probability of getting a sum of seven dots when rolling a pair of dice. If we think of the elementary outcomes of this experiment as the numbers between two and twelve, the outcomes are not equally likely. There is only one way to get a sum of two and only one way to get sum of twelve, but there are many ways to get a sum of seven.

Exercise

Create a table showing the possible sums of dots showing when you throw a pair of dice.

Are the elementary outcomes in this table equally likley?

What is the probability of each elementary outcome?

How many elementary outcomes are in the event “7”?

Solutions

dice = matrix(rep(0,36),nrow=6)
rownames(dice) = 1:6
for( i in 1:6){
  for (j in 1:6){
    dice[i,j] = i + j
  }
}
colnames(dice) = 1:6
dice
##   1 2 3  4  5  6
## 1 2 3 4  5  6  7
## 2 3 4 5  6  7  8
## 3 4 5 6  7  8  9
## 4 5 6 7  8  9 10
## 5 6 7 8  9 10 11
## 6 7 8 9 10 11 12

There are 36 possible equally likely elemementary outcomes when we consider the orderd pairs of outcomes describing what happened to die 1 and what happened to die 2. These are \((2,5),(5,2),(3,4),(4,3),(6,1) \text{ and } (1,6)\). Since there are six of them, the probabiliy of getting a seven is \(6/36\), or \(1/6\).

Empirical

This is really just noting that you can describe some empirical facts using the language of probability. An example or two should help.

Based on the Current Population Survey, about 60% of the US population over the age of 16 meets the definition of participating in the labor force. We can say that if a member of the population over the age of 16 were picked at random, the probability of that person being a labor force participant is about .6.

Based on data from the CDC, there were 2,515,458 deaths in the US in 2010. Of those the primary cause of death was heart disease in 596,577 cases. We can say that if we were to pick a 2010 death certificate at random the probability of the death having a primary cause of heart disease is .237.

Subjective

This is really just using the language of probability to express a personal degree of belief. Here are some examples based on my own personal experiences.

Basic Principles

There are several rules for constructing the probabilities of events defined in terms of other events.

Addition Law - General Case

\[P(A\text{ or }B)=P(A)+P(B)-P(A\text{ and }B)\] Example Draw a single card from a 52 card deck. What is the probability of a heart or an ace?
\[P(\text{Heart or Ace})=P(\text{Heart})+P(\text{Ace})-P(\text{Heart and Ace})\]

\[P(\text{Heart or Ace})=\frac{13}{52}+\frac{4}{52}-\frac{1}{52}=\frac{16}{52}=\frac{4}{13}\]

The subtraction of the third term is necessary because the first two terms would have counted the ace of hearts twice.

Addition Law - Special Case

If the two events are mutually exclsuive (disjoint), the third term may be omitted since its value is zero. \[P(A\text{ or }B)=P(A)+P(B)\] Example Draw a single card from a 52 card deck. What is the probability of a heart or a spade?
\[P(\text{Heart or Ace})=P(\text{Heart})+P(\text{Spade})\]

\[P(\text{Heart or Spade})=\frac{13}{52}+\frac{13}{52}=\frac{26}{52}=\frac{1}{2}\] The simpler version is applicable because no card can be both a heart and a spade.We say that these two events are disjoint or mutually exclusive.

The Complements Law

If A is an event, either it occcurs or does not occur whenever the experiment we are thinking of occurs. If we think of events as subsets of a sample space, then every outcome is in the subset A or not in the subset A. The set of all outcomes not in the event A is called the complement of A, sometimes denoted \(A^{c}\). No outcome is both in A and in \(A^{c}\) (mutually exclusive). Also, every outcome is in one of these (exhaustive). In set theoretic terms \(A \cup A^{c} = \text{Sample Space}\).

The practical consequence is that \[P(\text{A}) + P(\text{not A}) = 1.0\]

Example What is the probability of not getting a heart when you draw a single card from a 52 card deck?

\[P(\text{not Heart}) = 1 - P(\text{Heart}) = 1 -\frac{13}{52} = \frac{3}{4}\]

Multiplication Law - General Case

\[P(\text{A and B}) = P(\text{A})*P(\text{B given that A has occured}).\] The phrasing in the second expression is generally replaced with a single vertical bar, ‘|’ between the two event names. The law is then written as \[P(\text{A and B}) = P(\text{A})*P(\text{B|A}).\]

Example What is the probability of drawing two hearts if you draw two cards from a 52 card deck without replacement. There are two events involved, getting a heart on the first draw (H1) and getting a heart on the second draw (H2).

\[P(\text{H1 and H2}) = P(\text{H1})*P(\text{H1|H2})\]

\[P(\text{H1 and H2}) = \frac{13}{52}*\frac{12}{51}=\frac{3}{51}=\frac{1}{17}\]

Multiplication Law - Special Case

\[P(\text{A and B}) = P(\text{A})*P(\text{B}).\] If the probability that B will occur does not depend on whether A has occurred or not, we say that the two events are independent. In this case, we use the simple probability rather than the conditional probability.

Example What is the probability of drawing two hearts if you draw two cards from a 52 card deck with replacement. There are two events involved, getting a heart on the first draw (H1) and getting a heart on the second draw (H2). However, when we include the stipulation “with replacement,” the condition of the deck on the second draw is the same as it was on the first draw.

\[P(\text{H1 and H2}) = \frac{13}{52}*\frac{13}{52}=\frac{1}{4} *\frac{1}{4}=\frac{1}{16}\]

Bayes Law

In the description of the multiplication law above, we have assumed that A occurs first and may (or may not) influence the probability that a second event, B occurs. We imagine that an observer, having observed A, obtains an improved estimate of the probability that B will occur. In some circumstances, it makes sense to reverse our thinking and become more like a detective. If B has been observed, what is the probability that A occurred?

Write two equivalent expressions for \(P(\text{A and B}).\) \[P(\text{A and B}) = P(\text{A})*P(\text{B|A}) = P(\text{B}) *P(\text{A|B}).\]

Now solve the equation relating the second and third equations for \(P(\text{A|B})\).

\[P(\text{A})*P(\text{B|A}) = P(\text{B}) *P(\text{A|B})\]

\[P(\text{A|B}) = \frac{P(\text{A})*P(\text{B|A})}{P(\text{B})}\]

In many cases, we don’t know the probability of B directly. However we can construct it. If B occured, either A occurred and then B occured or; A did not occur and then B occurred anyway. These two possibilities are mutually exclusive and exhaustive. Therefore we can write

\[P(\text{B}) = P(\text{A}) * P(\text{B|A})+P(\text{ not A}) *P(\text{B| not A})\]

Example Return to the problem of drawing two hearts from a 52 card deck without replacement. Suppose the second card was a heart. What is the probability that the first card was also a heart?

\[P(\text{H1|H2}) = \frac{P(\text{H1})*P(\text{H2|H1})}{P(\text{H2})}\]

Now rewrite the denominator

\[P(\text{H1|H2}) = \frac{P(\text{H1})*P(\text{H2|H1})}{P(\text{H1})*P(\text{H2|H1})+P(\text{not H1})*P(\text{H2 | not H1})}\]

Now let’s insert the numerical values.

\[P(\text{H1|H2}) = \frac{\frac{13}{52}*\frac{12}{51}}{\frac{13}{52}*\frac{12}{51}+\frac{39}{52}*\frac{13}{51}}\]

Contingency Tables

Your most likely use of probability concepts will be in interpreting a contingency table. Such a table is a display of the outcomes in terms of two categorical variables when taking a random sample from a population.

Exercise

In a population of 100 people, 60 are men and 40 are women. 40 of the men smoke and 10 of the women smoke. Display the 4 cell table describing this data. Label the columns with Men/Women and the rows with Smoker/Non-Smoker.

Solution

##            Men Women
## Smoker      40    10
## Non-Smoker  20    30

Exercise

Calulate the following.

  1. The probability of Man and Smoker.
  2. The probability of Man given Smoker.
  3. The probability of Smoker given Man.