The Poisson distribution

Used to model counts

\[ P(X = x;\lambda) = \frac{\lambda^x e^{-\lambda}}{x!}\]

Where \(x\) is defined in the non-negative integers.

The mean of this distribution is \(\lambda\).

The variance of this distribution is also \(\lambda\)

Some uses for the Poisson distribution

Contingency tables are used when you classify for example people following their hair color by race. An example of contingency table would be

Body red yellow magenta
black 1 0 0
blue 0 1 0
green 0 0 1

So the Poisson distribution is the default distribution for modeling contingency table data. And it turns out that it has a deep connection with some of the other models such as multinomial, binomials, and Bernoullis

The last use of the Poisson is done quite often in epidemiology. Where \(n\), the population, is so large, but \(p\), or the events, are not so common.

Rates and Poisson distribution

Example

The number of people that show up at bus stop is Poisson with a mean of \(2.5\) per hour. If watching the bus stop for \(4\) hours, what is the probability that \(3\) or fewer people show up for the whole time?

ppois(3, lambda = 2.5 * 4)
## [1] 0.01033605

Poisson approximation to the binomial

Example, Poisson approximation to the binomial

We flip a coin with success probability \(0.01\) five hundred times. What’s the probability of 2 or fewer successes?

The binomial version

pbinom(2, size = 500, prob = 0.01)
## [1] 0.1233858

The poisson approximation with \(\lambda = np\)

ppois(2, lambda = 500 * 0.01)
## [1] 0.124652