Binomial and Poisson Distribution

Comprehensive study notes, diagrams, and exam preparation for Binomial and Poisson Distribution.

Binomial and Poisson Distribution

Definition

The Binomial and Poisson distributions are fundamental discrete probability distributions used to model random events. The Binomial distribution describes the number of successes in a fixed number of independent Bernoulli trials, while the Poisson distribution models the number of times an event occurs in a fixed interval of time or space, provided these events occur with a known constant mean rate and independently of the time since the last event.


Main Content

1. The Binomial Distribution

  • It requires four conditions (BINS): Binary outcomes (Success/Failure), Independent trials, Number of trials is fixed, and Success probability is constant.
  • The formula is $P(X = k) = \binom{n}{k} p^k (1-p)^{n-k}$, where $n$ is the number of trials and $p$ is the probability of success.

2. The Poisson Distribution

  • It is often used to model the number of rare events occurring in a large population or over a long duration.
  • The formula is $P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!}$, where $\lambda$ is the average rate of occurrence.

3. Visual Representation of Distributions

  • The Binomial distribution is symmetric when $p=0.5$ and skewed otherwise.
  • The Poisson distribution is typically right-skewed, especially for small values of $\lambda$.
Binomial (n=5, p=0.5)        Poisson (λ=2)
      _                        _
     | |                      | |
   __| |__                  __| |__
  |       |                |       |
  |       |                |       |
 _|_     _|_              _|_     _|_
  0 1 2 3 4 5              0 1 2 3 4 5

Working / Process

1. Identifying the Distribution Type

  • Determine if the experiment has a fixed number of trials (Binomial) or an open-ended observation window (Poisson).
  • Check if the events are independent; if they are not, these models may not apply.

2. Defining Parameters

  • For Binomial: Identify $n$ (total trials) and $p$ (probability of success in a single trial).
  • For Poisson: Identify $\lambda$ (the expected average count over the given interval).

3. Calculating the Probability

  • Substitute the variables into the respective probability mass function (PMF) formula.
  • Use a calculator or statistical table to solve for $k$ (the specific number of successes desired).

Advantages / Applications

  • Quality Control: Binomial distribution is used to estimate the number of defective items in a batch produced by a machine.
  • Risk Management: Poisson distribution models insurance claims, call center traffic, or number of accidents per month.
  • Predictive Analytics: Both are used in machine learning to categorize outcomes or predict frequencies of user engagement.

Summary

The Binomial distribution models "successes" in a set number of independent trials, whereas the Poisson distribution models the number of occurrences of an event within a fixed interval. Key terms to remember include $n$ (trials), $p$ (probability), $\lambda$ (mean rate), and $e$ (Euler's number). These theoretical distributions allow statisticians to approximate real-world behavior and fit curves to observed data points.