Binomial and Poisson Distribution
Definition
The Binomial and Poisson distributions are fundamental discrete probability distributions used to model random events. The Binomial distribution describes the number of successes in a fixed number of independent Bernoulli trials, while the Poisson distribution models the number of times an event occurs in a fixed interval of time or space, provided these events occur with a known constant mean rate and independently of the time since the last event.
Main Content
1. The Binomial Distribution
- It requires four conditions (BINS): Binary outcomes (Success/Failure), Independent trials, Number of trials is fixed, and Success probability is constant.
- The formula is $P(X = k) = \binom{n}{k} p^k (1-p)^{n-k}$, where $n$ is the number of trials and $p$ is the probability of success.
2. The Poisson Distribution
- It is often used to model the number of rare events occurring in a large population or over a long duration.
- The formula is $P(X = k) = \frac{\lambda^k e^{-\lambda}}{k!}$, where $\lambda$ is the average rate of occurrence.
3. Visual Representation of Distributions
- The Binomial distribution is symmetric when $p=0.5$ and skewed otherwise.
- The Poisson distribution is typically right-skewed, especially for small values of $\lambda$.
Binomial (n=5, p=0.5) Poisson (λ=2)
_ _
| | | |
__| |__ __| |__
| | | |
| | | |
_|_ _|_ _|_ _|_
0 1 2 3 4 5 0 1 2 3 4 5
Working / Process
1. Identifying the Distribution Type
- Determine if the experiment has a fixed number of trials (Binomial) or an open-ended observation window (Poisson).
- Check if the events are independent; if they are not, these models may not apply.
2. Defining Parameters
- For Binomial: Identify $n$ (total trials) and $p$ (probability of success in a single trial).
- For Poisson: Identify $\lambda$ (the expected average count over the given interval).
3. Calculating the Probability
- Substitute the variables into the respective probability mass function (PMF) formula.
- Use a calculator or statistical table to solve for $k$ (the specific number of successes desired).
Advantages / Applications
- Quality Control: Binomial distribution is used to estimate the number of defective items in a batch produced by a machine.
- Risk Management: Poisson distribution models insurance claims, call center traffic, or number of accidents per month.
- Predictive Analytics: Both are used in machine learning to categorize outcomes or predict frequencies of user engagement.
Summary
The Binomial distribution models "successes" in a set number of independent trials, whereas the Poisson distribution models the number of occurrences of an event within a fixed interval. Key terms to remember include $n$ (trials), $p$ (probability), $\lambda$ (mean rate), and $e$ (Euler's number). These theoretical distributions allow statisticians to approximate real-world behavior and fit curves to observed data points.