Distribution Functions and Densities
Definition
A distribution function of a random variable , also called the cumulative distribution function (CDF), is defined as
for every real number .
A density function, more precisely a probability density function (PDF) for a continuous random variable , is a function such that the probability that lies in an interval is
and the distribution function is obtained by
for all .
In simple terms:
- The distribution function tells the probability that the random variable has reached or passed a value.
- The density function tells how probability is distributed over the number line for continuous variables.
Main Content
1. Distribution Function
Meaning and interpretation
- The distribution function gives the cumulative probability up to .
- It answers “What is the probability that the random variable is at most ?”
- It is used for both discrete and continuous random variables.
- Example: If represents the waiting time for a bus, then means there is a 75% chance the bus arrives within 10 minutes.
Important properties
- is always between 0 and 1.
- It is non-decreasing: if , then .
- and .
- It is right-continuous.
-
For any ,
-
For discrete variables, the CDF has jumps at the possible values.
Illustration of a typical CDF shape
F(x)
1.0 | ________
| __/
| __/
| __/
| __/
| __/
| __/
0.0 |________/____________________________ x
values of x increasing
This shape shows how cumulative probability increases as increases.
2. Probability Density Function
Meaning and interpretation
- A density function describes how probability is distributed continuously over values.
- Unlike discrete probabilities, itself is not a probability.
-
For a continuous random variable, for any exact point .
-
Probabilities are obtained by integrating the density over intervals.
- Example: If is large near , values near 5 are more likely than values where is small.
Important properties
- for all .
-
The total area under the curve is 1:
-
Probability over an interval is the area under the curve:
-
The CDF and PDF are connected by: and, where differentiable,
Illustration of a density curve
f(x)
^
| /\
| / \
| / \
| / \
|____________/__________\____________> x
a b
The shaded area between and represents .
3. Relationship Between Distribution Function and Density
How they are connected
- The distribution function is the accumulated area of the density function.
- The density function is the slope of the distribution function wherever the CDF is differentiable.
- This relationship helps move from probabilities to functions and vice versa.
- If a density is known, the CDF can be found by integration.
- If the CDF is known and smooth, the density can be found by differentiation.
Discrete versus continuous cases
- For discrete random variables, the CDF increases in jumps.
- For continuous random variables, the CDF increases smoothly.
- In discrete cases, probabilities are assigned to exact values using a probability mass function rather than a density function.
- In continuous cases, the density function replaces point probabilities with interval probabilities.
- Example:
- A die roll is discrete.
- A person’s exact height is modeled as continuous.
Relationship sketch
Density f(x): CDF F(x):
/\ ________
/ \ __/
/ \ _/
/ \ _/
-------------------/------------------> x
The left side shows a density curve; the right side shows the corresponding cumulative increase.
Working / Process
1. Identify the type of random variable
- Determine whether the variable is discrete or continuous.
- If it is discrete, probabilities are found from a probability mass function and the CDF is built from cumulative sums.
- If it is continuous, probabilities are found using a density function and integration.
2. Use the correct function to compute probability
-
For a distribution function:
-
For a density function:
-
If only the density is given, first find the corresponding cumulative probability by integrating.
3. Check validity and interpret the result
- Ensure the CDF stays between 0 and 1 and is non-decreasing.
- Ensure the PDF is non-negative and has total area 1.
- Interpret the numerical result in context, such as waiting time, height, score, or measurement error.
- Example: If , then 30% of outcomes lie between 2 and 5.
Advantages / Applications
They provide a complete description of uncertainty
- Distribution functions summarize the total probability structure of a random variable.
- Densities show where values are concentrated more heavily.
They are widely used in real-world modeling
- Applied in economics, engineering, biology, weather prediction, and quality control.
- Useful for modeling continuous measurements such as time, weight, voltage, and temperature.
They support advanced statistical methods
- Used in expectation, variance, hypothesis testing, confidence intervals, and simulation.
- Help in comparing models and deriving distributions of transformed variables.
Summary
- Distribution functions describe cumulative probability, while densities describe how probability is spread over continuous values.
- A CDF gives , and a PDF gives interval probabilities through area under the curve.
- These functions are closely related by integration and differentiation, making them central to probability and statistics.
- Important terms to remember: random variable, cumulative distribution function, probability density function, interval probability, continuity