distribution functions and densities

Comprehensive study notes, diagrams, and exam preparation for distribution functions and densities.

Distribution Functions and Densities

Definition

A distribution function of a random variable , also called the cumulative distribution function (CDF), is defined as

for every real number .

A density function, more precisely a probability density function (PDF) for a continuous random variable , is a function such that the probability that lies in an interval is

and the distribution function is obtained by

for all .

In simple terms:

  • The distribution function tells the probability that the random variable has reached or passed a value.
  • The density function tells how probability is distributed over the number line for continuous variables.

Main Content

1. Distribution Function

Meaning and interpretation

  • The distribution function gives the cumulative probability up to .
  • It answers “What is the probability that the random variable is at most ?”
  • It is used for both discrete and continuous random variables.
  • Example: If represents the waiting time for a bus, then means there is a 75% chance the bus arrives within 10 minutes.

Important properties

  • is always between 0 and 1.
  • It is non-decreasing: if , then .
  • and .
  • It is right-continuous.
  • For any ,

  • For discrete variables, the CDF has jumps at the possible values.

Illustration of a typical CDF shape

F(x)
1.0 |                           ________
    |                        __/
    |                     __/
    |                  __/
    |               __/
    |            __/
    |         __/
0.0 |________/____________________________ x
         values of x increasing

This shape shows how cumulative probability increases as increases.


2. Probability Density Function

Meaning and interpretation

  • A density function describes how probability is distributed continuously over values.
  • Unlike discrete probabilities, itself is not a probability.
  • For a continuous random variable, for any exact point .

  • Probabilities are obtained by integrating the density over intervals.

  • Example: If is large near , values near 5 are more likely than values where is small.

Important properties

  • for all .
  • The total area under the curve is 1:

  • Probability over an interval is the area under the curve:

  • The CDF and PDF are connected by: and, where differentiable,

Illustration of a density curve

f(x)
 ^
 |                /\
 |               /  \
 |              /    \
 |             /      \
 |____________/__________\____________> x
              a    b

The shaded area between and represents .


3. Relationship Between Distribution Function and Density

How they are connected

  • The distribution function is the accumulated area of the density function.
  • The density function is the slope of the distribution function wherever the CDF is differentiable.
  • This relationship helps move from probabilities to functions and vice versa.
  • If a density is known, the CDF can be found by integration.
  • If the CDF is known and smooth, the density can be found by differentiation.

Discrete versus continuous cases

  • For discrete random variables, the CDF increases in jumps.
  • For continuous random variables, the CDF increases smoothly.
  • In discrete cases, probabilities are assigned to exact values using a probability mass function rather than a density function.
  • In continuous cases, the density function replaces point probabilities with interval probabilities.
  • Example:
    • A die roll is discrete.
    • A person’s exact height is modeled as continuous.

Relationship sketch

Density f(x):        CDF F(x):
    /\                   ________
   /  \                __/
  /    \             _/
 /      \          _/
-------------------/------------------> x

The left side shows a density curve; the right side shows the corresponding cumulative increase.


Working / Process

1. Identify the type of random variable

  • Determine whether the variable is discrete or continuous.
  • If it is discrete, probabilities are found from a probability mass function and the CDF is built from cumulative sums.
  • If it is continuous, probabilities are found using a density function and integration.

2. Use the correct function to compute probability

  • For a distribution function:

  • For a density function:

  • If only the density is given, first find the corresponding cumulative probability by integrating.

3. Check validity and interpret the result

  • Ensure the CDF stays between 0 and 1 and is non-decreasing.
  • Ensure the PDF is non-negative and has total area 1.
  • Interpret the numerical result in context, such as waiting time, height, score, or measurement error.
  • Example: If , then 30% of outcomes lie between 2 and 5.

Advantages / Applications

They provide a complete description of uncertainty

  • Distribution functions summarize the total probability structure of a random variable.
  • Densities show where values are concentrated more heavily.

They are widely used in real-world modeling

  • Applied in economics, engineering, biology, weather prediction, and quality control.
  • Useful for modeling continuous measurements such as time, weight, voltage, and temperature.

They support advanced statistical methods

  • Used in expectation, variance, hypothesis testing, confidence intervals, and simulation.
  • Help in comparing models and deriving distributions of transformed variables.

Summary

  • Distribution functions describe cumulative probability, while densities describe how probability is spread over continuous values.
  • A CDF gives , and a PDF gives interval probabilities through area under the curve.
  • These functions are closely related by integration and differentiation, making them central to probability and statistics.
  • Important terms to remember: random variable, cumulative distribution function, probability density function, interval probability, continuity