IB Math AA: Probability Distributions

Probability distributions are the mathematical engines that power predictions, from forecasting election results to modeling genetic inheritance. In IB Math Analysis and Approaches, mastering these distributions moves you from counting simple outcomes to quantifying real-world uncertainty with precision. This knowledge is not just exam-critical; it forms the foundational language for data science, economics, and experimental research.

Discrete Foundations: The Binomial Distribution

Many real-world scenarios involve a fixed number of independent trials, each with the same two possible outcomes: success or failure. The binomial distribution is the discrete probability model for exactly these situations. It is defined by two parameters: $n$ , the fixed number of trials, and $p$ , the constant probability of success on a single trial.

A random variable $X$ following a binomial distribution is written as $X \sim B (n, p)$ . The probability of achieving exactly $r$ successes is given by the formula: $P (X = r) = (r n) p^{r} (1 - p)^{n - r}$ where $(r n) = \frac{n !}{r ! ( n - r )!}$ is the binomial coefficient, calculating the number of ways to choose $r$ successes from $n$ trials.

The mean, or expected value, and the variance of a binomial distribution are derived from its parameters: $E (X) = n p$ $Var (X) = n p (1 - p)$ The standard deviation is simply the square root of the variance: $σ = n p (1 - p)$ . For example, if a fair die is rolled 60 times ( $n = 60, p = \frac{1}{6}$ ), the expected number of fours is $E (X) = 60 \times \frac{1}{6} = 10$ , with a standard deviation of $60 \times \frac{1}{6} \times \frac{5}{6} \approx 2.89$ .

Continuous Modeling: The Normal Distribution

While the binomial distribution handles discrete counts, many natural phenomena—like heights, exam scores, or measurement errors—are continuous and often cluster around a central value. The normal distribution models these continuous data with its iconic bell-shaped curve, symmetric about its mean.

A normal distribution is defined by its mean $μ$ (the center) and its standard deviation $σ$ (the spread), denoted $X \sim N (μ, σ^{2})$ . The total area under its probability density function is 1. To find probabilities, you must standardise the variable. This transforms any normal distribution into the standard normal distribution $Z \sim N (0, 1)$ using the $z$ -score formula: $z = \frac{x - μ}{σ}$ The $z$ -score tells you how many standard deviations an observation $x$ is from the mean. You then use your GDC or $z$ -tables to find probabilities like $P (Z < z)$ .

Often, you need to work backwards from a probability to find a corresponding data value, which is an inverse normal calculation. For instance, if you know $P (X < k) = 0.95$ , you first find the $z$ -score where $P (Z < z) = 0.95$ (approximately 1.645), and then "un-standardise" using $x = μ + z σ$ .

A crucial application is using the normal distribution to approximate a binomial distribution when $n$ is large. This requires a continuity correction because you are approximating a discrete distribution (binomial) with a continuous one (normal). If you want $P (X \leq 10)$ for a binomial variable, you approximate it as $P (X_{n or ma l} < 10.5)$ .

HL Extension: The Poisson Distribution and Hypothesis Testing

At Higher Level, you encounter the Poisson distribution, which models the number of events occurring in a fixed interval of time or space. These events must occur independently and at a known constant average rate, $λ$ . Examples include calls received by a call center per hour or typos per page. If $Y \sim Po (λ)$ , then the probability of $r$ events is: $P (Y = r) = \frac{e ^{- λ} λ ^{r}}{r !}$ Its expected value and variance are both equal to $λ$ : $E (Y) = Var (Y) = λ$ .

These distributions become powerful tools for formal hypothesis testing. This statistical method allows you to make inferences about a population parameter based on sample data. A typical test for a population mean using the normal distribution involves five steps:

State the null hypothesis ( $H_{0} : μ = μ_{0}$ ) and the alternative hypothesis ( $H_{1} : μ \neq =, >, < μ_{0}$ ).
Choose the significance level $α$ (commonly 5%).
Calculate the test statistic (e.g., a $z$ -score from sample data).
Determine the $p$ -value—the probability of obtaining results at least as extreme as the sample, assuming $H_{0}$ is true.
Compare the $p$ -value to $α$ . If $p \leq α$ , you reject the null hypothesis.

The $p$ -value is the cornerstone of this decision. A small $p$ -value provides evidence against the null hypothesis, suggesting your sample result is unlikely to have occurred by chance alone.

Common Pitfalls

Misidentifying the Distribution: Assuming data is binomial when trials are not independent (like drawing cards without replacement) is a frequent error. Similarly, applying Poisson to events that are not independent or whose rate is not constant will lead to incorrect probabilities. Always check the conditions before selecting your model.
Misusing the Continuity Correction: When using a normal distribution to approximate a binomial, forgetting the continuity correction leads to significant inaccuracy. Remember: for $P (X \leq a)$ , use $a + 0.5$ ; for $P (X \geq b)$ , use $b - 0.5$ ; for $P (a \leq X \leq b)$ , use $(a - 0.5)$ to $(b + 0.5)$ .
Confusing $p$ -value and Significance Level: The significance level $α$ is a pre-set threshold for rejection (e.g., 0.05). The $p$ -value is a calculated probability from your sample data. You do not "accept" the null hypothesis; you either reject it or fail to reject it based on the comparison.
Incorrect Inverse Normal Setup: When performing an inverse normal calculation, students often mix up which probability to input. If a question asks for the top 10%, you need to use $P (Z > z) = 0.10$ or, equivalently, $P (Z < z) = 0.90$ when looking up the critical $z$ -value on their GDC.

Summary

The binomial distribution $B (n, p)$ models discrete counts of success in $n$ independent trials, with $E (X) = n p$ and $Var (X) = n p (1 - p)$ .
The normal distribution $N (μ, σ^{2})$ models continuous data; standardisation via $z = \frac{x - μ}{σ}$ is essential for finding probabilities and performing inverse calculations.
Hypothesis testing is a structured process using a test statistic to generate a $p$ -value, which is compared to a significance level $α$ to decide whether to reject a null hypothesis.
HL Only: The Poisson distribution $Po (λ)$ models the count of independent events occurring at a constant rate, where $E (Y) = Var (Y) = λ$ .
Always verify the conditions for each distribution (independence, fixed $n$ and $p$ , constant $λ$ ) and remember the continuity correction when approximating binomial with normal.

IB Math AA: Probability Distributions

IB Math AA: Probability Distributions

Discrete Foundations: The Binomial Distribution

Continuous Modeling: The Normal Distribution

HL Extension: The Poisson Distribution and Hypothesis Testing

Common Pitfalls

Summary

Write better notes with AI