Back to Statistics
Mathematics & Statistics / Statistics

A Beginner's Guide to Probability Distributions: When to Use Normal, Binomial, and Poisson

A Beginner's Guide to Probability Distributions

When Netflix recommends shows, insurance companies price policies, or hospitals predict emergency room demand, they're using probability distributions—mathematical models that describe how likely different outcomes are. The most common question in applied statistics isn't "what's the probability?"—it's "which distribution should I use?" Choose the wrong distribution and your predictions will be systematically wrong. Choose correctly and you can model everything from manufacturing defects to election outcomes with remarkable accuracy. The three most widely used distributions—normal, binomial, and Poisson—each apply to specific types of real-world situations, and knowing which to use is essential for anyone working with data.

Quick Decision Guide: Which Distribution to Use

Your SituationDistributionKey QuestionExample
Measuring continuous data (height, weight, temperature)NormalIs it roughly symmetric and bell-shaped?Adult heights, measurement errors
Counting successes in fixed trials (yes/no outcomes)BinomialDo you have a fixed number of independent yes/no trials?20 coin flips, defect rates in 100-item batch
Counting events over time/space (rare events)PoissonAre events rare and independent?Customer arrivals per hour, accidents per month
Measuring time between eventsExponentialWhat's the time until the next occurrence?Time between customer arrivals
Modeling proportions/percentagesBetaDo you need probabilities of probabilities?Click-through rates, conversion rates

The Normal Distribution: When Measurements Cluster Around an Average

What It Models

The normal (Gaussian) distribution models continuous measurements that cluster symmetrically around a mean. It's the famous bell curve.

Mathematical properties:

  • Symmetric around the mean
  • 68% of data within ±1 standard deviation
  • 95% within ±2 standard deviations
  • 99.7% within ±3 standard deviations

When to Use It

Use the normal distribution when:

  1. Measuring natural characteristics: Height, weight, blood pressure
  2. Averaging many random factors: Test scores (combining many knowledge items)
  3. Measurement errors: Instrument readings, experimental errors
  4. Large sample sums/averages: Central Limit Theorem makes sums of random variables approximately normal

Real-world examples:

Manufacturing quality control: Bolt diameters from a machine process are normally distributed around target specification. If specifications require 10mm ± 0.5mm, you can calculate what percentage of bolts fall outside tolerance using the normal distribution.

Educational testing: SAT scores are designed to be normally distributed with mean 1000 (now 1050) and standard deviation 200. This allows percentile ranking—a score of 1250 is approximately 84th percentile (one standard deviation above mean).

Medical diagnostics: Blood pressure, cholesterol levels, and many biomarkers follow roughly normal distributions in healthy populations. Abnormal values are often defined as being more than 2 standard deviations from mean.

When NOT to Use It

Count data (number of events): Can't have negative values, may not be symmetric ❌ Proportions/percentages: Bounded between 0 and 1 ❌ Highly skewed data: Income, wealth, city populations (use log-normal instead) ❌ Small samples from non-normal populations: Central Limit Theorem requires adequate sample size

The Binomial Distribution: Counting Successes in Fixed Trials

What It Models

The binomial distribution models the number of successes in a fixed number of independent yes/no (Bernoulli) trials.

Requirements:

  1. Fixed number of trials (n)
  2. Each trial has only two outcomes (success/failure)
  3. Probability of success (p) is constant
  4. Trials are independent

Parameters: n (number of trials), p (probability of success)

When to Use It

Use the binomial distribution when:

  1. Quality control with fixed sample sizes: Inspecting 100 items, counting defects
  2. Survey responses: 50 people asked yes/no question
  3. Medical trials: 20 patients, counting how many respond to treatment
  4. Conversion tracking: 1000 website visitors, counting purchases

Real-world examples:

Clinical trials: A drug has 60% historical success rate. In a trial with 30 patients, what's the probability at least 20 respond positively?

Using binomial distribution with n=30, p=0.6:

  • Mean expected responders: 18
  • Probability of ≥20 responders: ~38%
  • Probability of ≤12 responders: ~2.5% (might indicate drug doesn't work as expected)

Manufacturing quality: A process produces 2% defective items. In batches of 100, what's the probability of more than 5 defects?

Binomial with n=100, p=0.02:

  • Expected defects: 2
  • Probability of >5 defects: ~1.8% (would trigger investigation if observed)

Email marketing: 5000 recipients, 3% typical click-through rate. What's the range of expected clicks?

Binomial with n=5000, p=0.03:

  • Mean: 150 clicks
  • 95% confidence interval: 133-167 clicks
  • Seeing <120 clicks would suggest campaign underperforming

When NOT to Use It

Unknown number of trials: Use Poisson instead ❌ Probability changes between trials: Not independent or constant p ❌ More than two outcomes: Use multinomial distribution ❌ Very large n with small p: Use Poisson approximation (more efficient)

The Poisson Distribution: Modeling Rare Events Over Time or Space

What It Models

The Poisson distribution models the count of rare, independent events occurring in a fixed interval of time or space.

Requirements:

  1. Events occur independently
  2. Events are rare (relative to opportunities)
  3. Average rate (λ) is constant
  4. You're counting events in a fixed interval

Parameter: λ (lambda) = average rate of events

When to Use It

Use the Poisson distribution when:

  1. Customer arrivals: People arriving at store/website per hour
  2. Rare defects: Flaws in a manufactured sheet of material
  3. Natural phenomena: Earthquakes per year, meteor impacts
  4. Call center volume: Calls received per minute
  5. Disease incidence: Rare disease cases in population per month

Real-world examples:

Hospital emergency department: Average 4 patients arrive per hour during overnight shift. What's the probability of more than 8 arrivals in one hour?

Poisson with λ=4:

  • Probability of ≤3 arrivals: ~43%
  • Probability of >8 arrivals: ~1.4%
  • This helps staffing decisions—how often will they be overwhelmed?

Website traffic: Server handles average 2 requests per second. What capacity is needed to handle 99% of seconds without queuing?

Poisson with λ=2:

  • Probability of ≤5 requests: ~98.3%
  • Probability of ≤6 requests: ~99.5%
  • Design for 6-7 concurrent requests to meet 99% reliability

Rare disease surveillance: A city averages 3 cases of a rare disease per year. If 8 cases appear, is this a significant outbreak?

Poisson with λ=3:

  • Probability of ≥8 cases: ~1.2%
  • This would trigger public health investigation

When NOT to Use It

Events occur in clusters: Poisson assumes independence; earthquakes have aftershocks ❌ Rate varies over time: Rush hour vs. overnight has different λ; need to model separately ❌ Very common events: For frequent events, normal approximation may be more appropriate ❌ Events have maximum limit: Binomial is more appropriate when there's a fixed n

Using Distribution Calculators for Practical Problems

When working with real data or planning studies, probability distribution calculators help you:

Answer "what if" questions:

  • "If my conversion rate is really 2%, what click range would I typically see in 10,000 visitors?"
  • "How often would I see 6+ defects in a batch if my process runs at 1% defect rate?"

Calculate probabilities for decision-making:

  • "What's the probability of exceeding capacity?"
  • "Is this observation unusually high or just normal variation?"

Validate distribution choice:

  • Plot your data against theoretical distribution
  • Check if your parameters (mean, standard deviation) match theoretical predictions

Design systems:

  • "What buffer capacity do I need to handle 95% of demand spikes?"
  • "How many trials do I need to detect a difference between 60% and 70% success rates?"

For example, using a binomial probability calculator with n=100, p=0.05, you can quickly find that:

  • Probability of exactly 5 defects: 18.0%
  • Probability of ≤3 defects: 25.9%
  • Probability of ≥8 defects: 13.4%

This helps you set quality control thresholds without hand-calculating every scenario.

Common Misconceptions About Distributions

Misconception 1: "Everything is normally distributed"

Reality: The normal distribution is common but not universal. Many real-world phenomena are skewed, bounded, or discrete.

Examples that aren't normal:

  • Income/wealth (heavily right-skewed, use log-normal)
  • Reaction times (right-skewed, can't be negative)
  • Proportions between 0-1 (use beta distribution)
  • Count data (discrete, use Poisson or binomial)

When normal works anyway: Due to Central Limit Theorem, sample means are approximately normal even when individual measurements aren't—but only with adequate sample size.

Misconception 2: "Poisson and binomial are the same for rare events"

Reality: They're related but different. Poisson is the limiting case of binomial when n is large and p is small, but:

Use binomial when: You have a fixed number of trials (inspecting 100 items) Use Poisson when: You don't have a fixed number of trials (customers arriving in an hour—you don't know how many could arrive)

Misconception 3: "If my data histogram looks bell-shaped, it's normal"

Reality: Many distributions look roughly bell-shaped. Check:

  • Skewness (is it symmetric?)
  • Kurtosis (are tails heavier or lighter than normal?)
  • Theoretical fit (do quantiles match normal distribution?)

Better approach: Use Q-Q plots or formal normality tests (Shapiro-Wilk, Anderson-Darling) rather than just eyeballing.

Misconception 4: "Distribution choice doesn't matter much"

Reality: Wrong distribution = systematically wrong predictions.

Example: Using normal distribution for Poisson data (customer arrivals):

  • Normal might predict negative arrivals (impossible)
  • Normal underestimates probability of extreme values
  • Capacity planning based on wrong distribution leads to system failures

Advanced Decision Framework

Step-by-Step Distribution Selection

1. What type of data do you have?

  • Continuous measurements → Consider normal, exponential, log-normal
  • Discrete counts → Consider Poisson, binomial, negative binomial
  • Proportions/rates → Consider beta, binomial
  • Time-to-event → Consider exponential, Weibull

2. Are there natural bounds?

  • No bounds → Normal might work
  • Lower bound at zero → Exponential, Poisson, log-normal
  • Bounded between 0 and 1 → Beta distribution
  • Bounded counts with maximum → Binomial

3. How are data generated?

  • Fixed trials with success/failure → Binomial
  • Rare events over time → Poisson
  • Sum/average of random variables → Normal (by CLT)
  • Time between events → Exponential

4. Check distributional assumptions

  • Plot histogram and compare to theoretical distribution
  • Calculate sample mean, variance, skewness
  • Use Q-Q plots for formal comparison
  • Consider domain knowledge (physical constraints, process understanding)

Key Takeaways

Choosing the right probability distribution isn't academic—it's essential for making accurate predictions. Use the wrong distribution and you'll systematically underestimate or overestimate probabilities, leading to poor decisions in quality control, capacity planning, risk assessment, and statistical inference.

The three most common distributions each model specific scenarios:

  • Normal: Continuous measurements clustering around a mean (heights, test scores, measurement errors)
  • Binomial: Counting successes in a fixed number of trials (defects in batch, survey responses)
  • Poisson: Counting rare, independent events (customer arrivals, accidents, rare defects)

Quick decision rule:

  • Continuous and symmetric? → Try normal
  • Fixed trials, yes/no outcomes? → Try binomial
  • Counting rare events? → Try Poisson

Always validate your choice with data visualization, parameter checks, and domain knowledge. A calculator can quickly compute probabilities—but only if you've chosen the right distribution. Understanding which model fits your situation is the critical first step that no calculator can do for you.