You have 3 free guides left 😟
Unlock your guides
You have 3 free guides left 😟
Unlock your guides

10.4 Hypergeometric distribution

3 min readjuly 19, 2024

The models from a . It's crucial for scenarios like or , where each draw changes the probability of success.

Unlike the , hypergeometric accounts for changing probabilities as items are removed. It's ideal for small populations or when the sample is a significant fraction of the whole.

Hypergeometric Distribution

Hypergeometric experiment and assumptions

Top images from around the web for Hypergeometric experiment and assumptions
Top images from around the web for Hypergeometric experiment and assumptions
  • Involves drawing a sample of size [n](https://www.fiveableKeyTerm:n)[n](https://www.fiveableKeyTerm:n) from a finite population of size NN without replacement (cards from a deck, defective items from a batch)
    • Population contains MM items with a specific characteristic () and NMN-M items without that characteristic (failures)
    • nn is fixed and known in advance
  • Key assumptions:
    • NN is finite and known (number of cards in a deck, total items in a batch)
    • Number of successes MM in the population is known (number of aces in a deck, defective items in a batch)
    • Each item is either a success or a failure (card is an ace or not, item is defective or not)
    • Sample is drawn without replacement, each item can only be selected once
    • Probability of success changes after each draw as the composition of the population changes (probability of drawing an ace decreases after each ace is drawn)

Hypergeometric vs binomial distributions

  • Hypergeometric distribution used when sampling is done without replacement, binomial distribution assumes sampling with replacement
    • In hypergeometric, probability of success changes after each draw
    • In binomial, probability of success remains constant for each trial (flipping a coin, rolling a die)
  • Population size is finite and known in hypergeometric, binomial assumes infinite population or sampling with replacement
  • Number of successes in population is known in hypergeometric, not a requirement for binomial
  • Hypergeometric more appropriate for smaller populations or when sample size is a significant fraction of the population (drawing 10 cards from a 52-card deck)

Hypergeometric probability calculations

  • Probability mass function (PMF) of hypergeometric distribution: P(X=[k](https://www.fiveableKeyTerm:k))=(Mk)(NMnk)(Nn)P(X=[k](https://www.fiveableKeyTerm:k)) = \frac{\binom{M}{k} \binom{N-M}{n-k}}{\binom{N}{n}}

    where:

    • XX = random variable representing number of successes in the sample
    • kk = number of successes in the sample
    • MM = number of successes in the population
    • NN = population size
    • nn = sample size
  • To compute probabilities:

    1. Identify values of NN, MM, nn, and kk from given problem (52 cards in deck, 4 aces, drawing 5 cards, probability of 2 aces)
    2. Substitute values into PMF formula
    3. Calculate binomial coefficients and perform arithmetic operations
    4. Result is probability of observing exactly kk successes in the sample

Applications of hypergeometric distribution

  • Real-world problems where hypergeometric distribution can be applied:
    • Quality control: Inspecting sample of items from production batch to determine number of defective items
    • Auditing: Selecting sample of financial transactions to check for errors or fraud
    • Genetics: Studying inheritance of specific traits in a population (pea plants, eye color)
    • Surveys: Analyzing characteristics of a sample drawn from a finite population
  • Steps to solve real-world problems:
    1. Identify population size (NN), number of successes in population (MM), and sample size (nn)
    2. Determine number of successes (kk) in sample to calculate probability for
    3. Use hypergeometric distribution formula to compute probability of observing kk successes in sample
    4. Interpret results in context of the problem

Appropriate use of hypergeometric distribution

  • Hypergeometric distribution appropriate when:
    • Population is finite and known (cards in a deck, items in a batch)
    • Sample is drawn without replacement
    • Number of successes in population is known
    • Probability of success changes after each draw
    • Sample size is a significant fraction of the population (more than 5% of population)
  • May not be appropriate when:
    • Population is very large compared to sample size, binomial distribution can be used as an approximation
    • Sampling is done with replacement or population is assumed to be infinite, binomial distribution is more suitable
    • Number of successes in population is unknown, other distributions or methods may be more appropriate (normal distribution for large samples)
© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.


© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.

© 2024 Fiveable Inc. All rights reserved.
AP® and SAT® are trademarks registered by the College Board, which is not affiliated with, and does not endorse this website.
Glossary
Glossary