Sampling Distributions

AP Statistics guide to sampling distributions: distribution of sample means and proportions, Central Limit Theorem, and standard error.

# Sampling Distributions — AP Statistics

Sampling distributions describe how a statistic (like xˉ\bar{x} or p^\hat{p}) varies across all possible samples. Understanding this concept is the bridge between probability and inference.

Key Concepts

Sampling Distribution of xˉ\bar{x} (Sample Mean)

  • Mean: μxˉ=μ\mu_{\bar{x}} = \mu
  • Standard deviation (Standard Error): σxˉ=σn\sigma_{\bar{x}} = \frac{\sigma}{\sqrt{n}}
  • Shape: If the population is normal, xˉ\bar{x} is normal. If not, the CLT applies for large nn.

Central Limit Theorem (CLT)

For large sample size (n30n \geq 30 as a rule of thumb), the sampling distribution of xˉ\bar{x} is approximately normal regardless of the population shape.

Sampling Distribution of p^\hat{p} (Sample Proportion)

  • Mean: μp^=p\mu_{\hat{p}} = p
  • Standard deviation: σp^=p(1p)n\sigma_{\hat{p}} = \sqrt{\frac{p(1-p)}{n}}
  • Shape: Approximately normal if np10np \geq 10 and n(1p)10n(1-p) \geq 10.

Key Ideas

  • Larger nn → smaller standard error → more precise estimates.
  • The sampling distribution is centered at the parameter.
  • Standard error measures the typical deviation of the statistic from the parameter.

Sampling Distribution of p^1p^2\hat{p}_1 - \hat{p}_2

μ=p1p2,σ=p1(1p1)n1+p2(1p2)n2\mu = p_1 - p_2, \quad \sigma = \sqrt{\frac{p_1(1-p_1)}{n_1} + \frac{p_2(1-p_2)}{n_2}}

Sampling Distribution of xˉ1xˉ2\bar{x}_1 - \bar{x}_2

μ=μ1μ2,σ=σ12n1+σ22n2\mu = \mu_1 - \mu_2, \quad \sigma = \sqrt{\frac{\sigma_1^2}{n_1} + \frac{\sigma_2^2}{n_2}}

Worked Example

Problem: A population has μ=50\mu = 50 and σ=12\sigma = 12. For samples of n=36n = 36, describe the sampling distribution of xˉ\bar{x}.

Solution:

  • Mean: μxˉ=50\mu_{\bar{x}} = 50.
  • Standard error: σxˉ=12/36=2\sigma_{\bar{x}} = 12/\sqrt{36} = 2.
  • Shape: By CLT (n=3630n = 36 \geq 30), approximately normal.
  • xˉN(50,2)\bar{x} \sim N(50, 2).

P(xˉ>53)=P(z>1.5)0.067P(\bar{x} > 53) = P(z > 1.5) \approx 0.067.

Practice Questions

  1. 1. If nn is quadrupled, what happens to the standard error of xˉ\bar{x}?

    It is halved (σ/4n=σ/(2n)\sigma/\sqrt{4n} = \sigma/(2\sqrt{n})).

    2. A poll finds p^=0.6\hat{p} = 0.6 with n=400n = 400. What is the standard error?

    SE=0.6(0.4)/400=0.00060.0245SE = \sqrt{0.6(0.4)/400} = \sqrt{0.0006} \approx 0.0245.

    3. Why is the CLT important for inference?

    It allows us to use normal-based methods (z and t tests) even when the population isn't normal, as long as the sample size is large enough.

Want to check your answers and get step-by-step solutions?

Get it on Google PlayDownload on the App Store

Summary

  • Sampling distributions describe statistic variability across samples.
  • SExˉ=σ/nSE_{\bar{x}} = \sigma/\sqrt{n}; SEp^=p(1p)/nSE_{\hat{p}} = \sqrt{p(1-p)/n}.
  • CLT: for large nn, sampling distributions are approximately normal.
  • Larger samples → smaller standard error → better estimates.

Ready to Ace Your AP STATISTICS statistics?

Get instant step-by-step solutions to any problem. Snap a photo and learn with Tutor AI — your personal exam prep companion.

Get it on Google PlayDownload on the App Store