Key Takeaways: Confidence Intervals: Estimating with Uncertainty
One-Sentence Summary
A confidence interval combines a point estimate with a margin of error to produce a range of plausible values for a population parameter, where the "95% confidence" describes the long-run reliability of the method (95% of intervals constructed this way would capture the true parameter) — not the probability that any single interval is correct.
Core Concepts at a Glance
| Concept | Definition | Why It Matters |
|---|---|---|
| Confidence interval | Point estimate ± margin of error; a range of plausible values for a parameter | First formal inference tool — lets you say something about the population, not just the sample |
| Confidence level | The long-run proportion of CIs that capture the true parameter (e.g., 95%) | Describes the reliability of the method, not the probability that a specific interval is correct |
| Margin of error | Critical value × standard error; the maximum likely error in the estimate | Quantifies the precision of an estimate — smaller MOE = more precise |
| t-distribution | A bell-shaped distribution with heavier tails than the normal, indexed by degrees of freedom | Accounts for the extra uncertainty when estimating $\sigma$ with $s$ |
The Confidence Interval Architecture
$$\boxed{\text{CI} = \text{Point Estimate} \pm \text{Critical Value} \times \text{Standard Error}}$$
| Component | For a Mean | For a Proportion |
|---|---|---|
| Point estimate | $\bar{x}$ | $\hat{p}$ |
| Critical value | $t^*$ (df = $n - 1$) | $z^*$ |
| Standard error | $s / \sqrt{n}$ | $\sqrt{\hat{p}(1-\hat{p})/n}$ |
Confidence Interval Formulas
For a Population Mean
$$\boxed{\bar{x} \pm t^* \cdot \frac{s}{\sqrt{n}}, \qquad df = n - 1}$$
Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition: $n \leq 0.10 \times N$) 3. Nearly normal population or large sample ($n \geq 30$)
For a Population Proportion
$$\boxed{\hat{p} \pm z^* \cdot \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}}$$
Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition) 3. Success-failure condition: $n\hat{p} \geq 10$ and $n(1-\hat{p}) \geq 10$
Common Critical Values
| Confidence Level | $z^*$ | $\alpha$ (two-tailed) |
|---|---|---|
| 90% | 1.645 | 0.10 |
| 95% | 1.960 | 0.05 |
| 99% | 2.576 | 0.01 |
For $t^*$, use a t-table or software with $df = n - 1$. Key pattern: $t^* > z^*$ always, but $t^* \to z^*$ as $df \to \infty$.
The t-Distribution vs. Normal
| Feature | Normal ($z$) | t-distribution |
|---|---|---|
| Shape | Bell-shaped, symmetric | Bell-shaped, symmetric, heavier tails |
| Parameter | None (fixed) | Degrees of freedom ($df$) |
| Tails | Thinner | Heavier (wider intervals) |
| When to use | Known $\sigma$ (rare), or proportions | Unknown $\sigma$, estimated by $s$ (the usual case) |
| Convergence | — | Approaches normal as $df \to \infty$ |
What "95% Confidence" Really Means
| Statement | Correct? |
|---|---|
| "If we repeated this sampling procedure many times and constructed a 95% CI each time, about 95% of those intervals would contain the true parameter." | ✅ |
| "We are 95% confident that the true parameter is in this interval." | ✅ |
| "There is a 95% probability that the parameter is in this interval." | ❌ |
| "95% of the data falls within the CI." | ❌ |
| "The interval has a 95% chance of being correct." | ❌ |
The key insight: The parameter is fixed — it doesn't move. The interval is random — it depends on which sample you get. The 95% describes how often the random interval captures the fixed target, across many repetitions of the sampling process.
The Tradeoff Triangle
Confidence Level
╱╲
╱ ╲
╱Pick╲
╱ any ╲
╱ two; ╲
╱the third ╲
╱is determined╲
╱________________╲
Sample Size ──────────────── Margin of Error
| Change | Effect on MOE |
|---|---|
| ↑ Confidence level | ↑ MOE (wider interval) |
| ↑ Sample size | ↓ MOE (narrower interval) |
| ↑ Sample SD | ↑ MOE (wider interval) |
The diminishing returns rule: To halve the margin of error, you must quadruple the sample size. (Same $\sqrt{n}$ relationship as standard error in Chapter 11.)
Sample Size Determination
For a Mean
$$\boxed{n = \left(\frac{z^* \cdot \sigma}{E}\right)^2}$$
For a Proportion
$$\boxed{n = \left(\frac{z^*}{E}\right)^2 \cdot \hat{p}(1-\hat{p})}$$
Conservative approach: Use $\hat{p} = 0.5$ when you have no prior estimate (maximizes the required $n$, guaranteeing the desired MOE).
Always round up to the next whole number.
Python Quick Reference
import numpy as np
from scipy import stats
# --- CI for a Mean ---
x_bar, s, n = 128.3, 18.6, 120
# Method 1: Direct
ci = stats.t.interval(confidence=0.95, df=n-1,
loc=x_bar, scale=s/np.sqrt(n))
# Method 2: Manual
t_star = stats.t.ppf(0.975, df=n-1)
moe = t_star * s / np.sqrt(n)
ci = (x_bar - moe, x_bar + moe)
# --- CI for a Proportion ---
x, n = 96, 800
p_hat = x / n
z_star = stats.norm.ppf(0.975)
se = np.sqrt(p_hat * (1 - p_hat) / n)
moe = z_star * se
ci = (p_hat - moe, p_hat + moe)
# --- Sample Size ---
# For a mean:
n_mean = int(np.ceil((1.96 * sigma / E) ** 2))
# For a proportion:
n_prop = int(np.ceil((1.96 / E) ** 2 * p_hat * (1 - p_hat)))
Excel Quick Reference
| Task | Excel Formula |
|---|---|
| MOE for a mean | =CONFIDENCE.T(0.05, s, n) |
| $t^*$ critical value | =T.INV.2T(0.05, df) |
| $z^*$ critical value | =NORM.S.INV(0.975) |
| SE for a proportion | =SQRT(p*(1-p)/n) |
Common Misconceptions
| Misconception | Reality |
|---|---|
| "95% probability the parameter is in the interval" | The parameter is fixed — 95% describes the method's long-run performance |
| "95% of data falls in the CI" | The CI estimates the mean (or proportion), not individual values |
| "Wider interval = worse interval" | Width reflects honesty about uncertainty; narrowing requires more data |
| "Overlapping CIs means no difference" | Unreliable shortcut — use a proper two-sample test (Ch.16) |
| "Always use 95% confidence" | Choose based on context: 99% for medical decisions, 90% for preliminary studies |
| "Larger samples always worth the cost" | Diminishing returns: quadruple $n$ to halve MOE |
| "MOE covers all uncertainty" | MOE only covers sampling error, not bias, nonresponse, or measurement error |
Worked Example Summary: Maya's Blood Pressure CI
| Step | Calculation | Result |
|---|---|---|
| Data | $\bar{x} = 128.3$, $s = 18.6$, $n = 120$ | — |
| Conditions | Random ✓, 10% ✓, $n \geq 30$ ✓ | All met |
| Critical value | $t^*_{119, 0.025} = 1.980$ | 1.980 |
| Standard error | $18.6 / \sqrt{120} = 1.698$ | 1.698 |
| Margin of error | $1.980 \times 1.698 = 3.362$ | 3.36 |
| 95% CI | $128.3 \pm 3.36$ | (124.9, 131.7) |
| Interpretation | "We are 95% confident the true mean systolic BP is between 124.9 and 131.7 mmHg" | — |
How This Chapter Connects
| This Chapter | Builds On | Leads To |
|---|---|---|
| Point estimate $\bar{x}$ | Sample mean (Ch.6) | Hypothesis testing (Ch.13) |
| Standard error $s/\sqrt{n}$ | SE concept (Ch.11) | All inference procedures |
| Critical value $t^*$ | Normal distribution (Ch.10) | t-tests (Ch.15) |
| Conditions checking | CLT (Ch.11), Random sampling (Ch.4) | Every inference chapter |
| Margin of error | Sampling variability (Ch.11) | Polling, A/B testing (Ch.16) |
| Sample size planning | SE and $\sqrt{n}$ (Ch.11) | Power analysis (Ch.17) |
The One Thing to Remember
If you forget everything else from this chapter, remember this:
A confidence interval is your first inference tool: point estimate ± margin of error. The "95% confidence" means that if you used this method many times, 95% of the resulting intervals would capture the true parameter. The parameter is fixed; the interval is random. The randomness is in which interval you happen to get, not in where the parameter is. To narrow the interval, increase the sample size — but remember the diminishing returns: quadrupling $n$ only halves the margin of error. This tradeoff between precision, confidence, and sample size is the fundamental constraint of statistical estimation, and understanding it makes you a smarter consumer of every poll, study, and data-driven claim you'll ever encounter.
Key Terms
| Term | Definition |
|---|---|
| Confidence interval (CI) | A range of plausible values for a population parameter, computed as point estimate ± margin of error |
| Confidence level | The proportion of CIs from repeated sampling that would contain the true parameter (e.g., 95%) |
| Margin of error (MOE) | The maximum likely distance between the point estimate and the true parameter; equals critical value × standard error |
| Point estimate | A single value used to estimate a population parameter ($\bar{x}$ for $\mu$, $\hat{p}$ for $p$) |
| Interval estimate | A range of values used to estimate a population parameter (synonym for confidence interval) |
| Critical value | The value from a reference distribution ($z^*$ or $t^*$) that determines the CI width for a given confidence level |
| t-distribution | A symmetric, bell-shaped distribution with heavier tails than the normal; used when $\sigma$ is estimated by $s$; indexed by degrees of freedom |
| Degrees of freedom (df) | For a one-sample CI: $df = n - 1$; determines which t-distribution to use; controls how heavy the tails are |
| Sample size determination | The process of calculating the minimum $n$ needed to achieve a desired margin of error at a given confidence level |