Key Takeaways: Confidence Intervals: Estimating with Uncertainty

One-Sentence Summary

A confidence interval combines a point estimate with a margin of error to produce a range of plausible values for a population parameter, where the "95% confidence" describes the long-run reliability of the method (95% of intervals constructed this way would capture the true parameter) — not the probability that any single interval is correct.

Core Concepts at a Glance

Concept Definition Why It Matters
Confidence interval Point estimate ± margin of error; a range of plausible values for a parameter First formal inference tool — lets you say something about the population, not just the sample
Confidence level The long-run proportion of CIs that capture the true parameter (e.g., 95%) Describes the reliability of the method, not the probability that a specific interval is correct
Margin of error Critical value × standard error; the maximum likely error in the estimate Quantifies the precision of an estimate — smaller MOE = more precise
t-distribution A bell-shaped distribution with heavier tails than the normal, indexed by degrees of freedom Accounts for the extra uncertainty when estimating $\sigma$ with $s$

The Confidence Interval Architecture

$$\boxed{\text{CI} = \text{Point Estimate} \pm \text{Critical Value} \times \text{Standard Error}}$$

Component For a Mean For a Proportion
Point estimate $\bar{x}$ $\hat{p}$
Critical value $t^*$ (df = $n - 1$) $z^*$
Standard error $s / \sqrt{n}$ $\sqrt{\hat{p}(1-\hat{p})/n}$

Confidence Interval Formulas

For a Population Mean

$$\boxed{\bar{x} \pm t^* \cdot \frac{s}{\sqrt{n}}, \qquad df = n - 1}$$

Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition: $n \leq 0.10 \times N$) 3. Nearly normal population or large sample ($n \geq 30$)

For a Population Proportion

$$\boxed{\hat{p} \pm z^* \cdot \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}}$$

Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition) 3. Success-failure condition: $n\hat{p} \geq 10$ and $n(1-\hat{p}) \geq 10$

Common Critical Values

Confidence Level $z^*$ $\alpha$ (two-tailed)
90% 1.645 0.10
95% 1.960 0.05
99% 2.576 0.01

For $t^*$, use a t-table or software with $df = n - 1$. Key pattern: $t^* > z^*$ always, but $t^* \to z^*$ as $df \to \infty$.

The t-Distribution vs. Normal

Feature Normal ($z$) t-distribution
Shape Bell-shaped, symmetric Bell-shaped, symmetric, heavier tails
Parameter None (fixed) Degrees of freedom ($df$)
Tails Thinner Heavier (wider intervals)
When to use Known $\sigma$ (rare), or proportions Unknown $\sigma$, estimated by $s$ (the usual case)
Convergence Approaches normal as $df \to \infty$

What "95% Confidence" Really Means

Statement Correct?
"If we repeated this sampling procedure many times and constructed a 95% CI each time, about 95% of those intervals would contain the true parameter."
"We are 95% confident that the true parameter is in this interval."
"There is a 95% probability that the parameter is in this interval."
"95% of the data falls within the CI."
"The interval has a 95% chance of being correct."

The key insight: The parameter is fixed — it doesn't move. The interval is random — it depends on which sample you get. The 95% describes how often the random interval captures the fixed target, across many repetitions of the sampling process.

The Tradeoff Triangle

                  Confidence Level
                       ╱╲
                      ╱  ╲
                     ╱Pick╲
                    ╱ any  ╲
                   ╱  two;  ╲
                  ╱the third ╲
                 ╱is determined╲
                ╱________________╲
   Sample Size ──────────────── Margin of Error
Change Effect on MOE
↑ Confidence level ↑ MOE (wider interval)
↑ Sample size ↓ MOE (narrower interval)
↑ Sample SD ↑ MOE (wider interval)

The diminishing returns rule: To halve the margin of error, you must quadruple the sample size. (Same $\sqrt{n}$ relationship as standard error in Chapter 11.)

Sample Size Determination

For a Mean

$$\boxed{n = \left(\frac{z^* \cdot \sigma}{E}\right)^2}$$

For a Proportion

$$\boxed{n = \left(\frac{z^*}{E}\right)^2 \cdot \hat{p}(1-\hat{p})}$$

Conservative approach: Use $\hat{p} = 0.5$ when you have no prior estimate (maximizes the required $n$, guaranteeing the desired MOE).

Always round up to the next whole number.

Python Quick Reference

import numpy as np
from scipy import stats

# --- CI for a Mean ---
x_bar, s, n = 128.3, 18.6, 120

# Method 1: Direct
ci = stats.t.interval(confidence=0.95, df=n-1,
                      loc=x_bar, scale=s/np.sqrt(n))

# Method 2: Manual
t_star = stats.t.ppf(0.975, df=n-1)
moe = t_star * s / np.sqrt(n)
ci = (x_bar - moe, x_bar + moe)

# --- CI for a Proportion ---
x, n = 96, 800
p_hat = x / n
z_star = stats.norm.ppf(0.975)
se = np.sqrt(p_hat * (1 - p_hat) / n)
moe = z_star * se
ci = (p_hat - moe, p_hat + moe)

# --- Sample Size ---
# For a mean:
n_mean = int(np.ceil((1.96 * sigma / E) ** 2))

# For a proportion:
n_prop = int(np.ceil((1.96 / E) ** 2 * p_hat * (1 - p_hat)))

Excel Quick Reference

Task Excel Formula
MOE for a mean =CONFIDENCE.T(0.05, s, n)
$t^*$ critical value =T.INV.2T(0.05, df)
$z^*$ critical value =NORM.S.INV(0.975)
SE for a proportion =SQRT(p*(1-p)/n)

Common Misconceptions

Misconception Reality
"95% probability the parameter is in the interval" The parameter is fixed — 95% describes the method's long-run performance
"95% of data falls in the CI" The CI estimates the mean (or proportion), not individual values
"Wider interval = worse interval" Width reflects honesty about uncertainty; narrowing requires more data
"Overlapping CIs means no difference" Unreliable shortcut — use a proper two-sample test (Ch.16)
"Always use 95% confidence" Choose based on context: 99% for medical decisions, 90% for preliminary studies
"Larger samples always worth the cost" Diminishing returns: quadruple $n$ to halve MOE
"MOE covers all uncertainty" MOE only covers sampling error, not bias, nonresponse, or measurement error

Worked Example Summary: Maya's Blood Pressure CI

Step Calculation Result
Data $\bar{x} = 128.3$, $s = 18.6$, $n = 120$
Conditions Random ✓, 10% ✓, $n \geq 30$ ✓ All met
Critical value $t^*_{119, 0.025} = 1.980$ 1.980
Standard error $18.6 / \sqrt{120} = 1.698$ 1.698
Margin of error $1.980 \times 1.698 = 3.362$ 3.36
95% CI $128.3 \pm 3.36$ (124.9, 131.7)
Interpretation "We are 95% confident the true mean systolic BP is between 124.9 and 131.7 mmHg"

How This Chapter Connects

This Chapter Builds On Leads To
Point estimate $\bar{x}$ Sample mean (Ch.6) Hypothesis testing (Ch.13)
Standard error $s/\sqrt{n}$ SE concept (Ch.11) All inference procedures
Critical value $t^*$ Normal distribution (Ch.10) t-tests (Ch.15)
Conditions checking CLT (Ch.11), Random sampling (Ch.4) Every inference chapter
Margin of error Sampling variability (Ch.11) Polling, A/B testing (Ch.16)
Sample size planning SE and $\sqrt{n}$ (Ch.11) Power analysis (Ch.17)

The One Thing to Remember

If you forget everything else from this chapter, remember this:

A confidence interval is your first inference tool: point estimate ± margin of error. The "95% confidence" means that if you used this method many times, 95% of the resulting intervals would capture the true parameter. The parameter is fixed; the interval is random. The randomness is in which interval you happen to get, not in where the parameter is. To narrow the interval, increase the sample size — but remember the diminishing returns: quadrupling $n$ only halves the margin of error. This tradeoff between precision, confidence, and sample size is the fundamental constraint of statistical estimation, and understanding it makes you a smarter consumer of every poll, study, and data-driven claim you'll ever encounter.

Key Terms

Term Definition
Confidence interval (CI) A range of plausible values for a population parameter, computed as point estimate ± margin of error
Confidence level The proportion of CIs from repeated sampling that would contain the true parameter (e.g., 95%)
Margin of error (MOE) The maximum likely distance between the point estimate and the true parameter; equals critical value × standard error
Point estimate A single value used to estimate a population parameter ($\bar{x}$ for $\mu$, $\hat{p}$ for $p$)
Interval estimate A range of values used to estimate a population parameter (synonym for confidence interval)
Critical value The value from a reference distribution ($z^*$ or $t^*$) that determines the CI width for a given confidence level
t-distribution A symmetric, bell-shaped distribution with heavier tails than the normal; used when $\sigma$ is estimated by $s$; indexed by degrees of freedom
Degrees of freedom (df) For a one-sample CI: $df = n - 1$; determines which t-distribution to use; controls how heavy the tails are
Sample size determination The process of calculating the minimum $n$ needed to achieve a desired margin of error at a given confidence level