Key Takeaways: Confidence Intervals: Estimating with Uncertainty

Contributors

Key Takeaways: Confidence Intervals: Estimating with Uncertainty

One-Sentence Summary

A confidence interval combines a point estimate with a margin of error to produce a range of plausible values for a population parameter, where the "95% confidence" describes the long-run reliability of the method (95% of intervals constructed this way would capture the true parameter) — not the probability that any single interval is correct.

Core Concepts at a Glance

Concept	Definition	Why It Matters
Confidence interval	Point estimate ± margin of error; a range of plausible values for a parameter	First formal inference tool — lets you say something about the population, not just the sample
Confidence level	The long-run proportion of CIs that capture the true parameter (e.g., 95%)	Describes the reliability of the method, not the probability that a specific interval is correct
Margin of error	Critical value × standard error; the maximum likely error in the estimate	Quantifies the precision of an estimate — smaller MOE = more precise
t-distribution	A bell-shaped distribution with heavier tails than the normal, indexed by degrees of freedom	Accounts for the extra uncertainty when estimating $\sigma$ with $s$

The Confidence Interval Architecture

$$\boxed{\text{CI} = \text{Point Estimate} \pm \text{Critical Value} \times \text{Standard Error}}$$

Component	For a Mean	For a Proportion
Point estimate	$\bar{x}$	$\hat{p}$
Critical value	$t^*$ (df = $n - 1$)	$z^*$
Standard error	$s / \sqrt{n}$	$\sqrt{\hat{p}(1-\hat{p})/n}$

Confidence Interval Formulas

For a Population Mean

$$\boxed{\bar{x} \pm t^* \cdot \frac{s}{\sqrt{n}}, \qquad df = n - 1}$$

Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition: $n \leq 0.10 \times N$) 3. Nearly normal population or large sample ($n \geq 30$)

For a Population Proportion

$$\boxed{\hat{p} \pm z^* \cdot \sqrt{\frac{\hat{p}(1-\hat{p})}{n}}}$$

Conditions: 1. Random sample (or random assignment) 2. Independence (10% condition) 3. Success-failure condition: $n\hat{p} \geq 10$ and $n(1-\hat{p}) \geq 10$

Common Critical Values

Confidence Level	$z^*$	$\alpha$ (two-tailed)
90%	1.645	0.10
95%	1.960	0.05
99%	2.576	0.01

For $t^*$, use a t-table or software with $df = n - 1$. Key pattern: $t^* > z^*$ always, but $t^* \to z^*$ as $df \to \infty$.

The t-Distribution vs. Normal

Feature	Normal ($z$)	t-distribution
Shape	Bell-shaped, symmetric	Bell-shaped, symmetric, heavier tails
Parameter	None (fixed)	Degrees of freedom ($df$)
Tails	Thinner	Heavier (wider intervals)
When to use	Known $\sigma$ (rare), or proportions	Unknown $\sigma$, estimated by $s$ (the usual case)
Convergence	—	Approaches normal as $df \to \infty$

What "95% Confidence" Really Means

Statement	Correct?
"If we repeated this sampling procedure many times and constructed a 95% CI each time, about 95% of those intervals would contain the true parameter."	✅
"We are 95% confident that the true parameter is in this interval."	✅
"There is a 95% probability that the parameter is in this interval."	❌
"95% of the data falls within the CI."	❌
"The interval has a 95% chance of being correct."	❌

The key insight: The parameter is fixed — it doesn't move. The interval is random — it depends on which sample you get. The 95% describes how often the random interval captures the fixed target, across many repetitions of the sampling process.

The Tradeoff Triangle

                  Confidence Level
                       ╱╲
                      ╱  ╲
                     ╱Pick╲
                    ╱ any  ╲
                   ╱  two;  ╲
                  ╱the third ╲
                 ╱is determined╲
                ╱________________╲
   Sample Size ──────────────── Margin of Error

Change	Effect on MOE
↑ Confidence level	↑ MOE (wider interval)
↑ Sample size	↓ MOE (narrower interval)
↑ Sample SD	↑ MOE (wider interval)

The diminishing returns rule: To halve the margin of error, you must quadruple the sample size. (Same $\sqrt{n}$ relationship as standard error in Chapter 11.)

Sample Size Determination

For a Mean

$$\boxed{n = \left(\frac{z^* \cdot \sigma}{E}\right)^2}$$

For a Proportion

$$\boxed{n = \left(\frac{z^*}{E}\right)^2 \cdot \hat{p}(1-\hat{p})}$$

Conservative approach: Use $\hat{p} = 0.5$ when you have no prior estimate (maximizes the required $n$, guaranteeing the desired MOE).

Always round up to the next whole number.

Python Quick Reference

import numpy as np
from scipy import stats

# --- CI for a Mean ---
x_bar, s, n = 128.3, 18.6, 120

# Method 1: Direct
ci = stats.t.interval(confidence=0.95, df=n-1,
                      loc=x_bar, scale=s/np.sqrt(n))

# Method 2: Manual
t_star = stats.t.ppf(0.975, df=n-1)
moe = t_star * s / np.sqrt(n)
ci = (x_bar - moe, x_bar + moe)

# --- CI for a Proportion ---
x, n = 96, 800
p_hat = x / n
z_star = stats.norm.ppf(0.975)
se = np.sqrt(p_hat * (1 - p_hat) / n)
moe = z_star * se
ci = (p_hat - moe, p_hat + moe)

# --- Sample Size ---
# For a mean:
n_mean = int(np.ceil((1.96 * sigma / E) ** 2))

# For a proportion:
n_prop = int(np.ceil((1.96 / E) ** 2 * p_hat * (1 - p_hat)))

Excel Quick Reference

Task	Excel Formula
MOE for a mean	`=CONFIDENCE.T(0.05, s, n)`
$t^*$ critical value	`=T.INV.2T(0.05, df)`
$z^*$ critical value	`=NORM.S.INV(0.975)`
SE for a proportion	`=SQRT(p*(1-p)/n)`

Common Misconceptions

Misconception	Reality
"95% probability the parameter is in the interval"	The parameter is fixed — 95% describes the method's long-run performance
"95% of data falls in the CI"	The CI estimates the mean (or proportion), not individual values
"Wider interval = worse interval"	Width reflects honesty about uncertainty; narrowing requires more data
"Overlapping CIs means no difference"	Unreliable shortcut — use a proper two-sample test (Ch.16)
"Always use 95% confidence"	Choose based on context: 99% for medical decisions, 90% for preliminary studies
"Larger samples always worth the cost"	Diminishing returns: quadruple $n$ to halve MOE
"MOE covers all uncertainty"	MOE only covers sampling error, not bias, nonresponse, or measurement error

Worked Example Summary: Maya's Blood Pressure CI

Step	Calculation	Result
Data	$\bar{x} = 128.3$, $s = 18.6$, $n = 120$	—
Conditions	Random ✓, 10% ✓, $n \geq 30$ ✓	All met
Critical value	$t^*_{119, 0.025} = 1.980$	1.980
Standard error	$18.6 / \sqrt{120} = 1.698$	1.698
Margin of error	$1.980 \times 1.698 = 3.362$	3.36
95% CI	$128.3 \pm 3.36$	(124.9, 131.7)
Interpretation	"We are 95% confident the true mean systolic BP is between 124.9 and 131.7 mmHg"	—

How This Chapter Connects

This Chapter	Builds On	Leads To
Point estimate $\bar{x}$	Sample mean (Ch.6)	Hypothesis testing (Ch.13)
Standard error $s/\sqrt{n}$	SE concept (Ch.11)	All inference procedures
Critical value $t^*$	Normal distribution (Ch.10)	t-tests (Ch.15)
Conditions checking	CLT (Ch.11), Random sampling (Ch.4)	Every inference chapter
Margin of error	Sampling variability (Ch.11)	Polling, A/B testing (Ch.16)
Sample size planning	SE and $\sqrt{n}$ (Ch.11)	Power analysis (Ch.17)

The One Thing to Remember

If you forget everything else from this chapter, remember this:

A confidence interval is your first inference tool: point estimate ± margin of error. The "95% confidence" means that if you used this method many times, 95% of the resulting intervals would capture the true parameter. The parameter is fixed; the interval is random. The randomness is in which interval you happen to get, not in where the parameter is. To narrow the interval, increase the sample size — but remember the diminishing returns: quadrupling $n$ only halves the margin of error. This tradeoff between precision, confidence, and sample size is the fundamental constraint of statistical estimation, and understanding it makes you a smarter consumer of every poll, study, and data-driven claim you'll ever encounter.

Key Terms

Term	Definition
Confidence interval (CI)	A range of plausible values for a population parameter, computed as point estimate ± margin of error
Confidence level	The proportion of CIs from repeated sampling that would contain the true parameter (e.g., 95%)
Margin of error (MOE)	The maximum likely distance between the point estimate and the true parameter; equals critical value × standard error
Point estimate	A single value used to estimate a population parameter ($\bar{x}$ for $\mu$, $\hat{p}$ for $p$)
Interval estimate	A range of values used to estimate a population parameter (synonym for confidence interval)
Critical value	The value from a reference distribution ($z^$ or $t^$) that determines the CI width for a given confidence level
t-distribution	A symmetric, bell-shaped distribution with heavier tails than the normal; used when $\sigma$ is estimated by $s$; indexed by degrees of freedom
Degrees of freedom (df)	For a one-sample CI: $df = n - 1$; determines which t-distribution to use; controls how heavy the tails are
Sample size determination	The process of calculating the minimum $n$ needed to achieve a desired margin of error at a given confidence level