Definition
The Central Limit Theorem (CLT) states that when independent random variables are added, their properly normalized sum tends toward a normal distribution (a bell curve), even if the original variables themselves are not normally distributed.
How to read: Z equals the sample mean X bar minus the population mean mu, all divided by the population standard deviation sigma over the square root of the sample size n. Meaning / when to use: Used to standardize the sample mean, demonstrating that as grows large, the distribution of approaches the standard normal distribution .
Why It Matters
The CLT is the foundational bridge between probability and statistics. It explains why the normal distribution is ubiquitous in nature and society. Because of the CLT, we can use simple statistical methods to make highly accurate inferences about large populations based on relatively small samples, regardless of the underlying population’s original shape. Without it, predictive modeling, quality control, and scientific hypothesis testing would be incredibly complex and often impossible.
Core Concepts
- Sample Size Dependency: The approximation to a normal distribution improves as the sample size increases (typically, is considered sufficient for most practical purposes).
- Independence: The randomly sampled variables must be independent of one another.
- Identical Distribution: The variables are drawn from the same probability distribution with a well-defined mean () and finite variance ().
- Smoothing of Chaos: It acts as a macroscopic smoothing effect over microscopic randomness, distilling unpredictable individual events into a highly predictable collective pattern.