📐 Central Limit Theorem (CLT)
Last Updated: Jan 2026
The Central Limit Theorem (CLT) states that:
If we take sufficiently large random samples from any population, the distribution of the sample mean will be approximately normal, regardless of the population’s original distribution.
This is one of the most important theorems in statistics and data science.
🗣 Hinglish Tip: CLT = chahe data kaisa bhi ho, sample mean ka graph normal ho jaata hai (agar sample size bada ho)
Central Limit Theorem is Important?
- Allows use of normal distribution
- Basis of:
- Confidence Interval
- Hypothesis Testing
- Z-test, t-test
- Widely used in:
- Machine Learning
- Data Science
- Quality Control
Conditions for CLT
CLT works when:
- Samples are random
- Samples are independent
- Sample size is large enough
Rule of thumb:
n ≥ 30
Mathematical Statement of CLT
If population has:
- Mean →
μ - Standard deviation →
σ
Then sampling distribution of sample mean has:
- Mean:
μₓ̄ = μ
- Standard Deviation (Standard Error):
σₓ̄ = σ / √n
Where:
n= sample size
Example
Suppose we have a population:
2, 4, 6, 8, 10
Step 1: Calculate Population Mean (μ)
μ = (2 + 4 + 6 + 8 + 10) / 5 = 30 / 5 = 6
Step 2: Take Samples of Size n = 2
We take all possible samples of size 2 and calculate their means.
| Sample | Values | Sample Mean (x̄) |
|---|---|---|
| S₁ | 2, 4 | 3 |
| S₂ | 2, 6 | 4 |
| S₃ | 2, 8 | 5 |
| S₄ | 2, 10 | 6 |
| S₅ | 4, 6 | 5 |
| S₆ | 4, 8 | 6 |
| S₇ | 4, 10 | 7 |
| S₈ | 6, 8 | 7 |
| S₉ | 6, 10 | 8 |
| S₁₀ | 8, 10 | 9 |
Step 3: Sampling Distribution of Sample Mean
Sample means obtained:
3, 4, 5, 6, 5, 6, 7, 7, 8, 9
Now calculate mean of sample means:
μₓ̄ = (3 + 4 + 5 + 6 + 5 + 6 + 7 + 7 + 8 + 9) / 10
μₓ̄ = 60 / 10 = 6
✔ Same as population mean
CLT Observation
- Population Mean (μ) = 6
- Mean of Sample Means (μₓ̄) = 6
As sample size increases:
- Distribution of x̄ becomes more normal
- Spread becomes narrower
🗣 Hinglish Tip: Sample size badhao → curve smooth aur bell shape ho jaati hai