Statistics — Distribution of the Sample Mean and the Central Limit Theorem
Statistics — Distribution of the Sample Mean and the Central Limit Theorem. Practice questions to deepen understanding of the distribution of the sample mean and the Central Limit Theorem. Online statistics practice with full solutions and step-by-step explanations.
Sampling distribution and CLT practice — 50 questions: Central Limit Theorem, standard error, confidence intervals, normal approximation, Bootstrap. Advanced statistics.
Part A — Theory.
100 questions
Question 1
2.00 pts
🎯 :
(Central Limit Theorem)?
Explanation:
💡 :
! 🎯
📚 :
(CLT):
X₁, X₂, ..., Xₙ μ σ²,
n → ∞:
Question 2
10.00 pts
🎯 The most important theorem in statistics:
What does the Central Limit Theorem (CLT) state?
Explanation:
💡 Detailed explanation:
The Central Limit Theorem! 🎯
📚 Statement of the theorem:
Central Limit Theorem (CLT):
If X₁, X₂, ..., Xₙ is a random sample from a population with mean μ and variance σ²,
then as n → ∞:
x̄ ~ N(μ, σ²/n)
Or in standardized form:
Z = (x̄ - μ)/(σ/√n) ~ N(0, 1)
⭐ The magic:
This works regardless of the original distribution!
• The population can be: - Uniform - Exponential - Binomial - Asymmetric - With outliers - Any distribution!
• But x̄ always (when n is large) will be distributed normally!
💡 Conditions:
1️⃣ n large enough • Rule of thumb: n ≥ 30 • If the population is symmetric: n=15 is enough • If the population is normal: any n is enough!
2️⃣ The sample is random
3️⃣ μ and σ² exist and are finite
🎲 Intuitive example:
Rolling a die: • Distribution: uniform (1,2,3,4,5,6) • Not normal at all!
But: • Average of 30 rolls • Average of another 30 rolls • Etc...
→ The averages will be distributed normally!
Question 3
10.00 pts
📊 Parameters:
If X has any distribution with E(X) = μ and Var(X) = σ², what is the distribution of x̄?
Explanation:
💡 Detailed explanation:
Parameters of x̄! 📊
📐 The complete formulas:
Parameters of the distribution of x̄:
1️⃣ Expectation (mean): E(x̄) = μ
2️⃣ Variance: Var(x̄) = σ²/n
3️⃣ Standard deviation (standard error): SD(x̄) = SE = σ/√n
🔍 Proofs:
Proof 1: E(x̄) = μ
x̄ = (X₁ + X₂ + ... + Xₙ) / n
E(x̄) = E[(X₁ + X₂ + ... + Xₙ) / n]
= [E(X₁) + E(X₂) + ... + E(Xₙ)] / n
= [μ + μ + ... + μ] / n
= nμ / n = μ ✓
Proof 2: Var(x̄) = σ²/n
x̄ = (X₁ + X₂ + ... + Xₙ) / n
Var(x̄) = Var[(X₁ + ... + Xₙ) / n]
= (1/n²) × Var(X₁ + ... + Xₙ)
Since the variables are independent:
= (1/n²) × [Var(X₁) + ... + Var(Xₙ)]
= (1/n²) × [σ² + σ² + ... + σ²]
= (1/n²) × nσ²
= σ²/n ✓
⭐ Insights:
1️⃣ x̄ is an unbiased estimator: E(x̄) = μ → "on average we hit the target"
2️⃣ The spread shrinks with n: Var(x̄) = σ²/n → as n grows, the variance shrinks
3️⃣ Rate of convergence: SE = σ/√n → to halve SE you need 4 times the sample!
Question 4
2.00 pts
📊 :
X ~ E(X)=μ, Var(X)=σ², x̄?
Explanation:
💡 :
x̄! 📊
📐 No:
x̄:
1️⃣ (): E(x̄) = μ
Question 5
2.00 pts
📏 :
?
Explanation:
💡 :
-CLT! 📏
📊 :
n
Question 6
10.00 pts
📏 Rule of thumb:
What is the commonly used minimum sample size for x̄ to be approximately normally distributed?
Explanation:
💡 Detailed explanation:
Sample size for CLT! 📏
📊 Rules of thumb:
Population state
Minimum n
Explanation
Normal
any n!
x̄ is always normal
Symmetric (no outliers)
n ≥ 15
Fast convergence
General (any distribution)
n ≥ 30
The general rule!
Highly asymmetric (outliers)
n ≥ 50-100
More data needed
⭐ The golden rule:
n ≥ 30
This is the safest value that works in most cases
💡 Why 30?
• It is not a physical law, but a practical rule • Found empirically to work well • In scientific research: n=30 is considered a "significance threshold"
🎯 Examples:
📊 Case 1: Uniform distribution • Already at n=10 a good approximation • At n=30 excellent
📊 Case 2: Exponential distribution • Need n≥40-50 • Highly asymmetric
📊 Case 3: Normal • Even n=5 is enough • x̄ is always normal!
⚠️ Important:
If n < 30 and we don't know the population is normal → we should use the t distribution (not Z)
Question 7
10.00 pts
🎲 Independence:
Does the CLT depend on the shape of the population distribution?
Explanation:
💡 Detailed explanation:
Universality of CLT! 🎲
🌟 The magic of CLT:
The amazing principle:
The Central Limit Theorem works for any distribution!
• Uniform ✓ • Exponential ✓ • Binomial ✓ • Poisson ✓ • Beta ✓ • Gamma ✓ • Asymmetric ✓ • With outliers ✓ • Mixed ✓ • Any distribution that has finite μ and σ²!
📊 Visual illustration:
The population: Can be any shape ⬜ ▅ ⬛ ▂ ▄ (asymmetric)
After sampling: x̄ is distributed normally 🔔 (symmetric bell!)
🎯 Concrete examples:
Example 1: Uniform distribution
🎲 Rolling a die (1-6): • Distribution: flat rectangle ▭ • Not normal at all!
But average of 30 rolls: → Distributed normally! 🔔
Example 2: Exponential distribution
⏱️ Waiting time (highly asymmetric): • Most times short / • Few times long
But average of 50 customer times: → Distributed normally! 🔔
⭐ Why is this so important?
1️⃣ No need to know the original distribution!
2️⃣ We can always use the Z table
3️⃣ Allows statistical inference almost always
That's why CLT is called "the most important theorem in statistics"
Question 8
2.00 pts
🎲 :
?
Explanation:
💡 :
-CLT! 🎲
🌟 CLT:
:
!
• ✓ • be
Question 9
2.00 pts
✅ :
?
Explanation:
💡 :
CLT! ✅
📋 No:
:
1️⃣ X₁, X₂, ..., Xₙ ( )
2️⃣ E(X) = μ Var(X) = σ²
Question 10
10.00 pts
✅ Conditions:
What are the conditions required for the CLT?
Explanation:
💡 Detailed explanation:
CLT conditions! ✅
📋 The full conditions:
Conditions of the Central Limit Theorem:
1️⃣ Random sample X₁, X₂, ..., Xₙ is a random sample (independent and identically distributed)
2️⃣ Existence of mean and variance E(X) = μ exists and is finite Var(X) = σ² exists and is finite
3️⃣ Sufficient sample size n large enough (usually n ≥ 30)
🔍 Detailed explanation of each condition:
1️⃣ Random sample (i.i.d):
i.i.d = independent and identically distributed
• independent: The choice of one does not affect the other
• identically distributed: All from the same population
Example: sampling with replacement ✓ Counter-example: dependent sampling ✗
2️⃣ Finite μ and σ²:
Why needed? • If μ = ∞ → no defined mean • If σ² = ∞ → infinite variance
When is this a problem? In rare cases: • Cauchy distribution (no mean!) • Distributions with "heavy tails"
But in most practical distributions: μ and σ² exist and are finite ✓
3️⃣ n large enough:
We saw in question 3: • Safe: n ≥ 30 • If symmetric: n ≥ 15 • If normal: any n
The more "unusual" the population → the larger n needed
⭐ Summary:
The conditions are quite easily met in practical cases!
The theorem is very robust and powerful
Question 11
10.00 pts
➕ Important property:
If X₁ ~ N(μ₁, σ₁²) and X₂ ~ N(μ₂, σ₂²) are independent,
what is the distribution of X₁ + X₂?
Explanation:
💡 Detailed explanation:
Normal additivity! ➕
📐 The additivity property:
Independent normal variables:
If X₁ ~ N(μ₁, σ₁²) and X₂ ~ N(μ₂, σ₂²)
and X₁, X₂ are independent
then:
X₁ + X₂ ~ N(μ₁+μ₂, σ₁²+σ₂²)
📊 Generalization:
If X₁, X₂, ..., Xₙ are independent
where Xᵢ ~ N(μᵢ, σᵢ²)
then:
ΣXᵢ ~ N(Σμᵢ, Σσᵢ²)
🎯 Special case — sum of n identical:
If X₁, ..., Xₙ ~ N(μ, σ²) identical and independent
then:
S = X₁ + X₂ + ... + Xₙ
S ~ N(nμ, nσ²)
💡 Connection to CLT:
From sum to mean:
x̄ = S/n = (X₁ + ... + Xₙ)/n
If S ~ N(nμ, nσ²)
then x̄ ~ N(nμ/n, nσ²/n²)
= N(μ, σ²/n) ✓
⭐ Important to remember:
Means add: μ₁+μ₂
Variances add: σ₁²+σ₂²
(not standard deviations!)
⚠️ Common mistake:
❌ σ₁+σ₂ (wrong!)
✓ σ₁²+σ₂² (correct!)
Question 12
2.00 pts
➕ :
X₁ ~ N(μ₁, σ₁²) -X₂ ~ N(μ₂, σ₂²) ,
X₁ + X₂?
Explanation:
💡 :
! ➕
📐 :
:
X₁ ~ N(μ₁, σ₁²) -X₂ ~ N(μ₂, σ₂²)
-X₁, X₂
:
Question 13
2.00 pts
🔢 :
x̄ ?
Explanation:
💡 :
x̄ ! 🔢
📐 :
= partial sum n:
x̄ = (X₁+X₂+...+Xₙ)/n
:
Question 14
10.00 pts
🔢 Connection:
How can x̄ be written using a sum?
Explanation:
💡 Detailed explanation:
x̄ as a sum! 🔢
📐 The connection between sum and mean:
Mean = sum divided by n:
x̄ = (X₁+X₂+...+Xₙ)/n
Or in short notation:
x̄ = (1/n)ΣXᵢ
🔄 From sum to mean:
Let: S = ΣXᵢ = X₁+X₂+...+Xₙ
Then: x̄ = S/n
If each Xᵢ ~ N(μ, σ²):
1️⃣ The sum: S ~ N(nμ, nσ²)
2️⃣ The mean: x̄ = S/n ~ N(μ, σ²/n)
📊 Effect of division on the distribution:
If Y ~ N(μ, σ²)
then Y/c ~ N(μ/c, σ²/c²)
So:
S ~ N(nμ, nσ²)
→ S/n ~ N(nμ/n, nσ²/n²)
→ x̄ ~ N(μ, σ²/n) ✓
⭐ Why is this useful?
1️⃣ Conceptual understanding: x̄ is a linear transformation of a sum
2️⃣ Probability computation: Sometimes easier to work with the sum