Data Analysis Slides
Data Analysis Slides
https://www.probabilitycourse.com/
Introduction
X1 + X2 + ... + Xn
X̄ =
n
https://www.probabilitycourse.com/
Sample Mean
▶ Variance:
Var (X1 + X2 + ... + Xn ) Var (X )
Var (X̄ ) = 2
=
n n
https://www.probabilitycourse.com/
Weak Law of Large Numbers (WLLN)
lim P(|X̄ − µ| ≥ ϵ) = 0.
n→∞
Var (X )
P(|X̄ − µ| ≥ ϵ) ≤
nϵ2
▶ This goes to zero as n → ∞.
https://www.probabilitycourse.com/
Central Limit Theorem (Slide 1)
▶ The central limit theorem (CLT) is one of the most important
results in probability theory.
X1 + X2 + . . . + Xn
X̄ =
n
https://www.probabilitycourse.com/
Central Limit Theorem (Slide 2)
X̄ − µ X + X2 + . . . + Xn − nµ
Zn = √ = 1 √
σ/ n nσ
https://www.probabilitycourse.com/
Central Limit Theorem
▶ The Central Limit Theorem (CLT)
X̄ − µ X + X2 + . . . + Xn − nµ
Zn = √ = 1 √
σ/ n nσ
Yn − np
Zn = p
np(1 − p)
https://www.probabilitycourse.com/
Understanding the Central Limit Theorem
https://www.probabilitycourse.com/
Understanding the Central Limit Theorem
Figure:
▶ The importance of the CLT stems from the fact that, in many
real applications, a certain random variable of interest is a
sum of a large number of independent random variables.
▶ Examples include:
▶ Laboratory measurement errors modeled by normal random
variables.
▶ Gaussian noise in communication and signal processing.
▶ Percentage changes in asset prices modeled by normal random
variables.
▶ Random sampling from a population to obtain statistical
knowledge.
▶ The CLT simplifies computations significantly, especially when
dealing with sums of a large number of i.i.d. random variables.
▶ It is often stated that if n ≥ 30, then the normal
approximation is very good.
Applying the Central Limit Theorem (CLT)
https://www.probabilitycourse.com/
Finding Mean and Variance
E [Y ] = nµ, Var (Y ) = nσ 2
Y − E [Y ] Y − nµ
p = √
Var (Y ) nσ
https://www.probabilitycourse.com/
Finding Probability Using CLT
https://www.probabilitycourse.com/
Problem Statement
https://www.probabilitycourse.com/
Solution
E [Y ] = 50 · E [Xi ] = 50 · 2 = 100
https://www.probabilitycourse.com/
Finding P(90 < Y < 110)
https://www.probabilitycourse.com/
Conclusion
https://www.probabilitycourse.com/
Defining the Problem
Let us define Xi as the indicator random variable for the i-th bit in
the packet. That is,
Xi ∼ Bernoulli(p = 0.1).
Y = X1 + X2 + · · · + Xn .
Mean and Variance of Xi
https://www.probabilitycourse.com/
Using the Central Limit Theorem
https://www.probabilitycourse.com/
Final Probability
Therefore, the probability that there are more than 120 errors in
the data packet is approximately 1.75%.
https://www.probabilitycourse.com/
Z-Table or Standard Normal Table
Y = X1 + X2 + . . . + Xn .
Expectation and Variance
8 − 10 Y − nµ 10 − 10 2
=P √ < √ < √ ≈ Φ(0)−Φ − √ = 0.3145.
5 nσ 5 5
https://www.probabilitycourse.com/
Exact Probability Calculation
https://www.probabilitycourse.com/
Approximation Error
√
7.5 − 10 Y − nµ 10.5 − 10 2.5
=P √ < √ < √ ≈ Φ(0.5/ 5)−Φ − √
5 nσ 5 5
= 0.4567.
https://www.probabilitycourse.com/
Application of Continuity Correction
https://www.probabilitycourse.com/
Continuity Correction for Discrete Random Variables
Let X1 , X2 , . . . , Xn be independent discrete random variables and
let
Y = X1 + X2 + . . . + Xn .
https://www.probabilitycourse.com/
Finding Probability Using CLT
https://www.probabilitycourse.com/