EE 533 Information Theory: Barı S Nakibo Glu
EE 533 Information Theory: Barı S Nakibo Glu
Lecture 01
Barış Nakiboğlu
2/10
Definition (σ-algebra)
A set F is a σ-algebra of subsets of Ω iff it satisfies the following three conditions
i) Ω ∈ F
ii) Ω \ E ∈ F for all E ∈ F.
iii) If En ∈ F for all n ∈ Z+ , then ∪∞
i=1 En ∈ F.
Note that
I F is closed under countable unions, intersections, and complements.
I If the third condition holds for finite collection of sets rather than countable then F is an
algebra.
I Every σ-algebra is an algebra. But not every algebra is a σ-algebra.
I Third condition is for countable collections not arbitrary collections.
Definition
(Ω, F, P) is a probability space iff F is σ-algebra of subsets of Ω and P is a probability measure
on (Ω, F).
Ω Sample space, certain event
ω : ω ∈ Ω Outcomes, sample points, elementary events.
F σ-algebra of events, set of events
4/10
Axioms of Probability.
i) P(E) ≥ 0 for all E ∈ F.
ii) P(Ω) = 1
iii) If Ei ∩ Ej = ∅ for all i, j ∈ Z+ s.t. i 6= j then
[∞ X∞
P Ei = P(Ei ) .
i=1 i=1
Definition
The Borel σ-algebra on R, denoted by B(R), is the σ-algebra generated by open intervals on the
real line.
Since we assign probabilities to all open intervals via B(R), we know we can describe continuous
random variables using B(R).
On the other hand {x} ∈ B(R) for all x ∈ R because {x} is in the intersection of all open
intervals containing it that has rational end points.
Thus we can use B(R) to describe discrete random variables using B(R).
6/10
Definition
On a probability space (Ω, F, P) a function X : Ω → R is a random variable iff X−1 (A) ∈ F for
all A ∈ B(R), i.e. X is (F, B(R))-measurable.
I In plain English: for all events about X defined by open sets on the real line, I can find an
event whose probability I can calculate under P.
Thus (Ω, F, P) and X induce a probability measure P e on (R, B(R)) via
:= P X−1 (A)
P(A)
e ∀A ∈ B(R).
I If there exists a countable X ⊂ R s.t. P(X ∈ X) = 1, then X is a discrete random variable.
P(X = x) = pX (x) ∀x ∈ X.
I In the probability theory one is often interested in the random variables and algebraic
operations are performed on the random variables themselves.
I In information theory one often works with the distributions of the random variables and
algebraic operations are often performed on the distributions of the random variables
themselves.
I Since we are perform algebraic operations on the distributions of the random variables, we
need to work with the associated vector space structure.
I Note that
I For a discrete random variable X, its pmf, i.e. pX (X), is itself a discrete random variable.
I For a continuous random variable X, its pdf, i.e. fX (X), in not necessarily a continuous random
variable! (Consider a random variable that is uniformly distributed on [0, 1].)
I How about the cdf’s, i.e. FX (x) :=P(X ≤ x)’s? Can we make similar assertions?
8/10
Vector Spaces
10/10