0% found this document useful (0 votes)

97 views39 pages

Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit

The document discusses lossless compression techniques such as Huffman coding. It defines entropy as a measure of uncertainty in a data source, and explains how entropy is calculated based on the probabilities of symbols in the source. Shannon's coding theorem establishes that the minimum average code length cannot be less than the source entropy. Huffman coding assigns variable-length codes to symbols based on their probabilities, with shorter codes for more probable symbols.

Uploaded by

mehul03ec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views39 pages

Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit

Uploaded by

mehul03ec

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 39

Lossless Compression:

Huffman Coding
Mikita Gandhi
Assistant Professor
ADIT
Application
some applications, such as satellite
image analysis, medical and business
document archival, medical images for
diagnosis etc, where loss may not be
tolerable and lossless compression
techniques are to be used.
Compression techniques
Some of the popular lossless image
compression techniques being used
are
(a) Huffman coding,
(b) Arithmetic coding,
(c) Ziv-Lempel coding,
(d) Bit-plane coding,
(e) Run-length coding etc.
Source entropy- a measure of
information content
Generation of information is generally
modeled as a random process that
has probability associated with it.
If P(E) is the probability of an event, its
information content I(E), also known
as self information is measured as

Source entropy- a measure of
information content
If P(E)=1, that is, the event always
occurs (like saying The sun rises in the
east), then we obtain from above that
I(E)=0, which means that there is no
information associated with it.
The base of the logarithm expresses the
unit of information and if the base is 2,
the unit is bits. For other values m of the
base, the information is expressed as m-
ary units. Unless otherwise mentioned,
we shall be using the base-2 system to
measure information content.

Source entropy- a measure of
information content
Now, suppose that we have an alphabet of n
symbols {a
i
/i=1,2..n}having probabilities of
occurrences P(a
1
), P(a
2
)P(a
n
). If k is the
number of source outputs generated, which is
considered to be sufficiently large, then the
average number of occurrences of symbol a
i
is
KP(a
i
)and the average self-information obtained
from k outputs is given by

and the average information per source output for
the source z is given by

Source entropy- a measure of
information content
The above quantity is defined as the
entropy of the source and measures
the uncertainty of the source. The
relationship between uncertainty and
entropy can be illustrated by a simple
example of two symbols a
1
and a
2
,
having probabilities P(a
1
)and P(a
2
)
respectively. Since, the summation of
probabilities is equal to 1,

and using equation, we obtain

Source entropy- a measure of
information content
If we plot H(z) versus P(a
1
) , we obtain
the graph shown

Source entropy- a measure of
information content
It is interesting to note that the entropy
is equal to zero for P(a
1
)=0 and
P(a
1
)=1. These correspond to the
cases where at least one of the two
symbols is certain to be present. H(z)
assumes maximum value of 1-bit for
P(a
1
)=1/2. This corresponds to the
most uncertain case, where both the
symbols are equally probable.
Source entropy- a measure of
information content
Example: Measurement of source entropy
If the probabilities of the source symbols are
known, the source entropy can be measured. Say,
we have five symbols a
1
, a
2
, a
3
, a
4
, a
5
having the
following probabilities:

the source entropy is given by

Shannons Coding Theorem for
noiseless channels
We are now going to present a very important
theorem by Shannon, which expresses the lower
limit of the average code word length of a source in
terms of its entropy. Stated formally, the theorem
states that in any coding scheme, the average
code word length of a source of symbols can at
best be equal to the source entropy and can never
be less than it. The above theorem assumes the
coding to be lossless and the channel to be
noiseless.
If m(z) is the minimum of the average code word
length obtained out of different uniquely
decipherable coding schemes, then as per
Shannons theorem, we can state that

Coding efficiency
The coding efficiency () of an
encoding scheme is expressed as the
ratio of the source entropy H(z) to the
average code word length L(z) and is
given by

Since according to Shannons Coding
theorem and both L(z) and H(z) are
positive,

Basic principles of Huffman
Coding
Huffman coding is a popular lossless
Variable Length Coding (VLC) scheme,
based on the following principles:
(a) Shorter code words are assigned to
more probable symbols and longer code
words are assigned to less probable
symbols.
(b) No code word of a symbol is a prefix of
another code word. This makes Huffman
coding uniquely decodable.
(c) Every source symbol must have a
unique code word assigned to it.

Basic principles of Huffman
Coding
In image compression systems
Huffman coding is performed on the
quantized symbols. Quite often,
Huffman coding is used in conjunction
with other lossless coding schemes,
such as run-length coding. In terms of
Shannons noiseless coding theorem,
Huffman coding is optimal for a fixed
alphabet size, subject to the constraint
that that the source symbols are
coded one at a time.
Assigning Binary Huffman codes
to a set of symbols
We shall now discuss how Huffman
codes are assigned to a set of source
symbols of known probability. If the
probabilities are not known a priori, it
should be estimated from a sufficiently
large set of samples. The code
assignment is based on a series of
source reductions and we shall
illustrate this with reference to the
example. The steps are as follows:
Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
This completes the Huffman code assignment
pertaining to this example. From this table, it is evident
that shortest code word (length=1) is assigned to the
most probable symbol a
4
and the longest code words
(length=4) are assigned to the two least probable
symbols a
3

and a
5
. Also, each symbol has a unique
code and no code word is a prefix of code word for
another symbol. The coding has therefore fulfilled the
basic requirements of Huffman coding,

Assigning Binary Huffman codes
to a set of symbols
Assigning Binary Huffman codes
to a set of symbols
Encoding a string of symbols
using Huffman codes
Decoding a Huffman coded bit
stream
Questions?
1) Define the entropy of a source of
symbols.
2) How is entropy related to
uncertainty?
3) State Shannons coding theorem on
noiseless channels.
4) Define the coding efficiency of an
encoding scheme.
5) State the basic principles of Huffman
coding.
Multiple Choice
The entropy of a source of symbols is
dependent upon
(A) The number of source outputs
generated.
(B) The average codeword length.
(C) The probabilities of the source
symbols.
(D) The order in which the source
outputs are generated.
Multiple Choice
We have two sources of symbols to compare
their entropies. Source-1 has three symbols a
1
,a
2
and a
3
with probabilities

Source-2 also has three symbols a
1
,a
2
and a
3
, but
with probabilities
(A) Entropy of source-1 is higher than that of
source-2.
(B) Entropy of source-1 is lower than that of
source-2.
(C) Entropy of source-1 and source-2 are the
same.
(D) It is not possible to compute the entropies
from the given data.

Multiple Choice
Shannons coding theorem on
noiseless channels provides us with
(A) A lower bound on the average
codeword length.
(B) An upper bound on the average
codeword length
(C) A lower bound on the source
entropy.
(D) An upper bound on the source
entropy.
Multiple Choice
Which one of the following is not true for
Huffman coding?
(A) No codeword of an elementary symbol
is a prefix of another elementary
symbol.
(B) Each symbol has a one-to-one
mapping with its corresponding
codeword.
(C) The symbols are encoded as a group,
rather than encoding one symbol at a
time.
(D) Shorter code words are assigned to
more probable symbols.
Multiple Choice
Multiple Choice
Which of the following must be
ensured before assigning binary
Huffman codes to a set of symbols?
(A) The channel is noiseless.
(B) There must be exactly 2
n
symbols
to encode.
(C) No two symbols should have the
same probability.
(D) The probabilities of the symbols
should be known a priori.
Multiple Choice
Multiple Choice
Problem
Solution
Solution of problem

Information Theory and Coding
No ratings yet
Information Theory and Coding
27 pages
Unit I Information Theory & Coding Techniques P I
No ratings yet
Unit I Information Theory & Coding Techniques P I
48 pages
Information Theory and Coding PDF
No ratings yet
Information Theory and Coding PDF
150 pages
Data Compression: Chapter - 2 Mathematical Preliminaries For Lossless Compression
100% (2)
Data Compression: Chapter - 2 Mathematical Preliminaries For Lossless Compression
26 pages
DC CH1
No ratings yet
DC CH1
17 pages
Final
No ratings yet
Final
34 pages
Image Compression: CS474/674 - Prof. Bebis
100% (1)
Image Compression: CS474/674 - Prof. Bebis
110 pages
Predictive Coding I
No ratings yet
Predictive Coding I
14 pages
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
No ratings yet
B.E Semester: 6 - IT (GTU) : 2161603 - Data Compression and Data Retrieval
17 pages
Image Compression
100% (1)
Image Compression
111 pages
Unit-Ii Itc
No ratings yet
Unit-Ii Itc
42 pages
Truncated Huffman
No ratings yet
Truncated Huffman
5 pages
Image Compression: Sankalp Kallakuri
No ratings yet
Image Compression: Sankalp Kallakuri
21 pages
Image Compression
100% (1)
Image Compression
38 pages
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
No ratings yet
Image Compression (Chapter 8) : CS474/674 - Prof. Bebis
128 pages
Image Compression Coding Schemes
50% (4)
Image Compression Coding Schemes
96 pages
Image Compression
100% (1)
Image Compression
47 pages
Difference Between Lossless Compression and Lossy Compression
No ratings yet
Difference Between Lossless Compression and Lossy Compression
15 pages
Data Compression
No ratings yet
Data Compression
29 pages
Arithmetic Coding: Implementation Details and Examples
No ratings yet
Arithmetic Coding: Implementation Details and Examples
11 pages
Huffman
No ratings yet
Huffman
13 pages
1.4 Exam Questions
No ratings yet
1.4 Exam Questions
3 pages
Arithmetic Coding: Presented By: Einat & Kim
No ratings yet
Arithmetic Coding: Presented By: Einat & Kim
48 pages
CH - 08 - #1 Distortion Criteria
No ratings yet
CH - 08 - #1 Distortion Criteria
9 pages
LDPC
No ratings yet
LDPC
40 pages
Data Compression
No ratings yet
Data Compression
25 pages
Dr. Kourosh Kiani Genetics-Algorithm Lecture 01 Introduction Genetics Algorithm
No ratings yet
Dr. Kourosh Kiani Genetics-Algorithm Lecture 01 Introduction Genetics Algorithm
86 pages
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
No ratings yet
Multimedia University of Kenya. Faculty of Engineering and Technology. Bsc. Electrical and Telecommunication Engineering
8 pages
JPEG2000 Image Compression Standard
No ratings yet
JPEG2000 Image Compression Standard
12 pages
InfThe Rev PDF
No ratings yet
InfThe Rev PDF
12 pages
EC2037 MUTIMEDIa QB
100% (4)
EC2037 MUTIMEDIa QB
16 pages
Compression and Decompression Techniques
No ratings yet
Compression and Decompression Techniques
68 pages
Matlab Code For RS Coding and Decoding
No ratings yet
Matlab Code For RS Coding and Decoding
6 pages
Digital Image Compression
No ratings yet
Digital Image Compression
19 pages
Image Compression
No ratings yet
Image Compression
14 pages
Image Processing Compression Techniques
No ratings yet
Image Processing Compression Techniques
16 pages
Mutual Information, Joint Entropy & Conditional Entropy
No ratings yet
Mutual Information, Joint Entropy & Conditional Entropy
13 pages
Knapsack
No ratings yet
Knapsack
7 pages
Lemp El Ziv Report
No ratings yet
Lemp El Ziv Report
17 pages
Matlab Code For Image Compression Using SPIHT Algorithm
No ratings yet
Matlab Code For Image Compression Using SPIHT Algorithm
19 pages
Data Compression
No ratings yet
Data Compression
46 pages
DIP IPT Unit V Complete
No ratings yet
DIP IPT Unit V Complete
69 pages
2D DFT Notes1
No ratings yet
2D DFT Notes1
50 pages
Computer Network Module 2
No ratings yet
Computer Network Module 2
160 pages
ECT305 M3 Ktunotes - in
No ratings yet
ECT305 M3 Ktunotes - in
73 pages
ITC Unit 5 Compression Techniques
100% (1)
ITC Unit 5 Compression Techniques
16 pages
Types of Research
No ratings yet
Types of Research
22 pages
Lect3-Filtering in Spatial Domain I
No ratings yet
Lect3-Filtering in Spatial Domain I
55 pages
Fractal Image Compression
0% (1)
Fractal Image Compression
18 pages
Introduction To Matlab Ii: Cpe251L/253 - Digital Signal Processing
No ratings yet
Introduction To Matlab Ii: Cpe251L/253 - Digital Signal Processing
47 pages
6 Image Compression
No ratings yet
6 Image Compression
45 pages
Linear Codes: 3.1 Basics
No ratings yet
Linear Codes: 3.1 Basics
17 pages
Huffman Coding
No ratings yet
Huffman Coding
10 pages
MM Assignment 3
100% (1)
MM Assignment 3
4 pages
JPEG Compression Standard
No ratings yet
JPEG Compression Standard
23 pages
All Coding
No ratings yet
All Coding
52 pages
Source Coding
No ratings yet
Source Coding
29 pages
Lecture 7 Source Coding 2024
No ratings yet
Lecture 7 Source Coding 2024
28 pages
Lossless Compression: Lesson 1
No ratings yet
Lossless Compression: Lesson 1
10 pages
Error-Correction on Non-Standard Communication Channels
From Everand
Error-Correction on Non-Standard Communication Channels
Edward A. Ratzer
No ratings yet
Satellite Communication: Mikita Gandhi Adit
No ratings yet
Satellite Communication: Mikita Gandhi Adit
41 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
Gain To Frequency: Assume Carrier of
No ratings yet
Gain To Frequency: Assume Carrier of
10 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Satellite Communication
No ratings yet
Satellite Communication
19 pages
Pin
No ratings yet
Pin
2 pages
BEC703 - Microwave Engineering
No ratings yet
BEC703 - Microwave Engineering
168 pages
Semianr Schedule
No ratings yet
Semianr Schedule
1 page
Go To Simulation Project Select "All Block As 3D Model" (Shown Below)
No ratings yet
Go To Simulation Project Select "All Block As 3D Model" (Shown Below)
6 pages
Network Parameters
No ratings yet
Network Parameters
55 pages
Class Time Table 14-12-2016
No ratings yet
Class Time Table 14-12-2016
4 pages
Communication 6 May 2017
No ratings yet
Communication 6 May 2017
33 pages
CPU (2110003) EC Division Assignment 4 Note: Deadline To Submit This Assignment Is 4-1-17
No ratings yet
CPU (2110003) EC Division Assignment 4 Note: Deadline To Submit This Assignment Is 4-1-17
1 page
Pointers
No ratings yet
Pointers
6 pages
NMMTM MD M: - 2 2 Kumnce
No ratings yet
NMMTM MD M: - 2 2 Kumnce
8 pages
Annual Day From EC
No ratings yet
Annual Day From EC
1 page
Gujarat Technological University: Electronics and Communication Engineering Subject Code: B.E. 8 Semester
No ratings yet
Gujarat Technological University: Electronics and Communication Engineering Subject Code: B.E. 8 Semester
4 pages
Researchpioneers 150120224510 Conversion Gate02
100% (1)
Researchpioneers 150120224510 Conversion Gate02
86 pages
Computer Programming Utilization (2110003) EC Division: Assignment - 2
No ratings yet
Computer Programming Utilization (2110003) EC Division: Assignment - 2
2 pages
Principles of Electronic Communication Systems
No ratings yet
Principles of Electronic Communication Systems
76 pages
ECE450 Information Theory ECE Department University of Rochester
No ratings yet
ECE450 Information Theory ECE Department University of Rochester
3 pages
Mutual Information
No ratings yet
Mutual Information
8 pages
ENTROPY MINIMAX MULTIVARIATE STATISTICAL Modeeling PDF
No ratings yet
ENTROPY MINIMAX MULTIVARIATE STATISTICAL Modeeling PDF
80 pages
A Secure Efficient and Super-Fast Chaos-Based Imag
No ratings yet
A Secure Efficient and Super-Fast Chaos-Based Imag
19 pages
Decision Tree Theory
No ratings yet
Decision Tree Theory
4 pages
Conklin 1995
100% (1)
Conklin 1995
23 pages
Decision Tree
No ratings yet
Decision Tree
5 pages
CT 216 Final Exam
No ratings yet
CT 216 Final Exam
11 pages
Part 3 Comparing The Information Gain of Alternative Data and Models
60% (5)
Part 3 Comparing The Information Gain of Alternative Data and Models
3 pages
Ec23ec4211itc PPT
No ratings yet
Ec23ec4211itc PPT
148 pages
(Ebook) Classical and Quantum Information Theory: An Introduction For The Telecom Scientist by Emmanuel Desurvire ISBN 9780521881715, 0521881714
No ratings yet
(Ebook) Classical and Quantum Information Theory: An Introduction For The Telecom Scientist by Emmanuel Desurvire ISBN 9780521881715, 0521881714
58 pages
Fundamentals of Information Theory and Coding Design 1st Edition Roberto Togneri - Download The Ebook Now To Never Miss Important Content
No ratings yet
Fundamentals of Information Theory and Coding Design 1st Edition Roberto Togneri - Download The Ebook Now To Never Miss Important Content
41 pages
Jancy-Jayakumar2019 Article SequenceStatisticalCodeBasedDa
No ratings yet
Jancy-Jayakumar2019 Article SequenceStatisticalCodeBasedDa
15 pages
Week 1 - Chapter 1 - Introduction
No ratings yet
Week 1 - Chapter 1 - Introduction
25 pages
Review: Practice Problems: Scott Sheffield
No ratings yet
Review: Practice Problems: Scott Sheffield
14 pages
Malaysian Customer Satisfaction Index
No ratings yet
Malaysian Customer Satisfaction Index
10 pages
Information Theory
No ratings yet
Information Theory
114 pages
Wiley Encyclopedia of Telecommunications Vol II.
No ratings yet
Wiley Encyclopedia of Telecommunications Vol II.
596 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
Fractal and Wavelet Image Compression Techniques
No ratings yet
Fractal and Wavelet Image Compression Techniques
258 pages
Emerging Powers in An Age of Disorder: Global Governance July 2011
No ratings yet
Emerging Powers in An Age of Disorder: Global Governance July 2011
15 pages
Toaz - Info Analog and Digital Communication 2016pdf PR 344 399
No ratings yet
Toaz - Info Analog and Digital Communication 2016pdf PR 344 399
56 pages
DC Lecture Slides 1 - Information Theory
No ratings yet
DC Lecture Slides 1 - Information Theory
22 pages
How Bad Can It Git Characterizing Secret Leakagein Public GitHub Repositories
No ratings yet
How Bad Can It Git Characterizing Secret Leakagein Public GitHub Repositories
15 pages
Describe in Brief Different Types of Regression Algorithms
No ratings yet
Describe in Brief Different Types of Regression Algorithms
25 pages
Ece141 Lec10 Information Theory
No ratings yet
Ece141 Lec10 Information Theory
49 pages
2014 Practical 2
No ratings yet
2014 Practical 2
16 pages
It 603 Information Theory and Coding
No ratings yet
It 603 Information Theory and Coding
2 pages
Information Theory and Coding: - Entropy - Probabilistic Information Measure
No ratings yet
Information Theory and Coding: - Entropy - Probabilistic Information Measure
11 pages
EE5143 Tutorial1
No ratings yet
EE5143 Tutorial1
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit

Uploaded by

Lossless Compression: Huffman Coding: Mikita Gandhi Assistant Professor Adit

Uploaded by

Lossless Compression:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.