0% found this document useful (0 votes)

11 views37 pages

Mod 1 DCT

The document provides an overview of data compression techniques, categorizing them into lossless and lossy compression, and discusses their applications, performance measures, and mathematical modeling. It highlights the importance of redundancy in data and the use of various probability models for effective compression. Additionally, it covers the evaluation metrics for compression algorithms, including compression ratio, time complexity, and distortion measures.

Uploaded by

Unknown User

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views37 pages

Mod 1 DCT

Uploaded by

Unknown User

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 37

DATA COMPRESSION

TECHNIQUES
CST 446
SYLLABUS
Module-1
Modelling and Types of Compression
Introduction to Compression Techniques- Lossy compression &
Lossless compression, Measures of Performance, Modeling and coding.
Mathematical modelling for Lossless and lossy compression - Physical
models and probability models.
Data compression
• Data compression algorithms are used to reduce the number of bits
required to represent a text or an image or a video sequence or music
• The art or science of representing information in a compact form.
• compact representations are created by identifying and using structures
that exist in the data.
• early example of data compression is Morse code, developed by Samuel
Morse in the mid-19th century.
• Letters sent by telegraph are encoded with dots and dashes.
• The assigned shorter sequences to letters that occur more frequently, such
as e (·) and a (· −), and longer sequences to letters that occur less
frequently, such as q (−−·−) and j (·−−−).
Compression Techniques
• the compression algorithm that takes an input x
• generates a representation x that requires fewer bits
• reconstruction algorithm operates on the compressed representation x c
to generate the reconstruction y
Compression Techniques
• Based on the requirements of reconstruction, data compression
schemes can be divided into two broad classes:
• lossless compression schemes, in which is y identical to x, and
• lossy compression schemes, which generally provide much higher
compression than lossless compression but allow y to be different
from x
Compression Techniques
• Lossless Compression
• involve no loss of information
• the original data can be recovered exactly from the compressed data
• used for applications that cannot tolerate any difference between
the original and reconstructed data
• Text compression is an important area for lossless compression
• Consider the sentences “Do not send money” and “Do now send
money.”
• a radiological image
• Data obtained from satellites
Compression Techniques
• Lossy Compression
• involve some loss of information
• data cannot be recovered or reconstructed exactly
• higher compression ratios than in lossless compression
• Applications in which lack of exact reconstruction is not a
problem
• storing or transmitting speech, the exact value of each sample of
speech is not necessary
• reconstruction of a video sequence
Compression Techniques
• Measures of Performance
• A compression algorithm can be evaluated in a number ways.
the relative complexity of the algorithm
the memory required to implement the algorithm
how fast the algorithm performs on a given machine
the amount of compression
how closely the reconstruction resembles the original.
Compression Techniques
• Measures of Performance
• compression ratio
• ratio of the number of bits required to represent the data before
compression to the number of bits required to represent the data after
compression
• Storing an image made up of a square array of 256×256 pixels requires
65,536 bytes.
• the compressed version requires 16,384 bytes.
• the compression ratio is 4:1.
• We can also represent the compression ratio by expressing the reduction in
the amount of data required as a percentage of the size of the original data.
In this particular example the compression ratio would be 75%.
Compression Techniques
• Measures of Performance
• Rate
• average number of bits required to represent a single sample
• if we assume 8 bits per byte (or pixel), the average number of
bits per pixel in the compressed representation is 2. Thus, we
would say that the rate is 2 bits per pixel.
• Bit rate measures the average number of bits required to
represent one unit of information. A lower bit rate indicates
more efficient compression.
Compression Techniques
Measures of Performance
• Time Complexity:
• Time complexity is a measure of the amount of time an algorithm takes to
complete as a function of the size of its input.
• It provides an estimation of how the algorithm's running time grows with the size
of the input.
• Lower time complexity is generally desirable, especially for real-time
applications.
• Space Complexity:
• This measures the amount of memory required by the compression algorithm.
Lower space complexity is preferable, particularly in resource-constrained
environments.
Compression Techniques
• Measures of Performance
• Lossy compression, the reconstruction differs from the original data
• To determine the efficiency of a compression algorithm, we have to quantify the difference
• The difference between the original and the reconstruction is called the distortion.
• In compression of speech and video, the final arbiter of quality is human.
• Approximate measures of distortion are used to determine the quality of the
reconstructed waveforms
• fidelity and quality- to terms to measure distortion
• When we say that the fidelity or quality of a reconstruction is high, we mean that the
difference between the reconstruction and the original is small
• Fidelity specifically relates to the accuracy of the compressed representation compared to
the original, while quality encompasses a broader set of criteria, including fidelity, and
may involve subjective judgments based on human perception and preferences.
Compression Techniques
• Modeling and Coding
• Exact compression scheme we use will depend on the characteristics of the
data that need to be compressed
• Redundancies inherent in the data.
• The development of data compression algorithms for a variety of data can be
divided into two phases.
• The first phase is modeling.
• extract information about any redundancy that exists in the data and describe the redundancy
in the form of a model.
• The second phase is called coding.
• A description of the model and a “description” of how the data differ from the model are
encoded, using a binary alphabet.
• The difference between the data and the model is referred to as the residual
Compression Techniques
• Modeling and Coding
• Example 1
• Consider the following sequence of numbers x₁,x₂,x₃,….

• To transmit or store the binary representations of these numbers, we would

need to use 5 bits per sample
• Exploiting the structure in the data, we can represent the sequence using
fewer bits.
• If we plot these data it seem to fall on a straight line.
• A model for the data could therefore be a straight line
Compression Techniques
Modeling and Coding
Compression Techniques
• Modeling and Coding
• The difference (or residual) is given by the sequence

• The residual sequence consists of only three numbers −1 0 1.

If we assign a code of 00 to −1, a code of 01 to 0, and a code of 10 to 1, we
need to use 2 bits to represent each element of the residual sequence.
we can obtain compression by transmitting or storing the parameters of the
model and the residual sequence.
Compression Techniques
• Modeling and Coding
• Example 2
• Consider the following sequence of numbers x₁,x₂,x₃,….
Compression Techniques
• Each value is close to the previous value.
• Suppose we send the first value, then in place of subsequent values we send the
difference between it and the previous value.
• The sequence of transmitted values would be

• The decoder adds each received value to the previous decoded value to obtain
the reconstruction
• Techniques that use the past values of a sequence to predict the current value
and then encode the error in prediction, or residual, are called predictive
coding schemes.
Compression Techniques
• Example 3:
• Suppose we have the following sequence:
abarayaranbarraybranbfarbfaarbfaaarbaway
• sequence is made up of eight different symbols
• to represent eight symbols, we need to use 3 bits per symbol
• Some symbols are more often than others
• Assign binary codes of different lengths to different symbols.
Compression Techniques

• If we substitute the codes for each symbol, we will use 106 bits to encode the entire sequence. As there are 41
symbols in the sequence, this works out to approximately 2.58 bits per symbol. This means we have obtained a
compression ratio of 1.16:1
Mathematical modelling for Lossless
Compression
• Physical Models
• Physics of the data generation process
• In speech-related applications, knowledge about the physics of speech
production can be used to construct a mathematical model for the sampled
speech process
• Models for certain telemetry data can also be obtained through knowledge of
the underlying process
• Residential electrical meter readings at hourly intervals were to be coded,
knowledge about the living habits of the populace could be used to determine
when electricity usage would be high and when the usage would be low.
• Instead of the actual readings, the difference (residual) between the actual
readings and those predicted by the model could be coded.
Mathematical modelling for Lossless
Compression
• Probability Models
• Ignorance model
• assume that each letter that is generated by the source is independent
of every other letter, and each occurs with the same probability
• Probability model
• assume that each letter that is generated by the source is independent
of every other letter, and each occurs with the different probability
• For a source that generates letters from an alphabet A= a1,a2,…. aM ,
we can have a probability model P= P(a1), P(a2),…….P(aM).
Mathematical modelling for Lossy
Compression
• When modeling sources in order to design or analyze lossy
compression schemes, we look more to the general rather than exact
correspondence.
• Certain probability distribution functions are more analytically
tractable than others, and we try to match the distribution of the source
with one of these “nice” distributions.
Mathematical modelling for Lossy
Compression
• Probability Models
• method for characterizing a particular source
• we look more to the general rather than exact correspondence
• Four probability models
• Uniform distribution
• Gaussian distribution
• Laplacian distribution
• Gamma distribution
Mathematical modelling for Lossy
Compression
• Uniform Distribution:
• this is an ignorance model.
• If we do not know anything about the distribution of the source output, except
possibly the range of values, we can use the uniform distribution to model the
source.
• A random variable X has a uniform distribution over(-2, 2)
• The probability density function for a random variable uniformly distributed
between a and b is
Mathematical modelling for Lossy
Compression
• Gaussian Distribution
• most commonly used probability models
• it is mathematically tractable
• Most data points cluster toward the middle of the range, while the rest taper
off symmetrically toward either extreme.
• The middle of the range is also known as the mean of the distribution.
• The probability density function for a random variable with a Gaussian
distribution
Mathematical modelling for Lossy
Compression
• Laplacian Distribution
• Many sources that we deal with have distributions that are quite peaked at
zero.
• For example, speech consists mainly of silence. Therefore, samples of speech
will be zero or close to zero with high probability.
• Image pixels there is a high degree of correlation among pixels. Therefore, a
large number of the pixel-to-pixel differences will have values close to zero.
• The distribution function for a zero mean random variable with Laplacian
distribution
Mathematical modelling for Lossy
Compression
• Gamma Distribution
• distribution that is more peaked an considerably less tractable
• The distribution function for a Gamma distributed random variable with zero
mean is given by
Mathematical modelling for Lossy
Compression
• The shapes of these four distributions, assuming a mean of zero and a
variance of one
Mathematical Preliminaries for Lossless and
Lossy Compression
• Information Theory
• Shannon defined a quantity called self-information
• Suppose we have an event A, which is a set of outcomes of some random experiment.
• If P(A) is the probability that the event A will occur, then the self-information associated
with A is given by

• if the probability of an event is low, the amount of self-information associated with it is

high; if the probability of an event is high, the information associated with it is low.
Mathematical Preliminaries for Lossless and
Lossy Compression

• It measures the amount of uncertainty or randomness in a set of data.

The entropy of a source is typically measured in bits, and it represents
the average amount of information needed to encode a symbol from
the source.
• If the experiment is a source that puts out symbols Ai from a set A ,
then the entropy is a measure of the average number of binary symbols
needed to code the output of the source.

first-order entropy
• MarkovModels
• the probability of the next letter is heavily influenced by the preceding letters
• It is assumed that future states depend only on the current state
• The use of the Markov model does not require the assumption of linearity. For example,
consider a binary image.
• The image has only two types of pixels, white pixels and black pixels.
• We know that the appearance of a white pixel as the next observation depends, to some
extent, on whether the current pixel is white or black.
• For models used in lossless compression, we use a specific type of Markov process called
a discrete time Markov chain. A sequence is said to follow a kth-order Markov model if
• MarkovModels
• MarkovModels
• we have processed precedin and are going to encode the next letter.
• If we take no account of the context and treat each letter as a surprise, the
probability of the letter g occurring is relatively low.
• If we use a first-order Markov model or single-letter context (that is, we look
at the probability model given n), we can see that the probability of g would
increase substantially.
• As we increase the context size (go from n to in to din and so on), the
probability of the alphabet becomes more and more skewed, which results in
lower entropy.
• CompositeSourceModel
• In many applications, it is not easy to use a single model to describe the
source.
• In such cases, we can define a composite source, which can be viewed as a
combination or composition of several sources, with only one source being
active at any given time.
Thank You

Chapter 3
No ratings yet
Chapter 3
52 pages
Data Compression Btech Notes
No ratings yet
Data Compression Btech Notes
32 pages
Information and Data Comparison
No ratings yet
Information and Data Comparison
102 pages
Data Compression Unit-1 - 1
No ratings yet
Data Compression Unit-1 - 1
21 pages
HTCS501 Unit 4
No ratings yet
HTCS501 Unit 4
17 pages
DC (Ca 1)
No ratings yet
DC (Ca 1)
11 pages
MMC 17ec741 Module 3 Notes
No ratings yet
MMC 17ec741 Module 3 Notes
45 pages
DC CH1
No ratings yet
DC CH1
17 pages
Emerging Trends in Electronics MCQ PDF: Question Bank
50% (10)
Emerging Trends in Electronics MCQ PDF: Question Bank
42 pages
MMC Chap3
100% (1)
MMC Chap3
22 pages
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
No ratings yet
EC 2214: Coding & Data Compression: Vishwakarma Institute of Technology
35 pages
MMC Module Iii-1
No ratings yet
MMC Module Iii-1
73 pages
MMC Unit II
No ratings yet
MMC Unit II
40 pages
DC M1 Merged
No ratings yet
DC M1 Merged
26 pages
Unit3 Ece MMC 6th Sem
No ratings yet
Unit3 Ece MMC 6th Sem
96 pages
Compression
100% (1)
Compression
38 pages
Compression Algo
No ratings yet
Compression Algo
10 pages
Chap 5 Compression
No ratings yet
Chap 5 Compression
43 pages
Data Compression Askbooks
No ratings yet
Data Compression Askbooks
75 pages
Introduction To Multimedia Compression: National Chiao Tung University Chun-Jen Tsai 2/21/2012
No ratings yet
Introduction To Multimedia Compression: National Chiao Tung University Chun-Jen Tsai 2/21/2012
22 pages
Chapter - 5 Data Compression
No ratings yet
Chapter - 5 Data Compression
8 pages
Multimedia Unit-4
No ratings yet
Multimedia Unit-4
30 pages
20250320121146-Module-3 MMC Notes
No ratings yet
20250320121146-Module-3 MMC Notes
27 pages
Chapt 6
No ratings yet
Chapt 6
26 pages
Data Compression1
No ratings yet
Data Compression1
74 pages
Data Compression
No ratings yet
Data Compression
19 pages
Multimedia Systems: Chapter 7: Data Compression
No ratings yet
Multimedia Systems: Chapter 7: Data Compression
41 pages
Main Techniques and Performance of Each Compression
No ratings yet
Main Techniques and Performance of Each Compression
23 pages
Unit 1 - CA209
No ratings yet
Unit 1 - CA209
25 pages
Data Compression Techniques Module 1 Ktu
No ratings yet
Data Compression Techniques Module 1 Ktu
15 pages
Unit 1 - CA209 Zohaib
No ratings yet
Unit 1 - CA209 Zohaib
24 pages
MODULE 3 Part1
0% (1)
MODULE 3 Part1
24 pages
Chapter 3 Multimedia Data Compression
100% (2)
Chapter 3 Multimedia Data Compression
23 pages
Lecture 3 Compression in Multimedia
No ratings yet
Lecture 3 Compression in Multimedia
60 pages
Data Compression: This Article May Require by
No ratings yet
Data Compression: This Article May Require by
25 pages
UNIT - IV - PPT
100% (1)
UNIT - IV - PPT
18 pages
Chapter 6 Lossy Compression Algorithms
No ratings yet
Chapter 6 Lossy Compression Algorithms
46 pages
Module 3
No ratings yet
Module 3
23 pages
Data Compression
No ratings yet
Data Compression
20 pages
Compression of Multimedia Data
No ratings yet
Compression of Multimedia Data
14 pages
Dereje Teferi Dereje - Teferi@aau - Edu.et
No ratings yet
Dereje Teferi Dereje - Teferi@aau - Edu.et
36 pages
Chapter 7
No ratings yet
Chapter 7
36 pages
Special Topics Data Compression
No ratings yet
Special Topics Data Compression
51 pages
Lossless Compression
No ratings yet
Lossless Compression
36 pages
Data Compression Intro
100% (1)
Data Compression Intro
107 pages
Introduction To Data Compression - Guy E. Blelloch PDF
No ratings yet
Introduction To Data Compression - Guy E. Blelloch PDF
54 pages
Data Compression
0% (1)
Data Compression
75 pages
Student Workbook - Unit 2 Algorithms
No ratings yet
Student Workbook - Unit 2 Algorithms
17 pages
Data Compression Techniques
No ratings yet
Data Compression Techniques
21 pages
Compression
No ratings yet
Compression
71 pages
Bce613a-Mod 3
No ratings yet
Bce613a-Mod 3
22 pages
Data Compression: CS 147 Minh Nguyen
No ratings yet
Data Compression: CS 147 Minh Nguyen
25 pages
3 MM Compression
100% (1)
3 MM Compression
35 pages
Nteractive Ultimedia Ystems: Ompression Types and Techniques
No ratings yet
Nteractive Ultimedia Ystems: Ompression Types and Techniques
12 pages
CH 15
No ratings yet
CH 15
34 pages
Text Data Compression
No ratings yet
Text Data Compression
13 pages
Data Compression Report
No ratings yet
Data Compression Report
12 pages
Snake Game Design Document
No ratings yet
Snake Game Design Document
5 pages
Literature Survey
No ratings yet
Literature Survey
5 pages
Data Structures Using C Material For Ambedkar University Degree Students
No ratings yet
Data Structures Using C Material For Ambedkar University Degree Students
105 pages
Data Compression
No ratings yet
Data Compression
29 pages
2019 Haplogroup C
No ratings yet
2019 Haplogroup C
424 pages
MT5 Manual
No ratings yet
MT5 Manual
35 pages
CKS Curriculum v1.28
No ratings yet
CKS Curriculum v1.28
4 pages
Cs - REVISION TOUR
No ratings yet
Cs - REVISION TOUR
59 pages
GRADE 10 PART 2 Computer Viruses and Examples
No ratings yet
GRADE 10 PART 2 Computer Viruses and Examples
20 pages
Web Mapping and Development-I
No ratings yet
Web Mapping and Development-I
16 pages
Final Exam Microprocessor Fundamentals 2020 Part 2
No ratings yet
Final Exam Microprocessor Fundamentals 2020 Part 2
7 pages
Escort FLS Manual
No ratings yet
Escort FLS Manual
111 pages
Literature Review Record Management System
100% (2)
Literature Review Record Management System
4 pages
Gaming PC Components and Their Specifications: Personal Professional Development 2 Research Skills
No ratings yet
Gaming PC Components and Their Specifications: Personal Professional Development 2 Research Skills
7 pages
Sirah: Prophet
No ratings yet
Sirah: Prophet
4 pages
MN10
No ratings yet
MN10
13 pages
Joel Repport
No ratings yet
Joel Repport
33 pages
Cybercrime Related Response
No ratings yet
Cybercrime Related Response
6 pages
Lesson 2
No ratings yet
Lesson 2
18 pages
3-28.OSB2B05 Traffic Statistics
No ratings yet
3-28.OSB2B05 Traffic Statistics
34 pages
ARTERY AT32 MCU Cross Reference Table EN V202011
No ratings yet
ARTERY AT32 MCU Cross Reference Table EN V202011
4 pages
Chalmlite v4.05 EULA PDF
No ratings yet
Chalmlite v4.05 EULA PDF
3 pages
Remote Control of A Standard ABB Robot System in R
No ratings yet
Remote Control of A Standard ABB Robot System in R
7 pages
Launchkey MK3 Programmers Reference
No ratings yet
Launchkey MK3 Programmers Reference
21 pages
GPU-Co Processing
No ratings yet
GPU-Co Processing
8 pages
Holidays Homework - Summer Vacation 2024-2025 Computer Science
No ratings yet
Holidays Homework - Summer Vacation 2024-2025 Computer Science
2 pages
One Minute Academy Student Handbook (English)
No ratings yet
One Minute Academy Student Handbook (English)
29 pages
CRMC Exercise
No ratings yet
CRMC Exercise
14 pages
Nota
No ratings yet
Nota
16 pages
Preparoty Worksheet 1 (Fill in The Blanks)
No ratings yet
Preparoty Worksheet 1 (Fill in The Blanks)
2 pages
Image Compression: Efficient Techniques for Visual Data Optimization
From Everand
Image Compression: Efficient Techniques for Visual Data Optimization
Fouad Sabry
No ratings yet
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Human Visual System Model: Understanding Perception and Processing
From Everand
Human Visual System Model: Understanding Perception and Processing
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Mod 1 DCT

Uploaded by

Mod 1 DCT

Uploaded by

DATA COMPRESSION

• To transmit or store the binary representations of these numbers, we would

• The residual sequence consists of only three numbers −1 0 1.

• if the probability of an event is low, the amount of self-information associated with it is

• It measures the amount of uncertainty or randomness in a set of data.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.