0% found this document useful (0 votes)

171 views87 pages

3-Discrete Cosine Transform PDF

The document discusses the discrete cosine transform (DCT), its characteristics and applications. It explains that DCT expresses data points as a sum of cosine functions, decorrelates input signals, and has fast implementations. DCT-II is most commonly used due to its strong energy compaction property and performance close to the optimal Karhunen-Loève transform. DCT-II forms the basis of many image and video compression standards like JPEG.

Uploaded by

Hitesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

171 views87 pages

3-Discrete Cosine Transform PDF

Uploaded by

Hitesh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 87

By

Prof (Dr.) P P Ghadekar

OUTLINE
 On completion Student will be able to understand
 Basic concepts of DCT
 Different types of DCT
 JPEG Coding
 Application of DCT

2 Prof P P Ghadekar
Transform Characteristic
 Transforms are useful entities that encapsulate these characteristics
 Data decorrelation
 Data-independent basis functions
 Fast implementation
 Linearity
 Orthogonality

3 Prof P P Ghadekar
Data decorrelation
 The ideal transform completely decorrelates the data in a
sequence/block; i.e., it packs the most amount of energy in the
fewest number of coefficients.
 In this way, many coefficients can be discarded after quantization
and prior to encoding.
 It is important to note that the transform operation itself does not
achieve any compression.
 It aims at decorrelating the original data and compacting a large
fraction of the signal energy into relatively few transform
coefficients.

4 Prof P P Ghadekar
Fast Implementation
 The number of operations required for an n-point transform
is generally of the order O(n2).(order of n2 time complexity)
 Some transforms have fast implementations, which reduce
the number of operations to O(n log n).
 For a separable n × n 2-D transform, performing the row
and column 1-D transforms successively reduces the
number of operations from O(n4) to O(2n2 log n).

O-Big O notation used to represent efficiency (No of steps)

6 Prof P P Ghadekar
Linearity and Orthogonality
 Linearity
 Linearity principle allows one to one mapping between
pixel values and transform coefficients.

 Orthogonality
 Orthogonal transform have the feature of eliminating
redundancy in the transformed image.

7 Prof P P Ghadekar
Transform based Image Coding Scheme
Original
Image TTransformation
Quantiser Entropy
Enencoder Channel
m Encoder

Source Encoder

Transmission
Channel
Reconstructed
Image

Inverse Inverse EntropyEn

decoder Channel
Transformationm Quantiser Decoder

8 Source Decoder
Prof P P Ghadekar
Transform based Image Coding Scheme
 Quantisation
 It is the process of reducing the number of possible values of a
quantity thereby reducing the number of bits needed to represent.
 It is an irreversible process.
 Entropy Encoder
 The purpose of an entropy encoder is to reduce the number of
bits required to represent each symbol at the quantiser output.
 Commonly used entropy coding techniques are Huffman coding,
Arithmetic coding, run length coding etc.
 It is a lossless coding scheme.

9 Prof P P Ghadekar
Introduction
 A Discrete cosine transform (DCT) expresses a sequence
of finitely many data points in terms of a sum of cosine
functions oscillating at different frequencies.
 The discrete cosine transforms (DCT) and discrete sine
transform (DST) are members of a family of sinusoidal
unitary transforms.
 They are real, orthogonal, and separable with fast algorithms
for its computation.
 They have a great relevance to data compression.
 The discrete cosine transform (DCT) is a Fourier-related
transform similar to the discrete Fourier transform (DFT),
but using only real numbers.
10 Prof P P Ghadekar
Introduction
 It is equivalent to a DFT of roughly twice the length,
operating on real data with even symmetry.
 There are eight standard variants, of which four are
common.

11 Prof P P Ghadekar
WHY TO USE DCT?
 We use DCT to decorrelate the input signal efficiently.

12 Prof P P Ghadekar
DFT Basis Images

Real (COS) Imaginary (SIN)

13 Prof P P Ghadekar
DCT Basis Images

14 Prof P P Ghadekar
Types of DCT
 The family of discrete trigonometric transforms consists of 8 versions of DCT.
 Each transform is identified as EVEN or ODD and of type I, II, III, and IV.
 All present digital signal and image processing applications (mainly transform
coding and digital filtering of signals) involve only even types of the DCT and
DST.
 Therefore, we consider these four even types of DCT.

15 Prof P P Ghadekar
Inverse transforms:
 The inverse of DCT-I is DCT-I multiplied by 2/(n-1).
 The inverse of DCT-IV is DCT-IV multiplied by 2/n. The
inverse of DCT-II is DCT-III multiplied by 2/n (and vice
versa).
 Like for the DFT, the normalization factor in front of
these transform definitions is merely a convention and
differs between treatments.
 For example, some authors multiply the transforms by
√(2/n) so that the inverse does not require any additional
multiplicative factor.

16 Prof P P Ghadekar
Why DCT-II is Common?
 The DCT, and in particular the DCT-II, is often used in
signal and image processing, especially for lossy data
compression.
 Because it has a strong "energy compaction" property.
 Most of the signal information tends to be concentrated in
a few low-frequency components of the DCT, approaching
the Karhunen-Loève transform.

17 Prof P P Ghadekar
Why DCT-II is Common?
 Performance of DCT-II is closest to the statistically optimal KLT
based on a number of performance criteria.
o Variance distribution,
o Energy packing efficiency,
o Residual correlation,
o Rate distortion,
o maximum reducible bits …
 Exhibition of desirable characteristics for data compression
namely,
o Data decorrelation
o Data-independent basis functions
o Fast implementation

18 Prof P P Ghadekar
Why DCT-II is Common?
 The importance of DCT II is further highlighted by its -
o Superiority in bandwidth compression (redundancy
reduction) of a wide range of signals.
o Powerful performance in the bit-rate reduction.
o Existence of fast algorithms for its implementation.
 DCT-II and its inversion, DCT-III, have been employed in
the international image/video coding standards: e.g.: JPEG,
MPEG, H.261, H.263, H.264…

19 Prof P P Ghadekar
1-D DCT (DCT-II)

1-D DCT
2 N 1
(2i  1)k
X (k )  C (k ) x(i ) cos[ ]
N i 0 2N

1-D IDCT
N 1
(2i  1)k

2
x(i )  C (k ) X (k ) cos[ ]
N k 0
2N

k = 0, 1, 2, …, N-1.
and i = 0, 1, 2, …, N-1.

20 Prof P P Ghadekar
Orthogonality
C is orthogonal which Implies C-1 = CT and that
Entails:
CTC = CCT = I
This property is used to solve matrix equations easily.

21 Prof P P Ghadekar
1-D DCT
The discrete cosine transform, C, has one basic characteristic: it
is a real orthogonal matrix.

 1 1 1 
  
2 2 2
 (2 n  1) π 
2  cos π cos
3π
 cos
C
n 2n 2n 2n 
     
 (n  1) π (n  1)3 π (n  1)(2 n  1) π 
 cos cos  cos 
2n 2n 2n

 1  (n  1) 
 cos  cos 
2 2n 2n
 1 3 (n  1)3 
2 cos  cos
C 1  C T 
n  2 2n 2n 

   
 1 (2n  1) (n  1)(2n  1) 
 cos  cos 
 2 2n 2n 

22 Prof P P Ghadekar
Interpolation with DCT

 What is Interpolation?
Interpolation is a method of constructing new data points
within the range of a discrete set of known data points.

 Why interpolate with DCT?

DCT interpolation gives terms already arranged in terms of
importance to the human visual system !!
First terms are most important, last terms are least important.

23 Prof P P Ghadekar
Discrete Cosine Transform (DCT)
The most important tool for overcoming drawbacks of
the 2D DFT is the 2D DCT.
There are several definitions of this transform in practice
and here we will use the following (for the 1D DCT):

24 Prof P P Ghadekar
Inverse DCT
 Inverse DCT is defined as:

 DCT and inverse DCT are mutually inverse.

25 Prof P P Ghadekar
2D DCT
 There is not unique form of the 2D DCT. The simplest
realization technique is calculation of the 1D DCT
along rows and after that along columns of newly
obtained matrix.
 However, there are alternative techniques for direct
evaluation of the 2D DCT. Again there are several
definitions of the 2D DCT that can be used in practice
but here we adopted:

26 Prof P P Ghadekar
Inverse 2D DCT

 Inverse 2D DCT (for our the 2D DCT form) is:

27 Prof P P Ghadekar
Mathematical Properties-DCT
 DCT Matrices are real and orthogonal.

 Unitary Property A-1 = AT A = Conjugate of A

 Linearity Property
M(αg + βf) = α Mg + βMf
For a matrix M, constants α and β, and vectors g and f, all DCTs are linear
transforms.
 The Convolution-Multiplication Property
Convolution in the spatial domain is equivalent to taking an inverse
transform of the product of forward transforms of two data sequences.
The convolution — multiplication property is a powerful tool for
performing digital filtering in the transform domain.

28 Prof P P Ghadekar
Mathematical Properties-DCT
 All DCTs are separable transforms.-Multidimensional transform
can be decomposed into successive application of one-
dimensional (1-D) transforms in the appropriate directions.
 It is separable meaning that it can be separated into a pair of 1-D
DCTs.
 To obtain the 2-D DCT of a block a 1-D DCT is first performed
on the rows of the block then a 1-D DCT is performed on the
columns of the resulting block.
 The same applies to the IDCT.
 This process is illustrated on the following slide.

29 Prof P P Ghadekar
30 Prof P P Ghadekar
Columns (256 pixels)
Raw Block of Pixels
128 127 130 128 134 130 128 128

128 127 130 128 134 130 128 128

An 8x8
125 126 130 127 130 131 128 127
block
Rows (256 pixels)

124 128 127 126 129 130 127 127

131 132 130 129 132 128 129 131

pixel[n] 129 131 134 131 133 129 131 131

129 134 134 136 136 137 134 132

pixel
[256+n] 133 135 136 138 137 140 135 132

pixel[n+1]

Part of a picture

Final result after DCT by columns Result after DCT by rows

1024 -4 -11 0 -4 -6 0 1 365 -1 -4 1 1 -3 1 3

-18 -1 1 0 2 -2 2 4 365 -1 -4 1 1 -3 1 3

11 -1 -5 2 0 0 1 -1 362 -3 -4 0 -3 -1 2 1

0 2 2 -3 3 -2-1 -2 3 359 -3 -3 0 -3 -3 0 -1

0 1 1 1 0 -1 -1 -2 368 1 1 -1 1 -3 -2 1

-4 -3 -3 -1 -3 -1 0 -2 370 0 -3 -3 -1 -1 -1 2

-2 0 -1 -2 -1 1 1 0 378 -3 -6 0 -3 -2 -1 -2

2 0 1 0 0 -1 -1 -1 383 -1 -6 2 -3 0 0 -3

31 Prof P P Ghadekar
DCT
 For example, the DCT is used in JPEG image
compression, MJPEG video compression, and MPEG
video compression.
 There, the two-dimensional DCT-II of 8x8 blocks is
computed and the results are quantized and entropy coded.
 In this case, n is typically 8 and the DCT-II formula is
applied to each row and column of the block.
 The result is an 8x8 transform coefficient array in which
the (0,0) element is the DC (zero-frequency) component
and entries with increasing vertical and horizontal index
values represent higher vertical and horizontal spatial
frequencies.

32 Prof P P Ghadekar
DCT– Matrix Form
DCT-Matrix Form

IDCT-Matrix Form

33 Prof P P Ghadekar
DCT Result

Prof P P Ghadekar 34
Example of a 4x4 DCT Matrix
Example of a 4x4 DCT Matrix

Example of a 4x4 IDCT Matrix

35 Prof P P Ghadekar
Example of a 4x4 DCT Matrix
 N=4 point DCT can be generated by
c[n,m]= a[n] cos((2m+1)nП /8)

Assume the signal is X=[0,1,2,3]T, then its DCT transform is

DCT Matrix Data Transform

36 Prof P P Ghadekar Vector Vector
Example of a 4x4 DCT Matrix
 The inverse transform is-

DCT Matrix Transform Data

Vector Vector

37 Prof P P Ghadekar
Joint Photographic Expert Group

JPEG involves the following four distinct steps

 Block Preparation
 Discrete Cosine Transform
 Quantization
 Compression

Input Block Discrete Cosine Data Output

Quantization
Preparation Transform Compression

38 Prof P P Ghadekar
JPEG Coding
To perform the JPEG coding, an image (in colour or
gray scales) is first subdivided into blocks of 8x8
pixels.
The Discrete Cosine Transform (DCT) is then
performed on each block.
This generates 64 coefficients which are then
quantized to reduce their magnitude.

39 Prof P P Ghadekar
JPEG Coding
The coefficients are then reordered into a one-
dimensional array in a zigzag manner before further
entropy encoding.
The compression is achieved in two stages; the first is
during quantization and the second during the entropy
coding process.
JPEG decoding is the reverse process of coding.

40 Prof P P Ghadekar
JPEG Coding Block Diagram

41 Prof P P Ghadekar
JPEG-Block Preparation
 Reasons for Block Processing
 Taking DCT for an entire image requires large memory.
 Taking DCT for an entire image is not a good idea for
compression due to spatially varying statistics within an image.

 Advantages of Block Processing

 The transform of small blocks is much easier to compute than
the complete image.
 The pixel correlation will not exceed usually 16 or 32 pixels.
 Correlation between neighbouring pixels is more than distinct
pixels.

42 Prof P P Ghadekar
JPEG-Block Preparation

Luminance (Brightness)
Y=0.30R + 0.59G + 0.11B
Chrominance (Colour) PAL
U= -0.18R - 0.29G +0.44B
V=0.62R - 0.52G - 0.10B

43 Prof P P Ghadekar
JPEG-DCT
Each block of 64 pixels goes through a transformation called DCT.
To understand the nature of this transformation, let us consider the result of the
transformations for three different cases-
1)Uniform Grayscale
2)Two sections
3)Gradient Grayscale.

Case-I Uniform Grayscale

44 Prof P P Ghadekar
JPEG-DCT
Case-II Two Section

Case-III Gradient Grayscale

45 Prof P P Ghadekar
Quantization

 After the T table is created, the values are quantized to reduce the
number of bits needed for encoding.
 Quantization divides the number of bits by a constant and then
drops the fraction. This reduces the required number of bits even
more.
 In most implementations, a quantizing table (8 by 8) defines how
to quantize each value.
 The divisor depends on the position of the value in the T table.
 This is done to optimize the number of bits and the number of 0s
for each particular application.

46 Prof P P Ghadekar
JPEG-Quantization
Quantization

47 Prof P P Ghadekar
Compression

 After quantization the values are read from the table, and
redundant 0s are removed.
 However, to cluster the 0s together, the process reads the table
diagonally in a zigzag fashion rather than row by row or column
by column.
 The reason is that if the picture does not have fine changes, the
bottom right corner of the T table is all 0s.
 JPEG usually uses run-length encoding at the compression phase
to compress the bit pattern resulting from the zigzag linearization.

48 Prof P P Ghadekar
Run Length Coding Result-20 1515 12 17 12 A 0 58
49 Prof P P Ghadekar
JPEG-Run length Encoding
A Zigzag scanning pattern is used to concentrate all the 0’s together. The runs of
0’s can be replaced by a single count (say,38 0’s).

Zigzag scanning

Run Length Coding

160 95 90 28 85 30 6 20 24 4 1 3 16 4 1 0 1 3 3 A 0 5 2 1 A 0 38
Another Example-
50 21000000000000000000000000003 21A0263
51 Prof P P Ghadekar
Bit Allocation
 There are two methods used for bits allocation
 Zonal coding
 Threshold coding

52 Prof P P Ghadekar
Zonal coding
 Zonal coding
 It is based on the fact that the transform coefficients of
maximum variance carry the most picture information.
 The locations of the coefficients with the largest variances are
indicated by means of a zonal mask, which is same for all
blocks.
 All transform coefficients in the zone are retained, while all
coefficient out side the zone are set to zero.
 To design a zonal mask, variances of each coefficient can be
calculated based on a global image model, such as the Guass-
Markov model.

53 Prof P P Ghadekar
Bit Allocation in Zonal Coding
 A simple bit allocation strategy is to choose the number of bits
proportional to the variance of each coefficient over the blocks.
 If the number of retained coefficients are M with the variances
σi2 , i=1,2,3…….M. Then the number of bits allocated for each of
these coefficients is given by

Where B is the total number of bits available to represent a block

54 Prof P P Ghadekar
Zonal coding
Information theory says coefficients with maximum variance carry
the most information.
15 out of 64 transform coefficients , with largest variance, are kept

keep

1 1 1 1 1 0 0 0
1 1 1 1 0 0 0 0
1 1 1 0 0 0 0 0
1 1 0 0 0 0 0 0 discard
1 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0
Prof P P Ghadekar
0 0 0 0 0
55
Quantization
 Retained coefficients are quantized and coded
 Coefficients are either normalized by their standard
deviation and uniformly quantized.

56 Prof P P Ghadekar
Bit allocation
 How many bits should be assigned to each coefficient?
 Information theory tells us, that a Gaussian random variable,
subject to distortion D, cannot be represented by fewer bits.
 Large variance coefficients need more bits. Small Variance
coefficients need less bits.

1  2 
log 2   bits
2  D 

57 Prof P P Ghadekar
Bit allocation table

8 7 6 4 3 2 1 0
7 6 5 4 3 2 1 0
6 5 4 3 2 1 1 0
4
3
2
1
0

Number of bits per coefficient

58 Prof P P Ghadekar
Problem with zonal coding
 If we consider a fixed area we run the risk of cutting off
occasional transform coefficients that are large enough to
be included

excluded

59 Prof P P Ghadekar
Threshold coding
 One solution is to threshold coefficients and possibly use
different thresholds for different subimages.
 This way we will end up keeping those coefficients that
exceed threshold, wherever they might be.

60 Prof P P Ghadekar
Threshold Coding
 In most images, different blocks have different spectral and
statistical characteristic.
 So it is necessary to use adaptive bit allocation methods.
 In threshold coding, each transform coefficient is compared
with a threshold.
 If it is smaller than the threshold then it is set to zero.
 If it is larger than the threshold, it will be retained for
quantization and encoding.
 The thresholding method is an adaptive method where only
those coefficient whose magnitude are above a threshold are
retained within each block.
61 Prof P P Ghadekar
Quantiser
 The purpose of quantisation is to remove the components of the
transformed data that are unimportant to the visual appearance of
the image and to retain the visually important components.
 The quantiser utilizes the fact that the human eye is unable to
perceive some visual information in an image.
 Such information is deemed redundant and can be discarded
without noticeable visual artefacts.
 Such redundancy is referred to as psychovisual redundancy.
 The process of assigning a particular sample to a particular level is
called quantisation.
 It is non-linear and irreversible. It is a lossy compression Scheme.
 Rounding the number to the nearest integer can be represented
with minimum number of bits
62 Prof P P Ghadekar
Picking the N-largest transform coefficients
 In a 2D array, how do we pick N-largest coefficients.
 Answer: zig-zag scan. Coefficients are rearranged in a 1D-
sequence

0 1 5 6 14 15 27 28
2 4 7 13 16 26 29
3 8 12 17 25 30
9 11 18 24 31
10 19 23 32
20 22 33
21 34
35

63 Prof P P Ghadekar
Threshold details
 How do we threshold the transform coefficients?
 There are 3 ways
 Single global threshold for all subimages
 Different thresholds for different subimages
 Different threshold for different locations in the subimage

64 Prof P P Ghadekar
Image Compression
 Image compression is a method that reduces the amount of
memory it takes to store in image.

 We will exploit the fact that the DCT matrix is based on our
visual system for the purpose of image compression.

 This means we can delete the least significant values without

our eyes noticing the difference.

67 Prof P P Ghadekar
Image Compression
 Now we have found the matrix Y = C(CXT)T

 Using the DCT, the entries in Y will be organized based on the

human visual system.

 The most important values to

our eyes will be placed in the Most
upper left corner of the matrix.
Important
Semi-
 The least important values
will be mostly in the lower Important

right corner of the matrix. Least

Important

68 Prof P P Ghadekar
Image Compression

8 x 8 Pixels Image

69 Prof P P Ghadekar
Image Compression
 Gray-Scale Example
 Value Range 0 (black) --- 255 (white)

63 33 36 28 63 81 86 98
27 18 17 11 22 48 104 108
72 52 28 15 17 16 47 77
132 100 56 19 10 9 21 55
187 186 166 88 13 34 43 51
184 203 199 177 82 44 97 73
211 214 208 198 134 52 78 83
211 210 203 191 133 79 74 86

X
70 Prof P P Ghadekar
Image Compression
 2D-DCT of matrix
Numbers are coefficients of
polynomial

-304 210 104 -69 10 20 -12 7

-327 -260 67 70 -10 -15 21 8
93 -84 -66 16 24 -2 -5 9
89 33 -19 -20 -26 21 -3 0
-9 42 18 27 -7 -17 29 -7
-5 15 -10 17 32 -15 -4 7
10 3 -12 -1 2 3 -2 -3
12 30 0 -3 -3 -6 12 -1

Y
71 Prof P P Ghadekar
Image Compression
 Cut the least significant components

-304 210 104 -69 10 20 -12 0

-327 -260 67 70 -10 -15 0 0
93 -84 -66 16 24 0 0 0
89 33 -19 -20 0 0 0 0
-9 42 18 0 0 0 0 0
-5 15 0 0 0 0 0 0
10 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0

As you can see, we save a little over half the original memory.

72 Prof P P Ghadekar
Reconstructing the Image
 New Matrix and Compressed Image

55 41 27 39 56 69 92 106
35 22 7 16 35 59 88 101
65 49 21 5 6 28 62 73
130 114 75 28 -7 -1 33 46
180 175 148 95 33 16 45 59
200 206 203 165 92 55 71 82
205 207 214 193 121 70 75 83
214 205 209 196 129 75 78 85

73 Prof P P Ghadekar
Can You Tell the Difference?
Original Compressed

74 Prof P P Ghadekar
Image Compression

Original Compressed

75 Prof P P Ghadekar
Linear Quantization
 We will not zero the bottom half of the matrix.
 The idea is to assign fewer bits of memory to store
information in the lower right corner of the DCT matrix.

76 Prof P P Ghadekar
Linear Quantization

Use Quantization Matrix (Q)

Q=p* 8 16 24 32 40 48 56 64
16 24 32 40 48 56 64 72
24 32 40 48 56 64 72 80
32 40 48 56 64 72 80 88
40 48 56 64 72 80 88 96
48 56 64 72 80 88 96 104
56 64 72 80 88 95 104 112
64 72 80 88 96 104 112 120

77 Prof P P Ghadekar
Linear Quantization
p is called the loss parameter

It acts like a “knob” to control compression

The greater p is the more you compress the

image

78 Prof P P Ghadekar
Linear Quantization
We divide the each entry in the DCT matrix by the
Quantization Matrix

-304 210 104 -69 10 20 -12 7 8 16 24 32 40 48 56 64

-327 -260 67 70 -10 -15 21 8 16 24 32 40 48 56 64 72
93 -84 -66 16 24 -2 -5 9 24 32 40 48 56 64 72 80
32 40 48 56 64 72 80 88
89 33 -19 -20 -26 21 -3 0
40 48 56 64 72 80 88 96
-9 42 18 27 -7 -17 29 -7 48 56 64 72 80 88 96 104
-5 15 -10 17 32 -15 -4 7 56 64 72 80 88 95 104 112
10 3 -12 -1 2 3 -2 -3 64 72 80 88 96 104 112 120
12 30 0 -3 -3 -6 12 -1

79 Prof P P Ghadekar
Linear Quantization
p=1 p=4

-38 13 4 -2 0 0 0 0 -9 3 1 -1 0 0 0 0
-20 -11 2 2 0 0 0 0
-5 -3 1 0 0 0 0 0
1 -1 0 0 0 0 0 0
4 -3 -2 0 0 0 0 0 1 0 0 0 0 0 0 0
3 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0
0 0 0 0 0 0 0 0

New Y: 14 terms New Y: 10 terms

80 Prof P P Ghadekar
Linear Quantization
p=1 p=4

81 Prof P P Ghadekar
Linear Quantization
p=1 p=4

82 Prof P P Ghadekar
Properties of DCT
 Decorrelation - The principle advantage of image
transformation is the removal of redundancy between
neighboring pixels. This leads to uncorrelated transform
coefficients which can be encoded independently.
 Energy Compaction - DCT exhibits excellent energy
compaction for highly correlated images. The uncorrelated
image has its energy spread out, whereas the energy of the
correlated image is packed into the low frequency region.

83 Prof P P Ghadekar
 Orthogonality - IDCT basis functions are orthogonal .
Thus, the inverse transformation matrix of A is equal to its
transpose i.e. invA= A'.
 Separability – Perform DCT operation in any of the
direction first and then apply on second direction,
coefficient will not change.

84 Prof P P Ghadekar
Applications
 The DCT is used in JPEG image compression, MJPEG
video compression, and MPEG video compression.
 A related transform, the modified discrete cosine
transform, or MDCT, is used in MP3 audio compression.
 DCTs are also widely employed in solving partial
differential equations by spectral methods, where the
different variants of the DCT correspond to slightly
different even/odd boundary conditions at the two ends of
the array.

85 Prof P P Ghadekar
Advantages of DCT
 The DCT does a better job of concentrating energy into
lower order coefficients than does the DFT for image data.
 The DCT is purely real, the DFT is complex.
 Assuming a periodic input, the magnitude of the DFT
coefficients is spatially invariant. This is not true for the
DCT.

86 Prof P P Ghadekar
Compared with DFT, DCT has two main advantages
 It’s a real transform with better computational efficiency
than DFT which by definition is a complex transform.
 It does not introduce discontinuity while imposing
periodicity in the time signal.
 In DFT, as the time signal is truncated and assumed
periodic, discontinuity is introduced in time domain and
some corresponding artifacts is introduced in frequency
domain.
 But as even symmetry is assumed while truncating the time
signal, no discontinuity and related artifacts are introduced
in DCT.

87 Prof P P Ghadekar
Drawbacks of DCT
 Blocking artefacts
 Graininess
 Blurring
Blocking Artefact mean there is variations in overall
characteristics from one block to another block i.e. one block
is having different DC level and another block is also having
DC level. This gives rise to some checkerboard effect.
Graininess-It causes due to truncation of low spectral
coefficient or lack of details.
Blurring effect-It is due to truncation of high frequency
components.

88 Prof P P Ghadekar
JPEG 2000-Block Diagram

89 Prof P P Ghadekar
Thanks

Prof (Dr) P P Ghadekar, VIT Pune

Vital Sign Detection Using Millimeter Wave Radars
No ratings yet
Vital Sign Detection Using Millimeter Wave Radars
131 pages
Lec 9 Supervised Learning Final
100% (1)
Lec 9 Supervised Learning Final
182 pages
Discrete Cosine Transform
No ratings yet
Discrete Cosine Transform
40 pages
03-Time-Domain Analysis of LTI Systems
No ratings yet
03-Time-Domain Analysis of LTI Systems
32 pages
Downloadand Install Anaconda
100% (1)
Downloadand Install Anaconda
18 pages
Wwwwrtyyu FGDH
No ratings yet
Wwwwrtyyu FGDH
25 pages
Application: The DCT and JPEG Image and Video Processing Dr. Anil Kokaram Anil - Kokaram@tcd - Ie
No ratings yet
Application: The DCT and JPEG Image and Video Processing Dr. Anil Kokaram Anil - Kokaram@tcd - Ie
24 pages
Fundamentals of Statistical Signal Processing Volume I Estimation Theory PDFDrive
No ratings yet
Fundamentals of Statistical Signal Processing Volume I Estimation Theory PDFDrive
603 pages
DC Unit3
No ratings yet
DC Unit3
97 pages
De Modulation
No ratings yet
De Modulation
153 pages
Image Compression Using DCT: - Rohan Kumar Sinha - Raghavendra Karthik D - Bijay Kalikotay
100% (1)
Image Compression Using DCT: - Rohan Kumar Sinha - Raghavendra Karthik D - Bijay Kalikotay
30 pages
5th Sem Model Question Papers CBCS Scheme
No ratings yet
5th Sem Model Question Papers CBCS Scheme
20 pages
SDE 1st Intro To Statistical Signal Processing
No ratings yet
SDE 1st Intro To Statistical Signal Processing
463 pages
Computer Vision & Image Processing Assignment
100% (1)
Computer Vision & Image Processing Assignment
13 pages
DCT
No ratings yet
DCT
39 pages
Linear Systems Raymond A DeCarlo Cap 4 - A - 6
No ratings yet
Linear Systems Raymond A DeCarlo Cap 4 - A - 6
55 pages
Bookshelf NBK569562
100% (1)
Bookshelf NBK569562
204 pages
Ai All Merged PDF
No ratings yet
Ai All Merged PDF
248 pages
Convolution in 1D and 2D
No ratings yet
Convolution in 1D and 2D
18 pages
Contemporary Communication Systems Using Matlab Proakis and Salehi
100% (2)
Contemporary Communication Systems Using Matlab Proakis and Salehi
443 pages
ecschsyll-5
No ratings yet
ecschsyll-5
30 pages
(5) DFT
No ratings yet
(5) DFT
8 pages
Application of Matrix in Signal Processing
No ratings yet
Application of Matrix in Signal Processing
8 pages
DC Merged PDF
No ratings yet
DC Merged PDF
100 pages
PBC Slide 2
No ratings yet
PBC Slide 2
53 pages
M 3425 Ib
No ratings yet
M 3425 Ib
230 pages
Digital Image Processing Concepts, Algorithms, and Scientific Applications Second Edition by Bemd Jahne PDF
No ratings yet
Digital Image Processing Concepts, Algorithms, and Scientific Applications Second Edition by Bemd Jahne PDF
413 pages
Ask, PSK, FSK
No ratings yet
Ask, PSK, FSK
48 pages
Information Theory and Coding
100% (1)
Information Theory and Coding
79 pages
5btaso - Information Theory and Coding Formula Sheet
No ratings yet
5btaso - Information Theory and Coding Formula Sheet
4 pages
FFT and Spectral Analysis Part2
No ratings yet
FFT and Spectral Analysis Part2
54 pages
Denoising Real-Time Audio Signals Using Matlab Filter Techniques
No ratings yet
Denoising Real-Time Audio Signals Using Matlab Filter Techniques
10 pages
DISCRETE-TIME RANDOM PROCESS Summary
No ratings yet
DISCRETE-TIME RANDOM PROCESS Summary
13 pages
Signals
100% (1)
Signals
47 pages
2-Discrete Fourier Transform PDF
No ratings yet
2-Discrete Fourier Transform PDF
80 pages
EXP NO:07-a Date: Design of Fir Filter by Windowing Technique
No ratings yet
EXP NO:07-a Date: Design of Fir Filter by Windowing Technique
15 pages
Department of Electronics & Communication Engineering Laboratory Manual - R18 Ec362-Digital Signal Processing Lab
No ratings yet
Department of Electronics & Communication Engineering Laboratory Manual - R18 Ec362-Digital Signal Processing Lab
29 pages
Itc Notes
No ratings yet
Itc Notes
20 pages
A Brief Introduction To Matlab
No ratings yet
A Brief Introduction To Matlab
8 pages
Image Segmentation in Digital Image Processing
No ratings yet
Image Segmentation in Digital Image Processing
71 pages
Fractal Geometry and Superformula To Model Natural Shapes Over The World
No ratings yet
Fractal Geometry and Superformula To Model Natural Shapes Over The World
15 pages
DSP With Matlab PDF
No ratings yet
DSP With Matlab PDF
17 pages
Vtu Course Work Syllabus
100% (2)
Vtu Course Work Syllabus
6 pages
Matlab-Intro11.12.08 Sina PDF
No ratings yet
Matlab-Intro11.12.08 Sina PDF
26 pages
Matlab Code of Image Compression
0% (1)
Matlab Code of Image Compression
5 pages
Ch2 Probability
100% (1)
Ch2 Probability
31 pages
University of Southeastern Philippines
No ratings yet
University of Southeastern Philippines
25 pages
MATLAB BERTool
No ratings yet
MATLAB BERTool
4 pages
CycloneNTT: An NTT/FFT Architecture Using Quasi-Streaming of Large Datasets On DDR - and HBM-based FPGA Platforms
No ratings yet
CycloneNTT: An NTT/FFT Architecture Using Quasi-Streaming of Large Datasets On DDR - and HBM-based FPGA Platforms
12 pages
Digital Image Processing 3
No ratings yet
Digital Image Processing 3
143 pages
A Course in Advanced Signal Processing
No ratings yet
A Course in Advanced Signal Processing
16 pages
Unit Iii - Discrete Fourier Series & Fourier Transforms Syllabus
No ratings yet
Unit Iii - Discrete Fourier Series & Fourier Transforms Syllabus
17 pages
Probability Assignment
No ratings yet
Probability Assignment
10 pages
Feature Detection and Matching
No ratings yet
Feature Detection and Matching
80 pages
Estimation
No ratings yet
Estimation
6 pages
Basic Probability
No ratings yet
Basic Probability
35 pages
Solving Optimization Problems Using The Matlab Opt
No ratings yet
Solving Optimization Problems Using The Matlab Opt
50 pages
An Introduction To Image Compression
No ratings yet
An Introduction To Image Compression
58 pages
Matlab Code For Basic DSP Signals MATLAB PROGRAM TO DISPLAY THE PROPERTIES OF DISCRETE FOURIER TRANSFORM (DFT)
No ratings yet
Matlab Code For Basic DSP Signals MATLAB PROGRAM TO DISPLAY THE PROPERTIES OF DISCRETE FOURIER TRANSFORM (DFT)
16 pages
Ecsyll 5
No ratings yet
Ecsyll 5
21 pages
Appendix
No ratings yet
Appendix
19 pages
Review of Transforms: ECGR 6118 Computer Project: Transforms Student Name
No ratings yet
Review of Transforms: ECGR 6118 Computer Project: Transforms Student Name
25 pages
DSP Lab Assignments
No ratings yet
DSP Lab Assignments
112 pages
Signals in MatLab PDF
No ratings yet
Signals in MatLab PDF
18 pages
HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing
No ratings yet
HST.582J / 6.555J / 16.456J Biomedical Signal and Image Processing
25 pages
DSP Lab Record Exp-1
No ratings yet
DSP Lab Record Exp-1
19 pages
CS4670: Computer Vision: Lecture 5: Feature Detection and Matching
No ratings yet
CS4670: Computer Vision: Lecture 5: Feature Detection and Matching
46 pages
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
No ratings yet
Example 1: DFT of Sine Waveform: Lecture Topic: Understanding DFT and FFT
15 pages
Convolution Correlation
No ratings yet
Convolution Correlation
39 pages
DSP Questions
No ratings yet
DSP Questions
46 pages
DCT Report
No ratings yet
DCT Report
24 pages
Introduction To Information Theory Channel Capacity and Models
No ratings yet
Introduction To Information Theory Channel Capacity and Models
36 pages
Computer Architecture For Digital Signal Processing: Jonathan Allen
No ratings yet
Computer Architecture For Digital Signal Processing: Jonathan Allen
25 pages
MATLAB Codes DFT
No ratings yet
MATLAB Codes DFT
8 pages
DSP Syllabus
No ratings yet
DSP Syllabus
2 pages
FFT Paper
No ratings yet
FFT Paper
6 pages
Data Compression
No ratings yet
Data Compression
29 pages
Algorithms For Constrained Optimization
No ratings yet
Algorithms For Constrained Optimization
22 pages
Solutions For Final Examination: EE113 - Fall 2013 1 of 12
No ratings yet
Solutions For Final Examination: EE113 - Fall 2013 1 of 12
12 pages
Fourier Series
No ratings yet
Fourier Series
14 pages
DigitalSignalProcessing Course McGill
No ratings yet
DigitalSignalProcessing Course McGill
6 pages
Lab 6 - Matlab FDATools
No ratings yet
Lab 6 - Matlab FDATools
4 pages
Truncated Huffman
No ratings yet
Truncated Huffman
5 pages
Question Bank For Digital Signal Processing
No ratings yet
Question Bank For Digital Signal Processing
11 pages
4-PCA and SVD
No ratings yet
4-PCA and SVD
32 pages
Advanced Digital Signal Processing With Matlab (R)
No ratings yet
Advanced Digital Signal Processing With Matlab (R)
4 pages
Fast Algorithm For DCT
No ratings yet
Fast Algorithm For DCT
20 pages
V & Vi
No ratings yet
V & Vi
37 pages
Secure Image Transfer in The Domain Transform DFT
No ratings yet
Secure Image Transfer in The Domain Transform DFT
5 pages
42-Comparing MCSA With Vibration Analysis in Order To Detect Bearing Faults - A Case Study PDF
No ratings yet
42-Comparing MCSA With Vibration Analysis in Order To Detect Bearing Faults - A Case Study PDF
7 pages
VHDL Code For 8 Point FFT Algorithm
No ratings yet
VHDL Code For 8 Point FFT Algorithm
5 pages
Applied Mathematical in Electrical Engineering - Final Exam (Semester I 2018-2019)
No ratings yet
Applied Mathematical in Electrical Engineering - Final Exam (Semester I 2018-2019)
1 page
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
From Everand
Analog Dialogue, Volume 48, Number 1: Analog Dialogue, #13
Analog Dialogue
4/5 (1)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

3-Discrete Cosine Transform PDF

Uploaded by

3-Discrete Cosine Transform PDF

Uploaded by

By

Prof (Dr.) P P Ghadekar

O-Big O notation used to represent efficiency (No of steps)

Inverse Inverse EntropyEn

Real (COS) Imaginary (SIN)

 Why interpolate with DCT?

 DCT and inverse DCT are mutually inverse.

 Inverse 2D DCT (for our the 2D DCT form) is:

 Unitary Property A-1 = A*T A* = Conjugate of A

128 127 130 128 134 130 128 128

124 128 127 126 129 130 127 127

131 132 130 129 132 128 129 131

129 134 134 136 136 137 134 132

Final result after DCT by columns Result after DCT by rows

Example of a 4x4 IDCT Matrix

Assume the signal is X=[0,1,2,3]T, then its DCT transform is

DCT Matrix Data Transform

DCT Matrix Transform Data

JPEG involves the following four distinct steps

Input Block Discrete Cosine Data Output

 Advantages of Block Processing

Case-I Uniform Grayscale

Case-III Gradient Grayscale

Run Length Coding

Where B is the total number of bits available to represent a block

Number of bits per coefficient

 This means we can delete the least significant values without

 Using the DCT, the entries in Y will be organized based on the

 The most important values to

right corner of the matrix. Least

-304 210 104 -69 10 20 -12 7

-304 210 104 -69 10 20 -12 0

Use Quantization Matrix (Q)

It acts like a “knob” to control compression

The greater p is the more you compress the

-304 210 104 -69 10 20 -12 7 8 16 24 32 40 48 56 64

New Y: 14 terms New Y: 10 terms

Prof (Dr) P P Ghadekar, VIT Pune

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

 Unitary Property A-1 = AT A = Conjugate of A