0% found this document useful (0 votes)

24 views25 pages

Vector Quantization: April 2006

The document discusses vector quantization, which involves grouping samples into vectors and quantizing the vectors rather than individual samples. Vector quantization has applications in signal processing, data compression, image processing, and pattern recognition. Several common algorithms for designing vector quantization codebooks are also presented.

Uploaded by

habibamch15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views25 pages

Vector Quantization: April 2006

Uploaded by

habibamch15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 25

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/230309750

Vector Quantization

Chapter · April 2006

DOI: 10.1002/9780471740360.ebs1254

CITATION READS

1 5,292

2 authors:

A. Enis Cetin Omer N. Gerek

Bilkent University Eskisehir Technical University
315 PUBLICATIONS 6,555 CITATIONS 227 PUBLICATIONS 2,579 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Omer N. Gerek on 24 October 2017.

The user has requested enhancement of the downloaded file.

VECTOR QUANTIZATION

Ömer Nezih Gerek a , A. Enis Çetin b

a AnadoluUniversity, Department of Electrical and Electronics Engineering, 26470
Eskişehir, Turkey.
Tel.: +90 222 321 3550; fax: +90 222 323 9501; e-mail: ongerek@anadolu.edu.tr.
b Bilkent University, Department of Electrical and Electronics Engineering, 06533
Ankara, Turkey.

1 Introduction

Quantization is an unavoidable step in representing signals in digital form for

computer processing. It is impossible to represent signal samples in infinite
precision in computers or digital signal processors. Analog to digital convert-
ers assign each data sample a numerical value with finite precision. Therefore,
the numerical values must be quantized to the numerical precision of the com-
puter. Apart from the inherent quantization during digitizing signals, a typical
digital signal is stored in a compressed form which is generated by a trans-
form/prediction stage followed by quantization and finally entropy coding. If
the samples coming from the optional transform/prediction stage are quan-
tized separately, the operation is called “scalar quantization”. Consequently,
if the samples are grouped to form vectors, their quantization is called “vector
quantization” (VQ).

Changing the quantization dimension from one (for scalar) to multi (for vec-
tors) has many important implications. First of all, VQ does not necessarily
correspond to rounding of data values to coarse levels, any more. VQ stage
produces indices that represent the vector formed by grouping samples. The
output index, which is an integer, has little or no physical relation with the
vector it is representing, which is formed by grouping real or complex valued
samples. The word “quantization” in VQ comes from the fact that similar
vectors are represented by the same index. Therefore, many distinct vectors
on the multi–dimensional space are quantized to a single vector which is rep-
resented by the index. Each index corresponds to a previously decided vector.
In that aspect, the number of distinct indices deﬁnes the number of quanti-
zation levels. It is reasonable to argue that the quantization index of a data
vector should be selected according to the nearest vector in the set of pre-
viously decided vectors (which is called the VQ codebook). As an example,
if the considered vector x is nearest to an element of the codebook, say vi ,
then the VQ output is simply i. In the de-quantization stage, the index i is
reconstructed as the vector vi , so one can say that x is quantized to vi .

Assigning indices to a number of vectors has implications other than coding [1],
too. Since vectors near to vi are indexed as i and those near to vj are indexed
as j, this automatically provides the clustering information around codebook
vectors. Clustering of vectors is commonly used in solving classification prob-
lems [2]. Classification of data is a major element of pattern recognition. As a
result, many VQ algorithms that are developed for signal coding, have analo-
gous counterparts in the pattern classification and recognition literature. ISO-
DATA [3], k–Nearest Neighbor (k-NN) [4], and Self-organizing feature maps
(SOM) [5] are popular clustering methods which have algorithms very similar
to those for desining VQ codebooks, such as Max–Lloyd [6],[7] and Linde–
Buzo–Gray (LBG) [8] algorithms. Another commonly used VQ application is
“color reduction” for images. Many acquisition devices produce color images
allocating 8 bits to red, green and blue components of a pixel, respectively.
This makes a total of 24 bits/pixel. Due to display buffer limitations or stor-
age requirements, it is desirable to reduce the number of bits to assign to
each pixel. This is also done by vector quantizing RGB components to fewer
number of indices [9],[10].

This chapter is organized as follows. First the basic concepts of a vector quan-
tizer is presented. The issue of distortion and several metrics used in the design
and implementation of a VQ are presented here. Second, properties of mini-
mum distortion VQ and necessary equations for optimality are presented. In
the third section, the basic iteration that optimizes the codebook with a given
set of data is presented and several VQ codebook design techniques are intro-
duced. Finally, some typical VQ applications are presented at the end of the
chapter.

2 Structure of a Vector Quantizer

A vector quatizer consists of two modules; an encoder E, and a decoder D [1].

The encoder is a module that assigns an index number i to an input vector
x. For example, the input vector in Figure 1 consists of 16 elements, and the
encoder generates an index number, say, 5. In this example, the input data is
in the form of a matrix. The input vector corresponding to the data is obtained
from the entries of the matrix by an appropriate scanning of the matrix.

In this module, there are a number of distinct vectors (called the code vectors
or codewords, vi ) to form a set (called the codebook, C). The encoder module
searches the codebook for the nearest match to the input vector. If the ith

2
14 25 57 9

88 35 11 47

37 92 29 33
Encoder 5
54 7 63 78

Fig. 1. Encoder produces index=5 to the input vector.

code vector in the codebook (ci ) is nearest to the input vector (x) according
to some metric, then the quantizer output is i. The operation is illustrated in
Figure 2. As a result of encoding, mere integers are obtained at the output
which results in a large amount of representation saving (compression).
v1
v2
x v3 i
co
mp
are

ct
e
sel
vi

v N -1
vN

ENCODER

Fig. 2. Comparing the input vector to the code vectors in the codebook.

Since the encoder output is only a representation of code vectors, in order

to reconstruct the signal (although with losses), the representation formed by
the index numbers must enter to a decoder module which accepts integers at
its input and produces the code vector at its output (Figure 3). The decoder
is also called the inverse quantizer.

Normally, the term “vector quantizer” (Q) is used for the combination of the
encoder and decoder modules. In terms of mathematical notations,

i = E(x),
vi = D(i), and (1)
vi = Q(x) = D(E(x)).

The vector quantizer Q has two attributes:

• the dimension, k, and

3
v1
v2
i v3 vi

sel
ect
th
i

ce
du
pro
vi

v N -1
vN

DECODER

Fig. 3. Generating a code vector according to the input index at the decoder.
• the codebook size, N .

The integer k corresponds to the number of elements in each vector. Therefore,

if the elements of the vector are real numbers, an input vector or a code vector
is a point on the k-dimensional Euclidean space (represented by Rk ). The
other integer N represents the number of code vectors inside the codebook C.
Mathematically, N is called the size of C. In this aspect, Q is an operator from
the Euclidean space to a ﬁnite set:

Q : Rk → C (2)

For coding purposes, the codebook is usually known by both the transmitter
(encoder) and receiver (decoder) parts. Therefore, only the integer output (the
index) of the encoder is transmitted.

On the other hand, the codebook itself represents a useful partitioning of the k
dimensional Euclidean space, Rk into N regions, Ri . Each region Ri is directly
deﬁned by the quantizer in such a way that, if the encoder produces index i
for the input vector x, then x is in region Ri . These regions are also known as
Voronoi cells. The i-th cluster is, therefore, determined as the set of all vectors
in the data set clossest to the the vector vi . Mathematically;

Ri = {x ∈ Rk |Q(x) = vi } (3)

This means that Ri is the set of all points which are closer to vi than to
all other code vectors. A region can be bounded (granular cell) with ﬁnite
k-dimensional volume, or unbounded (overload cell).

Conversely, the above splitting of the k dimensional space into N regions

implies an alternative deﬁnition of the vector quantizer as follows: If x ∈ Ri ,

4
then Q(x) = vi . Notice that this deﬁnition is very suitable when using VQ for
grouping or clustering purposes [11]. For these purposes, the encoder indices
immediately specify the cluster which the input belongs to.

We have been using the term “nearer to one of the code vectors than others” in
the encoder stage, since the beginning of the section. Therefore, what is meant
by “nearer” should be clariﬁed. For most practical purposes, an Euclidean
distance between two vectors is used for measuring how near two vectors are.
On the other hand, we will see that several other distance measures can be
used for determining how near two vectors are (Chapter 2 of [12]). The only
constraint about the deﬁnition of the distance is that, it must be a proper
metric 1 . If the encoder and decoder uses a proper metric for measuring the
distance of the input vector to a code vector in C, then the each region Ri
must be convex 2 .

Figure 4 shows a 2-D VQ region partitioning. The regions in the central por-
tions are bounded cells, and the ones that extend out of the center (dashed
line) are unbounded cells. Notice that this partitioning is proper, in the sense
that the regions are all convex.

Fig. 4. Typical VQ regions.

The ﬁnal remark about the general structure of VQ is due to its ability to op-
timize compression performance for inputs that are grouped to form vectors.
1 A proper metric is the distance measure D(·, ·) for a metric space which satisﬁes
four properties for vectors a, b, and c:

nonnegativity: D(a, b) ≥ 0
reﬂexivity: D(a, b) = 0 ⇔ a = b
(4)
symmetry: D(a, b) = D(b, a)
triangle inequality: D(a, b) + D(b, c) ≥ D(a, c)
2 An Euclidean region is convex if the lines connecting any two points in the region
always lie inside the region, too. A more general deﬁnition for convex sets is; if α
and β are members of a convex set, then λα + (1 − λ)β is also a member of the set
for 0 ≤ λ ≤ 1

5
Shannon has shown that if we have a coding system which maps input vectors
into one of N indices with the best coding performance 3 , VQ can achieve as
good as the above “best” encoder [13]. The way to reach to this performance
is through the optimization of the regions and code vectors, which will be
described in the next section. Since the minimization is with respect to a dis-
tortion, several distortion metrics can be formulated, which all yield diﬀerent
optimization results [14],[15]. Most commonly used metrics are:

• Minkowski metric:
k 1/L
L
dL (x, vi ) = |x(m) − vi (m)| (5)
m=1

and its special cases:

· Euclidean (Mean squared error - MSE, L = 2) distance:
k 1/2
2
dE (x, vi ) = |x(m) − vi (m)| = (x − vi )T (x − vi ) (6)
m=1

· Manhattan (Mean absolute error - MAE, L = 1) distance:

k
dM (x, vi ) = |x(m) − vi (m)| (7)
m=1

· Chebychev (max, L = ∞) distance:

k
dC (x, vi ) = max |x(m) − vi (m)| (8)
m=1

• Hamming distance:
k

dH (x, vi ) = 1 − δx(m),vi (m) , (9)
m=1

where
⎧
⎪
⎨ 1, α = β
δα,β = ⎪
⎩ 0, α = β

• Mahalanobis distance:

dR (x, vi ) = (x − vi )T C−1
x (x − vi ) , (10)

where Cx is the autocovariance matrix of x.

3 The best coder with N output indices is the one that minimizes the distortion
between the original vector and the decoder output.

6
Many other distortion metrics can be developed depending on the application
and usefulness. Using a distortion metric, d(·, ·), the overall VQ diagram can
be re-illustrated as in Figure 5.

Encoder C: Decoder C:
{v 1 ,v 2 , ... , v N } {v 1 ,v 2 , ... , v N }

minimize d (x,v j ) index : i

input vector: x output vector: vi
with respect to // Table lookup
j = 1, 2, ..., N

Fig. 5. VQ encoding and decoding process.

3 Minimum Distortion VQ

The performance of a vector quantizer is determined by the optimality of the

encoder and decoder parts, described in Section 2. The terms “performance”
and “optimum” are directly related to the amount of distortion the quan-
tizer produces at a given number of output levels. In this chapter, we will
describe the properties of encoder and decoder parts necessary for a minimum
distortion VQ.

3.1 Encoder Optimality

In Section 2, it is pointed out that the index of the output vector is determined
according to the minimum distance rule. These kind of vector quantizers are
known as “nearest neighbor” quantizers. The “nearest neighbor” rule is re-
quired for the optimality of the encoder:

Q(x) = vi only if d(x, vi ) ≤ d(x, vj ), ∀j (11)

The justiﬁcation of the nearest neighbor rule for encoder optimality is quite
simple; if a vector is quantized to a code vector which is not the nearest to
the input vector, then the distortion is increased. For that reason, the optimal
encoder must search the whole codebook for the smallest distance d(x, vi ):
N
d(x, Q(x)) = min d(x, vi ) (12)
i=1

7
This minimum distortion statistically minimizes the average expected distor-
tion:

D= d(x, Q(x))fX (x)dx, (13)

where fX (x) corresponds to the joint pdf of x. Consequently, the regions Ri

are formed, and the partitioning rule speciﬁes the encoder.

3.2 Decoder Optimality

The second optimality criterion is about the decoder part, satisfied by finding
the optimum codebook. In other words, if we are given the clustering regions,
we must find the best representing code vector for that region. Statistically,
the code vector vi in a region Ri must be selected in such a way that the
expected distortion it makes with any input vector x that lies inside region
Ri must be minimized:

vi = arg min
v
E {d(x, v)|x ∈ Ri } (14)

Equation 14 is also known as the “centroid” rule, since the minimization of

the expected value corresponds to the centroid of region Ri . For the MSE
distortion metric (given in Eq. 6), the centroid corresponds to the minimum
MSE (MMSE) estimate of x ∈ Ri :

vi = cent(Ri ) = E {x|x ∈ Ri } , (15)

where cent(·) stands for the centroid operation.

The proof of Equation 14 is as follows:

The average distortion for a given codebook with a set of code vectors vi is

N
N
D= d(x, vi )fX (x)dx = Pi d(x, vi )fX|i (x)dx, (16)
i=1 R i=1
i

where Pi is the probability of x being in region Ri , and fX|i (x) is the condi-
tional pdf of x given x ∈ Ri . From the deﬁnition of the expected value:

E {d(x, vi )|x ∈ Ri } = d(x, vi )fX|i (x)dx (17)

8
Since the centroid minimizes the expected value at the left of Equation 17,
it also minimizes the integral at the right. Therefore, the summing term of
Equation 16, hence the distortion, is minimized.

Equation 15 provides the method to select the code vector corresponding to

a region. From this equation, the centroid can be calculated as:

RixfX (x)dx
vi = xfX|i (x)dx = . (18)
Ri Ri fX (x)dx

Finally, an optimum vector quantizer satisﬁes the following properties:

(i) E {Q(x)} = E {x}, known as

the orthogonality condition,
T 2
(ii) E x Q(x) = E Q(x) , and

2 2 2
(iii) E Q(x) =E x −E x − Q(x) .

4 VQ Codebook Design using Empirical Data

For the optimum VQ, encoder and decoder optimality criteria must be sat-
isﬁed simultaneously. For a given number of code vectors, say N , the goal is
to achieve the minimum distortion by selecting the code vectors, hence the
corresponding regions. The properties and equations were described in Sec-
tion 3. There are a number of methods proposed for achieving the optimal or
a sub–optimal quantizer from the given data. One of the most commonly used
technique is called the Generalized Lloyd Algorithm [6],[7], which improves the
codebook iteratively starting from an initial codebook. The scalar version of
this quantizer design is also used for scalar quantizer design. Furthermore,
this algorithm is commonly referred to as k-means [4] or ISODATA [3] in the
literature concerning clustering.

Lloyd–Max iteration consists of two steps:

• Nearest Neighbor condition: Given a set of code vectors, C = {v1 , v2 , · · · , vN },

the clusters are deﬁned as

Ri = {x ∈ Rk |d(x, vi ) < (x, vj )}; ∀i = j (19)

If the data x is on the boundary (with same distortions) of Ri and Rj , assign

it to the smaller of i and j.
• Centroid condition: Given N clusters, Ri , assign the representative code vec-
tor vi as the centroid of the cluster. Using the Euclidean distance measure,
the centroid corresponds to the arithmetic mean of data vectors belonging

9
to a cluster. For other distance measures, centroid calculation diﬀers 4 .

These two steps are iteratively performed until the overall distortion does
not reduce any more, or the amount of distortion improvement goes below
a certain threshold after an iteration. Notice that each iteration step must
reduce or keep the distortion level. In some cases, empty regions may occur.
In that case, a new code vector is assigned, or the codebook size N is reduced.

The Lloyd iteration is a very general method for optimization. However, there
are several more VQ design techniques, some of them relying on the Lloyd
iteration as intermediate steps. We will name a few of these methods and
indicate their basic ideas here.

(a) Random Coding: Over a whole set of data vectors, one choses N of the
vectors randomly, and assigns them as the code vectors. This is a very
empirical technique, however if the data is strongly correlated, it may
yield acceptable results.
(b) Pruning: In this case, the data vectors are sequentially appended to the
codebook list according to whether they are near enough to one of the
code vectors in the codebook, or not. If the new vector has a high distance
to each code vector, that new vector is added to the codebook [16].
(c) Pairwise Nearest Neighbor: The algorithm combines clusters which has
nearest centroids, and continues combining as long as the number of clus-
ters is more than the desired codebook size. Initially each data vector
forms its own cluster, and the clusters grow iteratively, having more data
vectors inside them [17]. The combination of clusters produce a diﬀerent
centroid corresponding to the wighted average of the combined centroids.
Therefore, unlike the previous methods, the code vectors do not neces-
sarily correspond to data vectors.
(d) Product Codes: If the codebook size is represented as N = 2kR , then a
cartesian product of k scalar quantizers with 2R levels can be used as the
vector quantizer [18].

1
M
vEuc = xk
M
k=1
vM an (i) = {x|P (x(j) > x(i)) = P (x(j) < x(i))}
M M
vChe (i) = {min xj (i) + min xj (i)}/2
j=1 j=1
vHam (i) = {xk (i)|P (xk (i)) > P (xl (i)) ∀l}
M −1 T T

j=1 Cxj xi
vM ah =
C−1
xj

10
(e) Lloyd iteration with Stochastic Relaxation: A zero mean noise is added
to the centroids generated by each iteration in the Lloyd algorithm, and
the noise power is as the iterations proceed [19]. If the random noise is
generated according to a temperature parameter, Tm , which is decreased
as iteration number m proceeds, then this technique is also considered as
Simulated Annealing.
(f) Simulated Annealing: As a subset of the Stochastic Relaxation algorithm,
the noise is added to centroids (which is called a perturbation) and the
perturbed centroid is accepted with probability P = e−ΔH/T , where ΔH
is a cost which increases by the iteration number [20],[21].
(g) Fuzzy Clustering: Inclusion of a data vector inside a cluster is not as-
signed binary values 0 and 1. Instead a fuzzy membership (Sj (xi ): de-
gree of membership of xi in region Rj )value between 0 and 1 is as-
signed [22]. In this way, the membership of the data vector to the consid-
ered region is only partial, and a new fuzzy distortion deﬁnition is used:
N
DF = M1 M i=1
q
j=1 d(xi , vj )[Sj (xi )] . In the Lloyd iteration, the param-
eter q is initially selected as a large number (indicating high fuzzyness),
and decreased gradually down to 1.
(h) Linde-Buzo-Gray (splitting) Algorithm: Probably the most conventional
method that utilizes the Lloyd iteration is the Linde-Buzo-Gray (LBG)
algorithm [8]. In this case, The algorithm starts with a single code vector
(which is normally assigned to be the average of the data vectors. Then
the code vector is split into two by adding and subtracting a vector small
in magnitude, along the direction of maximum variation in the vector
space. With these two new vectors, the Lloyd iteration is applied and op-
timum code vectors with a codebook size 2 is obtained. LBG algorithm
iteratively splits each code vector into two by using the above perturba-
tion method, then applies Lloyd iteration again, until the desired number
of code vectors are obtained. This is a very convenient method to com-
pletely design the optimal codebook from the scratch without the risk of
obtaining empty or unbalanced clusters.

There are other variations on the quantizer design technique, too. For instance,
depending on the general structure of the input, it may be desirable to set up
a ﬁxed structured quantizer. Lattice vector quantizers are popular for this
aspect, where the clusters are selected according to a geometrical grid, mostly
hexagonal.

Quantizer improvements are also studied in the literature. The most commonly
used improvements can be listed as:

• Lattice (structured) VQ: This is actually the “uniform” quantizer in higher

dimensions. Each quantization region has the same shape. Therefore, the
regions must obey two conditions: (1) They must not overlap, (2) They must
cover the N–dimensional input space. Such structures are called lattice.

11
• Gain-Shape VQ: If the input data shows significant dynamic range varia-
tions, the codebook needs to be very arge for a fairly small distortion. To
remedy this situaion, the input vectors can be first normalized and then vec-
tor quantized. The normalization factor needs to be encoded separately [1].
• Mean-Removed VQ: In many images, the vector segments may contain sim-
ilar shape charcteristics, but because of intensity variations, they may be
quite far from each other according to distance metrics. To quantize such
vectors into the same code vector would improve the efficiency. This is pos-
sible if the means of the vectors are subtracted from each, and the resulting
vectors are quantized. Similar to the above situation, the mean values must
be encoded separately.
• Classified VQ: If the input data contains multiple patterns that exhibit
large spatial differences from each other, while vectors generated from the
same pattern portion are quite similar, then designing quantizers separately
for each pattern, and applying the appropriate quantizer to the vector im-
proves the efficiency of the quantizer [23]. Usually, there is an overhead of
transmitting the infomation of which codebook the encoder will use.
• Multistage VQ: This method significantly reduces encoder complexity and
memory requirements [24]. The idea is to quantize the input coarsely at the
first stage, and then continue quantizing the difference between the signal
and the its coarsely quantized version, iteratively. As an example, if we have
three quantizers Q1 , Q1 , and Q1 , with an input x, then

y1 = Q1 (x)
y2 = Q2 (x − Q1 (x)) = Q2 (x − y1 )
y3 = Q3 (x − Q1 (x) − Q2 (x − Q1 (x))) = Q2 (x − y1 − y2 )

and the quantization result is x̃ = y1 + y2 + y3 .

• Adaptive VQ: For purposes such as on–line encoding of signals that change
characteristics over time, adaptive VQ is a method to cope with the situa-
tion. Usually, the method starts with a relatively large codebook, and selects
a subset of the codebook according to the current input characteristics [25].
• Trellis-Coded Quantization: Inspired by the trellis coded modulation tech-
nique in the communication topic [26], the quantizer uses a quantizer code-
book for a given vector which is determined by the VQ output of the pre-
vious vector. In that aspect, a data vector can be quantized only after the
quantizer output for the previous data vector is determined.

12
5 VQ Applications and Examples

5.1 Compression

The most widely used application of VQ is data compression [27]. The in-
put data can be compressed by a VQ at the expense of distortion. From an
information theoretical perspective, distortion and rate are two inversely pro-
portional quantities. If the compression is high, then distortion increases, but
rate decreases. For a given source signal, the rate/distortion curve typically
has a shape shown with dashed lines in Figure 6. This curve is obtained by
evaluating the minimum distortion achieved by the best encoder at a given
rate. On the other hand, the characteristics of a vector quantizer usually looks
like the solid line staircase–like shape. Using the optimization techniques de-
scribed in the previous section, it is desired that the solid lines touch the
minimum rate/distortion curve at the given rate.

Distortion

Rate

Fig. 6. Rate/distortion curves (dashed: ideal, solid: typical vector quantizer).

5.1.1 Quantization of transform / predictive coding coeﬃcients:

Normally, VQ is the second stage of a conventional compression scheme. The

compression is typically composed of

(i) Transformation/Predictive coding stage,

(ii) Quantization, and
(iii) Entropy coding.

The ﬁrst stage, transformation or predictive coding, reduces the correlation

between samples of the input. Commonly used transforms are the Discrete
Fourier Transform (DFT), Discrete Cosine Transform (DCT, the celebrated

13
transform that is used in JPEG), Wavelet transforms, and input specific trans-
forms such as the Karhunen–Loeve Transform and Singular Value Decompo-
sition. The common point in most of these transforms is; they reduce the
correlation between the samples of its input vector, hence compacts most of
the energy of the vector to only a few of the output vector elements. The oper-
ation is reversible by the use of the inverse transforms. Another de–correlating
method is called predictive coding, where an element in a sequence is first tried
to be predicted using previous elements of the same list, and then only the
prediction difference is generated as the output. Using the same prediction
algorithm, and given the previously decoded elements, the decoder can re–
genrate the same prediction and add the prediction difference to reconstruct
the signal.

After either of these methods, the signal samples mostly contain small values,
which can be safely quantized to zero. As an example, in the JPEG image com-
pression standard an image is typically divided into 8 by 8 segments and DCT
of these segments are computed. This transfom causes a majority of the trans-
form signals to have values that will be quantized to zero. In order to better
understand the eﬃciency of transformation followed by quantization, consider
the example of an input vector x = [1.2, 1.1, 0.9, 0.8]. Assume that, in order
to achieve some compression, we want to quantize its elements by truncating
the samples to the greatest smaller integer ( • ). If we apply this quantization
without any transformation, the output vector would be x̃ = [1.0, 1.0, 0.0, 0.0],
1/2
and the distortion would be 14 4i=1 (x(i) − x̃(i))2 = 0.6124. Now, instead
of direct quantization, let us ﬁrst apply DCT to the input signal, and obtain
a new vector c = DCT {x} = [2.0, 0.3, 0, 0]. Applying quantization over c, we
get c̃ = [2, 0, 0, 0]. Taking the inverse DCT, we obtain x̃ = [1.0, 1.0, 1.0, 1.0],
and the distortion is only 0.1581.

As a second example, consider the 8 × 8 image shown in mesh format in

Figure 7(a). The image values are between 0 and 255 (8 bits). We want to
quantize it to 16 levels (4 bits). The quantized version has a distortion of
4.5343, however none of the quantized outputs have a value of the desired
zero. The 2D DCT of the same image is shown in Figure 7(b). Notice that
most of the coeﬃcients are already very near to zero. If we quantize these
coeﬃcients to 4 bits, and then take the inverse 2D DCT, the reconstructed
image has a distortion of 3.2112.

Similar to transform coeﬃcients, prediction error samples are also eﬃciently

quantized. However, in this case, the quantization operation should not be
applied to the direct output of the prediction step. Instead, the quantizer
must be embedded into the prediction module (Figure 8(a))so that the decoder
(Figure 8(b))produces the quantized version of the input signal, instead of a
diverging signal. The resulting system is called the Diﬀerential Pulse Code
Modulation (DPCM) [1].

14
200
800

150 600

400
100
200
50
8 0 1

0 7 −200 2
1 1 3
6
2 2
5 4
3 3
4 4 4 5
5 3 5 6
6 6
2 7
7 7
8 1 8 8

(a) (b)
Fig. 7. (a) An 8 × 8 segment of an image, and (b) its DCT.

d[ n] u[ n]=d[ n]- q[n]

x[ n] Quant .

x[ n] Predictor u[ n] y[n] = x[ n]
x'[ n]
Predictor

(a) (b)
Fig. 8. DPCM (a) encoder, and (b) decoder.

The recent work on signal compression focuses on eﬃcient vector quantization

of subband decomposition / wavelet transformation samples. As examples of
state–of–the–art compression algorithms, the commonly used MPEG Audio
Layer-3 (known as MP3) standard [28] uses adaptive quantization of subband
samples according to the energy of the bands. For images, the Embedded
Zerotree Wavelet coder (EZW) [29] and the SPIHT [30] coders use signal
dependent grouping and quantization of two dimensional wavelet transform
coeﬃcients.

5.1.2 Direct vector quantization of signals

It is also quite customary to apply VQ over sub–blocks of signals, speciﬁcally

images, without the transformation or prediction. In this case, sub–blocks of
an image are taken and fed to one of the VQ design algorithms described in
the previous section. One can select the sub–block size n × m (e.g. 4 × 4, 8 × 8,
etc.), and the codebook size N = 2R . When the codebook design ﬁnishes, each
block is represented by R bits. If the original image is bbits/pixel (b is usually
8), then the total original n × m × r bits will be compressed to R bits.

Consider the 8 bits/pixel 256×256 “Cameraman” image, shown in Figure 9. In

order to vector quantize this image, 4×4 and 8×8 block sizes are selected, and
codebook sizes of 16 and 32 are tested. LBG and Random Coding algorithms

15
(described in Section 4) are used as the design methods.

Fig. 9. 256 × 256, 8 bits/pixel Cameraman image.

Case 1: 4 × 4 blocks: First, let us consider the sixteen 4 × 4 code vectors gener-
ated by the LBG algorithm (shown in Figure 10(a)). It is quite interesting to
see that all of the 4 × 4 sub–blocks of the original image could be represented
by one of the vectors in this codebook quite eﬃciently. Indeed, the total re-
construction distortion is only 17.83 (corresponding to a PSNR of 23.11 dB).
The image quantized by this codebook is shown in shown in Figure 11(a). Per-
haps, what is more interesting is, the codevectors generated by the Random
Coding algorithm (shown in Figure 10(b)) could also produce an acceptable
performance of 22.98 dB PSNR (shown in Figure 11(b)). Note that Random
Coding has signiﬁcantly less computational complexity. For both cases, the
compression ratio is CR = (4 × 4 × 8) : (log2 16) = 32 : 1.

(a) (b)
Fig. 10. Sixteen 4×4 codevectors generated by (a) LBG algorithm, and (b) Random
Coding method.

If the codebook size is increased to N = 32, this corresponds to using an extra

bit. In this way, the distortion would decrease. Figure 12(a) shows 32 code
vectors of size 4 × 4, generated by the LBG algorithm. In Figure 12(b), the
quantized image is presented. This image has a PSNR of 24.32.

Case 2: 8 × 8 blocks: Finally, the same compression ratio of 32:1 could also

16
(a) (b)
Fig. 11. Quantized images with N = 16 and block size of 4 × 4 using (a) LBG
algorithm, and (b) Random Coding method.

(a) (b)
Fig. 12. (a) 32 code vectors generated by the LBG algorithm, (b) Quantized image
using this codebook.

be reached by using thirtytwo code vectors of size 8 × 8. The 8 × 8 code

vectors generated by the LBG and Random Coding algorithms are presented
in Figures 13(a) and (b) respectively. Using these codebooks, the reconstructed
images for LBG and Random Coding quantizers are shown in Figures 14(a)
(PSNR=21.83) and (b) (PSNR=21.62) respectively.

(a) (b)
Fig. 13. Thirtytwo 8 × 8 codevectors generated by (a) LBG algorithm, and (b)
Random Coding method.

17
(a) (b)
Fig. 14. Quantized images with N = 32 and block size of 8 × 8 using (a) LBG
algorithm, and (b) Random Coding method.

It can be seen that using a vector block size of 4 × 4 (Case 1) produces better
results than using 8 × 8 (Case 2) blocks at the same compression ratio. The
reason is, 4 × 4 blocks exhibit higher inter–pixel correlations than 8 × 8 blocks
do. Therefore, 4 × 4 code vectors represent the input vectors more eﬃciently.

5.2 Classiﬁcation and clustering

Another application of vector quantization is classiﬁcation. In many pattern

recognition applications, automatic clustering of the input data provides the
clusters according to which, one decides on the attribute of an input vector [31].
The example descibed in Figures 13(b) and 14(b) can be analyzed from the
point of view of classification. Notice that the ground portion inside the image
(Figure 14(b)) is quantized to the same code vector (4th row, 3rd column of
codebook in Figure 13(b)). Similarly, the sky is quantized to a common code
vector (1st row, 1st column of codebook in Figure 13(b)), too. Same is true
about the jacket of cameraman, too. Therefore, a portion of the image is
classified to a meaningful class according to the VQ output of that portion.
In some cases, the VQ outputs may not all be the same, but they may belong
to the same set of code vectors. In that case, the classification is again done
by checking which set the VQ output belongs to inside the codebook. The
situation can be generalized to other classification problems, as well.

Consider the problem of classifying three objects in a picture depicted in

Figure 15 as another example. The three types of objects are; large circles,
small circles, and ellipses. For this example, the data that can be considered
are area and perimeter of the objects. The poblems are; areas of ellipses and
small circles are very similar, and perimeters of ellipses and large circles are
ver similar. Because of this, one cannot discriminate all three objects using
a thresholding (scalar quantization) over either of the data. This example

18
Fig. 15. Three types of objects in an image.
Perimeter

xx x
x x ++
+

x:
x x + ++
+
+:
*
* * * *:
* * *

Area

Fig. 16. Area/Perimeter scatter of the objects, and their classification regions.
also indicates that VQ over higher dimensions provide more efficient feature
clustering. If both area (a) and perimeter (p) are used together to form the
input vector x = (a, p), then the scatter plot depicted in Figure 16 is obtained.
Running a simple 3-level VQ design produces the clustering drawn by the
dashed lines. Hence, an efficient classification is obtained.

5.3 Color reduction

A 24–bit true color image can be represented as:

xm,n ∈ Ω, (20)

where xm,n is an image pixel at an arbitrary location (m, n), and Ω is a set
deﬁned as: Ω = {(r, g, b) 0 ≤ r, g, b ≤ 255}. In this representation, each image
pixel is represented by three primary colors: r, g, and b, which all have 8–bits.
In a normal color image, the human eye does not distinguish this many colors.

19
Therefore, there is a representation redundancy. Similarly, many computer
monitors (with frame buﬀers) or printers are not capable of reproducing 24–
bit colors. Due to these reasons, it is a desirable application to reduce the
number of colors from 224 to, for instance, 28 = 256 [9],[10].

The reduction in the number of colors is normally done by grouping similar

colors into a single color index. In that aspect, the operation corresponds to
vector quantizing colors (which are vectors with three elements) with a code-
book of size 256. One can, therefore, run a VQ optimization program with
the input as (r,g,b) pixel values, and output as the indices. Due to compu-
tational and speed requirements for this operation, some standard quantizres
with ﬁxed clusters are used. This is also called the indexed colormapping, and
images with reduced number of colors are called colormap images [32].

A sample color quantizer is illustrated in Figure 17. A group of (r,g,b) vectors

with a similar color are grouped to a common color represented by an index,
i. A colormap image consists of such indices at the location of each pixel.
When a colormap image should be visualized, the display should re–generate
(r,g,b) pixel values. As an example, if a pixel x(m, n) has the index i, the
decoder must replace the pixel value by the ith code vector inside the code-
book. The codebook for this purpose is conventionally called colormap. The
reconstruction of a pixel color is illustrated in Figure 18.

0 0 0
0 0 1
0 0 2
0 0 3

0 0 255 0
0 1 0 1
0 1 1
0 1 2
i

255 255 252 255

255 255 253
255 255 254
255 255 255

Fig. 17. Quantization of RGB colors into indices.

Color reduction need not be applied on true color images, only. Sometimes,
even 256–level gray scale images can be color–reduced to 16 levels. Due to
producing fewer number of colors, color reduction must be made carefully
according to visual perception parameters. Usually, colormap images tend to
exhibit contouring eﬀects around smooth regions (Fig. 19(b)). To produce im-
ages with better perceptual characteristics, the customary practice is dithering,
where a pseudo–random noise is added to the colormap image (Fig. 19(c)).

20
code vectors

colormap image indexes

0 0 0 12
1 0 23 36
i
i 0 2 7 color of pixel(m,n )

255 253 250 121

index of pixel (m,n )

Fig. 18. Reconstruction of RGB color from index.

(a) (b) (c)

Fig. 19. (a) Original 256 level gray scale image, (b) Color reduced to 8 levels by
quantization, (c) Color reduced to 8 levels after dithering.
Color reduction is also frequently used in clustering images. A literature survey
and application of color reduction in segmenting biomedical images can be
found in [33].

21
References

[1] A. Gersho, R. M. Gray, Vector Quantization and Signal Compression, Kluwer

Academic Publishers, Norwell, MA, USA, 1991.

[2] R. O. Duda, P. E. Hart, D. G. Stork, Pattern Classiﬁcation, 2nd Ed., John

Wiley & Sons, USA, 2001.

[3] G. H. Ball, D. J. Hall, “Isodata, a novel method of data analysis and

pattern classiﬁcation,” Stanford Research Institute Report, (NTIS AD699616),
Stanford, CA, 1965.

[4] E. A. Patrick, F. P. Fischer, III, “A generalized k−nearest neighbor rule,”

Information and Control, Vol. 16, No. 2, pp.128–152, 1970.

[5] T. Kohonen, Self Organization and Associative Memory, 3rd Ed., Springer-
Verlag, Berlin, 1989.

[6] J. Max, “Quantizing for minimum distortion,” IEEE Trans. on Information

Theory, pp.7-12, March 1960.

[7] S. P. Lloyd, “Least squares quantization in PCM,” IEEE Trans. on

Information Theory, Vol. 28, pp.127-135, March 1982 (unpublished, 1957).

[8] Y. Linde, A. Buzo, R. M. Gray, “An algorithm for vector quantizer design,”
IEEE Trans. on Communications, Vol. 28, pp.84-95, January 1980.

[9] P. S. Heckbert “Color image quantization for frame-buﬀer display,” ACM

Computer Graphics (ACM SIGGRAPH ’82 Proceedings), Vol. 16, No.3 pp.297-
307, 1980.

[10] L. Akarun, Y. Yardımcı, A. E. Çetin, ‘Adaptive methods for dithering color

images, pp. 950-955, vol.6, no. 7, IEEE Trans. Image Processing, July 1997.

[11] A. K. Jain, R. C. Dubes, Algorithms for clusterin data, Prentice-Hall Inc., NJ,
1988.

[12] R. M. Gray, Source Coding Theory, Kluwer Academic Press, Boston, MA,
USA, 1990.

[13] C. E. Shannon, “Coding theorems for a discrete source with a ﬁdelity

criterion,” IRE National Convention Record, Part 4, pp.142-163, 1959.

[14] R. D. Short, K. Fukunaga, “Optimal distance measure for nearest neighbor

classiﬁcation,” IEEE Trans. on Information Theory, Vol. 27, pp.622-627, 1981.

[15] D. L. Davies, D. W. Bouldin, “A cluster separation measure,” IEEE Trans.

Pattern Anal. and Mach. Int., Vol. 1, No. 2, pp.224-227, 1979.

[16] J. T. Tou, R. C. Gonzalez, Pattern Recognition Principles, Addison-Wesley,

Reading, MA, 1974.

22
[17] W. H. Equitz, “A new vector quantization clustering algorithm,” IEEE Trans.
on A.S.S.P., pp.1568-1575, October 1989.

[18] M. J. Sabin, R. M. Gray, “Product code vector quantizers for waveform and
voice coding,” IEEE Trans. on A.S.S.P., Vol. 32, pp.474-488, June 1984.

[19] K. Zeger, A. Gersho, “A stochastic relaxation algorithm for improved vector

quantiser design,” Elecronics Letters, Vol. 25, pp.896-898, July 1989.

[20] A. E. Çetin, V. Weerackody, “Design of Vector Quantizers using Simulated

Annealing,” IEEE Trans. Circuits Syst., Vol. 35, no. 12, pp. 1550, December
1988.

[21] A. E. Çetin, V. Weerackody, “Design of Vector Quantizers using Simulated

Annealing,”The 22th Annual Conf. on Info. Sci. and Sys., Princeton
University, Princeton, NJ, March 1988.

[22] J. C. Bezdek, “A convergence theorem for the fuzzy ISODATA clustering

algorithms,” IEEE Trans. Pattern Anal. and Mach. Int., Vol. 3, pp.1-8, 1980.

[23] R. Ramamurthi, A. Gersho, “Classiﬁed vector quantization of images,” IEEE

Trans. on Communications, Vol. 34, pp.1105-1115, November 1980.

[24] B. H. Juang, A. H. Gray, “Multiple stage vector quantization for speech

coding,” Proc. Intl. Conf. on A.S.S.P., IEEE Press, pp.597-600, April 1982.

[25] S. Panchanathan, M. Goldberg, “Adaptive algorithm for image coding using

vector quantization,” Signal Processing: Image Communication, Vol. 4, pp.81–
92, 1991.

[26] A. J. Viterbi, J. K. Omura, Principles of Digital Communications and Coding,

McGraw Hill, New York, 1979.

[27] N. M. Nasrabadi, R. A. King, “Image Coding Using Vector Quantization: A

Review,” IEEE Trans. on Communications, Vol. 36, No. 8, pp. 957–971, Aug.
1988.

[28] ISO/IEC International Standard IS 11172-3 “Information Technology - Coding

of Moving Pictures and Associated Audio for Digital Storage Media at up to
about 1.5 Mbits/s - Part 3: Audio”.

[29] J. M. Shapiro, “Embedded Image Coding Using Zerotrees of Wavelet

Coeﬃcients,” IEEE Transactions on Signal Processing, vol. 41, no. 12, pp.
3445–3462, Dec. 1993.

[30] A. Said and W. A. Pearlman, “An Image Multiresolution Representation

for Lossless and Lossy Image Compression,” IEEE Transactions on Image
Processing, vol. 5, pp. 1303–1310, Sept. 1996.

[31] C. Fraley, A. E. Raftery, “How many clusters? Which clustering method? -

Answers via model based cluster analysis,” Computer Journal, Vol. 41, pp.
578–588, 1998.

23
[32] J. D. Murray, W. vanRyper, Encyclopedia of Graphics File Formats, O’Reilly
and Associates, Inc., Sebastopol, CA, 1994.

[33] A. E. Raftery, D. C. Stanford, “Determining the Number of Colors or Gray

Levels in an Image Using Approximate Bayes Factors: The Psuedolikelihood
Information Criterion (PLIC),” Online at:
http://www.stat.washington.edu/raftery/.

View publication stats

Fourier Transform
No ratings yet
Fourier Transform
40 pages
2024 Transformer-VQ Lingle ArXiv
No ratings yet
2024 Transformer-VQ Lingle ArXiv
30 pages
Scalar and Vector Quantization (1)
No ratings yet
Scalar and Vector Quantization (1)
39 pages
TEAA_ Clustering and Density Estimation
No ratings yet
TEAA_ Clustering and Density Estimation
31 pages
Behringer B300 User Guide
No ratings yet
Behringer B300 User Guide
13 pages
EEE505 HW#9: Media Compression: Volkan Dinc
No ratings yet
EEE505 HW#9: Media Compression: Volkan Dinc
26 pages
DSP Chapter2
No ratings yet
DSP Chapter2
32 pages
Vector Quantization
100% (1)
Vector Quantization
25 pages
Lecture 7-Lossy Image Compression Techniques-DM-DPCM
No ratings yet
Lecture 7-Lossy Image Compression Techniques-DM-DPCM
37 pages
HFC System Distortion Calculation
No ratings yet
HFC System Distortion Calculation
27 pages
MM06-1
No ratings yet
MM06-1
37 pages
Lecture 6-Lossy Image Compression Techniques
No ratings yet
Lecture 6-Lossy Image Compression Techniques
41 pages
5707-2.coding Basics
No ratings yet
5707-2.coding Basics
66 pages
1451-48691202263L
No ratings yet
1451-48691202263L
15 pages
2410.06424v1
No ratings yet
2410.06424v1
19 pages
Click Here To Download The Sheet
No ratings yet
Click Here To Download The Sheet
104 pages
SC 15
No ratings yet
SC 15
37 pages
Lecture 4 - 6 - Quantization and Reconstruction PDF
0% (1)
Lecture 4 - 6 - Quantization and Reconstruction PDF
97 pages
Pub Quad
No ratings yet
Pub Quad
15 pages
Generating Diverse High-Fidelity Images
No ratings yet
Generating Diverse High-Fidelity Images
15 pages
37 Nyquist Criterion For Zero ISI
No ratings yet
37 Nyquist Criterion For Zero ISI
3 pages
Transformer-VQ Linear-Time Transformers via Vector Quantization
No ratings yet
Transformer-VQ Linear-Time Transformers via Vector Quantization
22 pages
(eBook - Artificial Intelligence) - On image compression by competitive neural networks
No ratings yet
(eBook - Artificial Intelligence) - On image compression by competitive neural networks
7 pages
2403 K2technology Brochure ENG
No ratings yet
2403 K2technology Brochure ENG
9 pages
quq_1528
No ratings yet
quq_1528
6 pages
Tree-Structured Vector Quantizers
No ratings yet
Tree-Structured Vector Quantizers
13 pages
F S Q: Vq-Vae M S: Inite Calar Uantization ADE Imple
No ratings yet
F S Q: Vq-Vae M S: Inite Calar Uantization ADE Imple
16 pages
Lec6 - Scalar Abnd Vector Quantization
No ratings yet
Lec6 - Scalar Abnd Vector Quantization
32 pages
The Enhanced LBG Algorithm: Giuseppe Patan e and Marco Russo
No ratings yet
The Enhanced LBG Algorithm: Giuseppe Patan e and Marco Russo
33 pages
Vector Quantization K Means Nearest Neig
No ratings yet
Vector Quantization K Means Nearest Neig
19 pages
Video Processing Communications Yao Wang Chapter8b
No ratings yet
Video Processing Communications Yao Wang Chapter8b
16 pages
Speech Recognition Using Vector Quantization Through Modified K-Meanslbg Algorithm
No ratings yet
Speech Recognition Using Vector Quantization Through Modified K-Meanslbg Algorithm
9 pages
Data Compression (Rcs087) Assignment Unit-5
No ratings yet
Data Compression (Rcs087) Assignment Unit-5
6 pages
3 Vector Quantization - LBG
No ratings yet
3 Vector Quantization - LBG
9 pages
Lecture 19
No ratings yet
Lecture 19
12 pages
Adaptive and Vector Quantization: Shailza Gotra M.Tech (WC) A232601500 1
No ratings yet
Adaptive and Vector Quantization: Shailza Gotra M.Tech (WC) A232601500 1
20 pages
CH - 09 - 1 SQ Overview
No ratings yet
CH - 09 - 1 SQ Overview
12 pages
Part 2
No ratings yet
Part 2
6 pages
1d138237-5c1f-4e05-857a-0572b1526e55
No ratings yet
1d138237-5c1f-4e05-857a-0572b1526e55
16 pages
Vector Quantization
No ratings yet
Vector Quantization
5 pages
FoQ Unit 5
No ratings yet
FoQ Unit 5
13 pages
Q (X) Q (X) : 8.4.1 Uniform Scalar Quantization
No ratings yet
Q (X) Q (X) : 8.4.1 Uniform Scalar Quantization
22 pages
3 A Brief Review On Hardware Efficient Cascaded Integrator Comb Filters
No ratings yet
3 A Brief Review On Hardware Efficient Cascaded Integrator Comb Filters
6 pages
Flickr Solution
No ratings yet
Flickr Solution
8 pages
RD Search Alg: Xi2, N Yj2, ..., Yjk) J 1
No ratings yet
RD Search Alg: Xi2, N Yj2, ..., Yjk) J 1
7 pages
DC Unit 5
No ratings yet
DC Unit 5
7 pages
RD Search Alg: Xi2, N Yj2, ..., Yjk) J 1
No ratings yet
RD Search Alg: Xi2, N Yj2, ..., Yjk) J 1
7 pages
Contemporary Communication Systems Using Matlab Proakis and Salehi
100% (2)
Contemporary Communication Systems Using Matlab Proakis and Salehi
443 pages
cs lab expts
No ratings yet
cs lab expts
12 pages
AllenHeath XB 14 Manual
No ratings yet
AllenHeath XB 14 Manual
40 pages
Design of Wideband Filter Using Split-Ring Resonator DGS
No ratings yet
Design of Wideband Filter Using Split-Ring Resonator DGS
4 pages
EEE3218 New
No ratings yet
EEE3218 New
116 pages
Compression 2
No ratings yet
Compression 2
6 pages
EC6501-unit 3
No ratings yet
EC6501-unit 3
8 pages
Solutions Manual
91% (11)
Solutions Manual
260 pages
Image Compression Using Adaptive LBG: International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)
No ratings yet
Image Compression Using Adaptive LBG: International Journal of Emerging Trends & Technology in Computer Science (IJETTCS)
5 pages
Vector Quantization: Data Compression and Data Retrieval
No ratings yet
Vector Quantization: Data Compression and Data Retrieval
39 pages
This Exam Has An Open Question and A Multiple Choice Section
No ratings yet
This Exam Has An Open Question and A Multiple Choice Section
7 pages
Dangerous BAX EQ Mix Manual PDF
No ratings yet
Dangerous BAX EQ Mix Manual PDF
9 pages
Musical Note Processing
No ratings yet
Musical Note Processing
5 pages
A Pyramid Vector Quantizer: Ieee Transactions On Information Theory, Vol
No ratings yet
A Pyramid Vector Quantizer: Ieee Transactions On Information Theory, Vol
16 pages
Heart Rate Monitor in LabVIEW
No ratings yet
Heart Rate Monitor in LabVIEW
10 pages
Lossy Compression Iii - 1
No ratings yet
Lossy Compression Iii - 1
21 pages
Design Steps For Lag and Lead Compensator Using Bode Plot
No ratings yet
Design Steps For Lag and Lead Compensator Using Bode Plot
3 pages
4RM3 Assignment #1
No ratings yet
4RM3 Assignment #1
2 pages
GJCST Vol10 Issue3 Ver1 Paper11
No ratings yet
GJCST Vol10 Issue3 Ver1 Paper11
7 pages
Vector Quantization
No ratings yet
Vector Quantization
12 pages
Principle of Block Coding
No ratings yet
Principle of Block Coding
6 pages
An Introduction To Digital Control Systems - Tutorial: Research
No ratings yet
An Introduction To Digital Control Systems - Tutorial: Research
21 pages
Te Extc Sem5 DC Nov18 PDF
No ratings yet
Te Extc Sem5 DC Nov18 PDF
2 pages
Vector Quantization: CAP5015 Fall 2005
No ratings yet
Vector Quantization: CAP5015 Fall 2005
16 pages
Efficient Codebooks For Vector Quantization Image Compression With An Adaptive Tree Search Algorithm
No ratings yet
Efficient Codebooks For Vector Quantization Image Compression With An Adaptive Tree Search Algorithm
7 pages
Unit 4 DSP New
No ratings yet
Unit 4 DSP New
43 pages
Outline of Vector Quantization of Images: S.R.Subramanya 1
No ratings yet
Outline of Vector Quantization of Images: S.R.Subramanya 1
27 pages
Lossy Compression Algorithms
100% (2)
Lossy Compression Algorithms
18 pages
Vector Quantization
No ratings yet
Vector Quantization
7 pages
The Enhanced LBG Final
No ratings yet
The Enhanced LBG Final
6 pages
Pcs-Unit-I MCQ
No ratings yet
Pcs-Unit-I MCQ
3 pages
Quantization: Prof. Pooja M. Bharti IT Department Laxmi Institute of Technology
No ratings yet
Quantization: Prof. Pooja M. Bharti IT Department Laxmi Institute of Technology
35 pages
Deeplab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs
No ratings yet
Deeplab: Semantic Image Segmentation With Deep Convolutional Nets, Atrous Convolution, and Fully Connected Crfs
14 pages
Image Compression: by Mohamed Hagras
100% (1)
Image Compression: by Mohamed Hagras
24 pages
Vector Quantization
No ratings yet
Vector Quantization
6 pages
X Series LoudSpkr System Manual PDF
No ratings yet
X Series LoudSpkr System Manual PDF
8 pages
Behringer DEQ2496 Optimal Parameters
No ratings yet
Behringer DEQ2496 Optimal Parameters
3 pages
Unit 5
No ratings yet
Unit 5
8 pages
Implementation of Vector Quantization Using Content Addressable Memory Architecture
No ratings yet
Implementation of Vector Quantization Using Content Addressable Memory Architecture
14 pages
Efficient Codebook Design For Image Compression Using Vector Quantization
No ratings yet
Efficient Codebook Design For Image Compression Using Vector Quantization
12 pages
EQ Cheat Sheet v1
No ratings yet
EQ Cheat Sheet v1
1 page
Build Switch and Logic Gates Using Transistors on the Breadboard
From Everand
Build Switch and Logic Gates Using Transistors on the Breadboard
GURUPRASAD N H
No ratings yet
Introduction to Vectorial and Matricial Calculus
From Everand
Introduction to Vectorial and Matricial Calculus
Simone Malacrida
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Vector Quantization: April 2006

Uploaded by

Vector Quantization: April 2006

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Chapter · April 2006

A. Enis Cetin Omer N. Gerek

SEE PROFILE SEE PROFILE

The user has requested enhancement of the downloaded file.

Ömer Nezih Gerek a , A. Enis Çetin b

Quantization is an unavoidable step in representing signals in digital form for

2 Structure of a Vector Quantizer

A vector quatizer consists of two modules; an encoder E, and a decoder D [1].

Fig. 1. Encoder produces index=5 to the input vector.

Since the encoder output is only a representation of code vectors, in order

The vector quantizer Q has two attributes:

• the dimension, k, and

The integer k corresponds to the number of elements in each vector. Therefore,

Conversely, the above splitting of the k dimensional space into N regions

Fig. 4. Typical VQ regions.

and its special cases:

· Manhattan (Mean absolute error - MAE, L = 1) distance:

· Chebychev (max, L = ∞) distance:

where Cx is the autocovariance matrix of x.

minimize d (x,v j ) index : i

Fig. 5. VQ encoding and decoding process.

The performance of a vector quantizer is determined by the optimality of the

3.1 Encoder Optimality

Q(x) = vi only if d(x, vi ) ≤ d(x, vj ), ∀j (11)

D= d(x, Q(x))fX (x)dx, (13)

where fX (x) corresponds to the joint pdf of x. Consequently, the regions Ri

3.2 Decoder Optimality

Equation 14 is also known as the “centroid” rule, since the minimization of

vi = cent(Ri ) = E {x|x ∈ Ri } , (15)

where cent(·) stands for the centroid operation.

The proof of Equation 14 is as follows:

E {d(x, vi )|x ∈ Ri } = d(x, vi )fX|i (x)dx (17)

Equation 15 provides the method to select the code vector corresponding to

Finally, an optimum vector quantizer satisﬁes the following properties:

(i) E {Q(x)} = E {x}, known as

4 VQ Codebook Design using Empirical Data

Lloyd–Max iteration consists of two steps:

• Nearest Neighbor condition: Given a set of code vectors, C = {v1 , v2 , · · · , vN },

Ri = {x ∈ Rk |d(x, vi ) < (x, vj )}; ∀i = j (19)

If the data x is on the boundary (with same distortions) of Ri and Rj , assign

• Lattice (structured) VQ: This is actually the “uniform” quantizer in higher

and the quantization result is x̃ = y1 + y2 + y3 .

Fig. 6. Rate/distortion curves (dashed: ideal, solid: typical vector quantizer).

5.1.1 Quantization of transform / predictive coding coeﬃcients:

Normally, VQ is the second stage of a conventional compression scheme. The

(i) Transformation/Predictive coding stage,

The ﬁrst stage, transformation or predictive coding, reduces the correlation

As a second example, consider the 8 × 8 image shown in mesh format in

Similar to transform coeﬃcients, prediction error samples are also eﬃciently

d[ n] u[ n]=d[ n]- q[n]

The recent work on signal compression focuses on eﬃcient vector quantization

5.1.2 Direct vector quantization of signals

It is also quite customary to apply VQ over sub–blocks of signals, speciﬁcally

Consider the 8 bits/pixel 256×256 “Cameraman” image, shown in Figure 9. In

Fig. 9. 256 × 256, 8 bits/pixel Cameraman image.

If the codebook size is increased to N = 32, this corresponds to using an extra

be reached by using thirtytwo code vectors of size 8 × 8. The 8 × 8 code

5.2 Classiﬁcation and clustering

Another application of vector quantization is classiﬁcation. In many pattern

Consider the problem of classifying three objects in a picture depicted in

5.3 Color reduction

A 24–bit true color image can be represented as:

The reduction in the number of colors is normally done by grouping similar

A sample color quantizer is illustrated in Figure 17. A group of (r,g,b) vectors

255 255 252 255

Fig. 17. Quantization of RGB colors into indices.

colormap image indexes

255 253 250 121

index of pixel (m,n )

Fig. 18. Reconstruction of RGB color from index.

(a) (b) (c)

[1] A. Gersho, R. M. Gray, Vector Quantization and Signal Compression, Kluwer

[2] R. O. Duda, P. E. Hart, D. G. Stork, Pattern Classiﬁcation, 2nd Ed., John

[3] G. H. Ball, D. J. Hall, “Isodata, a novel method of data analysis and

[4] E. A. Patrick, F. P. Fischer, III, “A generalized k−nearest neighbor rule,”

(i) E {Q(x)} = E {x}, known as

Ri = {x ∈ Rk |d(x, vi ) < (x, vj )}; ∀i = j (19)