Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation
Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation
UNIVERSITY
By removing redundancy between neighboring samples signal can be compressed. In this paper we
have implemented compression technique in two steps, in 1st step a transform function is applied
on speech signal to get result with a new set of data with smaller values and more repetition, 2nd
step is coding(compression) step, this step will represent the data set in its minimal form by using
encoding techniques such as Run Length encoding, Huffman encoding, run length encoding
followed by Huffman encoding. Performance measures compression factor (CF), signal to noise
ratio (SNR), peak signal to noise ratio (PSNR), normalized root mean square error (NRMSE),
retained signal energy (RSE) is measured for reconstructed speech obtained DCT based speech
compression techniques.
Objectives
Here are four specific objectives of speech compression using DCT:
Enhancing data storage efficiency by reducing the size of speech
signals
Minimizing bandwidth requirements for speech transmission
Mitigating storage and transmission costs
Preserving essential speech features while reducing redundancy
enabling efficient utilization of communication resources in various
applications.
Statement of the Problem:
In this paper we are implementing speech compression technique based on DCT transform
method. in case of DCT transform speech can be represented in terms of DCT coefficient. Thus,
data operation can be performed using just the corresponding DCT coefficients. Transform
techniques and thresholding does not actually compress a signal, it simply provides information
about the signal, which allows the data to be compressed by standard encoding techniques.
Speech compression is achieved by neglecting small coefficients as insignificant data and
discarding them and then applying quantization and encoding scheme on coefficients.
SYSTEM DESIGN
Methodology for compression of speech signal
Steps in Speech Compression using DCT:
• Segmentation: Divide the speech signal into small segments or frames. Each frame typically
consists of a few milliseconds of audio data.
• DCT Transformation: Apply DCT to each frame of the speech signal.
• Quantization: Quantize the DCT coefficients by rounding them to a smaller number of bits or
by using a quantization matrix. This step reduces the precision of the coefficients.
• Entropy Coding: Apply entropy coding techniques (e.g., Huffman coding) to further compress
the quantized coefficients.
• Transmission/Storage: Transmit or store the compressed coefficients along with necessary
side information (e.g., frame size, quantization parameters) to reconstruct the speech signal.
• Reconstruction: At the decoder side, inverse the compression process by applying the inverse
steps: entropy decoding, dequantization, inverse DCT, and frame concatenation.
System Block Diagram
.
MATHEMATICAL ANALYSIS
Mathematical model
METHODOLOGY
Mathematical model
RESULT AND DISCUSSION
Performance evaluation
Performance evaluation
Where σx2 is the mean square of the speech signal and σe 2 is the mean
square difference between the original and reconstructed speech signal.
RESULT AND DISCUSSION
Performance evaluation
Peak Signal to Noise Ratio (PSNR)
Performance evaluation
Normalized Root Mean Square Error (NRMSE)
Here, X(n) is the speech signal, x‟(n) is reconstructed speech signal and
μ x(n) is the mean of speech signal.
RESULT AND DISCUSSION
Results
The results for Compression factor, Signal to Noise ratio, PSNR & Mean
square error for the speech signal using the DCT based compression are
summarized in table 1.
Results
CONCLUSION
While numerous other transforms and techniques exist for speech signal
compression, the utilization of DCT stands out as the simplest and widely
adopted method. Its effectiveness lies in its ability to efficiently represent
the signal in the frequency domain, enabling significant reductions in data
size while preserving essential information within the speech signal.
Thank you