0% found this document useful (0 votes)

19 views21 pages

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Geleta Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views21 pages

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Geleta Aman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 21

ADAMA SCIENCE AND TECHNOLOGY

UNIVERSITY

DEPARTMENT OF ELECTRONICS AND COMMUNICATION

ENGINEERING
Advanced Digital Signal Processing
Project presentation

Speech Compression Using DCT

By Geleta Aman
ID_No : PGR 35870/16
ABSTRACT
Speech compression is a fundamental aspect of modern
communication systems and enabling efficient transmission and
storage of audio data. Discrete Cosine Transform (DCT) has emerged as
a powerful tool in speech compression due to its ability to concentrate
signal energy into a reduced set of coefficients. This paper presents
analysis of speech compression using DCT, focusing on the
mathematical underpinnings and practical implementation aspects. The
trade-off between compression ratio and quality is carefully examined,
considering parameters such as thresholding and quantization step size.
ABSTRACT
Evaluation metrics including Signal-to-Noise Ratio (SNR) and Mean
Squared Error (MSE) are utilized to assess the fidelity of the
reconstructed speech signal. Through mathematical analysis and
experimental validation, this study highlights the efficacy of DCT-based
speech compression in achieving significant compression ratios while
preserving perceptual quality. The findings contribute to the
understanding and optimization of speech compression techniques,
paving the way for enhanced audio communication systems in various
domains.
INTRODUCTION

Objective of speech is communication, whether face to face or cell phone to

cell phone. A huge amount of data is a big issue for transmission or storage.
Speech compression is the technology of converting human speech into an
efficiently encoded representation that can later be decoded to produce a
close approximation of the original signal. Major objective of speech
compression is to represent speech with less or few numbers of bits with
level of quality.
INTRODUCTION

By removing redundancy between neighboring samples signal can be compressed. In this paper we
have implemented compression technique in two steps, in 1st step a transform function is applied
on speech signal to get result with a new set of data with smaller values and more repetition, 2nd
step is coding(compression) step, this step will represent the data set in its minimal form by using
encoding techniques such as Run Length encoding, Huffman encoding, run length encoding
followed by Huffman encoding. Performance measures compression factor (CF), signal to noise
ratio (SNR), peak signal to noise ratio (PSNR), normalized root mean square error (NRMSE),
retained signal energy (RSE) is measured for reconstructed speech obtained DCT based speech
compression techniques.
Objectives
Here are four specific objectives of speech compression using DCT:
Enhancing data storage efficiency by reducing the size of speech
signals
Minimizing bandwidth requirements for speech transmission
Mitigating storage and transmission costs
Preserving essential speech features while reducing redundancy
enabling efficient utilization of communication resources in various
applications.
Statement of the Problem:

Speech compression is a critical aspect of various applications including

telecommunications, multimedia streaming, and storage systems.
Efficient compression techniques are essential to reduce the storage
requirements and bandwidth usage while maintaining acceptable audio
quality. In this context, the utilization of the Discrete Cosine Transform
(DCT) for speech compression presents a promising approach.
SYSTEM DESIGN AND
MATHEMATICAL ANALYSIS
Methodology for compression of speech signal

In this paper we are implementing speech compression technique based on DCT transform
method. in case of DCT transform speech can be represented in terms of DCT coefficient. Thus,
data operation can be performed using just the corresponding DCT coefficients. Transform
techniques and thresholding does not actually compress a signal, it simply provides information
about the signal, which allows the data to be compressed by standard encoding techniques.
Speech compression is achieved by neglecting small coefficients as insignificant data and
discarding them and then applying quantization and encoding scheme on coefficients.
SYSTEM DESIGN
Methodology for compression of speech signal
Steps in Speech Compression using DCT:
• Segmentation: Divide the speech signal into small segments or frames. Each frame typically
consists of a few milliseconds of audio data.
• DCT Transformation: Apply DCT to each frame of the speech signal.
• Quantization: Quantize the DCT coefficients by rounding them to a smaller number of bits or
by using a quantization matrix. This step reduces the precision of the coefficients.
• Entropy Coding: Apply entropy coding techniques (e.g., Huffman coding) to further compress
the quantized coefficients.
• Transmission/Storage: Transmit or store the compressed coefficients along with necessary
side information (e.g., frame size, quantization parameters) to reconstruct the speech signal.
• Reconstruction: At the decoder side, inverse the compression process by applying the inverse
steps: entropy decoding, dequantization, inverse DCT, and frame concatenation.
System Block Diagram
.
MATHEMATICAL ANALYSIS
Mathematical model
METHODOLOGY
Mathematical model
RESULT AND DISCUSSION

Performance evaluation

To evaluate the overall performance of the proposed audio compression

scheme, several objective tests were made. To measure the performance
of the reconstructed signal, various factors such as compression factor,
Signal to noise ratio, PSNR& mean square error are taken into
consideration.
RESULT AND DISCUSSION

Performance evaluation

 Signal to Noise Ratio (SNR)

Where σx2 is the mean square of the speech signal and σe 2 is the mean
square difference between the original and reconstructed speech signal.
RESULT AND DISCUSSION

Performance evaluation
Peak Signal to Noise Ratio (PSNR)

Where N is the length of reconstructed signal, X is the maximum

absolute square value of signal x and ||x-x`||2 is the energy of the
difference between the original and reconstructed signal.
RESULT AND DISCUSSION

Performance evaluation
Normalized Root Mean Square Error (NRMSE)

Here, X(n) is the speech signal, x‟(n) is reconstructed speech signal and
μ x(n) is the mean of speech signal.
RESULT AND DISCUSSION

Results

The results for Compression factor, Signal to Noise ratio, PSNR & Mean
square error for the speech signal using the DCT based compression are
summarized in table 1.

No Error PSNR RMSE Size before compression Size after Decompression

1 3.0587e+04 21.8790 174.8914 110033 110033

RESULT AND DISCUSSION

Results
CONCLUSION

In conclusion, speech signal compression can be achieved through

various methods, but one of the simplest and effective approaches is
employing the Discrete Cosine Transform (DCT). By applying DCT, we
can identify threshold coefficients within the speech signal and
subsequently reduce its size, thereby facilitating efficient compression.
CONCLUSION

While numerous other transforms and techniques exist for speech signal
compression, the utilization of DCT stands out as the simplest and widely
adopted method. Its effectiveness lies in its ability to efficiently represent
the signal in the frequency domain, enabling significant reductions in data
size while preserving essential information within the speech signal.
Thank you

RGSHOA Memo For Garbage Collection
100% (1)
RGSHOA Memo For Garbage Collection
1 page
Audio Visual Speech Recognition: Advancements, Applications, and Insights
From Everand
Audio Visual Speech Recognition: Advancements, Applications, and Insights
Fouad Sabry
No ratings yet
Human Visual System Model: Understanding Perception and Processing
From Everand
Human Visual System Model: Understanding Perception and Processing
Fouad Sabry
No ratings yet
Digital Modulations using Matlab
From Everand
Digital Modulations using Matlab
Mathuranathan Viswanathan
4/5 (6)
Cmap Esp 9 - Quarter 1 - Based On Matatag 2025
No ratings yet
Cmap Esp 9 - Quarter 1 - Based On Matatag 2025
4 pages
11th Math Summer Vacation Task
No ratings yet
11th Math Summer Vacation Task
41 pages
Mock Analysis
No ratings yet
Mock Analysis
1 page
Quantitative Investigation
No ratings yet
Quantitative Investigation
10 pages
PM - L5 - SP2 - Learner WorkBook
No ratings yet
PM - L5 - SP2 - Learner WorkBook
42 pages
Gtu Me Dissertation Topics
100% (2)
Gtu Me Dissertation Topics
8 pages
10TH B Test Series 2024-2025 1ST Round Front Page
No ratings yet
10TH B Test Series 2024-2025 1ST Round Front Page
2 pages
01 Slides
No ratings yet
01 Slides
109 pages
DCT Application in Speech Recognition: A Survey
No ratings yet
DCT Application in Speech Recognition: A Survey
5 pages
Golden Rice - A Case Study in Intellectual Property Management and
No ratings yet
Golden Rice - A Case Study in Intellectual Property Management and
23 pages
Root Cause Analysis Enhancing Safety in Chemical Processing Environments
100% (1)
Root Cause Analysis Enhancing Safety in Chemical Processing Environments
91 pages
D-STAR, DMR & Fusion A Beginner’s Guide
From Everand
D-STAR, DMR & Fusion A Beginner’s Guide
Duarte Braga
No ratings yet
Speech Coding Techniques
No ratings yet
Speech Coding Techniques
4 pages
Logs Uhbhy NLP Based Web Application Main App - Py 2024 06 06T06 - 48 - 29.961Z
No ratings yet
Logs Uhbhy NLP Based Web Application Main App - Py 2024 06 06T06 - 48 - 29.961Z
7 pages
1 Reyhword
No ratings yet
1 Reyhword
3 pages
Mesh Network
No ratings yet
Mesh Network
10 pages
Lecture Slides GGR The Role of The Board in Innovation Ver1.0 110224
No ratings yet
Lecture Slides GGR The Role of The Board in Innovation Ver1.0 110224
46 pages
M3 CCIS SpringerSeries
No ratings yet
M3 CCIS SpringerSeries
16 pages
MC Granahan Anthropologyas Theoretical Storytelling 2020
No ratings yet
MC Granahan Anthropologyas Theoretical Storytelling 2020
8 pages
Final Journal - Fourier Series
No ratings yet
Final Journal - Fourier Series
6 pages
Review of Islanding Detection Using Advanced Signal Processing Techniques
No ratings yet
Review of Islanding Detection Using Advanced Signal Processing Techniques
22 pages
2016 E.C Academic Calander Final2
No ratings yet
2016 E.C Academic Calander Final2
10 pages
Nonverbal Behaviour Culture Gender and The Media
100% (1)
Nonverbal Behaviour Culture Gender and The Media
3 pages
Chapter 5 Data Compression
No ratings yet
Chapter 5 Data Compression
71 pages
2023 English For Computer Science
No ratings yet
2023 English For Computer Science
134 pages
Global Supply Chains
No ratings yet
Global Supply Chains
25 pages
MMC 17ec741 Module 3 Notes
No ratings yet
MMC 17ec741 Module 3 Notes
45 pages
CISCE
No ratings yet
CISCE
1 page
Digitization of One-Dimension Signals
No ratings yet
Digitization of One-Dimension Signals
46 pages
Moral and Ethics Education
No ratings yet
Moral and Ethics Education
233 pages
Mie GDPR Data Processing Agreement With Examiners
No ratings yet
Mie GDPR Data Processing Agreement With Examiners
4 pages
Implementation of Audio Compression Usin
No ratings yet
Implementation of Audio Compression Usin
4 pages
Audio Compression by Using Wavelet
No ratings yet
Audio Compression by Using Wavelet
5 pages
Construction Electrician Level 4
No ratings yet
Construction Electrician Level 4
19 pages
22uec111 DSP Exp7
No ratings yet
22uec111 DSP Exp7
8 pages
GSW NG01017640 GEN LA7880 00004 - Technical Specifications For Pipeline Valves - D01
100% (1)
GSW NG01017640 GEN LA7880 00004 - Technical Specifications For Pipeline Valves - D01
23 pages
Trampa Termodinamica Modelo NTD600
No ratings yet
Trampa Termodinamica Modelo NTD600
2 pages
Implementation Challenges and Performance Analysis of Image Compression Using Huffman Encoding and DCT Algorithm On DSP Processor TMS320C6748 and Arduino Nano 33 BLE
No ratings yet
Implementation Challenges and Performance Analysis of Image Compression Using Huffman Encoding and DCT Algorithm On DSP Processor TMS320C6748 and Arduino Nano 33 BLE
6 pages
TD UMICORE BrazeTec 3076U EN
No ratings yet
TD UMICORE BrazeTec 3076U EN
1 page
Chapter 2 Defining The Research Problem
No ratings yet
Chapter 2 Defining The Research Problem
17 pages
Atomic Habits Presentation
No ratings yet
Atomic Habits Presentation
15 pages
The Origin of Mitochondria - Reading
No ratings yet
The Origin of Mitochondria - Reading
2 pages
Chapter 3 - Well Test Analysis Formulas and Calcu
No ratings yet
Chapter 3 - Well Test Analysis Formulas and Calcu
30 pages
1 PB
No ratings yet
1 PB
9 pages
DSP Project Report
No ratings yet
DSP Project Report
18 pages
ART2017951
No ratings yet
ART2017951
5 pages
Tunnel Lining Analysis and Design Using Staad Pro
No ratings yet
Tunnel Lining Analysis and Design Using Staad Pro
4 pages
Prime MX FIRA 6250 2018
No ratings yet
Prime MX FIRA 6250 2018
4 pages
Discrete Cosine Transform
No ratings yet
Discrete Cosine Transform
12 pages
MMC Unit II
No ratings yet
MMC Unit II
40 pages
Final Nazary Lec 10 11
No ratings yet
Final Nazary Lec 10 11
6 pages
Acha Et Al. 2015 PDF
No ratings yet
Acha Et Al. 2015 PDF
73 pages
Audio and Audio Compression
No ratings yet
Audio and Audio Compression
27 pages
Analysis of Audio Signal Using Various T Ef70b0cd
No ratings yet
Analysis of Audio Signal Using Various T Ef70b0cd
13 pages
Weekly Learning Activity Sheets General Physics 1 Grade 12, Quarter 2 Week 6
100% (1)
Weekly Learning Activity Sheets General Physics 1 Grade 12, Quarter 2 Week 6
10 pages
GE8 - What Is The Primary Reason of The Author in Writing The Documents How Was It Produced
No ratings yet
GE8 - What Is The Primary Reason of The Author in Writing The Documents How Was It Produced
2 pages
Chap 5 Compression
No ratings yet
Chap 5 Compression
43 pages
3G 4 DigitalComm PDF
No ratings yet
3G 4 DigitalComm PDF
163 pages
Digital Signal Processing Lab: Bachelor of Technology in Electronics and Communication Engineering
No ratings yet
Digital Signal Processing Lab: Bachelor of Technology in Electronics and Communication Engineering
13 pages
Itc Review 3 PDF
No ratings yet
Itc Review 3 PDF
8 pages
Main Techniques and Performance of Each Compression
No ratings yet
Main Techniques and Performance of Each Compression
23 pages
EC8002 MCC Question Bank Watermark
No ratings yet
EC8002 MCC Question Bank Watermark
109 pages
Speech Compression Techniques: An Overview
No ratings yet
Speech Compression Techniques: An Overview
4 pages
Psychoacoustic Principles and Genetic Algorithms in Audio Compression
No ratings yet
Psychoacoustic Principles and Genetic Algorithms in Audio Compression
3 pages
Ijetr011603 PDF
No ratings yet
Ijetr011603 PDF
5 pages
Unit 5 - Data Compression
No ratings yet
Unit 5 - Data Compression
46 pages
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
No ratings yet
Robust Pitch Detection Using DCT Based Spectral Autocorrelation
20 pages
Digital Audio Compression: by Davis Yen Pan
No ratings yet
Digital Audio Compression: by Davis Yen Pan
14 pages
MP3 Audio Compression Using DCT
No ratings yet
MP3 Audio Compression Using DCT
13 pages
Chapter-5 Data Compression
No ratings yet
Chapter-5 Data Compression
53 pages
Assignment 2
No ratings yet
Assignment 2
1 page
Project 1 Audio Compression Using The FFT: Independent Dominant (Maximum) Components Are Retained For Data Reconstruction
No ratings yet
Project 1 Audio Compression Using The FFT: Independent Dominant (Maximum) Components Are Retained For Data Reconstruction
4 pages
Towards Using Genetic Algorithms in Lossy Audio Compression 2008
No ratings yet
Towards Using Genetic Algorithms in Lossy Audio Compression 2008
8 pages
Chapter 1: Lossless Data Compression
No ratings yet
Chapter 1: Lossless Data Compression
4 pages
1987 OKI Voice Synthesis LSI Data Book
No ratings yet
1987 OKI Voice Synthesis LSI Data Book
214 pages
Speech Coding: Before You Start..
No ratings yet
Speech Coding: Before You Start..
5 pages
A Method of Continuous Data Flow Embedded Within Speech Signals
No ratings yet
A Method of Continuous Data Flow Embedded Within Speech Signals
4 pages
Introduction To Digital Communications System: Wireless Information Transmission System Lab
No ratings yet
Introduction To Digital Communications System: Wireless Information Transmission System Lab
83 pages
Audio Compression
0% (1)
Audio Compression
26 pages
Audio and Speech Compression Using DCT and DWT Techniques: M. V. Patil, Apoorva Gupta, Ankita Varma, Shikhar Salil
No ratings yet
Audio and Speech Compression Using DCT and DWT Techniques: M. V. Patil, Apoorva Gupta, Ankita Varma, Shikhar Salil
8 pages
Ranjana Chaturvedi - A Survey On Compression Techniques For Ecg
No ratings yet
Ranjana Chaturvedi - A Survey On Compression Techniques For Ecg
3 pages
Speech Coding
100% (3)
Speech Coding
36 pages
A Novel Method of Compressing Speech With Higher Bandwidt
100% (2)
A Novel Method of Compressing Speech With Higher Bandwidt
12 pages
Speech Coders For Wireless Communication
No ratings yet
Speech Coders For Wireless Communication
53 pages
DCT For Speech Compression
No ratings yet
DCT For Speech Compression
21 pages
Design of Test Data Compressor/Decompressor Using Xmatchpro Method
No ratings yet
Design of Test Data Compressor/Decompressor Using Xmatchpro Method
10 pages
A Novel Method of
No ratings yet
A Novel Method of
5 pages
UNIT - IV - PPT
100% (1)
UNIT - IV - PPT
18 pages
Packet Loss Concealment Using Audio Morphing: STQ Workshop, Sophia-Antipolis, February 11, 2003
No ratings yet
Packet Loss Concealment Using Audio Morphing: STQ Workshop, Sophia-Antipolis, February 11, 2003
12 pages
EE412/CS455 Principles of Digital Audio and Video
No ratings yet
EE412/CS455 Principles of Digital Audio and Video
71 pages
CVSD - A Tutorial
No ratings yet
CVSD - A Tutorial
16 pages
Implementation of Image and Audio Compression Techniques Using
No ratings yet
Implementation of Image and Audio Compression Techniques Using
26 pages
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
No ratings yet
Speech Recognition Using Matrix Comparison: Vishnupriya Gupta
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

Adama Science and Technology University: Advanced Digital Signal Processing Project Presentation

Uploaded by

ADAMA SCIENCE AND TECHNOLOGY

DEPARTMENT OF ELECTRONICS AND COMMUNICATION

Speech Compression Using DCT

Objective of speech is communication, whether face to face or cell phone to

Speech compression is a critical aspect of various applications including

To evaluate the overall performance of the proposed audio compression

 Signal to Noise Ratio (SNR)

Where N is the length of reconstructed signal, X is the maximum

No Error PSNR RMSE Size before compression Size after Decompression

1 3.0587e+04 21.8790 174.8914 110033 110033

In conclusion, speech signal compression can be achieved through

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.