SlideShare a Scribd company logo
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
1 | P a g e
SPEECH COMPRESSION TECHNIQUES: A REVIEW
Pujari Bhavana C.
ME Student
Amrutvahini College of engineering,Sangamner
ABSTRACT
Speech is the vocalizer form of human communication,and based upon the syntactic
combination of lexical and vocabularies. The aim of speech coding is to compress the speech
signal to the highest possible compression ratio but maintaining user acceptability.There are
many methods for speech compression like Linear Predictive coding (LPC) , Code Excited
Linear Predictive coding (CELP) ,Sub-band coding ,Transform coding :- Fast Fourier
Transform (FFT) ,Discrete Cosine Transform (DCT) , Continuous Wavelet Transform
(CWT) , Discrete Wavelet Transform (DWT) ,Variance Fractal Compression (VFC) ,
Discrete Cosine Transform (DCT),Psychoacoustics andetc. Few of them are discus in this
paper.
KEYWORDS: Compression, LPC,DWT,DCT.
INTRODUCTION
Speech compression is nothing but reduction of number of bits needed to represent the signal
used for storage purpose and transmission. The ideal goal of speech compression is to contain
original information in as minimum bits as possible.The reasons for compressing the signal is
Cost of disk, Cost of data management,Memory,Bandwidth and transfer speed.There are two
basic types of compression lossy and lossless.
LOSSLESS COMPRESSION:-
In this type of compression signal after compression is same as before,no information has
been loosed i.e. the original signal can be perfectly recovered from the compressed signal. It
is mainly used in application where it is necessary that the original signal and the de-
compressed signal are almost same.
Examples: Entropy Encoding (Shannon-Fano Algorithm, Huffmann coding, Arithmetic
Coding) Run-length, Lempel Ziv Welch (LZW) Algorithm.
Lossy compression:-In this type of compression,some degree of information has been lossed.
The original signal cannot be perfectly recovered from the compressed signal,but it gives its
best possible quality for the given technique. Lossy compression typically attain far better
compression than lossless by discarding less-critical data. Theaim of this technique is to
minimize the amount of data that has to be transmitted.They are mostly used for multimedia
data compression.
Ex: FFT,DCT,DWT.
1.1 Linear Predictive coding (LPC):
LPC is most commonly used in speech coding due to effectiveness of LPC coefficient in
modelling vocal tract associated with speech production.LPC is used to estimate basic speech
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
2 | P a g e
parameters like pitch formant and spectra.The principle behind the use of LPC is to minimize
LPC coefficient .This LPC coefficient is estimated in energy frame size of 20ms long.
LPC analysis of each frame involves decision making process of concluding if sound is
voiced or unvoiced.If sound is decided to be voiced,on impulse train is used to represent it
with non zero taps occurring every pitch period .Autocorrelation function is one of technique
used to estimate pitch period.For unvoiced frame white noise is used to represent it and pitch
period of T=0 is transmitted.
1.2 Discrete Cosine Transform (DCT):
DCT forming a periodic,symmetric sequences from finite length sequence in such a way that
original finite length sequence can be uniquely recovered.It can be used for speech
compression because of high similarities in adjacent coefficient.DCT is similar to DFT but
containing only the real part of DFT.
In speech processing DCT
The 1D DCT is
Y(k)=w(k)∑ ‫ݔ‬ሺ݊ሻ ∗ cos ሺ
గ∗ሺଶ௡ିଵሻ∗ሺ௞ିଵሻ
ଶே
ሻே
௡ୀଵ
K=1,2,3......N
Where w(k)=
ଵ
√ே
k=1
=ට
ଶ
ே
2<=k<=N
N is the length of x
X and y are of same size
For reconstruction very few DCT coefficient are required.
x(n)=ට
ଶ
ே
*∑ ‫ݓ‬ሺ݇ሻ ∗ ‫ݔ‬ሺ݇ሻ ∗ ܿ‫ݏ݋‬
ሺଶ௡ାଵሻ௡గ
ଶே
ேିଵ
௞ୀ଴
1.3 Discrete Wavelet Transform (DWT):
DWT is special property of wavelet transform that provide a compact representation of
signal in time and frequency domain.DWT decomposes the signal into the too many function
by using property of translation and dilation of single function called as a mother wavelet.
߮s, ߬ =
ଵ
√௦
∗ ߮ሺ
௧ିఛ
௦
ሻ
Where s is scaling parameter
߬is translation parameter
DWT of signal s(k) is defined as
DWT(m,n)=2
ି௠
ଶൗ *∑ ‫ݏ‬ሺ݇ሻ ∗ ߮ሺ2ି௠
݇ − ݊ሻ௞
DWT is sub band coding based technique.
In DWT signal which is to be analysed is first passing through filter bank followed by
decimation operation .This filter bank consist of LPF and HPF at each decomposition stages.
LPF O/P is called approximate component
HPF O/P is called detail component
Working of DWT is as shown in figure 1.1
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
3 | P a g e
Figure 1.1 Three-level wavelet decomposition trees
1.4 Discrete Wavelet Packet Transform:
In this signal is split into approximate and detail coefficient then both the coefficient is then
itself split into second level approximate and detail coefficient and process is repeated ,as
shown in Figure 1.2
Figure 1.2 Level 3 Decomposition using Wavelet Packet Transform
It gives more than 22n-1
different ways to encode the signal.
The wavelet have several families, they are Haar, Daubechies, Symlet, Coiflet,
Biorthogonal, Reverse Biorthogonal, Meyer wavelet, Gaussian, complex Gaussian, Maxican
Hat, Morlet, Complex Morlet, Ballet Lamarie.
1.5 Psychoacoustic Model:
It is based on study of human perception.The average human hearing of all frequency is not
same. Psychoacoustic Model is made up of two principal human auditory system
properties,they are auditory masking and hearing absolute threshold.It uses the concept that
some informationin signal is not necessary for our interpretation of sound,thus they can be
removed.The speech signal contains lots of frequency many of whom the human ear can’t
hear .By removing these frequency from the signal,the information load gets reduced without
effecting our impression of signal.
1.5.1 Frequency Masking:
It occurs when frequency we able to hear normally is masked by nearby frequency. The ear is
unable to simply distinguish frequency close to each other .The masked frequency can be
removed.
NOVATEUR PUBLICATIONS
INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT]
ISSN: 2394-3696
VOLUME 2, ISSUE 12, DEC.-2015
4 | P a g e
1.5.2 Temporal Masking:
When weak frequency is preceded by a strong frequency in time domain,that is frequency
with low energy close to a frequency with high energy,the sound associated with weak
frequency is unable to hear if time interval between frequencies is short.This is called
temporal masking.By removing all frequency that are masked,the ones with low energy the
information amount is minimized.
CONCLUSION
From review of speech compression techniques, it is observed that, the greatest advantage of
wavelet over other techniques is that the compression factor is not constant and it can be
varied while most other techniques have fixed compression factor. DWT significantly
improves the reconstruction of the compressed speech signal and also yields higher
compression factor.
REFERENCES
[1]. Jing Pang, Shitalben Chauhan, Jay Mahesh kumar Bhlodia,” Speech Compression FPGA
Design By Using Different Discrete Wavelet TransformSchemes”, in Advances in Electrical
and Electronics Engineering - IAENG Special Edition of the World Congress on Engineering
and Computer Science 2008.
[2]. Shijo M Joseph , Babu Anto P,” SPEECH COMPRESSION USING WAVELET
TRANSFORM”, in IEEE-International Conference on Recent Trends in Information
Technology, ICRTIT 2011 ,MIT, Anna University, Chennai. June 3-5, 2011.
[3]. Firoz Shah A, Babu Anto P,” Spoken Digit Compression: A Comparative Study between
Discrete Wavelet Transforms and Linear Predictive Coding”, in International Journal of
Computer Applications (0975 – 8887) Volume 6– No.6, September 2010.
[4]. Jithin James,Vinod J Thomas,”A Comparative Study of Speech Compression using
Different Transform Techniques”, in International Journal of Computer Applications (0975 –
8887) Volume 97– No.2, July 2014.
[5]. Harmanpreet kaur , Ramanpreet kaur,” Speech compression and decompression using
DWT and DCT” in Harmanpreet Kaur et al ,Int.J.Computer Technology &Applications,Vol 3
(4), 1501-1503.

More Related Content

PPT
Digital Audio Broadcasting By SAIKIRAN PANJALA
PDF
Wireless communication theodore rappaport
PPTX
Audio compression
PPTX
Rise Time Budget Analysis and Design of Components
PPTX
Digital Hearing aid
PPTX
M ary psk modulation
PPTX
Spread spectrum
PPT
Earth Station Subsystem
Digital Audio Broadcasting By SAIKIRAN PANJALA
Wireless communication theodore rappaport
Audio compression
Rise Time Budget Analysis and Design of Components
Digital Hearing aid
M ary psk modulation
Spread spectrum
Earth Station Subsystem

What's hot (20)

PDF
PPTX
Adaptive equalization
PPTX
Jammers&amp;anti jammers
PDF
digital audio broadcasting
PPTX
Mac protocols for ad hoc wireless networks
PPTX
Concept of Diversity & Fading (wireless communication)
PPTX
Cognitive radio networks
PPT
Parameters of multipath channel
PPTX
PPTX
CELLULAR MOBILE RADIO SYSTEMS
PPTX
IS-95 Cdma
PPTX
Acoustic echo cancellation
PPTX
Mimo in Wireless Communication
PPTX
Multimedia communication networks
PPT
Speech encoding techniques
PPT
WSN IN IOT
PPT
Monopulse tracking radar
PDF
Multiple Access
PPTX
Chap 4 (large scale propagation)
Adaptive equalization
Jammers&amp;anti jammers
digital audio broadcasting
Mac protocols for ad hoc wireless networks
Concept of Diversity & Fading (wireless communication)
Cognitive radio networks
Parameters of multipath channel
CELLULAR MOBILE RADIO SYSTEMS
IS-95 Cdma
Acoustic echo cancellation
Mimo in Wireless Communication
Multimedia communication networks
Speech encoding techniques
WSN IN IOT
Monopulse tracking radar
Multiple Access
Chap 4 (large scale propagation)
Ad

Similar to SPEECH COMPRESSION TECHNIQUES: A REVIEW (20)

PDF
Speech Compression Using Wavelets
PPTX
slideshare.pptx
PPTX
slideshare.pptx
PDF
Analysis of PEAQ Model using Wavelet Decomposition Techniques
PDF
H010234144
PDF
Data Compression using Multiple Transformation Techniques for Audio Applicati...
PDF
N017657985
PDF
Speech compression analysis using matlab
PDF
Speech compression analysis using matlab
PPTX
discrete cosine transform presentation.pptx
PDF
Mk3422222228
PDF
H0144952
PDF
Bg044357364
PDF
40120140505005
PDF
40120140505005
PDF
40120140505005 2
PDF
Effect of Singular Value Decomposition Based Processing on Speech Perception
PDF
Effect of Singular Value Decomposition Based Processing on Speech Perception
PDF
Iaetsd wavelet transform based latency optimized image compression for
PDF
Compressive speech enhancement using semi-soft thresholding and improved thre...
Speech Compression Using Wavelets
slideshare.pptx
slideshare.pptx
Analysis of PEAQ Model using Wavelet Decomposition Techniques
H010234144
Data Compression using Multiple Transformation Techniques for Audio Applicati...
N017657985
Speech compression analysis using matlab
Speech compression analysis using matlab
discrete cosine transform presentation.pptx
Mk3422222228
H0144952
Bg044357364
40120140505005
40120140505005
40120140505005 2
Effect of Singular Value Decomposition Based Processing on Speech Perception
Effect of Singular Value Decomposition Based Processing on Speech Perception
Iaetsd wavelet transform based latency optimized image compression for
Compressive speech enhancement using semi-soft thresholding and improved thre...
Ad

More from ijiert bestjournal (20)

PDF
CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
PDF
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
PDF
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
PDF
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
PDF
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
PDF
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
PDF
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
PDF
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
PDF
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
PDF
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
PDF
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
PDF
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
PDF
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
PDF
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
PDF
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
PDF
DESIGN OF WELDING FIXTURES AND POSITIONERS
PDF
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
PDF
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
PDF
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
PDF
MAGNETIC ABRASIVE FINISHING
CRACKS IN STEEL CASTING FOR VOLUTE CASING OF A PUMP
A COMPARATIVE STUDY OF DESIGN OF SIMPLE SPUR GEAR TRAIN AND HELICAL GEAR TRAI...
COMPARATIVE ANALYSIS OF CONVENTIONAL LEAF SPRING AND COMPOSITE LEAF
POWER GENERATION BY DIFFUSER AUGMENTED WIND TURBINE
FINITE ELEMENT ANALYSIS OF CONNECTING ROD OF MG-ALLOY
REVIEW ON CRITICAL SPEED IMPROVEMENT IN SINGLE CYLINDER ENGINE VALVE TRAIN
ENERGY CONVERSION PHENOMENON IN IMPLEMENTATION OF WATER LIFTING BY USING PEND...
SCUDERI SPLIT CYCLE ENGINE: REVOLUTIONARY TECHNOLOGY & EVOLUTIONARY DESIGN RE...
EXPERIMENTAL EVALUATION OF TEMPERATURE DISTRIBUTION IN JOURNAL BEARING OPERAT...
STUDY OF SOLAR THERMAL CAVITY RECEIVER FOR PARABOLIC CONCENTRATING COLLECTOR
DESIGN, OPTIMIZATION AND FINITE ELEMENT ANALYSIS OF CRANKSHAFT
ELECTRO CHEMICAL MACHINING AND ELECTRICAL DISCHARGE MACHINING PROCESSES MICRO...
HEAT TRANSFER ENHANCEMENT BY USING NANOFLUID JET IMPINGEMENT
MODIFICATION AND OPTIMIZATION IN STEEL SANDWICH PANELS USING ANSYS WORKBENCH
IMPACT ANALYSIS OF ALUMINUM HONEYCOMB SANDWICH PANEL BUMPER BEAM: A REVIEW
DESIGN OF WELDING FIXTURES AND POSITIONERS
ADVANCED TRANSIENT THERMAL AND STRUCTURAL ANALYSIS OF DISC BRAKE BY USING ANS...
REVIEW ON MECHANICAL PROPERTIES OF NON-ASBESTOS COMPOSITE MATERIAL USED IN BR...
PERFORMANCE EVALUATION OF TRIBOLOGICAL PROPERTIES OF COTTON SEED OIL FOR MULT...
MAGNETIC ABRASIVE FINISHING

Recently uploaded (20)

PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Artificial Intelligence
PDF
Well-logging-methods_new................
PPT
Project quality management in manufacturing
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
Sustainable Sites - Green Building Construction
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
CYBER-CRIMES AND SECURITY A guide to understanding
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Safety Seminar civil to be ensured for safe working.
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Artificial Intelligence
Well-logging-methods_new................
Project quality management in manufacturing
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
R24 SURVEYING LAB MANUAL for civil enggi
OOP with Java - Java Introduction (Basics)
Sustainable Sites - Green Building Construction
Mechanical Engineering MATERIALS Selection
Foundation to blockchain - A guide to Blockchain Tech
Fundamentals of safety and accident prevention -final (1).pptx

SPEECH COMPRESSION TECHNIQUES: A REVIEW

  • 1. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 1 | P a g e SPEECH COMPRESSION TECHNIQUES: A REVIEW Pujari Bhavana C. ME Student Amrutvahini College of engineering,Sangamner ABSTRACT Speech is the vocalizer form of human communication,and based upon the syntactic combination of lexical and vocabularies. The aim of speech coding is to compress the speech signal to the highest possible compression ratio but maintaining user acceptability.There are many methods for speech compression like Linear Predictive coding (LPC) , Code Excited Linear Predictive coding (CELP) ,Sub-band coding ,Transform coding :- Fast Fourier Transform (FFT) ,Discrete Cosine Transform (DCT) , Continuous Wavelet Transform (CWT) , Discrete Wavelet Transform (DWT) ,Variance Fractal Compression (VFC) , Discrete Cosine Transform (DCT),Psychoacoustics andetc. Few of them are discus in this paper. KEYWORDS: Compression, LPC,DWT,DCT. INTRODUCTION Speech compression is nothing but reduction of number of bits needed to represent the signal used for storage purpose and transmission. The ideal goal of speech compression is to contain original information in as minimum bits as possible.The reasons for compressing the signal is Cost of disk, Cost of data management,Memory,Bandwidth and transfer speed.There are two basic types of compression lossy and lossless. LOSSLESS COMPRESSION:- In this type of compression signal after compression is same as before,no information has been loosed i.e. the original signal can be perfectly recovered from the compressed signal. It is mainly used in application where it is necessary that the original signal and the de- compressed signal are almost same. Examples: Entropy Encoding (Shannon-Fano Algorithm, Huffmann coding, Arithmetic Coding) Run-length, Lempel Ziv Welch (LZW) Algorithm. Lossy compression:-In this type of compression,some degree of information has been lossed. The original signal cannot be perfectly recovered from the compressed signal,but it gives its best possible quality for the given technique. Lossy compression typically attain far better compression than lossless by discarding less-critical data. Theaim of this technique is to minimize the amount of data that has to be transmitted.They are mostly used for multimedia data compression. Ex: FFT,DCT,DWT. 1.1 Linear Predictive coding (LPC): LPC is most commonly used in speech coding due to effectiveness of LPC coefficient in modelling vocal tract associated with speech production.LPC is used to estimate basic speech
  • 2. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 2 | P a g e parameters like pitch formant and spectra.The principle behind the use of LPC is to minimize LPC coefficient .This LPC coefficient is estimated in energy frame size of 20ms long. LPC analysis of each frame involves decision making process of concluding if sound is voiced or unvoiced.If sound is decided to be voiced,on impulse train is used to represent it with non zero taps occurring every pitch period .Autocorrelation function is one of technique used to estimate pitch period.For unvoiced frame white noise is used to represent it and pitch period of T=0 is transmitted. 1.2 Discrete Cosine Transform (DCT): DCT forming a periodic,symmetric sequences from finite length sequence in such a way that original finite length sequence can be uniquely recovered.It can be used for speech compression because of high similarities in adjacent coefficient.DCT is similar to DFT but containing only the real part of DFT. In speech processing DCT The 1D DCT is Y(k)=w(k)∑ ‫ݔ‬ሺ݊ሻ ∗ cos ሺ గ∗ሺଶ௡ିଵሻ∗ሺ௞ିଵሻ ଶே ሻே ௡ୀଵ K=1,2,3......N Where w(k)= ଵ √ே k=1 =ට ଶ ே 2<=k<=N N is the length of x X and y are of same size For reconstruction very few DCT coefficient are required. x(n)=ට ଶ ே *∑ ‫ݓ‬ሺ݇ሻ ∗ ‫ݔ‬ሺ݇ሻ ∗ ܿ‫ݏ݋‬ ሺଶ௡ାଵሻ௡గ ଶே ேିଵ ௞ୀ଴ 1.3 Discrete Wavelet Transform (DWT): DWT is special property of wavelet transform that provide a compact representation of signal in time and frequency domain.DWT decomposes the signal into the too many function by using property of translation and dilation of single function called as a mother wavelet. ߮s, ߬ = ଵ √௦ ∗ ߮ሺ ௧ିఛ ௦ ሻ Where s is scaling parameter ߬is translation parameter DWT of signal s(k) is defined as DWT(m,n)=2 ି௠ ଶൗ *∑ ‫ݏ‬ሺ݇ሻ ∗ ߮ሺ2ି௠ ݇ − ݊ሻ௞ DWT is sub band coding based technique. In DWT signal which is to be analysed is first passing through filter bank followed by decimation operation .This filter bank consist of LPF and HPF at each decomposition stages. LPF O/P is called approximate component HPF O/P is called detail component Working of DWT is as shown in figure 1.1
  • 3. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 3 | P a g e Figure 1.1 Three-level wavelet decomposition trees 1.4 Discrete Wavelet Packet Transform: In this signal is split into approximate and detail coefficient then both the coefficient is then itself split into second level approximate and detail coefficient and process is repeated ,as shown in Figure 1.2 Figure 1.2 Level 3 Decomposition using Wavelet Packet Transform It gives more than 22n-1 different ways to encode the signal. The wavelet have several families, they are Haar, Daubechies, Symlet, Coiflet, Biorthogonal, Reverse Biorthogonal, Meyer wavelet, Gaussian, complex Gaussian, Maxican Hat, Morlet, Complex Morlet, Ballet Lamarie. 1.5 Psychoacoustic Model: It is based on study of human perception.The average human hearing of all frequency is not same. Psychoacoustic Model is made up of two principal human auditory system properties,they are auditory masking and hearing absolute threshold.It uses the concept that some informationin signal is not necessary for our interpretation of sound,thus they can be removed.The speech signal contains lots of frequency many of whom the human ear can’t hear .By removing these frequency from the signal,the information load gets reduced without effecting our impression of signal. 1.5.1 Frequency Masking: It occurs when frequency we able to hear normally is masked by nearby frequency. The ear is unable to simply distinguish frequency close to each other .The masked frequency can be removed.
  • 4. NOVATEUR PUBLICATIONS INTERNATIONAL JOURNAL OF INNOVATIONS IN ENGINEERING RESEARCH AND TECHNOLOGY [IJIERT] ISSN: 2394-3696 VOLUME 2, ISSUE 12, DEC.-2015 4 | P a g e 1.5.2 Temporal Masking: When weak frequency is preceded by a strong frequency in time domain,that is frequency with low energy close to a frequency with high energy,the sound associated with weak frequency is unable to hear if time interval between frequencies is short.This is called temporal masking.By removing all frequency that are masked,the ones with low energy the information amount is minimized. CONCLUSION From review of speech compression techniques, it is observed that, the greatest advantage of wavelet over other techniques is that the compression factor is not constant and it can be varied while most other techniques have fixed compression factor. DWT significantly improves the reconstruction of the compressed speech signal and also yields higher compression factor. REFERENCES [1]. Jing Pang, Shitalben Chauhan, Jay Mahesh kumar Bhlodia,” Speech Compression FPGA Design By Using Different Discrete Wavelet TransformSchemes”, in Advances in Electrical and Electronics Engineering - IAENG Special Edition of the World Congress on Engineering and Computer Science 2008. [2]. Shijo M Joseph , Babu Anto P,” SPEECH COMPRESSION USING WAVELET TRANSFORM”, in IEEE-International Conference on Recent Trends in Information Technology, ICRTIT 2011 ,MIT, Anna University, Chennai. June 3-5, 2011. [3]. Firoz Shah A, Babu Anto P,” Spoken Digit Compression: A Comparative Study between Discrete Wavelet Transforms and Linear Predictive Coding”, in International Journal of Computer Applications (0975 – 8887) Volume 6– No.6, September 2010. [4]. Jithin James,Vinod J Thomas,”A Comparative Study of Speech Compression using Different Transform Techniques”, in International Journal of Computer Applications (0975 – 8887) Volume 97– No.2, July 2014. [5]. Harmanpreet kaur , Ramanpreet kaur,” Speech compression and decompression using DWT and DCT” in Harmanpreet Kaur et al ,Int.J.Computer Technology &Applications,Vol 3 (4), 1501-1503.