SlideShare a Scribd company logo
Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf
Speech Compression using Loosy Predictive Coding
International Journal of Emerging Technology and Advanced
Engineering
Abstract:
The aim of the project is to develop a system for encoding good quality speech at low bit rate. To
implement this we have used most powerful speech analysis technique called Loosy Predictive Coding
(LPC). It uses 10
th
order Levinson – Durbin Recursion algorithm to accomplish the task. It provides
extremely accurate estimates of speech parameters, and is relatively efficient for computation. The
speech signal of males and females were coded. The tradeoffs between the bit rate, end-to-end delay,
speech quality, and complexity were analyzed. The results show that project was successful in coding
the speech signal at relatively low bit rates with good quality.
Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf
(a) Block Diagram of an LPC Vocoder
(b) Mathematical Model of Speech Production
Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf
(c) Human vs. Voice Coder Speech Production
Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf
A: - Waveforms
B: - Spectrograms
Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf
Conclusion:
Linear Predictive Coding is an analysis/synthesis technique to lossy speech compression that attempts to
model the human production of sound instead of transmitting an estimate of the sound wave. Linear
predictive coding achieves a bit rate of 2400 bits/second which makes it ideal for use in secure
telephone systems. Secure telephone systems are more concerned that the content and meaning of
speech, rather than the quality of speech, be preserved. The trade-off for LPC’s low bit rate is that it
does have some difficulty with certain sounds and it produces speech that sound synthetic.
Linear predictive coding encoders break up a sound signal into different segments and then send
information on each segment to the decoder. The encoder send information on whether the segment is
voiced or unvoiced and the pitch period for voiced segment which is used to create an excitement signal
in the decoder. The encoder also sends information about the vocal tract which is used to build a filter
on the decoder side which when given the excitement signal as input can reproduce the original speech.
Reference:
[1] V. Hardman and O. Hodson. Internet/Mbone Audio (2000) 5-7.
[2] Scott C. Douglas. Introduction to Adaptive Filters, Digital Signal Processing Handbook (1999) 7-12.
[3] Poor, H. V., Looney, C. G., Marks II, R. J., Verdú, S., Thomas, J. A., Cover, T. M. Information Theory.
The Electrical Engineering Handbook (2000) 56-57.
[4] R. Sproat, and J. Olive. Text-to-Speech Synthesis, Digital Signal Processing Handbook (1999) 9-11 .
[5] Richard C. Dorf, et. al.. Broadcasting (2000) 44-47.
[6] Richard V. Cox. Speech Coding (1999) 5-8.
[7] Randy Goldberg and Lance Riek. A Practical Handbook of Speech Coders (1999) Chapter 2:1-28,
Chapter 4: 1-14, Chapter 9: 1-9, Chapter 10:1-18.
[8] Mark Nelson and Jean-Loup Gailly. Speech Compression, The Data Compression Book (1995) 289-319.
[9] Khalid Sayood. Introduction to Data Compression (2000) 497-509.
[10] Richard Wolfson, Jay Pasachoff. Physics for Scientists and Engineers (1995) 376-377.

More Related Content

PDF
Speech compression using voiced excited loosy predictive coding (lpc)
PPTX
Automatic speech recognition system
PPTX
Speech Recognition
PDF
Voice/Speech recognition in mobile devices
PPT
Speech recognition
PDF
"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...
PPTX
Speech recognition challenges
PPTX
Deep Learning | Speaker Indentification
Speech compression using voiced excited loosy predictive coding (lpc)
Automatic speech recognition system
Speech Recognition
Voice/Speech recognition in mobile devices
Speech recognition
"Automatic speech recognition for mobile applications in Yandex" — Fran Campi...
Speech recognition challenges
Deep Learning | Speaker Indentification

What's hot (11)

PPTX
Voice recognition system
PPTX
Deep Learning - Speaker Recognition
PPTX
Speech recognition final presentation
PPT
Speech Recognition System By Matlab
PPTX
Automatic Speech Recognion
PPT
Speech Recognition in Artificail Inteligence
PDF
Diving deep into NLP
PDF
Multi-Edge Type LDPC codes
PDF
Deep Learning in NLP (BERT, ERNIE and REFORMER)
DOCX
Curriculum vitae - Aggraj Gupta
Voice recognition system
Deep Learning - Speaker Recognition
Speech recognition final presentation
Speech Recognition System By Matlab
Automatic Speech Recognion
Speech Recognition in Artificail Inteligence
Diving deep into NLP
Multi-Edge Type LDPC codes
Deep Learning in NLP (BERT, ERNIE and REFORMER)
Curriculum vitae - Aggraj Gupta
Ad

Similar to Speech compression using loosy predictive coding (lpc) (20)

PDF
B034205010
PDF
A survey on Enhancements in Speech Recognition
PDF
On the realization of non linear pseudo-noise generator for various signal pr...
PDF
G010424248
PDF
A comparison of different support vector machine kernels for artificial speec...
PDF
Advanced Signal Processing For Communication Systems The Springer Internation...
PDF
Efficient Intralingual Text To Speech Web Podcasting And Recording
PDF
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
PDF
An efficient transcoding algorithm for G.723.1 and G.729A ...
PDF
Programmable Digital Signal Processors Vol 13 Architecture Programming And Ap...
PPTX
ETE405-lec8.pptx
PDF
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
PDF
Audio Steganography Coding Using the Discreet Wavelet Transforms
PDF
Utterance Based Speaker Identification Using ANN
PDF
Utterance Based Speaker Identification Using ANN
PDF
F5242832
PDF
Curriculum Development of an Audio Processing Laboratory Course
PPTX
Wireless and mobile communication final year AKTU (KEC-076) Unit-2 Lecture-01...
PDF
Utterance based speaker identification
PDF
Speech to text conversion for visually impaired person using µ law companding
B034205010
A survey on Enhancements in Speech Recognition
On the realization of non linear pseudo-noise generator for various signal pr...
G010424248
A comparison of different support vector machine kernels for artificial speec...
Advanced Signal Processing For Communication Systems The Springer Internation...
Efficient Intralingual Text To Speech Web Podcasting And Recording
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
An efficient transcoding algorithm for G.723.1 and G.729A ...
Programmable Digital Signal Processors Vol 13 Architecture Programming And Ap...
ETE405-lec8.pptx
GENDER RECOGNITION SYSTEM USING SPEECH SIGNAL
Audio Steganography Coding Using the Discreet Wavelet Transforms
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
F5242832
Curriculum Development of an Audio Processing Laboratory Course
Wireless and mobile communication final year AKTU (KEC-076) Unit-2 Lecture-01...
Utterance based speaker identification
Speech to text conversion for visually impaired person using µ law companding
Ad

More from Harshal Ladhe (14)

PDF
RGB Image Compression using Two-dimensional Discrete Cosine Transform
PDF
A robust watermarking algorithm based on image normalization and dc coefficients
PDF
Image compression using discrete wavelet transform
PDF
Adaptive noise estimation algorithm for speech enhancement
PDF
Bilateral filtering for gray and color images
PDF
Phase locked loop techniques for fm demodulation and modulation
PDF
Design of iir notch filters and narrow and wide band filters
PDF
A geometric approach to improving active packet loss measurement
PDF
Genetic algorithm for the design of optimal iir digital filters
PDF
Intrusion detection in homogeneous and heterogeneous wireless sensor networks
PDF
Study & simulation of O.F.D.M. system
PDF
A simulation and analysis of ofdm system for 4 g communications
PDF
Noise analysis & qrs detection in ecg signals
RGB Image Compression using Two-dimensional Discrete Cosine Transform
A robust watermarking algorithm based on image normalization and dc coefficients
Image compression using discrete wavelet transform
Adaptive noise estimation algorithm for speech enhancement
Bilateral filtering for gray and color images
Phase locked loop techniques for fm demodulation and modulation
Design of iir notch filters and narrow and wide band filters
A geometric approach to improving active packet loss measurement
Genetic algorithm for the design of optimal iir digital filters
Intrusion detection in homogeneous and heterogeneous wireless sensor networks
Study & simulation of O.F.D.M. system
A simulation and analysis of ofdm system for 4 g communications
Noise analysis & qrs detection in ecg signals

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Institutional Correction lecture only . . .
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Pharma ospi slides which help in ospi learning
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Lesson notes of climatology university.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Computing-Curriculum for Schools in Ghana
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Chinmaya Tiranga quiz Grand Finale.pdf
Institutional Correction lecture only . . .
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Pharma ospi slides which help in ospi learning
2.FourierTransform-ShortQuestionswithAnswers.pdf
01-Introduction-to-Information-Management.pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Lesson notes of climatology university.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
O7-L3 Supply Chain Operations - ICLT Program
O5-L3 Freight Transport Ops (International) V1.pdf
RMMM.pdf make it easy to upload and study
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Computing-Curriculum for Schools in Ghana
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape

Speech compression using loosy predictive coding (lpc)

  • 1. Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf Speech Compression using Loosy Predictive Coding International Journal of Emerging Technology and Advanced Engineering Abstract: The aim of the project is to develop a system for encoding good quality speech at low bit rate. To implement this we have used most powerful speech analysis technique called Loosy Predictive Coding (LPC). It uses 10 th order Levinson – Durbin Recursion algorithm to accomplish the task. It provides extremely accurate estimates of speech parameters, and is relatively efficient for computation. The speech signal of males and females were coded. The tradeoffs between the bit rate, end-to-end delay, speech quality, and complexity were analyzed. The results show that project was successful in coding the speech signal at relatively low bit rates with good quality.
  • 2. Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf (a) Block Diagram of an LPC Vocoder (b) Mathematical Model of Speech Production
  • 3. Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf (c) Human vs. Voice Coder Speech Production
  • 4. Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf A: - Waveforms B: - Spectrograms
  • 5. Base paper: - http://www.mirlabs.org/nagpur/paper16.pdf Conclusion: Linear Predictive Coding is an analysis/synthesis technique to lossy speech compression that attempts to model the human production of sound instead of transmitting an estimate of the sound wave. Linear predictive coding achieves a bit rate of 2400 bits/second which makes it ideal for use in secure telephone systems. Secure telephone systems are more concerned that the content and meaning of speech, rather than the quality of speech, be preserved. The trade-off for LPC’s low bit rate is that it does have some difficulty with certain sounds and it produces speech that sound synthetic. Linear predictive coding encoders break up a sound signal into different segments and then send information on each segment to the decoder. The encoder send information on whether the segment is voiced or unvoiced and the pitch period for voiced segment which is used to create an excitement signal in the decoder. The encoder also sends information about the vocal tract which is used to build a filter on the decoder side which when given the excitement signal as input can reproduce the original speech. Reference: [1] V. Hardman and O. Hodson. Internet/Mbone Audio (2000) 5-7. [2] Scott C. Douglas. Introduction to Adaptive Filters, Digital Signal Processing Handbook (1999) 7-12. [3] Poor, H. V., Looney, C. G., Marks II, R. J., Verdú, S., Thomas, J. A., Cover, T. M. Information Theory. The Electrical Engineering Handbook (2000) 56-57. [4] R. Sproat, and J. Olive. Text-to-Speech Synthesis, Digital Signal Processing Handbook (1999) 9-11 . [5] Richard C. Dorf, et. al.. Broadcasting (2000) 44-47. [6] Richard V. Cox. Speech Coding (1999) 5-8. [7] Randy Goldberg and Lance Riek. A Practical Handbook of Speech Coders (1999) Chapter 2:1-28, Chapter 4: 1-14, Chapter 9: 1-9, Chapter 10:1-18. [8] Mark Nelson and Jean-Loup Gailly. Speech Compression, The Data Compression Book (1995) 289-319. [9] Khalid Sayood. Introduction to Data Compression (2000) 497-509. [10] Richard Wolfson, Jay Pasachoff. Physics for Scientists and Engineers (1995) 376-377.