SlideShare a Scribd company logo
Speech /Audio Coding Standard
LPC-10
By
Sonawane Swapnil R
511006
Sub.: Speech
DEP-E&TC
VIIT Pune
1
What is Speech Coding?
• “Speech coding" = finding a representation of
speech which can be transmitted efficiently
through a digital channel.
• It is usually lossy coding, meaning that the
waveform can not be completely reproduced
by the decoder, instead, only the information
which is useful to a human listener is retained.
2
Coding Algorithm
• ADPCM: Adaptive Differential Pulse Code
Modulation
• LPC-10: LPC Vocoder with 10 coefficients
• CELP: Code Excited LPC
• RPE-LTP: Regular Pulse Excited LPC with Long
Term Prediction
• VSELP: Vector Sum Excited LPC
• IMBE: Improved Multi-Band Excitation
3
LPC – 10/ FS-1015 :-
• BRIEF HISTORY :-
– Is a secure telephony speech encoding standard developed by
the United States Department of Defense and later by NATO. The
standard was finished 1984.
– Algorithm uses linear predictive coding vocoder.
– The vocoder enables understandable speech, but the quality is
very unnatural and synthetic.
4
PROPERTIES :-
• 10 LP(linear predictor) coefficients are used.
• Bandwidth: 2.4kbps
• Samples/frame : 180 samples
• Bits/frame: 54 bits
• Frame Size: 22.5ms = 44.44 frames/sec
5
ANALYSIS :-
6
Conti..
• Analysis process extract from the speech signal the parameters
required to model it.
• First parameter :- Type of speech signal (voiced or unvoiced).
• The result is a voicing indicator,
• When voiced, its period has to be estimated in order to reflect its
height. This period, called as pitch .
7
1 Voiced segment
0 Unvoiced segment
Conti..
• The result of this analysis is a set of ten reflection coefficients
(hence the name LPC 10) which sufficiently & faithfully describe the
cross-sectional variations in the vocal tract.
• Finally, for each frame, the level of the speech signal is evaluated in
order to control the gain of the synthesizer on the synthesis side.
8
SYNTHESIS:-
9
Conti..
• The algorithms employed to synthesize the speech signal reflect the
assumed speech production model.
• They include, in succession:
– a noise generator, used for unvoiced sounds;
– a periodic signal generator, to which the pitch is provided, for voiced
sounds;
– a switch allowing selection of either generator according to the type of
speech signal to be produced in the current frame;
10
Conti..
– a filter of order 10, which filters the excitation selected; it is at this
level that the distinction between the different vowels and the
different consonants is made;
– a gain control system, which gives the synthetic signal the right
volume;
– optionally, a “post-filtering” system, designed to mask certain
imperfections in the synthesizer and to make the synthesized signal
more pleasant to the human ear.
11
VOCODER AT 2,400 BIT/S:-
12
LP Coefficients Pitch Voicing Energy
0 41 48 53
- The remaining 1 bit is for synchronization
SPEECH CODER COMPARISON:-
13
APPLICATIONS
• Digital telephony
• Satellite bradcasting
• Radio communications with secure voice
transmissions
14
THANKU
15

More Related Content

PPTX
Linear Predictive Coding
DOCX
Linear predictive coding documentation
PPTX
Linear Predictive Coding
PPT
Speech coding techniques
PDF
SPEECH CODING
PPTX
Speech coding standards2
PPT
Speech encoding techniques
PPT
Speech compression-using-gsm
Linear Predictive Coding
Linear predictive coding documentation
Linear Predictive Coding
Speech coding techniques
SPEECH CODING
Speech coding standards2
Speech encoding techniques
Speech compression-using-gsm

What's hot (20)

PPTX
lpc and horn noise detection
PDF
Speech Analysis and synthesis using Vocoder
PPT
Speech technology basics
PPTX
Speech coding techniques
PPT
Basics of speech coding
PPTX
adaptive multirate speech coding
PPT
3a. Speech Coders
PPTX
Speech Compression using LPC
PDF
Interactive voice conversion for augmented speech production
PDF
Speech Compression using LPC
PPTX
Digital speech processing lecture1
PPT
Multimedia Compression and Communication
PDF
Communication Networks II
PPTX
Applications of information theory in communication engineering
PDF
Loudness and Metadata and Codecs (c) DOLBY
PDF
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
PDF
Introductory Lecture to Audio Signal Processing
PDF
[NUGU CONFERENCE 2019] 트랙 A-2 : NUGU call 적용 기술 및 서비스 소개
PDF
Finalreport
lpc and horn noise detection
Speech Analysis and synthesis using Vocoder
Speech technology basics
Speech coding techniques
Basics of speech coding
adaptive multirate speech coding
3a. Speech Coders
Speech Compression using LPC
Interactive voice conversion for augmented speech production
Speech Compression using LPC
Digital speech processing lecture1
Multimedia Compression and Communication
Communication Networks II
Applications of information theory in communication engineering
Loudness and Metadata and Codecs (c) DOLBY
DSP_FOEHU - Lec 13 - Digital Signal Processing Applications I
Introductory Lecture to Audio Signal Processing
[NUGU CONFERENCE 2019] 트랙 A-2 : NUGU call 적용 기술 및 서비스 소개
Finalreport
Ad

Viewers also liked (20)

PPT
PPT
Adaptive multi rate (amr) document
PDF
Bluetooth Summer Gift Guide
PDF
Bluetooth wireless technology basics
PPTX
PPTX
Bluetooth Wireless Technology
PPT
Code Division Multiple Access
PPTX
PPT
Wcdma channels
PPTX
Presentation on fhss
PPTX
Equalization
PPT
Spread spectrum modulation
PPTX
Frequency hopping spread spectrum
PDF
3 handoff management
PPTX
ALOHA Protocol (in detail)
PPTX
PDF
Handoff management
PDF
WLAN - IEEE 802.11
PPT
Aloha
Adaptive multi rate (amr) document
Bluetooth Summer Gift Guide
Bluetooth wireless technology basics
Bluetooth Wireless Technology
Code Division Multiple Access
Wcdma channels
Presentation on fhss
Equalization
Spread spectrum modulation
Frequency hopping spread spectrum
3 handoff management
ALOHA Protocol (in detail)
Handoff management
WLAN - IEEE 802.11
Aloha
Ad

Similar to Speech coding std (20)

PDF
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
PPTX
Wireless and mobile communication final year AKTU (KEC-076) Unit-2 Lecture-01...
PPT
Audio and video compression
PPT
Module-4.ppt of mmc which is multi media communication
PPTX
Harmonic speech coding
PDF
A Distributed System for Recognizing Home Automation Commands and Distress Ca...
PDF
G010424248
PPTX
COLEA : A MATLAB Tool for Speech Analysis
DOC
Lpc vocoder implemented by using matlab
PDF
Single Frequency Networks for FM Broadcast (SFNs)
PPTX
Introduction to the spectrum analyzer
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
Audio Essentials for Broadcast and Multiscreen
PPTX
How to play audio from a microcontroller
PDF
Spherator FM VST VST3 Audio Unit: 4 Operator Frequency Modulation Synthesizer...
PPTX
Homomorphic speech processing
PDF
Mine detecting robot
PDF
H0814247
PPTX
spectrum analyzers ppt
PPTX
Final presentation
DDSP_2018_FOEHU - Lec 10 - Digital Signal Processing Applications
Wireless and mobile communication final year AKTU (KEC-076) Unit-2 Lecture-01...
Audio and video compression
Module-4.ppt of mmc which is multi media communication
Harmonic speech coding
A Distributed System for Recognizing Home Automation Commands and Distress Ca...
G010424248
COLEA : A MATLAB Tool for Speech Analysis
Lpc vocoder implemented by using matlab
Single Frequency Networks for FM Broadcast (SFNs)
Introduction to the spectrum analyzer
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
Audio Essentials for Broadcast and Multiscreen
How to play audio from a microcontroller
Spherator FM VST VST3 Audio Unit: 4 Operator Frequency Modulation Synthesizer...
Homomorphic speech processing
Mine detecting robot
H0814247
spectrum analyzers ppt
Final presentation

Recently uploaded (20)

PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
composite construction of structures.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Construction Project Organization Group 2.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPT
Project quality management in manufacturing
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
DOCX
573137875-Attendance-Management-System-original
PPTX
Sustainable Sites - Green Building Construction
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
web development for engineering and engineering
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
composite construction of structures.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
CH1 Production IntroductoryConcepts.pptx
Construction Project Organization Group 2.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Project quality management in manufacturing
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
573137875-Attendance-Management-System-original
Sustainable Sites - Green Building Construction
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Model Code of Practice - Construction Work - 21102022 .pdf
Internet of Things (IOT) - A guide to understanding
CYBER-CRIMES AND SECURITY A guide to understanding
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
R24 SURVEYING LAB MANUAL for civil enggi
UNIT 4 Total Quality Management .pptx
web development for engineering and engineering

Speech coding std

  • 1. Speech /Audio Coding Standard LPC-10 By Sonawane Swapnil R 511006 Sub.: Speech DEP-E&TC VIIT Pune 1
  • 2. What is Speech Coding? • “Speech coding" = finding a representation of speech which can be transmitted efficiently through a digital channel. • It is usually lossy coding, meaning that the waveform can not be completely reproduced by the decoder, instead, only the information which is useful to a human listener is retained. 2
  • 3. Coding Algorithm • ADPCM: Adaptive Differential Pulse Code Modulation • LPC-10: LPC Vocoder with 10 coefficients • CELP: Code Excited LPC • RPE-LTP: Regular Pulse Excited LPC with Long Term Prediction • VSELP: Vector Sum Excited LPC • IMBE: Improved Multi-Band Excitation 3
  • 4. LPC – 10/ FS-1015 :- • BRIEF HISTORY :- – Is a secure telephony speech encoding standard developed by the United States Department of Defense and later by NATO. The standard was finished 1984. – Algorithm uses linear predictive coding vocoder. – The vocoder enables understandable speech, but the quality is very unnatural and synthetic. 4
  • 5. PROPERTIES :- • 10 LP(linear predictor) coefficients are used. • Bandwidth: 2.4kbps • Samples/frame : 180 samples • Bits/frame: 54 bits • Frame Size: 22.5ms = 44.44 frames/sec 5
  • 7. Conti.. • Analysis process extract from the speech signal the parameters required to model it. • First parameter :- Type of speech signal (voiced or unvoiced). • The result is a voicing indicator, • When voiced, its period has to be estimated in order to reflect its height. This period, called as pitch . 7 1 Voiced segment 0 Unvoiced segment
  • 8. Conti.. • The result of this analysis is a set of ten reflection coefficients (hence the name LPC 10) which sufficiently & faithfully describe the cross-sectional variations in the vocal tract. • Finally, for each frame, the level of the speech signal is evaluated in order to control the gain of the synthesizer on the synthesis side. 8
  • 10. Conti.. • The algorithms employed to synthesize the speech signal reflect the assumed speech production model. • They include, in succession: – a noise generator, used for unvoiced sounds; – a periodic signal generator, to which the pitch is provided, for voiced sounds; – a switch allowing selection of either generator according to the type of speech signal to be produced in the current frame; 10
  • 11. Conti.. – a filter of order 10, which filters the excitation selected; it is at this level that the distinction between the different vowels and the different consonants is made; – a gain control system, which gives the synthetic signal the right volume; – optionally, a “post-filtering” system, designed to mask certain imperfections in the synthesizer and to make the synthesized signal more pleasant to the human ear. 11
  • 12. VOCODER AT 2,400 BIT/S:- 12 LP Coefficients Pitch Voicing Energy 0 41 48 53 - The remaining 1 bit is for synchronization
  • 14. APPLICATIONS • Digital telephony • Satellite bradcasting • Radio communications with secure voice transmissions 14

Editor's Notes

  • #4: Waveform Compression Coding ,, Parametric Compression Coding ,, Hybrid Compression Coding—Analysis-by-Synthesis
  • #5: Linear predictive coding (LPC) is a tool used mostly in audio signal processing and speech processing for representing the spectral envelope of a digital signal of speech in compressed form, using the information of a linear predictive model.[1] It is one of the most powerful speech analysis techniques, and one of the most useful methods for encoding good quality speech at a low bit rate and provides extremely accurate estimates of speech parameters.
  • #14: MOS (Mean Opinion Score The most widely used measure of quality is the Mean Opinion Score (MOS), which is the result of averaging opinion scores for a set of between 20 and 60 untrained subjects. Standards Organization ISO: International Standards Organization (http://www.iso.ch) ITU: International Telecomm unication Union (formerly CCITT) (http://www.itu.ch)