“LPC Analysis of Kannada Syllables”
Under the Guidance of
Prof. K. Indira
Dept. of E&C, RIT, Bangalore
Presented by
Vinodkumar A G - 1MS20EC127
Sridhar B - 1MS20EC111
Vinayak M B - 1MS20EC125
Vinay Swastik H - 1MS20EC124
MAJOR PROJECT
Table of Contents
1. Introduction and overview of the project
2. Problem statement
3. Project objectives and scope
4. Literature survey
5. Methodology, Proposed Work and Preliminary Results
6. References
INTRODUCTION
• The project will involve implementing the LPC algorithm to model the vocal tract, extracting the
formant frequencies from the LPC model, and comparing the results with known formant frequencies.
• The estimation of formant frequencies is an important task in speech signal processing, as it provides
information about the spectral characteristics of the vocal tract. Linear Predictive Coding (LPC) is a
widely used technique for speech analysis and has been shown to be effective for estimating formant
frequencies.
• Overall, the goal of this project is to develop a practical understanding of LPC and its applications in
speech analysis and to gain insights into the spectral characteristics of the vocal tract.
PROBLEM STATEMENT
To estimate Formant frequencies of the Kannada
Vowels and Consonants using Linear Predictive
Coefficients and verify using Praat tool.
Project Objectives
 The main objective of this report is to estimate the formant frequencies of Kannada
vowels and consonants using Linear Predictive Coding (LPC) analysis.
 The formant frequencies of speech sounds provide information about the resonant
properties of the vocal tract, which are essential for understanding the acoustic properties
of speech sounds.
 The estimated formant frequencies can be used for further analysis and modeling of
Kannada speech signals.
 An additional objective may involve applying the estimated formant and pitch
frequencies in practical speech processing tasks. This could include applications such as
speech recognition, speech synthesis, or speaker identification specific to Kannada
language.
Literature Survey
1.Performance Analysis of Kannada Phonetics: Vowels, Fricatives and Stop
Consonants Using LP Spectrum.
-By Shivakumar M and Latha Mariswamy
• A dataset of Kannada speech samples is collected, including a variety of phonetic units such as vowels,
fricatives, and stop consonants, recorded from native Kannada speakers.
• LP spectrum analysis is performed on the collected speech samples to estimate the spectral envelope and
formant frequencies.
• The study presents the findings of the spectral analysis, highlighting the spectral characteristics, patterns,
and variations observed in Kannada vowels, fricatives, and stop consonants.
• The accuracy and effectiveness of the LP spectrum analysis in capturing the phonetic properties of Kannada
are evaluated by comparing the estimated spectral features with known phonetic characteristics.
2.Formants and LPC Analysis of Kannada Vowels Speech Signal
-By K. Indira, Sadashiva Chakrasali and Umesh Bilembagi
• The speech signal is down-sampled by a factor of 6 after passing through a low-pass filter, resulting
in a sampling frequency of 7350Hz.
• Pre-emphasis is applied to enhance the power of high frequency signals before LPC coefficients are
extracted using an autoregressive filter of varying order.
• The LPC filter is then used to obtain the LP residual, and frequency responses of LPC filters for
different orders are compared with the formants of corresponding vowels noted from a tool.
3.Extraction of Speech Pitch and Formant Frequencies using
Discrete Wavelet Transform.
-By Sajad Hamzenejadi, Seyed Amir Yousef Hosseini Goki and Mahdieh Ghazvini
• The paper proposes a method for estimating speech pitch and formant frequencies using Discrete
Wavelet
Transform.
• DWT is used to decompose the speech signal into sub-bands, and the pitch and formant frequencies are
estimated from each sub-band.
• The method is advantageous because it captures both time and frequency information, and is efficiently
implemented using filter banks.
• The proposed method is shown to outperform existing methods for pitch and formant frequency
estimation in terms of accuracy and robustness.
4.Formant Text to Speech Synthesis Using Artificial Neural Networks.
-By Gurinder Kaur and Parminder Singh
• The paper proposes a method for formant-based Text-to-Speech (TTS)
synthesis using Artificial Neural Networks (ANN).
• The method involves training an ANN on a set of formant frequency
parameters and their corresponding phonetic labels to generate synthetic
speech.
• The paper discusses the advantages of using formant-based synthesis over
concatenative TTS, including improved naturalness and flexibility.
• The proposed method is shown to achieve high-quality speech synthesis with
low computational complexity and outperform existing methods in terms of
naturalness and intelligibility.
Methodology:
• Block Diagram:
Steps:
1. Collection of Speech Samples.
2. Pre-processing
3. Frame the speech signal
4. Compute LPC coefficients
5. Compute formant frequencies
6. Inverse filtering
7. Comparing formants of MATLAB with Praat Tool formants
Letter: ಅ(Male(23))
Letter: ಆ (Male(23))
Letter : ಅ(Female(18))
Letter : ಆ (Female(18))
Formant Frequencies of Male person ( Age 20-25): Vowels
Formant Frequencies of Male person ( Age 20-25): Consonants
Conclusion:
• In this work Kannada vowels and consonants were recorded from different age groups.
• Formants frequencies of corresponding Vowels and Consonants were computed. The variation of
formant frequencies across different gender and different age groups are shown in tables.
• The analysis is carried out separately for male and female speakers. The preliminary analysis of
frequency domain characteristics of vowels shows significant variations across different gender
and age groups.
• The importance of F1, F2, F3, F4 (formants) and their impact on order of the LPC filter have been
studied thoroughly in great details. The results have indicated the significant dependency of
speech signal characteristics on gender and different age groups.
Future work:
1.LPC-based formant frequency estimation can be used for speech enhancement,
speech recognition, voice conversion, and speech pathology diagnosis.
2.In speech enhancement, LPC can help to remove noise and other unwanted
distortions from speech signals.
3.In speech recognition, LPC-based formant frequency estimation can improve
accuracy and mitigate the effects of noise and other distortions.
4.In voice conversion, LPC can be used to convert the formant frequencies of one
speaker's voice to those of another speaker.
5.In speech pathology diagnosis, LPC-based formant frequency estimation can be
used to identify deviations from normal speech patterns and assist in diagnosis.
REFERENCES
1. Latha, M., M. Shivakumar, and R. Manjula. "Performance Analysis of Kannada Phonetics: Vowels,
Fricatives and Stop Consonants Using LP Spectrum." SN Computer Science 1, no. 2 (2020): 84.
2. Chakrasali Sadashiva, Umesh Bilembagi, and K. Indira. "Formants and LPC analysis of
Kannada vowel speech signals." In 2018 3rd IEEE International Conference on Recent
Trends in Electronics,Information & Communication Technology (RTEICT), pp. 945-948.
IEEE, 2018.
3. Dhiman Chowdhury , Md. Raju Ahmed Ripan,Md. Mehedihasan “Speech Features:
Pitch and Formant Extraction of Vowel Sounds Using Autocorrelation and Frequency
Domain Spectral Analysis” International conference on Innovation in Engineering
and Technology (ICIET),27-29 Dec 2018.
4. Sajad Hamzenejad , Seyed Amir Yousef Hosseini Goki, Mahdieh Ghazvini “Extraction
of Speech Pitch and Formant Frequencies using Discrete Wavelet Transform”, 2019 7th
Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS).
5. U. Shrawankar and V. Thakare, “Feature Extraction for a Speech Recognition System in Noisy
Environment: A Study”, in Proc. Second Int. Conf. on Computer Engineering and Applications, 19-21
Mar. 2010.
6. EV Raghavendra, P. Vijay Aditya and K. Prahalad, "Speech synthesis using artificial neural networks
2010 National Conference On Communications (NCC) Chennai, India 2010, pp: 1-5, dor 10.1109/NCC
2010 5430190
7. Reddy MV, Hanumanthappa M. Kannada phonemes to speech dictionary: statistical approach. Int J Eng
Res Appl. 2017;7(1):77–80.
8. Sarika Hegde KK, Achary KK, Shetty S. Statistical analysis of features and classification of alpha
syllabary, sounds in Kannada language. New York: Springer; 2014.
9. Formant Text To Speech Synthesis Using Artificial Neural Networks, 2019 Second International
Conference on Advanced Computational and Communication Paradigms (ICACCP).
This Photo by Unknown author is licensed under CC BY-SA.

More Related Content

PDF
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
PDF
Analysis of speech signal mlbp features
PDF
Analysis of speech signal mlbp features
PDF
Analysis of speech signal mlbp features
PDF
Analysis of speech signal mlbp features
PDF
FORMANT ANALYSIS OF BANGLA VOWEL FOR AUTOMATIC SPEECH RECOGNITION
DOC
Assignment_DSP.doc
PDF
Arabic Phoneme Recognition using Hierarchical Neural Fuzzy Petri Net and LPC ...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
Analysis of speech signal mlbp features
Analysis of speech signal mlbp features
Analysis of speech signal mlbp features
Analysis of speech signal mlbp features
FORMANT ANALYSIS OF BANGLA VOWEL FOR AUTOMATIC SPEECH RECOGNITION
Assignment_DSP.doc
Arabic Phoneme Recognition using Hierarchical Neural Fuzzy Petri Net and LPC ...

Similar to Powerpoint on Linear Predictive coding.pptx (20)

PDF
Audio/Speech Signal Analysis for Depression
PDF
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
PDF
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
PDF
Speaker Recognition System using MFCC and Vector Quantization Approach
PDF
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
PDF
Detection of speech under stress using spectral analysis
PDF
Detection of speech under stress using spectral analysis
PDF
Speech Analysis and synthesis using Vocoder
PPTX
Speech Signal Processing
PDF
Text independent speaker identification system using average pitch and forman...
PDF
Voice Signal Synthesis using Non Negative Matrix Factorization
PDF
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
PDF
SEARCH TIME REDUCTION USING HIDDEN MARKOV MODELS FOR ISOLATED DIGIT RECOGNITION
PDF
Speech Feature Extraction and Data Visualisation
PDF
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
PDF
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
DOCX
Voice biometric recognition
PPTX
BSc 4th year project proposal final 16-5-22
PDF
DATABASES, FEATURES, CLASSIFIERS AND CHALLENGES IN AUTOMATIC SPEECH RECOGNITI...
PDF
Audio/Speech Signal Analysis for Depression
Isolated English Word Recognition System: Appropriate for Bengali-accented En...
Feature Extraction Analysis for Hidden Markov Models in Sundanese Speech Reco...
Speaker Recognition System using MFCC and Vector Quantization Approach
Investigation of-combined-use-of-mfcc-and-lpc-features-in-speech-recognition-...
Detection of speech under stress using spectral analysis
Detection of speech under stress using spectral analysis
Speech Analysis and synthesis using Vocoder
Speech Signal Processing
Text independent speaker identification system using average pitch and forman...
Voice Signal Synthesis using Non Negative Matrix Factorization
Automatic speech emotion and speaker recognition based on hybrid gmm and ffbnn
SEARCH TIME REDUCTION USING HIDDEN MARKOV MODELS FOR ISOLATED DIGIT RECOGNITION
Speech Feature Extraction and Data Visualisation
Artificial Intelligent Algorithm for the Analysis, Quality Speech & Different...
AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND T...
Voice biometric recognition
BSc 4th year project proposal final 16-5-22
DATABASES, FEATURES, CLASSIFIERS AND CHALLENGES IN AUTOMATIC SPEECH RECOGNITI...
Ad

Recently uploaded (20)

PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PPTX
Computer Architecture Input Output Memory.pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
Introduction to pro and eukaryotes and differences.pptx
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
PDF
Race Reva University – Shaping Future Leaders in Artificial Intelligence
PDF
Journal of Dental Science - UDMY (2021).pdf
PDF
semiconductor packaging in vlsi design fab
PPTX
Climate Change and Its Global Impact.pptx
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
Complications of Minimal Access-Surgery.pdf
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
Literature_Review_methods_ BRACU_MKT426 course material
PDF
Journal of Dental Science - UDMY (2022).pdf
Unit 4 Computer Architecture Multicore Processor.pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Environmental Education MCQ BD2EE - Share Source.pdf
Computer Architecture Input Output Memory.pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Introduction to pro and eukaryotes and differences.pptx
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
David L Page_DCI Research Study Journey_how Methodology can inform one's prac...
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
Race Reva University – Shaping Future Leaders in Artificial Intelligence
Journal of Dental Science - UDMY (2021).pdf
semiconductor packaging in vlsi design fab
Climate Change and Its Global Impact.pptx
Core Concepts of Personalized Learning and Virtual Learning Environments
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
Complications of Minimal Access-Surgery.pdf
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
Literature_Review_methods_ BRACU_MKT426 course material
Journal of Dental Science - UDMY (2022).pdf
Ad

Powerpoint on Linear Predictive coding.pptx

  • 1. “LPC Analysis of Kannada Syllables” Under the Guidance of Prof. K. Indira Dept. of E&C, RIT, Bangalore Presented by Vinodkumar A G - 1MS20EC127 Sridhar B - 1MS20EC111 Vinayak M B - 1MS20EC125 Vinay Swastik H - 1MS20EC124 MAJOR PROJECT
  • 2. Table of Contents 1. Introduction and overview of the project 2. Problem statement 3. Project objectives and scope 4. Literature survey 5. Methodology, Proposed Work and Preliminary Results 6. References
  • 3. INTRODUCTION • The project will involve implementing the LPC algorithm to model the vocal tract, extracting the formant frequencies from the LPC model, and comparing the results with known formant frequencies. • The estimation of formant frequencies is an important task in speech signal processing, as it provides information about the spectral characteristics of the vocal tract. Linear Predictive Coding (LPC) is a widely used technique for speech analysis and has been shown to be effective for estimating formant frequencies. • Overall, the goal of this project is to develop a practical understanding of LPC and its applications in speech analysis and to gain insights into the spectral characteristics of the vocal tract.
  • 4. PROBLEM STATEMENT To estimate Formant frequencies of the Kannada Vowels and Consonants using Linear Predictive Coefficients and verify using Praat tool.
  • 5. Project Objectives  The main objective of this report is to estimate the formant frequencies of Kannada vowels and consonants using Linear Predictive Coding (LPC) analysis.  The formant frequencies of speech sounds provide information about the resonant properties of the vocal tract, which are essential for understanding the acoustic properties of speech sounds.  The estimated formant frequencies can be used for further analysis and modeling of Kannada speech signals.  An additional objective may involve applying the estimated formant and pitch frequencies in practical speech processing tasks. This could include applications such as speech recognition, speech synthesis, or speaker identification specific to Kannada language.
  • 6. Literature Survey 1.Performance Analysis of Kannada Phonetics: Vowels, Fricatives and Stop Consonants Using LP Spectrum. -By Shivakumar M and Latha Mariswamy • A dataset of Kannada speech samples is collected, including a variety of phonetic units such as vowels, fricatives, and stop consonants, recorded from native Kannada speakers. • LP spectrum analysis is performed on the collected speech samples to estimate the spectral envelope and formant frequencies. • The study presents the findings of the spectral analysis, highlighting the spectral characteristics, patterns, and variations observed in Kannada vowels, fricatives, and stop consonants. • The accuracy and effectiveness of the LP spectrum analysis in capturing the phonetic properties of Kannada are evaluated by comparing the estimated spectral features with known phonetic characteristics.
  • 7. 2.Formants and LPC Analysis of Kannada Vowels Speech Signal -By K. Indira, Sadashiva Chakrasali and Umesh Bilembagi • The speech signal is down-sampled by a factor of 6 after passing through a low-pass filter, resulting in a sampling frequency of 7350Hz. • Pre-emphasis is applied to enhance the power of high frequency signals before LPC coefficients are extracted using an autoregressive filter of varying order. • The LPC filter is then used to obtain the LP residual, and frequency responses of LPC filters for different orders are compared with the formants of corresponding vowels noted from a tool.
  • 8. 3.Extraction of Speech Pitch and Formant Frequencies using Discrete Wavelet Transform. -By Sajad Hamzenejadi, Seyed Amir Yousef Hosseini Goki and Mahdieh Ghazvini • The paper proposes a method for estimating speech pitch and formant frequencies using Discrete Wavelet Transform. • DWT is used to decompose the speech signal into sub-bands, and the pitch and formant frequencies are estimated from each sub-band. • The method is advantageous because it captures both time and frequency information, and is efficiently implemented using filter banks. • The proposed method is shown to outperform existing methods for pitch and formant frequency estimation in terms of accuracy and robustness.
  • 9. 4.Formant Text to Speech Synthesis Using Artificial Neural Networks. -By Gurinder Kaur and Parminder Singh • The paper proposes a method for formant-based Text-to-Speech (TTS) synthesis using Artificial Neural Networks (ANN). • The method involves training an ANN on a set of formant frequency parameters and their corresponding phonetic labels to generate synthetic speech. • The paper discusses the advantages of using formant-based synthesis over concatenative TTS, including improved naturalness and flexibility. • The proposed method is shown to achieve high-quality speech synthesis with low computational complexity and outperform existing methods in terms of naturalness and intelligibility.
  • 11. Steps: 1. Collection of Speech Samples. 2. Pre-processing 3. Frame the speech signal 4. Compute LPC coefficients 5. Compute formant frequencies 6. Inverse filtering 7. Comparing formants of MATLAB with Praat Tool formants
  • 15. Letter : ಆ (Female(18))
  • 16. Formant Frequencies of Male person ( Age 20-25): Vowels
  • 17. Formant Frequencies of Male person ( Age 20-25): Consonants
  • 18. Conclusion: • In this work Kannada vowels and consonants were recorded from different age groups. • Formants frequencies of corresponding Vowels and Consonants were computed. The variation of formant frequencies across different gender and different age groups are shown in tables. • The analysis is carried out separately for male and female speakers. The preliminary analysis of frequency domain characteristics of vowels shows significant variations across different gender and age groups. • The importance of F1, F2, F3, F4 (formants) and their impact on order of the LPC filter have been studied thoroughly in great details. The results have indicated the significant dependency of speech signal characteristics on gender and different age groups.
  • 19. Future work: 1.LPC-based formant frequency estimation can be used for speech enhancement, speech recognition, voice conversion, and speech pathology diagnosis. 2.In speech enhancement, LPC can help to remove noise and other unwanted distortions from speech signals. 3.In speech recognition, LPC-based formant frequency estimation can improve accuracy and mitigate the effects of noise and other distortions. 4.In voice conversion, LPC can be used to convert the formant frequencies of one speaker's voice to those of another speaker. 5.In speech pathology diagnosis, LPC-based formant frequency estimation can be used to identify deviations from normal speech patterns and assist in diagnosis.
  • 20. REFERENCES 1. Latha, M., M. Shivakumar, and R. Manjula. "Performance Analysis of Kannada Phonetics: Vowels, Fricatives and Stop Consonants Using LP Spectrum." SN Computer Science 1, no. 2 (2020): 84. 2. Chakrasali Sadashiva, Umesh Bilembagi, and K. Indira. "Formants and LPC analysis of Kannada vowel speech signals." In 2018 3rd IEEE International Conference on Recent Trends in Electronics,Information & Communication Technology (RTEICT), pp. 945-948. IEEE, 2018. 3. Dhiman Chowdhury , Md. Raju Ahmed Ripan,Md. Mehedihasan “Speech Features: Pitch and Formant Extraction of Vowel Sounds Using Autocorrelation and Frequency Domain Spectral Analysis” International conference on Innovation in Engineering and Technology (ICIET),27-29 Dec 2018. 4. Sajad Hamzenejad , Seyed Amir Yousef Hosseini Goki, Mahdieh Ghazvini “Extraction of Speech Pitch and Formant Frequencies using Discrete Wavelet Transform”, 2019 7th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS).
  • 21. 5. U. Shrawankar and V. Thakare, “Feature Extraction for a Speech Recognition System in Noisy Environment: A Study”, in Proc. Second Int. Conf. on Computer Engineering and Applications, 19-21 Mar. 2010. 6. EV Raghavendra, P. Vijay Aditya and K. Prahalad, "Speech synthesis using artificial neural networks 2010 National Conference On Communications (NCC) Chennai, India 2010, pp: 1-5, dor 10.1109/NCC 2010 5430190 7. Reddy MV, Hanumanthappa M. Kannada phonemes to speech dictionary: statistical approach. Int J Eng Res Appl. 2017;7(1):77–80. 8. Sarika Hegde KK, Achary KK, Shetty S. Statistical analysis of features and classification of alpha syllabary, sounds in Kannada language. New York: Springer; 2014. 9. Formant Text To Speech Synthesis Using Artificial Neural Networks, 2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP).
  • 22. This Photo by Unknown author is licensed under CC BY-SA.