VOICE RECOGNITION SYSTEM

    Shailendra Singh Tiwari                                 SGSITS ENGINEERING COLLEGE
    Computer Science Engineering                          INDORE (M.P.)
    singhtiwari.shailendra@gmail.com
    AB-24108                                                 Introduction


   Abstract
R.N.:O801CS123D13                                            Speech recognition system performs two
                                                             fundamental operations: signal modeling and
   Automatic speech recognition (ASR) has made
SGSITS                                                       pattern matching Signal modeling represents
   great strides with the development of digital             process of converting speech signal into a set of
   signal processing hardware and software. But              parameters. Pattern matching is the task of finding
   despite of all these advances, machines cannot            parameter set from memory which closely
   match the performance of their human                      matches the parameter set obtained from the
   counterparts in terms of accuracy and speed,              input speech signal.
   specially in case of speaker independent speech
   recognition. So today significant portion of              Signal Modeling
   speech recognition research is focussed on
   speaker independent speech recognition problem.           To obtain the perceptually meaningful parameters
   The reasons are its wide range of applications,           i.e. parameters which are analogous to those used
   and limitations of available techniques of speech         behuman auditory system. To obtain the invariant
   recognition. In this report we briefly discuss the        parameters i.e. parameters which are robust to
   signal modeling approach for speech recognition.          variations in channel, speaker and transducer. To
   It is followed by overview of basic operations            obtain parameters that capture spectral dynamics,
   involved in signal modeling. Further commonly             or changes of spectrum with time. The signal
   used temporal and spectral analysis techniques of         modeling involves basic operationsSpectral
   feature extraction are discussed in detail.               shapin


   spectral shapingis the process of converting the          Analysis techniques for feature extraction have
   speech signal from sound pressure wave to a digital       been studied in detail and following conclusions
   signal; and emphasizing important frequency               are drawn
   components in the signal.
                                                             Temporal analysis techniques involve less
   Feature extraction                                        computation, ease of implementation. But they
                                                             are limited to determination simple speech
   Feature extraction is process of obtaining different      parameters like power, energy and periodicity of
   features such as power, pitch, and vocal tract            speech. For finding vocal tract parameters we
   configuration from the speech signal. Parameter           require spectral analysis techniques. Critical
   transformation is the process of converting
   these features into signal parameters through             band filter bank decomposes the speech signal
   process of differentiation and concatenation.             into discrete set of spectral samples containing
   Statistical modeling involves conversion of               information, which is similar to information,
   parameters in signal observation vectors.                 presented to higher levels processing in auditory
                                                             system. Cepstral analysis separates the speech
   Parametric transformation                                 signal into component representing excitation
                                                             source and a component representing vocal tract
                                                             impulse response.
   Feature Extraction
                                                             So it provides information about pitch and
   In speaker independent speech recogniton, a               vocal tract configuration. But it is computationally
   premium is placed on extracting features that             more intensive. Mel cepstral analysis has
   are somewhat invariant to changes in the speaker.         decorrelating property of cepstral analysis and
   So feture extraction involves analysis of speech          also includes some aspects of audition. LPC
   siganl. Broadly the feature extraction techniques         analysis provides compact representation of vocal
   are classified as temporal analysis and spectral          tract configuration by relatively simple
   analysis technique. In temporal analysis the              computation compared tocepstral analysis. To
   speech waveform itself is used for analysis. In           minimize analysis complexity it assumes all
   spectral analysis spectral representation of speech       pole model for speech production system. But
   signal is used for analysis.                              speech has zeros due to nasals so in these cases the
                                                             result are not as good as in case of vowels but still
   Conclusions                                               reasonably acceptable if order of model is
                                                             sufficiently high.
   The basic operations in speech recognition system
   have been discussed briefly. Different temporal
   and spectral
Acknowlegdement                                   References
I wish to express my sincere gratitude to Prof.   L. R. Rabiner and R. W. Schafer, Digital
puja gupta for her constant guidance throughout   Processing of Speech Signals. Englewood Cliffs,
the course of the computer workshop and many      New Jersey:
useful discussions which enabled me to know the
subtleties of the subject in proper way.          Prentice-Hall, 1978. D.O. Shaughnessy, Speech
                                                  Communication: Human and Machine.
                                                  India:University Press ,2001.

More Related Content

PDF
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
PDF
IRJET - Audio Emotion Analysis
PDF
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
PDF
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
PDF
Interactive voice conversion for augmented speech production
DOC
Speaker recognition on matlab
PDF
Nd2421622165
PPTX
Voice Identification And Recognition System, Matlab
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
IRJET - Audio Emotion Analysis
Voice over IP (VoIP) Speech Quality Measurement with Open-Source Software Com...
[IJET-V1I6P21] Authors : Easwari.N , Ponmuthuramalingam.P
Interactive voice conversion for augmented speech production
Speaker recognition on matlab
Nd2421622165
Voice Identification And Recognition System, Matlab

What's hot (20)

PPTX
Speech Signal Analysis
PPTX
Speaker recognition system by abhishek mahajan
PPTX
Speaker recognition using MFCC
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
DOC
Speaker recognition.
PDF
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
PDF
Adaptive noise estimation algorithm for speech enhancement
PPTX
Speech Signal Processing
PDF
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
PPT
Speech technology basics
PPTX
SPEAKER VERIFICATION
PPT
multirate signal processing for speech
DOCX
speech enhancement
PPTX
COLEA : A MATLAB Tool for Speech Analysis
PDF
Dy36749754
PPT
Speech Recognition System By Matlab
PPT
Speaker identification system with voice controlled functionality
PDF
G010424248
Speech Signal Analysis
Speaker recognition system by abhishek mahajan
Speaker recognition using MFCC
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
Speaker recognition.
Speech Enhancement Using A Minimum Mean Square Error Short Time Spectral Ampl...
Adaptive noise estimation algorithm for speech enhancement
Speech Signal Processing
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
Speech technology basics
SPEAKER VERIFICATION
multirate signal processing for speech
speech enhancement
COLEA : A MATLAB Tool for Speech Analysis
Dy36749754
Speech Recognition System By Matlab
Speaker identification system with voice controlled functionality
G010424248
Ad

Viewers also liked (9)

PPTX
Blogging and research
DOCX
Bitácora 08 04-2013
PPTX
How we made our cd pane
PDF
Dengue
DOCX
Bitácora 01 04-2013
PPT
Price for buying pinterest followers
DOCX
3 essays
ODP
CCW Handguns - Semi Automatic vs Revolver
DOCX
Bitácora 09 04-2013
Blogging and research
Bitácora 08 04-2013
How we made our cd pane
Dengue
Bitácora 01 04-2013
Price for buying pinterest followers
3 essays
CCW Handguns - Semi Automatic vs Revolver
Bitácora 09 04-2013
Ad

Similar to Servey (20)

PDF
Utterance Based Speaker Identification Using ANN
PDF
Utterance Based Speaker Identification Using ANN
PDF
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
PDF
Anvita Eusipco 2004
PDF
Anvita Eusipco 2004
PDF
Ijetcas14 426
PDF
Utterance based speaker identification
PDF
Real Time Speaker Identification System – Design, Implementation and Validation
PPTX
voice morphing.pptx
PDF
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
PDF
International Journal of Engineering and Science Invention (IJESI)
PDF
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
PDF
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
PDF
IRJET- A Survey on Sound Recognition
PPT
Speech Recognition
PDF
Adaptive wavelet thresholding with robust hybrid features for text-independe...
PDF
Bachelors project summary
PDF
E0502 01 2327
PDF
Speaker Identification
Utterance Based Speaker Identification Using ANN
Utterance Based Speaker Identification Using ANN
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
Anvita Eusipco 2004
Anvita Eusipco 2004
Ijetcas14 426
Utterance based speaker identification
Real Time Speaker Identification System – Design, Implementation and Validation
voice morphing.pptx
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
International Journal of Engineering and Science Invention (IJESI)
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
CURVELET BASED SPEECH RECOGNITION SYSTEM IN NOISY ENVIRONMENT: A STATISTICAL ...
IRJET- A Survey on Sound Recognition
Speech Recognition
Adaptive wavelet thresholding with robust hybrid features for text-independe...
Bachelors project summary
E0502 01 2327
Speaker Identification

Servey

  • 1. VOICE RECOGNITION SYSTEM Shailendra Singh Tiwari SGSITS ENGINEERING COLLEGE Computer Science Engineering INDORE (M.P.) singhtiwari.shailendra@gmail.com AB-24108 Introduction Abstract R.N.:O801CS123D13 Speech recognition system performs two fundamental operations: signal modeling and Automatic speech recognition (ASR) has made SGSITS pattern matching Signal modeling represents great strides with the development of digital process of converting speech signal into a set of signal processing hardware and software. But parameters. Pattern matching is the task of finding despite of all these advances, machines cannot parameter set from memory which closely match the performance of their human matches the parameter set obtained from the counterparts in terms of accuracy and speed, input speech signal. specially in case of speaker independent speech recognition. So today significant portion of Signal Modeling speech recognition research is focussed on speaker independent speech recognition problem. To obtain the perceptually meaningful parameters The reasons are its wide range of applications, i.e. parameters which are analogous to those used and limitations of available techniques of speech behuman auditory system. To obtain the invariant recognition. In this report we briefly discuss the parameters i.e. parameters which are robust to signal modeling approach for speech recognition. variations in channel, speaker and transducer. To It is followed by overview of basic operations obtain parameters that capture spectral dynamics, involved in signal modeling. Further commonly or changes of spectrum with time. The signal used temporal and spectral analysis techniques of modeling involves basic operationsSpectral feature extraction are discussed in detail. shapin spectral shapingis the process of converting the Analysis techniques for feature extraction have speech signal from sound pressure wave to a digital been studied in detail and following conclusions signal; and emphasizing important frequency are drawn components in the signal. Temporal analysis techniques involve less Feature extraction computation, ease of implementation. But they are limited to determination simple speech Feature extraction is process of obtaining different parameters like power, energy and periodicity of features such as power, pitch, and vocal tract speech. For finding vocal tract parameters we configuration from the speech signal. Parameter require spectral analysis techniques. Critical transformation is the process of converting these features into signal parameters through band filter bank decomposes the speech signal process of differentiation and concatenation. into discrete set of spectral samples containing Statistical modeling involves conversion of information, which is similar to information, parameters in signal observation vectors. presented to higher levels processing in auditory system. Cepstral analysis separates the speech Parametric transformation signal into component representing excitation source and a component representing vocal tract impulse response. Feature Extraction So it provides information about pitch and In speaker independent speech recogniton, a vocal tract configuration. But it is computationally premium is placed on extracting features that more intensive. Mel cepstral analysis has are somewhat invariant to changes in the speaker. decorrelating property of cepstral analysis and So feture extraction involves analysis of speech also includes some aspects of audition. LPC siganl. Broadly the feature extraction techniques analysis provides compact representation of vocal are classified as temporal analysis and spectral tract configuration by relatively simple analysis technique. In temporal analysis the computation compared tocepstral analysis. To speech waveform itself is used for analysis. In minimize analysis complexity it assumes all spectral analysis spectral representation of speech pole model for speech production system. But signal is used for analysis. speech has zeros due to nasals so in these cases the result are not as good as in case of vowels but still Conclusions reasonably acceptable if order of model is sufficiently high. The basic operations in speech recognition system have been discussed briefly. Different temporal and spectral
  • 2. Acknowlegdement References I wish to express my sincere gratitude to Prof. L. R. Rabiner and R. W. Schafer, Digital puja gupta for her constant guidance throughout Processing of Speech Signals. Englewood Cliffs, the course of the computer workshop and many New Jersey: useful discussions which enabled me to know the subtleties of the subject in proper way. Prentice-Hall, 1978. D.O. Shaughnessy, Speech Communication: Human and Machine. India:University Press ,2001.