SlideShare a Scribd company logo
Speech Recognition
Speech Recognition
Speech recognition (also known as automatic speech recognition or computer speech recognition) converts spoken words to text.The term "voice recognition" is sometimes used to refer to speech recognition where the recognition system is trained to a particular speaker.
Applications of Speech RecognitionSpeech recognition applications include Voice dialing (e.g., "Call home"),
Call routing (e.g., "I would like to make a collect call"),
Simple data entry (e.g., entering a credit card number),
Preparation of structured documents (e.g., A radiology report),
Speech-to-text processing (e.g., word processors or emails), and
In aircraft cockpits (usually termed Direct Voice Input).Cont.,Automatic translation; Automotive speech recognition (e.g., Ford Sync); Telematics (e.g. Vehicle Navigation Systems); Court reporting (Realtime Voice Writing); Hands-free computing: voice command recognition computer user interface; Home automation; Interactive voice response; Mobile telephony, including mobile email; Multimodal interaction; Pronunciation evaluation in computer-aided language learning applications; Robotics; Video games, with Tom Clancy's EndWar and Lifeline as working examples; Transcription(digital speech-to-text); Speech-to-text (transcription of speech into mobile text messages); Air Traffic Control Speech Recognition
Speech recognition techniquesAnalysis techniques are similar for speech and speaker recognition.The following are techniques in SR :Modal evaluationText dependenceStochastic modalsVector quantizationCepstral analysis (High Recognition Accuracy)Orthogonal LPC parametersNeural network approaches
Speech Recognition ArchitectureThe noisy channel model of individual wordsThe noisy model channel applied to entire sentence
Speech Recognition ArchitectureThe goal of the probabilistic noisy channel architecture for speech recognition can be summarized as follows :    What is the most likely sentence out of all sentences in the language L given some acoustic input O ?
Speech Recognition ArchitectureThree stage for speech recognition system    Signal processing or Feature extraction stage :Waveform is sliced up into frames.Waveform are transformed into spectral features.Subwordor Phone recognition stage :Recognize individual speech.    Decoding stage :Find the sequence of words that most probably generated the input
Overview of HMMsMarkov chains used “to model pronunciation”.Forward algorithm:Phonesequences likelihood.Real input is not symbolic: Spectral featuresInput symbols do not correspond to machine states.Note: Why HMMs are used in speech recognition is that a speech signal could be viewed as a piecewise stationary signal or a short-time stationary signal.
Speech Recognition RequirementsTo use speech recognition, you need the following:A high quality close-talk (headset) microphone with gain adjustment (gain adjustment: A microphone feature that allows your input to be amplified so that it is made louder for use by the system.) support (A universal serial bus (USB) microphone is recommended.)A 400 megahertz (MHz) or faster computer128 MB or more of memoryWindows 2000 with Service Pack 3 or Windows XP or laterMicrosoft Internet Explorer 5.01 or later
Speech Recognition
Automatic Speech Recognition System for Home Appliances Control     Abstract - In the present work we study the performance of a speech recognizer for the Greek language, in a smart-home environment. This recognizer operates in spoken interaction scenarios, where the users are able to control various home appliances. In contrast to command and control systems, in our application the users speak spontaneously, beyond the use of a standardized set of isolated commands. The operational performance was tested over various environmental conditions, for two different types of microphones.
Different Home Appliances Control Scenarios
Dialogue systemsDialogue systems play a key role in any kind of conversational spoken language interface.Intelligent interfaces of home appliances provide the means for facilitating the operation of these devices, within a dialogue system. Various systems for home appliance control have been reported in the literature, focusing on enhancing the performance of the speech recognition process

More Related Content

PPTX
Programming Fundamentals and Programming Languages Concepts
PPTX
Assembly language
PPTX
Assembly Language
PPTX
classification of computer language
DOCX
Assembly language
PPT
Assembly language
PPT
Chapt 01 Assembly Language
PPTX
Computer languages
Programming Fundamentals and Programming Languages Concepts
Assembly language
Assembly Language
classification of computer language
Assembly language
Assembly language
Chapt 01 Assembly Language
Computer languages

What's hot (20)

PPTX
Assembly language programming
PPTX
Introduction to Programming Languages
PPT
Computer Organization and Assembly Language
PPTX
Assembly language
PPTX
Programming Fundamental Slide No.1
PPTX
Programming Languages / Translators
PPTX
Algorithms - Introduction to computer programming
PPTX
Computer programming
PDF
Ch0 computer systems overview
PPTX
Presentation on computer language
PPT
Al2ed chapter1
PPT
Introduction To Computer and Java
PPT
VOICE BASED SECURITY SYSTEM
PPT
Introduction to programming principles languages
PPTX
Voice Browser
PDF
Assembly Language Programming By Ytha Yu, Charles Marut Chap 1(Microcomputer ...
PPTX
Intro to assembly language
PPTX
Introduction To Programming in Matlab
PPTX
Programming Fundamentals lecture 2
PDF
Assembly Langauge Chap 1
Assembly language programming
Introduction to Programming Languages
Computer Organization and Assembly Language
Assembly language
Programming Fundamental Slide No.1
Programming Languages / Translators
Algorithms - Introduction to computer programming
Computer programming
Ch0 computer systems overview
Presentation on computer language
Al2ed chapter1
Introduction To Computer and Java
VOICE BASED SECURITY SYSTEM
Introduction to programming principles languages
Voice Browser
Assembly Language Programming By Ytha Yu, Charles Marut Chap 1(Microcomputer ...
Intro to assembly language
Introduction To Programming in Matlab
Programming Fundamentals lecture 2
Assembly Langauge Chap 1
Ad

Viewers also liked (17)

PDF
Social mediametricsdefinitionsfinal
PDF
Yancey 2011, August
PPT
C:\Fakepath\Overzicht 2009 Ppp
PPTX
Commodity Culture
PPTX
Esitlus1[1]
 
PPTX
Estonia
 
PPSX
Happy Valentines Day Slide Show
PDF
Social advertising-best-practices-0509
ODP
Trabajo de la paz
PDF
Rock Music Research
PPT
Hüperrealism, neoekspressionism
 
DOCX
Bangladesh1
ODP
Interneteko Legedia
PDF
Rock Music Research
PPTX
VONQ presentation: past, present and future of recruitment (jan 2010)
PPTX
Bovine respiratory syncytial virus (brsv)
 
ODP
Semana santa 5º primaria
Social mediametricsdefinitionsfinal
Yancey 2011, August
C:\Fakepath\Overzicht 2009 Ppp
Commodity Culture
Esitlus1[1]
 
Estonia
 
Happy Valentines Day Slide Show
Social advertising-best-practices-0509
Trabajo de la paz
Rock Music Research
Hüperrealism, neoekspressionism
 
Bangladesh1
Interneteko Legedia
Rock Music Research
VONQ presentation: past, present and future of recruitment (jan 2010)
Bovine respiratory syncytial virus (brsv)
 
Semana santa 5º primaria
Ad

Similar to Speech Recognition (20)

PDF
Artificial Intelligence for Speech Recognition
PPTX
Speech Recognition
PPTX
Speech Recognition By Hardik Mistry(Laxmi Institute Of Technology)
PPTX
Speech Recognition Technology
PPTX
Dilpreetanshika major project
PPT
Asr
PDF
A survey on Enhancements in Speech Recognition
PDF
Speech recognition - how does it work?
PDF
Speech recognizers & generators
PPT
Speech recognition
PPTX
AI for voice recognition.pptx
PPTX
Artificial Intelligence - An Introduction
PPTX
Artificial Intelligence- An Introduction
PPT
Asr
PPT
Speechrecognition 100423091251-phpapp01
PPTX
Amadou
PPTX
Speech to text conversion
PPTX
Speech to text conversion
PDF
The role of speech technology in biometrics, forensics and man-machine interface
Artificial Intelligence for Speech Recognition
Speech Recognition
Speech Recognition By Hardik Mistry(Laxmi Institute Of Technology)
Speech Recognition Technology
Dilpreetanshika major project
Asr
A survey on Enhancements in Speech Recognition
Speech recognition - how does it work?
Speech recognizers & generators
Speech recognition
AI for voice recognition.pptx
Artificial Intelligence - An Introduction
Artificial Intelligence- An Introduction
Asr
Speechrecognition 100423091251-phpapp01
Amadou
Speech to text conversion
Speech to text conversion
The role of speech technology in biometrics, forensics and man-machine interface

Recently uploaded (20)

PPTX
Introduction to Building Materials
PDF
1_English_Language_Set_2.pdf probationary
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Practical Manual AGRO-233 Principles and Practices of Natural Farming
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PPTX
Digestion and Absorption of Carbohydrates, Proteina and Fats
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PPTX
History, Philosophy and sociology of education (1).pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Trump Administration's workforce development strategy
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Weekly quiz Compilation Jan -July 25.pdf
Introduction to Building Materials
1_English_Language_Set_2.pdf probationary
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Chinmaya Tiranga quiz Grand Finale.pdf
Practical Manual AGRO-233 Principles and Practices of Natural Farming
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Final Presentation General Medicine 03-08-2024.pptx
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
202450812 BayCHI UCSC-SV 20250812 v17.pptx
medical_surgical_nursing_10th_edition_ignatavicius_TEST_BANK_pdf.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Digestion and Absorption of Carbohydrates, Proteina and Fats
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
History, Philosophy and sociology of education (1).pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Trump Administration's workforce development strategy
LDMMIA Reiki Yoga Finals Review Spring Summer
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Weekly quiz Compilation Jan -July 25.pdf

Speech Recognition

  • 3. Speech recognition (also known as automatic speech recognition or computer speech recognition) converts spoken words to text.The term "voice recognition" is sometimes used to refer to speech recognition where the recognition system is trained to a particular speaker.
  • 4. Applications of Speech RecognitionSpeech recognition applications include Voice dialing (e.g., "Call home"),
  • 5. Call routing (e.g., "I would like to make a collect call"),
  • 6. Simple data entry (e.g., entering a credit card number),
  • 7. Preparation of structured documents (e.g., A radiology report),
  • 8. Speech-to-text processing (e.g., word processors or emails), and
  • 9. In aircraft cockpits (usually termed Direct Voice Input).Cont.,Automatic translation; Automotive speech recognition (e.g., Ford Sync); Telematics (e.g. Vehicle Navigation Systems); Court reporting (Realtime Voice Writing); Hands-free computing: voice command recognition computer user interface; Home automation; Interactive voice response; Mobile telephony, including mobile email; Multimodal interaction; Pronunciation evaluation in computer-aided language learning applications; Robotics; Video games, with Tom Clancy's EndWar and Lifeline as working examples; Transcription(digital speech-to-text); Speech-to-text (transcription of speech into mobile text messages); Air Traffic Control Speech Recognition
  • 10. Speech recognition techniquesAnalysis techniques are similar for speech and speaker recognition.The following are techniques in SR :Modal evaluationText dependenceStochastic modalsVector quantizationCepstral analysis (High Recognition Accuracy)Orthogonal LPC parametersNeural network approaches
  • 11. Speech Recognition ArchitectureThe noisy channel model of individual wordsThe noisy model channel applied to entire sentence
  • 12. Speech Recognition ArchitectureThe goal of the probabilistic noisy channel architecture for speech recognition can be summarized as follows : What is the most likely sentence out of all sentences in the language L given some acoustic input O ?
  • 13. Speech Recognition ArchitectureThree stage for speech recognition system Signal processing or Feature extraction stage :Waveform is sliced up into frames.Waveform are transformed into spectral features.Subwordor Phone recognition stage :Recognize individual speech. Decoding stage :Find the sequence of words that most probably generated the input
  • 14. Overview of HMMsMarkov chains used “to model pronunciation”.Forward algorithm:Phonesequences likelihood.Real input is not symbolic: Spectral featuresInput symbols do not correspond to machine states.Note: Why HMMs are used in speech recognition is that a speech signal could be viewed as a piecewise stationary signal or a short-time stationary signal.
  • 15. Speech Recognition RequirementsTo use speech recognition, you need the following:A high quality close-talk (headset) microphone with gain adjustment (gain adjustment: A microphone feature that allows your input to be amplified so that it is made louder for use by the system.) support (A universal serial bus (USB) microphone is recommended.)A 400 megahertz (MHz) or faster computer128 MB or more of memoryWindows 2000 with Service Pack 3 or Windows XP or laterMicrosoft Internet Explorer 5.01 or later
  • 17. Automatic Speech Recognition System for Home Appliances Control Abstract - In the present work we study the performance of a speech recognizer for the Greek language, in a smart-home environment. This recognizer operates in spoken interaction scenarios, where the users are able to control various home appliances. In contrast to command and control systems, in our application the users speak spontaneously, beyond the use of a standardized set of isolated commands. The operational performance was tested over various environmental conditions, for two different types of microphones.
  • 18. Different Home Appliances Control Scenarios
  • 19. Dialogue systemsDialogue systems play a key role in any kind of conversational spoken language interface.Intelligent interfaces of home appliances provide the means for facilitating the operation of these devices, within a dialogue system. Various systems for home appliance control have been reported in the literature, focusing on enhancing the performance of the speech recognition process
  • 21. Architecture ExplanationThe audio signal from the user is captured and passed through a speech recognition module that produces a recognition hypothesis. This recognition hypothesis is then forwarded to a language understanding component that creates a corresponding semantic representation. This semantic input is then passed to the dialog manager, which, based on the current input and discourse context, produces the next system action (typically in the form of a semantic output). A language generation module then produces the corresponding surface (textual) form, which is subsequently passed to a speech synthesis module and rendered as audio back to the user.
  • 22. Function of dialog managerThe dialog manager therefore plays a key control role in any conversational spoken language interface: given the decoded semantic input corresponding to the current user utterance and the current discourse context, it determines the next system action. In essence, the dialog manager is responsible for planning and maintaining the coherence, over time of the conversation.
  • 23. Steps a dialogue manager doFirst, the dialog manager must maintain a history of the discourse and use it to interpret the perceived semantic inputs in the current context. Second, a representation – either explicit or implicit – of the system task is typically required. The current semantic input, together with the current dialog state and information about the task to be performed is then used to determine the next system action.