SlideShare a Scribd company logo
Field: Computational linguistics 
Topic :Speech Recognition 
how SR came to be and how it 
works? How beneficial is it for 
students? 
Presenter: 
AMADOU ADAMOU AMINOU
Have you ever talked to your 
computer or your phone? Where 
it actually recognized what you 
said and than did something as a 
result??
Contents 
• History 
• Introduction 
• Types of Speech Recognition 
• Components of SR 
• Applications 
• Examples of SR 
• Weakness and flaws 
• Conclusion 
• References
INTRODUCTION 
WHAT IS SPEECH RECOGNISTION? 
• Speech recognition is a process by which a 
computer takes a speech signal (recorded 
using a microphone) and converts it into 
words in real-time. 
• SR simply is the process of converting spoken 
input to text. 
PELLOM B. Sonic: The University of Colorado Continuous, Speech 
Recognition System.2001
History 
• The first speech recognition system was invented by Bell 
laboratories in 1952 in US. It could understand only digits 
spoken by a single voice. The system is called the “AUDREY” 
system. 
• Ten years late labs in the US, Japan, England developed 
hardware dedicated to recognize spoken sounds. 
• They now have large vocabularies and can recognize 
continuous speech. 
Eugene Weinstein - Speech Recognition p15, Prentice hall 1995
Why was SRS invented 
• Individuals With Disabilities – Assists those who have visual 
impairment, hand immobility, dyslexia, etc. 
• Medical Transcription – Reduces delays to write out 
medical transcriptions 
• Dictation - Converts words to text in emails or other word 
documents (also helpful for English Language Learners). 
• Access Menu Commands – Opens files using voice commands. 
Eugene Weinstein - Speech Recognition p40-50, Prentice hall 1995
Speech recognition system consists of: 
• A microphone. 
• A speech recognition software. 
• A computer to take and interpret the speech. 
• A good quality soundcard for input and output. 
• A proper and good pronunciation. 
BGROUV, B T. (1989) computational linguistics, London, Longman
Two types of SR.. 
• Speaker-dependent systems 
– Require “training” to “teach” the individual System 
– More robust 
– But less convenient 
– And obviously less portable 
• Speaker-independent systems 
– Language coverage is reduced to compensate need to be 
flexible in phoneme identification 
– Clever compromise is to learn on the fly 
Eugene Weinstein - Speech Recognition p40-45, Prentice hall 1995
Components 
• Audio input 
• Grammar 
• Speech Recognition Engine 
• Acoustic Model 
• Recognized text
What’s hard about that? 
• Digitization 
– Converting analogue signal into digital representation. 
• Signal processing 
– Separating speech from background noise. 
• Phonetics 
– Variability in human speech. 
• Phonology 
– Recognizing individual sound distinctions (similar phonemes.) 
• Lexicology and syntax 
– Disambiguating homophones. 
– Features of continuous speech. 
• Syntax and pragmatics 
– Interpreting features. 
– Filtering of performance errors (disfluencies). 
Eugene Weinstein - Speech Recognition p15-67, Prentice hall 1995.
Potential uses in education 
• Teaching students of foreign languages to 
pronounce vocabulary correctly 
• Enabling students who are physically 
handicapped who cant use keyboard. 
• Enabling student with textual interpretive 
problems e.g. Dyslexia to enter text verbally. 
• Restrictive access on high security computer, 
where a keyboard may be used by hackers. 
BGROUV, B T. (1989) computational linguistics, London, Longman
Applications of Speech Recognition 
• Speech recognition applications include 
 controlling any devices in your home (e.g. video), 
 Call routing (e.g., "I would like to make a collect call"), 
 Simple data entry (e.g., entering a credit card number), 
 Preparation of structured documents (e.g., A radiology 
report), 
 Speech-to-text processing (e.g., word processors or emails) 
 In aircraft cockpits (usually termed Direct Voice Input). 
BGROUV, B T. (1989) computational linguistics, London: LONMAN
Example: Microsoft Speech 
Recognition – Windows 7
SIRI & GOOGLE 
Intelligent Personal Assistant 
developed by Apple. 
Google Now is an intelligent personal 
assistant developed by Google. 
Both use a combination of speaker- dependent and speaker-independent 
SR systems
Weakness and Flaws 
• Low signal-to-noise ratio: the program needs to 
hear the words spoken distinctly. 
• Intensive use of device power. 
• Homonyms e.g. “there” and “their”, “be” and 
“bee” 
• Overlapping speech. 
• No program is 100% perfect 
• Problem of understanding dialects and accents 
Example: see video 
A practical Intro to the computer Analysis of Language, Geoff Barnbrook, 
1996, Edinburgh University press.
Conclusion 
• Revolutionize the way people conduct 
business over the Web and ,differentiate 
world-class e-businesses. 
• VoiceXML ties speech recognition and 
telephony together 
• At some point in the future, speech 
recognition may become speech 
understanding. 
• voice-enabled Web solutions TODAY!
References 
• PELLOM, B., Sonic: The University of Colorado 
Continuous Speech Recognition System, 2001 
• BOGUREV, B T. (1989) computational linguistics, 
London: LONGMAN 
• Eugene Weinstein - Speech Recognition, Prentice 
hall 1995 
• http://www.tldp.org/HOWTO/Speech- 
Recognition- 
• A practical Intro to the computer Analysis of 
Language, Geoff BRANKROOK, 1996, Edinburgh 
University press.

More Related Content

PPTX
Speech Recognition
PDF
Speech Recognition: Transcription and transformation of human speech
PPTX
PPTX
Voice input and speech recognition system in tourism/social media
PPT
Noise Adaptive Training for Robust Automatic Speech Recognition
PPTX
Automatic speech recognition
DOCX
Role of language engineering to preserve endangered languages
Speech Recognition
Speech Recognition: Transcription and transformation of human speech
Voice input and speech recognition system in tourism/social media
Noise Adaptive Training for Robust Automatic Speech Recognition
Automatic speech recognition
Role of language engineering to preserve endangered languages

What's hot (20)

PPT
Speech recognition system
PDF
American Standard Sign Language Representation Using Speech Recognition
PPTX
Speech Recognition
PPTX
What Is Speech Processing?
DOC
12EEE032- text 2 voice
PPTX
SPEECH RECOGNITION USING NEURAL NETWORK
PPTX
Role of Language Engineering to Preserve Endangered Language
PPTX
PPTX
Computational linguistics
PPTX
Computational linguistics
PPTX
Introduction to computational linguistics
PPTX
Natural language processing
PPTX
Computational linguistics
PPT
Re-speaking and localization
PPT
Natural language processing
PPTX
COMPUTATIONAL LINGUISTICS
PDF
Curriculum Vitae
DOCX
Com ling
PPTX
Natural language processing (NLP)
PPTX
screen translation
Speech recognition system
American Standard Sign Language Representation Using Speech Recognition
Speech Recognition
What Is Speech Processing?
12EEE032- text 2 voice
SPEECH RECOGNITION USING NEURAL NETWORK
Role of Language Engineering to Preserve Endangered Language
Computational linguistics
Computational linguistics
Introduction to computational linguistics
Natural language processing
Computational linguistics
Re-speaking and localization
Natural language processing
COMPUTATIONAL LINGUISTICS
Curriculum Vitae
Com ling
Natural language processing (NLP)
screen translation
Ad

Similar to Amadou (20)

PPT
Speechrecognition 100423091251-phpapp01
PPTX
Speech recognition An overview
PPTX
Speech Recognition in artificial intelligence
PPTX
BSc 4th year project proposal final 16-5-22
PPTX
550529842-SPEECH-RECOGNITION-PPT-BF.pptx
PPTX
Speech recognition final presentation
PDF
ch1.pdf
PPTX
Speech Recognition Technology
PPT
Chapter 9 Universal Design
PPTX
Chapter 10 - Universal Design and User Support.pptx
PPT
Automatic Speech Recognition.ppt
PPT
Automatic speech recognition
PPT
Speech recognition
PPTX
Speech Recognition Technology
PPTX
Speech to text conversion
PPTX
Speech to text conversion
PPTX
Natural Language Processing 20 March.pptx
PPTX
Speech-Recognition.pptx
PPTX
DTUI6_chap09_accessiblePPT.pptx
PPTX
Automatic speech recognition
Speechrecognition 100423091251-phpapp01
Speech recognition An overview
Speech Recognition in artificial intelligence
BSc 4th year project proposal final 16-5-22
550529842-SPEECH-RECOGNITION-PPT-BF.pptx
Speech recognition final presentation
ch1.pdf
Speech Recognition Technology
Chapter 9 Universal Design
Chapter 10 - Universal Design and User Support.pptx
Automatic Speech Recognition.ppt
Automatic speech recognition
Speech recognition
Speech Recognition Technology
Speech to text conversion
Speech to text conversion
Natural Language Processing 20 March.pptx
Speech-Recognition.pptx
DTUI6_chap09_accessiblePPT.pptx
Automatic speech recognition
Ad

Recently uploaded (20)

PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Business Ethics Teaching Materials for college
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Insiders guide to clinical Medicine.pdf
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Cell Structure & Organelles in detailed.
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
master seminar digital applications in india
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
STATICS OF THE RIGID BODIES Hibbelers.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
Renaissance Architecture: A Journey from Faith to Humanism
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Microbial disease of the cardiovascular and lymphatic systems
Business Ethics Teaching Materials for college
O7-L3 Supply Chain Operations - ICLT Program
VCE English Exam - Section C Student Revision Booklet
Insiders guide to clinical Medicine.pdf
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Cell Structure & Organelles in detailed.
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
RMMM.pdf make it easy to upload and study
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
master seminar digital applications in india
Abdominal Access Techniques with Prof. Dr. R K Mishra
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
human mycosis Human fungal infections are called human mycosis..pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx

Amadou

  • 1. Field: Computational linguistics Topic :Speech Recognition how SR came to be and how it works? How beneficial is it for students? Presenter: AMADOU ADAMOU AMINOU
  • 2. Have you ever talked to your computer or your phone? Where it actually recognized what you said and than did something as a result??
  • 3. Contents • History • Introduction • Types of Speech Recognition • Components of SR • Applications • Examples of SR • Weakness and flaws • Conclusion • References
  • 4. INTRODUCTION WHAT IS SPEECH RECOGNISTION? • Speech recognition is a process by which a computer takes a speech signal (recorded using a microphone) and converts it into words in real-time. • SR simply is the process of converting spoken input to text. PELLOM B. Sonic: The University of Colorado Continuous, Speech Recognition System.2001
  • 5. History • The first speech recognition system was invented by Bell laboratories in 1952 in US. It could understand only digits spoken by a single voice. The system is called the “AUDREY” system. • Ten years late labs in the US, Japan, England developed hardware dedicated to recognize spoken sounds. • They now have large vocabularies and can recognize continuous speech. Eugene Weinstein - Speech Recognition p15, Prentice hall 1995
  • 6. Why was SRS invented • Individuals With Disabilities – Assists those who have visual impairment, hand immobility, dyslexia, etc. • Medical Transcription – Reduces delays to write out medical transcriptions • Dictation - Converts words to text in emails or other word documents (also helpful for English Language Learners). • Access Menu Commands – Opens files using voice commands. Eugene Weinstein - Speech Recognition p40-50, Prentice hall 1995
  • 7. Speech recognition system consists of: • A microphone. • A speech recognition software. • A computer to take and interpret the speech. • A good quality soundcard for input and output. • A proper and good pronunciation. BGROUV, B T. (1989) computational linguistics, London, Longman
  • 8. Two types of SR.. • Speaker-dependent systems – Require “training” to “teach” the individual System – More robust – But less convenient – And obviously less portable • Speaker-independent systems – Language coverage is reduced to compensate need to be flexible in phoneme identification – Clever compromise is to learn on the fly Eugene Weinstein - Speech Recognition p40-45, Prentice hall 1995
  • 9. Components • Audio input • Grammar • Speech Recognition Engine • Acoustic Model • Recognized text
  • 10. What’s hard about that? • Digitization – Converting analogue signal into digital representation. • Signal processing – Separating speech from background noise. • Phonetics – Variability in human speech. • Phonology – Recognizing individual sound distinctions (similar phonemes.) • Lexicology and syntax – Disambiguating homophones. – Features of continuous speech. • Syntax and pragmatics – Interpreting features. – Filtering of performance errors (disfluencies). Eugene Weinstein - Speech Recognition p15-67, Prentice hall 1995.
  • 11. Potential uses in education • Teaching students of foreign languages to pronounce vocabulary correctly • Enabling students who are physically handicapped who cant use keyboard. • Enabling student with textual interpretive problems e.g. Dyslexia to enter text verbally. • Restrictive access on high security computer, where a keyboard may be used by hackers. BGROUV, B T. (1989) computational linguistics, London, Longman
  • 12. Applications of Speech Recognition • Speech recognition applications include  controlling any devices in your home (e.g. video),  Call routing (e.g., "I would like to make a collect call"),  Simple data entry (e.g., entering a credit card number),  Preparation of structured documents (e.g., A radiology report),  Speech-to-text processing (e.g., word processors or emails)  In aircraft cockpits (usually termed Direct Voice Input). BGROUV, B T. (1989) computational linguistics, London: LONMAN
  • 13. Example: Microsoft Speech Recognition – Windows 7
  • 14. SIRI & GOOGLE Intelligent Personal Assistant developed by Apple. Google Now is an intelligent personal assistant developed by Google. Both use a combination of speaker- dependent and speaker-independent SR systems
  • 15. Weakness and Flaws • Low signal-to-noise ratio: the program needs to hear the words spoken distinctly. • Intensive use of device power. • Homonyms e.g. “there” and “their”, “be” and “bee” • Overlapping speech. • No program is 100% perfect • Problem of understanding dialects and accents Example: see video A practical Intro to the computer Analysis of Language, Geoff Barnbrook, 1996, Edinburgh University press.
  • 16. Conclusion • Revolutionize the way people conduct business over the Web and ,differentiate world-class e-businesses. • VoiceXML ties speech recognition and telephony together • At some point in the future, speech recognition may become speech understanding. • voice-enabled Web solutions TODAY!
  • 17. References • PELLOM, B., Sonic: The University of Colorado Continuous Speech Recognition System, 2001 • BOGUREV, B T. (1989) computational linguistics, London: LONGMAN • Eugene Weinstein - Speech Recognition, Prentice hall 1995 • http://www.tldp.org/HOWTO/Speech- Recognition- • A practical Intro to the computer Analysis of Language, Geoff BRANKROOK, 1996, Edinburgh University press.