SlideShare a Scribd company logo
2
Most read
3
Most read
4
Most read
Artificial intelligence Speech recognition system
   What is Speech Recognition?
    Also known as automatic speech
     recognition or computer speech
     recognition which means
     understanding voice by the computer
     and performing any required task.
   Where can it be used?
   System control/navigation
    e.g. GPS-connected digital maps: “How far is it
    to the motorway junction?”
   Commercial/Industrial applications
    in-car steering systems
    Voice dialing
    hands-free use of mobile in car e.g. “Dial
    office”
Voice Input     Analog to Digital      Acoustic Model



                                       Language Model




     Feedback      Display          Speech Engine
   Acoustic Model
       An acoustic model is created by taking audio recordings of
        speech, and their text transcriptions, and using software to
        create statistical representations of the sounds that make up
        each word. It is used by a speech recognition engine to
        recognize speech.

   Language Model
         Language model is used in many natural language
        processing applications such as speech recognition tries to
        capture the properties of a language, and to predict the next
        word in a speech sequence.
   Two types of speech recognition.

       Speaker-Dependent

              is commonly used for dictation software

     Speaker-Independent

               is more commonly found in telephone
        applications.
   Speaker-dependent software works by learning
    the unique characteristics of a single person’s
    voice, in a way similar to voice recognition.
    New users must first “train” the software by
    speaking to it, so the computer can analyze
    how the person talks.
    This often means users have to read a few
    pages of text to the computer before they can
    use the speech recognition software.
   Speaker-independent software is designed to recognize
    anyone’s voice, so no training is involved.
    This means it is the only real option for applications
    such as interactive voice response systems — where
    businesses can’t ask callers to read pages of text before
    using the system.
   The downside is that speaker-independent software is
    generally less accurate than speaker-dependent
    software.
   Speech recognition engines that are speaker
    independent generally deal with this fact by limiting
    the grammars they use. By using a smaller list of
    recognized words, the speech engine is more likely to
    correctly recognize what a speaker said.
   Articulation produces
   sound waves which
   the ear conveys to the brain
   for processing
Acoustic waveform      Acoustic signal




      Digitization
      Acoustic analysis of the
                                         Speech recognition
       speech signal
      Linguistic interpretation
Artificial intelligence Speech recognition system
• Digitization
   Analogue to digital conversion
   Sampling and quantizing
       Sampling is converting a continuous signal into a discrete signal
       Quantizing is the process of approximating a continuous range of
        values
• Signal processing
      – Separating speech from background noise
• Phonetics
      – Variability in human speech
• Phonology
      – Recognizing individual sound distinctions (similar
        phonemes)
      – is the systematic use of sound to encode meaning
        in any spoken human language
   Semantics and pragmatics
     Semantics tells about the meaning
     Pragmatics is concerned with bridging the
      explanatory gap between sentence meaning and
      speaker's meaning
   Lexicology and syntax
       Lexicology is that part of linguistics which
        studies words, their nature, and meaning.
     Syntax tell about the arrangement of words and
        phrases to create well-formed sentences.
S1




 Speaker       Speech
                             parsing      S2
                               and
Recognition   Recognition
                            arbitration


                                          SK



                                          SN
Switch on                                             S1
Channel 9




             Speaker       Speech
                                         parsing      S2
                                           and
            Recognition   Recognition
                                        arbitration


                                                      SK



                                                      SN
Who is                          S1
              speaking?




 Speaker            Speech
                                  parsing      S2
                                    and
Recognition        Recognition
                                 arbitration


                                               SK


                    Annie
                    David                      SN
                    Cathy


                  “Authentication”
What is he          S1
                        saying?




 Speaker       Speech
                              parsing      S2
                                and
Recognition   Recognition
                             arbitration


                                           SK



                              On,Off,TV    SN
                             Fridge,Door


              “Understanding”
What is he
                             talking              S1
                             about?




 Speaker       Speech
                               parsing            S2
                                 and
Recognition   Recognition
                              arbitration


                                                  SK


 “Switch”,”to”,”channel”,”nine”
                                    Channel->TV
                                                  SN
                                     Dim->Lamp
                                    On->TV,Lamp
    “Inferring and execution”
   http://www.slideshare.net/richiebmthimmaia
    h/automatic-speech-recognition-4721204
   http://www.scribd.com/doc/36605017/Voice
    -Recognition-System-PPT
   http://www.google.com.pk/url?
    sa=t&rct=j&q=speech%20recognization
    %20system%20.ppt
   http://www.wikipedia.com
Artificial intelligence Speech recognition system

More Related Content

PPTX
Artificial intelligence for speech recognition
PPT
Speech Recognition
PPT
Speech recognition
PPTX
Speech recognition final presentation
PPTX
Speech Recognition Technology
PPTX
automatic number plate recognition
PPTX
Speech recognition system seminar
Artificial intelligence for speech recognition
Speech Recognition
Speech recognition
Speech recognition final presentation
Speech Recognition Technology
automatic number plate recognition
Speech recognition system seminar

What's hot (20)

PPT
Speech Recognition in Artificail Inteligence
PPTX
Domain specific IoT
PPTX
Speech Recognition Technology
PPTX
Artificial intelligence in speech recognition
PPTX
Speech to text conversion
DOCX
Hand Written Character Recognition Using Neural Networks
PDF
Sensor Cloud
PPT
Pervasive Computing
PDF
Deep Learning For Speech Recognition
PPT
Natural language processing
PPTX
Computer Vision
PPTX
M2M systems layers and designs standardizations
PPTX
Virtual personal assistant
PDF
Introduction to IoT Architectures and Protocols
PDF
Introduction to AI & ML
DOCX
Computer science seminar topics
PPTX
Industrial Internet of things.pptx
PPTX
Data enrichment
Speech Recognition in Artificail Inteligence
Domain specific IoT
Speech Recognition Technology
Artificial intelligence in speech recognition
Speech to text conversion
Hand Written Character Recognition Using Neural Networks
Sensor Cloud
Pervasive Computing
Deep Learning For Speech Recognition
Natural language processing
Computer Vision
M2M systems layers and designs standardizations
Virtual personal assistant
Introduction to IoT Architectures and Protocols
Introduction to AI & ML
Computer science seminar topics
Industrial Internet of things.pptx
Data enrichment
Ad

More from REHMAT ULLAH (20)

PPTX
Poker Game
PPTX
Men's clothing at style war
PPTX
software project management Software development life cycle
PPTX
Software project management Improving Team Effectiveness
PPTX
software project management Software inspection
PPTX
Improving of software processes
PPT
software project management Elaboration phase
PPTX
software project management Improvement in size
PPTX
Software development life cycle Construction phase
PPTX
software project management Artifact set(spm)
PPTX
software project management Waterfall model
PPTX
Software project management Software economics
PPTX
Introduction of software project management
PPTX
software project management Cocomo model
PPTX
software project management Assumption about conventional model
PPT
Usability engineering Usability testing
PPTX
Usability engineering Usability issues(iphone)
PPTX
Usability engineering Usability issues in mobile web
PPTX
Usability engineering Usability issues in firefox
PPT
Software Quality Assurance(Sqa) automated software testing
Poker Game
Men's clothing at style war
software project management Software development life cycle
Software project management Improving Team Effectiveness
software project management Software inspection
Improving of software processes
software project management Elaboration phase
software project management Improvement in size
Software development life cycle Construction phase
software project management Artifact set(spm)
software project management Waterfall model
Software project management Software economics
Introduction of software project management
software project management Cocomo model
software project management Assumption about conventional model
Usability engineering Usability testing
Usability engineering Usability issues(iphone)
Usability engineering Usability issues in mobile web
Usability engineering Usability issues in firefox
Software Quality Assurance(Sqa) automated software testing
Ad

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
01-Introduction-to-Information-Management.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
Classroom Observation Tools for Teachers
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PPTX
Cell Structure & Organelles in detailed.
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Anesthesia in Laparoscopic Surgery in India
Insiders guide to clinical Medicine.pdf
Cell Types and Its function , kingdom of life
01-Introduction-to-Information-Management.pdf
Microbial disease of the cardiovascular and lymphatic systems
Pharmacology of Heart Failure /Pharmacotherapy of CHF
2.FourierTransform-ShortQuestionswithAnswers.pdf
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
Week 4 Term 3 Study Techniques revisited.pptx
Classroom Observation Tools for Teachers
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
102 student loan defaulters named and shamed – Is someone you know on the list?
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
O7-L3 Supply Chain Operations - ICLT Program
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Cell Structure & Organelles in detailed.
Module 4: Burden of Disease Tutorial Slides S2 2025

Artificial intelligence Speech recognition system

  • 2. What is Speech Recognition? Also known as automatic speech recognition or computer speech recognition which means understanding voice by the computer and performing any required task.
  • 3. Where can it be used?  System control/navigation e.g. GPS-connected digital maps: “How far is it to the motorway junction?”  Commercial/Industrial applications in-car steering systems  Voice dialing hands-free use of mobile in car e.g. “Dial office”
  • 4. Voice Input Analog to Digital Acoustic Model Language Model Feedback Display Speech Engine
  • 5. Acoustic Model  An acoustic model is created by taking audio recordings of speech, and their text transcriptions, and using software to create statistical representations of the sounds that make up each word. It is used by a speech recognition engine to recognize speech.  Language Model  Language model is used in many natural language processing applications such as speech recognition tries to capture the properties of a language, and to predict the next word in a speech sequence.
  • 6. Two types of speech recognition.  Speaker-Dependent is commonly used for dictation software  Speaker-Independent is more commonly found in telephone applications.
  • 7. Speaker-dependent software works by learning the unique characteristics of a single person’s voice, in a way similar to voice recognition.  New users must first “train” the software by speaking to it, so the computer can analyze how the person talks.  This often means users have to read a few pages of text to the computer before they can use the speech recognition software.
  • 8. Speaker-independent software is designed to recognize anyone’s voice, so no training is involved.  This means it is the only real option for applications such as interactive voice response systems — where businesses can’t ask callers to read pages of text before using the system.  The downside is that speaker-independent software is generally less accurate than speaker-dependent software.  Speech recognition engines that are speaker independent generally deal with this fact by limiting the grammars they use. By using a smaller list of recognized words, the speech engine is more likely to correctly recognize what a speaker said.
  • 9. Articulation produces  sound waves which  the ear conveys to the brain  for processing
  • 10. Acoustic waveform Acoustic signal  Digitization  Acoustic analysis of the Speech recognition speech signal  Linguistic interpretation
  • 12. • Digitization  Analogue to digital conversion  Sampling and quantizing  Sampling is converting a continuous signal into a discrete signal  Quantizing is the process of approximating a continuous range of values • Signal processing – Separating speech from background noise • Phonetics – Variability in human speech • Phonology – Recognizing individual sound distinctions (similar phonemes) – is the systematic use of sound to encode meaning in any spoken human language
  • 13. Semantics and pragmatics  Semantics tells about the meaning  Pragmatics is concerned with bridging the explanatory gap between sentence meaning and speaker's meaning  Lexicology and syntax  Lexicology is that part of linguistics which studies words, their nature, and meaning.  Syntax tell about the arrangement of words and phrases to create well-formed sentences.
  • 14. S1 Speaker Speech parsing S2 and Recognition Recognition arbitration SK SN
  • 15. Switch on S1 Channel 9 Speaker Speech parsing S2 and Recognition Recognition arbitration SK SN
  • 16. Who is S1 speaking? Speaker Speech parsing S2 and Recognition Recognition arbitration SK Annie David SN Cathy “Authentication”
  • 17. What is he S1 saying? Speaker Speech parsing S2 and Recognition Recognition arbitration SK On,Off,TV SN Fridge,Door “Understanding”
  • 18. What is he talking S1 about? Speaker Speech parsing S2 and Recognition Recognition arbitration SK “Switch”,”to”,”channel”,”nine” Channel->TV SN Dim->Lamp On->TV,Lamp “Inferring and execution”
  • 19. http://www.slideshare.net/richiebmthimmaia h/automatic-speech-recognition-4721204  http://www.scribd.com/doc/36605017/Voice -Recognition-System-PPT  http://www.google.com.pk/url? sa=t&rct=j&q=speech%20recognization %20system%20.ppt  http://www.wikipedia.com