SlideShare a Scribd company logo
Text
to
Speech
synthesizer.
Mini Project Report
(Speech and Audio Processing ECT 359-1)
Contents :
 Introduction
 Objective
 Theoretical background
 Flowchart
 Code and Execution
 Result with Discussion
 Applications
 Advantages
 Limitations and Future scope
 References
2
Introduction :
 The text-to-speech (TTS) synthesis procedure
consists of two main phases.
 The first is text analysis, where the input text is
transcribed into a phonetic or some other linguistic
representation.
 And the second one is the generation of speech
waveforms, where the output is produced from this
phonetic and prosodic information.
3
Introduction :
 These two phases are usually called high and low-level
synthesis . A simplified version of this procedure is
presented in figure below.
4
5
Objectives :
 Text to speech synthesizer will be of great help to
people with visual impairment .
 Text to speech synthesizer will help a machine to
communicate with users.
Theoretical Background :
6
 Speech Synthesis is the artificial production of human
speech.
 A synthesizer can incorporate a model of the vocal tract and
other human voice characteristics to create a completely
"synthetic" voice output.
 A computer system used for this purpose is called a speech
computer or speech synthesizer.
 A text-to-speech (TTS) system converts normal language text
into speech; other systems render symbolic linguistic
representations like phonetic transcriptions into speech.
TTS overview :
7
The procedure consist of two main phases:-
 Text Analysis
 Speech waveforms
 TEXT ANALYSIS : The input text is transcribed into a phonetic or some other
linguistic representation
 SPEECH WAVEFORMS : The acoustic output is produced from the phonetic
and prosodic information
Front End and Back End in TTS
8
 A text-to-speech system (or "engine") is composed of two
parts: a front-end and a back-end.
 The front-end converts raw text containing symbols like
numbers and abbreviations into the equivalent of written
out words (tokenization), then assigns phonetic
transcriptions to each word, and divides and marks the
text into prosodic units, like phrases, clauses, and
sentences (grapheme-phoneme conversion).
 The back-end often referred to as the synthesizer— then
converts the symbolic linguistic representation into sound.
9
FrFront End and Back End in TTS :
TTS Technology :
10
11
Speech Synthesizer used :
Concatenative synthesis is based on the concatenation (or
stringing together) of segments of recorded speech. Generally,
concatenative synthesis produces the most natural-sounding
synthesized speech.
 Concatenate segments of pre-recorded natural human
speech.
 Requires database of previously recorded human speech
covering all the possible segments to be synthesized .
 Segment might be phoneme, syllable, word, phrase, or any
combination .
Detailed Architecture of TTS systems :
12
.NET Framework :
 .NET is a framework developed by Microsoft.
 It is a new programming methodology.
 .NET is platform independent/cross platform ‘
 .NET is language insensitive.
 It includes a large class library known as Framework
Class Library (FCL).
13
Continued …….
14
 Microsoft also produces an IDE largely for .NET software
called Visual Studio.
 It provides language interoperability (each language can
use code written in other language ) across several
programming languages.
.NET Architecture :
15
.NET Execution :
16
Flowchart :
17
18
Code
19
20
Execution
21
Result :
22
 In this way , our aim to convert text which we passed as argument in
function is converted into artificial human voice (speech) .
 With the help of this TTS synthesizer , a blind guy can even read a book
or novel which is not available in braille language .
 This TTS synthesizer can be used in medical store for proper
pronunciation of medicines on cover or boxes.
 It is mostly used in voice stick device and voice assistant like Siri, google
assistant , Cortana and Alexa etc.
Applications :
 Talking Calculator
 Computer generated instructions
 Aids for the blind
 Telephone inquiry services
 Teaching machices
 Usage in education and daily life .
23
Advantages :
 Able to read large paragraphs .
 It offers a range of different accents and voices .
 Provide significant help for people with eyes disabilities.
 More accuracy in medical systems.
 It can be adapted easily to say whatever users want them to say.
 It provides talking machines for vocally impaired or deaf people
and better aids for speech therapy.
24
Limitations :
 No explicit emotions
 Homographs (Pronunciation)
 Prosody
 Language specific problems
 Special characters and symbols
25
Future Scope :
 It can also work in different languages like Marathi ,
Hindi , Kannada , etc.
 Accuracy will become better and can able to
pronounce symbols and special characters.
 Increasing variety of voices .
26
References :
 www.google.com
 www.youtube.com
 www.shareslide.net
 www.mathworks.com
 www.microsoft.com
28
Thank
you !!!!!!!!!!!!
Team Presentation
29

More Related Content

PDF
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
PDF
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
PDF
On Developing an Automatic Speech Recognition System for Commonly used Englis...
PDF
Natural language processing with python and amharic syntax parse tree by dani...
PPTX
Natural Language processing Parts of speech tagging, its classes, and how to ...
PDF
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
PPTX
Speech and Language Processing
PDF
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
Artificially Generatedof Concatenative Syllable based Text to Speech Synthesi...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
On Developing an Automatic Speech Recognition System for Commonly used Englis...
Natural language processing with python and amharic syntax parse tree by dani...
Natural Language processing Parts of speech tagging, its classes, and how to ...
Implementation of English-Text to Marathi-Speech (ETMS) Synthesizer
Speech and Language Processing
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION

What's hot (19)

PPTX
PPTX
Parts of Speect Tagging
PPTX
NLP pipeline in machine translation
PDF
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
PDF
Intro to NLP. Lecture 2
PPT
NLP new words
DOC
12EEE032- text 2 voice
PPTX
Machine translation from English to Hindi
PPT
Speech Recognition
PPT
PDF
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
PDF
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
PDF
A Marathi Hidden-Markov Model Based Speech Synthesis System
PPT
Lec 15,16,17 NLP.machine translation
PPTX
PPT
Types of machine translation
ODT
A tutorial on Machine Translation
PPT
Natural language processing
PDF
Ijetcas14 458
Parts of Speect Tagging
NLP pipeline in machine translation
PERFORMANCE ANALYSIS OF DIFFERENT ACOUSTIC FEATURES BASED ON LSTM FOR BANGLA ...
Intro to NLP. Lecture 2
NLP new words
12EEE032- text 2 voice
Machine translation from English to Hindi
Speech Recognition
Segmentation Words for Speech Synthesis in Persian Language Based On Silence
Comparative study of Text-to-Speech Synthesis for Indian Languages by using S...
A Marathi Hidden-Markov Model Based Speech Synthesis System
Lec 15,16,17 NLP.machine translation
Types of machine translation
A tutorial on Machine Translation
Natural language processing
Ijetcas14 458
Ad

Similar to SAP (SPEECH AND AUDIO PROCESSING) (20)

PPTX
Speech Synthesis.pptx
PPTX
visH (fin).pptx
PDF
A Short Introduction To Text-To-Speech Synthesis
PDF
SMATalk: Standard Malay Text to Speech Talk System
PPTX
Introduction to myanmar Text-To-Speech
PPTX
Text-to-Speech-presentation2(punjabi).pptx
PPTX
Text-to-Speech-in-Computational-Linguistics (2)final.pptx
PDF
Tutorial - Speech Synthesis System
PDF
551 466-472
PDF
Speech to text conversion for visually impaired person using µ law companding
PDF
H010625862
PDF
F017163443
PDF
ACHIEVING SECURITY VIA SPEECH RECOGNITION
PDF
Voice based web browser
PDF
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
PDF
Survey On Speech Synthesis
PDF
Ey4301913917
PDF
G1803013542
PDF
Paper on Speech Recognition
PDF
Direct Punjabi to English Speech Translation using Discrete Units
Speech Synthesis.pptx
visH (fin).pptx
A Short Introduction To Text-To-Speech Synthesis
SMATalk: Standard Malay Text to Speech Talk System
Introduction to myanmar Text-To-Speech
Text-to-Speech-presentation2(punjabi).pptx
Text-to-Speech-in-Computational-Linguistics (2)final.pptx
Tutorial - Speech Synthesis System
551 466-472
Speech to text conversion for visually impaired person using µ law companding
H010625862
F017163443
ACHIEVING SECURITY VIA SPEECH RECOGNITION
Voice based web browser
IRJET- Text to Speech Synthesis for Hindi Language using Festival Framework
Survey On Speech Synthesis
Ey4301913917
G1803013542
Paper on Speech Recognition
Direct Punjabi to English Speech Translation using Discrete Units
Ad

Recently uploaded (20)

PDF
PPT on Performance Review to get promotions
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
web development for engineering and engineering
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPT
Project quality management in manufacturing
PPTX
bas. eng. economics group 4 presentation 1.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
Geodesy 1.pptx...............................................
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPT on Performance Review to get promotions
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
Automation-in-Manufacturing-Chapter-Introduction.pdf
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
Construction Project Organization Group 2.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
web development for engineering and engineering
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Project quality management in manufacturing
bas. eng. economics group 4 presentation 1.pptx
573137875-Attendance-Management-System-original
Model Code of Practice - Construction Work - 21102022 .pdf
Geodesy 1.pptx...............................................
additive manufacturing of ss316l using mig welding
Internet of Things (IOT) - A guide to understanding
Foundation to blockchain - A guide to Blockchain Tech
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS

SAP (SPEECH AND AUDIO PROCESSING)

  • 2. Contents :  Introduction  Objective  Theoretical background  Flowchart  Code and Execution  Result with Discussion  Applications  Advantages  Limitations and Future scope  References 2
  • 3. Introduction :  The text-to-speech (TTS) synthesis procedure consists of two main phases.  The first is text analysis, where the input text is transcribed into a phonetic or some other linguistic representation.  And the second one is the generation of speech waveforms, where the output is produced from this phonetic and prosodic information. 3
  • 4. Introduction :  These two phases are usually called high and low-level synthesis . A simplified version of this procedure is presented in figure below. 4
  • 5. 5 Objectives :  Text to speech synthesizer will be of great help to people with visual impairment .  Text to speech synthesizer will help a machine to communicate with users.
  • 6. Theoretical Background : 6  Speech Synthesis is the artificial production of human speech.  A synthesizer can incorporate a model of the vocal tract and other human voice characteristics to create a completely "synthetic" voice output.  A computer system used for this purpose is called a speech computer or speech synthesizer.  A text-to-speech (TTS) system converts normal language text into speech; other systems render symbolic linguistic representations like phonetic transcriptions into speech.
  • 7. TTS overview : 7 The procedure consist of two main phases:-  Text Analysis  Speech waveforms  TEXT ANALYSIS : The input text is transcribed into a phonetic or some other linguistic representation  SPEECH WAVEFORMS : The acoustic output is produced from the phonetic and prosodic information
  • 8. Front End and Back End in TTS 8  A text-to-speech system (or "engine") is composed of two parts: a front-end and a back-end.  The front-end converts raw text containing symbols like numbers and abbreviations into the equivalent of written out words (tokenization), then assigns phonetic transcriptions to each word, and divides and marks the text into prosodic units, like phrases, clauses, and sentences (grapheme-phoneme conversion).  The back-end often referred to as the synthesizer— then converts the symbolic linguistic representation into sound.
  • 9. 9 FrFront End and Back End in TTS :
  • 11. 11 Speech Synthesizer used : Concatenative synthesis is based on the concatenation (or stringing together) of segments of recorded speech. Generally, concatenative synthesis produces the most natural-sounding synthesized speech.  Concatenate segments of pre-recorded natural human speech.  Requires database of previously recorded human speech covering all the possible segments to be synthesized .  Segment might be phoneme, syllable, word, phrase, or any combination .
  • 12. Detailed Architecture of TTS systems : 12
  • 13. .NET Framework :  .NET is a framework developed by Microsoft.  It is a new programming methodology.  .NET is platform independent/cross platform ‘  .NET is language insensitive.  It includes a large class library known as Framework Class Library (FCL). 13
  • 14. Continued ……. 14  Microsoft also produces an IDE largely for .NET software called Visual Studio.  It provides language interoperability (each language can use code written in other language ) across several programming languages.
  • 19. 19
  • 21. 21
  • 22. Result : 22  In this way , our aim to convert text which we passed as argument in function is converted into artificial human voice (speech) .  With the help of this TTS synthesizer , a blind guy can even read a book or novel which is not available in braille language .  This TTS synthesizer can be used in medical store for proper pronunciation of medicines on cover or boxes.  It is mostly used in voice stick device and voice assistant like Siri, google assistant , Cortana and Alexa etc.
  • 23. Applications :  Talking Calculator  Computer generated instructions  Aids for the blind  Telephone inquiry services  Teaching machices  Usage in education and daily life . 23
  • 24. Advantages :  Able to read large paragraphs .  It offers a range of different accents and voices .  Provide significant help for people with eyes disabilities.  More accuracy in medical systems.  It can be adapted easily to say whatever users want them to say.  It provides talking machines for vocally impaired or deaf people and better aids for speech therapy. 24
  • 25. Limitations :  No explicit emotions  Homographs (Pronunciation)  Prosody  Language specific problems  Special characters and symbols 25
  • 26. Future Scope :  It can also work in different languages like Marathi , Hindi , Kannada , etc.  Accuracy will become better and can able to pronounce symbols and special characters.  Increasing variety of voices . 26
  • 27. References :  www.google.com  www.youtube.com  www.shareslide.net  www.mathworks.com  www.microsoft.com