SlideShare a Scribd company logo
2
Most read
3
Most read
6
Most read
PERSONAL VOICE
ASSISTANT IN
PYTHON
1
Submitted By:
Vikas Kumar
Utkarsh sengar
Yash Jauhri
Yash Tripathi
Contents
 Introduction
 How does an intelligent personal assistant work?
 Existing system and proposed system
 Dataflow diagrams
 Automatic Speech Recognition System Model in Google AP
 Use case diagrams
 Modules
 screenshots
 Requirements
 Conclusion
2
Introduction
 A virtual assistant is a technology based on artificial intelligence. The
software uses a device’s microphone to receive voice requests while the
voice output takes place at the speaker. But the most exciting thing
happens between these two actions.
 It is a combination of several different technologies: voice recognition,
voice analysis and language processing.
 It is completely developed using one of the most powerful language
python.
3
How does an intelligent personal assistant
work?
 User asks a personal assistant to perform a task.
 The natural language audio signal is converted into digital data that can
be analyzed by the software.
 Compared with a database of the software using an innovative algorithm
to find a suitable answer.
 This database is located on distributed servers in cloud networks. For this
reason ,it must have a reliable Internet connection.
4
EXISTING SYSTEM V/S PROPOSED SYSTEM 5
Existing System Proposed System
Usage statistics and user data are sent to
the developer
Does not collect any user data
Installation required Installation required
User cannot edit or change the modules User can edit and add new modules
Not a free Software Free Software
lightweight
Simple User interface
Dataflow Diagrams 6
User
Personal
voice
assistant
Computer
Text to speech
Speech to
Text
Voice Action
Response
Voice output
DFD - 0
User Voice to text
Action perform
Database
Personal voice
assistant
Computer
Voice Command Perform action
DFD - 1
7
User This will convert
voice into binary
Microphone
Computer
This will convert
voice data into text
form
Google voice API
Flow sensor
value
Voice audio
data
Perform action
DFD - 2
8
Automatic Speech Recognition System Model in
Google API
9
Feature Extraction Decoder
Acoustic
models
Pronunciation
Dictionary
Language
Models
Speech signal Recognized words
Feature Extraction
 Feature Extraction is a common to extract a set of features from speech
signal.
 Classification of Feature extraction is carried out on the set of features
instead of the speech signals themselves.
 The feature extraction stage seeks to provide a compact representation of
the speech waveform. This form should minimize the loss of information
that discriminates between words, and provide a good match with the
distributional assumptions made by the acoustic models.
10
 ACOUSTIC MODELS
• Acoustic model is a relationship between audio signal and phoneme
• Phoneme means one of the smallest unit of speech that make one word different from another word
 PRONUNCIATION DICTIONARY
• The act or result of producing the sounds of speech, including articulation, stress, and intonation
• A phonetic transcription of a given word, sound, etc.
• An accepted standard of the sound and stress patterns of a word, phrase, etc.
 LANGUAGE MODELS
• The language model provides context to distinguish between words and phrases that sound similar.
for example, In American English the phrases “recognize speach” and “wreck a nice beach” sound
similar , but mean different things.
11
Use case diagrams 12
Input
voice
Sent
mail
Turn
on/off
Wi Fi
Wikipedia
Read
search
User
User Microphone Google API
Computer
Open Personal Assistant Accessing G-API
Voice response
Start Mic
Wait until user speak
Receive data
Convert audio to text
Match text with action
Perform action
Voice / Text
Response
13
 MODULES
 Speech recognition
 Process and system utilities ( psutil )
 PlaySound
 SMTP Protocol client ( smtplib )
 Google Text To Speech ( gtts )
14
Requirements
 Software requirements
 Pycharm IDE/visual studio code
 Inno Setup Compiler
 Pyinstaller
 Python 3.8.2 and its Sub modules
 Hardware requirements
 Intel core i3
 4gb RAM
 30 Gb Hard drive space
15
Conclusion
Voice Controlled Personal Assistant System will use the Natural language
processing and can be integrated with artificial intelligence techniques to
achieve a smart assistant that can control the computer and applications and
even solve user queries using web searches.. It can be designed to minimize
the human efforts to interact with many other subsystems, which would
otherwise have to be performed manually. By achieving this, the system will
make human life comfortable
16
THANK YOU
17

More Related Content

PDF
Voice Assistant (1).pdf
PPTX
Voice assistant ppt
PPTX
Automatic Number Plate Recognition (ANPR)
PDF
IT Touchless Operations
PPTX
Kelompok1 ovo
PPTX
Automatic number plate recognition (anpr)
PPTX
Alpha-beta pruning (Artificial Intelligence)
PDF
Advanced RAG Optimization To Make it Production-ready
Voice Assistant (1).pdf
Voice assistant ppt
Automatic Number Plate Recognition (ANPR)
IT Touchless Operations
Kelompok1 ovo
Automatic number plate recognition (anpr)
Alpha-beta pruning (Artificial Intelligence)
Advanced RAG Optimization To Make it Production-ready

What's hot (20)

PPTX
Chatbot ppt
PPTX
chatGPT.txt
PPTX
JARVIS - The Digital Life Assistant
PPTX
Chatbot and Virtual AI Assistant Implementation in Natural Language Processing
PDF
How to build a Chatbot with Google's Dialogflow
PDF
Chatbot Artificial Intelligence
PPTX
Chat application android app ppt
PPTX
Chatbot_Presentation
PDF
Introduction to the Dart language
PDF
UX 101: Personas
PPTX
Project proposal-presentation
PPTX
UI/UX presentation by Roshan Karunarathna
PPTX
Why choose flutter 2 over flutter 1
PDF
Introduction to Chatbots
PPTX
Testing Conversational AI
PPTX
ChatGPT Deck.pptx
PDF
PDF
Desktop assistant
PPTX
Virtual Personal Assistant
PPT
Chat bots and AI
Chatbot ppt
chatGPT.txt
JARVIS - The Digital Life Assistant
Chatbot and Virtual AI Assistant Implementation in Natural Language Processing
How to build a Chatbot with Google's Dialogflow
Chatbot Artificial Intelligence
Chat application android app ppt
Chatbot_Presentation
Introduction to the Dart language
UX 101: Personas
Project proposal-presentation
UI/UX presentation by Roshan Karunarathna
Why choose flutter 2 over flutter 1
Introduction to Chatbots
Testing Conversational AI
ChatGPT Deck.pptx
Desktop assistant
Virtual Personal Assistant
Chat bots and AI
Ad

Similar to Personal Voice Assistant using python.pptx (20)

PPTX
GROUP 13 PPT engineering VIII sem last yearrrr.ppt
PPTX
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
PDF
Paper on Speech Recognition
PDF
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
PDF
VOICE COMMAND SYSTEM USING RASPBERRY PI
PDF
Voice Command System Using Raspberry PI
PDF
Advanced Computational Intelligence: An International Journal (ACII)
PDF
Advanced Computational Intelligence: An International Journal (ACII)
PDF
DT project.pdf
PDF
A Voice Based Assistant Using Google Dialogflow And Machine Learning
PPTX
Introduction-to-Personal-AI-Desktop-Assistant.pptx
PDF
IRJET- Virtual Vision for Blinds
PDF
IRJET- Voice to Code Editor using Speech Recognition
PDF
Virtual Personal Assistant
PDF
How to Build an Excellent Voice Assistant App like Siri.pdf
PPTX
Voice Assistant.pptx
PDF
The Importance of Speech Datasets in the Advancement of Voice AI:
 
PDF
Top 10 Best Speech Recognition Software
PDF
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
PDF
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
GROUP 13 PPT engineering VIII sem last yearrrr.ppt
VOICE-ASSISTANT-IN-PYTHON-pptx.pptx
Paper on Speech Recognition
10 World’s Leading Speech or Voice Recognition Software That Can 3X Your Prod...
VOICE COMMAND SYSTEM USING RASPBERRY PI
Voice Command System Using Raspberry PI
Advanced Computational Intelligence: An International Journal (ACII)
Advanced Computational Intelligence: An International Journal (ACII)
DT project.pdf
A Voice Based Assistant Using Google Dialogflow And Machine Learning
Introduction-to-Personal-AI-Desktop-Assistant.pptx
IRJET- Virtual Vision for Blinds
IRJET- Voice to Code Editor using Speech Recognition
Virtual Personal Assistant
How to Build an Excellent Voice Assistant App like Siri.pdf
Voice Assistant.pptx
The Importance of Speech Datasets in the Advancement of Voice AI:
 
Top 10 Best Speech Recognition Software
Advanced Virtual Assistant Based on Speech Processing Oriented Technology on ...
Cloud-Native Roadshow Google Cloud Platform - Los Angeles
Ad

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPT
Project quality management in manufacturing
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
Digital Logic Computer Design lecture notes
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Arduino robotics embedded978-1-4302-3184-4.pdf
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
Strings in CPP - Strings in C++ are sequences of characters used to store and...
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Project quality management in manufacturing
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Digital Logic Computer Design lecture notes
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Operating System & Kernel Study Guide-1 - converted.pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Foundation to blockchain - A guide to Blockchain Tech
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Model Code of Practice - Construction Work - 21102022 .pdf
Arduino robotics embedded978-1-4302-3184-4.pdf
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
Mechanical Engineering MATERIALS Selection
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
Strings in CPP - Strings in C++ are sequences of characters used to store and...
CH1 Production IntroductoryConcepts.pptx
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...

Personal Voice Assistant using python.pptx

  • 1. PERSONAL VOICE ASSISTANT IN PYTHON 1 Submitted By: Vikas Kumar Utkarsh sengar Yash Jauhri Yash Tripathi
  • 2. Contents  Introduction  How does an intelligent personal assistant work?  Existing system and proposed system  Dataflow diagrams  Automatic Speech Recognition System Model in Google AP  Use case diagrams  Modules  screenshots  Requirements  Conclusion 2
  • 3. Introduction  A virtual assistant is a technology based on artificial intelligence. The software uses a device’s microphone to receive voice requests while the voice output takes place at the speaker. But the most exciting thing happens between these two actions.  It is a combination of several different technologies: voice recognition, voice analysis and language processing.  It is completely developed using one of the most powerful language python. 3
  • 4. How does an intelligent personal assistant work?  User asks a personal assistant to perform a task.  The natural language audio signal is converted into digital data that can be analyzed by the software.  Compared with a database of the software using an innovative algorithm to find a suitable answer.  This database is located on distributed servers in cloud networks. For this reason ,it must have a reliable Internet connection. 4
  • 5. EXISTING SYSTEM V/S PROPOSED SYSTEM 5 Existing System Proposed System Usage statistics and user data are sent to the developer Does not collect any user data Installation required Installation required User cannot edit or change the modules User can edit and add new modules Not a free Software Free Software lightweight Simple User interface
  • 6. Dataflow Diagrams 6 User Personal voice assistant Computer Text to speech Speech to Text Voice Action Response Voice output DFD - 0
  • 7. User Voice to text Action perform Database Personal voice assistant Computer Voice Command Perform action DFD - 1 7
  • 8. User This will convert voice into binary Microphone Computer This will convert voice data into text form Google voice API Flow sensor value Voice audio data Perform action DFD - 2 8
  • 9. Automatic Speech Recognition System Model in Google API 9 Feature Extraction Decoder Acoustic models Pronunciation Dictionary Language Models Speech signal Recognized words
  • 10. Feature Extraction  Feature Extraction is a common to extract a set of features from speech signal.  Classification of Feature extraction is carried out on the set of features instead of the speech signals themselves.  The feature extraction stage seeks to provide a compact representation of the speech waveform. This form should minimize the loss of information that discriminates between words, and provide a good match with the distributional assumptions made by the acoustic models. 10
  • 11.  ACOUSTIC MODELS • Acoustic model is a relationship between audio signal and phoneme • Phoneme means one of the smallest unit of speech that make one word different from another word  PRONUNCIATION DICTIONARY • The act or result of producing the sounds of speech, including articulation, stress, and intonation • A phonetic transcription of a given word, sound, etc. • An accepted standard of the sound and stress patterns of a word, phrase, etc.  LANGUAGE MODELS • The language model provides context to distinguish between words and phrases that sound similar. for example, In American English the phrases “recognize speach” and “wreck a nice beach” sound similar , but mean different things. 11
  • 12. Use case diagrams 12 Input voice Sent mail Turn on/off Wi Fi Wikipedia Read search User
  • 13. User Microphone Google API Computer Open Personal Assistant Accessing G-API Voice response Start Mic Wait until user speak Receive data Convert audio to text Match text with action Perform action Voice / Text Response 13
  • 14.  MODULES  Speech recognition  Process and system utilities ( psutil )  PlaySound  SMTP Protocol client ( smtplib )  Google Text To Speech ( gtts ) 14
  • 15. Requirements  Software requirements  Pycharm IDE/visual studio code  Inno Setup Compiler  Pyinstaller  Python 3.8.2 and its Sub modules  Hardware requirements  Intel core i3  4gb RAM  30 Gb Hard drive space 15
  • 16. Conclusion Voice Controlled Personal Assistant System will use the Natural language processing and can be integrated with artificial intelligence techniques to achieve a smart assistant that can control the computer and applications and even solve user queries using web searches.. It can be designed to minimize the human efforts to interact with many other subsystems, which would otherwise have to be performed manually. By achieving this, the system will make human life comfortable 16