SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 02 | Feb 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1301
Threat Prediction using Speech Analysis
Shreyas Kulkarni1, Nirantar Kulkarni2, Sushil Kukreja3, Kaustubh Kotkar4, Shambhavi
Kulkarni5, Jayanti Kamalasekaran6
1,2,3,4B.E. Student, Dept. of Computer Engineering, Sinhgad College of Engineering, Vadgaon, Pune- 411041,
Maharashtra, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract -Speech recognition technology is one of the fast
growing Engineering technologies. It has number of
applications in different areas and provides potential
benefits. Audio surveillance is getting more important and
requires large amount of man hours to determine threats
from audio. The aim of this system is to identify potential
cases of threats, and provide an early warning or alert to
such cases. This will be based on voice such as voice chat
over telecommunication networks or social media. The
intended result will be achieved in the three major steps.
This system will utilize latest machine learning algorithms
to identify phonetics from the audio and helps convert it to
equivalent transcript form. Natural language processing
will help in developing sentiment analysis model.
Key Words: Recurrent Neural Network, Sentimental
Analysis, Speech Recognition, Natural Language
Processing.
1. INTRODUCTION
There has been much advancement in the areas of
education, healthcare, etc. However, crime is one of the
major issues faced by societies all over the world. The
crime investigation team follows a systematic procedure
for solving a particular case. At first, there are multiple
suspects. Hence, to reach to a conclusion, the police might
have to listen to their call recordings. This procedure
consumes a lot of time as a single person is required to
listen to multiple recordings. This also has a scope for
human errors. These errors and time can be minimized
using our proposed system. In this system, the user needs
to provide an audio as an input and the system displays a
warning about threat present in the audio. There are 2
parts in this project. First part converts speech to text and
the second part applies sentimental analysis to the text.
The total percentage of threat is calculated and if it is
above a certain threshold, warning is provided. This
system can also be used separately, i.e. only the first part
can be used by the user if he/she doesn’t want sentimental
analysis on the text and the second part can be used be
directly providing text to the system to analyze the
sentiments in a particular text. Speech to text is converted
using signal processing and sentimental analysis is carried
out using Naïve Bayes algorithm and neural network. This
system is advantageous as it provides the results of the
recording directly without a person having to analyze it
for hours.
1.1 Literature Survey
1. English Language Speech Recognition using MFCC
and HMM:
This model implements and exploits Mel
Frequency Cepstral Coefficient and Hidden
Markov Model Techniques to evaluate the
competence of speech recognition. In this paper
they have used data from Google audioSet and
converted audio signal into a vector using MFCC.
The research is done using Python Programming
with the help of Librosa library to generate MFCC.
2. Speech Recognition with Deep Recurrent Neural
Network:
In this paper, they have implemented speech
recognition technique using Recurrent Neural
Network which will also take into consideration
context of the sentence. They have taken TIMIT
dataset to perform phoneme recognition. This
model combines deep, bidirectional Long Short-
term Memory RNNs with end-to-end training.
3. Speech Recognition using Deep Learning:
In this paper, audio files are taken as input and
Artificial Neural Network is used to predict
speech as an output. This research uses Deep
Learning for speech recognition using library
from Google which has 66.22% accuracy.
4. Sentimental Analysis and Prediction using Neural
Networks:
This paper classifies sentences into positive,
negative and neutral categories. It uses Artificial
Neural Network for sentimental analysis by
training and testing data. It also checks the
accuracy of ANN for large datasets.
2. Project Overview
Supervision and analysis of audio files manually takes a lot
of time. This procedure is very tedious and leaves scope
for a lot of human errors. These errors might manipulate
the results on a large scale. We have proposed a system
that eliminates the human work of supervising the audio
files. In sectors like crime investigation, this system might
prove to be advantageous as threat in a particular audio
file will be analyzed automatically and a warning will be
provided as output. This system can also be used
separately by the user or actor. The developed model will
help the will be of help to various organizations as they
can either use it for speech to text conversion or
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 02 | Feb 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1302
sentimental analysis or both. There is one user/actor that
can use the system by providing an audio file as input.
[A] Artificial Neural Network
Artificial Neural Network is used for processing the
information similar to how brain works. It has large
number of small processing elements connected to each
other, working together to solve a specific problem.
In the above figure, inputs are basically independent
variables which are multiplied by their weights and
summed up to get the activation function. This function
takes decision on whether neuron should be activated or
not.
For learning of neural network, it uses the cost function
which is the difference between the actual output and
expected output. This function is analyzed and proper
changes are made to the inputs accordingly. Cost function
is reduced to minimum value to get best learning results.
We use back propagation on the network continuously till
error becomes minimal.
[B] System Architecture Diagram
Large number of audio files are given as input to the
system. These audio files are converted to the digital data
using signal processing. Then feature extraction and
feature selection is performed on digital data. We will get
the text output by applying ANN algorithm on those
selected features. Text is classified into positive, negative
or neutral sentiment using Naïve Bayes classifier. The
polarity of the text input is calculated by using ANN on the
classified data.
[C] Sentimental Analysis
Sentiment analysis is used to detect threats and give the
warning. It is used to analyse the tone of the text and
classify it into positive or negative. This process is very
useful to detect the threat before so that it can be
eliminated or damage can be reduced. Textual data is
analysed and the polarity of the text is calculated. All the
data is scaled using the data mining methods like standard
scalar. Then the data is given to ANN to train the model.
This model is supervisedly trained to get the threat
warning for the input text.
[D] System Architecture Diagram
3. CONCLUSION
The developed model will help revolutionize audio
surveillance as time required to listen and then analyze
the audio is highly reduced. Human errors such as
skipping some part of audio will also be reduced. Available
API’s such as Google API may become paid in future and
hence our system provides a way to convert speech to text
without using Google API.
REFERENCES
[1] Phoemporn Lakkhanawannakun, Chaluemwut
Noyunsan, “Speech Recognition using Deep Learning,”
Rajamangala University of Technology, Thiland.
[2] Kanchan Naithani, V.M. Thakkar, Ashish Senwal,
“English Language Speech Recognition using MFCC
and HMM,” GBP Institute of Engineering Technology,
Uttarakhand, India.
[3] Sneh Paliwal, Sunil Khatri, Mayank Sharma,
“Sentiment Analysis and Prediction using Neural
Networks,” Amity University, Noida, Uttar Pradesh,
India.

More Related Content

PDF
IRJET- Voice Modulation and Verification for Smart Authentication System
PDF
Yolinda chiramba Survey Paper
PDF
Evaluation of network intrusion detection using markov chain
PDF
IRJET- Comparative Analysis of Emotion Recognition System
PDF
IRJET- Intrusion Detection using IP Binding in Real Network
PDF
IRJET- Study of Effect of PCA on Speech Emotion Recognition
DOCX
Raichlin et al_Mobile application for Speech Rate
PDF
Speech Recognized Automation System Using Speaker Identification through Wire...
IRJET- Voice Modulation and Verification for Smart Authentication System
Yolinda chiramba Survey Paper
Evaluation of network intrusion detection using markov chain
IRJET- Comparative Analysis of Emotion Recognition System
IRJET- Intrusion Detection using IP Binding in Real Network
IRJET- Study of Effect of PCA on Speech Emotion Recognition
Raichlin et al_Mobile application for Speech Rate
Speech Recognized Automation System Using Speaker Identification through Wire...

What's hot (19)

PDF
Ijetcas14 426
PDF
A survey of Network Intrusion Detection using soft computing Technique
PDF
Automatic Selection of Open Source Multimedia Softwares Using Error Back-Prop...
PPTX
Application of machine learning and cognitive computing in intrusion detectio...
PDF
Genetic algorithm based approach for
PDF
False positive reduction by combining svm and knn algo
PDF
Dy36749754
PDF
SPECIFICATION BASED TESTING OF ON ANDROID SYSTEMS
PDF
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
PDF
A45010107
PDF
IRJET - Network Traffic Monitoring and Botnet Detection using K-ANN Algorithm
PDF
Ista presentation-android
PDF
19 ijcse-01227
PPTX
Using Brain Waves as New Biometric Feature for Authenticating a Computer User...
PDF
An approach for ids by combining svm and ant colony algorithm
PDF
An approach for ids by combining svm and ant colony algorithm
PDF
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
PDF
IRJET- Embedding Randomness into Symmetric Key Encryption using Genetic Algor...
PDF
A novel algorithm to protect and manage memory locations
Ijetcas14 426
A survey of Network Intrusion Detection using soft computing Technique
Automatic Selection of Open Source Multimedia Softwares Using Error Back-Prop...
Application of machine learning and cognitive computing in intrusion detectio...
Genetic algorithm based approach for
False positive reduction by combining svm and knn algo
Dy36749754
SPECIFICATION BASED TESTING OF ON ANDROID SYSTEMS
IJERD (www.ijerd.com) International Journal of Engineering Research and Devel...
A45010107
IRJET - Network Traffic Monitoring and Botnet Detection using K-ANN Algorithm
Ista presentation-android
19 ijcse-01227
Using Brain Waves as New Biometric Feature for Authenticating a Computer User...
An approach for ids by combining svm and ant colony algorithm
An approach for ids by combining svm and ant colony algorithm
IRJET- A Secured Method of Data Aggregation for Wireless Sensor Networks in t...
IRJET- Embedding Randomness into Symmetric Key Encryption using Genetic Algor...
A novel algorithm to protect and manage memory locations
Ad

Similar to IRJET - Threat Prediction using Speech Analysis (20)

PDF
IRJET-speech emotion.pdf
PDF
A Deep Learning Model For Crime Surveillance In Phone Calls.
PDF
IRJET- Voice to Code Editor using Speech Recognition
PPTX
SPEECH RECOGNITION USING NEURAL NETWORK
PDF
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
PDF
Sentimental analysis of audio based customer reviews without textual conversion
PDF
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
PDF
Text Summarization and Conversion of Speech to Text
PDF
Emotional telugu speech signals classification based on k nn classifier
PDF
Emotional telugu speech signals classification based on k nn classifier
PDF
NLP BASED INTERVIEW ASSESSMENT SYSTEM
PDF
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
PDF
PDF
IRJET- Audio Data Summarization System using Natural Language Processing
PDF
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
PDF
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
PDF
IRJET- Emotion recognition using Speech Signal: A Review
PDF
Voice Recognition Based Automation System for Medical Applications and for Ph...
PDF
Voice Recognition Based Automation System for Medical Applications and for Ph...
PDF
IRJET - Speech to Speech Translation using Encoder Decoder Architecture
IRJET-speech emotion.pdf
A Deep Learning Model For Crime Surveillance In Phone Calls.
IRJET- Voice to Code Editor using Speech Recognition
SPEECH RECOGNITION USING NEURAL NETWORK
IRJET- Voice Command Execution with Speech Recognition and Synthesizer
Sentimental analysis of audio based customer reviews without textual conversion
MULTILINGUAL SPEECH TO TEXT CONVERSION USING HUGGING FACE FOR DEAF PEOPLE
Text Summarization and Conversion of Speech to Text
Emotional telugu speech signals classification based on k nn classifier
Emotional telugu speech signals classification based on k nn classifier
NLP BASED INTERVIEW ASSESSMENT SYSTEM
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
IRJET- Audio Data Summarization System using Natural Language Processing
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
IRJET- Emotion recognition using Speech Signal: A Review
Voice Recognition Based Automation System for Medical Applications and for Ph...
Voice Recognition Based Automation System for Medical Applications and for Ph...
IRJET - Speech to Speech Translation using Encoder Decoder Architecture
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...

Recently uploaded (20)

PDF
composite construction of structures.pdf
PDF
Well-logging-methods_new................
PPTX
bas. eng. economics group 4 presentation 1.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
web development for engineering and engineering
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
Welding lecture in detail for understanding
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
OOP with Java - Java Introduction (Basics)
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PPTX
additive manufacturing of ss316l using mig welding
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
composite construction of structures.pdf
Well-logging-methods_new................
bas. eng. economics group 4 presentation 1.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
web development for engineering and engineering
Automation-in-Manufacturing-Chapter-Introduction.pdf
Foundation to blockchain - A guide to Blockchain Tech
Welding lecture in detail for understanding
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Operating System & Kernel Study Guide-1 - converted.pdf
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
OOP with Java - Java Introduction (Basics)
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Model Code of Practice - Construction Work - 21102022 .pdf
additive manufacturing of ss316l using mig welding
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...

IRJET - Threat Prediction using Speech Analysis

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 02 | Feb 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1301 Threat Prediction using Speech Analysis Shreyas Kulkarni1, Nirantar Kulkarni2, Sushil Kukreja3, Kaustubh Kotkar4, Shambhavi Kulkarni5, Jayanti Kamalasekaran6 1,2,3,4B.E. Student, Dept. of Computer Engineering, Sinhgad College of Engineering, Vadgaon, Pune- 411041, Maharashtra, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract -Speech recognition technology is one of the fast growing Engineering technologies. It has number of applications in different areas and provides potential benefits. Audio surveillance is getting more important and requires large amount of man hours to determine threats from audio. The aim of this system is to identify potential cases of threats, and provide an early warning or alert to such cases. This will be based on voice such as voice chat over telecommunication networks or social media. The intended result will be achieved in the three major steps. This system will utilize latest machine learning algorithms to identify phonetics from the audio and helps convert it to equivalent transcript form. Natural language processing will help in developing sentiment analysis model. Key Words: Recurrent Neural Network, Sentimental Analysis, Speech Recognition, Natural Language Processing. 1. INTRODUCTION There has been much advancement in the areas of education, healthcare, etc. However, crime is one of the major issues faced by societies all over the world. The crime investigation team follows a systematic procedure for solving a particular case. At first, there are multiple suspects. Hence, to reach to a conclusion, the police might have to listen to their call recordings. This procedure consumes a lot of time as a single person is required to listen to multiple recordings. This also has a scope for human errors. These errors and time can be minimized using our proposed system. In this system, the user needs to provide an audio as an input and the system displays a warning about threat present in the audio. There are 2 parts in this project. First part converts speech to text and the second part applies sentimental analysis to the text. The total percentage of threat is calculated and if it is above a certain threshold, warning is provided. This system can also be used separately, i.e. only the first part can be used by the user if he/she doesn’t want sentimental analysis on the text and the second part can be used be directly providing text to the system to analyze the sentiments in a particular text. Speech to text is converted using signal processing and sentimental analysis is carried out using Naïve Bayes algorithm and neural network. This system is advantageous as it provides the results of the recording directly without a person having to analyze it for hours. 1.1 Literature Survey 1. English Language Speech Recognition using MFCC and HMM: This model implements and exploits Mel Frequency Cepstral Coefficient and Hidden Markov Model Techniques to evaluate the competence of speech recognition. In this paper they have used data from Google audioSet and converted audio signal into a vector using MFCC. The research is done using Python Programming with the help of Librosa library to generate MFCC. 2. Speech Recognition with Deep Recurrent Neural Network: In this paper, they have implemented speech recognition technique using Recurrent Neural Network which will also take into consideration context of the sentence. They have taken TIMIT dataset to perform phoneme recognition. This model combines deep, bidirectional Long Short- term Memory RNNs with end-to-end training. 3. Speech Recognition using Deep Learning: In this paper, audio files are taken as input and Artificial Neural Network is used to predict speech as an output. This research uses Deep Learning for speech recognition using library from Google which has 66.22% accuracy. 4. Sentimental Analysis and Prediction using Neural Networks: This paper classifies sentences into positive, negative and neutral categories. It uses Artificial Neural Network for sentimental analysis by training and testing data. It also checks the accuracy of ANN for large datasets. 2. Project Overview Supervision and analysis of audio files manually takes a lot of time. This procedure is very tedious and leaves scope for a lot of human errors. These errors might manipulate the results on a large scale. We have proposed a system that eliminates the human work of supervising the audio files. In sectors like crime investigation, this system might prove to be advantageous as threat in a particular audio file will be analyzed automatically and a warning will be provided as output. This system can also be used separately by the user or actor. The developed model will help the will be of help to various organizations as they can either use it for speech to text conversion or
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 02 | Feb 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 1302 sentimental analysis or both. There is one user/actor that can use the system by providing an audio file as input. [A] Artificial Neural Network Artificial Neural Network is used for processing the information similar to how brain works. It has large number of small processing elements connected to each other, working together to solve a specific problem. In the above figure, inputs are basically independent variables which are multiplied by their weights and summed up to get the activation function. This function takes decision on whether neuron should be activated or not. For learning of neural network, it uses the cost function which is the difference between the actual output and expected output. This function is analyzed and proper changes are made to the inputs accordingly. Cost function is reduced to minimum value to get best learning results. We use back propagation on the network continuously till error becomes minimal. [B] System Architecture Diagram Large number of audio files are given as input to the system. These audio files are converted to the digital data using signal processing. Then feature extraction and feature selection is performed on digital data. We will get the text output by applying ANN algorithm on those selected features. Text is classified into positive, negative or neutral sentiment using Naïve Bayes classifier. The polarity of the text input is calculated by using ANN on the classified data. [C] Sentimental Analysis Sentiment analysis is used to detect threats and give the warning. It is used to analyse the tone of the text and classify it into positive or negative. This process is very useful to detect the threat before so that it can be eliminated or damage can be reduced. Textual data is analysed and the polarity of the text is calculated. All the data is scaled using the data mining methods like standard scalar. Then the data is given to ANN to train the model. This model is supervisedly trained to get the threat warning for the input text. [D] System Architecture Diagram 3. CONCLUSION The developed model will help revolutionize audio surveillance as time required to listen and then analyze the audio is highly reduced. Human errors such as skipping some part of audio will also be reduced. Available API’s such as Google API may become paid in future and hence our system provides a way to convert speech to text without using Google API. REFERENCES [1] Phoemporn Lakkhanawannakun, Chaluemwut Noyunsan, “Speech Recognition using Deep Learning,” Rajamangala University of Technology, Thiland. [2] Kanchan Naithani, V.M. Thakkar, Ashish Senwal, “English Language Speech Recognition using MFCC and HMM,” GBP Institute of Engineering Technology, Uttarakhand, India. [3] Sneh Paliwal, Sunil Khatri, Mayank Sharma, “Sentiment Analysis and Prediction using Neural Networks,” Amity University, Noida, Uttar Pradesh, India.