SlideShare a Scribd company logo
6
Most read
Speech Emotion
Recognition with CNN
Capstone Project
Diego Rios
11
Can we detect
emotion
from audio files?
2
How can would
this be applicable?
Recognizing when users
are upset over phone calls
with bots and then
speeding call to a human or
agent.
3
Data
◦ 2 data sets
◦ 8 emotions
◦ 24 actors
4
You can think of
sound as vibrations
that propagate as an
acoustic wave
5
6
Data Description
1. Sample the audio file at a
specific rate; resulting in an
array of numbers.
2. Converts audio into a time
series analysis
3. Plot the amplitude of time
4. Measure Mel-frequency
cepstral coefficients (MFCCs)
a. Assimilate to human hearing
b. Transforms linear
frequencies to
quasi-logarithmic
frequencies scaled
Image source: https://manual.audacityteam.org/man/digital_audio.html
Complete Clean-Up & Modelling
◦ MFCC become our input features
◦ Started with 1170 audio files and got 1170 rows of data
◦ There are 8 target values:
◦ Angry
◦ Calm
◦ Disgust
◦ Fear
◦ Happy
◦ Neutral
◦ Sad
◦ Surprised
40 features => Averages of frequency of audio target
feature
How accurate is the
model?
1 dimensional CNN
Random: 12.5%
8 emotions (1 divided by 8)
1D CNN: 75%
Much improved result!
Random Forest: 46%
Better but still low...
9
10
Confusion Matrix
& Next Steps
1. Fear is commonly misclassified
2. Surprised is also misclassified
3. Data imbalance with some
emotions having more audio
files than others.
4. Record audio files and used
trained CNN to validate results
and model.
11
Thanks!
Any questions?
You can find me at:
◦ https://medium.com/@Markeko/speech-emoti
on-recognition-with-convolutional-neural-netw
ork-ae5406a1c0f7
◦ diegoerios@gmail.com
◦ https://github.com/diegoerios

More Related Content

PDF
Emotion Recognition Based On Audio Speech
PDF
Speech emotion recognition
PPTX
SPEECH BASED EMOTION RECOGNITION USING VOICE
DOCX
Voice morphing document
PDF
Hand Gesture Recognition using Neural Network
PPTX
SPEECH RECOGNITION USING NEURAL NETWORK
PPT
Speech Recognition in Artificail Inteligence
PDF
Emotion based music player
Emotion Recognition Based On Audio Speech
Speech emotion recognition
SPEECH BASED EMOTION RECOGNITION USING VOICE
Voice morphing document
Hand Gesture Recognition using Neural Network
SPEECH RECOGNITION USING NEURAL NETWORK
Speech Recognition in Artificail Inteligence
Emotion based music player

What's hot (20)

PDF
Emotion detection using cnn.pptx
PPTX
Emotion recognition
PPTX
Speech Recognition Technology
PPTX
Predictive coding
PPT
Voice morphing-101113123852-phpapp01
PDF
EMOTION DETECTION USING AI
PPTX
Minor on Face Recognition System using Raspberry Pi
PPT
Data Redundacy
PPTX
Facial emotion recognition
PPTX
Number plate recognition using matlab
PPTX
Voice recognition system
PPTX
Voice Morping ppt
PDF
Facial emotion recognition
DOCX
Project synopsis on face recognition in e attendance
PPTX
Speech recognition final presentation
PPTX
Facial expression recognition projc 2 (3) (1)
PPTX
Emotion based music player
PPTX
Histogram Specification or Matching Problem
ODP
image compression ppt
PDF
Silent sound technology final report
Emotion detection using cnn.pptx
Emotion recognition
Speech Recognition Technology
Predictive coding
Voice morphing-101113123852-phpapp01
EMOTION DETECTION USING AI
Minor on Face Recognition System using Raspberry Pi
Data Redundacy
Facial emotion recognition
Number plate recognition using matlab
Voice recognition system
Voice Morping ppt
Facial emotion recognition
Project synopsis on face recognition in e attendance
Speech recognition final presentation
Facial expression recognition projc 2 (3) (1)
Emotion based music player
Histogram Specification or Matching Problem
image compression ppt
Silent sound technology final report
Ad

Similar to Emotion Speech Recognition - Convolutional Neural Network Capstone Project (20)

PDF
IRJET-speech emotion.pdf
PPTX
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
PPTX
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
PPTX
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
PDF
Presentation Mini Project_Presentation Mini Project.pdf
PDF
IRJET - Audio Emotion Analysis
PDF
IRJET- Comparative Analysis of Emotion Recognition System
PPTX
Vocal Sentiments Transformers Based Speech Emotion Recognition Emotion Recogn...
PPTX
speech emirjopjkfsnakfnkjsdnsdjdnknfksdnknj
PDF
Speech emotion recognition using 2D-convolutional neural network
PDF
Human Emotion Recognition From Speech
PDF
Speech Emotion Recognition Using Machine Learning
PDF
Emotion Recognition through Speech Analysis using various Deep Learning Algor...
PPTX
Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...
PPTX
Emotions detection voice using ai ml Project-PPT(1)12.pptx
PPTX
Speech based emotion recognition using artificial intelligence
PDF
76201926
PDF
IRJET- Emotion recognition using Speech Signal: A Review
PDF
A Review Paper on Speech Based Emotion Detection Using Deep Learning
PPTX
AI-Driven Emotion Recognition - Integrated Electronic Systems
IRJET-speech emotion.pdf
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
Advancing Sentiment Analysis in Audio: Deep Learning & NLP approaches for Emo...
Presentation Mini Project_Presentation Mini Project.pdf
IRJET - Audio Emotion Analysis
IRJET- Comparative Analysis of Emotion Recognition System
Vocal Sentiments Transformers Based Speech Emotion Recognition Emotion Recogn...
speech emirjopjkfsnakfnkjsdnsdjdnknfksdnknj
Speech emotion recognition using 2D-convolutional neural network
Human Emotion Recognition From Speech
Speech Emotion Recognition Using Machine Learning
Emotion Recognition through Speech Analysis using various Deep Learning Algor...
Audio Visual Emotion Recognition Using Cross Correlation and Wavelet Packet D...
Emotions detection voice using ai ml Project-PPT(1)12.pptx
Speech based emotion recognition using artificial intelligence
76201926
IRJET- Emotion recognition using Speech Signal: A Review
A Review Paper on Speech Based Emotion Detection Using Deep Learning
AI-Driven Emotion Recognition - Integrated Electronic Systems
Ad

Recently uploaded (20)

PPT
ISS -ESG Data flows What is ESG and HowHow
PDF
Mega Projects Data Mega Projects Data
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
Introduction to machine learning and Linear Models
PPT
Reliability_Chapter_ presentation 1221.5784
PDF
Lecture1 pattern recognition............
PDF
Foundation of Data Science unit number two notes
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPT
Quality review (1)_presentation of this 21
ISS -ESG Data flows What is ESG and HowHow
Mega Projects Data Mega Projects Data
oil_refinery_comprehensive_20250804084928 (1).pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Miokarditis (Inflamasi pada Otot Jantung)
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
IB Computer Science - Internal Assessment.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Fluorescence-microscope_Botany_detailed content
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to machine learning and Linear Models
Reliability_Chapter_ presentation 1221.5784
Lecture1 pattern recognition............
Foundation of Data Science unit number two notes
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Quality review (1)_presentation of this 21

Emotion Speech Recognition - Convolutional Neural Network Capstone Project

  • 1. Speech Emotion Recognition with CNN Capstone Project Diego Rios 11
  • 2. Can we detect emotion from audio files? 2
  • 3. How can would this be applicable? Recognizing when users are upset over phone calls with bots and then speeding call to a human or agent. 3
  • 4. Data ◦ 2 data sets ◦ 8 emotions ◦ 24 actors 4
  • 5. You can think of sound as vibrations that propagate as an acoustic wave 5
  • 6. 6 Data Description 1. Sample the audio file at a specific rate; resulting in an array of numbers. 2. Converts audio into a time series analysis 3. Plot the amplitude of time 4. Measure Mel-frequency cepstral coefficients (MFCCs) a. Assimilate to human hearing b. Transforms linear frequencies to quasi-logarithmic frequencies scaled Image source: https://manual.audacityteam.org/man/digital_audio.html
  • 7. Complete Clean-Up & Modelling ◦ MFCC become our input features ◦ Started with 1170 audio files and got 1170 rows of data ◦ There are 8 target values: ◦ Angry ◦ Calm ◦ Disgust ◦ Fear ◦ Happy ◦ Neutral ◦ Sad ◦ Surprised 40 features => Averages of frequency of audio target feature
  • 8. How accurate is the model? 1 dimensional CNN
  • 9. Random: 12.5% 8 emotions (1 divided by 8) 1D CNN: 75% Much improved result! Random Forest: 46% Better but still low... 9
  • 10. 10 Confusion Matrix & Next Steps 1. Fear is commonly misclassified 2. Surprised is also misclassified 3. Data imbalance with some emotions having more audio files than others. 4. Record audio files and used trained CNN to validate results and model.
  • 11. 11 Thanks! Any questions? You can find me at: ◦ https://medium.com/@Markeko/speech-emoti on-recognition-with-convolutional-neural-netw ork-ae5406a1c0f7 ◦ diegoerios@gmail.com ◦ https://github.com/diegoerios