SlideShare a Scribd company logo
Emotion and Theme recognition in music using Jamendo
A new MediaEval task
Alastair Porter, Dmitry Bogdanov, Philip Tovstogan
Music Technology Group, Universitat Pompeu Fabra
Emotion and mood in MIR
● A popular task in Music Information Retrieval
● Relevant for music search and recommendation
● MIREX (Music Information Retrieval Evaluation eXchange)
https://www.music-ir.org/mirex/wiki/MIREX_HOME
● Commercial music (cannot be shared outside of contest)
● 5 clusters of similar moods
Cluster_1: passionate, rousing, confident,boisterous, rowdy
Cluster_2: rollicking, cheerful, fun, sweet, amiable/good natured
Cluster_3: literate, poignant, wistful, bittersweet, autumnal, brooding
Cluster_4: humorous, silly, campy, quirky, whimsical, witty, wry
Cluster_5: aggressive, fiery, tense/anxious, intense, volatile,visceral
X. Hu and J. S. Downie, “Exploring mood metadata: Relationships with genre, artist and usage metadata,” in International
Conference on Music Information Retrieval, 2007.
Emotion and mood in MIR
● Presented in MediaEval 2015:
http://www.multimediaeval.org/mediaeval2015/emotioninmusic2015/
○ 1000 Songs, annotated by 10 subjects each
○ Audio samples from Free Music Archive
Soleymani, M., Caro, M. N., Schmidt, E. M., Sha, C. Y., & Yang, Y. H. (2013). 1000 songs for emotional analysis of music.
In Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia (pp. 1-6). ACM.
Emotion and mood in MIR
● Mood dataset used in AcousticBrainz, prepared by Cyril
Laurier
http://acousticbrainz.org/datasets/accuracy#mood_acoustic
● acoustic, electronic, aggressive, happy, sad, relaxed, party
Laurier, C., Meyers, O., Serra, J., Blech, M., & Herrera, P. (2009). Music mood annotator design and integration. In 7th
International Workshop on Content-Based Multimedia Indexing (CBMI'09), pp. 156-161.
Jamendo
● https://jamendo.org
● Database of music released under creative commons
licenses, can be uploaded by anyone
● Metadata added by performers during upload, including
instrumentation, vocals, genre, moods, and themes
● Commercial product, “Jamendo Licensing”, to allow
musicians to earn from their works
Jamendo
Jamendo data
● 50k tracks with annotations, including mood and theme
○ 3.3k happy
○ 2.8k dark
○ 2.3k energetic
○ 2.2k relaxing
○ 2.2k epic
○ 2.0k melodic
○ 2.0k love
○ ...
Possible approaches
● Label classification (Autotagging)
● Label cluster classification (e.g. MIREX)
● Arousal / Valence regresion (Emotion In Music 2015)
● Arousal / Valence segment classification (ISMIR 2018)
Mapping tags to emotion space
● 267 mood and theme tags ⇒ 204 tags in Warriner’s list
○ 13,915 English words with Arousal and Valence ratings
according to Russell’s model
● Instead of asking for people to listen and annotate song,
use existing labels and map it to a space
Warriner, A.B., Kuperman, V., & Brysbaert, M. (2013). Norms of valence, arousal, and dominance for 13,915 English
lemmas. Behavior Research Methods, 45
Musical Texture and Expressivity Features for Music Emotion Recognition, Renato Panda, Ricardo Malheiro, and Rui
Pedro Paiva International Society on Music Information Retrieval Conference 2018
Mapping tags to emotion space
data I (22695)II (2610)
III (8443) IV (29301)
Jamendo data
● Some tracks have tags that
fall into multiple quadrants
● 28k out of 50k tracks fall in a
single quadrant
Our open questions
● How many of the possible approaches are a good idea for
subtasks?
○ Classification, Clustering, continuous
○ The "Emotion in Music Task" MediaEval task focused on
continuous annotations. Is it an issue if we don’t?
● How to deal with skewed data (“all music is happy”)
● In classification, what to do with neutral labels (close to 0)
● Evaluation metrics?
Thanks

More Related Content

PDF
Emotion and Theme Recognition in Music Using Jamendo
PDF
Exploring the Relationship Between Multi-Modal Emotion Semantics of Music
PDF
Emotion analysis of songs based on lyrical
PDF
MediaEval 2020: Emotion and Theme Recognition in Music Using Jamendo
PPTX
Developing music mood taxonomies
PDF
Dimensional Music Emotion Recognition
PDF
Emotion in Music Task at MediaEval 2014
PDF
IRJET - EMO-MUSIC(Emotion based Music Player)
Emotion and Theme Recognition in Music Using Jamendo
Exploring the Relationship Between Multi-Modal Emotion Semantics of Music
Emotion analysis of songs based on lyrical
MediaEval 2020: Emotion and Theme Recognition in Music Using Jamendo
Developing music mood taxonomies
Dimensional Music Emotion Recognition
Emotion in Music Task at MediaEval 2014
IRJET - EMO-MUSIC(Emotion based Music Player)

Similar to MediaEval 2018: Emotion and theme recognition in music using jamendo (20)

PDF
Emotion classification for musical data using deep learning techniques
PDF
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
PDF
Audio Features for Music Emotion Recognition A Survey.pdf
PDF
Mouawad
PDF
Mood Detection
PDF
MOODetector: Automatic Music Emotion Recognition
PDF
MediaEval 2015 - Emotion in Music: Task Overview
PPTX
Emotion based music player
PDF
MediaEval 2016 - Emotion in Music Task: Lessons Learned
PDF
From Content-based Music Emotion Recognition to Emotion Maps of Musical Piece...
PDF
Emotion Recognition in Classical Music
PDF
Mehfil : Song Recommendation System Using Sentiment Detected
PDF
SMART MUSIC PLAYER BASED ON EMOTION DETECTION
PPT
A Musical System For Emotional Expression
PPTX
What does music mood mean for real users?
PPTX
Interdisciplinary Perspectives on Emotion, Music and Technology
PDF
Gracenote: Music Navigation and Discovery
PDF
Emotion Based Music Player
PDF
A General Architecture for an Emotion-aware Content-based Recommender System
PDF
Emotion Recognition in Classical Music - Presentation
Emotion classification for musical data using deep learning techniques
Music Emotion Classification based on Lyrics-Audio using Corpus based Emotion...
Audio Features for Music Emotion Recognition A Survey.pdf
Mouawad
Mood Detection
MOODetector: Automatic Music Emotion Recognition
MediaEval 2015 - Emotion in Music: Task Overview
Emotion based music player
MediaEval 2016 - Emotion in Music Task: Lessons Learned
From Content-based Music Emotion Recognition to Emotion Maps of Musical Piece...
Emotion Recognition in Classical Music
Mehfil : Song Recommendation System Using Sentiment Detected
SMART MUSIC PLAYER BASED ON EMOTION DETECTION
A Musical System For Emotional Expression
What does music mood mean for real users?
Interdisciplinary Perspectives on Emotion, Music and Technology
Gracenote: Music Navigation and Discovery
Emotion Based Music Player
A General Architecture for an Emotion-aware Content-based Recommender System
Emotion Recognition in Classical Music - Presentation
Ad

More from multimediaeval (20)

PPTX
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
PDF
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
PDF
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
PDF
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
PPTX
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
PDF
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
PDF
Fooling an Automatic Image Quality Estimator
PDF
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
PDF
Pixel Privacy: Quality Camouflage for Social Images
PDF
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
PPTX
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
PDF
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
PDF
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
PPTX
Deep Conditional Adversarial learning for polyp Segmentation
PPTX
A Temporal-Spatial Attention Model for Medical Image Detection
PPTX
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
PDF
Fine-tuning for Polyp Segmentation with Attention
PPTX
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
PPTX
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
PDF
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Fooling an Automatic Image Quality Estimator
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Pixel Privacy: Quality Camouflage for Social Images
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Deep Conditional Adversarial learning for polyp Segmentation
A Temporal-Spatial Attention Model for Medical Image Detection
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
Fine-tuning for Polyp Segmentation with Attention
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Ad

Recently uploaded (20)

PPTX
Substance Disorders- part different drugs change body
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPT
veterinary parasitology ````````````.ppt
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PPTX
Welcome-grrewfefweg-students-of-2024.pptx
PPTX
A powerpoint on colorectal cancer with brief background
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PPTX
Seminar Hypertension and Kidney diseases.pptx
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPTX
gene cloning powerpoint for general biology 2
PPTX
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
PPTX
BIOMOLECULES PPT........................
PPT
6.1 High Risk New Born. Padetric health ppt
PPTX
Hypertension_Training_materials_English_2024[1] (1).pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PPTX
Introcution to Microbes Burton's Biology for the Health
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PPT
LEC Synthetic Biology and its application.ppt
Substance Disorders- part different drugs change body
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
veterinary parasitology ````````````.ppt
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
Welcome-grrewfefweg-students-of-2024.pptx
A powerpoint on colorectal cancer with brief background
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
Seminar Hypertension and Kidney diseases.pptx
TORCH INFECTIONS in pregnancy with toxoplasma
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
gene cloning powerpoint for general biology 2
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
BIOMOLECULES PPT........................
6.1 High Risk New Born. Padetric health ppt
Hypertension_Training_materials_English_2024[1] (1).pptx
lecture 2026 of Sjogren's syndrome l .pdf
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Introcution to Microbes Burton's Biology for the Health
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
LEC Synthetic Biology and its application.ppt

MediaEval 2018: Emotion and theme recognition in music using jamendo

  • 1. Emotion and Theme recognition in music using Jamendo A new MediaEval task Alastair Porter, Dmitry Bogdanov, Philip Tovstogan Music Technology Group, Universitat Pompeu Fabra
  • 2. Emotion and mood in MIR ● A popular task in Music Information Retrieval ● Relevant for music search and recommendation ● MIREX (Music Information Retrieval Evaluation eXchange) https://www.music-ir.org/mirex/wiki/MIREX_HOME ● Commercial music (cannot be shared outside of contest) ● 5 clusters of similar moods Cluster_1: passionate, rousing, confident,boisterous, rowdy Cluster_2: rollicking, cheerful, fun, sweet, amiable/good natured Cluster_3: literate, poignant, wistful, bittersweet, autumnal, brooding Cluster_4: humorous, silly, campy, quirky, whimsical, witty, wry Cluster_5: aggressive, fiery, tense/anxious, intense, volatile,visceral X. Hu and J. S. Downie, “Exploring mood metadata: Relationships with genre, artist and usage metadata,” in International Conference on Music Information Retrieval, 2007.
  • 3. Emotion and mood in MIR ● Presented in MediaEval 2015: http://www.multimediaeval.org/mediaeval2015/emotioninmusic2015/ ○ 1000 Songs, annotated by 10 subjects each ○ Audio samples from Free Music Archive Soleymani, M., Caro, M. N., Schmidt, E. M., Sha, C. Y., & Yang, Y. H. (2013). 1000 songs for emotional analysis of music. In Proceedings of the 2nd ACM international workshop on Crowdsourcing for multimedia (pp. 1-6). ACM.
  • 4. Emotion and mood in MIR ● Mood dataset used in AcousticBrainz, prepared by Cyril Laurier http://acousticbrainz.org/datasets/accuracy#mood_acoustic ● acoustic, electronic, aggressive, happy, sad, relaxed, party Laurier, C., Meyers, O., Serra, J., Blech, M., & Herrera, P. (2009). Music mood annotator design and integration. In 7th International Workshop on Content-Based Multimedia Indexing (CBMI'09), pp. 156-161.
  • 5. Jamendo ● https://jamendo.org ● Database of music released under creative commons licenses, can be uploaded by anyone ● Metadata added by performers during upload, including instrumentation, vocals, genre, moods, and themes ● Commercial product, “Jamendo Licensing”, to allow musicians to earn from their works
  • 7. Jamendo data ● 50k tracks with annotations, including mood and theme ○ 3.3k happy ○ 2.8k dark ○ 2.3k energetic ○ 2.2k relaxing ○ 2.2k epic ○ 2.0k melodic ○ 2.0k love ○ ...
  • 8. Possible approaches ● Label classification (Autotagging) ● Label cluster classification (e.g. MIREX) ● Arousal / Valence regresion (Emotion In Music 2015) ● Arousal / Valence segment classification (ISMIR 2018)
  • 9. Mapping tags to emotion space ● 267 mood and theme tags ⇒ 204 tags in Warriner’s list ○ 13,915 English words with Arousal and Valence ratings according to Russell’s model ● Instead of asking for people to listen and annotate song, use existing labels and map it to a space Warriner, A.B., Kuperman, V., & Brysbaert, M. (2013). Norms of valence, arousal, and dominance for 13,915 English lemmas. Behavior Research Methods, 45 Musical Texture and Expressivity Features for Music Emotion Recognition, Renato Panda, Ricardo Malheiro, and Rui Pedro Paiva International Society on Music Information Retrieval Conference 2018
  • 10. Mapping tags to emotion space data I (22695)II (2610) III (8443) IV (29301)
  • 11. Jamendo data ● Some tracks have tags that fall into multiple quadrants ● 28k out of 50k tracks fall in a single quadrant
  • 12. Our open questions ● How many of the possible approaches are a good idea for subtasks? ○ Classification, Clustering, continuous ○ The "Emotion in Music Task" MediaEval task focused on continuous annotations. Is it an issue if we don’t? ● How to deal with skewed data (“all music is happy”) ● In classification, what to do with neutral labels (close to 0) ● Evaluation metrics?