SlideShare a Scribd company logo
Machine
Learning & Big
Data for Music
Discovery
Galvanize NYC, Mar 9th, 2017
Vidhya Murali @vid052
Ching-Wei Chen @cweichen
100M users in 60 markets
50M subscribers
Over 30M songs, and 2B playlists
$5B paid to rightsholders
Spotify
Music for everyone.
30 Million Songs...
What to recommend?
What to recommend?
Discover
Discover
Discover Weekly
How What to recommend?
Many flavors of recommendations
Radio
Many flavors of recommendations
Daily MixRadio
Many flavors of recommendations
This Is:Daily MixRadio
Many flavors of recommendations
Recommended SongsThis Is:Daily MixRadio
‣ Editorial
Recommendation approaches
‣ Editorial
‣ Algorithmic
○Content-based
■ Metadata
■ Audio Signals
○Collaborative
■ Usage based
Recommendation approaches
‣ Editorial
‣ Algorithmic
○Content-based
■ Metadata
■ Audio Signals
○Collaborative
■ Usage based
‣ Algotorial
Recommendation approaches
‣ Editorial
‣ Algorithmic
○Content-based
■ Metadata
■ Audio Signals
○Collaborative
■ Usage based
‣ Algotorial
Recommendation approaches
‣ Find patterns from user’s past behavior to
generate recommendations.
‣ Domain independent
‣ Scalable
Collaborative Filtering
Compact representation for each user and item
(song): f-dimensional vectors
Latent Factor Models
NLP Models
Context & Co-occurrence is key!
Document : Playlist
Word : Song
NLP Models work great on playlists!
Generating Song Vectors
w1
w2
w3
w4
w5
w6
w7
wn....………..
?
Music in Latent Space
Semantic Regularities
Music + Math = Epic
Songs as vectors
Recommendations
User Profile:
● Aggregation over user
interactions on Spotify
● Clustering to capture
distinct user tastes/
contexts
● Time Sensitive profiling
‣ Scale of catalog
● 30M tracks; 2B playlists
● Training
○ 25B data points
○ 100M users
○ 60 countries represented
Challenges unique to spotify
Data Pipelines
Data Pipelines
Big Table
Big Table
GCS
D
A
T
A
F
L
O
W
Pub Sub
Scio
‣ Scale of catalog
● 30M tracks; 2B playlists
● Training
○ 25B data points
○ 100M users
○ 60 countries represented
‣ Cold-Start
○ New Users
○ New Music
Challenges unique to spotify
Learning
from sound
What’s in a sound?
What’s in a sound?
AmplitudeTime
Frequencies
Loudness
What’s in a sound?
Melody
Beats
Chords
Voices
Instruments
Lyrics
AmplitudeTime
Frequencies
Loudness
What’s in a sound?
Melody
Beats
Chords
Voices
Instruments
Lyrics
AmplitudeTime
Frequencies
Loudness
Popularity
Era
Region
Genre
Mood
Purpose
Supervised Machine Learning
http://www.nltk.org/
Deep Learning
Deep Learning
1. No feature extraction necessary
2. LOTS of simple learning nodes in many layers
3. Propogate errors backwards to learn optimal
weights
4. Needs LOTS of data
Convolutional Neural Networks
Typical Convolutional Neural Network
Deep Learning on Audio at Spotify
Sander Dieleman: http://benanne.github.io/2014/08/05/spotify-cnns.html
Input: Audio
spectrogram
Output: Latent
Space Vector
Audio vector space
Cold Start? Problem solved! *
* Not completely, of course!
Recommending new music
Release Radar Fresh Finds
Recommendations at Spotify
Recommended
SongsThis Is:Daily MixRadio
Discover
Weekly
Release
Radar
What’s next?
?
Join the band!
www.spotify.com/jobs
Ching-Wei (@cweichen): cw@spotify.com
Vidhya (@vid052): vidhya@spotify.com

More Related Content

PDF
Music Personalization At Spotify
PDF
Personalized Playlists at Spotify
PDF
Recommending and searching @ Spotify
PDF
Algorithmic Music Recommendations at Spotify
PPTX
Collaborative Filtering at Spotify
PDF
From Idea to Execution: Spotify's Discover Weekly
PDF
Music recommendations @ MLConf 2014
PDF
Homepage Personalization at Spotify
Music Personalization At Spotify
Personalized Playlists at Spotify
Recommending and searching @ Spotify
Algorithmic Music Recommendations at Spotify
Collaborative Filtering at Spotify
From Idea to Execution: Spotify's Discover Weekly
Music recommendations @ MLConf 2014
Homepage Personalization at Spotify

What's hot (20)

PDF
Big data and machine learning @ Spotify
PDF
The Evolution of Hadoop at Spotify - Through Failures and Pain
PDF
Data at Spotify
PDF
Big Data At Spotify
PDF
Building Data Pipelines for Music Recommendations at Spotify
PDF
Scala Data Pipelines for Music Recommendations
PPTX
Spotify Discover Weekly: The machine learning behind your music recommendations
PDF
The Evolution of Big Data at Spotify
PDF
Scala Data Pipelines @ Spotify
PDF
How Apache Drives Music Recommendations At Spotify
PDF
Personalizing the listening experience
PDF
Music Personalization : Real time Platforms.
PDF
CF Models for Music Recommendations At Spotify
PDF
Playlist Recommendations @ Spotify
PDF
Interactive Recommender Systems with Netflix and Spotify
PDF
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
PDF
How data drives spotify
PDF
The Spotify Brand
PPTX
Product School - Spotify presentation
PPTX
Social Media Monitoring: The Case of Spotify
Big data and machine learning @ Spotify
The Evolution of Hadoop at Spotify - Through Failures and Pain
Data at Spotify
Big Data At Spotify
Building Data Pipelines for Music Recommendations at Spotify
Scala Data Pipelines for Music Recommendations
Spotify Discover Weekly: The machine learning behind your music recommendations
The Evolution of Big Data at Spotify
Scala Data Pipelines @ Spotify
How Apache Drives Music Recommendations At Spotify
Personalizing the listening experience
Music Personalization : Real time Platforms.
CF Models for Music Recommendations At Spotify
Playlist Recommendations @ Spotify
Interactive Recommender Systems with Netflix and Spotify
DataEngConf: Building a Music Recommender System from Scratch with Spotify Da...
How data drives spotify
The Spotify Brand
Product School - Spotify presentation
Social Media Monitoring: The Case of Spotify
Ad

Viewers also liked (20)

PPTX
Gracenote Music Recognition, Metadata, and Discovery APIs
PDF
Qcon London 2017 - Architecture overhaul - Ad serving @ Spotify scale
PPTX
Spotify - Pesquisa de Marketing
PPTX
Spotify
PDF
Scaling Operations At Spotify
PDF
A Spotify Presentation - Case studies
PDF
Design in Tech Report 2017
PDF
Growing up with agile - how the Spotify 'model' has evolved
PDF
Office365のID基盤活用とセキュリティ上の注意点
PDF
Testing at Spotify
PPTX
Hackathons Are Eating The World
PDF
React Nativeはクロスプラットフォームモバイルアプリ開発の夢を見るか #DroidKaigi
PDF
Playlists at Spotify - Using Cassandra to store version controlled objects
PDF
Docker at Spotify - Dockercon14
PPTX
The Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary Singularity
PDF
Empowering Engineering Talent - an update from Spotify
PDF
How spotify makes product
PDF
Activation: From thinking to tweaking it, how we do it at Spotify
PDF
Quality Built In @ Spotify
PDF
The Spotify Playbook
Gracenote Music Recognition, Metadata, and Discovery APIs
Qcon London 2017 - Architecture overhaul - Ad serving @ Spotify scale
Spotify - Pesquisa de Marketing
Spotify
Scaling Operations At Spotify
A Spotify Presentation - Case studies
Design in Tech Report 2017
Growing up with agile - how the Spotify 'model' has evolved
Office365のID基盤活用とセキュリティ上の注意点
Testing at Spotify
Hackathons Are Eating The World
React Nativeはクロスプラットフォームモバイルアプリ開発の夢を見るか #DroidKaigi
Playlists at Spotify - Using Cassandra to store version controlled objects
Docker at Spotify - Dockercon14
The Next Tsunami AI Blockchain IOT and Our Swarm Evolutionary Singularity
Empowering Engineering Talent - an update from Spotify
How spotify makes product
Activation: From thinking to tweaking it, how we do it at Spotify
Quality Built In @ Spotify
The Spotify Playbook
Ad

Similar to Machine Learning and Big Data for Music Discovery at Spotify (20)

PDF
Deep Learning Meetup #5
PDF
Literature Survey for Music Genre Classification Using Neural Network
PDF
Understanding Music Playlists
PDF
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
PDF
AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...
PDF
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
PDF
IRJET- A Personalized Music Recommendation System
PDF
Btp 1st
PPTX
PPTX
Understanding ai music discovery and recommendation systems
PDF
Music Genre Classification using Machine Learning
PDF
IRJET- Music Genre Recognition using Convolution Neural Network
PDF
Hsjs.pdf
PDF
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
PDF
DHRUV_rawat_21scse1011607_project_report.pdf
PDF
Bangla song genre recognition using artificial neural network
PPTX
Music recommendations model using natural language processing
PDF
Aiml ppt pdf.pdf on music recommendation system
PDF
FORECASTING MUSIC GENRE (RNN - LSTM)
PDF
IRJET- Implementing Musical Instrument Recognition using CNN and SVM
Deep Learning Meetup #5
Literature Survey for Music Genre Classification Using Neural Network
Understanding Music Playlists
Igor Kostiuk “Как приручить музыкальную рекомендательную систему”
AI&BigData Lab 2016. Игорь Костюк: Как приручить музыкальную рекомендательную...
IRJET- Machine Learning and Noise Reduction Techniques for Music Genre Classi...
IRJET- A Personalized Music Recommendation System
Btp 1st
Understanding ai music discovery and recommendation systems
Music Genre Classification using Machine Learning
IRJET- Music Genre Recognition using Convolution Neural Network
Hsjs.pdf
IRJET- Music Genre Classification using Machine Learning Algorithms: A Compar...
DHRUV_rawat_21scse1011607_project_report.pdf
Bangla song genre recognition using artificial neural network
Music recommendations model using natural language processing
Aiml ppt pdf.pdf on music recommendation system
FORECASTING MUSIC GENRE (RNN - LSTM)
IRJET- Implementing Musical Instrument Recognition using CNN and SVM

Recently uploaded (20)

PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Empathic Computing: Creating Shared Understanding
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
DOCX
The AUB Centre for AI in Media Proposal.docx
Understanding_Digital_Forensics_Presentation.pptx
Encapsulation_ Review paper, used for researhc scholars
Network Security Unit 5.pdf for BCA BBA.
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Monthly Chronicles - July 2025
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Empathic Computing: Creating Shared Understanding
Building Integrated photovoltaic BIPV_UPV.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
20250228 LYD VKU AI Blended-Learning.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Mobile App Security Testing_ A Comprehensive Guide.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
The AUB Centre for AI in Media Proposal.docx

Machine Learning and Big Data for Music Discovery at Spotify