SlideShare a Scribd company logo
Who’s to say what’s funny?
A computer using Language Models and Deep Learning,
That’s Who!
Xinru Yan & Ted Pedersen
{yanxx418,tpederse}@d.umn.edu
Department of Computer Science University of Minnesota Duluth
The Problem
• Traditional humor detection: binary classification
Our focus: learn a continuous and subjective sense of
humor from tweets submitted to the @midnight show in
response to hashtags by using Ngram Language
Models (LMs) and Deep Learning (DL) methods
• We participated in SemEval-2017 Task 6
#HashtagWars: Learning a Sense of Humor [1]:
• Tweets are in three baskets: top most funny tweet, next
nine most funny tweets and all remaining
• Two Subtasks
• A: Pairwise Comparison – a system should choose a
funnier of two tweets given a hashtag file
• B: Semi-Ranking – a system should categorize tweets
into the right baskets given a hashtag file
• Dataset
• Tweet Data: provided by the task, 106 hashtag files,
about 21,580 tokens
• News Data: We used 6.2 GB English news, about 2
million tokens [2]
Examples from #BreakUpIn5Words (Trigram LM, News Data)
Tweet @midnight LM DL
It’s not you, it’s meth. funniest funny
?
Hey, can we NOT talk? funny funny
You need your own Netflix funny not funny
Figured I’d try being happy. not funny funny
You’re a Mac, I’m PC not funny not funny
Language Models
Ngram LMs learn humor from training data and allow rank-
ing by assigning probability for each statement [3][4]
Language Model Results
We seek high accuracy for A and low distance for B.
Dataset Ngram Accuracy (A) Distance (B)
news 3 0.627 (4th) 0.872 (1st)
news 2 0.624 0.853
tweet 3 0.397 (8th) 0.967 (8th)
tweet 2 0.406 0.944
• The type and the quantity of the corpora is what
really matters –> more tweet data, less news data
• Bigram LMs performed slightly better than trigram LMs
–> Unigram and character level LMs
Deep Learning
• Humor relies on creative use of language which causes
too many OOV
• Jokes often include puns based on invented words
Barktender #DogJobs
Tinderella #UpdateAFairyTale
• Token-level LMs can not understand such puns
• Character-based CNNs (CharCNN) are not dependent
on observing tokens in training data
• Bigram and trigram LMs only use two or three preceding
words to predict the next word –> LSTMs are good at
making use of sequantial data such as text and are
designed for long-term dependencies
• Some hashtags require tweets to have more than three
words and some funny tweets are mostly made up of
common bigrams or trigrams
Complaining makes it better #AmericaIn4Words
Romantic dinners with the cats #BestWeekendIn5Words
• Ngram LMs do not include external knowledge such as
movie titles and song lyrics –> Create word embeddings
from domain specific materials
• Our plan: use Keras library to train CharCNN + LSTM
LMs on both datasets and investigate ways to include
domain knowledge word embeddings in the CharCNN +
LSTM LM
References
[1] Peter Potash, Alexey Romanov, and Anna Rumshisky.
SemEval-2017 Task 6: #HashtagWars: learning a sense of humor.
In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Vancouver, BC, August 2017.
[2] EMNLP 2011 SIXTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION.
http://www.statmt.org/wmt11/translation-task.html.
[3] Kenneth Heafield, Ivan Pouzyrevsky, Jonathan H. Clark, and Philipp Koehn.
Scalable modified Kneser-Ney language model estimation.
In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 2013.
[4] Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg, and Lillian Lee.
You had me at hello: How phrasing affects memorability.
In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, USA, 2012.

More Related Content

PPTX
Linguistic Cues to Deception: Identifying Political Trolls on Social Media
PPTX
Insights into the Twitterverse: Benchmarking and analysis twitter content
PPTX
Pydata Taipei 2020
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Cyber Pun...
PDF
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Zer...
ODP
Detecting Trends Through Twitter Stream v2
PDF
Twitter Sentiment and Network Analysis
PPTX
Sentiment Analysis of Film-Related Messages on Social Media
Linguistic Cues to Deception: Identifying Political Trolls on Social Media
Insights into the Twitterverse: Benchmarking and analysis twitter content
Pydata Taipei 2020
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Cyber Pun...
Spotle AI-thon Top 10 Showcase - Analysing Mental Health Of India - Team Zer...
Detecting Trends Through Twitter Stream v2
Twitter Sentiment and Network Analysis
Sentiment Analysis of Film-Related Messages on Social Media

Similar to Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who! (20)

PDF
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
PPTX
Computational Humor: Can a machine have a sense of humor (December 2022)
PPTX
Computational Humor: Can a machine have a sense of humor (2022)
PPTX
Computational Humor: Can a machine have a sense of humor? (2020)
PDF
What if a Robot Could Laugh at Your Jokes? (Laughter Prediction with Natural ...
PPTX
Neuro-Symbolic Creative Artificial Intelligence for Humor (PhD Defense)
PPTX
How do you teach computers humor + Text Generators as Creative Partners (May ...
PPTX
NLP Introduction and basics of natural language processing
PDF
Representation Learning of Text for NLP
PDF
Anthiil Inside workshop on NLP
PDF
MACHINE-DRIVEN TEXT ANALYSIS
PPTX
A Panorama of Natural Language Processing
PDF
Word2vec and Friends
PPTX
NLP Bootcamp
PDF
Deep learning for nlp
PDF
Natural Language Processing
PPTX
LLMSDFSDFSDFSDFDFSDFSDFDSFSDFSDFDS24aug.pptx
PPTX
LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLM24aug.pptx
PPTX
Recent Advances in Natural Language Processing
PDF
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Duluth at Semeval 2017 Task 6 - Language Models in Humor Detection
Computational Humor: Can a machine have a sense of humor (December 2022)
Computational Humor: Can a machine have a sense of humor (2022)
Computational Humor: Can a machine have a sense of humor? (2020)
What if a Robot Could Laugh at Your Jokes? (Laughter Prediction with Natural ...
Neuro-Symbolic Creative Artificial Intelligence for Humor (PhD Defense)
How do you teach computers humor + Text Generators as Creative Partners (May ...
NLP Introduction and basics of natural language processing
Representation Learning of Text for NLP
Anthiil Inside workshop on NLP
MACHINE-DRIVEN TEXT ANALYSIS
A Panorama of Natural Language Processing
Word2vec and Friends
NLP Bootcamp
Deep learning for nlp
Natural Language Processing
LLMSDFSDFSDFSDFDFSDFSDFDSFSDFSDFDS24aug.pptx
LLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLLM24aug.pptx
Recent Advances in Natural Language Processing
Castro - 2018 - A Crowd-Annotated Spanish Corpus for Humour Analysis
Ad

More from University of Minnesota, Duluth (20)

PPTX
Muslims in Machine Learning workshop (NeurlPS 2021) - Automatically Identifyi...
PDF
Automatically Identifying Islamophobia in Social Media
PPTX
What Makes Hate Speech : an interactive workshop
PDF
Algorithmic Bias - What is it? Why should we care? What can we do about it?
PDF
Algorithmic Bias : What is it? Why should we care? What can we do about it?
PDF
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
PDF
Puns upon a midnight dreary, lexical semantics for the weak and weary
PDF
The horizon isn't found in a dictionary : Identifying emerging word senses a...
PDF
Screening Twitter Users for Depression and PTSD
PDF
Duluth : Word Sense Discrimination in the Service of Lexicography
PDF
Pedersen masters-thesis-oct-10-2014
PDF
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
PDF
What it's like to do a Master's thesis with me (Ted Pedersen)
PDF
Pedersen naacl-2013-demo-poster-may25
PDF
Pedersen semeval-2013-poster-may24
ODP
Talk at UAB, April 12, 2013
ODP
Feb20 mayo-webinar-21feb2012
ODP
Ihi2012 semantic-similarity-tutorial-part1
ODP
Pedersen ACL Disco-2011 workshop
PPT
Pedersen acl2011-business-meeting
Muslims in Machine Learning workshop (NeurlPS 2021) - Automatically Identifyi...
Automatically Identifying Islamophobia in Social Media
What Makes Hate Speech : an interactive workshop
Algorithmic Bias - What is it? Why should we care? What can we do about it?
Algorithmic Bias : What is it? Why should we care? What can we do about it?
Duluth at Semeval 2017 Task 7 - Puns upon a Midnight Dreary, Lexical Semantic...
Puns upon a midnight dreary, lexical semantics for the weak and weary
The horizon isn't found in a dictionary : Identifying emerging word senses a...
Screening Twitter Users for Depression and PTSD
Duluth : Word Sense Discrimination in the Service of Lexicography
Pedersen masters-thesis-oct-10-2014
MICAI 2013 Tutorial Slides - Measuring the Similarity and Relatedness of Conc...
What it's like to do a Master's thesis with me (Ted Pedersen)
Pedersen naacl-2013-demo-poster-may25
Pedersen semeval-2013-poster-may24
Talk at UAB, April 12, 2013
Feb20 mayo-webinar-21feb2012
Ihi2012 semantic-similarity-tutorial-part1
Pedersen ACL Disco-2011 workshop
Pedersen acl2011-business-meeting
Ad

Recently uploaded (20)

PPTX
Lesson notes of climatology university.
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Sports Quiz easy sports quiz sports quiz
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Insiders guide to clinical Medicine.pdf
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Cell Types and Its function , kingdom of life
PPTX
Institutional Correction lecture only . . .
PPTX
Pharma ospi slides which help in ospi learning
Lesson notes of climatology university.
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
VCE English Exam - Section C Student Revision Booklet
Final Presentation General Medicine 03-08-2024.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Sports Quiz easy sports quiz sports quiz
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
human mycosis Human fungal infections are called human mycosis..pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Renaissance Architecture: A Journey from Faith to Humanism
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
O7-L3 Supply Chain Operations - ICLT Program
Insiders guide to clinical Medicine.pdf
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Cell Types and Its function , kingdom of life
Institutional Correction lecture only . . .
Pharma ospi slides which help in ospi learning

Who's to say what's funny? A computer using Language Models and Deep Learning, That's Who!

  • 1. Who’s to say what’s funny? A computer using Language Models and Deep Learning, That’s Who! Xinru Yan & Ted Pedersen {yanxx418,tpederse}@d.umn.edu Department of Computer Science University of Minnesota Duluth The Problem • Traditional humor detection: binary classification Our focus: learn a continuous and subjective sense of humor from tweets submitted to the @midnight show in response to hashtags by using Ngram Language Models (LMs) and Deep Learning (DL) methods • We participated in SemEval-2017 Task 6 #HashtagWars: Learning a Sense of Humor [1]: • Tweets are in three baskets: top most funny tweet, next nine most funny tweets and all remaining • Two Subtasks • A: Pairwise Comparison – a system should choose a funnier of two tweets given a hashtag file • B: Semi-Ranking – a system should categorize tweets into the right baskets given a hashtag file • Dataset • Tweet Data: provided by the task, 106 hashtag files, about 21,580 tokens • News Data: We used 6.2 GB English news, about 2 million tokens [2] Examples from #BreakUpIn5Words (Trigram LM, News Data) Tweet @midnight LM DL It’s not you, it’s meth. funniest funny ? Hey, can we NOT talk? funny funny You need your own Netflix funny not funny Figured I’d try being happy. not funny funny You’re a Mac, I’m PC not funny not funny Language Models Ngram LMs learn humor from training data and allow rank- ing by assigning probability for each statement [3][4] Language Model Results We seek high accuracy for A and low distance for B. Dataset Ngram Accuracy (A) Distance (B) news 3 0.627 (4th) 0.872 (1st) news 2 0.624 0.853 tweet 3 0.397 (8th) 0.967 (8th) tweet 2 0.406 0.944 • The type and the quantity of the corpora is what really matters –> more tweet data, less news data • Bigram LMs performed slightly better than trigram LMs –> Unigram and character level LMs Deep Learning • Humor relies on creative use of language which causes too many OOV • Jokes often include puns based on invented words Barktender #DogJobs Tinderella #UpdateAFairyTale • Token-level LMs can not understand such puns • Character-based CNNs (CharCNN) are not dependent on observing tokens in training data • Bigram and trigram LMs only use two or three preceding words to predict the next word –> LSTMs are good at making use of sequantial data such as text and are designed for long-term dependencies • Some hashtags require tweets to have more than three words and some funny tweets are mostly made up of common bigrams or trigrams Complaining makes it better #AmericaIn4Words Romantic dinners with the cats #BestWeekendIn5Words • Ngram LMs do not include external knowledge such as movie titles and song lyrics –> Create word embeddings from domain specific materials • Our plan: use Keras library to train CharCNN + LSTM LMs on both datasets and investigate ways to include domain knowledge word embeddings in the CharCNN + LSTM LM References [1] Peter Potash, Alexey Romanov, and Anna Rumshisky. SemEval-2017 Task 6: #HashtagWars: learning a sense of humor. In Proceedings of the 11th International Workshop on Semantic Evaluation (SemEval-2017), Vancouver, BC, August 2017. [2] EMNLP 2011 SIXTH WORKSHOP ON STATISTICAL MACHINE TRANSLATION. http://www.statmt.org/wmt11/translation-task.html. [3] Kenneth Heafield, Ivan Pouzyrevsky, Jonathan H. Clark, and Philipp Koehn. Scalable modified Kneser-Ney language model estimation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, August 2013. [4] Cristian Danescu-Niculescu-Mizil, Justin Cheng, Jon Kleinberg, and Lillian Lee. You had me at hello: How phrasing affects memorability. In Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Stroudsburg, PA, USA, 2012.