A	HYBRID	APPROACH	FOR	VERIFYING	MULTIMEDIA	USE	ON	TWITTER
Quoc-Tin	Phan,	Alessandro	Budroni,	Cecilia	Pasquini,	Francesco	G.	B.	De	Natale
Department	of	Information	Engineering	and	Computer	Science	– University	of	Trento,	Italy
INTRODUCTION
Can	you	recognize?	They	are	“FAKE”.
EXISTING	APPROACHES
THE	PROPOSED	METHOD
Schema	of	the	proposed	method.
MULTIMEDIA	ASSESSMENT
1. Search by keywords: online web search using relevant keywords associated to the
event.
2. Search by image/video: Google reverse image search and comment retrieval from
YouTube.
3. Forensic feature extraction: Non-Aligned Double JPEG Compression, Block Artifact
Grid, and Error Level Analysis. We seek for highest-probable blocks which may
undergo modifications and extract statistical features as min, max, mean, and
variance.
4. Textual feature extraction:
4.1 Extract most relevant terms from results of 1. as bag-of-words.
4.2 From results of 2., calculate term frequency of bag-of-words from 4.1.
4.3 Calculate term frequency of bag of negative, positive and “fake” words.
4.4 Concatenate features from 4.2 and 4.3 to form textual features.
5. Textual features together with forensic features are fed to Classifier 1.
RESULTS	AND	DISCUSSIONS
Multimedia	Signal	Processing	and	Understanding	Lab,	University	of	Trento,	Italy
✘Not	useful	with	short	text	
and multiple	languages.
✘Not	take	into	account	
multimedia	content.
Hurricane	
Sandy	
sharing
sharing
fake	topic
real	topic
Unreliable	information	about	events	and	news	sharing	over	Online	
Social	Networks	might	cause	negative	consequences	on	community
GIVEN:	A	TWEET	comprising	
<text,	images	/	video>
REAL	/	FAKE
INPUT
OUTPUT
SYNTHETIC MANIPULATION
Text-based
Multimedia-Forensic-
based
User-based
✘Sensitive	to	subsequent
modifications and	compression.
Multimedia
Event
Post
User
Forensic	feature	
extraction
Search	by	
image/video
Search	by	keywords
Forensic	
features
Textual	
features
Textual	feature	
extraction
Classifier	 1
Post-based	
features
User-based	
features
Classifier	 2Concatenate
Score	fusion
Final	
decision
Post-based	 feature	
extraction
User-based	 feature	
extraction
Concatenate
Multimedia	
assessment
Tweet	credibility	
assessment
TWEET	CREDIBILITY	ASSESSMENT
1. Post-based feature extraction: useful features reflecting the credibility of a tweet
post are extracted, i.e. whether the tweet contains “?” or “!”, number of negative
sentiment words.
2. User-based feature extraction: useful features reflecting the credibility of a user are
extracted, i.e. number of followers the user has, whether the user is verified by
Twitter.
3. Post-based features together with user-based features are fed to Classifier 2.
WRONG	CONTEXT
SCORE	FUSION
With the assumption that a tweet sharing fake images or videos is likely to be fake,
higher weight is assigned to the output from Classifier 1, lower weight is assigned to the
output from Classifier 2.
In the sub-task, we submitted RUN 1 applying only forensic features,
and RUN 2 applying both textual features and forensic features.
In the main task, we submitted three RUNs: i) RUN 1: applied only the second
classification tier, ii) RUN 2: applied two-tier classification and 0.8 : 0.2 fusion strategy,
answered UNKNOWN to cases suffered from online searching errors, iii) RUN 3: same as
RUN 2, considered the output of classification tier 2 instead of UNKNOWN.
The proposed method is subject to online search errors, which happen to videos NOT
hosted by Youtube.
Recall Precision F1-score
RUN	1 0.5 0.48 0.49
RUN	2 0.93 0.49 0.64
Our method gains recall if we take into account textual features
acquired from online text search and image reverse search. This
approach effectively reduces false negative rate.
Recall Precision F1-score
RUN	1 0.55 0.71 0.62
RUN	2 0.94 0.81 0.87
RUN 3 0.94 0.74 0.83

More Related Content

PPTX
Pydata Taipei 2020
PDF
IRJET - Suicidal Text Detection using Machine Learning
PPTX
Link prediction with the linkpred tool
DOCX
Sentiment analysis using machine learning
PDF
Social Sensor for Real Time Event Detection
PPTX
CML's Presentation at FengChia University
PDF
H017445260
PDF
B017441015
Pydata Taipei 2020
IRJET - Suicidal Text Detection using Machine Learning
Link prediction with the linkpred tool
Sentiment analysis using machine learning
Social Sensor for Real Time Event Detection
CML's Presentation at FengChia University
H017445260
B017441015

Viewers also liked (7)

PDF
MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task
PDF
MediaEval 2015 - MediaEval 2015 Drone Protect Task: Privacy Protection in Sur...
PDF
MediaEval 2015 - Privacy Protection Filter Using StegoScrambling in Video Sur...
PDF
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
PDF
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
PDF
MediaEval 2016 - Emotional Impact of Movies Task
PDF
MediaEval 2016 - Emotion in Music Task: Lessons Learned
MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task
MediaEval 2015 - MediaEval 2015 Drone Protect Task: Privacy Protection in Sur...
MediaEval 2015 - Privacy Protection Filter Using StegoScrambling in Video Sur...
MediaEval 2015 - UNED-UV @ Retrieving Diverse Social Images Task - Poster
MediaEval 2016 - TUD-MMC Predicting media Interestingness Task
MediaEval 2016 - Emotional Impact of Movies Task
MediaEval 2016 - Emotion in Music Task: Lessons Learned
Ad

Similar to MediaEval 2016: A Hybrid Approach for Verifying Multimedia Use on Twitter (16)

PDF
MediaEval 2016 - Verifying Multimedia Use Task Overview
PDF
MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task
PPTX
Learning to detect Misleading Content on Twitter
PPTX
Verifying Multimedia Use at MediaEval 2016
PDF
News Reliability Evaluation using Latent Semantic Analysis
PPTX
Verifying Multimedia Use at MediaEval 2015
PPTX
Verifying Multimedia Use at MediaEval 2015
PDF
MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015
PDF
20574-38941-1-PB.pdf
PPTX
Fake news detection using machine learning
PDF
IRJET- Fake News Detection using Logistic Regression
PDF
Fake News Detection
PDF
Fake news Detection using Machine Learning
PPTX
Computational Verification Challenges in Social Media
PDF
IRJET- Authentic News Summarization
PDF
Presentation of the InVID tool for social media verification
MediaEval 2016 - Verifying Multimedia Use Task Overview
MediaEval 2016: A Multimodal System for the Verifying Multimedia Use Task
Learning to detect Misleading Content on Twitter
Verifying Multimedia Use at MediaEval 2016
News Reliability Evaluation using Latent Semantic Analysis
Verifying Multimedia Use at MediaEval 2015
Verifying Multimedia Use at MediaEval 2015
MediaEval 2015 - Verifying Multimedia Use at MediaEval 2015
20574-38941-1-PB.pdf
Fake news detection using machine learning
IRJET- Fake News Detection using Logistic Regression
Fake News Detection
Fake news Detection using Machine Learning
Computational Verification Challenges in Social Media
IRJET- Authentic News Summarization
Presentation of the InVID tool for social media verification
Ad

More from multimediaeval (20)

PPTX
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
PDF
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
PDF
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
PDF
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
PPTX
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
PDF
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
PDF
Fooling an Automatic Image Quality Estimator
PDF
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
PDF
Pixel Privacy: Quality Camouflage for Social Images
PDF
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
PPTX
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
PDF
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
PDF
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
PPTX
Deep Conditional Adversarial learning for polyp Segmentation
PPTX
A Temporal-Spatial Attention Model for Medical Image Detection
PPTX
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
PDF
Fine-tuning for Polyp Segmentation with Attention
PPTX
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
PPTX
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
PDF
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...
Classification of Strokes in Table Tennis with a Three Stream Spatio-Temporal...
HCMUS at MediaEval 2020: Ensembles of Temporal Deep Neural Networks for Table...
Sports Video Classification: Classification of Strokes in Table Tennis for Me...
Predicting Media Memorability from a Multimodal Late Fusion of Self-Attention...
Essex-NLIP at MediaEval Predicting Media Memorability 2020 Task
Overview of MediaEval 2020 Predicting Media Memorability task: What Makes a V...
Fooling an Automatic Image Quality Estimator
Fooling Blind Image Quality Assessment by Optimizing a Human-Understandable C...
Pixel Privacy: Quality Camouflage for Social Images
HCMUS at MediaEval 2020:Image-Text Fusion for Automatic News-Images Re-Matching
Efficient Supervision Net: Polyp Segmentation using EfficientNet and Attentio...
HCMUS at Medico Automatic Polyp Segmentation Task 2020: PraNet and ResUnet++ ...
Depth-wise Separable Atrous Convolution for Polyps Segmentation in Gastro-Int...
Deep Conditional Adversarial learning for polyp Segmentation
A Temporal-Spatial Attention Model for Medical Image Detection
HCMUS-Juniors 2020 at Medico Task in MediaEval 2020: Refined Deep Neural Netw...
Fine-tuning for Polyp Segmentation with Attention
Bigger Networks are not Always Better: Deep Convolutional Neural Networks for...
Insights for wellbeing: Predicting Personal Air Quality Index using Regressio...
Use Visual Features From Surrounding Scenes to Improve Personal Air Quality ...

Recently uploaded (20)

PPT
Computional quantum chemistry study .ppt
PPTX
Understanding the Circulatory System……..
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PPTX
Presentation1 INTRODUCTION TO ENZYMES.pptx
PPTX
PMR- PPT.pptx for students and doctors tt
PPT
LEC Synthetic Biology and its application.ppt
PPTX
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
PPTX
A powerpoint on colorectal cancer with brief background
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPTX
Introcution to Microbes Burton's Biology for the Health
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
PPTX
Microbes in human welfare class 12 .pptx
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PDF
Social preventive and pharmacy. Pdf
PDF
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
PPT
veterinary parasitology ````````````.ppt
PPTX
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat
Computional quantum chemistry study .ppt
Understanding the Circulatory System……..
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
Presentation1 INTRODUCTION TO ENZYMES.pptx
PMR- PPT.pptx for students and doctors tt
LEC Synthetic Biology and its application.ppt
GREEN FIELDS SCHOOL PPT ON HOLIDAY HOMEWORK
A powerpoint on colorectal cancer with brief background
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
Introcution to Microbes Burton's Biology for the Health
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Microbes in human welfare class 12 .pptx
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
Social preventive and pharmacy. Pdf
Worlds Next Door: A Candidate Giant Planet Imaged in the Habitable Zone of ↵ ...
veterinary parasitology ````````````.ppt
SCIENCE 4 Q2W5 PPT.pptx Lesson About Plnts and animals and their habitat

MediaEval 2016: A Hybrid Approach for Verifying Multimedia Use on Twitter

  • 1. A HYBRID APPROACH FOR VERIFYING MULTIMEDIA USE ON TWITTER Quoc-Tin Phan, Alessandro Budroni, Cecilia Pasquini, Francesco G. B. De Natale Department of Information Engineering and Computer Science – University of Trento, Italy INTRODUCTION Can you recognize? They are “FAKE”. EXISTING APPROACHES THE PROPOSED METHOD Schema of the proposed method. MULTIMEDIA ASSESSMENT 1. Search by keywords: online web search using relevant keywords associated to the event. 2. Search by image/video: Google reverse image search and comment retrieval from YouTube. 3. Forensic feature extraction: Non-Aligned Double JPEG Compression, Block Artifact Grid, and Error Level Analysis. We seek for highest-probable blocks which may undergo modifications and extract statistical features as min, max, mean, and variance. 4. Textual feature extraction: 4.1 Extract most relevant terms from results of 1. as bag-of-words. 4.2 From results of 2., calculate term frequency of bag-of-words from 4.1. 4.3 Calculate term frequency of bag of negative, positive and “fake” words. 4.4 Concatenate features from 4.2 and 4.3 to form textual features. 5. Textual features together with forensic features are fed to Classifier 1. RESULTS AND DISCUSSIONS Multimedia Signal Processing and Understanding Lab, University of Trento, Italy ✘Not useful with short text and multiple languages. ✘Not take into account multimedia content. Hurricane Sandy sharing sharing fake topic real topic Unreliable information about events and news sharing over Online Social Networks might cause negative consequences on community GIVEN: A TWEET comprising <text, images / video> REAL / FAKE INPUT OUTPUT SYNTHETIC MANIPULATION Text-based Multimedia-Forensic- based User-based ✘Sensitive to subsequent modifications and compression. Multimedia Event Post User Forensic feature extraction Search by image/video Search by keywords Forensic features Textual features Textual feature extraction Classifier 1 Post-based features User-based features Classifier 2Concatenate Score fusion Final decision Post-based feature extraction User-based feature extraction Concatenate Multimedia assessment Tweet credibility assessment TWEET CREDIBILITY ASSESSMENT 1. Post-based feature extraction: useful features reflecting the credibility of a tweet post are extracted, i.e. whether the tweet contains “?” or “!”, number of negative sentiment words. 2. User-based feature extraction: useful features reflecting the credibility of a user are extracted, i.e. number of followers the user has, whether the user is verified by Twitter. 3. Post-based features together with user-based features are fed to Classifier 2. WRONG CONTEXT SCORE FUSION With the assumption that a tweet sharing fake images or videos is likely to be fake, higher weight is assigned to the output from Classifier 1, lower weight is assigned to the output from Classifier 2. In the sub-task, we submitted RUN 1 applying only forensic features, and RUN 2 applying both textual features and forensic features. In the main task, we submitted three RUNs: i) RUN 1: applied only the second classification tier, ii) RUN 2: applied two-tier classification and 0.8 : 0.2 fusion strategy, answered UNKNOWN to cases suffered from online searching errors, iii) RUN 3: same as RUN 2, considered the output of classification tier 2 instead of UNKNOWN. The proposed method is subject to online search errors, which happen to videos NOT hosted by Youtube. Recall Precision F1-score RUN 1 0.5 0.48 0.49 RUN 2 0.93 0.49 0.64 Our method gains recall if we take into account textual features acquired from online text search and image reverse search. This approach effectively reduces false negative rate. Recall Precision F1-score RUN 1 0.55 0.71 0.62 RUN 2 0.94 0.81 0.87 RUN 3 0.94 0.74 0.83