SlideShare a Scribd company logo
Social Web: Where are the Semantics?
ESWC 2014
Miriam Fernández, Victor Rodríguez,
Andrés García-Silva, Oscar Corcho
Ontology Engineering Group, UPM, Spain
Knowledge Media Institute, The Open University
Outline
2
•  Part 1: Understanding Social Media
–  Theory: background & applications described in this tutorial
–  Hands on: data extraction from Twitter and Facebook
•  Part 2: Using semantics to represent data from SNS
–  Theory: Using SW to represent content, users and relations
–  Hands on: applying and extending SIOC
•  Part 3: Using semantics to understand social media conversations
–  Theory: Using semantics to understand topics in social media
–  Hands on: using LDA to extract topics from social media
•  Part 4: Using semantics to understand user behaviour
Implicit vs. Explicit Semantics
•  Implicit Semantics
–  Implicit, also called statistical semantics, focus on extracting word
sense by studying the patterns of human word usage in massive
collections of text or other human generated data
–  It does not rely on an explicit formalisation/conceptualisation of
knowledge
•  Explicit Semantics
–  Explicit semantics focus on the analysis of content by using the
support of explicit conceptualisations in the form of ontologies and
knowledge bases
ESWC 2014 Social Web: Where are the Semantics? 3
Explicit Semantics
Structured
Unstructured
From the Web of human generated content
The Web of unstructured text (Posts / Documents)
and Links
To the Web of machine understandable content
The Web of Objects and Relations
•  The annotators extract entities (classes / individuals) and relations
from the text and link them to object URIs
Obtaining explicit semantics from social media content
Using Semantics to Analyse Topic Evolution
•  LDA topics are identified by a set of keywords
–  Difficult to assess their meaning and evolution
•  Use explicit semantics to characterise topics as concrete entities
6
!
!
Using Semantics to Analyse Topic Evolution
ESWC 2014 Social Web: Where are the Semantics? 7
•  Analyse concepts appearance
–  Within a group
–  Across groups
–  Over time
•  Type filtering
•  Interlinking with other datasets
(data.open.ac.uk)
Using Semantics To Analyse Sentiment
•  Sentiment analysis on social media
–  Offers a fast and cheap access to publics’ feelings towards brands,
business, people, etc.
–  Comes with additional challenges
–  Current approaches
•  Lexical-based
•  Machine Learning
–  Explicit semantics are often neglected
ESWC 2014 Social Web: Where are the Semantics? 8
Using Semantics to Analyse Sentiment
•  Add semantics as additional features into the training set
•  Results
–  Incorporating semantics increases accuracy by 6.5% for negative
sentiment and by 4.8% for positive sentiment
–  The use of explicit semantics is more appropriate when the datasets
being analysed are large and cover a wide range of topics
Saif, Hassan, He, Yulan, Alani, Harith (2012). Semantic sentiment analysis of twitter. In: 11th
International Semantic Web Conference (ISWC 2012)
“Words that occur in
similar context tend
to have similar
meaning”
Wittgenstein (1953)
Using Semantics To Analyse Sentiment
•  SentiCircles
–  Integrates implicit and explicit semantics to analyse sentiment
–  Outperforms other lexicon labeling methods and overtakes the state-of-the-
art SentiStrength approach in accuracy, with a marginal drop in F-measure
ESWC 2014 Social Web: Where are the Semantics? 10
Saif, Hassan, Fernandez, Miriam, He, Yulan, Alani, Harith (2014). SentiCircles for Tweet-level
Sentiment Analysis (ESWC 2014) -> conference presentation on the 27, 14:00!!
Using Semantics To Analyse Sentiment
ESWC 2014 Social Web: Where are the Semantics? 11
Using Semantics to Analyse User Behaviour
•  Goal
–  Monitor and capture member activities
–  Analyse emerging behaviour over time
–  Understand the correlation of behaviour with community evolution
•  Approach
–  Identify behavioural features and behaviour roles
–  Create an ontology to model behavioural roles and behaviour
features
–  Use semantic rules to infer user roles in online communities
–  Study role composition patterns
ESWC 2014 Social Web: Where are the Semantics? 12
Angeletou, S., Rowe, M. and Alani, H. (2011) Modelling and Analysis of User Behaviour in Online
Communities, 10th International Semantic Web Conference (ISWC 2011), Bonn, Germany
Rowe, Matthew; Fernandez, Miriam; Angeletou, Sofia and Alani, Harith (2013). Community analysis through
semantic rules and role composition derivation. Journal of Web Semantics: Science, Services and Agents on
the World Wide Web, 18(1) pp. 31–47
Behavioural roles and features
ESWC 2014 Social Web: Where are the Semantics? 13
Table 1. Roles and the feature-to-level mappings
Role Feature Level
Elitist In-Degree Ratio low
Bi-directional Threads Ratio high
Bi-directional Neighbours Ratio low
Grunt Bi-directional Threads Ratio med
Bi-directional Neighbours Ratio med
Average Posts per Thread low
STD of Posts per Thread low
Joining Conversationalist Thread Initiation Ratio low
Average Posts per Thread high
STD of Posts per Thread high
Popular Initiator In-Degree Ratio high
Thread Initiation Ratio high
Popular Participants In-Degree Ratio high
Thread Initiation Ratio low
Average Posts per Thread med
STD of Posts per Thread med
Supporter In-Degree Ratio med
Bi-directional Threads Ratio med
Bi-directional Neighbours Ratio med
Taciturn Bi-directional Threads Ratio low
Bi-directional Neighbours Ratio low
Average Posts per Thread low
STD of Posts per Thread low
Ignored Posts Replied Ratio low
Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing discussion forums using
common user roles. In Proc. Web Science Conf. (WebSci10), Raleigh, NC: US, 2010.
Modelling user features and interactions
ESWC 2014 Social Web: Where are the Semantics? 14
http://purl.org/net/oubo/0.3•  OUBO: The OU Behaviour Ontology
Encoding Rules in Ontologies with SPIN
ESWC 2014 Social Web: Where are the Semantics? 15
Apply rules to infer user roles over time
ESWC 2014 Social Web: Where are the Semantics? 16
1.- Construct features for community
users at a given time step
2.- Derive bings using equal
frequency binning
Popularity-low cutoff = 0.5
Initiation-high cutoff = 0.4
3.- Use skeleton rule base to construct
rules using bin levels
Popularity=low, Initiation=high ->roleA
Popularity<0.5, Initiation > 0.4 -> roleA
4.- Apply rules to infer user roles and
community composition
5.- Repeat 1-4 following time steps
Analyse the role composition of the community
ESWC 2014 Social Web: Where are the Semantics? 17
•  Investigate the correlation between the role composition and the
students’ performance
Analyse the role composition of the community
•  Allow Policy Makers to focus on a smaller set of users, with whom
they may want to engage more closely
ESWC 2014 Social Web: Where are the Semantics? 18
Analyse the role composition of the community
•  Development of models to predict community health based on role
compositions and evolution of user behaviour
–  Health Indicators
•  Churn Rate: proportion of users who leave the network in a given time segment
•  User Count: number of users who posted at least once
•  Seeds / Non seeds: proportion of posts that get responses vs. those that don’t
•  Clustering coefficient: measures the cohesion within the network
–  Results
•  Accurate detection of community health is possible using role composition
information
•  There is no “one size fits all” model
ESWC 2014 Social Web: Where are the Semantics? 19
Rowe, M. and Alani, H. (2012) What Makes Communities Tick? Community Health Analysis using Role
compositions. International Conference on Social Computing, 2012
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Churn Rate
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
User Count
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Seeds / Non−seeds Prop
FPR
TPR
0.0 0.2 0.4 0.6 0.8 1.0
0.00.20.40.60.81.0
Clustering Coefficient
FPR
TPR
Challenges: How would you address them?
•  Scalability
–  Communities exceed millions of users
–  Infrastructures must support hundreds of millions discussion threads
•  Growth (real-time analysis)
–  Speed of new incoming data / stream processing
•  Concept vs. keyword based data acquisition/pre-processing
–  How to filter certain tags?
–  Which new topics emerge?
–  How topics evolve over time?
–  Authorship in social media, who copies who?
•  Multilingualism
–  We all speak different languages
•  Understanding the user and acting accordingly
–  We all have different personalities, behaviours and preferences
ESWC 2014 Social Web: Where are the Semantics? 20

More Related Content

PPT
Social Web 2.0 Class Week 4: Social Networks, Privacy
PPT
Lupien Oldham Presentation Cil2008
PDF
Lecture 6: How can we STUDY the (Social) Web? (VU Amsterdam Social Web Course)
PPTX
Harnessing the Social Web: The Science of Identity Disambiguation
PPTX
A systematic literature review of academic cyberbullying 2021
PPT
01 Introduction to Networks Methods and Measures
PPTX
A Community of Quality: Using Social Network Analysis to Study University-Wid...
PDF
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)
Social Web 2.0 Class Week 4: Social Networks, Privacy
Lupien Oldham Presentation Cil2008
Lecture 6: How can we STUDY the (Social) Web? (VU Amsterdam Social Web Course)
Harnessing the Social Web: The Science of Identity Disambiguation
A systematic literature review of academic cyberbullying 2021
01 Introduction to Networks Methods and Measures
A Community of Quality: Using Social Network Analysis to Study University-Wid...
13 An Introduction to Stochastic Actor-Oriented Models (aka SIENA)

What's hot (20)

DOC
Social Media Use by Canadian Academic Librarians
PPTX
04 Diffusion and Peer Influence
PPTX
11 Network Experiments and Interventions
PDF
Categorize balanced dataset for troll detection
PDF
05 Communities in Network
PDF
Predicting cyber bullying on t witter using machine learning
PPTX
WSDM'16 Relational Learning with Social Status Analysis
PDF
Published Paper
DOC
CARL ABRC social media environmental scan 2011
PPTX
Social Network Analysis (SNA) 2018
PPTX
Social Network Analysis: applications for education research
PPTX
10 More than a Pretty Picture: Visual Thinking in Network Studies
PPTX
Learning Analytics - CET Seminar 2012
DOC
CARL ABRC Survey Results april 2011
PPTX
Social Network Analysis (Part 1)
PPTX
Political prediction analysis using text mining and deep learning
PPTX
Social media for libraries
PDF
Taylor & Francis: Use of Social Media by the Library
PPTX
Who creates trends in online social media
PPT
Social computing meet & greet
Social Media Use by Canadian Academic Librarians
04 Diffusion and Peer Influence
11 Network Experiments and Interventions
Categorize balanced dataset for troll detection
05 Communities in Network
Predicting cyber bullying on t witter using machine learning
WSDM'16 Relational Learning with Social Status Analysis
Published Paper
CARL ABRC social media environmental scan 2011
Social Network Analysis (SNA) 2018
Social Network Analysis: applications for education research
10 More than a Pretty Picture: Visual Thinking in Network Studies
Learning Analytics - CET Seminar 2012
CARL ABRC Survey Results april 2011
Social Network Analysis (Part 1)
Political prediction analysis using text mining and deep learning
Social media for libraries
Taylor & Francis: Use of Social Media by the Library
Who creates trends in online social media
Social computing meet & greet
Ad

Viewers also liked (8)

PPTX
SocInfo2014 CityLabs Workshop
PDF
ESWC 2014 Tutorial part 1
PPTX
Dealers Program
PPTX
DealersProgram
PDF
ESWC 2014 Tutorial part 2
PDF
CAEPIA 2011
PPT
Toyota slides candi's revision
PDF
ESWC 2014 Tutorial part 3
SocInfo2014 CityLabs Workshop
ESWC 2014 Tutorial part 1
Dealers Program
DealersProgram
ESWC 2014 Tutorial part 2
CAEPIA 2011
Toyota slides candi's revision
ESWC 2014 Tutorial part 3
Ad

Similar to ESWC 2014 Tutorial Part 4 (20)

PPTX
Ifip wg-galway-
PPTX
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
PPT
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
PPTX
Click and Connect: Social Media Affordances & their influence on user partici...
PDF
Social Network Analysis based on MOOC's (Massive Open Online Classes)
PPTX
Working with Social Media Data: Ethics & good practice around collecting, usi...
PPTX
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
PDF
Making More Sense Out of Social Data
PPTX
The Future of Research Communications and e-Scholarship: Are we there yet?
PPT
Social information Access2012
PPTX
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
PDF
Empirical user studies in Semantic Web contexts
PPTX
Influencing the MOOC agenda - analysis of #MOOC Twitter Data
PDF
Social Multimedia as Sensors
PPTX
Social media and researchers: Josipa Crnic Deakin University
PDF
Global Redirective Practices: an online workshop for a client
PDF
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
PPT
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
PDF
Social Media Analytics with a pinch of semantics
Ifip wg-galway-
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
Network Awareness Tool - Learning Analytics in the workplace: 
Detecting and ...
Click and Connect: Social Media Affordances & their influence on user partici...
Social Network Analysis based on MOOC's (Massive Open Online Classes)
Working with Social Media Data: Ethics & good practice around collecting, usi...
CIC Networked Learning Practices Workshop - Caroline Haythornthwaite
Making More Sense Out of Social Data
The Future of Research Communications and e-Scholarship: Are we there yet?
Social information Access2012
SampleLiteratureReviewTemplate_IVBTechIISEM_MajorProject.pptx
Empirical user studies in Semantic Web contexts
Influencing the MOOC agenda - analysis of #MOOC Twitter Data
Social Multimedia as Sensors
Social media and researchers: Josipa Crnic Deakin University
Global Redirective Practices: an online workshop for a client
Stabilization of Black Cotton Soil with Red Mud and Formulation of Linear Reg...
An Ensemble Model for Cross-Domain Polarity Classification on Twitter
Social Media Analytics with a pinch of semantics

More from Miriam Fernandez (16)

PDF
Biases in Social Media Research (NoBias EU project)
PPTX
Research seminar Queen Mary University of London (CogSci)
PDF
Vision track october_2020_fernandez_v5
PDF
On the Application of Social Data Science to Address Societal Challenges
PPTX
Online radicalisation: work, challenges and future directions
PDF
Mining Social Media Data For Policing
PDF
Introduction to Mining Social Media Data
PDF
Online Misinformation: Challenges and Future Directions
PDF
Slides 28-feb-2018-v2.pptx
PDF
Artificial Intelligence for Policing
PPTX
OUSocial OUSocMed conference
PDF
On the use of social media for evidence-based policing
PDF
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
PPTX
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
PDF
Wm unit1.6-slides-semantic web-final
PPT
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...
Biases in Social Media Research (NoBias EU project)
Research seminar Queen Mary University of London (CogSci)
Vision track october_2020_fernandez_v5
On the Application of Social Data Science to Address Societal Challenges
Online radicalisation: work, challenges and future directions
Mining Social Media Data For Policing
Introduction to Mining Social Media Data
Online Misinformation: Challenges and Future Directions
Slides 28-feb-2018-v2.pptx
Artificial Intelligence for Policing
OUSocial OUSocMed conference
On the use of social media for evidence-based policing
ECSM2014: Using Social Media To Inform Policy Making: To whom are we listenin...
ESWC 2014 Tutorial Handson 1: Collect Data from Facebook
Wm unit1.6-slides-semantic web-final
Iswc 2011: Linking Data Across Universities: An Integrated Video Lectures Dat...

Recently uploaded (20)

PDF
Encapsulation theory and applications.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Machine learning based COVID-19 study performance prediction
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPT
Teaching material agriculture food technology
PDF
Approach and Philosophy of On baking technology
PDF
Encapsulation_ Review paper, used for researhc scholars
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation theory and applications.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
cuic standard and advanced reporting.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
Chapter 3 Spatial Domain Image Processing.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
MIND Revenue Release Quarter 2 2025 Press Release
Diabetes mellitus diagnosis method based random forest with bat algorithm
Machine learning based COVID-19 study performance prediction
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Understanding_Digital_Forensics_Presentation.pptx
Teaching material agriculture food technology
Approach and Philosophy of On baking technology
Encapsulation_ Review paper, used for researhc scholars
The AUB Centre for AI in Media Proposal.docx
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

ESWC 2014 Tutorial Part 4

  • 1. Social Web: Where are the Semantics? ESWC 2014 Miriam Fernández, Victor Rodríguez, Andrés García-Silva, Oscar Corcho Ontology Engineering Group, UPM, Spain Knowledge Media Institute, The Open University
  • 2. Outline 2 •  Part 1: Understanding Social Media –  Theory: background & applications described in this tutorial –  Hands on: data extraction from Twitter and Facebook •  Part 2: Using semantics to represent data from SNS –  Theory: Using SW to represent content, users and relations –  Hands on: applying and extending SIOC •  Part 3: Using semantics to understand social media conversations –  Theory: Using semantics to understand topics in social media –  Hands on: using LDA to extract topics from social media •  Part 4: Using semantics to understand user behaviour
  • 3. Implicit vs. Explicit Semantics •  Implicit Semantics –  Implicit, also called statistical semantics, focus on extracting word sense by studying the patterns of human word usage in massive collections of text or other human generated data –  It does not rely on an explicit formalisation/conceptualisation of knowledge •  Explicit Semantics –  Explicit semantics focus on the analysis of content by using the support of explicit conceptualisations in the form of ontologies and knowledge bases ESWC 2014 Social Web: Where are the Semantics? 3
  • 4. Explicit Semantics Structured Unstructured From the Web of human generated content The Web of unstructured text (Posts / Documents) and Links To the Web of machine understandable content The Web of Objects and Relations
  • 5. •  The annotators extract entities (classes / individuals) and relations from the text and link them to object URIs Obtaining explicit semantics from social media content
  • 6. Using Semantics to Analyse Topic Evolution •  LDA topics are identified by a set of keywords –  Difficult to assess their meaning and evolution •  Use explicit semantics to characterise topics as concrete entities 6
  • 7. ! ! Using Semantics to Analyse Topic Evolution ESWC 2014 Social Web: Where are the Semantics? 7 •  Analyse concepts appearance –  Within a group –  Across groups –  Over time •  Type filtering •  Interlinking with other datasets (data.open.ac.uk)
  • 8. Using Semantics To Analyse Sentiment •  Sentiment analysis on social media –  Offers a fast and cheap access to publics’ feelings towards brands, business, people, etc. –  Comes with additional challenges –  Current approaches •  Lexical-based •  Machine Learning –  Explicit semantics are often neglected ESWC 2014 Social Web: Where are the Semantics? 8
  • 9. Using Semantics to Analyse Sentiment •  Add semantics as additional features into the training set •  Results –  Incorporating semantics increases accuracy by 6.5% for negative sentiment and by 4.8% for positive sentiment –  The use of explicit semantics is more appropriate when the datasets being analysed are large and cover a wide range of topics Saif, Hassan, He, Yulan, Alani, Harith (2012). Semantic sentiment analysis of twitter. In: 11th International Semantic Web Conference (ISWC 2012)
  • 10. “Words that occur in similar context tend to have similar meaning” Wittgenstein (1953) Using Semantics To Analyse Sentiment •  SentiCircles –  Integrates implicit and explicit semantics to analyse sentiment –  Outperforms other lexicon labeling methods and overtakes the state-of-the- art SentiStrength approach in accuracy, with a marginal drop in F-measure ESWC 2014 Social Web: Where are the Semantics? 10 Saif, Hassan, Fernandez, Miriam, He, Yulan, Alani, Harith (2014). SentiCircles for Tweet-level Sentiment Analysis (ESWC 2014) -> conference presentation on the 27, 14:00!!
  • 11. Using Semantics To Analyse Sentiment ESWC 2014 Social Web: Where are the Semantics? 11
  • 12. Using Semantics to Analyse User Behaviour •  Goal –  Monitor and capture member activities –  Analyse emerging behaviour over time –  Understand the correlation of behaviour with community evolution •  Approach –  Identify behavioural features and behaviour roles –  Create an ontology to model behavioural roles and behaviour features –  Use semantic rules to infer user roles in online communities –  Study role composition patterns ESWC 2014 Social Web: Where are the Semantics? 12 Angeletou, S., Rowe, M. and Alani, H. (2011) Modelling and Analysis of User Behaviour in Online Communities, 10th International Semantic Web Conference (ISWC 2011), Bonn, Germany Rowe, Matthew; Fernandez, Miriam; Angeletou, Sofia and Alani, Harith (2013). Community analysis through semantic rules and role composition derivation. Journal of Web Semantics: Science, Services and Agents on the World Wide Web, 18(1) pp. 31–47
  • 13. Behavioural roles and features ESWC 2014 Social Web: Where are the Semantics? 13 Table 1. Roles and the feature-to-level mappings Role Feature Level Elitist In-Degree Ratio low Bi-directional Threads Ratio high Bi-directional Neighbours Ratio low Grunt Bi-directional Threads Ratio med Bi-directional Neighbours Ratio med Average Posts per Thread low STD of Posts per Thread low Joining Conversationalist Thread Initiation Ratio low Average Posts per Thread high STD of Posts per Thread high Popular Initiator In-Degree Ratio high Thread Initiation Ratio high Popular Participants In-Degree Ratio high Thread Initiation Ratio low Average Posts per Thread med STD of Posts per Thread med Supporter In-Degree Ratio med Bi-directional Threads Ratio med Bi-directional Neighbours Ratio med Taciturn Bi-directional Threads Ratio low Bi-directional Neighbours Ratio low Average Posts per Thread low STD of Posts per Thread low Ignored Posts Replied Ratio low Jeffrey Chan, Conor Hayes, and Elizabeth Daly. Decomposing discussion forums using common user roles. In Proc. Web Science Conf. (WebSci10), Raleigh, NC: US, 2010.
  • 14. Modelling user features and interactions ESWC 2014 Social Web: Where are the Semantics? 14 http://purl.org/net/oubo/0.3•  OUBO: The OU Behaviour Ontology
  • 15. Encoding Rules in Ontologies with SPIN ESWC 2014 Social Web: Where are the Semantics? 15
  • 16. Apply rules to infer user roles over time ESWC 2014 Social Web: Where are the Semantics? 16 1.- Construct features for community users at a given time step 2.- Derive bings using equal frequency binning Popularity-low cutoff = 0.5 Initiation-high cutoff = 0.4 3.- Use skeleton rule base to construct rules using bin levels Popularity=low, Initiation=high ->roleA Popularity<0.5, Initiation > 0.4 -> roleA 4.- Apply rules to infer user roles and community composition 5.- Repeat 1-4 following time steps
  • 17. Analyse the role composition of the community ESWC 2014 Social Web: Where are the Semantics? 17 •  Investigate the correlation between the role composition and the students’ performance
  • 18. Analyse the role composition of the community •  Allow Policy Makers to focus on a smaller set of users, with whom they may want to engage more closely ESWC 2014 Social Web: Where are the Semantics? 18
  • 19. Analyse the role composition of the community •  Development of models to predict community health based on role compositions and evolution of user behaviour –  Health Indicators •  Churn Rate: proportion of users who leave the network in a given time segment •  User Count: number of users who posted at least once •  Seeds / Non seeds: proportion of posts that get responses vs. those that don’t •  Clustering coefficient: measures the cohesion within the network –  Results •  Accurate detection of community health is possible using role composition information •  There is no “one size fits all” model ESWC 2014 Social Web: Where are the Semantics? 19 Rowe, M. and Alani, H. (2012) What Makes Communities Tick? Community Health Analysis using Role compositions. International Conference on Social Computing, 2012 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Churn Rate FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 User Count FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Seeds / Non−seeds Prop FPR TPR 0.0 0.2 0.4 0.6 0.8 1.0 0.00.20.40.60.81.0 Clustering Coefficient FPR TPR
  • 20. Challenges: How would you address them? •  Scalability –  Communities exceed millions of users –  Infrastructures must support hundreds of millions discussion threads •  Growth (real-time analysis) –  Speed of new incoming data / stream processing •  Concept vs. keyword based data acquisition/pre-processing –  How to filter certain tags? –  Which new topics emerge? –  How topics evolve over time? –  Authorship in social media, who copies who? •  Multilingualism –  We all speak different languages •  Understanding the user and acting accordingly –  We all have different personalities, behaviours and preferences ESWC 2014 Social Web: Where are the Semantics? 20