SlideShare a Scribd company logo
Authors
University
Politehnica
of Bucharest
How Useful are Semantic Links for the
Detection of Implicit References in CSCL Chats?
Traian Rebedea traian.rebedea@cs.pub.ro
Costin-Gabriel Chiru
Gabriel-Marius Guțu
Overview
• Introduction
• Background
• Research Question
• Semantic Similarity
• Macro-level: Are Semantic Links Useful?
• Micro-level: Are Semantic Links Useful?
• Conclusions
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 2
Introduction
• Chat conversations are used in a large range
of Computer-Supported Collaborative
Learning (CSCL) activities
– Debating and solving difficult problems in small
groups
– Multi-party conversations
– Several discussion threads running in parallel
• Discourse analysis does not provide a theory
suitable for processing multi-party chat
conversations
19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 3
Background
• New theories propose the use of conversation (or
coherence) graphs for chat analysis
• There is a existence of a multitude of links – explicit or
implicit – between utterances that might explain the
evolution of discussion threads
• Each utterance has an inner (or specific) voice of the
participant which uttered it, but also complex echoes
from previous voices
• Determining and analyzing this linkage between voices
would provide a powerful method for analyzing
learning and knowledge building
– Individual level
– Group level
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 4
Background
• The quality of individual thinking can be improved by
improving the quality of dialogue
– “individual thinking skills originate in conversations, where
we learn to reason, to evaluate, to join in creative play and
to provide relevant information” (Wegerif, 2006)
• Difficult to determine the quality of a conversation,
especially in online multi-party discussions
– Proposed that the degree of inter-animation in a
conversation can be used to assess its quality
– Inter-animation assumes that meaning arises not from a
single utterance, but rather from the interactions between
utterances
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 5
Background
• Inter-animation of voices in a conversation may be
represented through the links between the utterances
– Explicit links
– Many links remain implicit
• The discovery of these implicit links is a difficult task,
mainly because of the multitude distinct types of links
that may arise
– Lexical links
– Continuation of utterances (which may be considered
broken utterances or syntactic links)
– Semantic links
– Pragmatic and conversation specific references
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 6
Research Question
• How useful is semantic similarity alone for
detecting links arising in CSCL chat
conversations?
• Many CSCL applications that are processing
either conversations or written texts are mainly
using (lexical, semantic) similarity relatedness
methods for assessing the cohesion of the
analyzed texts
• They may miss important links that are not
detectable using semantic similarity alone
• Cohesion vs. Coherence19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 7
Semantic Similarity
• Semantic similarity (or relatedness) between
proximal words is one of the characteristics of
the context in most coherent discourses
• Knowledge-based methods (“strong” semantics)
– Rely on general, domain and discourse ontologies
• Corpus-based methods (“weak” semantics)
– Do not define any underlying semantic model
between the words or concepts, but rather exploit
the probability distribution and the statistics of two
words co-appearing together in a given discourse unit
(e.g. paragraph)
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 8
Knowledge-based Methods
• Primarily use lexical resources built by linguists,
for example dictionaries, thesauri and lexical
ontologies
• The most popular methods for computing the
semantic similarity or relatedness between two
words or concepts are defined for ontologies =>
semantic distances
• Most of them have been especially constructed
for the linguistic ontology (or lexical database)
WordNet
– synonyms, antonyms, related words, hypernyms,
hyponyms, meronyms and holonyms
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 9
Corpus-Based Methods
• Instead of using human-assembled linguistic
knowledge, these techniques process large amounts of
text corpora and then use the statistics of words co-
appearance
• Latent Semantic Analysis (LSA)
– Perform SVD to reduce the dimensionality of the term-
document matrix
– Keep only the largest singular values (usually k = 100..300)
– This reduced dimensionality space is also called the latent
semantic space and may be used to compute the similarity
between words, word sets and texts by using cosine
similarity for the document vectors in this reduced space
• Other techniques have been proposed: Latent Dirichlet
Allocation (LDA), Explicit Semantic Analysis (ESA), etc.
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 10
Distribution of Explicit Links in Chats
• Corpus of 8 chat conversations analyzed to compute statistics
with regard to the usage of explicit links
– Multi-party discussions in groups of 4-5 students
– Using Concert Chat software (allows explicit referencing)
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 11
Distribution of Explicit Links in Chats
• Complete distribution of explicit links given
the distance between the two utterances
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 12
Are Semantic Links Useful?
• Macro-level analysis: Compare the distribution of
the explicit links to the semantic similarity between
the current utterance and utterances that are at a
certain distance
• Computing the semantic similarity between two
utterances uses the similarities between all the pairs
of words in each utterance
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 13
)
)(
))(*)1,((max
)(
))(*)2,((max
(*
2
1
)2,1(
2
2
1
1
∑
∑
∑
∑
∈
∈
∈
∈
+=
Tw
Tw
Tw
Tw
widf
widfTwSim
widf
widfTwSim
TTsim
Are Semantic Links Useful?
LSA
WordNet – Resnik distance
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 14
Are Semantic Links Useful?
• Micro-Level Analysis
• How effective is LSA-based semantic similarity
for detecting links between utterances?
• Simple greedy method for choosing the link of
the current utterance:
– Compute the semantic similarity between the
utterance and the previous 20 utterances
– Choose as a link the pair which has the best
(largest) similarity score
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 15
Are Semantic Links Useful?
ID Utterance User Link Sematic score
148 ok.... so what we were talking about before the
connection issue?
Mona
149 as long as it depends on an internet connection... Mona LINK TO 147
150 meeting board Cristi LINK TO 148 score(150, 148) = 0.18 =
max
151 about different stages of a project... a client must
know about them..so wiki is a good solution
Corina LINK TO 148
152 meeting? Corina LINK TO 150 score(152, 150) = 1.00 =
max
153 ok Stefan LINK TO 151
154 so we agree that wiki is a good solution when we
want to present a product/the evolution of a
project to a client ?
Corina score(154, 151) = 0.5 =
max
155 what about selling our products..what
technologies we should use for this?
Diana
…
166 to sell our products blog is the best solution Corina LINK TO 155 score(166, 155) = 0.38 =
max
…
169 A blog would be a good way to advertise our
products
Mona LINK TO 166 score(169, 166) = 0.48 =
max
…
180 A forum would be useful for offering solutions to
some problems that our customers have
Mona
181 I agree...also other people can offer solutions, not
only us
Corina LINK TO 180 score(181, 180) = 0.27 =
max
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 16
Are Semantic Links Useful?
• Can explicit links be identified using only the
LSA semantic similarity between utterances?
19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 17
Conclusions
• Two important conclusions arise from this study
• The distribution of explicit links is very similar to that
of the semantic similarity scores, thus proving that, on
a macro-level, both links and semantic similarity
measures have a similar behavior
• This is not always true when analyzing local elements:
an individual link between two utterances is not
always explained by semantic similarity
• Only around 40% of the explicit links in our conversations
could be explained based mainly on semantic similarity
measures
19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 18
Thank you!
Questions?
Discussion
19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 19
This work has been funded by the
Sectorial Operational Programme
Human Resources Development
2007-2013 of the Romanian Ministry
of European Funds through the
Financial Agreement
POSDRU/159/1.5/S/132397 and by the
FP7 LTfLL project.

More Related Content

PPT
Relevance based ranking of video comments on YouTube
PPT
PhD Defense: Computer-Based Support and Feedback for Collaborative Chat Conve...
PPTX
Web services for supporting the interactions of learners in the social web - ...
PPT
Opinion mining for social media and news items in Romanian
PPTX
Detecting and Describing Historical Periods in a Large Corpora
PPTX
An Evolution of Deep Learning Models for AI2 Reasoning Challenge
PDF
Practical machine learning - Part 1
PPTX
Analysis of Metadata and Topic Modeling for
Relevance based ranking of video comments on YouTube
PhD Defense: Computer-Based Support and Feedback for Collaborative Chat Conve...
Web services for supporting the interactions of learners in the social web - ...
Opinion mining for social media and news items in Romanian
Detecting and Describing Historical Periods in a Large Corpora
An Evolution of Deep Learning Models for AI2 Reasoning Challenge
Practical machine learning - Part 1
Analysis of Metadata and Topic Modeling for

What's hot (20)

PPTX
Question answering
PDF
Frontiers of Natural Language Processing
PDF
Presentation of Domain Specific Question Answering System Using N-gram Approach.
PDF
French machine reading for question answering
PDF
Question Answering - Application and Challenges
PPT
Using lexical chains for text summarization
PDF
Lecture 2: Computational Semantics
POTX
LDA Beginner's Tutorial
PDF
Open domain Question Answering System - Research project in NLP
PPTX
NLP Project Presentation
PDF
Arabic Question Answering: Challenges, Tasks, Approaches, Test-sets, Tools, A...
PDF
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
PPTX
2010 PACLIC - pay attention to categories
PDF
Nlp presentation
PDF
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
PDF
Meta-evaluation of machine translation evaluation methods
PPTX
Word Tagging with Foundational Ontology Classes
PPTX
Arabic question answering ‫‬
PPTX
1909 paclic
Question answering
Frontiers of Natural Language Processing
Presentation of Domain Specific Question Answering System Using N-gram Approach.
French machine reading for question answering
Question Answering - Application and Challenges
Using lexical chains for text summarization
Lecture 2: Computational Semantics
LDA Beginner's Tutorial
Open domain Question Answering System - Research project in NLP
NLP Project Presentation
Arabic Question Answering: Challenges, Tasks, Approaches, Test-sets, Tools, A...
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
2010 PACLIC - pay attention to categories
Nlp presentation
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
Meta-evaluation of machine translation evaluation methods
Word Tagging with Foundational Ontology Classes
Arabic question answering ‫‬
1909 paclic
Ad

Viewers also liked (20)

PPS
Oooppps
PDF
Thomas
PPTX
REFORMA EDUCATIVA
PPT
Mirandodondepisamos
PDF
Jansamachar June 24, 2009 Sensex Closes Flat, Most Of Day’S Losses Recouped (...
PPTX
Ancient civilizations
PDF
Ideg publication this week newsletter issue 3
PPT
презентация Microsoft power point
PPTX
Prácticas de clasificación arancelaria
PDF
Projektledelse i kreative virksomheder
DOCX
Ada (ad) (1)
PPT
Number system 1
PDF
Conceptos Básicos
PDF
Ict question
DOCX
Educación pública
PPS
Brilliant Designers from India come together at Design Emporia
PDF
Number system utm notes
PPTX
Indifference Curves
Oooppps
Thomas
REFORMA EDUCATIVA
Mirandodondepisamos
Jansamachar June 24, 2009 Sensex Closes Flat, Most Of Day’S Losses Recouped (...
Ancient civilizations
Ideg publication this week newsletter issue 3
презентация Microsoft power point
Prácticas de clasificación arancelaria
Projektledelse i kreative virksomheder
Ada (ad) (1)
Number system 1
Conceptos Básicos
Ict question
Educación pública
Brilliant Designers from India come together at Design Emporia
Number system utm notes
Indifference Curves
Ad

Similar to How useful are semantic links for the detection of implicit references in cscl chats (20)

PDF
ESWC 2014 Tutorial Part 4
PPTX
Discourse-Centric Learning Analytics
PPT
Nlp based heuristics for assessing participants in cscl chats
PPTX
Assessing critical thinking in moo cs preliminary proposal 201309
PPT
From Open Content To Open Thinking
PPT
Ed-Media2010- De Liddo
ODP
KnowledgeNET conference 2010
PPTX
Automated interpretability of linked data ontologies: an evaluation within th...
PPT
Automatic assessment of collaborative chat conversations with PolyCAFe - EC-T...
PPTX
AIM Analytics: U-M Community Presentations
PDF
ESWC 2014 Tutorial part 3
PDF
DeLiddo&BuckinghamShum-e-Part2014
PPT
Detecting discourse creativity in chat conversations
PDF
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
PPTX
MOOC Tales and Trends
PPT
PhD Dissertation
PPTX
session-based recommendations with the session style of Explanation
PDF
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
PDF
Empirical user studies in Semantic Web contexts
PDF
Interaction Design Patterns in Recommender Systems
ESWC 2014 Tutorial Part 4
Discourse-Centric Learning Analytics
Nlp based heuristics for assessing participants in cscl chats
Assessing critical thinking in moo cs preliminary proposal 201309
From Open Content To Open Thinking
Ed-Media2010- De Liddo
KnowledgeNET conference 2010
Automated interpretability of linked data ontologies: an evaluation within th...
Automatic assessment of collaborative chat conversations with PolyCAFe - EC-T...
AIM Analytics: U-M Community Presentations
ESWC 2014 Tutorial part 3
DeLiddo&BuckinghamShum-e-Part2014
Detecting discourse creativity in chat conversations
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...
MOOC Tales and Trends
PhD Dissertation
session-based recommendations with the session style of Explanation
Exploring Generative Models of Tripartite Graphs for Recommendation in Social...
Empirical user studies in Semantic Web contexts
Interaction Design Patterns in Recommender Systems

More from Traian Rebedea (20)

PDF
AI @ Wholi - Bucharest.AI Meetup #5
PDF
Deep neural networks for matching online social networking profiles
PDF
Intro to Deep Learning for Question Answering
PPTX
What is word2vec?
PPT
A focused crawler for romanian words discovery
PPT
Propunere de dezvoltare a carierei universitare
PPT
Automatic plagiarism detection system for specialized corpora
PPT
Importanța algoritmilor pentru problemele de la interviuri
PPT
Conclusions and Recommendations of the Romanian ICT RTD Survey
PPT
Istoria Web-ului - part 2 - tentativ How to Web 2009
PPT
Istoria Web-ului - part 1 (2) - tentativ How to Web 2009
PPT
Istoria Web-ului - part 1 - tentativ How to Web 2009
PDF
Algorithm Design and Complexity - Course 12
PDF
Algorithm Design and Complexity - Course 11
PPT
Algorithm Design and Complexity - Course 10
PPT
Algorithm Design and Complexity - Course 9
PPT
Algorithm Design and Complexity - Course 8
PPTX
Algorithm Design and Complexity - Course 7
PPTX
Algorithm Design and Complexity - Course 6
PPTX
Algorithm Design and Complexity - Course 5
AI @ Wholi - Bucharest.AI Meetup #5
Deep neural networks for matching online social networking profiles
Intro to Deep Learning for Question Answering
What is word2vec?
A focused crawler for romanian words discovery
Propunere de dezvoltare a carierei universitare
Automatic plagiarism detection system for specialized corpora
Importanța algoritmilor pentru problemele de la interviuri
Conclusions and Recommendations of the Romanian ICT RTD Survey
Istoria Web-ului - part 2 - tentativ How to Web 2009
Istoria Web-ului - part 1 (2) - tentativ How to Web 2009
Istoria Web-ului - part 1 - tentativ How to Web 2009
Algorithm Design and Complexity - Course 12
Algorithm Design and Complexity - Course 11
Algorithm Design and Complexity - Course 10
Algorithm Design and Complexity - Course 9
Algorithm Design and Complexity - Course 8
Algorithm Design and Complexity - Course 7
Algorithm Design and Complexity - Course 6
Algorithm Design and Complexity - Course 5

Recently uploaded (20)

PDF
RMMM.pdf make it easy to upload and study
PDF
Basic Mud Logging Guide for educational purpose
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Lesson notes of climatology university.
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Classroom Observation Tools for Teachers
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
master seminar digital applications in india
PDF
Microbial disease of the cardiovascular and lymphatic systems
RMMM.pdf make it easy to upload and study
Basic Mud Logging Guide for educational purpose
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
TR - Agricultural Crops Production NC III.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
2.FourierTransform-ShortQuestionswithAnswers.pdf
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPH.pptx obstetrics and gynecology in nursing
Final Presentation General Medicine 03-08-2024.pptx
Pharma ospi slides which help in ospi learning
Lesson notes of climatology university.
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Classroom Observation Tools for Teachers
Renaissance Architecture: A Journey from Faith to Humanism
master seminar digital applications in india
Microbial disease of the cardiovascular and lymphatic systems

How useful are semantic links for the detection of implicit references in cscl chats

  • 1. Authors University Politehnica of Bucharest How Useful are Semantic Links for the Detection of Implicit References in CSCL Chats? Traian Rebedea traian.rebedea@cs.pub.ro Costin-Gabriel Chiru Gabriel-Marius Guțu
  • 2. Overview • Introduction • Background • Research Question • Semantic Similarity • Macro-level: Are Semantic Links Useful? • Micro-level: Are Semantic Links Useful? • Conclusions 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 2
  • 3. Introduction • Chat conversations are used in a large range of Computer-Supported Collaborative Learning (CSCL) activities – Debating and solving difficult problems in small groups – Multi-party conversations – Several discussion threads running in parallel • Discourse analysis does not provide a theory suitable for processing multi-party chat conversations 19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 3
  • 4. Background • New theories propose the use of conversation (or coherence) graphs for chat analysis • There is a existence of a multitude of links – explicit or implicit – between utterances that might explain the evolution of discussion threads • Each utterance has an inner (or specific) voice of the participant which uttered it, but also complex echoes from previous voices • Determining and analyzing this linkage between voices would provide a powerful method for analyzing learning and knowledge building – Individual level – Group level 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 4
  • 5. Background • The quality of individual thinking can be improved by improving the quality of dialogue – “individual thinking skills originate in conversations, where we learn to reason, to evaluate, to join in creative play and to provide relevant information” (Wegerif, 2006) • Difficult to determine the quality of a conversation, especially in online multi-party discussions – Proposed that the degree of inter-animation in a conversation can be used to assess its quality – Inter-animation assumes that meaning arises not from a single utterance, but rather from the interactions between utterances 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 5
  • 6. Background • Inter-animation of voices in a conversation may be represented through the links between the utterances – Explicit links – Many links remain implicit • The discovery of these implicit links is a difficult task, mainly because of the multitude distinct types of links that may arise – Lexical links – Continuation of utterances (which may be considered broken utterances or syntactic links) – Semantic links – Pragmatic and conversation specific references 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 6
  • 7. Research Question • How useful is semantic similarity alone for detecting links arising in CSCL chat conversations? • Many CSCL applications that are processing either conversations or written texts are mainly using (lexical, semantic) similarity relatedness methods for assessing the cohesion of the analyzed texts • They may miss important links that are not detectable using semantic similarity alone • Cohesion vs. Coherence19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 7
  • 8. Semantic Similarity • Semantic similarity (or relatedness) between proximal words is one of the characteristics of the context in most coherent discourses • Knowledge-based methods (“strong” semantics) – Rely on general, domain and discourse ontologies • Corpus-based methods (“weak” semantics) – Do not define any underlying semantic model between the words or concepts, but rather exploit the probability distribution and the statistics of two words co-appearing together in a given discourse unit (e.g. paragraph) 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 8
  • 9. Knowledge-based Methods • Primarily use lexical resources built by linguists, for example dictionaries, thesauri and lexical ontologies • The most popular methods for computing the semantic similarity or relatedness between two words or concepts are defined for ontologies => semantic distances • Most of them have been especially constructed for the linguistic ontology (or lexical database) WordNet – synonyms, antonyms, related words, hypernyms, hyponyms, meronyms and holonyms 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 9
  • 10. Corpus-Based Methods • Instead of using human-assembled linguistic knowledge, these techniques process large amounts of text corpora and then use the statistics of words co- appearance • Latent Semantic Analysis (LSA) – Perform SVD to reduce the dimensionality of the term- document matrix – Keep only the largest singular values (usually k = 100..300) – This reduced dimensionality space is also called the latent semantic space and may be used to compute the similarity between words, word sets and texts by using cosine similarity for the document vectors in this reduced space • Other techniques have been proposed: Latent Dirichlet Allocation (LDA), Explicit Semantic Analysis (ESA), etc. 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 10
  • 11. Distribution of Explicit Links in Chats • Corpus of 8 chat conversations analyzed to compute statistics with regard to the usage of explicit links – Multi-party discussions in groups of 4-5 students – Using Concert Chat software (allows explicit referencing) 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 11
  • 12. Distribution of Explicit Links in Chats • Complete distribution of explicit links given the distance between the two utterances 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 12
  • 13. Are Semantic Links Useful? • Macro-level analysis: Compare the distribution of the explicit links to the semantic similarity between the current utterance and utterances that are at a certain distance • Computing the semantic similarity between two utterances uses the similarities between all the pairs of words in each utterance 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 13 ) )( ))(*)1,((max )( ))(*)2,((max (* 2 1 )2,1( 2 2 1 1 ∑ ∑ ∑ ∑ ∈ ∈ ∈ ∈ += Tw Tw Tw Tw widf widfTwSim widf widfTwSim TTsim
  • 14. Are Semantic Links Useful? LSA WordNet – Resnik distance 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 14
  • 15. Are Semantic Links Useful? • Micro-Level Analysis • How effective is LSA-based semantic similarity for detecting links between utterances? • Simple greedy method for choosing the link of the current utterance: – Compute the semantic similarity between the utterance and the previous 20 utterances – Choose as a link the pair which has the best (largest) similarity score 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 15
  • 16. Are Semantic Links Useful? ID Utterance User Link Sematic score 148 ok.... so what we were talking about before the connection issue? Mona 149 as long as it depends on an internet connection... Mona LINK TO 147 150 meeting board Cristi LINK TO 148 score(150, 148) = 0.18 = max 151 about different stages of a project... a client must know about them..so wiki is a good solution Corina LINK TO 148 152 meeting? Corina LINK TO 150 score(152, 150) = 1.00 = max 153 ok Stefan LINK TO 151 154 so we agree that wiki is a good solution when we want to present a product/the evolution of a project to a client ? Corina score(154, 151) = 0.5 = max 155 what about selling our products..what technologies we should use for this? Diana … 166 to sell our products blog is the best solution Corina LINK TO 155 score(166, 155) = 0.38 = max … 169 A blog would be a good way to advertise our products Mona LINK TO 166 score(169, 166) = 0.48 = max … 180 A forum would be useful for offering solutions to some problems that our customers have Mona 181 I agree...also other people can offer solutions, not only us Corina LINK TO 180 score(181, 180) = 0.27 = max 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 16
  • 17. Are Semantic Links Useful? • Can explicit links be identified using only the LSA semantic similarity between utterances? 19.08.15 RoEduNet Conference 2014 – Chișinău, R. Moldova 17
  • 18. Conclusions • Two important conclusions arise from this study • The distribution of explicit links is very similar to that of the semantic similarity scores, thus proving that, on a macro-level, both links and semantic similarity measures have a similar behavior • This is not always true when analyzing local elements: an individual link between two utterances is not always explained by semantic similarity • Only around 40% of the explicit links in our conversations could be explained based mainly on semantic similarity measures 19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 18
  • 19. Thank you! Questions? Discussion 19.08.15 RoEduNet Conference 2014 – Chi inău, R. Moldovaș 19 This work has been funded by the Sectorial Operational Programme Human Resources Development 2007-2013 of the Romanian Ministry of European Funds through the Financial Agreement POSDRU/159/1.5/S/132397 and by the FP7 LTfLL project.