SlideShare a Scribd company logo
A Vague Sense Classifier for Detecting Vague
Definitions in Ontologies
Panos Alexopoulos, John Pavlopoulos
14th Conference of the European Chapter of the Association for Computational
Linguistics
Gothenburg, Sweden, 26–30 April 2014
2
Vagueness
Introduction
●Vagueness is a semantic phenomenon where predicates admit
borderline cases, i.e. cases where it is not determinately true that the
predicate applies or not (Shapiro 2006).
●This happens when predicates have blurred boundaries:
● What’s the threshold number of years separating old and not old
films?
● What are the exact criteria that distinguish modern restaurants
from non-modern?
3
Vagueness Consequences
Introduction
●The problem with vague terms in semantic data is the possibility of
disagreements!
●E.g., when we asked domain experts to provide instances of the
concept Critical Business Process, there were certain processes for
which there was a dispute among them about whether they should be
regarded as critical or not.
●The problem was that different experts had different criteria of
process criticality and could not decide which of these were
sufficient to classify a process as critical.
4
Problematic Scenarios
Introduction
1. Structuring Data with a Vague Ontology: Possible
disagreement among experts when defining class and relation
instances.
2. Utilizing Vague Facts in Ontology-Based Systems:
Reasoning results might not meet users’ expectations
3. Integrating Vague Semantic Information: The merging of
particular vague elements can lead to data that will not be
valid for all its users.
5
Problem Definition & Approach
Automatic Vagueness Detection
●Can we automatically determine whether an ontology entity (class, relation etc.)
is vague or not?
● “StrategicClient” as “A client that has a high value for the company” is
vague!
● “AmericanCompany” as “A company that has legal status in the
Unites States” is not!
Problem Definition
●We train a binary classifier that may distinguish between vague and non-vague
term word senses.
●Training is supervised, using examples from Wordnet.
●We use this classifier to determine whether a given ontology element definition
is vague or not.
Approach
6
Data
Automatic Vagueness Detection
●2,000 adjective senses from WordNet.
● 1,000 vague
● 1,000 non-vague
●Inter-agreement of vague/non-vague annotation among 3 human
judges was 0.64 (Cohen’s Kappa)
Vague Senses Non Vague Senses
• Abnormal: not normal, not typical or usual
or regularor conforming to a norm
• Compound: composed of more than one
part
• Impenitent: impervious to moral persuasion • Biweekly: occurring every two weeks.
• Notorious: known widely and usually
unfavorably
• Irregular: falling below the manufacturer's
standard
• Aroused: emotionally aroused • Outermost: situated at the farthest possible
point from a center.
7
Training and Evaluation
Automatic Vagueness Detection
●80% of the data used to train a multinomial Naive Bayes classifier.
●We removed stop words and we used the bag of words assumption to
represent each instance.
●The remaining 20% of the data was used as a test set.
●Classification accuracy was 84%!
8
Comparison with Subjectivity Analyzer
Automatic Vagueness Detection
●We also used a subjective sense classifier to classify our dataset’s
senses as subjective or objective.
●From the 1000 vague senses, only 167 were classified as subjective
while from the 1000 non-vague ones 993.
●This shows that treating vagueness in the same way as
subjectiveness is not really effective.
9
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
●As an ontology use case we considered CiTO, an ontology that
enables characterization of the nature or type of citations.
●CiTO consists primarily of relations, many of which are vague (e.g.
plagiarizes).
●We selected 44 relations and we had 3 human judges manually
classify them as vague or not.
●Then we applied our Wordnet-trained vagueness classifier on the
textual definitions of the same relations.
10
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
Vague Relations Non Vague Relations
• plagiarizes: A property indicating that
the author of the citing entity
plagiarizes the cited entity, by
including textual or other elements
from the cited entity without formal
acknowledgement of their source
• sharesAuthorInstitutionWith: Each
entity has at least one author that
shares a common institutional
affiliation with an author of the other
entity
• citesAsAuthority: The citing entity
cites the cited entity as one that
provides an authoritative description
or definition of the subject under
discussion.
• providesDataFor: The cited entity
presents data that are used in work
described in the citing entity.
11
Use Case: Detecting Vagueness in CiTO Ontology
Automatic Vagueness Detection
●Classification Results:
● 82% of relations were correctly classified as vague/non-vague
● 94% accuracy for non-vague relations.
● 74% accuracy for vague relations.
●Again, we classified the same relations with the subjectivity classifier:
● 40% of vague/non-vague relations were classified as
subjective/objective respectively.
● 94% of non-vague were classified as objective.
● 7% of vague relations were classified as subjective.
12
Future Work
Vagueness-Aware Semantic Data
●Incorporate the current classifier into an ontology analysis tool
●Improve the classifier by contemplating new features
●See whether it is possible to build a vague sense lexicon.
13
Questions?
Thank you!
iSOCO Madrid
Av. del Partenón, 16-18, 1º7ª
Campo de las Naciones
28042 Madrid
España
(t) +34 913 349 797
iSOCO Pamplona
Parque Tomás
Caballero, 2, 6º4ª
31006 Pamplona
España
(t) +34 948 102 408
iSOCO Valencia
C/ Prof. Beltrán Báguena, 4
Oficina 107
46009 Valencia
España
(t) +34 963 467 143
iSOCO Barcelona
Av. Torre Blanca, 57
Edificio ESADE CREAPOLIS
Oficina 3C 15
08172 Sant Cugat del Vallès
Barcelona, España
(t) +34 935 677 200
iSOCO Colombia
Complejo Ruta N
Calle 67, 52-20
Piso 3, Torre A
Medellín
Colombia
(t) +57 516 7770 ext. 1132
Key Vendor
Virtual Assistant 2013
Quieres
innovar?
Dr. Panos Alexopoulos
Semantic Applications Research
Manager
palexopoulos@isoco.com
(t) +34 913 349 797

More Related Content

PPT
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
PPT
Troubleshooting and Optimizing Named Entity Resolution Systems in the Industry
PPT
Detecting, Measuring and Representing Vagueness in Ontologies
PPT
Vagueness in Semantic Information Management
PPTX
Towards Vagueness-Aware Semantic Data
PPTX
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
PDF
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
PDF
Sentiment Analysis of Feedback Data
Learning Vague Knowledge From Socially Generated Content in an Enterprise Fra...
Troubleshooting and Optimizing Named Entity Resolution Systems in the Industry
Detecting, Measuring and Representing Vagueness in Ontologies
Vagueness in Semantic Information Management
Towards Vagueness-Aware Semantic Data
Towards Purposeful Reuse of Semantic Datasets Through Goal-Driven Summarization
SENTIMENT ANALYSIS-AN OBJECTIVE VIEW
Sentiment Analysis of Feedback Data

What's hot (20)

PPTX
Sentiment analysis
PDF
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
PDF
SemEval - Aspect Based Sentiment Analysis
PDF
Project sentiment analysis
DOCX
295B_Report_Sentiment_analysis
PPTX
Amazon Product Sentiment review
PPTX
A review of sentiment analysis approaches in big
PDF
CrowdTruth for medical relation extraction - WAI talk
PDF
Project report
PDF
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
PDF
An overview of text mining and sentiment analysis for Decision Support System
PPTX
Sentiment analysis
ODP
Query recommendation papers
PPTX
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
PPTX
PPTX
Sentiment Analysis
PDF
PDF
project sentiment analysis
PPTX
Sentiment analysis
Sentiment analysis
A Framework for Arabic Concept-Level Sentiment Analysis using SenticNet
SemEval - Aspect Based Sentiment Analysis
Project sentiment analysis
295B_Report_Sentiment_analysis
Amazon Product Sentiment review
A review of sentiment analysis approaches in big
CrowdTruth for medical relation extraction - WAI talk
Project report
IRJET- Survey of Classification of Business Reviews using Sentiment Analysis
An overview of text mining and sentiment analysis for Decision Support System
Sentiment analysis
Query recommendation papers
Supervised Learning Based Approach to Aspect Based Sentiment Analysis
Sentiment Analysis
project sentiment analysis
Sentiment analysis
Ad

Viewers also liked (20)

ODP
DBpedia: A Public Data Infrastructure for the Web of Data
PDF
DBpedia InsideOut
PPTX
NLP todo
PDF
Linked Data Fragments
PDF
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
PPTX
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
PDF
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
PPT
Gathering Alternative Surface Forms for DBpedia Entities
PPTX
Federated SPARQL query processing over the Web of Data
ODP
Fast Approximate A-box Consistency Checking using Machine Learning
PDF
LDQL: A Query Language for the Web of Linked Data
PDF
Applying Linked Open Data to Public Procurement
PDF
Exploiting the query structure for efficient join ordering in SPARQL queries
PPT
Automatic Term Ambiguity Detection
PDF
Exploring Linked Data content through network analysis
PPT
Linked Data: What’s the Story?
PDF
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
ODP
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
PPTX
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
DBpedia: A Public Data Infrastructure for the Web of Data
DBpedia InsideOut
NLP todo
Linked Data Fragments
Evaluating Named Entity Recognition and Disambiguation in News and Tweets
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Open Education Challenge 2014: exploiting Linked Data in Educational Applicat...
Gathering Alternative Surface Forms for DBpedia Entities
Federated SPARQL query processing over the Web of Data
Fast Approximate A-box Consistency Checking using Machine Learning
LDQL: A Query Language for the Web of Linked Data
Applying Linked Open Data to Public Procurement
Exploiting the query structure for efficient join ordering in SPARQL queries
Automatic Term Ambiguity Detection
Exploring Linked Data content through network analysis
Linked Data: What’s the Story?
A Comparison of NER Tools w.r.t. a Domain-Specific Vocabulary
Data Mining with Background Knowledge from the Web - Introducing the RapidMin...
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
Ad

Similar to A Vague Sense Classifier for Detecting Vague Definitions in Ontologies (20)

PPTX
How many truths can you handle?
DOCX
PSY 540 Short Presentation Guidelines and Rubric Overvi.docx
PDF
Test bank for burns and groves the practice of nursing research-1-55.pdf
PPTX
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
PDF
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
PPTX
Discriminant Analysis.pptx
DOCX
1) A cyber crime is a crime that involves a computer and the Inter.docx
PPT
Chap008
PPTX
Fore FAIR ISMB 2019
PPTX
Human Assessment of Ontologies
PDF
How Did I Miss That Bug? Managing Cognitive Bias in Testing
PDF
Primary Printable Paper. Online assignment writing service.
PPT
Mann core study
PDF
Validity and Reliability of the Research Instrument; How to Test the Validati...
PPTX
Class Delivery Final.pptx
PPTX
Analyzing Qualitative Data for_ Research
DOC
Unit2 studyguide302
PPTX
PRACTICAL--RESEARCH--II--MODULE--1-.pptx
PPT
Study design2 6_07
PPTX
Identification of Research Problem
How many truths can you handle?
PSY 540 Short Presentation Guidelines and Rubric Overvi.docx
Test bank for burns and groves the practice of nursing research-1-55.pdf
Of Unicorns, Yetis, and Error-Free Datasets (or what is data quality?)
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
Discriminant Analysis.pptx
1) A cyber crime is a crime that involves a computer and the Inter.docx
Chap008
Fore FAIR ISMB 2019
Human Assessment of Ontologies
How Did I Miss That Bug? Managing Cognitive Bias in Testing
Primary Printable Paper. Online assignment writing service.
Mann core study
Validity and Reliability of the Research Instrument; How to Test the Validati...
Class Delivery Final.pptx
Analyzing Qualitative Data for_ Research
Unit2 studyguide302
PRACTICAL--RESEARCH--II--MODULE--1-.pptx
Study design2 6_07
Identification of Research Problem

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PPT
Teaching material agriculture food technology
PDF
cuic standard and advanced reporting.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Empathic Computing: Creating Shared Understanding
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
KodekX | Application Modernization Development
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Unlocking AI with Model Context Protocol (MCP)
Teaching material agriculture food technology
cuic standard and advanced reporting.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Building Integrated photovoltaic BIPV_UPV.pdf
Approach and Philosophy of On baking technology
Digital-Transformation-Roadmap-for-Companies.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Empathic Computing: Creating Shared Understanding
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Encapsulation_ Review paper, used for researhc scholars
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Per capita expenditure prediction using model stacking based on satellite ima...
KodekX | Application Modernization Development
How UI/UX Design Impacts User Retention in Mobile Apps.pdf

A Vague Sense Classifier for Detecting Vague Definitions in Ontologies

  • 1. A Vague Sense Classifier for Detecting Vague Definitions in Ontologies Panos Alexopoulos, John Pavlopoulos 14th Conference of the European Chapter of the Association for Computational Linguistics Gothenburg, Sweden, 26–30 April 2014
  • 2. 2 Vagueness Introduction ●Vagueness is a semantic phenomenon where predicates admit borderline cases, i.e. cases where it is not determinately true that the predicate applies or not (Shapiro 2006). ●This happens when predicates have blurred boundaries: ● What’s the threshold number of years separating old and not old films? ● What are the exact criteria that distinguish modern restaurants from non-modern?
  • 3. 3 Vagueness Consequences Introduction ●The problem with vague terms in semantic data is the possibility of disagreements! ●E.g., when we asked domain experts to provide instances of the concept Critical Business Process, there were certain processes for which there was a dispute among them about whether they should be regarded as critical or not. ●The problem was that different experts had different criteria of process criticality and could not decide which of these were sufficient to classify a process as critical.
  • 4. 4 Problematic Scenarios Introduction 1. Structuring Data with a Vague Ontology: Possible disagreement among experts when defining class and relation instances. 2. Utilizing Vague Facts in Ontology-Based Systems: Reasoning results might not meet users’ expectations 3. Integrating Vague Semantic Information: The merging of particular vague elements can lead to data that will not be valid for all its users.
  • 5. 5 Problem Definition & Approach Automatic Vagueness Detection ●Can we automatically determine whether an ontology entity (class, relation etc.) is vague or not? ● “StrategicClient” as “A client that has a high value for the company” is vague! ● “AmericanCompany” as “A company that has legal status in the Unites States” is not! Problem Definition ●We train a binary classifier that may distinguish between vague and non-vague term word senses. ●Training is supervised, using examples from Wordnet. ●We use this classifier to determine whether a given ontology element definition is vague or not. Approach
  • 6. 6 Data Automatic Vagueness Detection ●2,000 adjective senses from WordNet. ● 1,000 vague ● 1,000 non-vague ●Inter-agreement of vague/non-vague annotation among 3 human judges was 0.64 (Cohen’s Kappa) Vague Senses Non Vague Senses • Abnormal: not normal, not typical or usual or regularor conforming to a norm • Compound: composed of more than one part • Impenitent: impervious to moral persuasion • Biweekly: occurring every two weeks. • Notorious: known widely and usually unfavorably • Irregular: falling below the manufacturer's standard • Aroused: emotionally aroused • Outermost: situated at the farthest possible point from a center.
  • 7. 7 Training and Evaluation Automatic Vagueness Detection ●80% of the data used to train a multinomial Naive Bayes classifier. ●We removed stop words and we used the bag of words assumption to represent each instance. ●The remaining 20% of the data was used as a test set. ●Classification accuracy was 84%!
  • 8. 8 Comparison with Subjectivity Analyzer Automatic Vagueness Detection ●We also used a subjective sense classifier to classify our dataset’s senses as subjective or objective. ●From the 1000 vague senses, only 167 were classified as subjective while from the 1000 non-vague ones 993. ●This shows that treating vagueness in the same way as subjectiveness is not really effective.
  • 9. 9 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection ●As an ontology use case we considered CiTO, an ontology that enables characterization of the nature or type of citations. ●CiTO consists primarily of relations, many of which are vague (e.g. plagiarizes). ●We selected 44 relations and we had 3 human judges manually classify them as vague or not. ●Then we applied our Wordnet-trained vagueness classifier on the textual definitions of the same relations.
  • 10. 10 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection Vague Relations Non Vague Relations • plagiarizes: A property indicating that the author of the citing entity plagiarizes the cited entity, by including textual or other elements from the cited entity without formal acknowledgement of their source • sharesAuthorInstitutionWith: Each entity has at least one author that shares a common institutional affiliation with an author of the other entity • citesAsAuthority: The citing entity cites the cited entity as one that provides an authoritative description or definition of the subject under discussion. • providesDataFor: The cited entity presents data that are used in work described in the citing entity.
  • 11. 11 Use Case: Detecting Vagueness in CiTO Ontology Automatic Vagueness Detection ●Classification Results: ● 82% of relations were correctly classified as vague/non-vague ● 94% accuracy for non-vague relations. ● 74% accuracy for vague relations. ●Again, we classified the same relations with the subjectivity classifier: ● 40% of vague/non-vague relations were classified as subjective/objective respectively. ● 94% of non-vague were classified as objective. ● 7% of vague relations were classified as subjective.
  • 12. 12 Future Work Vagueness-Aware Semantic Data ●Incorporate the current classifier into an ontology analysis tool ●Improve the classifier by contemplating new features ●See whether it is possible to build a vague sense lexicon.
  • 13. 13 Questions? Thank you! iSOCO Madrid Av. del Partenón, 16-18, 1º7ª Campo de las Naciones 28042 Madrid España (t) +34 913 349 797 iSOCO Pamplona Parque Tomás Caballero, 2, 6º4ª 31006 Pamplona España (t) +34 948 102 408 iSOCO Valencia C/ Prof. Beltrán Báguena, 4 Oficina 107 46009 Valencia España (t) +34 963 467 143 iSOCO Barcelona Av. Torre Blanca, 57 Edificio ESADE CREAPOLIS Oficina 3C 15 08172 Sant Cugat del Vallès Barcelona, España (t) +34 935 677 200 iSOCO Colombia Complejo Ruta N Calle 67, 52-20 Piso 3, Torre A Medellín Colombia (t) +57 516 7770 ext. 1132 Key Vendor Virtual Assistant 2013 Quieres innovar? Dr. Panos Alexopoulos Semantic Applications Research Manager palexopoulos@isoco.com (t) +34 913 349 797