SlideShare a Scribd company logo
Semantic Intensity Spectrum At least 25 systems have been developed for ontology alignment, matching.  A classification technique for ontology alignment approaches Based on semantic intensity Semantics: the intended meanings of ontological entities Some methods consider only syntactical features (semantic poor) E.g. String distance, String equality Semantics are added via: meanings of words provided by external lexicons Positions in taxonomies Relations with other ontological entities Logic entailment  Classified instance data Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Semantic Intensity Spectrum   SIS Diagram http://www.aktors.org/crosi/si-spectrum/ Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Semantic Intensity Spectrum   Existing systems Duplicate efforts in developing largely overlapped algorithms Re-implement algorithms e.g. string distance Similar heuristic rules Different performance and different results w.r.t. the same test sets Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Semantic Intensity Spectrum   Diversity of existing systems Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Semantic Intensity Spectrum   A possible solution Combine existing systems to minimise development efforts Possibility of combining in a meaningful way Many systems output in compatible format  Heterogeneous outputs need to be normalised using heuristic rules, e.g. convert “more general than” into numeric values Reuse available packages E.g. SecondString for computing string distance Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
A principled architecture  Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
A principled architecture:   Signature Extraction Ontologies can be captured with a set of ontological signatures Local signatures: Labels, IDs, and URIs Declaimed properties, property domains and ranges Equivalent and complement classes, inverse and functional properties Instantiated classes Global signatures: Super-, sub-classes, properties Disjoint classes Sibling classes Comments, version information Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
A principled architecture:   Multiple matchers Specialised internal matchers targeting at particular signatures Name matchers String distance based matchers WordNet based matchers Class matchers Taxonomy based matchers Definition based matchers Invoking existing ontology matching/alignment systems as external matchers FOAM API INRIA Alignment API Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
CMS design commitments Avoid  reinventing the wheel Use existing packages to enhance internal matchers Use existing mapping/alignment systems as external matchers Semantically enriched matchers based on the definition of concepts Propagate similarity along concept hierarchies Refine concept similarity by taking into account the names, domains and ranges of declared properties Compute similarity using WordNet hierarchies Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
String distance  Reuse of existing packages SecondString Metrics Jaro, MongeElkan, NeedlemanWunsh, etc. Soundex Metrics Consider only the local names of ontological entities  Namespace is ignored Names of super(sub)classes are ignored Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
WordNet-based algorithms Use JWNL WordNet Java Lib Names only Synonyms are retrieved and compared with string equality or string distance Composite names are split and stop words are removed E.g. “has_name” => “name” WordNet hierarchy Calculate distance between two Words Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
WordNet-based algorithms   WordNet hierarchy  h  the distance between “Word” and Root h’  the distance between “Word’” and Root H  the distance between common subsumer of “Word” and “Word’” and Root Similarity between “Word” and “Word’” is computed as  2H/(h+h’) Root Common  Subsumer Word’ Word Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work H h’ h
Canonical Name C=> A.B.D.C C’=> A’.B’.D’.E’.C’ Compute the similarity between C and C’ as well as the respective similarity between every pair of super classes of C and C’ Penalise the similarity between C and C’ with those of their super classes C’ A’ B’ D’ E’ A B C D Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Structure algorithm f  (name similarity, domain similarity, range similarity ) Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work P1’(C’, B’) P2’ P3’ P1(C, B) P2 P3 H G Domain of P1 H’ G’ I’ Domain of P1’ A B D Range of P1 E A’ B’ D’ E’ Range of P1’ F’
Structure algorithm cnt’d Structure Retrieve declaimed properties For each property, retrieve its domains and ranges Compare property’s name, domain and range  StructurePlus Compare also the super and sub classes of property’s domain and range When compare domains and ranges, using existing name matching techniques Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Implemented Matchers powered by existing java libs Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Post alignment  Aggregator Weighted average based aggregation  Weights are manually set by users Evaluator Nothing is more qualified than a human inspector with domain knowledge and experiences Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
CMS: deployment options Run CMS from command line A batch file is provided Invoke CMS as an API Run CMS as a service  via JSP interface Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Demo Ontologies: web directory, small size and simple structure Run with different weights Output to different formats OWL, SKOS, HTML, XML(OAEI) Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
Use of SIS Declaimed functionalities of existing systems can be justified against this spectrum A reference for selecting the right mapping techniques for a particular problem A designer’s aid for navigating through different mapping approaches with emphasis on the use of semantics Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work

More Related Content

PPT
Viva slides_secured objective programming
PPT
Initiating Organisational Memories using Ontology Network Analysis - 2002
PPT
Reasoning on the Semantic Web
PPT
Using Ontologies to Support and Critique Decisions - 2004
PPT
Semantic technologies at work
PDF
Lucania stretta tra ENI e TOTAL: petrolio la nostra salvezza o la nostra rov...
PPTX
From information to intelligence
PPT
On the Emergent Semantic Web and Overlooked Issues - 2004
Viva slides_secured objective programming
Initiating Organisational Memories using Ontology Network Analysis - 2002
Reasoning on the Semantic Web
Using Ontologies to Support and Critique Decisions - 2004
Semantic technologies at work
Lucania stretta tra ENI e TOTAL: petrolio la nostra salvezza o la nostra rov...
From information to intelligence
On the Emergent Semantic Web and Overlooked Issues - 2004

Viewers also liked (11)

PPT
A Channel Theoretic Foundation for Ontology Coordination - 2004
PPT
Advanced Knowledge Technologies (AKT) -highlights 2006
PPT
Portable Ontology Alignment Fragments - 2008
PPT
E Res Akt Finalreview
PPT
Semantic technologies as an investment opportunity
PPT
Semantic technologies at work - 2007
PPT
SPARQL and SQL: technical aspects and synergy
PPT
Web 2.0 and mobile web
PPT
Semantic Technologies - 2007
PPT
Information Flow based Ontology Mapping - 2002
PPT
Artificial Intelligence
A Channel Theoretic Foundation for Ontology Coordination - 2004
Advanced Knowledge Technologies (AKT) -highlights 2006
Portable Ontology Alignment Fragments - 2008
E Res Akt Finalreview
Semantic technologies as an investment opportunity
Semantic technologies at work - 2007
SPARQL and SQL: technical aspects and synergy
Web 2.0 and mobile web
Semantic Technologies - 2007
Information Flow based Ontology Mapping - 2002
Artificial Intelligence
Ad

Similar to Semantic Intensity Spectrum and Semantic Integration Algorithms (20)

PPT
Searching Repositories of Web Application Models
PPT
A Research Literature Search Engine With Abbreviation Recognition
PPT
STAT Requirement Analysis
PDF
Aws certified solutions_architect_associate_blueprint
PPTX
슬라이드 1
PPTX
G2g offerings-online campus drives
ODP
Software Patterns
PPT
Modeling Search Computing Applications
PPT
Rhapsody Systems Software
PPT
Sedna XML Database: Query Parser & Optimizing Rewriter
PPT
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
PPTX
Agentic RAG and Small & Specialized Models v1.6.pptx
PPT
Extending Rotor with Structural Reflection to support Reflective Languages
PDF
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
PPT
PDF
Query optimization to improve performance of the code execution
PDF
11.query optimization to improve performance of the code execution
PPT
Sw Software Design
PDF
LONG_Dong_CV
PDF
Introduction to Machine Learning with SciKit-Learn
Searching Repositories of Web Application Models
A Research Literature Search Engine With Abbreviation Recognition
STAT Requirement Analysis
Aws certified solutions_architect_associate_blueprint
슬라이드 1
G2g offerings-online campus drives
Software Patterns
Modeling Search Computing Applications
Rhapsody Systems Software
Sedna XML Database: Query Parser & Optimizing Rewriter
Final Jspring2009 Mda Slimmer Ontwikkelen Van Java Ee Applicaties
Agentic RAG and Small & Specialized Models v1.6.pptx
Extending Rotor with Structural Reflection to support Reflective Languages
Large-Scale Text Processing Pipeline with Spark ML and GraphFrames: Spark Sum...
Query optimization to improve performance of the code execution
11.query optimization to improve performance of the code execution
Sw Software Design
LONG_Dong_CV
Introduction to Machine Learning with SciKit-Learn
Ad

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Approach and Philosophy of On baking technology
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Spectroscopy.pptx food analysis technology
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Big Data Technologies - Introduction.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Cloud computing and distributed systems.
Digital-Transformation-Roadmap-for-Companies.pptx
Approach and Philosophy of On baking technology
Programs and apps: productivity, graphics, security and other tools
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced methodologies resolving dimensionality complications for autism neur...
Network Security Unit 5.pdf for BCA BBA.
Machine Learning_overview_presentation.pptx
Spectroscopy.pptx food analysis technology
Dropbox Q2 2025 Financial Results & Investor Presentation
Assigned Numbers - 2025 - Bluetooth® Document
Big Data Technologies - Introduction.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Review of recent advances in non-invasive hemoglobin estimation
NewMind AI Weekly Chronicles - August'25-Week II
“AI and Expert System Decision Support & Business Intelligence Systems”
sap open course for s4hana steps from ECC to s4
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
A comparative analysis of optical character recognition models for extracting...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Semantic Intensity Spectrum and Semantic Integration Algorithms

  • 1. Semantic Intensity Spectrum At least 25 systems have been developed for ontology alignment, matching. A classification technique for ontology alignment approaches Based on semantic intensity Semantics: the intended meanings of ontological entities Some methods consider only syntactical features (semantic poor) E.g. String distance, String equality Semantics are added via: meanings of words provided by external lexicons Positions in taxonomies Relations with other ontological entities Logic entailment Classified instance data Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 2. Semantic Intensity Spectrum SIS Diagram http://www.aktors.org/crosi/si-spectrum/ Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 3. Semantic Intensity Spectrum Existing systems Duplicate efforts in developing largely overlapped algorithms Re-implement algorithms e.g. string distance Similar heuristic rules Different performance and different results w.r.t. the same test sets Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 4. Semantic Intensity Spectrum Diversity of existing systems Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 5. Semantic Intensity Spectrum A possible solution Combine existing systems to minimise development efforts Possibility of combining in a meaningful way Many systems output in compatible format Heterogeneous outputs need to be normalised using heuristic rules, e.g. convert “more general than” into numeric values Reuse available packages E.g. SecondString for computing string distance Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 6. A principled architecture Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 7. A principled architecture: Signature Extraction Ontologies can be captured with a set of ontological signatures Local signatures: Labels, IDs, and URIs Declaimed properties, property domains and ranges Equivalent and complement classes, inverse and functional properties Instantiated classes Global signatures: Super-, sub-classes, properties Disjoint classes Sibling classes Comments, version information Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 8. A principled architecture: Multiple matchers Specialised internal matchers targeting at particular signatures Name matchers String distance based matchers WordNet based matchers Class matchers Taxonomy based matchers Definition based matchers Invoking existing ontology matching/alignment systems as external matchers FOAM API INRIA Alignment API Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 9. CMS design commitments Avoid reinventing the wheel Use existing packages to enhance internal matchers Use existing mapping/alignment systems as external matchers Semantically enriched matchers based on the definition of concepts Propagate similarity along concept hierarchies Refine concept similarity by taking into account the names, domains and ranges of declared properties Compute similarity using WordNet hierarchies Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 10. String distance Reuse of existing packages SecondString Metrics Jaro, MongeElkan, NeedlemanWunsh, etc. Soundex Metrics Consider only the local names of ontological entities Namespace is ignored Names of super(sub)classes are ignored Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 11. WordNet-based algorithms Use JWNL WordNet Java Lib Names only Synonyms are retrieved and compared with string equality or string distance Composite names are split and stop words are removed E.g. “has_name” => “name” WordNet hierarchy Calculate distance between two Words Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 12. WordNet-based algorithms WordNet hierarchy h the distance between “Word” and Root h’ the distance between “Word’” and Root H the distance between common subsumer of “Word” and “Word’” and Root Similarity between “Word” and “Word’” is computed as 2H/(h+h’) Root Common Subsumer Word’ Word Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work H h’ h
  • 13. Canonical Name C=> A.B.D.C C’=> A’.B’.D’.E’.C’ Compute the similarity between C and C’ as well as the respective similarity between every pair of super classes of C and C’ Penalise the similarity between C and C’ with those of their super classes C’ A’ B’ D’ E’ A B C D Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 14. Structure algorithm f (name similarity, domain similarity, range similarity ) Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work P1’(C’, B’) P2’ P3’ P1(C, B) P2 P3 H G Domain of P1 H’ G’ I’ Domain of P1’ A B D Range of P1 E A’ B’ D’ E’ Range of P1’ F’
  • 15. Structure algorithm cnt’d Structure Retrieve declaimed properties For each property, retrieve its domains and ranges Compare property’s name, domain and range StructurePlus Compare also the super and sub classes of property’s domain and range When compare domains and ranges, using existing name matching techniques Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 16. Implemented Matchers powered by existing java libs Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 17. Post alignment Aggregator Weighted average based aggregation Weights are manually set by users Evaluator Nothing is more qualified than a human inspector with domain knowledge and experiences Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 18. CMS: deployment options Run CMS from command line A batch file is provided Invoke CMS as an API Run CMS as a service via JSP interface Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 19. Demo Ontologies: web directory, small size and simple structure Run with different weights Output to different formats OWL, SKOS, HTML, XML(OAEI) Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work
  • 20. Use of SIS Declaimed functionalities of existing systems can be justified against this spectrum A reference for selecting the right mapping techniques for a particular problem A designer’s aid for navigating through different mapping approaches with emphasis on the use of semantics Project aims & targets Timeline and deliverables Semantic Intensity Spectrum Modular architecture Algorithms CMS Evaluation Lessons learnt & future work