SlideShare a Scribd company logo
Work Together Effectively
Cross Media Concept and
Entity Driven Search for
Enterprise
Chalitha Perera and Dileepa Jayakody
R&D Engineers
Work Together Effectively
•  Headquartered in London with office in Colombo, Sri Lanka
•  Focused on delivering enterprise content management solutions
•  Our Skills
Work Together Effectively
Zaizi R&D Department
•  Giving sense to the content  
–  Enriching it semantically
•  Adding value to ECM/CMS
–  More structured content, easy to manage, link and search
•  Improving search
–  Across different domains, data sources, User Experience
•  Machine Learning applied research
Work Together Effectively
Agenda
•  Problem
•  Solution
•  Sensefy and MICO
•  Demo
•  Q&A
Work Together Effectively
Problem
•  Unstructured Text Content
–  Text documents, PDFs, Word …
•  Rapid growth in multimedia content
•  Heterogeneous Data Sources
–  ECMs (Alfresco, Sharepoint), File System,
Confluence, JIRA …
•  Data is not useful without effective methods for
–  Knowledge Extraction
–  Information Retrieval
Work Together Effectively
Current Enterprise Search
Limitations
•  Limited to keyword based search
•  Search context is not considered
•  Ambiguity of terms
•  Low precision
•  Inability to properly handle multimedia files
Work Together Effectively
Desired traits of Solution
•  Semantically Enhance documents
–  Unstructured text
–  Multimedia documents
•  Cross media search
•  Search with semantic concepts and entities
•  Federated Search
–  Search across different content repositories
–  User permissions
Work Together Effectively
Sensefy
•  Semantic Enterprise Search Engine
•  Cross Media Search
•  Federated Search
•  Smart Search Assistance
•  Open Source
Work Together Effectively
Sensefy Architecture
Work Together Effectively
Repository Crawler
•  Four types of connectors
–  Repository Connectors
–  Authority Connectors
–  Transformation Connectors
–  Output Connectors
•  Connect different source repositories with different target indexes
–  Source repositories (Alfresco, Sharepoint, Confluence etc)
–  Target Indexes (Solr, ElasticSearch, Amazon CloudSearch)
•  Security Model to enforce source repository security policies
Work Together Effectively
Media In Context (MICO)
Platform
•  MICO provides an integrated platform for
–  Cross media analysis
–  Metadata publishing
–  Metadata querying
•  Sensefy uses MICO as the cross media analysis engine to extract entities and concepts
from multimedia
Work Together Effectively
Cross Media Extraction Pipeline
Work Together Effectively
Semantic Content Enrichment
•  Named Entity Recognition
–  People, places, organizations and concepts
•  Entity Linking
–  DBpedia, Yago, Custom Enterprise knowledge bases
•  Entity Disambiguation
Work Together Effectively
Entity Search with Suggestions
•  Named Entity Suggestions
•  Ability to query with disambiguated entities
•  Search results with high precision
–  Keyword search results for “ronaldo”-  “Cristiano Ronaldo” and “Ronaldo”
–  Entity Search - will contain only the documents related to selected entity
Work Together Effectively
Entity Search with Suggestions
•  Combine entities and concepts for more complex queries
Work Together Effectively
DEMO
Work Together Effectively
Q&A
Work Together Effectively
Thank you.

More Related Content

PDF
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
PPTX
Using your mlis skills in information architecture
PPTX
Planning for Project Cortex
PPTX
Big data analytics data structure
PDF
Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...
PPT
When Is A Digital Library Not A Digital
PPTX
Metadata primer for technical communicators
PPT
Zuse digital search engine optimization rankings, tactics & trends
Chalitha Perera | Cross Media Concept and Entity Driven Search for Enterprise
Using your mlis skills in information architecture
Planning for Project Cortex
Big data analytics data structure
Evaluating Search in Digital Cultural Heritage: Thinking Outside the (Search)...
When Is A Digital Library Not A Digital
Metadata primer for technical communicators
Zuse digital search engine optimization rankings, tactics & trends

What's hot (9)

KEY
Intro to Info Arch
PPTX
DRI Community Forum - Repository Reports
PPTX
AIIM Virtual event Feb 21 2019_Aria Consulting
PPTX
SHMcloud vision
PPT
Developments in Access to Art Information: EnCompass Digital Portal. 2003
PDF
The Enterprise Search Market in a Nutshell
PDF
Linked Open Data in the World of Patents
DOCX
Project criteria for self directed
PPTX
SPC.Org - SharePoint 2013 Search
Intro to Info Arch
DRI Community Forum - Repository Reports
AIIM Virtual event Feb 21 2019_Aria Consulting
SHMcloud vision
Developments in Access to Art Information: EnCompass Digital Portal. 2003
The Enterprise Search Market in a Nutshell
Linked Open Data in the World of Patents
Project criteria for self directed
SPC.Org - SharePoint 2013 Search
Ad

Similar to cross media concept and entity driven search for enterprise (20)

PDF
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
PDF
Content Discovery Through Entity Driven Search
PPTX
European SharePoint Conference Automated Tagging and Metadata Management w...
PDF
A Test-Bed For The Correlation Center Of Digital Services
PPTX
#SEASPC: Information Architecture and Enterprise Search - Better Together
PPTX
Share Point2007 Best Practices Final
PPT
EMC Documentum Product Line Overview
PPTX
InfoFusion Overview And Roadmap
PPT
Content Management, Metadata and Semantic Web
PPT
Content Management, Metadata and Semantic Web
PDF
Aiim Webinar Helen Mitchell Unified Search Final 7 21 2010
PDF
FAST for SharePoint 2010: How and Why?
PPTX
Taxonomies for Publishing
PPTX
Webinar: Enterprise Search in 2025
PDF
KMWorld Martin Briefing
PDF
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
PPT
Implementing Semantic Search
PDF
Semantic Web For Dummies
PPTX
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
PDF
AI in multi billion search engines. Building AI and Search teams
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
Content Discovery Through Entity Driven Search
European SharePoint Conference Automated Tagging and Metadata Management w...
A Test-Bed For The Correlation Center Of Digital Services
#SEASPC: Information Architecture and Enterprise Search - Better Together
Share Point2007 Best Practices Final
EMC Documentum Product Line Overview
InfoFusion Overview And Roadmap
Content Management, Metadata and Semantic Web
Content Management, Metadata and Semantic Web
Aiim Webinar Helen Mitchell Unified Search Final 7 21 2010
FAST for SharePoint 2010: How and Why?
Taxonomies for Publishing
Webinar: Enterprise Search in 2025
KMWorld Martin Briefing
Coexist or Integrate? Manage Unstructured Content from Diverse Repositories a...
Implementing Semantic Search
Semantic Web For Dummies
SPConnections Amsterdam: Beyond the Search Center - Application or Solution? ...
AI in multi billion search engines. Building AI and Search teams
Ad

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
KodekX | Application Modernization Development
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
sap open course for s4hana steps from ECC to s4
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
MYSQL Presentation for SQL database connectivity
PDF
cuic standard and advanced reporting.pdf
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Electronic commerce courselecture one. Pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
Review of recent advances in non-invasive hemoglobin estimation
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Chapter 3 Spatial Domain Image Processing.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
KodekX | Application Modernization Development
NewMind AI Weekly Chronicles - August'25 Week I
sap open course for s4hana steps from ECC to s4
MIND Revenue Release Quarter 2 2025 Press Release
MYSQL Presentation for SQL database connectivity
cuic standard and advanced reporting.pdf
Programs and apps: productivity, graphics, security and other tools
Build a system with the filesystem maintained by OSTree @ COSCUP 2025

cross media concept and entity driven search for enterprise

  • 1. Work Together Effectively Cross Media Concept and Entity Driven Search for Enterprise Chalitha Perera and Dileepa Jayakody R&D Engineers
  • 2. Work Together Effectively •  Headquartered in London with office in Colombo, Sri Lanka •  Focused on delivering enterprise content management solutions •  Our Skills
  • 3. Work Together Effectively Zaizi R&D Department •  Giving sense to the content   –  Enriching it semantically •  Adding value to ECM/CMS –  More structured content, easy to manage, link and search •  Improving search –  Across different domains, data sources, User Experience •  Machine Learning applied research
  • 4. Work Together Effectively Agenda •  Problem •  Solution •  Sensefy and MICO •  Demo •  Q&A
  • 5. Work Together Effectively Problem •  Unstructured Text Content –  Text documents, PDFs, Word … •  Rapid growth in multimedia content •  Heterogeneous Data Sources –  ECMs (Alfresco, Sharepoint), File System, Confluence, JIRA … •  Data is not useful without effective methods for –  Knowledge Extraction –  Information Retrieval
  • 6. Work Together Effectively Current Enterprise Search Limitations •  Limited to keyword based search •  Search context is not considered •  Ambiguity of terms •  Low precision •  Inability to properly handle multimedia files
  • 7. Work Together Effectively Desired traits of Solution •  Semantically Enhance documents –  Unstructured text –  Multimedia documents •  Cross media search •  Search with semantic concepts and entities •  Federated Search –  Search across different content repositories –  User permissions
  • 8. Work Together Effectively Sensefy •  Semantic Enterprise Search Engine •  Cross Media Search •  Federated Search •  Smart Search Assistance •  Open Source
  • 10. Work Together Effectively Repository Crawler •  Four types of connectors –  Repository Connectors –  Authority Connectors –  Transformation Connectors –  Output Connectors •  Connect different source repositories with different target indexes –  Source repositories (Alfresco, Sharepoint, Confluence etc) –  Target Indexes (Solr, ElasticSearch, Amazon CloudSearch) •  Security Model to enforce source repository security policies
  • 11. Work Together Effectively Media In Context (MICO) Platform •  MICO provides an integrated platform for –  Cross media analysis –  Metadata publishing –  Metadata querying •  Sensefy uses MICO as the cross media analysis engine to extract entities and concepts from multimedia
  • 12. Work Together Effectively Cross Media Extraction Pipeline
  • 13. Work Together Effectively Semantic Content Enrichment •  Named Entity Recognition –  People, places, organizations and concepts •  Entity Linking –  DBpedia, Yago, Custom Enterprise knowledge bases •  Entity Disambiguation
  • 14. Work Together Effectively Entity Search with Suggestions •  Named Entity Suggestions •  Ability to query with disambiguated entities •  Search results with high precision –  Keyword search results for “ronaldo”-  “Cristiano Ronaldo” and “Ronaldo” –  Entity Search - will contain only the documents related to selected entity
  • 15. Work Together Effectively Entity Search with Suggestions •  Combine entities and concepts for more complex queries