SlideShare a Scribd company logo
ENTERPRISE SEARCH SOLUTION
Ranjan Baisak(RB)
rb@ezmap.in
blog.ezmap.in
2 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
SCENARIOS
I want to see facts
from different
sources describing
EX4200
I want to more
about EX4200
about its chassis
specifications
I want to know
power supply
details of EX4200
switches
I want to know
more about
Juniper’s Security
solutions
I want to know
Juniper’s Data
center offerings
I want to see
customer’s
feedback about
Junos
3 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
1. Find Information faster
 Provide search assistants
2. Reveal hidden information
 Enrich the search index with background
knowledge
3. Find more specific information
 Query the semantic web
4. Find linked information
 Integrate data sources
4 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
FIND INFORMATION FASTER
To Provide a powerful auto-complete based on past searches
To have a enterprise vocabulary to assist in auto-complete
I can’t
remember
how to spell
the search
term
5 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
FIND INFORMATION FASTER
High Performance Networking Search
I can’t
remember
exactly what I
was looking
for
6 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
FIND INFORMATION FASTER WITH RELATED SEARCH
TERM
EX4200 Search
7 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
REVEAL HIDDEN INFORMATION WITH QUERY
EXPANSION
VLAN SearchOR ”VLAN Configuration"
Ontology
VLAN
VLAN Configuration
alternative Label
preferred Label
8 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
FIND LINKED INFORMATION
IX Content
KB
TechNotes
J-Net
Technical
Bulletin
EX4200
9 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
WHY ENTITIES IMPORTANT
Entities are new driver.
Entities are generated relational mapping that uncovers the
association between different data points.
Entities becomes trusted points around which other data
revolves.
10 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
THE ROLE OF ONTOLOGY IN SEMANTIC SEARCH
Junos
12.2 R1
Junos 12.2
JunosSoftware
Sep 05
2012
ACX
1000
ACX
2000
M7i
M10i
MX5
MX10
T320
T640
EX2200
EX4200
ACX
M
Series
MX
T
Series
EX
Series
Routing
Switching
Hardware
Released on
Belongs to
Supported on
Belongs To
Belongs To
Belongs To
11 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
HOW TO EXTRACT ENTITIES
Top level view, how entities are defined.
 We have data sources from iX contents, KB, Technets, J-Net
forums, Technical Bulletins etc…
 Through entity detection and raw relation detection raw text
extracted from web pages in unstructured data format at one end
becomes a responsive, refined entity that can provide an intelligent
answer in semantic search.
12 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
ONTOLOGY A CENTRAL POINT TO CONTROL
Labels and Query Expansion
Faceted Search
Refine Search Mechanism
Metadata Integration
Search Services
Search Application
Collector
Semantic
Indexer
Document
Index
Ontology
Manager
Extractor
HTML
Ontology
DB
13 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
PROBLEM WITH CURRENT SEARCH
Ambiguity and association in natural language
 Search is based on text
 Polysemy: Words often have a multitude of meanings and different
types of usage (more urgent for very heterogeneous collections).
 Synonymy: Different terms may have an identical or a similar
meaning (weaker: words indicating the same topic).
14 Copyright © 2013 Juniper Networks, Inc. www.juniper.net
LATENT SEMANTIC INDEXING
Latent Semantic Indexing (LSI) means analyzing documents to find the
underlying/latent meaning/semantics or concepts of those documents.
The fundamental difficulty in finding relevant documents from search words is that
what we really want is to compare the meanings or concepts behind the words.
LSA attempts to solve this problem by mapping both words and documents into
a "concept" space and doing the comparison in this space.
LSI overcomes two of the most problematic constraints of Boolean keyword
queries:
 multiple words that have similar meanings (synonymy)
 words that have more than one meaning (polysemy).
Text does not need to be in sentence form for LSI to be effective. It can work
with lists, free-form notes, email, web content, etc.
LSI is also used to perform automated document categorization and clustering. In
fact, several experiments have demonstrated that there are a number of
correlations between the way LSI and humans process and categorize text.
Semantic based Enterprise Search Solution in Networking Domain

More Related Content

PDF
B07040308
PPTX
Using mel cat opposing views
PPTX
Components of a search engine
PDF
HeirList Knowledge Base Access
PDF
AELA: An Adaptive Entity Linking Approach
PPTX
Share point metadata
DOC
Search engine
PPTX
Introduction to ActiveReports Server by GrapeCity
B07040308
Using mel cat opposing views
Components of a search engine
HeirList Knowledge Base Access
AELA: An Adaptive Entity Linking Approach
Share point metadata
Search engine
Introduction to ActiveReports Server by GrapeCity

Similar to Semantic based Enterprise Search Solution in Networking Domain (20)

PDF
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
PDF
Content Discovery Through Entity Driven Search
ODP
The search engine index
PPTX
Share Point2007 Best Practices Final
PPTX
How search engines work Anand Saini
PDF
II-SDV 2012 Patent Prior-Art Searching with Latent Semantic Analysis
PDF
Sql Saturday 111 Atlanta applied enterprise semantic mining
PPT
Search overview
PPTX
Eureka, I found it! - Special Libraries Association 2021 Presentation
PDF
Crowdsourced query augmentation through the semantic discovery of domain spec...
PPTX
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
PPTX
Intro to elasticsearch
PDF
The Enterprise Search Market in a Nutshell
PDF
Mastering Elasticsearch 2nd Edition Edition Rafal Kuc
PPTX
Info 2402 irt-chapter_2
PPTX
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
PPT
Building Search Systems for the Enterprise
PPT
2_Capability.ppt
PDF
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
PDF
US20110191333
ECIR-2014: Multilanguage Content Discovery Through Entity Driven Search
Content Discovery Through Entity Driven Search
The search engine index
Share Point2007 Best Practices Final
How search engines work Anand Saini
II-SDV 2012 Patent Prior-Art Searching with Latent Semantic Analysis
Sql Saturday 111 Atlanta applied enterprise semantic mining
Search overview
Eureka, I found it! - Special Libraries Association 2021 Presentation
Crowdsourced query augmentation through the semantic discovery of domain spec...
Discovery: Beyond Initial Implementation & Participation - and into Collabora...
Intro to elasticsearch
The Enterprise Search Market in a Nutshell
Mastering Elasticsearch 2nd Edition Edition Rafal Kuc
Info 2402 irt-chapter_2
Indexing Techniques: Their Usage in Search Engines for Information Retrieval
Building Search Systems for the Enterprise
2_Capability.ppt
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
US20110191333
Ad

More from Ranjan Baisak (6)

PDF
Proactive Vulnerability Detection in Source Code Using Graph Neural Networks:...
PPTX
Cloud Native Migration Steps
PPTX
PR agency - a personalized marketing analysis platform
PPTX
CabXury - a social cab sharing service
PPTX
Micro Services Architecture
PPTX
Docker : Container Virtualization
Proactive Vulnerability Detection in Source Code Using Graph Neural Networks:...
Cloud Native Migration Steps
PR agency - a personalized marketing analysis platform
CabXury - a social cab sharing service
Micro Services Architecture
Docker : Container Virtualization
Ad

Recently uploaded (20)

PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPT
Quality review (1)_presentation of this 21
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
Fluorescence-microscope_Botany_detailed content
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Computer network topology notes for revision
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
Introduction to the R Programming Language
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
SAP 2 completion done . PRESENTATION.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Quality review (1)_presentation of this 21
IB Computer Science - Internal Assessment.pptx
Qualitative Qantitative and Mixed Methods.pptx
annual-report-2024-2025 original latest.
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Fluorescence-microscope_Botany_detailed content
[EN] Industrial Machine Downtime Prediction
Business Ppt On Nestle.pptx huunnnhhgfvu
Computer network topology notes for revision
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Data_Analytics_and_PowerBI_Presentation.pptx
Business Analytics and business intelligence.pdf
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
Introduction to the R Programming Language
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
SAP 2 completion done . PRESENTATION.pptx

Semantic based Enterprise Search Solution in Networking Domain

  • 1. ENTERPRISE SEARCH SOLUTION Ranjan Baisak(RB) rb@ezmap.in blog.ezmap.in
  • 2. 2 Copyright © 2013 Juniper Networks, Inc. www.juniper.net SCENARIOS I want to see facts from different sources describing EX4200 I want to more about EX4200 about its chassis specifications I want to know power supply details of EX4200 switches I want to know more about Juniper’s Security solutions I want to know Juniper’s Data center offerings I want to see customer’s feedback about Junos
  • 3. 3 Copyright © 2013 Juniper Networks, Inc. www.juniper.net 1. Find Information faster  Provide search assistants 2. Reveal hidden information  Enrich the search index with background knowledge 3. Find more specific information  Query the semantic web 4. Find linked information  Integrate data sources
  • 4. 4 Copyright © 2013 Juniper Networks, Inc. www.juniper.net FIND INFORMATION FASTER To Provide a powerful auto-complete based on past searches To have a enterprise vocabulary to assist in auto-complete I can’t remember how to spell the search term
  • 5. 5 Copyright © 2013 Juniper Networks, Inc. www.juniper.net FIND INFORMATION FASTER High Performance Networking Search I can’t remember exactly what I was looking for
  • 6. 6 Copyright © 2013 Juniper Networks, Inc. www.juniper.net FIND INFORMATION FASTER WITH RELATED SEARCH TERM EX4200 Search
  • 7. 7 Copyright © 2013 Juniper Networks, Inc. www.juniper.net REVEAL HIDDEN INFORMATION WITH QUERY EXPANSION VLAN SearchOR ”VLAN Configuration" Ontology VLAN VLAN Configuration alternative Label preferred Label
  • 8. 8 Copyright © 2013 Juniper Networks, Inc. www.juniper.net FIND LINKED INFORMATION IX Content KB TechNotes J-Net Technical Bulletin EX4200
  • 9. 9 Copyright © 2013 Juniper Networks, Inc. www.juniper.net WHY ENTITIES IMPORTANT Entities are new driver. Entities are generated relational mapping that uncovers the association between different data points. Entities becomes trusted points around which other data revolves.
  • 10. 10 Copyright © 2013 Juniper Networks, Inc. www.juniper.net THE ROLE OF ONTOLOGY IN SEMANTIC SEARCH Junos 12.2 R1 Junos 12.2 JunosSoftware Sep 05 2012 ACX 1000 ACX 2000 M7i M10i MX5 MX10 T320 T640 EX2200 EX4200 ACX M Series MX T Series EX Series Routing Switching Hardware Released on Belongs to Supported on Belongs To Belongs To Belongs To
  • 11. 11 Copyright © 2013 Juniper Networks, Inc. www.juniper.net HOW TO EXTRACT ENTITIES Top level view, how entities are defined.  We have data sources from iX contents, KB, Technets, J-Net forums, Technical Bulletins etc…  Through entity detection and raw relation detection raw text extracted from web pages in unstructured data format at one end becomes a responsive, refined entity that can provide an intelligent answer in semantic search.
  • 12. 12 Copyright © 2013 Juniper Networks, Inc. www.juniper.net ONTOLOGY A CENTRAL POINT TO CONTROL Labels and Query Expansion Faceted Search Refine Search Mechanism Metadata Integration Search Services Search Application Collector Semantic Indexer Document Index Ontology Manager Extractor HTML Ontology DB
  • 13. 13 Copyright © 2013 Juniper Networks, Inc. www.juniper.net PROBLEM WITH CURRENT SEARCH Ambiguity and association in natural language  Search is based on text  Polysemy: Words often have a multitude of meanings and different types of usage (more urgent for very heterogeneous collections).  Synonymy: Different terms may have an identical or a similar meaning (weaker: words indicating the same topic).
  • 14. 14 Copyright © 2013 Juniper Networks, Inc. www.juniper.net LATENT SEMANTIC INDEXING Latent Semantic Indexing (LSI) means analyzing documents to find the underlying/latent meaning/semantics or concepts of those documents. The fundamental difficulty in finding relevant documents from search words is that what we really want is to compare the meanings or concepts behind the words. LSA attempts to solve this problem by mapping both words and documents into a "concept" space and doing the comparison in this space. LSI overcomes two of the most problematic constraints of Boolean keyword queries:  multiple words that have similar meanings (synonymy)  words that have more than one meaning (polysemy). Text does not need to be in sentence form for LSI to be effective. It can work with lists, free-form notes, email, web content, etc. LSI is also used to perform automated document categorization and clustering. In fact, several experiments have demonstrated that there are a number of correlations between the way LSI and humans process and categorize text.