SlideShare a Scribd company logo
Enterprise Knowledge Graph
(EKG)
Mining an Enterprise’s Systems of Engagements

                 Sumeet Vij
              Senior Associate
             Booz Allen Hamilton
Can you make your decisions on
 just 20% of your data?
◦ According to IDC Research, less than 20% percent
  of an enterprise’s information is in the form of
  structured data which can reside neatly in traditional
  columnar relational databases
◦ 80% of information is unstructured and semi-
  structured in the form of documents, web-
  pages, emails, images and videos which are growing
  at a tremendous rate
◦ Current Enterprise Systems of Record
  (ERPs, CRMs) capture a miniscule amount of
  information generated within an Enterprise
◦ However the Systems of Record remain the main
  focus of the IT team and the main source of
  information for the enterprise leadership
Unstructured Data creates
 Enterprise Information
 Management Challenges
• Information is scattered and inaccessible
    • Spread across documents, spreadsheet, emails
• Data is stored in multiple, often incompatible
  formats
• Data sources are not linked
    • No documented relationships between pieces of
      information
    • No easy way to harness data from external sources
      including social networks
•   Information is hard to understand
    • Different terminology and vocabularies
How do employees create and
share information? Through
Systems of Engagement
   These systems are the primary way
    employees in an Enterprise
    communicate and share information,
    namely
    ◦ Email
    ◦ IM (Lync)
    ◦ Social collaboration tools like Yammer, Tibbr,
      Jive
   Not surprisingly, these systems
    generate unstructured data at high
Systems of Engagement
 Loosely structured knowledge flows
 Conversational
 Dynamic and in flux
How does the industry extract information
from unstructured text? Google
Knowledge Graph
The Google Knowledge Graph provides “Things not just
Strings”, that is, it enhances its search results with semantic
information gathered from multiple sources. It provides
structured information about entities and links to other related
entities. Its goal is to help people
 • Find the right thing: Find the right entity, understand the
   difference between Taj Mahal the monument and Taj Mahal
   the musician
•   Get the best summary: Summarize relevant content
    related to the entity, key facts and other related entities
•   Go deep and broader: Help make unexpected discoveries
    and relationships
How does an Enterprise extract
information from these Systems of
Engagement? Enter the Enterprise
Knowledge Graph (EKG)
Along the lines of the Google Knowledge Graph, the EKG aims to help
enterprises extract and explore information created by systems of
engagement. Core EKG concepts are:

•   Knowledge Capture: Extract key concepts and relationships from
    unstructured documents using an Enterprise Ontology. This allows
    concept based indexing of content
    •   Example: An employee submits a trip report in the form of an email. EKG automatically extracts
        the Who, What, When and Where information and links it to other relevant resources.

•   Knowledge Discovery: Search multiple data sources for information
    using a relevant Enterprise Ontology
    • Example: A proposal manager can ask, “Who has background information about
      the Army CIO/G6?”.

•   Knowledge Exploration: Expose information to a host of graphical tools
    to visualize and further analyze relationships between data
How is the EKG seeded?
 Crowd-source the creation
 The major source of information generation in an enterprise
  is email. The process to seed the EKG with email would
  be:
   ◦ The sender copies their email to a monitored EKG email
     mailbox
   ◦ The EKG parses, analyzes and adds the extracted facts
     to the Knowledge Graph
   ◦ The EKG then sends an automated email back to the
     sender, describing the facts and a link to correct the
     extracted information
 Start with a specific Ontology geared towards a high value
  use case and then build out the entities and their
  relationships
Benefits of adding email to the
EKG
◦ Bigger insights as we can leverage the
  collective interactions of all the employees
  (not just the respondent) and the
  subsequent interactions enrich the
  EKG, allowing even more questions to be
  answered
◦ Liberate employee knowledge, expertise
  and interactions from the mailbox and
  make it available for the enterprise to
  leverage.
EKG Benefits
•   Utilize all available knowledge sources
    • Allows documents, spreadsheets and emails to serve as “top
      level” information sources
•   Integration
    • Ties disconnected pieces of data together into meaningful
      wholes that provide a basis for planning and decision making
•   Meaning-Centric
    • Facts around an object or an entity can be easily explored
    • Search phrases are better “understood” as they are based
      upon concepts and not literals
•   Serendipity
    • Related searches allow the formerly “unknown” to be
      discovered
                                       SLIDE 10
How we discover information within an
                 Enterprise today
                                                                         Sumeet Vij
   Proposal Manager                  Resume                                                                      Facts
                                      System
                      Search
                                                                                 Presented at
                                                                                                    Cliff Daus
                                                   Attended
                                                                        DoD SOA &
                                                                         Semantic
                                                                        Technology        Attended                             Trip Report
                                                                        Symposium
                      Search      Opportunity
Who has                          Management
information                          System                           Follow on Meeting
about the army
                                                                                                                 Employee of
CIO/G6 ?
                                                             Demonstration
                                                                                          Attended
                                                                                                                           Social Network
                                                                 at
                                                               CIO/G6
                       Search                                                                   CIO/G6
                                                                        Customer
                                                Topic
                                        CRM

                                                                           Attended
                                                         Semantic
                                                        Technologie
                                                            s


                                                                                                                                  Web
                                                                             A        B         C        D


                      Systems of Record                      Systems of Engagement

                                                                                 SLIDE 11
Knowledge Discovery using EKG
Proposal Manager


                                      Knowledge Discovery      Knowledge Capture                          Web Submission

 Who has information about the Army CIO/G6?




                                                                           Entity Extraction
            Parse                                                                                                                Trip Reports
                                                                                                                                 Meeting Minutes
                                                                                                          Email Submission       Etc.
            Determine Sources for Information


            Query



              Resume                                 Knowledgebase
               System


          Opportunity
         Management                                                                                                          Submit
             System

                   CRM                                                                                           Update


                                                                                                                                       Sumeet Vij

                                                                                               SLIDE 12
Conceptual EKG Architecture
•   An open architecture composed of re-
    useable open source components
                                                                                  User Interface Layer


                                                    Document               Knowledge
                            Query UI
                                                     Upload                 Browser


                                                                            Semantic Processing Layer

                                                                        Data Source
                        Entity Extraction     Concept Catalog
                                                                          Catalog


        Integration Layer                                                       Persistence Layer


                  E-Mail                Database         Web Services                  NoSQL
                 Connector              Connector           Client                      Store


                                                                SLIDE 13
Questions?

More Related Content

PDF
What is an information professional?
PDF
In the social, mobile and cloud era, what does it take to be an Information P...
PDF
Visual Analytics: Revealing Corruption, Fraud, Waste, and Abuse
PDF
BI Forum 2009 - BI Mega Trends
PPTX
Big data and the challenge of extreme information
PDF
Collaboration & Social Media New Challenges For Records Management
PDF
Simplified Business Event Processing
PDF
EDF2012 Wolfgang Nimfuehr - Bringing Big Data to the Enterprise
What is an information professional?
In the social, mobile and cloud era, what does it take to be an Information P...
Visual Analytics: Revealing Corruption, Fraud, Waste, and Abuse
BI Forum 2009 - BI Mega Trends
Big data and the challenge of extreme information
Collaboration & Social Media New Challenges For Records Management
Simplified Business Event Processing
EDF2012 Wolfgang Nimfuehr - Bringing Big Data to the Enterprise

Viewers also liked (20)

ODP
Freebase and the semantic web
PDF
Bekas for cognitive_speaker_series
PDF
PDF
TAO: Facebook's Distributed Data Store for the Social Graph
PPT
Overview of an Efficient Knowledge Management Model
PPTX
#Espc15: Build a knowledge social network with o365, yammer and office graph
PDF
Applications and Analytics players and positioning
PDF
Enterprise Knowledge Graph
PDF
Enterprise Applications, Analytics and Knowledge Products Positionings in Isr...
PPTX
Enterprise knowledge graphs
PDF
The Algorithm of Magical Customer Experiences
PPTX
The Semantic Knowledge Graph
PDF
Relational to Big Graph
PDF
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
PDF
Enterprise Knowledge Graph
PDF
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
PDF
Knowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup
PDF
Venture Scanner Artificial Intelligence 2016 Q4
PPTX
Knowledge Graphs at Elsevier
PDF
LinkedIn Graph Presentation
Freebase and the semantic web
Bekas for cognitive_speaker_series
TAO: Facebook's Distributed Data Store for the Social Graph
Overview of an Efficient Knowledge Management Model
#Espc15: Build a knowledge social network with o365, yammer and office graph
Applications and Analytics players and positioning
Enterprise Knowledge Graph
Enterprise Applications, Analytics and Knowledge Products Positionings in Isr...
Enterprise knowledge graphs
The Algorithm of Magical Customer Experiences
The Semantic Knowledge Graph
Relational to Big Graph
Enterprise Knowledge - Taxonomy Design Best Practices and Methodology
Enterprise Knowledge Graph
Knowledge Graphs - Journey to the Connected Enterprise - Data Strategy and An...
Knowledge Graphs for a Connected World - AI, Deep & Machine Learning Meetup
Venture Scanner Artificial Intelligence 2016 Q4
Knowledge Graphs at Elsevier
LinkedIn Graph Presentation
Ad

Similar to Sumeet vij enterprise_knowledge_graph (20)

PPTX
Law Firm Knowledge Management, An Introduction
PPTX
NGC records management - SP2010 RM Features
PDF
Improving Findability: The Role of Information Architecture in Effective Search
PDF
Aligning people process and technology in km kwt presentation
PDF
Doculabs E Discovery 051710
PDF
Improving Findability Inside the Firewall
PDF
Booz Allen Hamilton - KM presentation to UNDP
PPT
Conducting a Knowledge - Business workshop
PPTX
Aligning people process and technology in km arma metro ny presentation
PPTX
Architecting the Building Blocks of Enterprise Social Networking
PPT
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
PPT
Knowledge mobilization
PDF
Law firm knowledge management, an introduction: LawTech Camp 2012
PPTX
Km2003cope
PPT
LIS3353 SP12 Week 10
PPT
LIS3353 SP12 Week 10
PPTX
20111031 KMWorld 2011 Applying the Social Business Roadmap to Your Organization
PPT
Metadata in general and Dublin Core in specific; some experiences
PPTX
Design Considerations For Enterprise Social Networks: Identity, Graphs, Strea...
PDF
Enterprise Content Management and the Librarian
Law Firm Knowledge Management, An Introduction
NGC records management - SP2010 RM Features
Improving Findability: The Role of Information Architecture in Effective Search
Aligning people process and technology in km kwt presentation
Doculabs E Discovery 051710
Improving Findability Inside the Firewall
Booz Allen Hamilton - KM presentation to UNDP
Conducting a Knowledge - Business workshop
Aligning people process and technology in km arma metro ny presentation
Architecting the Building Blocks of Enterprise Social Networking
Albert Simard - Mobilizing Knowledge: Acquisition, Analysis, and Action
Knowledge mobilization
Law firm knowledge management, an introduction: LawTech Camp 2012
Km2003cope
LIS3353 SP12 Week 10
LIS3353 SP12 Week 10
20111031 KMWorld 2011 Applying the Social Business Roadmap to Your Organization
Metadata in general and Dublin Core in specific; some experiences
Design Considerations For Enterprise Social Networks: Identity, Graphs, Strea...
Enterprise Content Management and the Librarian
Ad

More from Open Analytics (20)

PDF
Cyber after Snowden (OA Cyber Summit)
PPTX
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
PPT
CDM….Where do you start? (OA Cyber Summit)
PPTX
An Immigrant’s view of Cyberspace (OA Cyber Summit)
PPTX
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
PPTX
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
PPTX
Using Real-Time Data to Drive Optimization & Personalization
PPTX
M&A Trends in Telco Analytics
PPTX
Competing in the Digital Economy
PPTX
Piwik: An Analytics Alternative (Chicago Summit)
PDF
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
PDF
Crossing the Chasm (Ikanow - Chicago Summit)
PPTX
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
PDF
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
PDF
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
PDF
From Insight to Impact (Chicago Summit - Keynote)
PPT
Easybib Open Analytics NYC
PPTX
MarkLogic - Open Analytics Meetup
PPTX
The caprate presentation_july2013_open analytics dc meetup
PPTX
Verifeed open analytics_3min deck_071713_final
Cyber after Snowden (OA Cyber Summit)
Utilizing cyber intelligence to combat cyber adversaries (OA Cyber Summit)
CDM….Where do you start? (OA Cyber Summit)
An Immigrant’s view of Cyberspace (OA Cyber Summit)
MOLOCH: Search for Full Packet Capture (OA Cyber Summit)
Observations on CFR.org Website Traffic Surge Due to Chechnya Terrorism Scare...
Using Real-Time Data to Drive Optimization & Personalization
M&A Trends in Telco Analytics
Competing in the Digital Economy
Piwik: An Analytics Alternative (Chicago Summit)
Social Media, Cloud Computing, Machine Learning, Open Source, and Big Data An...
Crossing the Chasm (Ikanow - Chicago Summit)
On the “Moneyball” – Building the Team, Product, and Service to Rival (Pegged...
Data evolutions in media, marketing, and retail (Business Adv Group - Chicago...
Characterizing Risk in your Supply Chain (nContext - Chicago Summit)
From Insight to Impact (Chicago Summit - Keynote)
Easybib Open Analytics NYC
MarkLogic - Open Analytics Meetup
The caprate presentation_july2013_open analytics dc meetup
Verifeed open analytics_3min deck_071713_final

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
GamePlan Trading System Review: Professional Trader's Honest Take
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Machine learning based COVID-19 study performance prediction
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Cloud computing and distributed systems.
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Modernizing your data center with Dell and AMD
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Electronic commerce courselecture one. Pdf
Chapter 3 Spatial Domain Image Processing.pdf
Spectral efficient network and resource selection model in 5G networks
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
GamePlan Trading System Review: Professional Trader's Honest Take
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Machine learning based COVID-19 study performance prediction
Review of recent advances in non-invasive hemoglobin estimation
Per capita expenditure prediction using model stacking based on satellite ima...
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
Advanced Soft Computing BINUS July 2025.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Cloud computing and distributed systems.
Dropbox Q2 2025 Financial Results & Investor Presentation
“AI and Expert System Decision Support & Business Intelligence Systems”
Network Security Unit 5.pdf for BCA BBA.
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Modernizing your data center with Dell and AMD
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Electronic commerce courselecture one. Pdf

Sumeet vij enterprise_knowledge_graph

  • 1. Enterprise Knowledge Graph (EKG) Mining an Enterprise’s Systems of Engagements Sumeet Vij Senior Associate Booz Allen Hamilton
  • 2. Can you make your decisions on just 20% of your data? ◦ According to IDC Research, less than 20% percent of an enterprise’s information is in the form of structured data which can reside neatly in traditional columnar relational databases ◦ 80% of information is unstructured and semi- structured in the form of documents, web- pages, emails, images and videos which are growing at a tremendous rate ◦ Current Enterprise Systems of Record (ERPs, CRMs) capture a miniscule amount of information generated within an Enterprise ◦ However the Systems of Record remain the main focus of the IT team and the main source of information for the enterprise leadership
  • 3. Unstructured Data creates Enterprise Information Management Challenges • Information is scattered and inaccessible • Spread across documents, spreadsheet, emails • Data is stored in multiple, often incompatible formats • Data sources are not linked • No documented relationships between pieces of information • No easy way to harness data from external sources including social networks • Information is hard to understand • Different terminology and vocabularies
  • 4. How do employees create and share information? Through Systems of Engagement  These systems are the primary way employees in an Enterprise communicate and share information, namely ◦ Email ◦ IM (Lync) ◦ Social collaboration tools like Yammer, Tibbr, Jive  Not surprisingly, these systems generate unstructured data at high
  • 5. Systems of Engagement  Loosely structured knowledge flows  Conversational  Dynamic and in flux
  • 6. How does the industry extract information from unstructured text? Google Knowledge Graph The Google Knowledge Graph provides “Things not just Strings”, that is, it enhances its search results with semantic information gathered from multiple sources. It provides structured information about entities and links to other related entities. Its goal is to help people • Find the right thing: Find the right entity, understand the difference between Taj Mahal the monument and Taj Mahal the musician • Get the best summary: Summarize relevant content related to the entity, key facts and other related entities • Go deep and broader: Help make unexpected discoveries and relationships
  • 7. How does an Enterprise extract information from these Systems of Engagement? Enter the Enterprise Knowledge Graph (EKG) Along the lines of the Google Knowledge Graph, the EKG aims to help enterprises extract and explore information created by systems of engagement. Core EKG concepts are: • Knowledge Capture: Extract key concepts and relationships from unstructured documents using an Enterprise Ontology. This allows concept based indexing of content • Example: An employee submits a trip report in the form of an email. EKG automatically extracts the Who, What, When and Where information and links it to other relevant resources. • Knowledge Discovery: Search multiple data sources for information using a relevant Enterprise Ontology • Example: A proposal manager can ask, “Who has background information about the Army CIO/G6?”. • Knowledge Exploration: Expose information to a host of graphical tools to visualize and further analyze relationships between data
  • 8. How is the EKG seeded?  Crowd-source the creation  The major source of information generation in an enterprise is email. The process to seed the EKG with email would be: ◦ The sender copies their email to a monitored EKG email mailbox ◦ The EKG parses, analyzes and adds the extracted facts to the Knowledge Graph ◦ The EKG then sends an automated email back to the sender, describing the facts and a link to correct the extracted information  Start with a specific Ontology geared towards a high value use case and then build out the entities and their relationships
  • 9. Benefits of adding email to the EKG ◦ Bigger insights as we can leverage the collective interactions of all the employees (not just the respondent) and the subsequent interactions enrich the EKG, allowing even more questions to be answered ◦ Liberate employee knowledge, expertise and interactions from the mailbox and make it available for the enterprise to leverage.
  • 10. EKG Benefits • Utilize all available knowledge sources • Allows documents, spreadsheets and emails to serve as “top level” information sources • Integration • Ties disconnected pieces of data together into meaningful wholes that provide a basis for planning and decision making • Meaning-Centric • Facts around an object or an entity can be easily explored • Search phrases are better “understood” as they are based upon concepts and not literals • Serendipity • Related searches allow the formerly “unknown” to be discovered SLIDE 10
  • 11. How we discover information within an Enterprise today Sumeet Vij Proposal Manager Resume Facts System Search Presented at Cliff Daus Attended DoD SOA & Semantic Technology Attended Trip Report Symposium Search Opportunity Who has Management information System Follow on Meeting about the army Employee of CIO/G6 ? Demonstration Attended Social Network at CIO/G6 Search CIO/G6 Customer Topic CRM Attended Semantic Technologie s Web A B C D Systems of Record Systems of Engagement SLIDE 11
  • 12. Knowledge Discovery using EKG Proposal Manager Knowledge Discovery Knowledge Capture Web Submission Who has information about the Army CIO/G6? Entity Extraction Parse Trip Reports Meeting Minutes Email Submission Etc. Determine Sources for Information Query Resume Knowledgebase System Opportunity Management Submit System CRM Update Sumeet Vij SLIDE 12
  • 13. Conceptual EKG Architecture • An open architecture composed of re- useable open source components User Interface Layer Document Knowledge Query UI Upload Browser Semantic Processing Layer Data Source Entity Extraction Concept Catalog Catalog Integration Layer Persistence Layer E-Mail Database Web Services NoSQL Connector Connector Client Store SLIDE 13