SlideShare a Scribd company logo
APRIL 12, 2018
Knowledge as a Service
An Introduction to the Emerging Pre-Built Knowledge Market
Adrian J Bowles, PhD
Founder, STORM Insights, Inc.
Lead Analyst, AI, Aragon Research
info@storminsights.com
AGENDA
What Problem Are We Solving?
Designing For Change
Procuring Data
Watch Out For…
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
WHY IS THE AVAILABILITY OF KNOWLEDGE & DATA SUCH AN ISSUE TODAY?
PERCEPTION
UNDERSTANDING
LEARNING
Big
Data
Classic
AI
Deep
Learning
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Systems
Controls
Model
Data Mgmt
Human
Machine
Input Output
Gestures
Emotions
Language
Narrative Generation
Visualization
Reports
Haptics
Sensors
(IOT)
Systems
Controls
DATA IN THE MODERN AI LANDSCAPE
Learn
Reason
Understand
Emotions Meaning
Concepts Intent
Context
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Systems
Controls
Model
Data Mgmt
Human
Machine
Input Output
Gestures
Emotions
Language
Narrative Generation
Visualization
Reports
Haptics
Sensors
(IOT)
Systems
Controls
DATA MANAGEMENT IN THE MODERN AI LANDSCAPE
Emotions Meaning
Concepts Intent
Context
IDENTIFYING THE RIGHT DATA SOURCES IS INCREASING IN IMPORTANCE
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
DATA
More Data + Faster HW make
Deep Learning Practical
Deep Learning Success With Recognition
Spurs Investment
ALGORITHMS
&
RULES
Caution for Applications Where
Transparency is Critical
Investment Leads to Investigation
Broaden the Scope of Applications
New “Explainability” Research Emerges
Hybrid Solutions to Augment Intelligence
Will Thrive for Critical Applications
DATA REQUIREMENTS VARY WITH DOMAIN/TASK REQUIREMENTS
Domain
Task
General
General
Intelligent
Apps
Healthcare
Customer Service
Reorder Rx
Speak to a
Pharmacist
Pharma
Artificial
General
Intelligence
Chatbot
THE SEMANTIC WEB:
ALL DATA SHOULD BE ASSOCIATED WITH SEMANTIC ATTRIBUTES (MEANING)
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
RDF - Resource Description Framework - A directed, labeled graph.
DFS - RDF Specifications Suite Recommendations (Language for representing RDF vocabularies)
SPARQL - A Semantic Protocol & Query Language for RDF Data
OWL - The Web Ontology Language is a Semantic We
language designed to represent knowledge about things
and relationships between things on the Web.
An OWL Document is an Ontology.
https://www.w3.org/2013/data/
BASICS OF THE W3C SEMANTIC WEB ONTOLOGY STACK
DEEP STRUCTURE REQUIRES STRONGER METHODS FOR ANALYSIS TO FIND CONCEPTS
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Perception: obvious
structure is easy to
process…
but most of the
interesting stuff isn’t
obvious to a
computer.
START WITH A TAXONOMY
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
A taxonomy represents the formal structure of classes or types of objects within a domain.
•Generally hierarchical and provide names for each class in the domain.
•May also capture the membership properties of each object in relation to the other objects.
•The rules of a specific taxonomy are used to classify or categorize any object in the domain, so they
must be complete, consistent, and unambiguous. This rigor in specification should ensure that any
newly discovered object must fit into one, and only one, category or object class.
ONTOLOGIES
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
An ontology formalizes and specifies the names, definitions,
and attributes of entities within a domain.
An accepted ontology may define the domain.
ONTOLOGIES EVOLVE - SYSTEMS MUST BE FLEXIBLE
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
TRUTH VS BELIEF - DESIGN ACCORDINGLY
DATA SOURCE INTEGRATION IS A DESIGN CONSIDERATION
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
DataSources
Integrate
CRM
ERP
Enterprise Apps
Streaming
Historical/
Static/Batch
Required
Optional
IoT Sensors
Social Media Streams
Log Data
Other Streams
Deliv
er
Visualize
Analyze
DETERMINE YOUR NEED OR IDENTIFY RESOURCES FIRST?
Domain
Rate of Change
General
Streaming
Specific
Static
Natural
Language
Traffic
Stock Prices
Weather
APA
Diagnostics
Disaster/Battlefield
Monitoring
Twitterverse
USE PRE-BUILT KNOWLEDGE RESOURCES, SAVE TIME (30 YEARS?)
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
OPENCYC
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
PREBUILT CONTENT FOR FASTER DEPLOYMENT
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Watson Conversation
and
Virtual Agent
Source: IBM
PROCURING DATA
Commercial Providers
Open Source
Customers
Public/Government Open Data
YAGO - YET ANOTHER GREAT ONTOLOGY
Semantic knowledge base derived from Wikipedia, WordNet, and GeoNames
s://www.mpi-inf.mpg.de/departments/databases-and-information-systems/research/yago-naga/yag
Joint project of the Man Planck Institute for Informatics
and the
Telecom Paris Tech University
> 10M entities > 120M facts
Facts & Entities may have Temporal and Spacial Dimensions
Open Source: Available on Github (8/31/2017)
Graph Browser
Extracting “structured data” from Wikipedia.
The DBpedia data set describes > 4.58 million entities
>4 million are classified in a consistent ontology, including
1,445,000 persons, 735,000 places, 123,000 music albums, 87,000 films, 19,000 video games,
241,000 organizations, 251,000 species and 6,000 diseases.
~50 million links to other RDF datasets, 80.9 million links to Wikipedia categories, and 41.2 million
YAGO2 categories.
DBpedia uses the Resource Description Framework (RDF) to represent extracted information and
consists of 3 billion RDF triples, of which 580 million were extracted from the English edition of
Wikipedia and 2.46 billion from other language editions.
Derived from "DBpedia." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 17 Nov. 2017. Web. 12 Apr.
2018.
NEED TO ASSOCIATE/RECOGNIZE/UNDERSTAND TO ORGANIZE/REPRESENT
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Wordnet(R) Princeton
University "About WordNet."
Princeton University. 2010.
<http://wordnet.princeton.edu>
Source: https://aws.amazon.com/public-datasets/
REPRESENTATIVE DATA SETS
Source: https://cloud.google.com/public-datasets/
REPRESENTATIVE DATA SETS
Source: https://docs.microsoft.com/en-us/azure/sql-database/sql-database-public-data-sets
REPRESENTATIVE DATA SETS
CUSTOMERS CAN BE RICH DATA SOURCES
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Type Typical Current Use Potential Use
Accelerometer/motion
Rotate screen, Switch screen to
landscape/portrait
Ambient Light Adjust screen brightness
Barometer Measure altitude
Geo-Location (wifi/cellular) Location/Alerts
3-Axis Gyroscope Rotation rate for games, VR…
Proximity
Turn off screen when phone is by
your head
Touch ID fingerprint, Facial
Recognition
Security
CUSTOMERS CAN BE RICH DATA SOURCES
Copyright (c) 2012-6 by Dark Sky Company LLC. All Rights Reserved.
OPEN DATA
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
PUBLIC USE OF OPT-IN DATA
https://www.boston.gov/departments/new-urban-mechanics/street-bump
Smarter Cities
Collaborative Intelligence
The Borg Lives!
DISTRIBUTED DATA AND INTELLIGENCE
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Intelligence can be
Local to the device
Distributed
Aggregated
OPT-IN FREE DATA
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
LOCATION AND PROXIMITY DATA
Copyright (c) by Qualcomm. All Rights reserved.
A WORD OF CAUTION
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Sometimes the wisdom of crowds
leads to
Unintended Consequences
BE CAREFUL USING COMMERCIALLY ACQUIRED DATA
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
According to MaxMind…
This farm is home to
600,000,000 IP Addresses
Watch Out for Unintended Consequences,
Especially With Big Data
“THAT’S A BIG HOUSE” - MEANING MAY BE DIFFERENT FOR DIFFERENT SPEAKERS
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Bob Mary
Al (Capone)
Wikipedia contributors. "Alcatraz Federal Penitentiary." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 30 Oct. 2017. Web. 8 Nov. 2017.
DRAW A QUARTER, TO SCALE - RESULTS DIFFER ACCORDING TO HIDDEN CONTEXT
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
IT’S (ALMOST) ALL OUT THERE
Domain
Rate of Change
General
Streaming
Specific
Static
Wikipedia
OpenCyc
Public
OpenData
NOAA
Data
FIND NEW USES FOR EXISTING DATA SOURCES
Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
Copyright (c) 2014 by Umbrellium Ltd.
No Shortage of Data
How will you
create value?
adrian@storminsights.com
Twitter @ajbowles
Skype ajbowles
KEEP IN TOUCH
Upcoming SmartData Webinar Dates & Topics
May 10 Case Studies: Transforming Industries with AI
(Manufacturing & Retail)
June 14 Natural Language Processing:
From Chatbots to Artificial Understanding with Affective I/O
COMING SOON…
AGEOFREASONING.COM
BOOK, VIDEOS, PROFESSIONAL SERVICES
WWW.AGEOFREASONING.COM

More Related Content

PDF
Smart Data Webinar: Transforming Industries with Artificial Intelligence (AI)...
PDF
How to Consume Your Data for AI
PDF
Graph Database
PDF
Smart Data Slides: Leverage the IOT to Build a Smart Data Ecosystem
PDF
Data Catalog as the Platform for Data Intelligence
PDF
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
PPTX
Ai presentatie
PDF
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
Smart Data Webinar: Transforming Industries with Artificial Intelligence (AI)...
How to Consume Your Data for AI
Graph Database
Smart Data Slides: Leverage the IOT to Build a Smart Data Ecosystem
Data Catalog as the Platform for Data Intelligence
Smart Data Webinar: Choosing the Right Data Management Architecture for Cogni...
Ai presentatie
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?

What's hot (20)

PPTX
Using Machine Learning & Spark to Power Data-Driven Marketing
PPTX
How Data is Driving AI Innovation
PPTX
Big data analytics
PDF
Building a New Platform for Customer Analytics
PDF
Getting down to business on Big Data analytics
PPTX
Data Strategy in 2016
PDF
Tamr | Making enterprise elephants dance @ boston data festival
PDF
Maximize the Value of Your Data: Neo4j Graph Data Platform
PPTX
What are the 6 elements of a project
PDF
Applications of AI in Supply Chain Management: Hype versus Reality
PPTX
Importance of Big data for your Business
PPTX
Digital Transformation: How to Build an Analytics-Driven Culture
PDF
Big Data & Analytics perspectives in Banking
PPTX
Big Data & Business Analytics: Understanding the Marketspace
PPTX
BigData in Banking
PPTX
Big data analytics in banking sector
PPTX
Tamr Gartner BI and Analytics Summit
PDF
Fraud Detection with Graphs at the Danish Business Authority
PDF
Top Data Analytics Trends for 2019
PPTX
SKOS as the focal point of linked data strategies
Using Machine Learning & Spark to Power Data-Driven Marketing
How Data is Driving AI Innovation
Big data analytics
Building a New Platform for Customer Analytics
Getting down to business on Big Data analytics
Data Strategy in 2016
Tamr | Making enterprise elephants dance @ boston data festival
Maximize the Value of Your Data: Neo4j Graph Data Platform
What are the 6 elements of a project
Applications of AI in Supply Chain Management: Hype versus Reality
Importance of Big data for your Business
Digital Transformation: How to Build an Analytics-Driven Culture
Big Data & Analytics perspectives in Banking
Big Data & Business Analytics: Understanding the Marketspace
BigData in Banking
Big data analytics in banking sector
Tamr Gartner BI and Analytics Summit
Fraud Detection with Graphs at the Danish Business Authority
Top Data Analytics Trends for 2019
SKOS as the focal point of linked data strategies
Ad

Similar to Smart Data Webinar: Knowledge as a Service (20)

PDF
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
PPT
Spivack Blogtalk 2008
PPTX
BrightTALK - Semantic AI
PPT
Applications of Semantic Technology in the Real World Today
PPT
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
PPT
Introduction to Semantic Web for GIS Practitioners
PDF
Session 0.0 poster minutes madness
PDF
PPTX
The Information Workbench -
PDF
CAEPIA 2011
PDF
The Future of Semantics on the Web
PDF
How Semantics Solves Big Data Challenges
PPTX
Big Data Content Organization, Discovery, and Management
PDF
Smart Data - The Foundation for Better Business Outcomes
PDF
AI at the Edge
PDF
Quality, quantity, web and semantics
PDF
Quality, Quantity, Web and Semantics
PDF
Course 3 : Types of data and opportunities by Nikolaos Deligiannis
ODT
Riding The Semantic Wave
PDF
API's, Freebase, and the Collaborative Semantic web
Smart Data Webinar: Organizing Data and Knowledge - The Role of Taxonomies an...
Spivack Blogtalk 2008
BrightTALK - Semantic AI
Applications of Semantic Technology in the Real World Today
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
Introduction to Semantic Web for GIS Practitioners
Session 0.0 poster minutes madness
The Information Workbench -
CAEPIA 2011
The Future of Semantics on the Web
How Semantics Solves Big Data Challenges
Big Data Content Organization, Discovery, and Management
Smart Data - The Foundation for Better Business Outcomes
AI at the Edge
Quality, quantity, web and semantics
Quality, Quantity, Web and Semantics
Course 3 : Types of data and opportunities by Nikolaos Deligiannis
Riding The Semantic Wave
API's, Freebase, and the Collaborative Semantic web
Ad

More from DATAVERSITY (20)

PDF
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
PDF
Data at the Speed of Business with Data Mastering and Governance
PDF
Exploring Levels of Data Literacy
PDF
Building a Data Strategy – Practical Steps for Aligning with Business Goals
PDF
Make Data Work for You
PDF
Data Catalogs Are the Answer – What is the Question?
PDF
Data Catalogs Are the Answer – What Is the Question?
PDF
Data Modeling Fundamentals
PDF
Showing ROI for Your Analytic Project
PDF
How a Semantic Layer Makes Data Mesh Work at Scale
PDF
Is Enterprise Data Literacy Possible?
PDF
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
PDF
Emerging Trends in Data Architecture – What’s the Next Big Thing?
PDF
Data Governance Trends - A Look Backwards and Forwards
PDF
Data Governance Trends and Best Practices To Implement Today
PDF
2023 Trends in Enterprise Analytics
PDF
Data Strategy Best Practices
PDF
Who Should Own Data Governance – IT or Business?
PDF
Data Management Best Practices
PDF
MLOps – Applying DevOps to Competitive Advantage
Architecture, Products, and Total Cost of Ownership of the Leading Machine Le...
Data at the Speed of Business with Data Mastering and Governance
Exploring Levels of Data Literacy
Building a Data Strategy – Practical Steps for Aligning with Business Goals
Make Data Work for You
Data Catalogs Are the Answer – What is the Question?
Data Catalogs Are the Answer – What Is the Question?
Data Modeling Fundamentals
Showing ROI for Your Analytic Project
How a Semantic Layer Makes Data Mesh Work at Scale
Is Enterprise Data Literacy Possible?
The Data Trifecta – Privacy, Security & Governance Race from Reactivity to Re...
Emerging Trends in Data Architecture – What’s the Next Big Thing?
Data Governance Trends - A Look Backwards and Forwards
Data Governance Trends and Best Practices To Implement Today
2023 Trends in Enterprise Analytics
Data Strategy Best Practices
Who Should Own Data Governance – IT or Business?
Data Management Best Practices
MLOps – Applying DevOps to Competitive Advantage

Recently uploaded (20)

PPTX
Big Data Technologies - Introduction.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Weekly Chronicles - August'25 Week I
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
sap open course for s4hana steps from ECC to s4
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Spectroscopy.pptx food analysis technology
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Cloud computing and distributed systems.
Big Data Technologies - Introduction.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Machine learning based COVID-19 study performance prediction
Per capita expenditure prediction using model stacking based on satellite ima...
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Weekly Chronicles - August'25 Week I
The AUB Centre for AI in Media Proposal.docx
Building Integrated photovoltaic BIPV_UPV.pdf
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Review of recent advances in non-invasive hemoglobin estimation
sap open course for s4hana steps from ECC to s4
MIND Revenue Release Quarter 2 2025 Press Release
MYSQL Presentation for SQL database connectivity
Spectroscopy.pptx food analysis technology
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Cloud computing and distributed systems.

Smart Data Webinar: Knowledge as a Service

  • 1. APRIL 12, 2018 Knowledge as a Service An Introduction to the Emerging Pre-Built Knowledge Market Adrian J Bowles, PhD Founder, STORM Insights, Inc. Lead Analyst, AI, Aragon Research info@storminsights.com
  • 2. AGENDA What Problem Are We Solving? Designing For Change Procuring Data Watch Out For…
  • 3. Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. WHY IS THE AVAILABILITY OF KNOWLEDGE & DATA SUCH AN ISSUE TODAY? PERCEPTION UNDERSTANDING LEARNING Big Data Classic AI Deep Learning
  • 4. Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Systems Controls Model Data Mgmt Human Machine Input Output Gestures Emotions Language Narrative Generation Visualization Reports Haptics Sensors (IOT) Systems Controls DATA IN THE MODERN AI LANDSCAPE Learn Reason Understand Emotions Meaning Concepts Intent Context
  • 5. Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Systems Controls Model Data Mgmt Human Machine Input Output Gestures Emotions Language Narrative Generation Visualization Reports Haptics Sensors (IOT) Systems Controls DATA MANAGEMENT IN THE MODERN AI LANDSCAPE Emotions Meaning Concepts Intent Context
  • 6. IDENTIFYING THE RIGHT DATA SOURCES IS INCREASING IN IMPORTANCE Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. DATA More Data + Faster HW make Deep Learning Practical Deep Learning Success With Recognition Spurs Investment ALGORITHMS & RULES Caution for Applications Where Transparency is Critical Investment Leads to Investigation Broaden the Scope of Applications New “Explainability” Research Emerges Hybrid Solutions to Augment Intelligence Will Thrive for Critical Applications
  • 7. DATA REQUIREMENTS VARY WITH DOMAIN/TASK REQUIREMENTS Domain Task General General Intelligent Apps Healthcare Customer Service Reorder Rx Speak to a Pharmacist Pharma Artificial General Intelligence Chatbot
  • 8. THE SEMANTIC WEB: ALL DATA SHOULD BE ASSOCIATED WITH SEMANTIC ATTRIBUTES (MEANING) Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. RDF - Resource Description Framework - A directed, labeled graph. DFS - RDF Specifications Suite Recommendations (Language for representing RDF vocabularies) SPARQL - A Semantic Protocol & Query Language for RDF Data OWL - The Web Ontology Language is a Semantic We language designed to represent knowledge about things and relationships between things on the Web. An OWL Document is an Ontology. https://www.w3.org/2013/data/ BASICS OF THE W3C SEMANTIC WEB ONTOLOGY STACK
  • 9. DEEP STRUCTURE REQUIRES STRONGER METHODS FOR ANALYSIS TO FIND CONCEPTS Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Perception: obvious structure is easy to process… but most of the interesting stuff isn’t obvious to a computer.
  • 10. START WITH A TAXONOMY Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. A taxonomy represents the formal structure of classes or types of objects within a domain. •Generally hierarchical and provide names for each class in the domain. •May also capture the membership properties of each object in relation to the other objects. •The rules of a specific taxonomy are used to classify or categorize any object in the domain, so they must be complete, consistent, and unambiguous. This rigor in specification should ensure that any newly discovered object must fit into one, and only one, category or object class.
  • 11. ONTOLOGIES Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. An ontology formalizes and specifies the names, definitions, and attributes of entities within a domain. An accepted ontology may define the domain.
  • 12. ONTOLOGIES EVOLVE - SYSTEMS MUST BE FLEXIBLE Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. TRUTH VS BELIEF - DESIGN ACCORDINGLY
  • 13. DATA SOURCE INTEGRATION IS A DESIGN CONSIDERATION Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. DataSources Integrate CRM ERP Enterprise Apps Streaming Historical/ Static/Batch Required Optional IoT Sensors Social Media Streams Log Data Other Streams Deliv er Visualize Analyze
  • 14. DETERMINE YOUR NEED OR IDENTIFY RESOURCES FIRST? Domain Rate of Change General Streaming Specific Static Natural Language Traffic Stock Prices Weather APA Diagnostics Disaster/Battlefield Monitoring Twitterverse
  • 15. USE PRE-BUILT KNOWLEDGE RESOURCES, SAVE TIME (30 YEARS?) Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
  • 16. OPENCYC Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
  • 17. PREBUILT CONTENT FOR FASTER DEPLOYMENT Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Watson Conversation and Virtual Agent Source: IBM
  • 18. PROCURING DATA Commercial Providers Open Source Customers Public/Government Open Data
  • 19. YAGO - YET ANOTHER GREAT ONTOLOGY Semantic knowledge base derived from Wikipedia, WordNet, and GeoNames s://www.mpi-inf.mpg.de/departments/databases-and-information-systems/research/yago-naga/yag Joint project of the Man Planck Institute for Informatics and the Telecom Paris Tech University > 10M entities > 120M facts Facts & Entities may have Temporal and Spacial Dimensions Open Source: Available on Github (8/31/2017) Graph Browser
  • 20. Extracting “structured data” from Wikipedia. The DBpedia data set describes > 4.58 million entities >4 million are classified in a consistent ontology, including 1,445,000 persons, 735,000 places, 123,000 music albums, 87,000 films, 19,000 video games, 241,000 organizations, 251,000 species and 6,000 diseases. ~50 million links to other RDF datasets, 80.9 million links to Wikipedia categories, and 41.2 million YAGO2 categories. DBpedia uses the Resource Description Framework (RDF) to represent extracted information and consists of 3 billion RDF triples, of which 580 million were extracted from the English edition of Wikipedia and 2.46 billion from other language editions. Derived from "DBpedia." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 17 Nov. 2017. Web. 12 Apr. 2018.
  • 21. NEED TO ASSOCIATE/RECOGNIZE/UNDERSTAND TO ORGANIZE/REPRESENT Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Wordnet(R) Princeton University "About WordNet." Princeton University. 2010. <http://wordnet.princeton.edu>
  • 25. CUSTOMERS CAN BE RICH DATA SOURCES Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Type Typical Current Use Potential Use Accelerometer/motion Rotate screen, Switch screen to landscape/portrait Ambient Light Adjust screen brightness Barometer Measure altitude Geo-Location (wifi/cellular) Location/Alerts 3-Axis Gyroscope Rotation rate for games, VR… Proximity Turn off screen when phone is by your head Touch ID fingerprint, Facial Recognition Security
  • 26. CUSTOMERS CAN BE RICH DATA SOURCES Copyright (c) 2012-6 by Dark Sky Company LLC. All Rights Reserved.
  • 27. OPEN DATA Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
  • 28. PUBLIC USE OF OPT-IN DATA https://www.boston.gov/departments/new-urban-mechanics/street-bump Smarter Cities Collaborative Intelligence The Borg Lives!
  • 29. DISTRIBUTED DATA AND INTELLIGENCE Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Intelligence can be Local to the device Distributed Aggregated
  • 30. OPT-IN FREE DATA Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
  • 31. LOCATION AND PROXIMITY DATA Copyright (c) by Qualcomm. All Rights reserved.
  • 32. A WORD OF CAUTION Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Sometimes the wisdom of crowds leads to Unintended Consequences
  • 33. BE CAREFUL USING COMMERCIALLY ACQUIRED DATA Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. According to MaxMind… This farm is home to 600,000,000 IP Addresses Watch Out for Unintended Consequences, Especially With Big Data
  • 34. “THAT’S A BIG HOUSE” - MEANING MAY BE DIFFERENT FOR DIFFERENT SPEAKERS Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Bob Mary Al (Capone) Wikipedia contributors. "Alcatraz Federal Penitentiary." Wikipedia, The Free Encyclopedia. Wikipedia, The Free Encyclopedia, 30 Oct. 2017. Web. 8 Nov. 2017.
  • 35. DRAW A QUARTER, TO SCALE - RESULTS DIFFER ACCORDING TO HIDDEN CONTEXT Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved.
  • 36. IT’S (ALMOST) ALL OUT THERE Domain Rate of Change General Streaming Specific Static Wikipedia OpenCyc Public OpenData NOAA Data
  • 37. FIND NEW USES FOR EXISTING DATA SOURCES Copyright (c) 2018 by STORM Insights Inc. All Rights Reserved. Copyright (c) 2014 by Umbrellium Ltd. No Shortage of Data How will you create value?
  • 38. adrian@storminsights.com Twitter @ajbowles Skype ajbowles KEEP IN TOUCH Upcoming SmartData Webinar Dates & Topics May 10 Case Studies: Transforming Industries with AI (Manufacturing & Retail) June 14 Natural Language Processing: From Chatbots to Artificial Understanding with Affective I/O COMING SOON… AGEOFREASONING.COM BOOK, VIDEOS, PROFESSIONAL SERVICES WWW.AGEOFREASONING.COM