Text, Content, and Social Analytics: BI for the New WorldSeth GrimesAlta Plana Corporation@sethgrimesTDWI – Washington DCJuly 15, 2011
Table of Content:Principles.Perspectives.Semantics.Text/content analytics.Social.BI for the New World.
Imperatives for the 2010s:Do more with more.“It’s Not Information Overload. It’s Filter Failure”: Clay Shirky, 2008.More sources & types of data.
Greater data volumes.
New hardware and methods.Automate more, more intelligently.Analytics.
Semantics.Engage.  Socialize.
I see three categories of data:Quantities, whether measured, observed, or computed.Content, which I’ll characterize as non-quantitative information.Metadata (semantic & structural) describing quantities and content.Our concern is content, analytics & fusion.
Structured/unstructured is a false dichotomy.
Where do relationships fit?DW & BI relate numbers......but by-the-numbers BI lacks doesn’t explain.
Questions for business (& government):What are people saying?  What’s hot/trending?What are they saying about {topic|person|product} X?... about X versus {topic|person|product} Y?How has opinion about X and Y evolved?How has opinion correlated with {our|competitors’|general} {news|marketing|sales|events}?What’s behind opinion, the root causes?Who are opinion leaders?How does sentiment propagate across multiple channels?
The answers are here...But how do you get at them?
“In this example, you can quickly see that the Drooling Dog Bar B Q has gotten lots of positive reviews, and if you want to see what other people have said about the restaurant, clicking this result is a good choice.”-- http://googleblog.blogspot.com/2009/05/more-search-options-and-other-updates.html“In the recap of [Searchology] from Google’s Matt Cutts, he tells us that: ‘If you sort by reviews, Google will perform sentiment analysis and highlight interesting comments.’-- Bill Slawski, “Google's New Review Search Option and Sentiment Analysis,” http://www.seobythesea.com/?p=1488
Text Analytics!More generally...
Analytics is a collection of tools and techniques that extract insights from data.Apply or embed analytics within business contexts – collect data and information about customers, markets, suppliers, and business processes – use results to inform, drive, and optimize business decision making – and you harness analytics as a core BI asset.
Analytics seeks structure in “unstructured” sources.x(t)	=	t	y(t)	=	½ a (et/a + e-t/a)	=	acosh(t/a)http://www.tropicalisland.de/NYC_New_York_Brooklyn_Bridge_from_World_Trade_Center_b.jpghttp://en.wikipedia.org/wiki/Seven_Bridges_of_K%C3%B6nigsberg
Text analytics models text.“Statistical information derived from word frequency and distribution is used by the machine to compute a relative measure of significance, first for individual words and then for sentences.”-- H.P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, 1958.http://wordle.net
Document input and processingKnowledge handling is keyDesk Set (1957): Computer engineer Richard Sumner (Spencer Tracy) and television network librarian Bunny Watson (Katherine Hepburn)  and the "electronic brain" EMERAC.Hans Peter Luhn“A Business Intelligence System”IBM Journal, October 1958
“This rather unsophisticated argument on ‘significance’ avoids such linguistic implications as grammar and syntax... No attention is paid to the logical and semantic relationships the author has established.” 	-- H.P. Luhn
My 2009 text-analytics market survey asked, [What information] do you need (or expect to need) to extract or analyze:Text Analytics 2009: User Perspectives on Solutions and Providers
Text, Content, and Social Analytics: BI for the New World
From document to DB; an IBM example: “The standard features are stored in the STANDARD_KW table, keywords with their occurrences in the KEYWORD_KW_OCC table, and the text list features in the TEXTLIST_TEXT table. Every feature table contains the DOC_ID as a reference to the DOCUMENT table.”
Welcome to the New World.The Far Side by Gary LarsonKen Jennings, IBM Watson, and Brad Rutter play Jeopardy!https://secure.wikimedia.org/wikipedia/en/wiki/File:Watson_Jeopardy.jpg
In a sense, text analytics, by generating semantics, bridges search and BI to turn Information Retrieval into Information Access.Information AccessSearchBIText AnalyticsIntegrated analyticsSemantic search
Have we arrived?2001: A Space Odyssey, Stanley Kubrick
En route.http://www.businessweek.com/magazine/content/04_19/b3882029_mz072.htm
Intelligent computing involves:Big (and little) Data.Quantities.
Content.
Metadata.Analytics.Semantics.Integration.Inference
Semantics enables better content production, management & use.  Semantics captures –MeaningRelationshipsContext Understanding–the sense of “unstructured” online, social, and enterprise information, for content consumers and publishers.Semantics unites data of all types.

More Related Content

PPT
Reports vs analysis
PPTX
Introduction to Data Analytics
PPTX
PPT
Analysis of ‘Unstructured’ Data
PPTX
Data analytics
PPTX
Big data business analytics | Introduction to Business Analytics
DOCX
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
PPTX
Classification of data
Reports vs analysis
Introduction to Data Analytics
Analysis of ‘Unstructured’ Data
Data analytics
Big data business analytics | Introduction to Business Analytics
BIAM 410 Final Paper - Beyond the Buzzwords: Big Data, Machine Learning, What...
Classification of data

What's hot (20)

PPTX
Analytics 2
PPTX
Data analytics
PDF
Knowledge Graphs - The Power of Graph-Based Search
PPT
Data Modeling Presentations I
PPTX
Data analytics
PPTX
Big data and data science overview
PPT
Data mining
PPT
Data analytics & its Trends
PPT
Data Mining
PDF
Interfacing Chatbot with Data Retrieval and Analytics Queries for Decision Ma...
PDF
Introduction to Data Science (Data Summit, 2017)
PPTX
Analytics Organizations & The New Emerging Roles
PDF
Data Visualization
PDF
Project: Interfacing Chatbot with Data Retrieval and Analytics Queries for De...
PPTX
Data Warehousing AWS 12345
PPTX
Top Data Mining Techniques and Their Applications
PDF
Enterprise Knowledge Graph
PPTX
Predictive Analytics: Business Perspective & Use Cases
PPTX
Big data visualization
PDF
Big data analytics, research report
Analytics 2
Data analytics
Knowledge Graphs - The Power of Graph-Based Search
Data Modeling Presentations I
Data analytics
Big data and data science overview
Data mining
Data analytics & its Trends
Data Mining
Interfacing Chatbot with Data Retrieval and Analytics Queries for Decision Ma...
Introduction to Data Science (Data Summit, 2017)
Analytics Organizations & The New Emerging Roles
Data Visualization
Project: Interfacing Chatbot with Data Retrieval and Analytics Queries for De...
Data Warehousing AWS 12345
Top Data Mining Techniques and Their Applications
Enterprise Knowledge Graph
Predictive Analytics: Business Perspective & Use Cases
Big data visualization
Big data analytics, research report
Ad

Viewers also liked (19)

PPTX
The State of Semantics
PPTX
An Introduction to Text Analytics: 2013 Workshop presentation
PPTX
Hacking Data and Predicting the Future
PPTX
Smart Content = Smart Business
PPTX
Technology Frontiers: Text, Sentiment, and Sense
PDF
Social Media AND THE Enterprise Business Intelligence/Analytics Connection
PPTX
Sentiment Analysis: The Marketplace and Providers
PPTX
An Industry Perspective on Subjectivity, Sentiment, and Social
PPTX
The Insight Value of Social Sentiment
PDF
Text Analytics 2014: User Perspectives on Solutions and Providers
PDF
12 Things the Semantic Web Should Know about Content Analytics
PDF
Text/Content Analytics 2011: User Perspectives on Solutions and Providers
PDF
Social Data Sentiment Analysis
PPTX
Text Analytics Today
PPTX
Text Analytics Overview, 2011
PDF
From Business Intelligence to Predictive Analytics
PPTX
Introduction to Machine Learning
PPTX
Introduction to Machine Learning
PPTX
Business intelligence ppt
The State of Semantics
An Introduction to Text Analytics: 2013 Workshop presentation
Hacking Data and Predicting the Future
Smart Content = Smart Business
Technology Frontiers: Text, Sentiment, and Sense
Social Media AND THE Enterprise Business Intelligence/Analytics Connection
Sentiment Analysis: The Marketplace and Providers
An Industry Perspective on Subjectivity, Sentiment, and Social
The Insight Value of Social Sentiment
Text Analytics 2014: User Perspectives on Solutions and Providers
12 Things the Semantic Web Should Know about Content Analytics
Text/Content Analytics 2011: User Perspectives on Solutions and Providers
Social Data Sentiment Analysis
Text Analytics Today
Text Analytics Overview, 2011
From Business Intelligence to Predictive Analytics
Introduction to Machine Learning
Introduction to Machine Learning
Business intelligence ppt
Ad

Similar to Text, Content, and Social Analytics: BI for the New World (20)

PPTX
BrightTALK - Semantic AI
PDF
Big Data Analytics Research Report
PDF
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
PDF
An Comprehensive Study of Big Data Environment and its Challenges.
DOCX
PDF
Different Types Of Fact Tables
PPTX
727325165-Unit-1-Data-Analytics-PPT-1.pptx
DOCX
Business Intelligence
PDF
Big data analytics, research report
PDF
Introduction to Knowledge Graphs: Data Summit 2020
PDF
From Rocket Science to Data Science
PDF
Big Data analytics usage
PDF
Keynote@CADE2018_HalukDemirkan
PPTX
Mtech First_Year Data Analytics in Industry with power bI
PPTX
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
PPT
lawTechCamp - Knowledge Management Panel
PPTX
Introduction to data science
PDF
Agile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
PDF
KIT-601 Lecture Notes-UNIT-1.pdf
PDF
Synthesys Technical Overview
BrightTALK - Semantic AI
Big Data Analytics Research Report
Data Science - An emerging Stream of Science with its Spreading Reach & Impact
An Comprehensive Study of Big Data Environment and its Challenges.
Different Types Of Fact Tables
727325165-Unit-1-Data-Analytics-PPT-1.pptx
Business Intelligence
Big data analytics, research report
Introduction to Knowledge Graphs: Data Summit 2020
From Rocket Science to Data Science
Big Data analytics usage
Keynote@CADE2018_HalukDemirkan
Mtech First_Year Data Analytics in Industry with power bI
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
lawTechCamp - Knowledge Management Panel
Introduction to data science
Agile Testing Days 2017 Intoducing AgileBI Sustainably - Excercises
KIT-601 Lecture Notes-UNIT-1.pdf
Synthesys Technical Overview

More from Seth Grimes (20)

PPT
Recent Advances in Natural Language Processing
PPTX
Creating an AI Startup: What You Need to Know
PPT
NLP 2020: What Works and What's Next
PDF
Efficient Deep Learning in Natural Language Processing Production, with Moshe...
PDF
From Customer Emotions to Actionable Insights, with Peter Dorrington
PDF
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
PPT
Emotion AI
PPT
Text Analytics Market Trends
PPTX
Text Analytics for NLPers
PPTX
Our FinTech Future – AI’s Opportunities and Challenges?
PDF
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
PDF
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
PDF
Fairness in Machine Learning and AI
PDF
Classification with Memes–Uber case study
PDF
Aspect Detection for Sentiment / Emotion Analysis
PPTX
Content AI: From Potential to Practice
PPT
Text Analytics Market Insights: What's Working and What's Next
PPTX
Global Analytics: Text, Speech, Sentiment, and Sense
PPTX
Text Analytics Past, Present & Future: An Industry View
PPTX
Sentiment, Opinion & Emotion on the Multilingual Web
Recent Advances in Natural Language Processing
Creating an AI Startup: What You Need to Know
NLP 2020: What Works and What's Next
Efficient Deep Learning in Natural Language Processing Production, with Moshe...
From Customer Emotions to Actionable Insights, with Peter Dorrington
Intro to Deep Learning for Medical Image Analysis, with Dan Lee from Dentuit AI
Emotion AI
Text Analytics Market Trends
Text Analytics for NLPers
Our FinTech Future – AI’s Opportunities and Challenges?
Preposition Semantics: Challenges in Comprehensive Corpus Annotation and Auto...
The Ins and Outs of Preposition Semantics:
 Challenges in Comprehensive Corpu...
Fairness in Machine Learning and AI
Classification with Memes–Uber case study
Aspect Detection for Sentiment / Emotion Analysis
Content AI: From Potential to Practice
Text Analytics Market Insights: What's Working and What's Next
Global Analytics: Text, Speech, Sentiment, and Sense
Text Analytics Past, Present & Future: An Industry View
Sentiment, Opinion & Emotion on the Multilingual Web

Recently uploaded (20)

PPTX
Build Your First AI Agent with UiPath.pptx
PPTX
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
PPT
Geologic Time for studying geology for geologist
PDF
Flame analysis and combustion estimation using large language and vision assi...
PDF
Five Habits of High-Impact Board Members
PPTX
2018-HIPAA-Renewal-Training for executives
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
Configure Apache Mutual Authentication
PPTX
Modernising the Digital Integration Hub
PDF
Architecture types and enterprise applications.pdf
PDF
Consumable AI The What, Why & How for Small Teams.pdf
PDF
Convolutional neural network based encoder-decoder for efficient real-time ob...
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
Benefits of Physical activity for teenagers.pptx
PPTX
Chapter 5: Probability Theory and Statistics
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
UiPath Agentic Automation session 1: RPA to Agents
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPT
What is a Computer? Input Devices /output devices
Build Your First AI Agent with UiPath.pptx
GROUP4NURSINGINFORMATICSREPORT-2 PRESENTATION
Geologic Time for studying geology for geologist
Flame analysis and combustion estimation using large language and vision assi...
Five Habits of High-Impact Board Members
2018-HIPAA-Renewal-Training for executives
OpenACC and Open Hackathons Monthly Highlights July 2025
A contest of sentiment analysis: k-nearest neighbor versus neural network
Configure Apache Mutual Authentication
Modernising the Digital Integration Hub
Architecture types and enterprise applications.pdf
Consumable AI The What, Why & How for Small Teams.pdf
Convolutional neural network based encoder-decoder for efficient real-time ob...
Zenith AI: Advanced Artificial Intelligence
Benefits of Physical activity for teenagers.pptx
Chapter 5: Probability Theory and Statistics
sbt 2.0: go big (Scala Days 2025 edition)
UiPath Agentic Automation session 1: RPA to Agents
1 - Historical Antecedents, Social Consideration.pdf
What is a Computer? Input Devices /output devices

Text, Content, and Social Analytics: BI for the New World

  • 1. Text, Content, and Social Analytics: BI for the New WorldSeth GrimesAlta Plana Corporation@sethgrimesTDWI – Washington DCJuly 15, 2011
  • 3. Imperatives for the 2010s:Do more with more.“It’s Not Information Overload. It’s Filter Failure”: Clay Shirky, 2008.More sources & types of data.
  • 5. New hardware and methods.Automate more, more intelligently.Analytics.
  • 7. I see three categories of data:Quantities, whether measured, observed, or computed.Content, which I’ll characterize as non-quantitative information.Metadata (semantic & structural) describing quantities and content.Our concern is content, analytics & fusion.
  • 9. Where do relationships fit?DW & BI relate numbers......but by-the-numbers BI lacks doesn’t explain.
  • 10. Questions for business (& government):What are people saying? What’s hot/trending?What are they saying about {topic|person|product} X?... about X versus {topic|person|product} Y?How has opinion about X and Y evolved?How has opinion correlated with {our|competitors’|general} {news|marketing|sales|events}?What’s behind opinion, the root causes?Who are opinion leaders?How does sentiment propagate across multiple channels?
  • 11. The answers are here...But how do you get at them?
  • 12. “In this example, you can quickly see that the Drooling Dog Bar B Q has gotten lots of positive reviews, and if you want to see what other people have said about the restaurant, clicking this result is a good choice.”-- http://googleblog.blogspot.com/2009/05/more-search-options-and-other-updates.html“In the recap of [Searchology] from Google’s Matt Cutts, he tells us that: ‘If you sort by reviews, Google will perform sentiment analysis and highlight interesting comments.’-- Bill Slawski, “Google's New Review Search Option and Sentiment Analysis,” http://www.seobythesea.com/?p=1488
  • 14. Analytics is a collection of tools and techniques that extract insights from data.Apply or embed analytics within business contexts – collect data and information about customers, markets, suppliers, and business processes – use results to inform, drive, and optimize business decision making – and you harness analytics as a core BI asset.
  • 15. Analytics seeks structure in “unstructured” sources.x(t) = t y(t) = ½ a (et/a + e-t/a) = acosh(t/a)http://www.tropicalisland.de/NYC_New_York_Brooklyn_Bridge_from_World_Trade_Center_b.jpghttp://en.wikipedia.org/wiki/Seven_Bridges_of_K%C3%B6nigsberg
  • 16. Text analytics models text.“Statistical information derived from word frequency and distribution is used by the machine to compute a relative measure of significance, first for individual words and then for sentences.”-- H.P. Luhn, The Automatic Creation of Literature Abstracts, IBM Journal, 1958.http://wordle.net
  • 17. Document input and processingKnowledge handling is keyDesk Set (1957): Computer engineer Richard Sumner (Spencer Tracy) and television network librarian Bunny Watson (Katherine Hepburn) and the "electronic brain" EMERAC.Hans Peter Luhn“A Business Intelligence System”IBM Journal, October 1958
  • 18. “This rather unsophisticated argument on ‘significance’ avoids such linguistic implications as grammar and syntax... No attention is paid to the logical and semantic relationships the author has established.” -- H.P. Luhn
  • 19. My 2009 text-analytics market survey asked, [What information] do you need (or expect to need) to extract or analyze:Text Analytics 2009: User Perspectives on Solutions and Providers
  • 21. From document to DB; an IBM example: “The standard features are stored in the STANDARD_KW table, keywords with their occurrences in the KEYWORD_KW_OCC table, and the text list features in the TEXTLIST_TEXT table. Every feature table contains the DOC_ID as a reference to the DOCUMENT table.”
  • 22. Welcome to the New World.The Far Side by Gary LarsonKen Jennings, IBM Watson, and Brad Rutter play Jeopardy!https://secure.wikimedia.org/wikipedia/en/wiki/File:Watson_Jeopardy.jpg
  • 23. In a sense, text analytics, by generating semantics, bridges search and BI to turn Information Retrieval into Information Access.Information AccessSearchBIText AnalyticsIntegrated analyticsSemantic search
  • 24. Have we arrived?2001: A Space Odyssey, Stanley Kubrick
  • 26. Intelligent computing involves:Big (and little) Data.Quantities.
  • 29. Semantics enables better content production, management & use. Semantics captures –MeaningRelationshipsContext Understanding–the sense of “unstructured” online, social, and enterprise information, for content consumers and publishers.Semantics unites data of all types.
  • 33. From connections to influence: What’s wrong with these pictures? (Radian6, Sysomos, Klout)
  • 34. Social analytics:Use social data in analyses (alongside enterprise & online information).Content.
  • 35. Connections.Bring BI to social analyses.3rd & 4th senses of social analytics:Adopt agile, collaborative methods.Share your data.A challenge: Enterprise-social-online data integration.
  • 36. Text, Content, and Social Analytics: BI for the New WorldSeth GrimesAlta Plana Corporation@sethgrimesTDWI – Washington DCJuly 15, 2011