1!
What do we do with all this big data?!
Fostering insight and trust in the digital age!
Susan Etlinger, Industry Analyst, Altimeter Group!
February 11, 2015!
!
Photo: qthomasbower, cc 2.0!
2!
“
Orwell feared those who would deprive us
of information. Huxley feared those who
would give us so much that we would be
reduced to passivity and egotism. Orwell
feared that the truth would be concealed
from us. Huxley feared the truth would be
drowned in a sea of irrelevance.”!
Neil Postman!
Amusing Ourselves to Death, 1985!
3!
What’s so hard about big data?!
4!
“
Ninety percent of all the data
in the world was created in
the past two years.!
− IBM!
5!
Big Data as defined by Gartner*: !
• Volume!
• Velocity!
• Variety!
Variety is the most challenging.!
What is big data?!
* NB: This is really just a starting point for understanding big data. See
Gartner for research on origins and definitions.
6!
With Big Data, Size Isn’t Everything!
Images Text
Video
Audio
7!
How unstructured data disrupts!
1.  Does not conform to standard data models!
2.  Demands new analytical approaches!
·  Human expression: images, text. Much is
unstructured.!
·  Raw material—requires processing to translate
it into something a machine can understand &
act upon!
3.  Strains traditional methodologies !
Source: http://www.foreignpolicy.com/articles/2014/09/26/why_big_data_missed_the_early_warning_signs_of_ebola?
utm_content=buffer6a337&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer#trending
8!
A simple
example…!
Source: http://www.foreignpolicy.com/articles/2014/09/26/why_big_data_missed_the_early_warning_signs_of_ebola?
utm_content=buffer6a337&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer#trending
9!
“
The nature of human
language demands rigorous
and repeatable processes to
extract meaning from it in a
transparent and defensible
way.!
10!
Unstructured Data Requires New Analytics !
11!
Unstructured Data Requires New Analytics !
12!
Case Study: Health Media Collaboratory!
What can social data tell us
about smoking cessation?!
• How much electronic
promotion exists on Twitter?!
• How much organic
conversation about e-
cigarettes is on Twitter?!
Understanding the impact of
CDC anti-smoking
commercials !
• Did the commercials work?!
• How can we prove it?!
13!
Disambiguating e-cigarettes!
14!
Disambiguating “smoking”!
15!
Methodology!
1.  Data collection. Determine appropriate source and sample size of the
data.!
2.  Keyword selection. Generate the most comprehensive possible list of
keywords, encompassing nonstandard English usages, slang terms, and
misspellings.!
3.  Metadata. Collect metadata related to the tweets: !
·  A tweet ID (a unique numerical identifier assigned to each tweet) !
·  The username and biographical profile of the account used to post the
tweet!
·  Geolocation (if enabled by the user) !
·  Number of followers of the posting account !
·  The number of accounts the posting account follows!
16!
Methodology!
3.  Metadata (continued). Collect metadata related to the tweets:!
·  The posting account’s Klout score !
·  Hashtags !
·  URL links !
·  Media content attached to the tweet.!
·  Filtering for engagement. !
4.  Human coding. To assess relevance and code message content.!
5.  Precision and relevance. Combination of human and machine coding.!
6.  Recall. To determine whether data was generalizable.!
7.  Content coding. To determine message effectiveness.!
17!
“
Eighty-seven percent of the tweets
about the TV commercials
expressed fear, and the ads had
the desired result of jolting the
audience into a thought process
that might have some impact on
future behavior.!
!
− Health Media Collaboratory!
18!
From data to insight!
19!
“The type of data
(structured, text, etc.)
isn’t the point at all. The
way of thinking
matters.”!
!
− Philip B. Stark, professor and
chair of statistics, University of
California, Berkeley!
20!
Logic problems: causation vs. correlation!
21!
From Insight to Trust!
22!
“
In civilized life, law
floats in a sea of
ethics.”!
− Earl Warren, former chief justice
of the United States!
23!
Three Recent Examples!
24!
Ethical
Issues
Related
to Data!
25!
Planning for data ubiquity!
26!
Planning for 2015!
①  Define data strategy and operating model!
②  Update analytics methodology to reflect new
data realities!
③  Seek out critical thinking and diverse skill sets!
④  Insist on ethical data use and transparent
disclosure!
⑤  Reward and reinforce humility and learning!
27!
Thank You!
Disclaimer: Although the information and data used in this report have been produced and processed from sources believed to be reliable, no warranty
expressed or implied is made regarding the completeness, accuracy, adequacy or use of the information. The authors and contributors of the information and
data shall have no liability for errors or omissions contained herein or for interpretations thereof. Reference herein to any specific product or vendor by trade
name, trademark or otherwise does not constitute or imply its endorsement, recommendation or favoring by the authors or contributors and shall not be used for
advertising or product endorsement purposes. The opinions expressed herein are subject to change without notice.!
!
Altimeter Group provides research and advisory for companies challenged by
business disruptions, enabling them to pursue new opportunities and business
models. !
Susan Etlinger!
susan@altimetergroup.com!
susanetlinger.com!
@setlinger!

More Related Content

PDF
[Slides] Social Business Governance, Altimeter Group
PDF
[Report] What Do We Do With All This Big Data? Fostering Insight and Trust i...
PDF
TrustImperative_etlinger
PDF
What Big Data Means for PR and Why It Matters to Us
 
PDF
A Framework for Social Analytics
PDF
What Makes Great Infographics
PDF
What-Do-We-Do-with-All-This-Big-Data-Altimeter-Group
KEY
Solving the Wanamaker Problem for Healthcare (keynote file)
[Slides] Social Business Governance, Altimeter Group
[Report] What Do We Do With All This Big Data? Fostering Insight and Trust i...
TrustImperative_etlinger
What Big Data Means for PR and Why It Matters to Us
 
A Framework for Social Analytics
What Makes Great Infographics
What-Do-We-Do-with-All-This-Big-Data-Altimeter-Group
Solving the Wanamaker Problem for Healthcare (keynote file)

What's hot (20)

PDF
Appinions Information Technology Influence Study_August 2013
PDF
{White Paper} Measuring Global Attention | Appinions
PDF
Future of Work: Collaboration & Communication
PDF
Women On The Leading Edge
PPTX
From complainers to advocates social media & analytics
PDF
From complainers to advocates social media & analytics
PDF
[REPORT PREVIEW] AI in the Enterprise
PDF
UX AT Work: Experience Design Principles for an Agency World
PDF
The State of My Industry
PDF
Innovation series 112318
PDF
Surprising Data Around How We Work [Infographic]
PPT
The Demanding State of I.T.
PDF
The Future of Work: Analytics
PDF
The Essential Data Ingredient
PDF
Living in a data economy: Transforming the role of HR
PDF
Social Media for Retail: Translating “Posts” into Profits
PDF
Sogeti big data - no more secrets with big data analytics
PDF
What people think about when they say data driven
PDF
Beyond Measure, Erika Hall
PDF
Simpler, Clearer, Faster Government Services
Appinions Information Technology Influence Study_August 2013
{White Paper} Measuring Global Attention | Appinions
Future of Work: Collaboration & Communication
Women On The Leading Edge
From complainers to advocates social media & analytics
From complainers to advocates social media & analytics
[REPORT PREVIEW] AI in the Enterprise
UX AT Work: Experience Design Principles for an Agency World
The State of My Industry
Innovation series 112318
Surprising Data Around How We Work [Infographic]
The Demanding State of I.T.
The Future of Work: Analytics
The Essential Data Ingredient
Living in a data economy: Transforming the role of HR
Social Media for Retail: Translating “Posts” into Profits
Sogeti big data - no more secrets with big data analytics
What people think about when they say data driven
Beyond Measure, Erika Hall
Simpler, Clearer, Faster Government Services
Ad

Viewers also liked (14)

PPT
[Slides] Strengthening Employee Relationships in the Digital Era by Altimeter...
PDF
[Slides] Content Marketing Vendor Landscape: Marketer Needs & Vendor Solution...
PPT
[Slides] Content Marketing Software RFP, by Altimeter Group
PDF
[Slides] Evolution of the Social Media Command Center by Susan Etlinger
PPTX
[Slides] Leveraging Social Identity, by Altimeter Group
PPT
[Slides] A Culture of Content by Altimeter Group
PDF
[Slides] The Inevitability of a Mobile-Only Customer Experience by Altimeter ...
PDF
[Report] Shiny Object or Digital Intelligence Hub? Evolution of the Enterpris...
PDF
[Slides] Digital Transformation, with Brian Solis
PPTX
[Slides] The Social Business Journey: Creating a Coherent, Sustainable Strate...
PPTX
[Webinar] Setting the Stage for Mobile Business, with Chris Silva and Charlen...
PPTX
[Slides] Disruptive Technology Outlook 2012, by Charlene Li
PDF
[Infographic] Leverage Social Identity to Build Better Customer Relationships...
PDF
[Report] Make An App For That: Mobile Strategies For Retail, by Chris Silva
[Slides] Strengthening Employee Relationships in the Digital Era by Altimeter...
[Slides] Content Marketing Vendor Landscape: Marketer Needs & Vendor Solution...
[Slides] Content Marketing Software RFP, by Altimeter Group
[Slides] Evolution of the Social Media Command Center by Susan Etlinger
[Slides] Leveraging Social Identity, by Altimeter Group
[Slides] A Culture of Content by Altimeter Group
[Slides] The Inevitability of a Mobile-Only Customer Experience by Altimeter ...
[Report] Shiny Object or Digital Intelligence Hub? Evolution of the Enterpris...
[Slides] Digital Transformation, with Brian Solis
[Slides] The Social Business Journey: Creating a Coherent, Sustainable Strate...
[Webinar] Setting the Stage for Mobile Business, with Chris Silva and Charlen...
[Slides] Disruptive Technology Outlook 2012, by Charlene Li
[Infographic] Leverage Social Identity to Build Better Customer Relationships...
[Report] Make An App For That: Mobile Strategies For Retail, by Chris Silva
Ad

Similar to [Slides] What Do We Do with All This Big Data by Altimeter Group (20)

PDF
Getting comfortable with Data
PDF
Towards Explainable Fact Checking (DIKU Business Club presentation)
PPTX
Numbers Uncovered A Data Dive into Corona Chaos
PDF
Data science and ethics in fundraising
PPTX
Michael Pocock: Citizen Science Project Design
PPTX
Effective Cybersecurity Communication Skills
PDF
The Web Analyst's Code of Ethics
PDF
CybersecurityTFReport2016 PRINT
PDF
CIL Stats Workshop April1 2022 Abram Silk.pdf
PDF
Everyone Is an Analyst and Data Is Everywhere, But Research Has Never Been Ne...
PDF
Ethics and Data
PPTX
What do you do with all this big data
PPTX
AI and Social Justice: From Avoiding Harms to Positive Action
PDF
What the IoT should learn from the life sciences
PDF
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
PDF
James Joyce Dubliners Essay Topics
PDF
​Big data and the examined life
PPTX
UKSG Conference 2016 Breakout Session - The new research data environment: im...
PDF
Sdal air health and social development (jan. 27, 2014) final
PDF
Technology Essay Writing.pdf
Getting comfortable with Data
Towards Explainable Fact Checking (DIKU Business Club presentation)
Numbers Uncovered A Data Dive into Corona Chaos
Data science and ethics in fundraising
Michael Pocock: Citizen Science Project Design
Effective Cybersecurity Communication Skills
The Web Analyst's Code of Ethics
CybersecurityTFReport2016 PRINT
CIL Stats Workshop April1 2022 Abram Silk.pdf
Everyone Is an Analyst and Data Is Everywhere, But Research Has Never Been Ne...
Ethics and Data
What do you do with all this big data
AI and Social Justice: From Avoiding Harms to Positive Action
What the IoT should learn from the life sciences
IBM Watson Content Analytics: Discover Hidden Value in Your Unstructured Data
James Joyce Dubliners Essay Topics
​Big data and the examined life
UKSG Conference 2016 Breakout Session - The new research data environment: im...
Sdal air health and social development (jan. 27, 2014) final
Technology Essay Writing.pdf

More from Altimeter, a Prophet Company (20)

PDF
[REPORT PREVIEW] The AI Maturity Playbook: Five Pillars of Enterprise Success
PDF
[REPORT PREVIEW] Employee Adoption of Collaboration Tools in 2018
PDF
[REPORT PREVIEW] Smart Places: The Digital Transformation of Location
PDF
[REPORT PREVIEW] GDPR Beyond May 25, 2018
PDF
[REPORT PREVIEW] The Customer Experience of AI
PDF
Experience Strategy: Connecting Customer Experience to Business Strategy [REP...
PDF
The Conversational Business [REPORT PREVIEW]
PDF
[RESEARCH REPORT PREVIEW] Creating a Customer-First Web Experience
PDF
[REPORT PREVIEW] The Transformation of Selling
PDF
[REPORT PREVIEW] The Age of AI
PDF
The Race to 2021: The State of Autonomous Vehicles and a "Who's Who" of Indus...
PDF
The 2016 State of Social Business
PDF
The 2016 State of Digital Content
PPTX
Crafting a Digital Strategy
PDF
[RESEARCH REPORT] The 2016 State of Digital Transformation
PDF
[NEW RESEARCH] Crafting A Digital Strategy
PDF
The Six Stages of Digital Transformation
PPTX
Social Employee Advocacy: Tapping into the Power of an Engaged Social Workforce
PDF
[NEW RESEARCH] Social Media Employee Advocacy
PDF
The OPPOSITE FRAMEWORK: 8 Success Factors for Digital Transformation
[REPORT PREVIEW] The AI Maturity Playbook: Five Pillars of Enterprise Success
[REPORT PREVIEW] Employee Adoption of Collaboration Tools in 2018
[REPORT PREVIEW] Smart Places: The Digital Transformation of Location
[REPORT PREVIEW] GDPR Beyond May 25, 2018
[REPORT PREVIEW] The Customer Experience of AI
Experience Strategy: Connecting Customer Experience to Business Strategy [REP...
The Conversational Business [REPORT PREVIEW]
[RESEARCH REPORT PREVIEW] Creating a Customer-First Web Experience
[REPORT PREVIEW] The Transformation of Selling
[REPORT PREVIEW] The Age of AI
The Race to 2021: The State of Autonomous Vehicles and a "Who's Who" of Indus...
The 2016 State of Social Business
The 2016 State of Digital Content
Crafting a Digital Strategy
[RESEARCH REPORT] The 2016 State of Digital Transformation
[NEW RESEARCH] Crafting A Digital Strategy
The Six Stages of Digital Transformation
Social Employee Advocacy: Tapping into the Power of an Engaged Social Workforce
[NEW RESEARCH] Social Media Employee Advocacy
The OPPOSITE FRAMEWORK: 8 Success Factors for Digital Transformation

Recently uploaded (20)

PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
PPTX
Steganography Project Steganography Project .pptx
PDF
Global Data and Analytics Market Outlook Report
PPTX
chrmotography.pptx food anaylysis techni
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PDF
Introduction to Data Science and Data Analysis
PPT
statistic analysis for study - data collection
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PPTX
modul_python (1).pptx for professional and student
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
Leprosy and NLEP programme community medicine
PPTX
Business_Capability_Map_Collection__pptx
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PDF
Navigating the Thai Supplements Landscape.pdf
DOCX
Factor Analysis Word Document Presentation
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
Steganography Project Steganography Project .pptx
Global Data and Analytics Market Outlook Report
chrmotography.pptx food anaylysis techni
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Introduction to Data Science and Data Analysis
statistic analysis for study - data collection
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
modul_python (1).pptx for professional and student
[EN] Industrial Machine Downtime Prediction
SAP 2 completion done . PRESENTATION.pptx
Leprosy and NLEP programme community medicine
Business_Capability_Map_Collection__pptx
retention in jsjsksksksnbsndjddjdnFPD.pptx
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Topic 5 Presentation 5 Lesson 5 Corporate Fin
Navigating the Thai Supplements Landscape.pdf
Factor Analysis Word Document Presentation
STERILIZATION AND DISINFECTION-1.ppthhhbx
Tetra Pak Index 2023 - The future of health and nutrition - Full report.pdf

[Slides] What Do We Do with All This Big Data by Altimeter Group

  • 1. 1! What do we do with all this big data?! Fostering insight and trust in the digital age! Susan Etlinger, Industry Analyst, Altimeter Group! February 11, 2015! ! Photo: qthomasbower, cc 2.0!
  • 2. 2! “ Orwell feared those who would deprive us of information. Huxley feared those who would give us so much that we would be reduced to passivity and egotism. Orwell feared that the truth would be concealed from us. Huxley feared the truth would be drowned in a sea of irrelevance.”! Neil Postman! Amusing Ourselves to Death, 1985!
  • 3. 3! What’s so hard about big data?!
  • 4. 4! “ Ninety percent of all the data in the world was created in the past two years.! − IBM!
  • 5. 5! Big Data as defined by Gartner*: ! • Volume! • Velocity! • Variety! Variety is the most challenging.! What is big data?! * NB: This is really just a starting point for understanding big data. See Gartner for research on origins and definitions.
  • 6. 6! With Big Data, Size Isn’t Everything! Images Text Video Audio
  • 7. 7! How unstructured data disrupts! 1.  Does not conform to standard data models! 2.  Demands new analytical approaches! ·  Human expression: images, text. Much is unstructured.! ·  Raw material—requires processing to translate it into something a machine can understand & act upon! 3.  Strains traditional methodologies ! Source: http://www.foreignpolicy.com/articles/2014/09/26/why_big_data_missed_the_early_warning_signs_of_ebola? utm_content=buffer6a337&utm_medium=social&utm_source=twitter.com&utm_campaign=buffer#trending
  • 9. 9! “ The nature of human language demands rigorous and repeatable processes to extract meaning from it in a transparent and defensible way.!
  • 12. 12! Case Study: Health Media Collaboratory! What can social data tell us about smoking cessation?! • How much electronic promotion exists on Twitter?! • How much organic conversation about e- cigarettes is on Twitter?! Understanding the impact of CDC anti-smoking commercials ! • Did the commercials work?! • How can we prove it?!
  • 15. 15! Methodology! 1.  Data collection. Determine appropriate source and sample size of the data.! 2.  Keyword selection. Generate the most comprehensive possible list of keywords, encompassing nonstandard English usages, slang terms, and misspellings.! 3.  Metadata. Collect metadata related to the tweets: ! ·  A tweet ID (a unique numerical identifier assigned to each tweet) ! ·  The username and biographical profile of the account used to post the tweet! ·  Geolocation (if enabled by the user) ! ·  Number of followers of the posting account ! ·  The number of accounts the posting account follows!
  • 16. 16! Methodology! 3.  Metadata (continued). Collect metadata related to the tweets:! ·  The posting account’s Klout score ! ·  Hashtags ! ·  URL links ! ·  Media content attached to the tweet.! ·  Filtering for engagement. ! 4.  Human coding. To assess relevance and code message content.! 5.  Precision and relevance. Combination of human and machine coding.! 6.  Recall. To determine whether data was generalizable.! 7.  Content coding. To determine message effectiveness.!
  • 17. 17! “ Eighty-seven percent of the tweets about the TV commercials expressed fear, and the ads had the desired result of jolting the audience into a thought process that might have some impact on future behavior.! ! − Health Media Collaboratory!
  • 18. 18! From data to insight!
  • 19. 19! “The type of data (structured, text, etc.) isn’t the point at all. The way of thinking matters.”! ! − Philip B. Stark, professor and chair of statistics, University of California, Berkeley!
  • 20. 20! Logic problems: causation vs. correlation!
  • 22. 22! “ In civilized life, law floats in a sea of ethics.”! − Earl Warren, former chief justice of the United States!
  • 26. 26! Planning for 2015! ①  Define data strategy and operating model! ②  Update analytics methodology to reflect new data realities! ③  Seek out critical thinking and diverse skill sets! ④  Insist on ethical data use and transparent disclosure! ⑤  Reward and reinforce humility and learning!
  • 27. 27! Thank You! Disclaimer: Although the information and data used in this report have been produced and processed from sources believed to be reliable, no warranty expressed or implied is made regarding the completeness, accuracy, adequacy or use of the information. The authors and contributors of the information and data shall have no liability for errors or omissions contained herein or for interpretations thereof. Reference herein to any specific product or vendor by trade name, trademark or otherwise does not constitute or imply its endorsement, recommendation or favoring by the authors or contributors and shall not be used for advertising or product endorsement purposes. The opinions expressed herein are subject to change without notice.! ! Altimeter Group provides research and advisory for companies challenged by business disruptions, enabling them to pursue new opportunities and business models. ! Susan Etlinger! susan@altimetergroup.com! susanetlinger.com! @setlinger!