SlideShare a Scribd company logo
Demystifying
   Big Data
        Aaron Weber, Product Architect
        Spiral16
Since 2007, Spiral16′s group of developers, data
analysts and researchers have been quietly building
one of the most powerful data mining platforms on
the planet, backed up by one of the most
experienced social data analysis teams in the
industry.

Spiral16′s social media and web research
platform and data analysis services are designed to
help everyone from CEOs to CMOs, marketing and
PR agencies, and researchers.

                              Demystifying Big Data
  11/8/2012                                           2
So what is Big Data?




                 Demystifying Big Data
11/8/2012                                3
So what is Big Data?


            “A collection of data sets so large and
            complex that it becomes difficult to
            process using on-hand database
            management tools.”
                       - Wikipedia




                         Demystifying Big Data
11/8/2012                                             4
So what is Big Data?

                           Exabytes Created By Year
            3000

            2500

            2000

            1500

            1000

            500

              0
                   2006   2007   2008   2009   2010   2011   2012




                             Demystifying Big Data
11/8/2012                                                           5
So what is Big Data?

                        In 2008 we were generating
                        as much stored data from the
                        dawn of civilization to 2003
                        every two days.

                        And that rate is predicted to
                        double every two years.



                 Demystifying Big Data
11/8/2012                                     6
So what is Big Data?




                 Demystifying Big Data
11/8/2012                                7
So what is Big Data?




            Inevitable

                 Demystifying Big Data
11/8/2012                                8
Big Data vs Little Data
                  Little Data =
              Relational Databases




                  Demystifying Big Data
11/8/2012                                 9
Big Data vs Little Data
                   Big Data =
            Non-Relational Databases




                   Demystifying Big Data
11/8/2012                                  10
Big Data vs Little Data




                  Demystifying Big Data
11/8/2012                                 11
Big Data vs Little Data


                                          • Structured




                  Demystifying Big Data
11/8/2012                                                12
Big Data vs Little Data


                                          • Structured
                                          • Organized




                  Demystifying Big Data
11/8/2012                                                13
Big Data vs Little Data


                                          • Structured
                                          • Organized
                                          • Hierarchical




                  Demystifying Big Data
11/8/2012                                              14
Big Data vs Little Data


                                          •   Structured
                                          •   Organized
                                          •   Hierarchical
                                          •   Rigid




                  Demystifying Big Data
11/8/2012                                                15
Big Data vs Little Data




                  Demystifying Big Data
11/8/2012                                 16
Big Data vs Little Data



                                          • Unstructured




                  Demystifying Big Data
11/8/2012                                             17
Big Data vs Little Data



                                          • Unstructured
                                          • Disparate




                  Demystifying Big Data
11/8/2012                                             18
Big Data vs Little Data



                                          • Unstructured
                                          • Disparate
                                          • Non-Hierarchical




                  Demystifying Big Data
11/8/2012                                             19
Big Data vs Little Data



                                          •   Unstructured
                                          •   Disparate
                                          •   Non-Hierarchical
                                          •   Reusable




                  Demystifying Big Data
11/8/2012                                               20
So why Big Data?


    • Find trends in existing data




                 Demystifying Big Data
11/8/2012                                21
So why Big Data?


    • Find trends in existing data
    • Better consumer targeting




                 Demystifying Big Data
11/8/2012                                22
So why Big Data?


    • Find trends in existing data
    • Better consumer targeting
    • Predictive analysis




                 Demystifying Big Data
11/8/2012                                23
So why Big Data?


    •       Find trends in existing data
    •       Better consumer targeting
    •       Predictive analysis
    •       Making better use of what you
            already know


                      Demystifying Big Data
11/8/2012                                     24
Who is using Big Data?




                  Demystifying Big Data
11/8/2012                                 25
Who is using Big Data?

      o     Financial Services
      o     Healthcare
      o     Retail
      o     Marketing
      o     Politics
                  Demystifying Big Data
11/8/2012                                 26
The Business of Big Data




                   Demystifying Big Data
11/8/2012                                  27
The Business of Big Data
   Vendor (Founded)    Founded Funding (in $US mil.) # of Institutional Rounds                 Investors

                                                                                 SAC Capital, The Founders Fund, Glynn
                                                                                 Capital, In-Q-Tel, Reed Elsevier
            Palantir    2004            $301                     7
                                                                                 Ventures, Ulu Ventures, Youniversity
                                                                                 Ventures and Jeremy Stoppelman
        Mu Sigma        2004            $133                     2               General Atlantic and Sequoia Capital
                                                                                 Silver Lake Sumeru, Accel-KKR, Invus
     Opera Solutions    2004            $84                      1               Financial Advisors, JGE Capital and
                                                                                 Tola Capital
                                                                                 Accel Partners, Greylock Partners and
            Cloudera    2008            $81                      4
                                                                                 Meritech Capital Partners
                                                                                 New Enterprise Associates, Sequoia
             10gen      2008           $73.4                     5               Capital, Flybridge Capital and Union
                                                                                 Square Ventures
                                                                                 Amazon, Menlo Ventures, Mohr
                                                                                 Davidow Ventures, Bay Partners,
            ParAccel    2005            $73                      5               Walden International, Tao Venture
                                                                                 Capital Partners and Silicon Valley
                                                                                 Bank
                                                                                 Andreesen Horowitz, General Catalyst,
                                                                                 O’Reilly AlphaTech Ventures, Windcrest
        GoodData        2007           $53.5                     3
                                                                                 Partners, Tenaya Capital and Next
                                                                                 World Capital
                                                                                 Ignition Partners, August Capital, JK&B
        Splunk(1)       2003            $40                      3
                                                                                 and Sevin Rosen Funds
                                                                                 Meritech Capital, Lightspeed Venture
        DataStax        2010           $38.7                     3               Partners, Sequoia Capital and Crosslink
                                                                                 Capital
        1010data        2000            $35                      1               Norwest Venture Partners




                                          Demystifying Big Data
11/8/2012                                                                                                  28
The Business of Big Data




                   Demystifying Big Data
11/8/2012                                  29
So what is Big Data?




            Inevitable

                 Demystifying Big Data
11/8/2012                                30
So what is Big Data?




            Inevitable
             And that’s a good thing.




                  Demystifying Big Data
11/8/2012                                 31
Big Data’s Big Questions


o What about the data storage and
  utilization we already have?
o How do we know if our data is Big Data?
o What are the primary costs of big data?
o What answers can I get from our data?
o Where do we begin?


                   Demystifying Big Data
11/8/2012                                  32
Better data. Better decisions.


        7171 West 95th Street
              Suite 310
       Overland Park, KS 2208
            913.944.4500

         www.spiral16.com

                                        Aaron Weber
                            aaron.weber@spiral16.com

More Related Content

PDF
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
PPTX
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
PDF
Big Data Fundamentals
PDF
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
PDF
GraphDay Stockholm - Fraud Prevention
PDF
The New Convergence of Data; the Next Strategic Business Advantage
PDF
Everis big data_wilson_v1.4
Becoming a Data-Driven Organization - Aligning Business & Data Strategy
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Big Data Fundamentals
ADV Slides: The World in 2045 – What Has Artificial Intelligence Created?
GraphDay Stockholm - Fraud Prevention
The New Convergence of Data; the Next Strategic Business Advantage
Everis big data_wilson_v1.4

What's hot (20)

PDF
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
PDF
Big data assignment
PDF
Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)
PPTX
Big Data, NoSQL, NewSQL & The Future of Data Management
PDF
Big data basics
PDF
Data-Ed Webinar: Demystifying Big Data
PPTX
How Data is Driving AI Innovation
PDF
Smart Data Webinar: Knowledge as a Service
PDF
Big Data Ethics
PDF
What's the Big Deal About Big Data?
PDF
Talking to your CEO about the Chief Data Officer Role
PDF
The Chief Data Officer's Agenda: The Need for Information Governance Controls
PDF
Data Modeling and Relational to NoSQL
PDF
From Big Data to Smart Data
PDF
Trends in Big Data & Business Challenges
PDF
Big data's impact on online marketing
PPTX
Metadata Matters
PDF
Assumptions about Data and Analysis: Briefing room webcast slides
PDF
McKinsey Global Institute - Big data: The next frontier for innovation, compe...
PPTX
The Business of Big Data (IA Ventures)
Whitepaper: Know Your Big Data – in 10 Minutes! - Happiest Minds
Big data assignment
Unlocking Value in the Fragmented World of Big Data Analytics (POV Paper)
Big Data, NoSQL, NewSQL & The Future of Data Management
Big data basics
Data-Ed Webinar: Demystifying Big Data
How Data is Driving AI Innovation
Smart Data Webinar: Knowledge as a Service
Big Data Ethics
What's the Big Deal About Big Data?
Talking to your CEO about the Chief Data Officer Role
The Chief Data Officer's Agenda: The Need for Information Governance Controls
Data Modeling and Relational to NoSQL
From Big Data to Smart Data
Trends in Big Data & Business Challenges
Big data's impact on online marketing
Metadata Matters
Assumptions about Data and Analysis: Briefing room webcast slides
McKinsey Global Institute - Big data: The next frontier for innovation, compe...
The Business of Big Data (IA Ventures)
Ad

Viewers also liked (13)

PPTX
MySQL Nested Datasets #JDNL13
PPSX
Demystifying Columnar Databases
PDF
Drill architecture 20120913
PDF
High Performance JSON Search and Relational Faceted Browsing with Lucene
PPTX
Rise of Column Oriented Database
PPT
Dremel: Interactive Analysis of Web-Scale Datasets
ODP
Google's Dremel
PDF
Trees and Hierarchies in SQL
PDF
More Better Nested Set
PDF
NoSQL? No, SQL! - SQL, the underestimated "Big Data" technology
PDF
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
PDF
Models for hierarchical data
PPTX
Big Data Platforms: An Overview
MySQL Nested Datasets #JDNL13
Demystifying Columnar Databases
Drill architecture 20120913
High Performance JSON Search and Relational Faceted Browsing with Lucene
Rise of Column Oriented Database
Dremel: Interactive Analysis of Web-Scale Datasets
Google's Dremel
Trees and Hierarchies in SQL
More Better Nested Set
NoSQL? No, SQL! - SQL, the underestimated "Big Data" technology
Slides: NoSQL Data Modeling Using JSON Documents – A Practical Approach
Models for hierarchical data
Big Data Platforms: An Overview
Ad

Similar to Demystifying Big Data (20)

PPTX
bigdata introduction for students pg msc
PPTX
A Big Data Concept
PPTX
TOPIC.pptx
PPTX
Big data
PPTX
Introduction to Big Data
PDF
Big Data Analysis
PPTX
introduction to data science
PDF
Big Data - Insights & Challenges
PDF
All About Big Data
PPTX
Big Data, Big Investment
PDF
Bda assignment can also be used for BDA notes and concept understanding.
PDF
Data-Ed: Demystifying Big Data
PDF
DataEd Online: Demystifying Big Data
PPT
Big Data Analytics Materials, Chapter: 1
PPTX
Bigdata Hadoop introduction
PDF
L18 Big Data and Analytics
PPTX
bigdata- Introduction for pg students fo
PPT
big_data.ppt
PPT
big_data.ppt
PPTX
Big Data.pptx
bigdata introduction for students pg msc
A Big Data Concept
TOPIC.pptx
Big data
Introduction to Big Data
Big Data Analysis
introduction to data science
Big Data - Insights & Challenges
All About Big Data
Big Data, Big Investment
Bda assignment can also be used for BDA notes and concept understanding.
Data-Ed: Demystifying Big Data
DataEd Online: Demystifying Big Data
Big Data Analytics Materials, Chapter: 1
Bigdata Hadoop introduction
L18 Big Data and Analytics
bigdata- Introduction for pg students fo
big_data.ppt
big_data.ppt
Big Data.pptx

More from Spiral16 (14)

PPTX
State of Social Media 2014
PPTX
Writing for Social Media: Engagement Strategies for Content Managers
PPTX
Listening & Storytelling for Nonprofits
PPTX
Monitoring & Measuring Social Media - A Practical Guide
PPTX
Listening & Nonprofit Storytelling
PPTX
Measuring Social Media Success
PPT
Boulevard Brewing Co. Social Media - The Tale of the Ale
PPTX
Measuring Social Media: Where's the ROI?
PPTX
Politics & Government in the Social Age
PPTX
How to Get a Grasp on ROI for Social Media
PPTX
What is Internet Monitoring?
PPT
Social Media ROI
PPTX
Measuring Influence: The Value of 3D Data Visualization
PPTX
What the Tweet is going on?
State of Social Media 2014
Writing for Social Media: Engagement Strategies for Content Managers
Listening & Storytelling for Nonprofits
Monitoring & Measuring Social Media - A Practical Guide
Listening & Nonprofit Storytelling
Measuring Social Media Success
Boulevard Brewing Co. Social Media - The Tale of the Ale
Measuring Social Media: Where's the ROI?
Politics & Government in the Social Age
How to Get a Grasp on ROI for Social Media
What is Internet Monitoring?
Social Media ROI
Measuring Influence: The Value of 3D Data Visualization
What the Tweet is going on?

Recently uploaded (20)

PPTX
Slide gioi thieu VietinBank Quy 2 - 2025
PPTX
Slide gioi thieu VietinBank Quy 2 - 2025
PDF
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
PPTX
Project Management_ SMART Projects Class.pptx
PPTX
Negotiation and Persuasion Skills: A Shrewd Person's Perspective
PDF
Introduction to Generative Engine Optimization (GEO)
PDF
Module 2 - Modern Supervison Challenges - Student Resource.pdf
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PDF
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
PDF
Tata consultancy services case study shri Sharda college, basrur
PPT
Lecture notes on Business Research Methods
PPTX
interschool scomp.pptxzdkjhdjvdjvdjdhjhieij
PDF
Cours de Système d'information about ERP.pdf
PDF
Technical Architecture - Chainsys dataZap
PDF
Tortilla Mexican Grill 发射点犯得上发射点发生发射点犯得上发生
PDF
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
PDF
THE COMPLETE GUIDE TO BUILDING PASSIVE INCOME ONLINE
PDF
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
PPTX
BUSINESS CYCLE_INFLATION AND UNEMPLOYMENT.pptx
DOCX
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx
Slide gioi thieu VietinBank Quy 2 - 2025
Slide gioi thieu VietinBank Quy 2 - 2025
Module 3 - Functions of the Supervisor - Part 1 - Student Resource (1).pdf
Project Management_ SMART Projects Class.pptx
Negotiation and Persuasion Skills: A Shrewd Person's Perspective
Introduction to Generative Engine Optimization (GEO)
Module 2 - Modern Supervison Challenges - Student Resource.pdf
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
ANALYZING THE OPPORTUNITIES OF DIGITAL MARKETING IN BANGLADESH TO PROVIDE AN ...
Tata consultancy services case study shri Sharda college, basrur
Lecture notes on Business Research Methods
interschool scomp.pptxzdkjhdjvdjvdjdhjhieij
Cours de Système d'information about ERP.pdf
Technical Architecture - Chainsys dataZap
Tortilla Mexican Grill 发射点犯得上发射点发生发射点犯得上发生
TyAnn Osborn: A Visionary Leader Shaping Corporate Workforce Dynamics
THE COMPLETE GUIDE TO BUILDING PASSIVE INCOME ONLINE
Susan Semmelmann: Enriching the Lives of others through her Talents and Bless...
BUSINESS CYCLE_INFLATION AND UNEMPLOYMENT.pptx
Handbook of Entrepreneurship- Chapter 5: Identifying business opportunity.docx

Demystifying Big Data

  • 1. Demystifying Big Data Aaron Weber, Product Architect Spiral16
  • 2. Since 2007, Spiral16′s group of developers, data analysts and researchers have been quietly building one of the most powerful data mining platforms on the planet, backed up by one of the most experienced social data analysis teams in the industry. Spiral16′s social media and web research platform and data analysis services are designed to help everyone from CEOs to CMOs, marketing and PR agencies, and researchers. Demystifying Big Data 11/8/2012 2
  • 3. So what is Big Data? Demystifying Big Data 11/8/2012 3
  • 4. So what is Big Data? “A collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools.” - Wikipedia Demystifying Big Data 11/8/2012 4
  • 5. So what is Big Data? Exabytes Created By Year 3000 2500 2000 1500 1000 500 0 2006 2007 2008 2009 2010 2011 2012 Demystifying Big Data 11/8/2012 5
  • 6. So what is Big Data? In 2008 we were generating as much stored data from the dawn of civilization to 2003 every two days. And that rate is predicted to double every two years. Demystifying Big Data 11/8/2012 6
  • 7. So what is Big Data? Demystifying Big Data 11/8/2012 7
  • 8. So what is Big Data? Inevitable Demystifying Big Data 11/8/2012 8
  • 9. Big Data vs Little Data Little Data = Relational Databases Demystifying Big Data 11/8/2012 9
  • 10. Big Data vs Little Data Big Data = Non-Relational Databases Demystifying Big Data 11/8/2012 10
  • 11. Big Data vs Little Data Demystifying Big Data 11/8/2012 11
  • 12. Big Data vs Little Data • Structured Demystifying Big Data 11/8/2012 12
  • 13. Big Data vs Little Data • Structured • Organized Demystifying Big Data 11/8/2012 13
  • 14. Big Data vs Little Data • Structured • Organized • Hierarchical Demystifying Big Data 11/8/2012 14
  • 15. Big Data vs Little Data • Structured • Organized • Hierarchical • Rigid Demystifying Big Data 11/8/2012 15
  • 16. Big Data vs Little Data Demystifying Big Data 11/8/2012 16
  • 17. Big Data vs Little Data • Unstructured Demystifying Big Data 11/8/2012 17
  • 18. Big Data vs Little Data • Unstructured • Disparate Demystifying Big Data 11/8/2012 18
  • 19. Big Data vs Little Data • Unstructured • Disparate • Non-Hierarchical Demystifying Big Data 11/8/2012 19
  • 20. Big Data vs Little Data • Unstructured • Disparate • Non-Hierarchical • Reusable Demystifying Big Data 11/8/2012 20
  • 21. So why Big Data? • Find trends in existing data Demystifying Big Data 11/8/2012 21
  • 22. So why Big Data? • Find trends in existing data • Better consumer targeting Demystifying Big Data 11/8/2012 22
  • 23. So why Big Data? • Find trends in existing data • Better consumer targeting • Predictive analysis Demystifying Big Data 11/8/2012 23
  • 24. So why Big Data? • Find trends in existing data • Better consumer targeting • Predictive analysis • Making better use of what you already know Demystifying Big Data 11/8/2012 24
  • 25. Who is using Big Data? Demystifying Big Data 11/8/2012 25
  • 26. Who is using Big Data? o Financial Services o Healthcare o Retail o Marketing o Politics Demystifying Big Data 11/8/2012 26
  • 27. The Business of Big Data Demystifying Big Data 11/8/2012 27
  • 28. The Business of Big Data Vendor (Founded) Founded Funding (in $US mil.) # of Institutional Rounds Investors SAC Capital, The Founders Fund, Glynn Capital, In-Q-Tel, Reed Elsevier Palantir 2004 $301 7 Ventures, Ulu Ventures, Youniversity Ventures and Jeremy Stoppelman Mu Sigma 2004 $133 2 General Atlantic and Sequoia Capital Silver Lake Sumeru, Accel-KKR, Invus Opera Solutions 2004 $84 1 Financial Advisors, JGE Capital and Tola Capital Accel Partners, Greylock Partners and Cloudera 2008 $81 4 Meritech Capital Partners New Enterprise Associates, Sequoia 10gen 2008 $73.4 5 Capital, Flybridge Capital and Union Square Ventures Amazon, Menlo Ventures, Mohr Davidow Ventures, Bay Partners, ParAccel 2005 $73 5 Walden International, Tao Venture Capital Partners and Silicon Valley Bank Andreesen Horowitz, General Catalyst, O’Reilly AlphaTech Ventures, Windcrest GoodData 2007 $53.5 3 Partners, Tenaya Capital and Next World Capital Ignition Partners, August Capital, JK&B Splunk(1) 2003 $40 3 and Sevin Rosen Funds Meritech Capital, Lightspeed Venture DataStax 2010 $38.7 3 Partners, Sequoia Capital and Crosslink Capital 1010data 2000 $35 1 Norwest Venture Partners Demystifying Big Data 11/8/2012 28
  • 29. The Business of Big Data Demystifying Big Data 11/8/2012 29
  • 30. So what is Big Data? Inevitable Demystifying Big Data 11/8/2012 30
  • 31. So what is Big Data? Inevitable And that’s a good thing. Demystifying Big Data 11/8/2012 31
  • 32. Big Data’s Big Questions o What about the data storage and utilization we already have? o How do we know if our data is Big Data? o What are the primary costs of big data? o What answers can I get from our data? o Where do we begin? Demystifying Big Data 11/8/2012 32
  • 33. Better data. Better decisions. 7171 West 95th Street Suite 310 Overland Park, KS 2208 913.944.4500 www.spiral16.com Aaron Weber aaron.weber@spiral16.com

Editor's Notes

  • #5: So how large and complex are we talking about?
  • #6: In fact we’ve moved past exabytes into zetabytes. To put this in perspective: That’s 2.7 TRILLION gigabytes of data.
  • #7: So what is Big Data?
  • #8: There is simply no way with our current levels of technology to process data in the manner we’re used to. Big Data is the technological path we have to take if we have any desire to make sense of the world we’re creating every day.
  • #9: There is simply no way with our current levels of technology to process data in the manner we’re used to. Big Data is the technological path we have to take if we have any desire to make sense of the world we’re creating every day.
  • #10: Little Data is a misnomer here. In the real world we’re still talking about massive sets of data. Warehouses full of it, in fact. What we’re really talking about is non-relational vs relational databases. An illustration:
  • #11: Little Data is a misnomer here. In the real world we’re still talking about massive sets of data. Warehouses full of it, in fact. What we’re really talking about is non-relational vs relational databases. An illustration:
  • #12: Little Data is a misnomer here. In the real world we’re still talking about massive sets of data. Warehouses full of it, in fact. What we’re really talking about
  • #13: But more importantly, Big Data is something else:
  • #14: But more importantly, Big Data is something else:
  • #15: But more importantly, Big Data is something else:
  • #16: But more importantly, Big Data is something else:
  • #17: But more importantly, Big Data is something else:
  • #18: But more importantly, Big Data is something else:
  • #19: But more importantly, Big Data is something else:
  • #20: But more importantly, Big Data is something else:
  • #21: This last one is the important one: Non-structured data left in its original state is infinitely reusable. Instead of dozens or hundreds of silos of information, you can reduce data (and management) duplication for a unified pool of information that can be used for vastly different ends.
  • #22: So how large and complex are we talking about?
  • #23: So how large and complex are we talking about?
  • #24: So how large and complex are we talking about?
  • #25: So how large and complex are we talking about?
  • #28: Gartner’s Hype Cycle – The Peak of Inflated Expectations
  • #33: So how large and complex are we talking about?