SlideShare a Scribd company logo
6
Most read
12
Most read
16
Most read
Big Data – Let’s Embrace It!

By Shilpi Sharma




              Nov, 2012


               When Execution Matters   1
                     (Confidential)
Topics Covered
 V3 and Enablers
 3Ts and Challenges
 Use Case: Sales Enablement




                       When Execution Matters   2
                             (Confidential)
Big Data Characteristics – V3



                                                                             A petabyte is one quadrillion bytes,
                                                                             or the equivalent of about 20 million
                                                                             filing cabinets’ worth of text.

                                                                             An exabyte is 1,000 times that
                                                                             amount, or one billion gigabytes.




As of 2012, about 2.5 exabytes of data are created each day, and that number is doubling every 40
                                          months or so.

More data cross the internet every second than were stored in the entire internet just 20 years ago.

                     30Bn pieces of content shared on Facebook every month
                                         When Execution Matters                                                  3
                                               (Confidential)
What is Big Data?




                    When Execution Matters   4
                          (Confidential)
Key Enablers



   When Execution Matters   5
         (Confidential)
Key Enabler – Data Storage




                   When Execution Matters   6
                         (Confidential)
Key Enabler – Computation Capacity




                   When Execution Matters   7
                         (Confidential)
Key Enabler – Data Availability




                     When Execution Matters   8
                           (Confidential)
Key Drivers – Internet of Things & Big Data




           of 2B electricity utility meters
18%        are smart meters
     Intelligent machines fighting
50B+ for bandwidth by 2020

                     When Execution Matters   9
                           (Confidential)
Gartner Emerging Technologies Hype Cycle 2012




            Investments in Big Data
 $5B+       Infrastructure (2009-2011)




                   When Execution Matters       10
                         (Confidential)
Industry Sectors
Rich in Big Data



    When Execution Matters   11
          (Confidential)
Value Potential Across Sectors




  For Hi-Tech Companies, Big Data is
  generated from Value Chain




                    When Execution Matters   12
                          (Confidential)
Readiness Across Sectors




  Information is the only industry that will get
  most value from Big Data with ease.




                   When Execution Matters          13
                         (Confidential)
Big Data – 3Ts
Technologies, Techniques & Talent




            When Execution Matters   14
                  (Confidential)
Big Data Technologies

          Where processing is hosted?
            Distributed Servers/Cloud (e.g. Amazon EC2)

                Where data is stored?
               Distributed Storage (e.g. Hadoop DFS)

          What is programming model?
             Distributed Processing (e.g. MapReduce)

         How data is stored& indexed?
      High-performance schema-free database (e.g. Cassandra)

        What operations are performed?
            Data Analytics, Semantic Processing (e.g. R)


                          When Execution Matters               15
                                (Confidential)
Big Data Techniques
A set of techniques to extract patterns from large datasets by
combining methods from statistics and machine learning with
database management. Few examples:

 Supervised Learning – Support Vector Machine
 Unsupervised learning – Cluster Analysis
 Data fusion – Signal processing, Natural Language
  Processing
 Optimization – Genetic Algorithm, Neural Networks
 Predictive Modeling – Regression, Time Series Analysis




                        When Execution Matters               16
                              (Confidential)
Big Data Talent




                  When Execution Matters   17
                        (Confidential)
Big Data Value Chain
       Aggregate                   Analyze                       Consume
                                                                                  Derive Value
         Data                       Data                           Data
 Data Integration (from    Smart Sampling of             Visualization    Connect the dots
  multiple sources)          Data                                             (Actionable Insights)

 Data harmonization        Finding similar items
  (multi-rate, noisy,
  missing)                  Building Models and
                             incremental updating of
 Data Classification        models




                                     Change Management

                                 Data Policy & Governance

                                  Technology Management




                                            When Execution Matters                                    18
                                                     (Confidential)
Big Data – Management Challenges
   Big data brings the potential for transformation, not the actual
                           transformation



                                                    Change
  Decision Making
                                                  Management


                                                    Clash of
 Shortage of Skills
                                                  Technologies

                           When Execution Matters                     19
                                 (Confidential)
Food for Thought




     When Execution Matters   20
           (Confidential)
Take an example – A Client Meeting
Types of Data:
    Internal Information: Company, Presentations, collateral, pricing, contracts
    Personal Information: Territory assignment, Goal Attainment, Past
     interactions with customer
    External Information: Company, People, Competition, Market

Data Sources:
Suddenly you are going from a few office documents to hundreds of files and
channels that are being continually updated.
    Static like a webpage, personal profile, competitive cheat sheet
    Dynamic like a YouTube channel demonstrating a competitor’s product, a
     blog reviewing an announcement, or twitter channel




                               When Execution Matters                           21
                                     (Confidential)
Some Facts
     $135,262 – Average support costs per year for each
      salesperson

     7 hours/week - Average salesperson spends looking for
      relevant information to prepare for sales calls

     50% of the information is pushed through email; only 10%
      is made available in a useful format




Source: Forrester Research & IDC Sales Advisory Service

                                                          When Execution Matters   22
                                                                (Confidential)
Big Data Application
 Connect the dots across internal and external data for sales
  professional
    What has been sold at client? How it has been working?
    Where the industry is moving? What are top challenges for
     the decision makers? How does it connect to product portfolio
     you are selling?
    What has been the buying pattern at client?
    Any new insights based on Install Base?


 Win More Deals, Increase Productivity, Sell Smarter



                          When Execution Matters                 23
                                (Confidential)
When Execution Matters   24
      (Confidential)

More Related Content

PPT
block ciphers
PPTX
File concept and access method
PPTX
Classification techniques in data mining
PPT
Synchronization in distributed systems
PPTX
OLAP & DATA WAREHOUSE
PPT
12. Indexing and Hashing in DBMS
PPT
OLAP
PDF
Run time storage
block ciphers
File concept and access method
Classification techniques in data mining
Synchronization in distributed systems
OLAP & DATA WAREHOUSE
12. Indexing and Hashing in DBMS
OLAP
Run time storage

What's hot (20)

PPT
process creation OS
PDF
Data in Motion vs Data at Rest
PPTX
Trusted systems
PDF
Data Mining & Data Warehousing Lecture Notes
PPTX
Multilayer & Back propagation algorithm
PDF
Block Ciphers and the Data Encryption Standard
PPTX
Data warehouse
PDF
Data warehouse architecture
PPTX
Developing a Map Reduce Application
PPTX
Data Mining: Application and trends in data mining
PPTX
Multilayer perceptron
PPTX
Chapter1: NoSQL: It’s about making intelligent choices
PPTX
Distributed System - Security
PPTX
connecting smart object in IoT.pptx
PDF
Presentation On NoSQL Databases
PDF
Network security - OSI Security Architecture
PPTX
Ooad unit – 1 introduction
PPT
Message Authentication
PPTX
file system in operating system
PPT
Data warehouse
process creation OS
Data in Motion vs Data at Rest
Trusted systems
Data Mining & Data Warehousing Lecture Notes
Multilayer & Back propagation algorithm
Block Ciphers and the Data Encryption Standard
Data warehouse
Data warehouse architecture
Developing a Map Reduce Application
Data Mining: Application and trends in data mining
Multilayer perceptron
Chapter1: NoSQL: It’s about making intelligent choices
Distributed System - Security
connecting smart object in IoT.pptx
Presentation On NoSQL Databases
Network security - OSI Security Architecture
Ooad unit – 1 introduction
Message Authentication
file system in operating system
Data warehouse
Ad

Viewers also liked (20)

PPTX
The big data value chain r1-31 oct13
PPTX
Big data characteristics, value chain and challenges
PDF
Core concepts and Key technologies - Big Data Analytics
PPTX
Big data ppt
PPTX
What is Big Data?
PDF
Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...
PDF
Big data: current technology scope.
PDF
Big data technologies and Hadoop infrastructure
PDF
Apache Spark Briefing
PPTX
Turning Big Data into More Effective Customer Experiences
PPT
Distributed File Systems
PPTX
Big Data Analytics V2
PDF
Distributed File Systems: An Overview
PDF
Introduction to distributed file systems
PPT
Chapter 8 distributed file systems
PPT
Chapter 17 - Distributed File Systems
PDF
The Ecosystem is too damn big
PPTX
structured and unstructured interview
ODP
Distributed File System
 
PDF
BigData_Chp2: Hadoop & Map-Reduce
The big data value chain r1-31 oct13
Big data characteristics, value chain and challenges
Core concepts and Key technologies - Big Data Analytics
Big data ppt
What is Big Data?
Trivadis TechEvent 2016 DWH Modernization – in the Age of Big Data by Gregor ...
Big data: current technology scope.
Big data technologies and Hadoop infrastructure
Apache Spark Briefing
Turning Big Data into More Effective Customer Experiences
Distributed File Systems
Big Data Analytics V2
Distributed File Systems: An Overview
Introduction to distributed file systems
Chapter 8 distributed file systems
Chapter 17 - Distributed File Systems
The Ecosystem is too damn big
structured and unstructured interview
Distributed File System
 
BigData_Chp2: Hadoop & Map-Reduce
Ad

Similar to Big data - Key Enablers, Drivers & Challenges (20)

PDF
EDF2012 Wolfgang Nimfuehr - Bringing Big Data to the Enterprise
PPTX
OWF12/Java Michael hirt
PDF
Evaluating Big Data Predictive Analytics Platforms
PDF
Business with Big data
PDF
Opening keynote gianni cooreman
PDF
Analyze This! Best Practices For Big And Fast Data
 
PPTX
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
PDF
Turning Big Data to Business Advantage
PPTX
Modernizing Your IT Infrastructure with Hadoop - Cloudera Summer Webinar Seri...
KEY
Exploring Big Data value for your business
PDF
Big Data is Here for Financial Services White Paper
PPTX
Big data and its big opportunity
PPTX
Big Data is on a Collision Course With Your Network - Are You Ready?
PPTX
Big Data’s Big Impact on Businesses
PDF
Hortonworks roadshow
PDF
Telco Big Data Workshop Sample
PDF
Report: CIOs & Big Data
PPTX
Big Data Is Here - Now What?
PPTX
Big Data in Business Application use case and benefits
PDF
The Big Deal About Big Data For Customer Engagement
EDF2012 Wolfgang Nimfuehr - Bringing Big Data to the Enterprise
OWF12/Java Michael hirt
Evaluating Big Data Predictive Analytics Platforms
Business with Big data
Opening keynote gianni cooreman
Analyze This! Best Practices For Big And Fast Data
 
Webinar | Using Hadoop Analytics to Gain a Big Data Advantage
Turning Big Data to Business Advantage
Modernizing Your IT Infrastructure with Hadoop - Cloudera Summer Webinar Seri...
Exploring Big Data value for your business
Big Data is Here for Financial Services White Paper
Big data and its big opportunity
Big Data is on a Collision Course With Your Network - Are You Ready?
Big Data’s Big Impact on Businesses
Hortonworks roadshow
Telco Big Data Workshop Sample
Report: CIOs & Big Data
Big Data Is Here - Now What?
Big Data in Business Application use case and benefits
The Big Deal About Big Data For Customer Engagement

Recently uploaded (20)

PDF
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
DOCX
unit 1 COST ACCOUNTING AND COST SHEET
PDF
Nidhal Samdaie CV - International Business Consultant
PPT
Chapter four Project-Preparation material
PDF
IFRS Notes in your pocket for study all the time
PDF
WRN_Investor_Presentation_August 2025.pdf
PDF
Power and position in leadershipDOC-20250808-WA0011..pdf
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PDF
Types of control:Qualitative vs Quantitative
PPTX
Belch_12e_PPT_Ch18_Accessible_university.pptx
PDF
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PDF
Laughter Yoga Basic Learning Workshop Manual
PDF
DOC-20250806-WA0002._20250806_112011_0000.pdf
PDF
Business model innovation report 2022.pdf
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
PDF
Chapter 5_Foreign Exchange Market in .pdf
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PPT
Data mining for business intelligence ch04 sharda
PPTX
HR Introduction Slide (1).pptx on hr intro
Stem Cell Market Report | Trends, Growth & Forecast 2025-2034
unit 1 COST ACCOUNTING AND COST SHEET
Nidhal Samdaie CV - International Business Consultant
Chapter four Project-Preparation material
IFRS Notes in your pocket for study all the time
WRN_Investor_Presentation_August 2025.pdf
Power and position in leadershipDOC-20250808-WA0011..pdf
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
Types of control:Qualitative vs Quantitative
Belch_12e_PPT_Ch18_Accessible_university.pptx
Dr. Enrique Segura Ense Group - A Self-Made Entrepreneur And Executive
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
Laughter Yoga Basic Learning Workshop Manual
DOC-20250806-WA0002._20250806_112011_0000.pdf
Business model innovation report 2022.pdf
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
Chapter 5_Foreign Exchange Market in .pdf
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
Data mining for business intelligence ch04 sharda
HR Introduction Slide (1).pptx on hr intro

Big data - Key Enablers, Drivers & Challenges

  • 1. Big Data – Let’s Embrace It! By Shilpi Sharma Nov, 2012 When Execution Matters 1 (Confidential)
  • 2. Topics Covered  V3 and Enablers  3Ts and Challenges  Use Case: Sales Enablement When Execution Matters 2 (Confidential)
  • 3. Big Data Characteristics – V3 A petabyte is one quadrillion bytes, or the equivalent of about 20 million filing cabinets’ worth of text. An exabyte is 1,000 times that amount, or one billion gigabytes. As of 2012, about 2.5 exabytes of data are created each day, and that number is doubling every 40 months or so. More data cross the internet every second than were stored in the entire internet just 20 years ago. 30Bn pieces of content shared on Facebook every month When Execution Matters 3 (Confidential)
  • 4. What is Big Data? When Execution Matters 4 (Confidential)
  • 5. Key Enablers When Execution Matters 5 (Confidential)
  • 6. Key Enabler – Data Storage When Execution Matters 6 (Confidential)
  • 7. Key Enabler – Computation Capacity When Execution Matters 7 (Confidential)
  • 8. Key Enabler – Data Availability When Execution Matters 8 (Confidential)
  • 9. Key Drivers – Internet of Things & Big Data of 2B electricity utility meters 18% are smart meters Intelligent machines fighting 50B+ for bandwidth by 2020 When Execution Matters 9 (Confidential)
  • 10. Gartner Emerging Technologies Hype Cycle 2012 Investments in Big Data $5B+ Infrastructure (2009-2011) When Execution Matters 10 (Confidential)
  • 11. Industry Sectors Rich in Big Data When Execution Matters 11 (Confidential)
  • 12. Value Potential Across Sectors For Hi-Tech Companies, Big Data is generated from Value Chain When Execution Matters 12 (Confidential)
  • 13. Readiness Across Sectors Information is the only industry that will get most value from Big Data with ease. When Execution Matters 13 (Confidential)
  • 14. Big Data – 3Ts Technologies, Techniques & Talent When Execution Matters 14 (Confidential)
  • 15. Big Data Technologies Where processing is hosted? Distributed Servers/Cloud (e.g. Amazon EC2) Where data is stored? Distributed Storage (e.g. Hadoop DFS) What is programming model? Distributed Processing (e.g. MapReduce) How data is stored& indexed? High-performance schema-free database (e.g. Cassandra) What operations are performed? Data Analytics, Semantic Processing (e.g. R) When Execution Matters 15 (Confidential)
  • 16. Big Data Techniques A set of techniques to extract patterns from large datasets by combining methods from statistics and machine learning with database management. Few examples:  Supervised Learning – Support Vector Machine  Unsupervised learning – Cluster Analysis  Data fusion – Signal processing, Natural Language Processing  Optimization – Genetic Algorithm, Neural Networks  Predictive Modeling – Regression, Time Series Analysis When Execution Matters 16 (Confidential)
  • 17. Big Data Talent When Execution Matters 17 (Confidential)
  • 18. Big Data Value Chain Aggregate Analyze Consume Derive Value Data Data Data  Data Integration (from  Smart Sampling of  Visualization  Connect the dots multiple sources) Data (Actionable Insights)  Data harmonization  Finding similar items (multi-rate, noisy, missing)  Building Models and incremental updating of  Data Classification models Change Management Data Policy & Governance Technology Management When Execution Matters 18 (Confidential)
  • 19. Big Data – Management Challenges Big data brings the potential for transformation, not the actual transformation Change Decision Making Management Clash of Shortage of Skills Technologies When Execution Matters 19 (Confidential)
  • 20. Food for Thought When Execution Matters 20 (Confidential)
  • 21. Take an example – A Client Meeting Types of Data:  Internal Information: Company, Presentations, collateral, pricing, contracts  Personal Information: Territory assignment, Goal Attainment, Past interactions with customer  External Information: Company, People, Competition, Market Data Sources: Suddenly you are going from a few office documents to hundreds of files and channels that are being continually updated.  Static like a webpage, personal profile, competitive cheat sheet  Dynamic like a YouTube channel demonstrating a competitor’s product, a blog reviewing an announcement, or twitter channel When Execution Matters 21 (Confidential)
  • 22. Some Facts  $135,262 – Average support costs per year for each salesperson  7 hours/week - Average salesperson spends looking for relevant information to prepare for sales calls  50% of the information is pushed through email; only 10% is made available in a useful format Source: Forrester Research & IDC Sales Advisory Service When Execution Matters 22 (Confidential)
  • 23. Big Data Application  Connect the dots across internal and external data for sales professional  What has been sold at client? How it has been working?  Where the industry is moving? What are top challenges for the decision makers? How does it connect to product portfolio you are selling?  What has been the buying pattern at client?  Any new insights based on Install Base? Win More Deals, Increase Productivity, Sell Smarter When Execution Matters 23 (Confidential)
  • 24. When Execution Matters 24 (Confidential)

Editor's Notes

  • #4: Many of the most important sources of big data are relatively new. Facebook was launched in 2004, Twitter in 2006. Smartphones and the other mobile devices iPhone was unveiled only five years ago, and the iPad in 2010.
  • #5: Highly structured data in these systems is typically stored in SQL databasesObservational data tends to come from the ‘Internet of things”Interactions are about how people and things interact with each other or with your business.Web Logs, User Click Streams, Social Interactions & Feeds, and User-Generated Content are classic places to find Interaction data
  • #10: According to Pike Research, in 2008 a mere 4% of the planet’s 1.5 billion electric utility meters were smart meters; today that has jumped to 18% of electric meters installed.Vehicle-to-vehicle (V2V) communications is also rapidly emerging as another M2M market. Currently, the U.S. Department of Transportation is working with the University of Michigan to test V2V systems on 3,000 vehicles. Logistics companies with more than 9 million vehicles in the U.S. are watching the results of that study carefully because of the promised savings in V2V operating costs.2020 researchers expect there to be more than 6 billion wireless subscribers using smartphones. However, Swedish communications giant Ericsson predicts that there will be over 50 billion intelligent machines fighting for bandwidth by then.Undoubtedly, networks will be faster in eight years. I expect there will be improvements in compression techniques and methods to limit M2M interactivity as well as other ways to boost network performance and capacity. Pricing will be another way carriers will be able to manage network loads.
  • #11: http://www.itworld.com/it-managementstrategy/289534/big-data-startups-lure-investment-dollars
  • #16: Computing and storage are typically hosted transparently on cloud infra- provide scale, flexibility and high fail-safety (reduced upfront cost)Distrbuted processing of Big-data requires non-standard programming models – beyong single machines or traditional parallel programming models (like MPI)..aim is to simplify complex programming tasksNoSQL databases support large amount of data cost effectively, flexible and fast (no predefined schema needed) and operate on distributed infra
  • #17: Computing and storage are typically hosted transparently on cloud infra- provide scale, flexibility and high fail-safety (reduced upfront cost)Distrbuted processing of Big-data requires non-standard programming models – beyong single machines or traditional parallel programming models (like MPI)..aim is to simplify complex programming tasksNoSQL databases support large amount of data cost effectively, flexible and fast (no predefined schema needed) and operate on distributed infra
  • #22: BenefitsIncreasing Personal and Team ProductivityDecreasing Ramp TimeIncreasing Win Rate
  • #23: http://www.sambacloud.com/what-is-samba-cloudA typical sales professional is mostly travelling, coordinating via email and work on different, out-of-date versions of content on their laptops or tablet’s cloud drives. The deal history is spread across everyone’s local version.