SlideShare a Scribd company logo
BIG DATA ANALYTICS

By

Rahul Kulkarni
 Big Data
Big Data Players in the Market
Hadoop Ecosystems

 Analytics


Machine Learning Algorithms

 SMAC
Big data analytics
WHAT IS BIG DATA?
“Big Data” is high-volume, high velocity,
high variety information assets that demand cost effective,
innovative forms of information processing for enhanced
insight and decision making.
By 2020, 1.7 MB of new information will be created for each and every human being on the planet – every second every day.
DATA CONTRIBUTIONS
Personalized for each visitor
HADOOP WAS A KEY PART OF IBMS WATSON
Hadoop analytics and data
discovery abilities were a
big reason that IBM's
Watson computer was able
to win a widely publicized
"Jeopardy“ showdown last
year against a couple of
very successful human
former champions.
BIG DATA PLAYERS
EVOLUTION OF HADOOP
Big data analytics
Big data analytics
Big data analytics
Simple models do
better than experts

LET US GET STARTED
AN INSURANCE PROBLEM
Product
Car Insurance
Life Insurance
Health Insurance
2-wheeler Insurance
Heavy Vehicle Insurance

Revenues in last quarter in
million
110
180
220
90
100
WHAT WE CANNOT EXPLAIN
FIRST MODEL . . .
 Categorize data as VEHICLE and NON-VEHICLE insurance.
 The average of vehicle insurance: 100
 The unexplained = (90-100)2+(100-100)2+(110-100)2=200
 The average non-vehicle insurance = 200
 The unexplained = (180-200)2+(220-200)2= 800
R2
Big data analytics
Analytics

Machine Learning : Supervised & Un-supervised Learning

Lets get started with two different techniques
(Supervised) - Classification and Regression
(Un-Supervised) - Clustering
Machine Learning
- Grew out of work in AI
- New capability for computers
Examples:
- Database mining
Large datasets from growth of automation/web.
E.g., Web click data, medical records, biology, engineering
- Applications can’t program by hand.
E.g., Autonomous helicopter,
handwriting recognition,
most of Natural Language Processing (NLP),
Computer Vision.
- Self-customizing programs
E.g., Amazon, Netflix product recommendations
- Understanding human learning (brain, real AI).
SUPERVISED LEARNING
PREDICTION AND FORECASTING
UN-SUPERVISED LEARNING
Big data analytics
"Consumer data will be the biggest
differentiator in the next two to three years.
Whoever unlocks the reams of data and uses
it strategically will win“
-Angela Ahrendts, CEO of Burberry

Big Data is key to any Loyalty scheme
The Obama 2012 campaign used data analytics and the experimental method
to assemble a winning coalition vote by vote. The interests of individual voters
were known and addressed.
Online Media and Web Analytics helped Obama beat McCain, changed the
political scene in one of the most powerful nations in the world and how it has
influenced the course of history
- Obama had 2.5 M Facebook friends compared to a paltry 0.5 M Facebook
friends for McCain (seems strange to think of politicians on Facebook though..)
– Obama raised USD 500 M online versus the total amount of USD,
201 M by McCain
Percentage of votes cast for Obama by early voters in Hamilton
Model - 57.68%, Actual 57.16%
Television commercials aired on TV land (National cable level)
Obama campaign - 1,710, Romney campaign - 0

Money spent on online Ads through Mid-October
Romney Campaign - $26 million, Obama Campaign - $52 millions
Big data analytics
Big data analytics
Big data analytics
SMAC will be the platform that will enable
organizations to drive consumerization of
technology, including IT. Early adopters of
SMAC stack would have a clear competitive
edge in their line of business
Big data analytics
Big data analytics
Big data analytics
cloud computing is a synonym for
distributed computing over a network,
and means the ability to run a program
or application on many connected
computers at the same time
Big data analytics
THANK YOU . . . . .

More Related Content

PPTX
Big data analytics
PPTX
SMAC
PDF
TELECOM, MAS PODE ME CHAMAR DE PUBLISHER MESMO
PPTX
SMAC
PDF
Big Data and Mobile Commerce - Privacy and Data Protection
PDF
8 Quotes about the Future of Display Advertising
PPTX
SMAC
PDF
Digital Economy Compass 2018
Big data analytics
SMAC
TELECOM, MAS PODE ME CHAMAR DE PUBLISHER MESMO
SMAC
Big Data and Mobile Commerce - Privacy and Data Protection
8 Quotes about the Future of Display Advertising
SMAC
Digital Economy Compass 2018

What's hot (19)

PPTX
Ayushi.ppt
PDF
Emerging Technology, Shiny Objects & The Future of Media - iSummit - Fred Steube
PDF
Why the Customer Journey is critical to your Integration Architecture
PDF
Frost & Sullivan - world's top global mega trends to 2025 and implications
PPTX
Post-PC Marketing
PPTX
Paymentus
PDF
Internet of Things forecasts infographic
PPTX
Kemelor Potomac Forum 03 30 10
PPTX
Mobile: State of the Industry
PDF
Governments Are Going Digital
PDF
The Impact of Data in Technology Today
PDF
CTMobileSummit May 6 2015
PDF
Internet Trends 2014 - Redesigned
PDF
INFOGRAPHIC: The mobile landscape
PPTX
Keynote Daniel Kraft - enera - Dec 2016
PDF
The Robos Are Coming - How AI will revolutionize Insurance 0117
PPTX
25 Disruptive Technology Trends 2015 - 2016
PDF
brandinnovators2015trends-150105172645-conversion-gate01
PDF
The New Mega Trends Sarwant Singh Frost Sullivan
Ayushi.ppt
Emerging Technology, Shiny Objects & The Future of Media - iSummit - Fred Steube
Why the Customer Journey is critical to your Integration Architecture
Frost & Sullivan - world's top global mega trends to 2025 and implications
Post-PC Marketing
Paymentus
Internet of Things forecasts infographic
Kemelor Potomac Forum 03 30 10
Mobile: State of the Industry
Governments Are Going Digital
The Impact of Data in Technology Today
CTMobileSummit May 6 2015
Internet Trends 2014 - Redesigned
INFOGRAPHIC: The mobile landscape
Keynote Daniel Kraft - enera - Dec 2016
The Robos Are Coming - How AI will revolutionize Insurance 0117
25 Disruptive Technology Trends 2015 - 2016
brandinnovators2015trends-150105172645-conversion-gate01
The New Mega Trends Sarwant Singh Frost Sullivan
Ad

Similar to Big data analytics (20)

PPTX
Cloud Revolution Conitnues
PDF
Megatrends-2025-Frost-and-Sullivan.pdf
PPTX
Big Data and PA
PDF
South By South Best 2018
PPTX
End of year review/preview
PDF
2025 Top Global Mega Trends
PDF
50 Powerful Statistics About Tech Megatrends Affecting Every Business
PDF
data, big data, open data
PDF
Eric van Tol - Businesscases & Verdienmodellen
PPTX
Artificial Intellegence Disruption by Machine Part 2 of 3
PPTX
Big data retail_industry_by VivekChutke
PPTX
Apps as a Marketing Tool
PDF
Contagious_IBM
PDF
MIhai Bonca - Inteligenta Artificiala. Inger, demon sau oportunitate
PDF
Mihai Bonca - Artificial Intelligence - Business Focus Iasi 2018
PDF
Data - A Big Disrupter!
PPT
Equity Position Deck Sans Video
PDF
2014 Key trends
PDF
The Future of Premium Content Delivery
PDF
TENDÊNCIAS DE NOVOS MODELOS NO MUNDO DO MARKETING
Cloud Revolution Conitnues
Megatrends-2025-Frost-and-Sullivan.pdf
Big Data and PA
South By South Best 2018
End of year review/preview
2025 Top Global Mega Trends
50 Powerful Statistics About Tech Megatrends Affecting Every Business
data, big data, open data
Eric van Tol - Businesscases & Verdienmodellen
Artificial Intellegence Disruption by Machine Part 2 of 3
Big data retail_industry_by VivekChutke
Apps as a Marketing Tool
Contagious_IBM
MIhai Bonca - Inteligenta Artificiala. Inger, demon sau oportunitate
Mihai Bonca - Artificial Intelligence - Business Focus Iasi 2018
Data - A Big Disrupter!
Equity Position Deck Sans Video
2014 Key trends
The Future of Premium Content Delivery
TENDÊNCIAS DE NOVOS MODELOS NO MUNDO DO MARKETING
Ad

Big data analytics

  • 2.  Big Data Big Data Players in the Market Hadoop Ecosystems  Analytics  Machine Learning Algorithms  SMAC
  • 4. WHAT IS BIG DATA? “Big Data” is high-volume, high velocity, high variety information assets that demand cost effective, innovative forms of information processing for enhanced insight and decision making.
  • 5. By 2020, 1.7 MB of new information will be created for each and every human being on the planet – every second every day.
  • 8. HADOOP WAS A KEY PART OF IBMS WATSON Hadoop analytics and data discovery abilities were a big reason that IBM's Watson computer was able to win a widely publicized "Jeopardy“ showdown last year against a couple of very successful human former champions.
  • 14. Simple models do better than experts LET US GET STARTED
  • 15. AN INSURANCE PROBLEM Product Car Insurance Life Insurance Health Insurance 2-wheeler Insurance Heavy Vehicle Insurance Revenues in last quarter in million 110 180 220 90 100
  • 16. WHAT WE CANNOT EXPLAIN
  • 17. FIRST MODEL . . .  Categorize data as VEHICLE and NON-VEHICLE insurance.  The average of vehicle insurance: 100  The unexplained = (90-100)2+(100-100)2+(110-100)2=200  The average non-vehicle insurance = 200  The unexplained = (180-200)2+(220-200)2= 800 R2
  • 19. Analytics Machine Learning : Supervised & Un-supervised Learning Lets get started with two different techniques (Supervised) - Classification and Regression (Un-Supervised) - Clustering
  • 20. Machine Learning - Grew out of work in AI - New capability for computers Examples: - Database mining Large datasets from growth of automation/web. E.g., Web click data, medical records, biology, engineering - Applications can’t program by hand. E.g., Autonomous helicopter, handwriting recognition, most of Natural Language Processing (NLP), Computer Vision. - Self-customizing programs E.g., Amazon, Netflix product recommendations - Understanding human learning (brain, real AI).
  • 25. "Consumer data will be the biggest differentiator in the next two to three years. Whoever unlocks the reams of data and uses it strategically will win“ -Angela Ahrendts, CEO of Burberry Big Data is key to any Loyalty scheme
  • 26. The Obama 2012 campaign used data analytics and the experimental method to assemble a winning coalition vote by vote. The interests of individual voters were known and addressed. Online Media and Web Analytics helped Obama beat McCain, changed the political scene in one of the most powerful nations in the world and how it has influenced the course of history - Obama had 2.5 M Facebook friends compared to a paltry 0.5 M Facebook friends for McCain (seems strange to think of politicians on Facebook though..) – Obama raised USD 500 M online versus the total amount of USD, 201 M by McCain Percentage of votes cast for Obama by early voters in Hamilton Model - 57.68%, Actual 57.16% Television commercials aired on TV land (National cable level) Obama campaign - 1,710, Romney campaign - 0 Money spent on online Ads through Mid-October Romney Campaign - $26 million, Obama Campaign - $52 millions
  • 30. SMAC will be the platform that will enable organizations to drive consumerization of technology, including IT. Early adopters of SMAC stack would have a clear competitive edge in their line of business
  • 34. cloud computing is a synonym for distributed computing over a network, and means the ability to run a program or application on many connected computers at the same time
  • 36. THANK YOU . . . . .