SlideShare a Scribd company logo
The Warranty Data Lake – After, Inc.
The Warranty Data Lake
How Big Data and Data Science
Can Drive Program improvements
March 2016
Richard Vermillion, CEO of After, Inc. and Fulcrum Analytics, Inc.
Who is After, Inc.?
Just What Is a “Data Lake”?
Why a Warranty Data Lake?
Warranty Data Science
•  Customer Data
•  Claims Data
•  Spatial Data
•  Telematics
•  Parts/Supplier/Lot Data
•  Shipment, Engineering, Dealer Data
What does it all mean?
Agenda
Founded in 2005 as a division of Fulcrum Analytics, we leverage our
dual-expertise in data analytics and consumer marketing to help
companies establish world-class warranty businesses.
We help manage program risk and optimize sales and marketing
programs from start to finish — absorbing almost all of the
management and technology burden.
As your partner, we are singularly committed to creating the greatest
possible value for you and your customers.
Who is After, Inc.?
What is a Data Lake?
TRADITIONAL REPOSITORIES
Large, well-organized data warehouses
That deliver some data to retail data marts
Much larger, more loosely organized lakes
That you can sample at will
DATA LAKES
What is a Data Lake?
A ”Data Lake” is a large storage repository and processing engine
Data Lakes are built on a “Big Data” platform like the Hadoop stack
•  Volume
•  Velocity
•  Variety
Focus on variety of data they hold than just volume or velocity:
•  Flat files
•  Relational
•  Hierarchical
•  Unstructured, free-form text
•  Graph/network
•  Real-time
Also distinguished by how they are built and loaded
•  Focus on agility
•  Iterative development to meet changing business needs
•  Support innovation
•  Destroy silos
Not an agile process
Data Warehouse or Mart
INNOVATION KILLER
Generate
Business &
Technical
Requirements
Design
Conformed
Data Model
Develop ETL
for Each Data
Source
Build Reports
& Connect BI
Tools
Too Limited
• Drop fields that don’t fit
• Aggregate away detail
• Only answer questions
you anticipated
Too Slow
• Waste time fully
understanding each data
source
• Business requirements
change by the time it’s
built
Too Expensive
• Costly commercial
hardware/software
• Especially if you aren’t
sure each data source is
valuable
Supports an agile process
Data Lake
INNOVATION ENABLER
Load Data
As-Is
Conform and
Model What
You Need
Iterate ELT for
Each Data
Source
Access All
Data Types
with One
Toolset
Not Limited
• Keep all the fields in their
original format
• Keep the detail data
• Pose and answer
questions you never
anticipated
Not Slow
• Easy to get started, just
copy raw data in
• Incremental changes as
business requirements
change
Not Expensive
• Cheap storage, open
source tools
• Defer hard work until you
have a use for the data
Build to optimize
warranty, extend to
impact other areas
Why a Warranty Data Lake?
Warranty Chain is unusual, if not unique, in how many areas of the
business it touches:
•  Customer – sales, usage, service, satisfaction
•  Engineering – quality, reliability, design
•  Service – repairs, claims
•  Distribution – dealers, retailers
•  Finance – insurance, reserves, credit
Spans front-office and back-office
Many relevant, but siloed, data sets
Large opportunities for analytical improvements
Large (usage) data sets on the horizon
By collecting multiple
types of data, we can
reduce reliance on
lagging indicators
(e.g., claims paid) and also identify,
or develop new leading indicators
for program performance.
Why a Warranty Data Lake?
Key analytical questions a warranty data lake can help answer:
•  Claim reserve and loss modeling
•  Early warning for losses
•  Claim & service triage optimization, “no fault found”
minimization
•  Fraud and/or service anomaly detection
•  Customer satisfaction warning signs
•  Sales & marketing effectiveness
•  Field usage and relation to service events
•  Recall cost estimation
Unstructured Data &
Text
Claim notes include more details than codes… but less structure
Codes impose structure and simplify analysis
•  But only for problems you anticipated and created codes for
Text mining and natural language processing can unlock the secrets in
claim and repair notes
Visualization techniques such as word trees and clouds enable free text
exploration
Unstructured Data &
Text
Tweets, product reviews and other user-generated content also can
be collected and analyzed
Sentiment analysis, topic modeling, keyword & semantic analysis
Bayesian classifiers (like your spam filter) can be trained to spot
quality complaints and failure reports
Possible early warning of service problems
Spatial Data
Spatial data (precipitation, temperature, crop production,
land use, etc.) can be a proxy for usage
For example, snow accumulationData spatially correlated with claims
Can be predictive of frequency
Usage Data &
Telematics
The real ‘big data’ of warranty (high volume, high velocity)
Usage data, telematics, “Internet of Things”, sensor data
Real-time streaming or collected at service event
Tracking Device (GPS + Altimeter + Accelerometer) with post-
processing can identify:
•  Mileage and usage time
•  Hard driving and hard braking incidents
•  Inefficient shifting
•  Idle time
•  Geography and location of usage
•  Altitude changes
More accurately map usage cycles to calendar time (relevant for
warranty)
Segment light vs. heavy usage and model differing expected losses
• 
Graph Data
Graph data models the world as nodes connected by edges
Developed for web search (pages connected by hyperlinks) and
social network analysis (users connected by friendship)
Can capture the complex relationships between parts, assemblies
and sub-assemblies
Store part explosions and proximity natively as directed graphs
Capture failure sets of parts serviced together (causal or not)
Customer and
Marketing Data
Product registration data
Demographic & third-party data
Marketing event data
Promotion participation
Direct mail participation and response
Email sends, opens, click-through
Customer service calls and dispositions
Clickstream data
Mobile app engagement
Consumer research and survey responses
Price and messaging test results
Other Useful Data Claim Workflow Details
Unit Shipment Data
Inventory Data
Dealer Information (locations, ownership, firmographics, etc.)
Retail Unit Sales, Pricing and Discounting
Damaged Product Images (especially for ADH claims)
ESP Terms & Conditions (often unstructured).
What does it all mean? A Data Lake provides a repository and processing engine to gather
all relevant data for analysis
Handling variety is the key to breaking down data silos
A Warranty Data Lake can bring together front-office and back-
office data to optimize program performance and improve loss
projections
No one is going to build a warranty data lake with every data
source discussed, but at After, we see companies beginning to
build – and incrementally add to – starter data lakes
Create the corporate data asset to drive 21st century program
improvement.
Richard Vermillion, CEO
After, Inc. + Fulcrum Analytics, Inc.
https://www.linkedin.com/in/rvermillion
rvermillion@afterinc.com
www.afterinc.com
800.374.4728 212.651.7000
70 West 40th Street, 10th Floor
New York, NY 10018
afterinc.com
Learn more

More Related Content

PPTX
Big data architectures and the data lake
PDF
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
PDF
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
PDF
Hadoop data-lake-white-paper
PPSX
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
PDF
Planing and optimizing data lake architecture
PPTX
Introduction to Microsoft’s Hadoop solution (HDInsight)
PPTX
Anatomy of a data driven architecture - Tamir Dresher
Big data architectures and the data lake
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Artur Fejklowicz - “Data Lake architecture” AI&BigDataDay 2017
Hadoop data-lake-white-paper
How to use Big Data and Data Lake concept in business using Hadoop and Spark...
Planing and optimizing data lake architecture
Introduction to Microsoft’s Hadoop solution (HDInsight)
Anatomy of a data driven architecture - Tamir Dresher

What's hot (20)

PPTX
Designing modern dw and data lake
PDF
5 Steps for Architecting a Data Lake
PDF
Data lake benefits
PDF
Big Data: Architecture and Performance Considerations in Logical Data Lakes
PPTX
Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"
PDF
The Data Lake and Getting Buisnesses the Big Data Insights They Need
PDF
Incorporating the Data Lake into Your Analytic Architecture
PDF
Data lake
PPTX
Microsoft Data Platform - What's included
PPTX
Microsoft Azure Big Data Analytics
PPTX
Should I move my database to the cloud?
PDF
Enterprise Data Lake - Scalable Digital
PDF
Designing the Next Generation Data Lake
PPTX
Power BI for Big Data and the New Look of Big Data Solutions
PPTX
How to build a successful Data Lake
PDF
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
PDF
Building a Data Lake - An App Dev's Perspective
PPTX
Why Data Lake should be the foundation of Enterprise Data Architecture
PDF
Creating a Next-Generation Big Data Architecture
PPTX
Big Data Analytics in the Cloud with Microsoft Azure
Designing modern dw and data lake
5 Steps for Architecting a Data Lake
Data lake benefits
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Eugene Polonichko "Azure Data Lake: what is it? why is it? where is it?"
The Data Lake and Getting Buisnesses the Big Data Insights They Need
Incorporating the Data Lake into Your Analytic Architecture
Data lake
Microsoft Data Platform - What's included
Microsoft Azure Big Data Analytics
Should I move my database to the cloud?
Enterprise Data Lake - Scalable Digital
Designing the Next Generation Data Lake
Power BI for Big Data and the New Look of Big Data Solutions
How to build a successful Data Lake
Alexandre Vasseur - Evolution of Data Architectures: From Hadoop to Data Lake...
Building a Data Lake - An App Dev's Perspective
Why Data Lake should be the foundation of Enterprise Data Architecture
Creating a Next-Generation Big Data Architecture
Big Data Analytics in the Cloud with Microsoft Azure
Ad

Viewers also liked (17)

PDF
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
PDF
AddReality company overview
PPTX
Automate Hadoop Cluster Deployment in a Banking Ecosystem
PDF
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
PDF
Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...
PDF
Data Lake,beyond the Data Warehouse
PPTX
Reducing Microservice Complexity with Kafka and Reactive Streams
PPT
Oracle Retail
PDF
Business Data Lake Best Practices
PDF
Retail 2.0 Strategy - Perfect Store PDF
PDF
The Emerging Data Lake IT Strategy
PDF
Implementing a Data Lake with Enterprise Grade Data Governance
PDF
Creating a Modern Data Architecture for Digital Transformation
PDF
The Rise of Microservices
PDF
Southwest Power Pool big data case study
PDF
Constant Contact: An Online Marketing Leader’s Data Lake Journey
PDF
Retail Industry Enterprise Architecture Review
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
AddReality company overview
Automate Hadoop Cluster Deployment in a Banking Ecosystem
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Luxury 3.0- a new Retail Scenario for Product Mass Customization and On Deman...
Data Lake,beyond the Data Warehouse
Reducing Microservice Complexity with Kafka and Reactive Streams
Oracle Retail
Business Data Lake Best Practices
Retail 2.0 Strategy - Perfect Store PDF
The Emerging Data Lake IT Strategy
Implementing a Data Lake with Enterprise Grade Data Governance
Creating a Modern Data Architecture for Digital Transformation
The Rise of Microservices
Southwest Power Pool big data case study
Constant Contact: An Online Marketing Leader’s Data Lake Journey
Retail Industry Enterprise Architecture Review
Ad

Similar to The Warranty Data Lake – After, Inc. (20)

PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PPTX
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
PDF
Semantic 'Radar' Steers Users to Insights in the Data Lake
PDF
Warranty Fraud Detection
PDF
GE’s Industrial Data Lake Platform
PDF
Modern Data Challenges require Modern Graph Technology
PDF
Platforming the Major Analytic Use Cases for Modern Engineering
PDF
Whitepaper-The-Data-Lake-3_0
PDF
Unlocking-Business-Value-Through-Industrial-Data-Management-whitepaper.pdf
PDF
Are You Killing the Benefits of Your Data Lake?
PDF
Harnessing Big Data_UCLA
PPT
Ruby, rails, no sql and big data
PDF
The New Model
PDF
Data lakehouse fallacies
PPT
Big Data = Big Decisions
PPTX
Bridge the Gap - Connecting Business Applications and Devices / Microsoft Sin...
PDF
Mobile, Wearables, Big Data and A Strategy to Move Forward (with NTT Data Ent...
PPTX
Data-As-A-Service to enable compliance reporting
PDF
Big Data Usecases
PPTX
Managing Data To Drive Competitive Advantage
Semantic 'Radar' Steers Users to Insights in the Data Lake
Webinar - Data Lake Management: Extending Storage and Lifecycle of Data
Semantic 'Radar' Steers Users to Insights in the Data Lake
Warranty Fraud Detection
GE’s Industrial Data Lake Platform
Modern Data Challenges require Modern Graph Technology
Platforming the Major Analytic Use Cases for Modern Engineering
Whitepaper-The-Data-Lake-3_0
Unlocking-Business-Value-Through-Industrial-Data-Management-whitepaper.pdf
Are You Killing the Benefits of Your Data Lake?
Harnessing Big Data_UCLA
Ruby, rails, no sql and big data
The New Model
Data lakehouse fallacies
Big Data = Big Decisions
Bridge the Gap - Connecting Business Applications and Devices / Microsoft Sin...
Mobile, Wearables, Big Data and A Strategy to Move Forward (with NTT Data Ent...
Data-As-A-Service to enable compliance reporting
Big Data Usecases
Managing Data To Drive Competitive Advantage

Recently uploaded (20)

PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
Global journeys: estimating international migration
PPTX
Database Infoormation System (DBIS).pptx
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PDF
Launch Your Data Science Career in Kochi – 2025
PDF
Mega Projects Data Mega Projects Data
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPT
Quality review (1)_presentation of this 21
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
Lecture1 pattern recognition............
IB Computer Science - Internal Assessment.pptx
climate analysis of Dhaka ,Banglades.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
Global journeys: estimating international migration
Database Infoormation System (DBIS).pptx
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Moving the Public Sector (Government) to a Digital Adoption
Launch Your Data Science Career in Kochi – 2025
Mega Projects Data Mega Projects Data
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Business Acumen Training GuidePresentation.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Miokarditis (Inflamasi pada Otot Jantung)
Quality review (1)_presentation of this 21
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Lecture1 pattern recognition............

The Warranty Data Lake – After, Inc.

  • 2. The Warranty Data Lake How Big Data and Data Science Can Drive Program improvements March 2016 Richard Vermillion, CEO of After, Inc. and Fulcrum Analytics, Inc.
  • 3. Who is After, Inc.? Just What Is a “Data Lake”? Why a Warranty Data Lake? Warranty Data Science •  Customer Data •  Claims Data •  Spatial Data •  Telematics •  Parts/Supplier/Lot Data •  Shipment, Engineering, Dealer Data What does it all mean? Agenda
  • 4. Founded in 2005 as a division of Fulcrum Analytics, we leverage our dual-expertise in data analytics and consumer marketing to help companies establish world-class warranty businesses. We help manage program risk and optimize sales and marketing programs from start to finish — absorbing almost all of the management and technology burden. As your partner, we are singularly committed to creating the greatest possible value for you and your customers. Who is After, Inc.?
  • 5. What is a Data Lake? TRADITIONAL REPOSITORIES Large, well-organized data warehouses That deliver some data to retail data marts Much larger, more loosely organized lakes That you can sample at will DATA LAKES
  • 6. What is a Data Lake? A ”Data Lake” is a large storage repository and processing engine Data Lakes are built on a “Big Data” platform like the Hadoop stack •  Volume •  Velocity •  Variety Focus on variety of data they hold than just volume or velocity: •  Flat files •  Relational •  Hierarchical •  Unstructured, free-form text •  Graph/network •  Real-time Also distinguished by how they are built and loaded •  Focus on agility •  Iterative development to meet changing business needs •  Support innovation •  Destroy silos
  • 7. Not an agile process Data Warehouse or Mart INNOVATION KILLER Generate Business & Technical Requirements Design Conformed Data Model Develop ETL for Each Data Source Build Reports & Connect BI Tools Too Limited • Drop fields that don’t fit • Aggregate away detail • Only answer questions you anticipated Too Slow • Waste time fully understanding each data source • Business requirements change by the time it’s built Too Expensive • Costly commercial hardware/software • Especially if you aren’t sure each data source is valuable
  • 8. Supports an agile process Data Lake INNOVATION ENABLER Load Data As-Is Conform and Model What You Need Iterate ELT for Each Data Source Access All Data Types with One Toolset Not Limited • Keep all the fields in their original format • Keep the detail data • Pose and answer questions you never anticipated Not Slow • Easy to get started, just copy raw data in • Incremental changes as business requirements change Not Expensive • Cheap storage, open source tools • Defer hard work until you have a use for the data
  • 9. Build to optimize warranty, extend to impact other areas Why a Warranty Data Lake? Warranty Chain is unusual, if not unique, in how many areas of the business it touches: •  Customer – sales, usage, service, satisfaction •  Engineering – quality, reliability, design •  Service – repairs, claims •  Distribution – dealers, retailers •  Finance – insurance, reserves, credit Spans front-office and back-office Many relevant, but siloed, data sets Large opportunities for analytical improvements Large (usage) data sets on the horizon
  • 10. By collecting multiple types of data, we can reduce reliance on lagging indicators (e.g., claims paid) and also identify, or develop new leading indicators for program performance. Why a Warranty Data Lake? Key analytical questions a warranty data lake can help answer: •  Claim reserve and loss modeling •  Early warning for losses •  Claim & service triage optimization, “no fault found” minimization •  Fraud and/or service anomaly detection •  Customer satisfaction warning signs •  Sales & marketing effectiveness •  Field usage and relation to service events •  Recall cost estimation
  • 11. Unstructured Data & Text Claim notes include more details than codes… but less structure Codes impose structure and simplify analysis •  But only for problems you anticipated and created codes for Text mining and natural language processing can unlock the secrets in claim and repair notes Visualization techniques such as word trees and clouds enable free text exploration
  • 12. Unstructured Data & Text Tweets, product reviews and other user-generated content also can be collected and analyzed Sentiment analysis, topic modeling, keyword & semantic analysis Bayesian classifiers (like your spam filter) can be trained to spot quality complaints and failure reports Possible early warning of service problems
  • 13. Spatial Data Spatial data (precipitation, temperature, crop production, land use, etc.) can be a proxy for usage For example, snow accumulationData spatially correlated with claims Can be predictive of frequency
  • 14. Usage Data & Telematics The real ‘big data’ of warranty (high volume, high velocity) Usage data, telematics, “Internet of Things”, sensor data Real-time streaming or collected at service event Tracking Device (GPS + Altimeter + Accelerometer) with post- processing can identify: •  Mileage and usage time •  Hard driving and hard braking incidents •  Inefficient shifting •  Idle time •  Geography and location of usage •  Altitude changes More accurately map usage cycles to calendar time (relevant for warranty) Segment light vs. heavy usage and model differing expected losses • 
  • 15. Graph Data Graph data models the world as nodes connected by edges Developed for web search (pages connected by hyperlinks) and social network analysis (users connected by friendship) Can capture the complex relationships between parts, assemblies and sub-assemblies Store part explosions and proximity natively as directed graphs Capture failure sets of parts serviced together (causal or not)
  • 16. Customer and Marketing Data Product registration data Demographic & third-party data Marketing event data Promotion participation Direct mail participation and response Email sends, opens, click-through Customer service calls and dispositions Clickstream data Mobile app engagement Consumer research and survey responses Price and messaging test results
  • 17. Other Useful Data Claim Workflow Details Unit Shipment Data Inventory Data Dealer Information (locations, ownership, firmographics, etc.) Retail Unit Sales, Pricing and Discounting Damaged Product Images (especially for ADH claims) ESP Terms & Conditions (often unstructured).
  • 18. What does it all mean? A Data Lake provides a repository and processing engine to gather all relevant data for analysis Handling variety is the key to breaking down data silos A Warranty Data Lake can bring together front-office and back- office data to optimize program performance and improve loss projections No one is going to build a warranty data lake with every data source discussed, but at After, we see companies beginning to build – and incrementally add to – starter data lakes Create the corporate data asset to drive 21st century program improvement.
  • 19. Richard Vermillion, CEO After, Inc. + Fulcrum Analytics, Inc. https://www.linkedin.com/in/rvermillion rvermillion@afterinc.com www.afterinc.com 800.374.4728 212.651.7000 70 West 40th Street, 10th Floor New York, NY 10018 afterinc.com Learn more