SlideShare a Scribd company logo
Data Refinement:
The missing link between data collection and decisions
Stephen H. Yu
Data Strategy & Analytics Consultant
What we will cover
1
• Database Marketing Landscape
• Analytics and Models
• “Model-Ready” Environment
• Data Summarization & Categorization
• Delivery: Scoring & QC
• Closing the Loop
Big Data, Small Data, Neat Data, Messy Data
How is the "Big Data” working out for you?
2.5 quintillion bytes collected per “day”
1 quintillion (exabytes) = 1 billion gigabytes
• Did all this data improve your decision making process?
• Do you have the results to show for?
• Information Overload? You bet!
 Harness insights, drop the noise
2
Data Refinement
Data Refinement
Raw Data
• Demographic / Firmographic
• RFM
• Products & Services Used
• Promotion / Response History
• Lifestyle / Survey Responses
• Delinquent history
• Call / Communication Log
• Movement Data
• Sentiments
Marketing Answers
• Likely to buy a luxury car
• Likely to take a foreign vacation
• Likely to response to free
shipping offer
• Likely to be a high value customer
• Likely to be qualified for credit
• Likely to upgrade
• Likely to leave
• Likely to come back
Refined Answers
5
Data Refinement
Data Refinement
Data Refinement
Custom Models for every stage of Marketing Lifecycle
9
Data Refinement
Data Refinement
Data Refinement
Data Refinement
• Most modern databases optimized for massive storage and
rapid retrieval, not necessarily for predictive analytics
o Relational databases
o NoSQL databases
• Need “Analytical Sandbox” (or Database/Data-mart)
o Structured & de-normalized
o Variables as descriptors of model targets
o Common analytical language (SAS, R, SPSS)
o Must support “in-database” scoring
Unstructured to Structured
14
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
Data Refinement
It is not just about modeling, but all surrounding services as well
10 essential items to consider when outsourcing analytical projects
1. Consulting capability: Translate marketing goals into mathematics
2. Data processing: Conversion, edit, summarization, data-append, etc.
3. Pricing structure: Model development is only one part; hidden fees?
4. Track record in the industry: Not in rocket science, but in marketing
5. Types of models supported: Watch out for one-trick ponies
6. Speed of execution: Turnaround time measured in days, not weeks
7. Documentation: Full disclosure of algorithms, charts and reports
8. Scoring validation: Job not done until fully scored and validated
9. Back-end analysis: For true “Closed-loop” marketing
10. Ongoing Support: Periodic review and update
10 essential items to consider when outsourcing analytics
46
Data Refinement
Data Refinement
Data Refinement

More Related Content

PPTX
Webinar: If Your Data Could Talk, What Story Would it Tell? Would it Be a Doc...
PPTX
Thinking Big with Big Data
PDF
Presentation for tech job
PDF
Evaluation of big data analysis
PPTX
Help Me, Help You: Supporting Your Data
PPTX
Data Science at UCSB Information Meeting
PDF
Data vault
PPTX
Data and Strategy: Cultivating Their Relationship
Webinar: If Your Data Could Talk, What Story Would it Tell? Would it Be a Doc...
Thinking Big with Big Data
Presentation for tech job
Evaluation of big data analysis
Help Me, Help You: Supporting Your Data
Data Science at UCSB Information Meeting
Data vault
Data and Strategy: Cultivating Their Relationship

What's hot (20)

PPTX
Big data
PPTX
Umc floortje scheepers
PPTX
Data analytics ppt
PDF
Intro
PDF
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
PDF
Data science York_University _2016
PDF
OBIEE, Endeca, Hadoop and ORE Development (on Exalytics) (ODTUG 2013)
PDF
MongoDB World 2019: Hands-on with an open source, serverless Data Warehouse f...
PDF
Introduction to Big Data
PDF
Data Standards – progress and case studies
PPTX
A Brief Introduction to Big Data Analytics.pptx
PPTX
How tech startups can leverage data analytics and visualization
PDF
Introduction to analytics
PPTX
From Good to Great – Tips for Becoming a Great Data Analyst
PDF
Data Science Salon: Culture, Data Engineering and Hamburger Stands: Thoughts ...
PPTX
Make good products great with data and analytics
PPTX
Sig big data
PDF
Health Check: Maintaining Enterprise BI
PPTX
Data-Driven Publishing (Ken Brooks at Publishers Launch Frankfurt 2013)
PDF
Tech Connect - The Importance of an IT Strategy
Big data
Umc floortje scheepers
Data analytics ppt
Intro
"Data Pipelines for Small, Messy and Tedious Data", Vladislav Supalov, CAO & ...
Data science York_University _2016
OBIEE, Endeca, Hadoop and ORE Development (on Exalytics) (ODTUG 2013)
MongoDB World 2019: Hands-on with an open source, serverless Data Warehouse f...
Introduction to Big Data
Data Standards – progress and case studies
A Brief Introduction to Big Data Analytics.pptx
How tech startups can leverage data analytics and visualization
Introduction to analytics
From Good to Great – Tips for Becoming a Great Data Analyst
Data Science Salon: Culture, Data Engineering and Hamburger Stands: Thoughts ...
Make good products great with data and analytics
Sig big data
Health Check: Maintaining Enterprise BI
Data-Driven Publishing (Ken Brooks at Publishers Launch Frankfurt 2013)
Tech Connect - The Importance of an IT Strategy
Ad

Similar to Data Refinement (20)

PPSX
Data Refinement: The missing link between data collection and decisions
PPTX
Unit I-Final MArketing analytics unit 1 ppt
PPTX
Module_I_Marketing_Analytics_Ver1.1.pptx
PPTX
Module 1-Introduction to Marketing Analytics.pptx
PPTX
How Companies Turn Data Into Business Value
PDF
Building a Complete View Across the Customer Experience on Oracle BICS
PDF
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
PDF
360i Report: Big Data
PDF
Valtech - Big Data for marketing (EN)
PPTX
Rd big data & analytics v1.0
PPSX
JU Analytics Day Presentation by Naveen Agarwal, Creative Analytics Solutions...
PPT
Get your data analytics strategy right!
PDF
Big Data why Now and where to?
PDF
Occam - Making Data-Driven Marketing Simple
PDF
Turning Big Data Analytics To Knowledge PowerPoint Presentation Slides
PPSX
Integrated Marketing Analytics & Data-Driven Intelligence
PPTX
Hair_EOMA_1e_Chap001_PPT.pptx
PPTX
Data-Driven Marketing Using Analytics to Make Informed Decisions
PDF
handout-final-A4-v2-nobleed
PDF
big-data-better-data-white-paper-final
Data Refinement: The missing link between data collection and decisions
Unit I-Final MArketing analytics unit 1 ppt
Module_I_Marketing_Analytics_Ver1.1.pptx
Module 1-Introduction to Marketing Analytics.pptx
How Companies Turn Data Into Business Value
Building a Complete View Across the Customer Experience on Oracle BICS
Marketing Database Analytics Transforming Data for Competitive Advantage 1st ...
360i Report: Big Data
Valtech - Big Data for marketing (EN)
Rd big data & analytics v1.0
JU Analytics Day Presentation by Naveen Agarwal, Creative Analytics Solutions...
Get your data analytics strategy right!
Big Data why Now and where to?
Occam - Making Data-Driven Marketing Simple
Turning Big Data Analytics To Knowledge PowerPoint Presentation Slides
Integrated Marketing Analytics & Data-Driven Intelligence
Hair_EOMA_1e_Chap001_PPT.pptx
Data-Driven Marketing Using Analytics to Make Informed Decisions
handout-final-A4-v2-nobleed
big-data-better-data-white-paper-final
Ad

More from Vivastream (20)

PDF
Exchange Solutions Datasheet_Ecommerce
PDF
Exchange Solutions Datasheet_Customer Engagement Roadmap
TXT
PDF
PPT
PPTX
Jeeva jessf
PDF
Vivastream Poster
PDF
Vivastream Poster
PDF
PDF
Breaking Up is Hard to Do: Small Businesses’ Love Affair with Checks
PDF
EY Smart Commerce Report
PDF
EY Global Consumer Banking Survey 2014
PDF
EY Global Consumer Banking Survey
PDF
Serano
PDF
Accura XV
PDF
Automation for RDC and Mobile
PDF
Healthcare Payments Automation Center
PDF
Next Generation Recognition Solutions
PDF
Automation Services
PDF
Company Overview
Exchange Solutions Datasheet_Ecommerce
Exchange Solutions Datasheet_Customer Engagement Roadmap
Jeeva jessf
Vivastream Poster
Vivastream Poster
Breaking Up is Hard to Do: Small Businesses’ Love Affair with Checks
EY Smart Commerce Report
EY Global Consumer Banking Survey 2014
EY Global Consumer Banking Survey
Serano
Accura XV
Automation for RDC and Mobile
Healthcare Payments Automation Center
Next Generation Recognition Solutions
Automation Services
Company Overview

Recently uploaded (20)

PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Mushroom cultivation and it's methods.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
OMC Textile Division Presentation 2021.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
TLE Review Electricity (Electricity).pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
project resource management chapter-09.pdf
Hindi spoken digit analysis for native and non-native speakers
Mushroom cultivation and it's methods.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Chapter 5: Probability Theory and Statistics
OMC Textile Division Presentation 2021.pptx
NewMind AI Weekly Chronicles - August'25-Week II
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Heart disease approach using modified random forest and particle swarm optimi...
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Digital-Transformation-Roadmap-for-Companies.pptx
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Enhancing emotion recognition model for a student engagement use case through...
Encapsulation_ Review paper, used for researhc scholars
MIND Revenue Release Quarter 2 2025 Press Release
Unlocking AI with Model Context Protocol (MCP)
TLE Review Electricity (Electricity).pptx
DP Operators-handbook-extract for the Mautical Institute
WOOl fibre morphology and structure.pdf for textiles
project resource management chapter-09.pdf

Data Refinement

  • 1. Data Refinement: The missing link between data collection and decisions Stephen H. Yu Data Strategy & Analytics Consultant
  • 2. What we will cover 1 • Database Marketing Landscape • Analytics and Models • “Model-Ready” Environment • Data Summarization & Categorization • Delivery: Scoring & QC • Closing the Loop
  • 3. Big Data, Small Data, Neat Data, Messy Data How is the "Big Data” working out for you? 2.5 quintillion bytes collected per “day” 1 quintillion (exabytes) = 1 billion gigabytes • Did all this data improve your decision making process? • Do you have the results to show for? • Information Overload? You bet!  Harness insights, drop the noise 2
  • 6. Raw Data • Demographic / Firmographic • RFM • Products & Services Used • Promotion / Response History • Lifestyle / Survey Responses • Delinquent history • Call / Communication Log • Movement Data • Sentiments Marketing Answers • Likely to buy a luxury car • Likely to take a foreign vacation • Likely to response to free shipping offer • Likely to be a high value customer • Likely to be qualified for credit • Likely to upgrade • Likely to leave • Likely to come back Refined Answers 5
  • 10. Custom Models for every stage of Marketing Lifecycle 9
  • 15. • Most modern databases optimized for massive storage and rapid retrieval, not necessarily for predictive analytics o Relational databases o NoSQL databases • Need “Analytical Sandbox” (or Database/Data-mart) o Structured & de-normalized o Variables as descriptors of model targets o Common analytical language (SAS, R, SPSS) o Must support “in-database” scoring Unstructured to Structured 14
  • 47. It is not just about modeling, but all surrounding services as well 10 essential items to consider when outsourcing analytical projects 1. Consulting capability: Translate marketing goals into mathematics 2. Data processing: Conversion, edit, summarization, data-append, etc. 3. Pricing structure: Model development is only one part; hidden fees? 4. Track record in the industry: Not in rocket science, but in marketing 5. Types of models supported: Watch out for one-trick ponies 6. Speed of execution: Turnaround time measured in days, not weeks 7. Documentation: Full disclosure of algorithms, charts and reports 8. Scoring validation: Job not done until fully scored and validated 9. Back-end analysis: For true “Closed-loop” marketing 10. Ongoing Support: Periodic review and update 10 essential items to consider when outsourcing analytics 46