SlideShare a Scribd company logo
Virtual Segment
• Current feature assessment
The effect of already existing attribute on the overall base customer value
• New feature assessment
Ingesting new attribute and see how it can change the fabric of
and play manner
• New acquisitions profiling
What kind of market you need to penetrate to optimize the customer
sized sample ingestion
• Recommendation (Next best action)
Which segment to target and tune the top 3 attributes to tune to
one level up
Summary
Model Output
• Uncertainty: Each feature or attribute at the same time is
contributing in resisting the overall customer potential and
increasing the customer overall potential as well.
• Increasing new parallel resistance will decrease the overall
resistance
• Increasing new series resistance will increase the overall
resistance
Model Analogy
Electric circuit resistance analogy (Assumptions)
• Parallel resistance :
1
𝑅 𝑇𝑜𝑡𝑎𝑙 𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙
=
1
𝑟1
+
1
𝑟2
+
1
𝑟3
+ ⋯ +
1
𝑟 𝑛
• Series resistance 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑠𝑒𝑟𝑖𝑒𝑠 = 𝑟1 + 𝑟2 + 𝑟3 + ⋯ 𝑟𝑛
• Parallel resistance quotient: 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑃𝑎𝑟𝑎𝑙𝑙𝑒𝑙 =
1
𝑖=1
𝑛 1
𝑟 𝑖
• Series resistance quotient: 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑠𝑒𝑟𝑖𝑒𝑠 = 𝑖=1
𝑛
𝑟𝑖
• 𝑉𝑇𝑜𝑡𝑎𝑙 𝛼𝑅 𝑇𝑜𝑡𝑎𝑙 𝑃𝑎𝑟𝑎𝑙𝑙𝑒𝑙
• 𝑉𝑇𝑜𝑡𝑎𝑙 𝛼𝑅 𝑇𝑜𝑡𝑎𝑙 𝑆𝑒𝑟𝑖𝑒𝑠
• 𝑉𝑇𝑜𝑡𝑎𝑙 = C𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑥 𝑖=1
𝑛
𝑟 𝑖
𝑖=1
𝑛 1
𝑟 𝑖
Equation
Analogy
V  Customer potential
r  Customer feature
C  Business Domain/Market constant
Virtual Segment
Data Set
Customer Attribute
Is
Categoric
al?
Calculate Attribute
(Frequency per dataset)
Numerical
Calculate the normalized, standardized,
scaled unitless value from 1 to 100
Per
customer
(Customer Value) Unsupervised clustering
with 10 centers over
the customer value
Get the maximum
difference value
between 2 consecutive
clusters
Query to get the
customer base with the
lower boundary
Tweak or tune
the original attribute
Calculate the Rank per
customer
Yes No
New
Attributes
Remove
correlation,
redundancy
& causality
Get the least 3 value
attributes (X per this
cluster
Recommendation
𝑖=1
𝑛
𝑟𝑖
𝑖=1
𝑛 1
𝑟𝑖
Sampling Criteria
(example to generate sample virtual customers)
Customer
Total Customer revenue from
activation to date
5% Random Sample from the
total base
Calculate the average and
Standard deviation for the total
revenue
100 times
Calculate the
average of means,
final standard
deviation of
standard deviations
Select count (distinct
customer) from base where
total revenue between
average_of_means and ±5
final_standard_deviation *
average_of_means
• Before big data you cannot ingest massive amount of columns with ease to a
dataset, columnar databases made that feasible
• The model is independent of the number data sources, you can add new data
sources instantaneously .
• Processing the model with scalability and efficiency on big data platforms.
Utilizing Big Data - MPP
SWOT
Strength Weakness
Opportunity Threat
• Automating the costly manual analysis
to next best action
• More certainty on variable tuning (ie
age, plan)
• Predict success rate of campaigns with minimal
cost
• Identify which variable to target and test
campaign with virtual segments immediately
• Identifying the correlation for each
added attribute against existing
attributes, can be costly and
complicated
• Some times would need manual
consultancy or advise from domain
experts
• It is complicated to perform the same
analysis on cross net base customers
What’s Next?
Leveraging on Big Data
High Level Example
Source 1
Source 2
Source 3
Source 4
Source 5
Subscriber
Hive- Hbase
Golden Record
per customer from
activation date
(delta Monthly Data
aggregated)
Spark
QL
Hive
Link to Data Model
Final stored data
Model execution
Because of Big Data:
• 100k+ columns can be processed
per dataset
• Billions of records
• Commodity of hardware
• Scalability
• Reliability and supporting
business continuity
Subscriber
Subscriber
Subscriber
Subscriber
Virtual Segment
• Approachable sales/marketing method
• Time to market will decrease
• Better traffic/capacity projections
• Better target customer group
• More data driven organization – centralized data team
11
Thank you!
Implementation Readiness
• Plan -- Gantt Chart
• Project Stakeholders
• Cost Benefit Analysis
• 6 month break even agile sprint
• Data governance plan
• End to end data architecture
• Hardware and resource planning

More Related Content

PPTX
Models ABC
PPTX
Testing of hypothesis case study
PPTX
Master the essentials of conversion optimization
PDF
Initializing & Optimizing Machine Learning Models
PPTX
The Finishing Line
PDF
Hypothesis driven development - Alexander Bertholds, APPRL
PPTX
Query optimization-with-sql
PPTX
Supervised Machine Learning Techniques
Models ABC
Testing of hypothesis case study
Master the essentials of conversion optimization
Initializing & Optimizing Machine Learning Models
The Finishing Line
Hypothesis driven development - Alexander Bertholds, APPRL
Query optimization-with-sql
Supervised Machine Learning Techniques

What's hot (13)

PPTX
Machine Learning Project - 1994 U.S. Census
PDF
Machine learning vs traditional optimization
PDF
Supervised Machine Learning Techniques common algorithms and its application
PPTX
Supervised Machine Learning
PPTX
Calculating a Sample Size
PDF
1555 track 2 ning_using our laptop
PPTX
MSA – Planning & Conducting the MSA
PPTX
Defining the VOC and Defects
PPTX
Using CART For Beginners with A Teclo Example Dataset
DOCX
Btm8107 8 week7 activity apply non-parametric tests a+ work
PPTX
Customer Targeting Augmented Analytics Use Case - Smarten
Machine Learning Project - 1994 U.S. Census
Machine learning vs traditional optimization
Supervised Machine Learning Techniques common algorithms and its application
Supervised Machine Learning
Calculating a Sample Size
1555 track 2 ning_using our laptop
MSA – Planning & Conducting the MSA
Defining the VOC and Defects
Using CART For Beginners with A Teclo Example Dataset
Btm8107 8 week7 activity apply non-parametric tests a+ work
Customer Targeting Augmented Analytics Use Case - Smarten
Ad

Similar to Virtual segment brief (20)

PDF
Machine Learning - Algorithms and simple business cases
PDF
Supercharge your AB testing with automated causal inference - Community Works...
PDF
Customer choice probabilities
PPTX
Customer_Churn_prediction.pptx
PPTX
Customer_Churn_prediction.pptx
PDF
Lead Scoring Group Case Study Presentation.pdf
PDF
1710 track3 zhu
PDF
Barga Galvanize Sept 2015
PPTX
Intro to ml_2021
PPTX
machineLearningTypingTool_Rev1
PDF
Introduction to Artificial Intelligence_ Lec 10
PPTX
Case Study: It’s All About Data – And the Customer
PDF
Introduction to machine learning
PPTX
Predicting House Prices: A Machine Learning Approach
PPTX
Unveiling the Market: Predicting House Prices with Data Science
PPTX
Statistical Learning and Model Selection (1).pptx
PPTX
Maximizing Retention with Minimal Effort
PPTX
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
PDF
credit card fraud detection
Machine Learning - Algorithms and simple business cases
Supercharge your AB testing with automated causal inference - Community Works...
Customer choice probabilities
Customer_Churn_prediction.pptx
Customer_Churn_prediction.pptx
Lead Scoring Group Case Study Presentation.pdf
1710 track3 zhu
Barga Galvanize Sept 2015
Intro to ml_2021
machineLearningTypingTool_Rev1
Introduction to Artificial Intelligence_ Lec 10
Case Study: It’s All About Data – And the Customer
Introduction to machine learning
Predicting House Prices: A Machine Learning Approach
Unveiling the Market: Predicting House Prices with Data Science
Statistical Learning and Model Selection (1).pptx
Maximizing Retention with Minimal Effort
What Is Multilayer Perceptron Classifier And How Is It Used For Enterprise An...
credit card fraud detection
Ad

Recently uploaded (20)

PPTX
1_Introduction to advance data techniques.pptx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PDF
.pdf is not working space design for the following data for the following dat...
PDF
Lecture1 pattern recognition............
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Computer network topology notes for revision
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
1_Introduction to advance data techniques.pptx
Database Infoormation System (DBIS).pptx
climate analysis of Dhaka ,Banglades.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
Clinical guidelines as a resource for EBP(1).pdf
Fluorescence-microscope_Botany_detailed content
Business Ppt On Nestle.pptx huunnnhhgfvu
.pdf is not working space design for the following data for the following dat...
Lecture1 pattern recognition............
Galatica Smart Energy Infrastructure Startup Pitch Deck
Qualitative Qantitative and Mixed Methods.pptx
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
STUDY DESIGN details- Lt Col Maksud (21).pptx
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Introduction to Knowledge Engineering Part 1
Computer network topology notes for revision
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
IBA_Chapter_11_Slides_Final_Accessible.pptx

Virtual segment brief

  • 2. • Current feature assessment The effect of already existing attribute on the overall base customer value • New feature assessment Ingesting new attribute and see how it can change the fabric of and play manner • New acquisitions profiling What kind of market you need to penetrate to optimize the customer sized sample ingestion • Recommendation (Next best action) Which segment to target and tune the top 3 attributes to tune to one level up Summary Model Output
  • 3. • Uncertainty: Each feature or attribute at the same time is contributing in resisting the overall customer potential and increasing the customer overall potential as well. • Increasing new parallel resistance will decrease the overall resistance • Increasing new series resistance will increase the overall resistance Model Analogy Electric circuit resistance analogy (Assumptions)
  • 4. • Parallel resistance : 1 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑝𝑎𝑟𝑎𝑙𝑙𝑒𝑙 = 1 𝑟1 + 1 𝑟2 + 1 𝑟3 + ⋯ + 1 𝑟 𝑛 • Series resistance 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑠𝑒𝑟𝑖𝑒𝑠 = 𝑟1 + 𝑟2 + 𝑟3 + ⋯ 𝑟𝑛 • Parallel resistance quotient: 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑃𝑎𝑟𝑎𝑙𝑙𝑒𝑙 = 1 𝑖=1 𝑛 1 𝑟 𝑖 • Series resistance quotient: 𝑅 𝑇𝑜𝑡𝑎𝑙 𝑠𝑒𝑟𝑖𝑒𝑠 = 𝑖=1 𝑛 𝑟𝑖 • 𝑉𝑇𝑜𝑡𝑎𝑙 𝛼𝑅 𝑇𝑜𝑡𝑎𝑙 𝑃𝑎𝑟𝑎𝑙𝑙𝑒𝑙 • 𝑉𝑇𝑜𝑡𝑎𝑙 𝛼𝑅 𝑇𝑜𝑡𝑎𝑙 𝑆𝑒𝑟𝑖𝑒𝑠 • 𝑉𝑇𝑜𝑡𝑎𝑙 = C𝑜𝑛𝑠𝑡𝑎𝑛𝑡 𝑥 𝑖=1 𝑛 𝑟 𝑖 𝑖=1 𝑛 1 𝑟 𝑖 Equation Analogy V  Customer potential r  Customer feature C  Business Domain/Market constant
  • 5. Virtual Segment Data Set Customer Attribute Is Categoric al? Calculate Attribute (Frequency per dataset) Numerical Calculate the normalized, standardized, scaled unitless value from 1 to 100 Per customer (Customer Value) Unsupervised clustering with 10 centers over the customer value Get the maximum difference value between 2 consecutive clusters Query to get the customer base with the lower boundary Tweak or tune the original attribute Calculate the Rank per customer Yes No New Attributes Remove correlation, redundancy & causality Get the least 3 value attributes (X per this cluster Recommendation 𝑖=1 𝑛 𝑟𝑖 𝑖=1 𝑛 1 𝑟𝑖
  • 6. Sampling Criteria (example to generate sample virtual customers) Customer Total Customer revenue from activation to date 5% Random Sample from the total base Calculate the average and Standard deviation for the total revenue 100 times Calculate the average of means, final standard deviation of standard deviations Select count (distinct customer) from base where total revenue between average_of_means and ±5 final_standard_deviation * average_of_means
  • 7. • Before big data you cannot ingest massive amount of columns with ease to a dataset, columnar databases made that feasible • The model is independent of the number data sources, you can add new data sources instantaneously . • Processing the model with scalability and efficiency on big data platforms. Utilizing Big Data - MPP
  • 8. SWOT Strength Weakness Opportunity Threat • Automating the costly manual analysis to next best action • More certainty on variable tuning (ie age, plan) • Predict success rate of campaigns with minimal cost • Identify which variable to target and test campaign with virtual segments immediately • Identifying the correlation for each added attribute against existing attributes, can be costly and complicated • Some times would need manual consultancy or advise from domain experts • It is complicated to perform the same analysis on cross net base customers
  • 9. What’s Next? Leveraging on Big Data High Level Example Source 1 Source 2 Source 3 Source 4 Source 5 Subscriber Hive- Hbase Golden Record per customer from activation date (delta Monthly Data aggregated) Spark QL Hive Link to Data Model Final stored data Model execution Because of Big Data: • 100k+ columns can be processed per dataset • Billions of records • Commodity of hardware • Scalability • Reliability and supporting business continuity Subscriber Subscriber Subscriber Subscriber
  • 10. Virtual Segment • Approachable sales/marketing method • Time to market will decrease • Better traffic/capacity projections • Better target customer group • More data driven organization – centralized data team
  • 12. Implementation Readiness • Plan -- Gantt Chart • Project Stakeholders • Cost Benefit Analysis • 6 month break even agile sprint • Data governance plan • End to end data architecture • Hardware and resource planning