SlideShare a Scribd company logo
IMPROVING RETAIL SALES CAMPAIGN PROFITABILITY
FINAL CAPSTONE PROJECT : DATA ANALYSIS , FINDINGS AND RECOMMENDATIONS
SHOUNAK MONDAL
POSTGRADUATE DIPLOMA IN DATA SCIENCE, EMERITUS AND COLUMBIA UNIVERSITY
CONTENTS 1 BACKGROUND
2 OBJECTIVES
3 APPROACH
4 ANALYSIS RESULTS
5 RECOMMENDATIONS
1. The Customer is a B2B retailer for office supplies, office electronics and office furniture
2. A marketing campaign was executed to ~16k of its customers
3. Detailed campaign target data and results are available including key data per customer :
a) Resulting Sales
b) Historical sales
c) Type of previous purchase
d) Communication channel preferences
e) Size of target company
f) Language
Background
Analyze the campaign results and provide insights and recommendations on :
1. Which type of customers responded positively to the campaign ?
2. What can the customer be doing for better future campaign performance ?
3. How much can be the financial gains of the improved campaign strategies ?
Objective
Approach
Exploratory Data
Analysis
• Removed outliers
in Number of Year
Prior Transaction
• Drop 10 rows
which has null
values for No of
Employees and
type of previous
purchase
• Impute 442 rows
with Last
Transaction
channel as
Unknown
• Impute 4470 rows
with language as
unknown
• Remove negative
sales and
Historical sales
volume rows
Data Transformation
& Analsysis
• Transform
categorical values
to binary coded
values
• Observe
correlation
between features
and remove highly
corelated features
Model Building
• Build a robust
classification
model with most
influential features
to predict sales or
no sales
• Build a regression
model to predict
sales with the
most influential
features and
predict sales
• Using the
probability of
sales and the
amount of sales
predicted
calculate
profitability given,
gross margin , the
marketing and
transaction costs
Gains / Lift Chart
• Classify the
customers into
deciles and show
profits per decile
• Show profitable
deciles and
expected gains
over random
targeting
• Inner Join based
on customer ID, the
probabilities from
classifier and
predicted sales
sales from linear
regressor model
to formula to find
profits.
• Sort profits and
split by deciles
Recommendations
• Show Which type
of customers
responded
positively to the
campaign ?
• Show What can
XYZ Ltd be doing
for better future
campaign
performance ?
• Show How much
can be the
financial gains of
the improved
campaign
strategies ?
1. Those who have purchased before ( particularly 16-22
Transactions in prior year )
2. Have had historical sales of up to $ 720,000+ ( 75% of the
purchases )
3. Made purchases in the year 1993 and 1994 – these 2 years
seems to have created long term loyal customers
Typical Purchasers are…
0
100
200
300
400
500
600
700
1926
1950
1954
1958
1962
1966
1970
1974
1978
1982
1986
1990
1994
1998
2002
2006
2010
2014
2018
Year of First Purchase
Coefficients–DegreeofInfluence
Numberof
purchaserecords
Year of Purchase
Classification Model was built and tested on ~8000 records test data. The model can be now
used to identify, predict and target positive sales candidates for future campaigns
To build a robust model, Random Forest Classifier was built by training it on ~8000 campaign data which yielded the following :
1. Prediction accuracy score : 85% - Ability of the classifier to make correct predictions
2. Precision score: 77% - The ability of the classifier to predict true sales accurately. Cost of low score is wasted marketing cost.
3. Recall score:65% - The ability of the classifier to find all the true sales. Cost of low score is missed revenue opportunities.
4. f1 Score 70% - balance between the precision and the recall
5. Confusion Matrix
NO YES
NO 5400 438
YES 788 1453
Predicted Sale ( Yes or No )
Actual
Sale
Now that we know targeting which customers will result in sales, next we built a model to predict
“Amount of Sales” for purchasing customers and tested it on ~4384 records positive sales test data
Linear Regression was used which yielded the following :
1. Linear Regression Fit score Training data : 77%
2. Linear Regression Fit score Test data : 75%
3. Root Mean Squared Error of prediction : 559
4. R squared score 0.75 ( degree to which the model
captures and explains the variance of the data )
Findings
1. Size of the company has the largest influence on the
sales amount : larger the company, larger the sales
amounts
2. Previous purchase of office furniture and computer
equipment has next significant influence in amount of
sales.
Coefficients–DegreeofInfluence
Gains Chart for ~1120 records test data representing a future campaign
Total Actual Profit per
customer of the campaign that was executed
( 16,000 records )
The lift chart is built from about 1120 records from test
data for linear regression and same 1120 records from
the classifier by using "inner" join on customer number
i.e common records between the two dataframes in the
linear reg test and classifier test
Deciles
Number of
customers per
decile
Actual Profitability
per customer
Lift over
average
Total Profit % of Profit
Incr Proj Profit
100k Customer base
Total Proj Profit
100k Customer
base
Cuml Incr Profit
100k Customer
base
Cuml Total
Profit
100k Customer
base
(1009.0, 1121.0] 112 504 497 55,630 76% 4,967 5,037 4,967 5,037
(897.0, 1009.0] 112 271 264 29,535 40% 2,637 2,707 7,604 7,744
(785.0, 897.0] 112 75 68 7,607 10% 679 749 8,283 8,493
(673.0, 785.0] 112 21 14 1,518 2% 135 205 8,419 8,699
(561.0, 673.0] 112 -1 -8 (947) -1% (85) (15) 8,334 8,684
(449.0, 561.0] 112 -12 -19 (2,113) -3% (189) (119) 8,145 8,565
(337.0, 449.0] 112 -20 -27 (2,994) -4% (267) (197) 7,878 8,368
(225.0, 337.0] 112 -28 -35 (3,959) -5% (354) (284) 7,525 8,085
(113.0, 225.0] 112 -37 -44 (4,944) -7% (441) (371) 7,083 7,713
(0.999, 113.0] 112 -49 -56 (6,268) -9% (560) (490) 6,523 7,223
Total 1120 7.00 73,063 100%
Recommendations
1. Instead of random targeting of customer base, use the prediction model to target only first 4 deciles type
customers for maximum profitability for future campaigns 1120 record test data.
2. Maximize profitability further by using lower cost channels that reach the above target of customers
effectively since marketing channels showed little or no influence on sales
3. Replicate what was done in the year 1993 and 1994, as it seems to have created long loyal customers
4. Use the model to predict sales, profitability, and expected Return on Investment and leverage it for a more
fact based budget requirements for decision by management / budget approver for next campaigns
THANK YOU

More Related Content

PDF
1 поток 2. аудиторные закупки в реальном времени thibault-oberlin
PPTX
How to make your PPC campaigns more efficient using targeted segmentation
PDF
Case competiton
PDF
Capstone.pptx
PPTX
Sugar crm by sakee
PPTX
13 6e sm module 09
PPT
Chapter17
PPTX
Sales automation CRM
1 поток 2. аудиторные закупки в реальном времени thibault-oberlin
How to make your PPC campaigns more efficient using targeted segmentation
Case competiton
Capstone.pptx
Sugar crm by sakee
13 6e sm module 09
Chapter17
Sales automation CRM

Similar to Improving profitability of campaigns through data science (20)

PDF
Marketelligent Capabilities & Offerings for Sales Analytics
PPTX
Revenue Planning: How Planning for Revenue Impacts a Budget's Effectiveness
PDF
MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...
PDF
B2B Sales Hacks
PPTX
eSUG-April 5-final
PPT
Marketing Mix Models In a Changing Environment
PDF
From Zero to Hero: Building the Internal Reputation of Customer Success
PDF
Building a Successful Cloud and Mobility Practice
PDF
Trivia marketing solutions services v14 uk
PPTX
Business Analytics Forum - Sheffield November 2018
PPTX
Introduction to marketing analytics
PPT
Market Mix Modeling Overview for Data Analytics
PDF
Database Marketing, part two: data enhancement, analytics, and attribution
PDF
Intent Based Segmentation by CleverTap
PPTX
Delivering Personalized Experiences using the Power of Data
PPTX
Wooing the Best Bank Deposit Customers
PDF
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
PDF
5 metrics to strengthen your multichannel sales strategy
PPTX
Segmenting the Market to Find Areas of Opportunity
PDF
CRM Analytics_Marketelligent
Marketelligent Capabilities & Offerings for Sales Analytics
Revenue Planning: How Planning for Revenue Impacts a Budget's Effectiveness
MARKETING INTELLIGENCE_Marketing ROI, come aumentare i ritorni sugli investim...
B2B Sales Hacks
eSUG-April 5-final
Marketing Mix Models In a Changing Environment
From Zero to Hero: Building the Internal Reputation of Customer Success
Building a Successful Cloud and Mobility Practice
Trivia marketing solutions services v14 uk
Business Analytics Forum - Sheffield November 2018
Introduction to marketing analytics
Market Mix Modeling Overview for Data Analytics
Database Marketing, part two: data enhancement, analytics, and attribution
Intent Based Segmentation by CleverTap
Delivering Personalized Experiences using the Power of Data
Wooing the Best Bank Deposit Customers
Sales Transformation: The Role of Sales Strategy & Operations, Dow Jones & Co...
5 metrics to strengthen your multichannel sales strategy
Segmenting the Market to Find Areas of Opportunity
CRM Analytics_Marketelligent
Ad

Recently uploaded (20)

PPTX
CH 2 The Role of IMC in the Marketing Process (combined)
PPTX
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
PDF
The Role of Search Intent in Shaping SEO Strategies in 2025
PPTX
Transform Your Business with Top Digital Marketing Services_EGlogics.pptx
PDF
Dream Powell - Project and Portfolio 3: Marketing
PDF
Digital Marketing Agency vs Freelancers and VAs: Which Should You Hire in 2025
PDF
Generation Alpha Report 2025 x DKC Analytics.pdf
PPT
Introduction to consumer behavior(1).PPT
PPTX
Unit 2 - Architects Act, COA n competitions.pptx
PDF
Biography of Brady Beitlich
PDF
FSSC 22000 Certification: Meaning, Benefits & Food Safety Compliance
PDF
DigiBrandX: Crafting Identities That Resonate
PDF
digital marketing courses online with od
PPTX
APA Examples Reference Examples Style and
PDF
Instagram Marketing Agency by IIS INDIA.pdf
PPTX
AI usage and the Social Media Marketing World
PDF
RC 14001 Certification: Enhancing ISO 14001 with EHS & Security Standards
PPTX
CH 1 AN INTRODUCTION OF INTEGRATED MARKETING COMMUNICATION (COMBINE)
PDF
Wondershare Filmora Crack Free Download 2025
PPTX
Digital-Marketing-Strategy-Trends-and-Best-Practices-for-2025 PPT3.pptx
CH 2 The Role of IMC in the Marketing Process (combined)
Ipsos+Protocols+Playbook+V1.2+(DEC2024)+final+IntClientUseOnly.pptx
The Role of Search Intent in Shaping SEO Strategies in 2025
Transform Your Business with Top Digital Marketing Services_EGlogics.pptx
Dream Powell - Project and Portfolio 3: Marketing
Digital Marketing Agency vs Freelancers and VAs: Which Should You Hire in 2025
Generation Alpha Report 2025 x DKC Analytics.pdf
Introduction to consumer behavior(1).PPT
Unit 2 - Architects Act, COA n competitions.pptx
Biography of Brady Beitlich
FSSC 22000 Certification: Meaning, Benefits & Food Safety Compliance
DigiBrandX: Crafting Identities That Resonate
digital marketing courses online with od
APA Examples Reference Examples Style and
Instagram Marketing Agency by IIS INDIA.pdf
AI usage and the Social Media Marketing World
RC 14001 Certification: Enhancing ISO 14001 with EHS & Security Standards
CH 1 AN INTRODUCTION OF INTEGRATED MARKETING COMMUNICATION (COMBINE)
Wondershare Filmora Crack Free Download 2025
Digital-Marketing-Strategy-Trends-and-Best-Practices-for-2025 PPT3.pptx
Ad

Improving profitability of campaigns through data science

  • 1. IMPROVING RETAIL SALES CAMPAIGN PROFITABILITY FINAL CAPSTONE PROJECT : DATA ANALYSIS , FINDINGS AND RECOMMENDATIONS SHOUNAK MONDAL POSTGRADUATE DIPLOMA IN DATA SCIENCE, EMERITUS AND COLUMBIA UNIVERSITY
  • 2. CONTENTS 1 BACKGROUND 2 OBJECTIVES 3 APPROACH 4 ANALYSIS RESULTS 5 RECOMMENDATIONS
  • 3. 1. The Customer is a B2B retailer for office supplies, office electronics and office furniture 2. A marketing campaign was executed to ~16k of its customers 3. Detailed campaign target data and results are available including key data per customer : a) Resulting Sales b) Historical sales c) Type of previous purchase d) Communication channel preferences e) Size of target company f) Language Background
  • 4. Analyze the campaign results and provide insights and recommendations on : 1. Which type of customers responded positively to the campaign ? 2. What can the customer be doing for better future campaign performance ? 3. How much can be the financial gains of the improved campaign strategies ? Objective
  • 5. Approach Exploratory Data Analysis • Removed outliers in Number of Year Prior Transaction • Drop 10 rows which has null values for No of Employees and type of previous purchase • Impute 442 rows with Last Transaction channel as Unknown • Impute 4470 rows with language as unknown • Remove negative sales and Historical sales volume rows Data Transformation & Analsysis • Transform categorical values to binary coded values • Observe correlation between features and remove highly corelated features Model Building • Build a robust classification model with most influential features to predict sales or no sales • Build a regression model to predict sales with the most influential features and predict sales • Using the probability of sales and the amount of sales predicted calculate profitability given, gross margin , the marketing and transaction costs Gains / Lift Chart • Classify the customers into deciles and show profits per decile • Show profitable deciles and expected gains over random targeting • Inner Join based on customer ID, the probabilities from classifier and predicted sales sales from linear regressor model to formula to find profits. • Sort profits and split by deciles Recommendations • Show Which type of customers responded positively to the campaign ? • Show What can XYZ Ltd be doing for better future campaign performance ? • Show How much can be the financial gains of the improved campaign strategies ?
  • 6. 1. Those who have purchased before ( particularly 16-22 Transactions in prior year ) 2. Have had historical sales of up to $ 720,000+ ( 75% of the purchases ) 3. Made purchases in the year 1993 and 1994 – these 2 years seems to have created long term loyal customers Typical Purchasers are… 0 100 200 300 400 500 600 700 1926 1950 1954 1958 1962 1966 1970 1974 1978 1982 1986 1990 1994 1998 2002 2006 2010 2014 2018 Year of First Purchase Coefficients–DegreeofInfluence Numberof purchaserecords Year of Purchase
  • 7. Classification Model was built and tested on ~8000 records test data. The model can be now used to identify, predict and target positive sales candidates for future campaigns To build a robust model, Random Forest Classifier was built by training it on ~8000 campaign data which yielded the following : 1. Prediction accuracy score : 85% - Ability of the classifier to make correct predictions 2. Precision score: 77% - The ability of the classifier to predict true sales accurately. Cost of low score is wasted marketing cost. 3. Recall score:65% - The ability of the classifier to find all the true sales. Cost of low score is missed revenue opportunities. 4. f1 Score 70% - balance between the precision and the recall 5. Confusion Matrix NO YES NO 5400 438 YES 788 1453 Predicted Sale ( Yes or No ) Actual Sale
  • 8. Now that we know targeting which customers will result in sales, next we built a model to predict “Amount of Sales” for purchasing customers and tested it on ~4384 records positive sales test data Linear Regression was used which yielded the following : 1. Linear Regression Fit score Training data : 77% 2. Linear Regression Fit score Test data : 75% 3. Root Mean Squared Error of prediction : 559 4. R squared score 0.75 ( degree to which the model captures and explains the variance of the data ) Findings 1. Size of the company has the largest influence on the sales amount : larger the company, larger the sales amounts 2. Previous purchase of office furniture and computer equipment has next significant influence in amount of sales. Coefficients–DegreeofInfluence
  • 9. Gains Chart for ~1120 records test data representing a future campaign Total Actual Profit per customer of the campaign that was executed ( 16,000 records ) The lift chart is built from about 1120 records from test data for linear regression and same 1120 records from the classifier by using "inner" join on customer number i.e common records between the two dataframes in the linear reg test and classifier test Deciles Number of customers per decile Actual Profitability per customer Lift over average Total Profit % of Profit Incr Proj Profit 100k Customer base Total Proj Profit 100k Customer base Cuml Incr Profit 100k Customer base Cuml Total Profit 100k Customer base (1009.0, 1121.0] 112 504 497 55,630 76% 4,967 5,037 4,967 5,037 (897.0, 1009.0] 112 271 264 29,535 40% 2,637 2,707 7,604 7,744 (785.0, 897.0] 112 75 68 7,607 10% 679 749 8,283 8,493 (673.0, 785.0] 112 21 14 1,518 2% 135 205 8,419 8,699 (561.0, 673.0] 112 -1 -8 (947) -1% (85) (15) 8,334 8,684 (449.0, 561.0] 112 -12 -19 (2,113) -3% (189) (119) 8,145 8,565 (337.0, 449.0] 112 -20 -27 (2,994) -4% (267) (197) 7,878 8,368 (225.0, 337.0] 112 -28 -35 (3,959) -5% (354) (284) 7,525 8,085 (113.0, 225.0] 112 -37 -44 (4,944) -7% (441) (371) 7,083 7,713 (0.999, 113.0] 112 -49 -56 (6,268) -9% (560) (490) 6,523 7,223 Total 1120 7.00 73,063 100%
  • 10. Recommendations 1. Instead of random targeting of customer base, use the prediction model to target only first 4 deciles type customers for maximum profitability for future campaigns 1120 record test data. 2. Maximize profitability further by using lower cost channels that reach the above target of customers effectively since marketing channels showed little or no influence on sales 3. Replicate what was done in the year 1993 and 1994, as it seems to have created long loyal customers 4. Use the model to predict sales, profitability, and expected Return on Investment and leverage it for a more fact based budget requirements for decision by management / budget approver for next campaigns