Products Frequently Bought Together in Stores
Using classification models
Supervised by
Dr. Amina dahim
2023 - 2024
Submitted by M.Sc. Students
Sabreen Salih Mahdi
Zahraa Fouad Rasool
Hiba Tullah Ziyad

 Introduction .
 Motivation of frequent item sets in online stores .
 Association rule .
 Association rule Advantages/ Disadvantages .
 A-priori Algorithm for computing frequent item sets .
 A-priori Algorithm Advantages/ Disadvantages .
 What Applications use A-priori Algorithm .
 Conclusion .
Outlines

Introduction
 Frequently bought together purchase
recommendations are one of the most impactful
stores strategies that have the potential to
maximize the return on investment on in-house
product stores.
 Frequently bought together are the products that
your customers usually purchase with specific
items in your store.
 Displaying the right products at the right time can
help drive the discovery and sales of your
products.

One of the most important strategies
for influencing sales is the offering of
discounts.
 A discount would enable the
salesperson to increase the average
order value of the transactions.
 Smart upselling and cross-selling
technique that fits into natural buying
habit.
Introduction

Motivations behind frequent sets of products in
online stores
 An easy way to offer shopping assistance.
 The frequently bought together section act
as a shopping assistant by suggesting the
best-suited complementary products.
 Manually curating frequently bought together
products lets you maximize revenue and
profits for your store

 products as frequently bought together
would help you remind your customers
of the products that they might need.
 Products that bought together helps
you boost your sales and profits.
Motivations behind frequent sets of products in
online stores

 This concept itself is derived from the terminology of market basket
analysis, namely the search for relationships of several products in a
purchase transaction.
 Most machine learning algorithms work with numeric datasets and
hence tend to be mathematical. However, association rule mining is
suitable for non-numeric, categorical data.
 Understanding consumer buying behavior is compulsory in business
Association rule Mining

Given a set of transactions, each of which is a set of items, find all
rules (XY) that satisfy user specified minimum support and
confidence constraints.
Support = (#T containing X and Y)/(#T)
Confidence = (#T containing X and Y)/ (#T containing X)
 Applications
 Cross selling and up selling
 Supermarket shelf management
Association rule Mining

What does association analysis do? Example

Example

Example

Example

Simple Example

Steps

Itemset

Antecedent & Consequent

Support

Confidence

 Advantages The employed algorithms have too many parameters for
someone who is not a data mining expert.
The disadvantage of association algorithms is that they are trying to find
patterns within a potentially very large search space and, hence, can require
much more time to run than a decision tree algorithm.
Association rule Advantages/ Disadvantages

 A priori Algorithm : is an significant algorithm for mining frequent
item sets for Boolean association rules.
It contains two processes:-
 Detect all frequent itemsets by scanning DB.
 Form strong association rules in the frequent itemsets.
 A priori pruning principle: If there is any itemset which is infrequent,
its superset should not be generated/tested!
A- priori Algorithm for computing frequent itemsets

Method:
 Initially, scan DB once to get frequent 1-
itemset
 Generate length (k+1) candidate itemsets
from length k frequent itemsets
 Test the candidates against DB
 Terminate when no frequent or candidate
set can be generated
A- priori Algorithm for computing frequent itemsets

Transaction
ID
Items
T1 Hot Dogs , Buns , ketchup
T2 Hot Dogs , Buns
T3 Hot Dogs , Coke , Chips
T4 Chips , Coke
T5 Chips , ketchup
T6 Hot Dogs , Coke , Chips
Find the frequent itemsets on this table, assume that minimum support
count = 3
TDB
Itemset Sup- count
Hot Dogs 4
Buns 2
ketchup 2
Coke 3
Chips 4
C1
1ST SCAN
Itemset Sup- count
Hot Dogs 4
Coke 3
Chips 4
Compare candidate
support count with
minimum support count
L1

Itemset
Sup-
count
Hot Dogs , Coke 2
Hot Dogs , Chips 2
Coke , Chips 3
Find the frequent itemsets on this table, assume that minimum support
count = 3
Itemset
Hot Dogs , Coke
Hot Dogs , Chips
Coke , Chips
Itemset Sup- count
Coke , Chips 3
C2
2nd scan
L2
Compare candidate
support count with
minimum support
count

 Advantages
- Uses large itemset property
- Easily parallelized
- Easy to implement
 Disadvantages
- Assumes transaction database is memory resident
- Requires many database scans
A- priori Algorithm Advantages/ Disadvantages

 Education.
 Medicine.
 Biology.
 E-commerce & Recommendation.
What Applications use this Algorithm ?

 Market Based Analysis is one of the key techniques used by large relations to show
associations between items.
 it can generate association rules from the given transactional datasets.
 Association rules are useful for analyzing and predicting customer behavior.
 The disadvantage of association algorithms is require much more time to run than a decision
tree algorithm.
 The A priori Algorithm is an instrumental algorithm for mining familiar item sets.
 The disadvantage is more exploration space and computational cost is too expensive.
Conclusion


More Related Content

PPTX
big data seminar.pptx
PPTX
MIning association rules and frequent patterns.pptx
PDF
Product recommendation
PPTX
1.pptx .
PPTX
MODULE 5 _ Mining frequent patterns and associations.pptx
PPTX
Association rule mining and Apriori algorithm
PDF
ASSOCIATION RULE MINING BASED ON TRADE LIST
PDF
Dm unit ii r16
big data seminar.pptx
MIning association rules and frequent patterns.pptx
Product recommendation
1.pptx .
MODULE 5 _ Mining frequent patterns and associations.pptx
Association rule mining and Apriori algorithm
ASSOCIATION RULE MINING BASED ON TRADE LIST
Dm unit ii r16

Similar to Products Frequently Bought Together in Stores Using classification models (20)

PPTX
Dma unit 2
PPTX
Association and Classification Algorithm
PPTX
Association Rule Mining
PPT
Data mining arm-2009-v0
PPT
20IT501_DWDM_U3.ppt
PPT
20IT501_DWDM_PPT_Unit_III.ppt
PPTX
Data mining in market basket analysis
PDF
MCA-IV_DataMining16_DataMining_AssociationRules_APriori_Keerti_Dixit.pdf
PPTX
Frequent Itemset Mining (FIM) using aporiori
PPTX
Fundamental of Data Science BCA 6th Sem Notes
PPTX
Fundamental of Data Science BCA 6th Sem ppt
PPTX
Fundamental of Data Science BCA 6th Sem Notes
PPTX
1. Introduction to Association Rule 2. Frequent Item Set Mining 3. Market Bas...
PDF
Data Mining For Supermarket Sale Analysis Using Association Rule
PPTX
Association Rule Mining in Data Mining.pptx
PPT
Associations1
PDF
BIM Data Mining Unit4 by Tekendra Nath Yogi
PPT
Associations.ppt
PDF
DATA MINING-MODULE II NOTES(S4 BCA)_________.pdf
Dma unit 2
Association and Classification Algorithm
Association Rule Mining
Data mining arm-2009-v0
20IT501_DWDM_U3.ppt
20IT501_DWDM_PPT_Unit_III.ppt
Data mining in market basket analysis
MCA-IV_DataMining16_DataMining_AssociationRules_APriori_Keerti_Dixit.pdf
Frequent Itemset Mining (FIM) using aporiori
Fundamental of Data Science BCA 6th Sem Notes
Fundamental of Data Science BCA 6th Sem ppt
Fundamental of Data Science BCA 6th Sem Notes
1. Introduction to Association Rule 2. Frequent Item Set Mining 3. Market Bas...
Data Mining For Supermarket Sale Analysis Using Association Rule
Association Rule Mining in Data Mining.pptx
Associations1
BIM Data Mining Unit4 by Tekendra Nath Yogi
Associations.ppt
DATA MINING-MODULE II NOTES(S4 BCA)_________.pdf
Ad

Recently uploaded (20)

PPTX
20th Century Theater, Methods, History.pptx
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
HVAC Specification 2024 according to central public works department
PPTX
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PPTX
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
Hazard Identification & Risk Assessment .pdf
20th Century Theater, Methods, History.pptx
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
FORM 1 BIOLOGY MIND MAPS and their schemes
Cambridge-Practice-Tests-for-IELTS-12.docx
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
HVAC Specification 2024 according to central public works department
Onco Emergencies - Spinal cord compression Superior vena cava syndrome Febr...
Share_Module_2_Power_conflict_and_negotiation.pptx
AI-driven educational solutions for real-life interventions in the Philippine...
History, Philosophy and sociology of education (1).pptx
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
LDMMIA Reiki Yoga Finals Review Spring Summer
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
Chinmaya Tiranga Azadi Quiz (Class 7-8 )
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
Hazard Identification & Risk Assessment .pdf
Ad

Products Frequently Bought Together in Stores Using classification models

  • 1. Products Frequently Bought Together in Stores Using classification models Supervised by Dr. Amina dahim 2023 - 2024 Submitted by M.Sc. Students Sabreen Salih Mahdi Zahraa Fouad Rasool Hiba Tullah Ziyad
  • 2.   Introduction .  Motivation of frequent item sets in online stores .  Association rule .  Association rule Advantages/ Disadvantages .  A-priori Algorithm for computing frequent item sets .  A-priori Algorithm Advantages/ Disadvantages .  What Applications use A-priori Algorithm .  Conclusion . Outlines
  • 3.  Introduction  Frequently bought together purchase recommendations are one of the most impactful stores strategies that have the potential to maximize the return on investment on in-house product stores.  Frequently bought together are the products that your customers usually purchase with specific items in your store.  Displaying the right products at the right time can help drive the discovery and sales of your products.
  • 4.  One of the most important strategies for influencing sales is the offering of discounts.  A discount would enable the salesperson to increase the average order value of the transactions.  Smart upselling and cross-selling technique that fits into natural buying habit. Introduction
  • 5.  Motivations behind frequent sets of products in online stores  An easy way to offer shopping assistance.  The frequently bought together section act as a shopping assistant by suggesting the best-suited complementary products.  Manually curating frequently bought together products lets you maximize revenue and profits for your store
  • 6.   products as frequently bought together would help you remind your customers of the products that they might need.  Products that bought together helps you boost your sales and profits. Motivations behind frequent sets of products in online stores
  • 7.   This concept itself is derived from the terminology of market basket analysis, namely the search for relationships of several products in a purchase transaction.  Most machine learning algorithms work with numeric datasets and hence tend to be mathematical. However, association rule mining is suitable for non-numeric, categorical data.  Understanding consumer buying behavior is compulsory in business Association rule Mining
  • 8.  Given a set of transactions, each of which is a set of items, find all rules (XY) that satisfy user specified minimum support and confidence constraints. Support = (#T containing X and Y)/(#T) Confidence = (#T containing X and Y)/ (#T containing X)  Applications  Cross selling and up selling  Supermarket shelf management Association rule Mining
  • 9.  What does association analysis do? Example
  • 19.   Advantages The employed algorithms have too many parameters for someone who is not a data mining expert. The disadvantage of association algorithms is that they are trying to find patterns within a potentially very large search space and, hence, can require much more time to run than a decision tree algorithm. Association rule Advantages/ Disadvantages
  • 20.   A priori Algorithm : is an significant algorithm for mining frequent item sets for Boolean association rules. It contains two processes:-  Detect all frequent itemsets by scanning DB.  Form strong association rules in the frequent itemsets.  A priori pruning principle: If there is any itemset which is infrequent, its superset should not be generated/tested! A- priori Algorithm for computing frequent itemsets
  • 21.  Method:  Initially, scan DB once to get frequent 1- itemset  Generate length (k+1) candidate itemsets from length k frequent itemsets  Test the candidates against DB  Terminate when no frequent or candidate set can be generated A- priori Algorithm for computing frequent itemsets
  • 22.  Transaction ID Items T1 Hot Dogs , Buns , ketchup T2 Hot Dogs , Buns T3 Hot Dogs , Coke , Chips T4 Chips , Coke T5 Chips , ketchup T6 Hot Dogs , Coke , Chips Find the frequent itemsets on this table, assume that minimum support count = 3 TDB Itemset Sup- count Hot Dogs 4 Buns 2 ketchup 2 Coke 3 Chips 4 C1 1ST SCAN Itemset Sup- count Hot Dogs 4 Coke 3 Chips 4 Compare candidate support count with minimum support count L1
  • 23.  Itemset Sup- count Hot Dogs , Coke 2 Hot Dogs , Chips 2 Coke , Chips 3 Find the frequent itemsets on this table, assume that minimum support count = 3 Itemset Hot Dogs , Coke Hot Dogs , Chips Coke , Chips Itemset Sup- count Coke , Chips 3 C2 2nd scan L2 Compare candidate support count with minimum support count
  • 24.   Advantages - Uses large itemset property - Easily parallelized - Easy to implement  Disadvantages - Assumes transaction database is memory resident - Requires many database scans A- priori Algorithm Advantages/ Disadvantages
  • 25.   Education.  Medicine.  Biology.  E-commerce & Recommendation. What Applications use this Algorithm ?
  • 26.   Market Based Analysis is one of the key techniques used by large relations to show associations between items.  it can generate association rules from the given transactional datasets.  Association rules are useful for analyzing and predicting customer behavior.  The disadvantage of association algorithms is require much more time to run than a decision tree algorithm.  The A priori Algorithm is an instrumental algorithm for mining familiar item sets.  The disadvantage is more exploration space and computational cost is too expensive. Conclusion
  • 27.