2
Most read
5
Most read
6
Most read
Association Rule Mining
OMega TechEd
Introduction
2
Association rule learning is a type of unsupervised learning technique that
checks for the dependency of one data item on another data item and maps
accordingly so that it can be more profitable.
It tries to find some interesting relations or associations among the variables
of dataset. It is based on different rules to discover the interesting relations
between variables in the database.
For example, if a customer buys bread, he most likely can also buy butter,
eggs, or milk, so these products are stored within a shelf or mostly nearby.
OMega TechEd
Applications
 Market Basket Analysis: It is one of the popular examples and
applications of association rule mining. This technique is commonly used
by big retailers to determine the association between items.
 Medical Diagnosis: With the help of association rules, patients can be
cured easily, as it helps in identifying the probability of illness for a
particular disease.
 Protein Sequence: The association rules help in determining the synthesis
of artificial Proteins.
 It is also used for the Catalog Design and Loss-leader Analysis and many
more other applications.
3
OMega TechEd
Working
Association rule learning works on the concept of If and Else Statement, such
as if A then B.
Here the If element is called antecedent, and then statement is called
as Consequent. These types of relationships where we can find out some
association or relation between two items is known as single cardinality. It is
all about creating rules, and if the number of items increases, then cardinality
also increases accordingly. So, to measure the associations between thousands
of data items, there are several metrics.
• Support
• Confidence
• Lift
4
OMega TechEd
Support
Support is the frequency of A or how frequently an item appears in the dataset.
It is defined as the fraction of the transaction T that contains the itemset X. If
there are X datasets, then for transactions T, It can be written as-
5
TID Items
1 Bread, Milk
2 Bread, Diaper, Beer, Eggs
3 Milk, Diaper, Beer, Coke
4 Bread, Milk, Diaper, Beer
5 Bread, Milk, Diaper, Coke
{Milk, Diaper}{Beer}
Support = σ ({Milk, Diaper, Beer}) / T
= 2/5
= 0.4
OMega TechEd
Confidence
Confidence indicates how often the rule has been found to be true. Or how
often the items X and Y occur together in the dataset when the occurrence of
X is already given. It is the ratio of the transaction that contains X and Y to the
number of records that contain X.
6
TID Items
1 Bread, Milk
2 Bread, Diaper, Beer, Eggs
3 Milk, Diaper, Beer, Coke
4 Bread, Milk, Diaper, Beer
5 Bread, Milk, Diaper, Coke
{Milk, Diaper}{Beer}
Confidence = σ ({Milk, Diaper, Beer}) /
σ ({Milk, Diaper})
= 2/3
= 0.67
OMega TechEd
Lift
It is the strength of any rule, which can be defined as below formula:
It is the ratio of the observed support measure and expected support if X and
Y are independent of each other. It has three possible values:
• If Lift= 1: The probability of occurrence of antecedent and consequent is
independent of each other.
• Lift>1: It determines the degree to which the two itemset are dependent to
each other.
• Lift<1: It tells us that one item is a substitute for other items, which
means one item has a negative effect on another.
7
OMega TechEd
Example
8
TID Items
1 Bread, Milk
2 Bread, Diaper, Beer, Eggs
3 Milk, Diaper, Beer, Coke
4 Bread, Milk, Diaper, Beer
5 Bread, Milk, Diaper, Coke
{Milk, Diaper}{Beer}
Lift= Supp({Milk, Diaper, Beer}) /
Supp({Milk, Diaper})*Supp({Beer})
Supp({Milk, Diaper, Beer})=2/5 = 0.4
Supp({Milk, Diaper}) = 3/5 = 0.6
Supp({Beer}) = 3/5 =0.6
0.4/(0.6*0.6)
= 1.11
High Association
OMega TechEd
Conclusion
The Association rule is very useful in analyzing datasets. The data is collected
using bar-code scanners in supermarkets. Such databases consists of many
transaction records which list all items bought by a customer on a single
purchase. So, the manager could know if certain groups of items are
consistently purchased together and use this data for adjusting store layouts,
cross-selling, promotions based on statistics.
9
OMega TechEd
Thank you
Reference:
Artificial Intelligence: A Modern Approach, 3rd ed.
Stuart Russell and Peter Norvig
https://www.javatpoint.com/reinforcement-learning
Join Telegram channel for AI notes. Link is in the description.
OMega TechEd

More Related Content

PPTX
Mining Association Rules in Large Database
PPTX
Data mining presentation.ppt
PPT
Chapter 8. Classification Basic Concepts.ppt
PPTX
What is Data mining? Data mining Presentation
PPT
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
PPTX
Introduction to-data-mining chapter 1
PPTX
Predictive analytics
PPTX
One R (1R) Algorithm
Mining Association Rules in Large Database
Data mining presentation.ppt
Chapter 8. Classification Basic Concepts.ppt
What is Data mining? Data mining Presentation
Chapter - 6 Data Mining Concepts and Techniques 2nd Ed slides Han &amp; Kamber
Introduction to-data-mining chapter 1
Predictive analytics
One R (1R) Algorithm

What's hot (20)

PPTX
Deadlock dbms
PPTX
Back face detection
PDF
Unit 3
PPTX
Association Analysis in Data Mining
PPTX
Tcp/ip server sockets
PPTX
Data mining primitives
PPTX
Data Integration and Transformation in Data mining
PPTX
Association rule mining.pptx
PDF
Remote backup system
PPT
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
PPT
Intro to scan conversion
PPTX
Association Rule Learning Part 1: Frequent Itemset Generation
PPTX
Attributes of output primitives( curve attributes & area fill attributes)
PPTX
Syntax-Directed Translation into Three Address Code
PPTX
Concurrency Control in Database Management System
PDF
Functional dependency and normalization
PPTX
Decision trees
PPTX
File system structure
PPTX
Modern Block Cipher- Modern Symmetric-Key Cipher
Deadlock dbms
Back face detection
Unit 3
Association Analysis in Data Mining
Tcp/ip server sockets
Data mining primitives
Data Integration and Transformation in Data mining
Association rule mining.pptx
Remote backup system
Data Mining: Concepts and Techniques (3rd ed.) - Chapter 3 preprocessing
Intro to scan conversion
Association Rule Learning Part 1: Frequent Itemset Generation
Attributes of output primitives( curve attributes & area fill attributes)
Syntax-Directed Translation into Three Address Code
Concurrency Control in Database Management System
Functional dependency and normalization
Decision trees
File system structure
Modern Block Cipher- Modern Symmetric-Key Cipher
Ad

Similar to Association Rule mining (20)

PPTX
Unit-II-1-1@dm.pptx .
PDF
BIM Data Mining Unit4 by Tekendra Nath Yogi
PPTX
Association rules
PPTX
Introduction to Association Rules.pptx
PPTX
Unit 4_ML.pptx
PDF
AssociationRule.pdf
PDF
6. Association Rule.pdf
PPTX
WEEK 11 - Association Mining_020520.pptx
PPTX
Lect6 Association rule & Apriori algorithm
PDF
Association rules and frequent pattern growth algorithms
PPTX
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
PDF
Data Science - Part VI - Market Basket and Product Recommendation Engines
DOCX
5Association AnalysisBasic Concepts an.docx
PPTX
Association Rule Mining || Data Mining
PPT
Pert 06 association rules
PPTX
BAS 250 Lecture 4
PPTX
Business intelligence
PDF
IRJET- Effecient Support Itemset Mining using Parallel Map Reducing
PDF
SURVEY ON FREQUENT PATTERN MINING
PPT
dm14-association-rules (BahanAR-3).ppt
Unit-II-1-1@dm.pptx .
BIM Data Mining Unit4 by Tekendra Nath Yogi
Association rules
Introduction to Association Rules.pptx
Unit 4_ML.pptx
AssociationRule.pdf
6. Association Rule.pdf
WEEK 11 - Association Mining_020520.pptx
Lect6 Association rule & Apriori algorithm
Association rules and frequent pattern growth algorithms
What is FP Growth Analysis and How Can a Business Use Frequent Pattern Mining...
Data Science - Part VI - Market Basket and Product Recommendation Engines
5Association AnalysisBasic Concepts an.docx
Association Rule Mining || Data Mining
Pert 06 association rules
BAS 250 Lecture 4
Business intelligence
IRJET- Effecient Support Itemset Mining using Parallel Map Reducing
SURVEY ON FREQUENT PATTERN MINING
dm14-association-rules (BahanAR-3).ppt
Ad

More from Megha Sharma (20)

PPTX
Designing Printed Circuit boards, Software Choices, The Design Process
PPTX
Manufacturing PCB, Etching board, milling board, Third party manufacturing, a...
PPTX
Business Model, make thing, sell thing, subscription, customization, Key Reso...
PPTX
Funding an IOT startup, Venture Capital, Government funding, Crowdfunding, Le...
PPTX
Sketch, Iterate and Explore, Nondigital Methods.
PPTX
CNC Milling, Software, Repurposing and Recycling.
PPTX
3D printing, Types of 3D printing: FDM, Laser Sintering, Powder bed, LOM, DLP.
PPTX
Laser Cutting, Choosing a laser cutter, Software, Hinges and joints.
PPTX
Memory management, Types of memory, Making the most of your RAM.
PPTX
Performance and Battery Life, Libraries, Debugging.
PPTX
Prototyping Embedded Devices: Arduino, Developing on the Arduino.
PPTX
Raspberry-Pi, Developing on Raspberry Pi, Difference between Arduino & Raspbe...
PPTX
Open Source versus Closed Source in IOT in IOT
PPTX
Why closed? Why Open? Mixing open and closed source
PPTX
Model Performance Metrics. Accuracy, Precision, Recall
PPTX
Graceful Degradation and Affordance in IOT
PPTX
Web thinking connected device, Small Pieces Loosely joined.
PPTX
Production & Mass Personalization, Changing Embedded Platform, Physical proto...
PPTX
Whose data is it anyways? Public vs Private data collection.
PPTX
Thinking about Prototyping: Sketching, Familiarity, Cost versus Ease of proto...
Designing Printed Circuit boards, Software Choices, The Design Process
Manufacturing PCB, Etching board, milling board, Third party manufacturing, a...
Business Model, make thing, sell thing, subscription, customization, Key Reso...
Funding an IOT startup, Venture Capital, Government funding, Crowdfunding, Le...
Sketch, Iterate and Explore, Nondigital Methods.
CNC Milling, Software, Repurposing and Recycling.
3D printing, Types of 3D printing: FDM, Laser Sintering, Powder bed, LOM, DLP.
Laser Cutting, Choosing a laser cutter, Software, Hinges and joints.
Memory management, Types of memory, Making the most of your RAM.
Performance and Battery Life, Libraries, Debugging.
Prototyping Embedded Devices: Arduino, Developing on the Arduino.
Raspberry-Pi, Developing on Raspberry Pi, Difference between Arduino & Raspbe...
Open Source versus Closed Source in IOT in IOT
Why closed? Why Open? Mixing open and closed source
Model Performance Metrics. Accuracy, Precision, Recall
Graceful Degradation and Affordance in IOT
Web thinking connected device, Small Pieces Loosely joined.
Production & Mass Personalization, Changing Embedded Platform, Physical proto...
Whose data is it anyways? Public vs Private data collection.
Thinking about Prototyping: Sketching, Familiarity, Cost versus Ease of proto...

Recently uploaded (20)

PDF
Unlock new opportunities with location data.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Modernising the Digital Integration Hub
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Getting Started with Data Integration: FME Form 101
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
A review of recent deep learning applications in wood surface defect identifi...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
DOCX
search engine optimization ppt fir known well about this
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
Unlock new opportunities with location data.pdf
Group 1 Presentation -Planning and Decision Making .pptx
Developing a website for English-speaking practice to English as a foreign la...
Modernising the Digital Integration Hub
DP Operators-handbook-extract for the Mautical Institute
Getting Started with Data Integration: FME Form 101
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
1 - Historical Antecedents, Social Consideration.pdf
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
A review of recent deep learning applications in wood surface defect identifi...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Chapter 5: Probability Theory and Statistics
A contest of sentiment analysis: k-nearest neighbor versus neural network
NewMind AI Weekly Chronicles – August ’25 Week III
search engine optimization ppt fir known well about this
O2C Customer Invoices to Receipt V15A.pptx
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
Enhancing emotion recognition model for a student engagement use case through...

Association Rule mining

  • 2. Introduction 2 Association rule learning is a type of unsupervised learning technique that checks for the dependency of one data item on another data item and maps accordingly so that it can be more profitable. It tries to find some interesting relations or associations among the variables of dataset. It is based on different rules to discover the interesting relations between variables in the database. For example, if a customer buys bread, he most likely can also buy butter, eggs, or milk, so these products are stored within a shelf or mostly nearby. OMega TechEd
  • 3. Applications  Market Basket Analysis: It is one of the popular examples and applications of association rule mining. This technique is commonly used by big retailers to determine the association between items.  Medical Diagnosis: With the help of association rules, patients can be cured easily, as it helps in identifying the probability of illness for a particular disease.  Protein Sequence: The association rules help in determining the synthesis of artificial Proteins.  It is also used for the Catalog Design and Loss-leader Analysis and many more other applications. 3 OMega TechEd
  • 4. Working Association rule learning works on the concept of If and Else Statement, such as if A then B. Here the If element is called antecedent, and then statement is called as Consequent. These types of relationships where we can find out some association or relation between two items is known as single cardinality. It is all about creating rules, and if the number of items increases, then cardinality also increases accordingly. So, to measure the associations between thousands of data items, there are several metrics. • Support • Confidence • Lift 4 OMega TechEd
  • 5. Support Support is the frequency of A or how frequently an item appears in the dataset. It is defined as the fraction of the transaction T that contains the itemset X. If there are X datasets, then for transactions T, It can be written as- 5 TID Items 1 Bread, Milk 2 Bread, Diaper, Beer, Eggs 3 Milk, Diaper, Beer, Coke 4 Bread, Milk, Diaper, Beer 5 Bread, Milk, Diaper, Coke {Milk, Diaper}{Beer} Support = σ ({Milk, Diaper, Beer}) / T = 2/5 = 0.4 OMega TechEd
  • 6. Confidence Confidence indicates how often the rule has been found to be true. Or how often the items X and Y occur together in the dataset when the occurrence of X is already given. It is the ratio of the transaction that contains X and Y to the number of records that contain X. 6 TID Items 1 Bread, Milk 2 Bread, Diaper, Beer, Eggs 3 Milk, Diaper, Beer, Coke 4 Bread, Milk, Diaper, Beer 5 Bread, Milk, Diaper, Coke {Milk, Diaper}{Beer} Confidence = σ ({Milk, Diaper, Beer}) / σ ({Milk, Diaper}) = 2/3 = 0.67 OMega TechEd
  • 7. Lift It is the strength of any rule, which can be defined as below formula: It is the ratio of the observed support measure and expected support if X and Y are independent of each other. It has three possible values: • If Lift= 1: The probability of occurrence of antecedent and consequent is independent of each other. • Lift>1: It determines the degree to which the two itemset are dependent to each other. • Lift<1: It tells us that one item is a substitute for other items, which means one item has a negative effect on another. 7 OMega TechEd
  • 8. Example 8 TID Items 1 Bread, Milk 2 Bread, Diaper, Beer, Eggs 3 Milk, Diaper, Beer, Coke 4 Bread, Milk, Diaper, Beer 5 Bread, Milk, Diaper, Coke {Milk, Diaper}{Beer} Lift= Supp({Milk, Diaper, Beer}) / Supp({Milk, Diaper})*Supp({Beer}) Supp({Milk, Diaper, Beer})=2/5 = 0.4 Supp({Milk, Diaper}) = 3/5 = 0.6 Supp({Beer}) = 3/5 =0.6 0.4/(0.6*0.6) = 1.11 High Association OMega TechEd
  • 9. Conclusion The Association rule is very useful in analyzing datasets. The data is collected using bar-code scanners in supermarkets. Such databases consists of many transaction records which list all items bought by a customer on a single purchase. So, the manager could know if certain groups of items are consistently purchased together and use this data for adjusting store layouts, cross-selling, promotions based on statistics. 9 OMega TechEd
  • 10. Thank you Reference: Artificial Intelligence: A Modern Approach, 3rd ed. Stuart Russell and Peter Norvig https://www.javatpoint.com/reinforcement-learning Join Telegram channel for AI notes. Link is in the description. OMega TechEd