DM Module 3 - Online
DM Module 3 - Online
ASSOCIATION RULES
Unit 3
UNIT 3
❑ Mining Association Rules
• Basics Concepts
• Single Dimensional Boolean Association
• Rules from Transaction Databases
• Multilevel Association Rules from transaction databases
• Multi dimension Association Rules from Relational Database and Data Warehouses.
• Apriori algorithm
• FP-Tree algorithm
Basic Concepts
■ Association Rule in data mining, involves discovering relationships between
independent relational databases or other data repositories through simple
If/Then statements.
■ The procedure aims to identify frequently occurring patterns, correlations, or
associations in datasets across various relational and transactional
databases.
■ The Association rule is a learning technique that helps identify the
dependencies between two data items.
■ An association rule has 2 parts:
– an antecedent (if) and a consequent (then)
– An antecedent is something that’s found in data, and a consequent is
an item that is found in combination with the antecedent.
– “If a customer buys bread, he’s 70% likely of buying milk.”
– In the above association rule, bread is the antecedent and milk is the
consequent.
■ Association rules are created by thoroughly analyzing data and looking for
frequent if/then patterns.
■ Depending on the following two parameters, the important relationships are
observed:
– Support: Support indicates how frequently the if/then relationship
appears in the database.
– Confidence: Confidence talks about the number of times these
relationships have been found to be true.
■ Association rules are a commonly used technique in data
mining and warehousing for discovering interesting
relationships between variables or items in a dataset.
Dimensional
between the purchase of a certain product and the purchase of
another product.
Association
B is the consequent (the item that is being predicted or
associated with A).
■ For example, a single-dimensional association rule could be “If
a customer buys bread, they are likely to also buy milk.”
■ If the items or attributes in an association rule reference only
one dimension, then it is a single-dimensional association rule.
■ For example, the rule
– computer => antivirus software [support = 2%,
confidence = 60%] could be written as
– buys(X, "computer”) = buys(X, “antivirus software")
Single Dimensional Boolean Association