0% found this document useful (0 votes)

2 views5 pages

Data Mining Unit-III

The document discusses key concepts in data mining, focusing on concept description, frequent pattern mining, and association rules. It outlines the differences between concept description and OLAP, defines frequent patterns, and explains data generalization and summarization. Additionally, it covers attribute relevance, methods for class comparison, and the measurement of rule quality in association mining.

Uploaded by

kundurathin06101964

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views5 pages

Data Mining Unit-III

Uploaded by

kundurathin06101964

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

DATA MINING (UNIT-III)

1. What is concept description of Data mining.

Ans. Concept Description is a definitive type of data mining. It defines a
set of data including frequent buyers, graduate candidates, etc. It
describes the characterization and comparison of the data. It is also
known as a class description when the concept to be described is
defined as a class of objects. These descriptions can be determined with
the support of data characterization.
2.State the difference between Concept description and OLAP.
ANS. the comparison between concept descriptions in large databases and OLAP
tools.

Concept description in large OLAP tools

databases

The database attributes can be of The data warehouses and OLAP tools are
several types, such as numeric, non- established on a multidimensional data model
numeric, spatial, text, or image. that views the data in the form of a data cube,
making attributes and measuring and
constraining dimensions to non-numeric data.

With aggregation, concept descriptions OLAP defines a simplified model for data
in databases can manage complex data analysis, because of its condition on the
types of the attributes. possible dimension and measure types.

Concept description in data mining OLAP in data warehouses is a simply user-

needed a more automated process that controlled process. The selection of
supports users to decide which dimensions and the application of OLAP
attributes should be included in the operations, including drill-down, roll-up,
analysis, and the degree to which given slicing, and dicing are supervised and
data should be generalized to make an controlled by the users. In OLAP, users are
interesting summarization of the data. required to define a long series of OLAP
operations.

3.Define frequent pattern in mining. State the advantages and disadvantages.

Ans. Frequent pattern mining in data mining is the process of identifying
patterns or associations within a dataset that occur frequently. This is
typically done by analyzing large datasets to find items or sets of items that
appear together frequently.
There are several different algorithms used for frequent pattern mining,
including:
1. Apriori algorithm: This is one of the most commonly used
algorithms for frequent pattern mining. It uses a “bottom-up”
approach to identify frequent itemsets and then generates
association rules from those itemsets.
2. ECLAT algorithm: This algorithm uses a “depth-first search”
approach to identify frequent itemsets. It is particularly efficient for
datasets with a large number of items.
3. FP-growth algorithm: This algorithm uses a “compression”
technique to find frequent patterns efficiently. It is particularly
efficient for datasets with a large number of transactions.
4. Frequent pattern mining has many applications, such as Market
Basket Analysis, Recommender Systems, Fraud Detection, and
many more.

Advantages:

1. It can find useful information which is not visible in simple data

browsing
2. It can find interesting association and correlation among data items

Disadvantages:

1. It can generate a large number of patterns

2. With high dimensionality, the number of patterns can be very large,
making it difficult to interpret the results.

4.What is association and correlation in data mining?

Ans. Association is a very general relationship: one variable provides
information about another. Correlation is more specific: two variables are
correlated when they display an increasing or decreasing trend. For example, in
an increasing trend, observing that X > μX implies that it is more likely that Y >
μY.

5.What is data generalization and summarization?

Ans. Data generalization is the process that abstracts a large set of task-
relevant data in a database from a low conceptual level to higher ones.
It is a summarization of general features of objects in a target class and
produces what is called characteristic rules.
There are two basic approaches of data generalization :
1. Data cube approach :
 It is also known as OLAP approach.
 It is an efficient approach as it is helpful to make the past selling
graph.
 In this approach, computation and results are stored in the Data
cube.
2. Attribute oriented induction :
 It is an online data analysis, query oriented and generalization
based approach.
 It performs off-line aggregation before an OLAP or data mining query
is submitted for processing.

6.What is attribute relevance? State the reason for attribute

relevance.
Ans. The basic concept behind attribute relevance analysis is to evaluate some
measure that can compute the relevance of an attribute regarding a given class or
concept. Such measures involve information gain, ambiguity, and correlation
coefficient.
Attribute relevance analysis for concept description is implemented as follows −
Data collection − It can collect data for both the target class and the contrasting
class by query processing.
Preliminary relevance analysis using conservative AOI − This step recognizes a
set of dimensions and attributes on which the selected relevance measure is to be
used.
Remove − This process removes irrelevant and weakly relevant attributes using the
selected relevance analysis measure.
Generate the concept description using AOI − It can implement AOI using a less
conservative set of attribute generalization thresholds.
There are several reasons for attribute relevance analysis are as follows −
 It can decide which dimensions must be included.
 It can produce a high level of generalization.
 It can reduce the number of attributes that support us to read patterns
easily.

7.What are the methods for class comparison?

Ans. There are several procedures which is as follows −
 Data collection − The set of relevant records in the database is
collected by query processing and is separate accordingly into a target
class and one or a set of contrasting classes.
 Dimension relevance analysis − If there are several dimensions, then
dimension relevance analysis must be implemented on these classes to
choose only the highly relevant dimensions for more analysis.
 Synchronous generalization − Generalization is implemented on the
target class to the level managed by a user-or professional-specified
dimension threshold, which outcomes in a prime target class relation.
 Presentation of the derived comparison − The resulting class
comparison description can be anticipated in the form of tables, graphs,
and rules. This presentation generally involves a “contrasting”.

8.State the basic concept of scalable frequent item set mining

methods.
Ans. Frequent item sets, also known as association rules, are a
fundamental concept in association rule mining, which is a technique used in
data mining to discover relationships between items in a dataset. The goal of
association rule mining is to identify relationships between items in a dataset
that occur frequently together.

9.What is association rule? State the various kind association

rules.

Ans. Association Mining searches for frequent items in the data set. In
frequent mining usually, interesting associations and correlations between
item sets in transactional and relational databases are found. In short,
Frequent Mining shows which items appear together in a transaction or
relationship.
There are various types of association rules in data mining:-
1. Multi-relational association rules: Multi-Relation Association Rules
(MRAR) is a new class of association rules, different from original, simple,
and even multi-relational association rules (usually extracted from multi-
relational databases), each rule element consists of one entity but many a
relationship. These relationships represent indirect relationships between
entities.
2. Generalized association rules: Generalized association rule extraction is
a powerful tool for getting a rough idea of interesting patterns hidden in data.
However, since patterns are extracted at each level of abstraction, the mined
rule sets may be too large to be used effectively for decision-making.
3. Quantitative association rules: Quantitative association rules is a
special type of association rule. Unlike general association rules, where both
left and right sides of the rule should be categorical (nominal or discrete)
attributes, at least one attribute (left or right) of quantitative association rules
must contain numeric attributes

10. Explain association mining to corelation analysis.

Ans. Most association rule mining algorithms employ a support-confidence
framework. Often, many interesting rules can be found using low support
thresholds.

Strong Rules Are Not Necessarily Interesting: An Example

Whether or not a rule is interesting can be assessed either subjectively

or objectively. Ultimately, only the user can judge if a given rule is
interesting, and this judgment, being subjective, may differ from one user
to another. However, objective interestingness measures, based on the
statistics “behind” the data, can be used as one step toward the goal of
weeding out uninteresting rules from presentation to the user.

11. How to measure the quality of rules?

Ans. There are three steps for measuring data quality. 1) Extract all
association rules. 2) Select compatible association rules. 3) Add
confidence factor of compatible rules as criteria of data quality of
transaction.

Technical Manual: Includes
No ratings yet
Technical Manual: Includes
13 pages
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Data Mining and Warehousing
100% (3)
Data Mining and Warehousing
30 pages
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
No ratings yet
Cs1004: Data Warehousing and Mining Two Marks Questions and Answers Unit I
31 pages
CS1004 DWM 2marks 2013
No ratings yet
CS1004 DWM 2marks 2013
22 pages
Data Mining Unit 1-1
No ratings yet
Data Mining Unit 1-1
11 pages
DWM Unit 2
No ratings yet
DWM Unit 2
4 pages
Data Mining-Unit-1
No ratings yet
Data Mining-Unit-1
21 pages
Thabet Slimani - Efficiant Analysis of Pattern and Association Rule Mining Approaches
No ratings yet
Thabet Slimani - Efficiant Analysis of Pattern and Association Rule Mining Approaches
14 pages
Data Mining Unit2
No ratings yet
Data Mining Unit2
9 pages
Solutions To DM I MID (A)
100% (1)
Solutions To DM I MID (A)
19 pages
Concept Description: Characterization and Comparision: Chapter-10
No ratings yet
Concept Description: Characterization and Comparision: Chapter-10
5 pages
UNIT 1 Introduction of Data Mining
No ratings yet
UNIT 1 Introduction of Data Mining
11 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
64 pages
Unit 3
No ratings yet
Unit 3
38 pages
DM UNIT-1 Question and Answer
No ratings yet
DM UNIT-1 Question and Answer
25 pages
Web Minng - Mining Association Rules in Large Databases
No ratings yet
Web Minng - Mining Association Rules in Large Databases
108 pages
CH 4
No ratings yet
CH 4
58 pages
Unit-1 Notes
No ratings yet
Unit-1 Notes
24 pages
Mining Frequent Patterns, Association and Correlations
No ratings yet
Mining Frequent Patterns, Association and Correlations
42 pages
DWM Important Answer
No ratings yet
DWM Important Answer
8 pages
Data Mining Unit-II
No ratings yet
Data Mining Unit-II
4 pages
Soln 1
100% (1)
Soln 1
6 pages
DM Concepts
No ratings yet
DM Concepts
64 pages
Questions and Answers On The Concept of Data Mining
No ratings yet
Questions and Answers On The Concept of Data Mining
3 pages
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
No ratings yet
Data Mining: Concepts and Techniques: - Slides For Textbook - Chapter 5
73 pages
Chapter 5 Concept Description Characterization and Comparison 395
No ratings yet
Chapter 5 Concept Description Characterization and Comparison 395
64 pages
BCA Data Mining
No ratings yet
BCA Data Mining
116 pages
Understanding Association Rule in Data Mining
No ratings yet
Understanding Association Rule in Data Mining
4 pages
Question Bank: Data Warehousing and Data Mining Semester: VII
No ratings yet
Question Bank: Data Warehousing and Data Mining Semester: VII
4 pages
5 Desc
No ratings yet
5 Desc
60 pages
CHAPTER1 Datamining
No ratings yet
CHAPTER1 Datamining
33 pages
Unit 4
No ratings yet
Unit 4
27 pages
6asso ST
No ratings yet
6asso ST
77 pages
DM 100
No ratings yet
DM 100
17 pages
Module 4
No ratings yet
Module 4
24 pages
Data Mining: An Overview From A Database Perspective
No ratings yet
Data Mining: An Overview From A Database Perspective
30 pages
Data Mining Unit I Notes
No ratings yet
Data Mining Unit I Notes
24 pages
Data Mining Tutorials
No ratings yet
Data Mining Tutorials
52 pages
UNIT-1 Introduction To Data Mining
No ratings yet
UNIT-1 Introduction To Data Mining
29 pages
Data Mining Unit 1
No ratings yet
Data Mining Unit 1
39 pages
DMDW Qa-3.2
No ratings yet
DMDW Qa-3.2
11 pages
DWDM Syllabus
No ratings yet
DWDM Syllabus
2 pages
Lecture 2.1.1 2.1.2
No ratings yet
Lecture 2.1.1 2.1.2
23 pages
DM Unit 2
No ratings yet
DM Unit 2
330 pages
Fundamentals of Data Mining
No ratings yet
Fundamentals of Data Mining
36 pages
Unit-4 DWM
No ratings yet
Unit-4 DWM
73 pages
Ch5 DataMIning
No ratings yet
Ch5 DataMIning
99 pages
2-Concept Hierarchy To Classification of DMS
No ratings yet
2-Concept Hierarchy To Classification of DMS
75 pages
Chapter - 4 - Association Rule Mining
No ratings yet
Chapter - 4 - Association Rule Mining
86 pages
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
No ratings yet
Association Rule-A Tool For Data Mining: Praveen Ranjan Srivastava
6 pages
02-Data Mining Functionalities-2
No ratings yet
02-Data Mining Functionalities-2
23 pages
DWM Mid 2 Question Bank
No ratings yet
DWM Mid 2 Question Bank
5 pages
Unit 1
No ratings yet
Unit 1
21 pages
BCA-404: Data Mining and Data Ware Housing
No ratings yet
BCA-404: Data Mining and Data Ware Housing
19 pages
Statistical Classification: Fundamentals and Applications
From Everand
Statistical Classification: Fundamentals and Applications
Fouad Sabry
No ratings yet
Data Analytics with Generative AI
From Everand
Data Analytics with Generative AI
Younish P
No ratings yet
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
From Everand
DATA MINING and MACHINE LEARNING. PREDICTIVE TECHNIQUES: ENSEMBLE METHODS, BOOSTING, BAGGING, RANDOM FOREST, DECISION TREES and REGRESSION TREES.: Examples with MATLAB
César Pérez López
No ratings yet
Data Mining: Fundamentals and Applications
From Everand
Data Mining: Fundamentals and Applications
Fouad Sabry
No ratings yet
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
From Everand
Técnicas Estadísticas para la Ciencia de Datos a través de R. Aprendizaje Supervisado: Análisis Discriminante, Árboles de Decisión, Redes Neuronales y Modelos Lineales Generalizados
César Pérez López
No ratings yet
An Investigation into the Use of a Neural Tree Classifier for Knowledge Discovery in OLAP Databases
From Everand
An Investigation into the Use of a Neural Tree Classifier for Knowledge Discovery in OLAP Databases
David R Swinburne
No ratings yet
Invitation FSWT 29072025
No ratings yet
Invitation FSWT 29072025
3 pages
10 SSC Holiday Homework (25-26) - 1
No ratings yet
10 SSC Holiday Homework (25-26) - 1
3 pages
Tense 2 Class 6
50% (2)
Tense 2 Class 6
8 pages
Tense Class 6
100% (1)
Tense Class 6
3 pages
Appendix C - Machine Language: Code Operand Description
No ratings yet
Appendix C - Machine Language: Code Operand Description
1 page
Logout Edit
No ratings yet
Logout Edit
5 pages
Tutorial - SurvCE.01.Rev3.NTRIP Connections S9III S8
No ratings yet
Tutorial - SurvCE.01.Rev3.NTRIP Connections S9III S8
18 pages
Amani's Resume 2025
No ratings yet
Amani's Resume 2025
2 pages
Sae Arp741c 2016
No ratings yet
Sae Arp741c 2016
22 pages
Ks2 Mathematics 2001 Marking Scheme
No ratings yet
Ks2 Mathematics 2001 Marking Scheme
30 pages
CG Report Final-Full
No ratings yet
CG Report Final-Full
24 pages
Com - Upgadata.up7723 Logcat
No ratings yet
Com - Upgadata.up7723 Logcat
47 pages
Regulation of Streams in The Skopje Region With Measures For Regulation and Rehabilitation of The River Beds
No ratings yet
Regulation of Streams in The Skopje Region With Measures For Regulation and Rehabilitation of The River Beds
29 pages
Design Rotor V-Shape Permanent Magnets-Good
No ratings yet
Design Rotor V-Shape Permanent Magnets-Good
4 pages
Woodmizer LT15 Parts
No ratings yet
Woodmizer LT15 Parts
39 pages
Mono Pump 80 - Manual
No ratings yet
Mono Pump 80 - Manual
162 pages
Go Bag Policy March 2023
No ratings yet
Go Bag Policy March 2023
5 pages
Curriculum Vitae: Nguyen Viet Anh
No ratings yet
Curriculum Vitae: Nguyen Viet Anh
7 pages
VL2900 Inverter Instruction
No ratings yet
VL2900 Inverter Instruction
51 pages
G Suite Interview Questions
No ratings yet
G Suite Interview Questions
7 pages
LJ CG Unit 2
No ratings yet
LJ CG Unit 2
2 pages
Exp22 Excel Ch04 CumulativeAssessment Variation Rockville Auto Sales Instructions
No ratings yet
Exp22 Excel Ch04 CumulativeAssessment Variation Rockville Auto Sales Instructions
2 pages
19 - Heating and Ventilating Systems - HVAC
No ratings yet
19 - Heating and Ventilating Systems - HVAC
6 pages
DBMS File
No ratings yet
DBMS File
96 pages
Project Documentation: File: Examen - Project Date: 16/06/2021 Profile: Codesys V3.5 Sp17
No ratings yet
Project Documentation: File: Examen - Project Date: 16/06/2021 Profile: Codesys V3.5 Sp17
9 pages
Cyber Insurance Policy
No ratings yet
Cyber Insurance Policy
4 pages
Sf6 Gas Density Monitor
No ratings yet
Sf6 Gas Density Monitor
2 pages
Safetica Datasheet EN 2024-04-11
No ratings yet
Safetica Datasheet EN 2024-04-11
8 pages
Extensometer: Types, How It Works, Applications: What Is An Extensometer?
No ratings yet
Extensometer: Types, How It Works, Applications: What Is An Extensometer?
4 pages
Penjelasan Listing Program
No ratings yet
Penjelasan Listing Program
63 pages
Lesson 2 Current Trends and Emerging Technologies - JENCY JOY MALASIG
No ratings yet
Lesson 2 Current Trends and Emerging Technologies - JENCY JOY MALASIG
15 pages
PDF Succinctly
100% (1)
PDF Succinctly
60 pages
JD - Android Developer - Fresher
No ratings yet
JD - Android Developer - Fresher
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Data Mining Unit-III

Uploaded by

Data Mining Unit-III

Uploaded by

DATA MINING (UNIT-III)

1. What is concept description of Data mining.

Concept description in large OLAP tools

Concept description in data mining OLAP in data warehouses is a simply user-

3.Define frequent pattern in mining. State the advantages and disadvantages.

1. It can find useful information which is not visible in simple data

1. It can generate a large number of patterns

4.What is association and correlation in data mining?

5.What is data generalization and summarization?

6.What is attribute relevance? State the reason for attribute

7.What are the methods for class comparison?

8.State the basic concept of scalable frequent item set mining

9.What is association rule? State the various kind association

10. Explain association mining to corelation analysis.

Strong Rules Are Not Necessarily Interesting: An Example

Whether or not a rule is interesting can be assessed either subjectively

11. How to measure the quality of rules?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.