Data Mining Unit-III
Data Mining Unit-III
The database attributes can be of The data warehouses and OLAP tools are
several types, such as numeric, non- established on a multidimensional data model
numeric, spatial, text, or image. that views the data in the form of a data cube,
making attributes and measuring and
constraining dimensions to non-numeric data.
With aggregation, concept descriptions OLAP defines a simplified model for data
in databases can manage complex data analysis, because of its condition on the
types of the attributes. possible dimension and measure types.
Advantages:
Disadvantages:
Ans. Association Mining searches for frequent items in the data set. In
frequent mining usually, interesting associations and correlations between
item sets in transactional and relational databases are found. In short,
Frequent Mining shows which items appear together in a transaction or
relationship.
There are various types of association rules in data mining:-
1. Multi-relational association rules: Multi-Relation Association Rules
(MRAR) is a new class of association rules, different from original, simple,
and even multi-relational association rules (usually extracted from multi-
relational databases), each rule element consists of one entity but many a
relationship. These relationships represent indirect relationships between
entities.
2. Generalized association rules: Generalized association rule extraction is
a powerful tool for getting a rough idea of interesting patterns hidden in data.
However, since patterns are extracted at each level of abstraction, the mined
rule sets may be too large to be used effectively for decision-making.
3. Quantitative association rules: Quantitative association rules is a
special type of association rule. Unlike general association rules, where both
left and right sides of the rule should be categorical (nominal or discrete)
attributes, at least one attribute (left or right) of quantitative association rules
must contain numeric attributes
Ans. There are three steps for measuring data quality. 1) Extract all
association rules. 2) Select compatible association rules. 3) Add
confidence factor of compatible rules as criteria of data quality of
transaction.