Concepts and Techniques: Data Mining
Concepts and Techniques: Data Mining
— Chapter 4 —
Jiawei Han
Department of Computer Science
University of Illinois at Urbana-Champaign
www.cs.uiuc.edu/~hanj
©2006 Jiawei Han and Micheline Kamber, All rights reserved
Birth_Region
Canada Foreign Total
Gender
M 16 14 30
F 10 22 32
Total 26 36 62
Cj = target class
qa = a generalized tuple covers some tuples of class
but can also cover some tuples of contrasting class
d-weight
range: [0, 1] count(qa Cj )
d weight m
count(q
i 1
a Ci )
Count distribution between graduate and undergraduate students for a generalized tuple
Quantitative discriminant rule
X , graduate _ student ( X )
major ( X ) " Science" age _ range ( X ) "21 25" gpa ( X ) " good " [d : 30%]
BUC
H-cubing
Star-cubing
Multi-feature cubes
Cube-gradient analysis