0% found this document useful (0 votes)

148 views32 pages

Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4

The document discusses decision trees for classification and regression. It recaps data understanding and preprocessing steps. Decision trees are popular due to their interpretability and computational efficiency. Classification trees classify examples based on attribute tests from the root node to a leaf node. Regression trees are used for numeric target variables. The document provides examples of classification trees and how they are constructed through recursive partitioning to split data into purer subgroups.

Uploaded by

Hiu Tung Chan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

148 views32 pages

Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4

Uploaded by

Hiu Tung Chan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 32

ISOM3360 Data Mining for Business Analytics, Session 4

Decision Trees (I)

Instructor: Rong Zheng

Department of ISOM
Fall 2020
Recap: Data Understanding

Preliminary investigation of the data to better understand

its specific characteristics
 Help in selecting appropriate data mining algorithms

Things to look at
 Class imbalance
 Dispersion of data attribute values
 Skewness, outliers, missing values
 Correlation analysis

Visualization tools are important

 Histograms, box plots
 Scatter plot
2
Recap: Data Preprocessing

Data transformation might be needed

 Handling missing values
 Handling categorical variables
 Feature transformation
 e.g., log transformation
 Normalization (back to this when clustering is discussed)
 Feature discretization

3
Question

Which of the following normalization method(s)

may transform the original variable to a negative
value: min-max, z-score, or both, or neither?

4
What Induction Algorithm Shall We Use?

5
Commonly Used Induction Algorithms

Post-mortem analysis of a popular data mining competition

6
Why Decision Trees?

Decision trees are one of the most popular data mining

tools.
 Classification trees (used when target variable is categori
cal)
 Regression trees (used when target variable is numeric)

They are easy to understand, implement and use, and c

omputationally cheap.
The model comprehensibility is important for communi
cation to non-DM-savvy stakeholders.

7
Classification Tree

Employed Balance Age Default

Yes 123,000 50 No
No 51,100 40 Yes
No 68,000 55 No
Yes 34,000 46 Yes
Yes 50,000 44 No
No 100,000 50 Yes

Objective: predicting borrowers who will default on loan payments.

8
Classification Tree: Upside-Down

Employed Root Node

Yes No
Class = Not
Default Balance Node

>=50K <50K
Class = Not
Default
Age

>=45 <45

Class = Not Class =

Leaf Default Default

9
Classification Tree: Divide and Conquer
“Recursive Partitioning”
Employed
Nodes
 Each node represents one attribute Yes No
 Tests on nominal attribute: number Class = Not
of splits (branches) is number of po Default Balance
ssible values or 2 (one value vs. res
>=50K <50K
t)
 Continuous attributes are discretize Class = Not
d Default
Age

Leaves >=45 <45

 A class assignment (e.g., default /not default) Class = Not Class =
 Also provide a distribution over all possible Default Default
classes (e.g., default with probability 0.25, not
default with prob. 0.75)
10
How a Tree is Used for Classification ?
To determine the class of a new exa
mple: e,g., Mark, age 40, retired, bal Employed
ance 38K. Yes No
 The example is routed down the tre Class = Not
e according to values of attributes . Default Balance
 At each node, a test is applied to one >=50K <50K
attribute.
Class = Not
 When a leaf is reached, the example Default
is assigned to a class—or alternativel Age
y to a distribution over the possible c >=45 <45
lasses (e.g., default with probability
0.25, not default with prob. 0.75). Class = Not Class =
Default Default

11
Assigning Probability Estimates

Age >=45

Entire Population
Age <45
Balance >= 50k

Balance < 50k

Age >=45

Age < 45

Q: How would you assign estimates of class probability (e.g.,

probability of default/not default) based on such trees?
12
Exercise: Assume this is the classification tree you learned from the
A Process View to a Business Problem
past defaulting data. A new guy, who is 45 and has 20K balance, is
applying a credit card issued by your company. Can you predict if
this new guy is gonna default? How confident are you about your
prediction? Another girl is also applying the same credit card. But
the only information we have about her is she has 70k balance. Can
you predict if she will default? How sure about that?

13
Classification Tree Learning

Objective: based on customer attributes, partition the custome

rs into subgroups that are less impure – with respect to the clas
s (i.e., such that in each group most instances belong to the sa
me class)
No
Yes
No Yes Yes
Yes Yes

No
Yes Yes No No

14
Classification Tree Learning

Partitioning into “purer” groups

Orange Bodies Purple Bodies

Yes Yes Yes Yes No No

Yes Yes No Yes No No

15
Classification Tree Learning

Partitioning into “purer” groups recursively

Purple Bodies
Yes No No

Yes No No

16
Classification Tree Learning

Purple Bodies

Red Head Green Head

Yes No No

17
Classification Tree Learning

Partitioning into “purer” groups

Orange Bodies Purple Bodies

Yes Yes Yes Yes No No

Yes Yes No Yes No No

18
Classification Tree Learning

Partitioning into “purer” groups recursively

Orange Bodies
Yes Yes Yes

Yes Yes No

19
Classification Tree Learning
Orange Bodies

Blue Limbs Yellow Limbs

Yes Yes Yes
No

Yes Yes

20
Classification Tree Learning
Body

Orange Bodies Purple Bodies

Yes Yes Yes Yes No No

Yes Yes No Yes No No

Head
Limbs
Red Head Green Head
Blue Limbs
Yes No No
Yellow Limbs
Yes Yes Yes Yes Yes No
Yes No No

21
Summary: Classification Tree Learning

A tree is constructed by recursively partitioning the e

xamples.
With each partition, the examples are split into subgr
oups that are “increasingly pure”.

22
Let’s play a game. I have someone in my mind, and
your job is to guess this person. You can only ask
yes/no question.

This person is an entrepreneur.

Go!

23
Next…

Some important questions without being answered

yet:
 How to automatically choose which attribute to be
used to split the data?
 When to stop the splitting?

24
How to Choose Which Attribute to Split Over?

Objectives
 For each splitting node, choose the attribute that best p
artitions the population into less impure groups.
 All else being equal, fewer nodes is better (more compr
ehensible, easy to use, reduce overfitting)

Impurity measures: many available but most common

one (from information theory) is: entropy.

25
Entropy

is the proportion of class in the data

For example: our initial population is composed of 16 cases

of class “Default” and 14 cases of class “Not default”

Entropy (entire population of examples) =

26
Entropy Exercise

A dataset is composed of 10
1
cases of class “Positive” and
10 cases of class “Negative” 0.75

Entropy
Entropy=?
0.5
A dataset is composed of 0 c
ases of class “Positive” and 2 0.25
0 cases of class “Negative”
0
Entropy=? 0.25 0.5 0.75 1
% of one class

Two-class entropy function

𝑡𝑖𝑝 : 0 log 2 0=0

27
Information Gain (based on Entropy)

The information gain is based on the reduction in entro

py after a dataset is split on an attribute

Information Gain =
entropy (parent) – [weighted average] entropy (children)

where weight of each child is given by the

proportion of the examples in that child.
Balance<50k Balance>=50k

28
Information Gain Example
30 instances

Balance < 50k Balance >= 50k

17 instances
13 instances

(Weighted) Average Impurity of Children =

Information Gain = 0.997 - 0.615 = 0.382

29
What If We Split Over “Age” First?
30 instances

Age < 45 Age >=45

15 instances
15 instances

Impurity Impurity
=? =? Exercise!
Information Gain = ?
recall, gain from first splitting on “Balance” = 0.382
30
Our Original Question

Now, ready to answer anxiously awaiting question:

How to choose which attribute to split over?

Answer: at each node, choose the attribute that obt

ains maximum information gain!

31
Decision Tree Algorithm (Full Tree)

 Step 1: Calculate the information gain from splitting over

each attribute using the dataset
 Step 2: Split the set into subsets using the attribute for which
the information gain is maximum
 Step 3: Make a decision tree node containing that attribute,
divide the dataset by its branches and repeat the same
process on every node.
 Step 4a: A node with entropy of 0 is a leaf.
 Step 4b: A node with entropy more than 0 needs further
splitting.

فاينل تعلم
No ratings yet
فاينل تعلم
144 pages
Aktu Btech Cse 5th Sem Syllabus
No ratings yet
Aktu Btech Cse 5th Sem Syllabus
5 pages
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
No ratings yet
CENG313 Introduction To Data Science: Lecture 12: Classification Decision Trees
61 pages
Decision Tree
No ratings yet
Decision Tree
74 pages
Cse 445 Lecture 8 Mma
No ratings yet
Cse 445 Lecture 8 Mma
107 pages
MM5425 WK5 Lecture Decision Tree
No ratings yet
MM5425 WK5 Lecture Decision Tree
34 pages
DWDM Unit 4
No ratings yet
DWDM Unit 4
80 pages
2 ML Ch3 Decision Trees Final
No ratings yet
2 ML Ch3 Decision Trees Final
70 pages
Chapter 3
No ratings yet
Chapter 3
88 pages
Machine Learning Unit-3.2
No ratings yet
Machine Learning Unit-3.2
61 pages
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
No ratings yet
CHTKT - DataScience - Chapter03 - Machine Learning With Python - 02
34 pages
Lecture 5 DecisionTree
No ratings yet
Lecture 5 DecisionTree
21 pages
Classification - Decision Trees
No ratings yet
Classification - Decision Trees
43 pages
Trees
No ratings yet
Trees
78 pages
Lecture4 Supervised Segmentation For Students
No ratings yet
Lecture4 Supervised Segmentation For Students
44 pages
Chap4 Classification Lecture 5
No ratings yet
Chap4 Classification Lecture 5
74 pages
ML-chap9 2024 110217
No ratings yet
ML-chap9 2024 110217
52 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
Decision Tree
No ratings yet
Decision Tree
23 pages
DM Lect8
No ratings yet
DM Lect8
56 pages
Classification
No ratings yet
Classification
75 pages
Lecture 6 Classification-Decision Tree Rule Based K-NN
No ratings yet
Lecture 6 Classification-Decision Tree Rule Based K-NN
73 pages
Week 11 - Decision Tree Learning
No ratings yet
Week 11 - Decision Tree Learning
43 pages
ML Unit 3 Notes-1
No ratings yet
ML Unit 3 Notes-1
118 pages
UNIT 2 Class Basic
No ratings yet
UNIT 2 Class Basic
69 pages
ML Unit 3 Notes
No ratings yet
ML Unit 3 Notes
117 pages
Classification
No ratings yet
Classification
45 pages
Unit-2 Material
No ratings yet
Unit-2 Material
52 pages
Classification&Decision Tree
No ratings yet
Classification&Decision Tree
10 pages
Unit6 - 2 Classification-Decision-Trees
No ratings yet
Unit6 - 2 Classification-Decision-Trees
36 pages
Exam H13-311 - V3.0
No ratings yet
Exam H13-311 - V3.0
72 pages
Decision Tree
No ratings yet
Decision Tree
20 pages
3-Classification, Clustering and Prediction
No ratings yet
3-Classification, Clustering and Prediction
142 pages
Decision Tree
No ratings yet
Decision Tree
41 pages
Decision Trees
No ratings yet
Decision Trees
45 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
Decision Tree Induction
No ratings yet
Decision Tree Induction
80 pages
Lecture11 Ch8 ClassBasic Part1
No ratings yet
Lecture11 Ch8 ClassBasic Part1
38 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
Objectivec The Ultimate Guide Sufyan Bin Uzayr Download
No ratings yet
Objectivec The Ultimate Guide Sufyan Bin Uzayr Download
77 pages
Leetcode DSA Sheet by Fraz
No ratings yet
Leetcode DSA Sheet by Fraz
44 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Module - 4.1-DM-1
No ratings yet
Module - 4.1-DM-1
63 pages
L04 Decision Trees
No ratings yet
L04 Decision Trees
34 pages
Classification Ppts 2021
No ratings yet
Classification Ppts 2021
80 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Supervised Learning Algorithm
No ratings yet
Supervised Learning Algorithm
59 pages
ML Classification Tree
No ratings yet
ML Classification Tree
36 pages
Python Decision Tree Classification
No ratings yet
Python Decision Tree Classification
14 pages
Data Mining I: Summer Semester 2017
No ratings yet
Data Mining I: Summer Semester 2017
52 pages
Decision Tree in Machine Learning
No ratings yet
Decision Tree in Machine Learning
11 pages
Classification With Decision Trees: Instructor: Qiang Yang
100% (1)
Classification With Decision Trees: Instructor: Qiang Yang
62 pages
Unit 3 Classification
No ratings yet
Unit 3 Classification
71 pages
Dynamic Memory Allocation 19
No ratings yet
Dynamic Memory Allocation 19
14 pages
Decision Trees: Classifier
No ratings yet
Decision Trees: Classifier
23 pages
DS535 Note 6 (Page1-14)
No ratings yet
DS535 Note 6 (Page1-14)
13 pages
08 Class Basic
No ratings yet
08 Class Basic
86 pages
Data Engineer Interview Guide
No ratings yet
Data Engineer Interview Guide
5 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
Classification and Regression Tree Construction
No ratings yet
Classification and Regression Tree Construction
18 pages
Space Invader Game in JavaFX
No ratings yet
Space Invader Game in JavaFX
46 pages
Classification: Decision Trees: Business Analytics Lecture 7/8
No ratings yet
Classification: Decision Trees: Business Analytics Lecture 7/8
35 pages
Data Mining: Concepts and Techniques: - Chapter 7
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 7
61 pages
Project Report: Blogging Website Using MERN Stack
No ratings yet
Project Report: Blogging Website Using MERN Stack
14 pages
Naincy Mod 3 Python
No ratings yet
Naincy Mod 3 Python
110 pages
2 History Versions of C
No ratings yet
2 History Versions of C
6 pages
CD 4 New
No ratings yet
CD 4 New
19 pages
Csstrg2024 Wave 2 Challenges
No ratings yet
Csstrg2024 Wave 2 Challenges
37 pages
Java - Httpilpintcs - Blogspot.in
No ratings yet
Java - Httpilpintcs - Blogspot.in
152 pages
P22CS504 (Os) Final
No ratings yet
P22CS504 (Os) Final
3 pages
Coursera Activity 2
No ratings yet
Coursera Activity 2
8 pages
CSDS337 - 2024 PS3 Solution
No ratings yet
CSDS337 - 2024 PS3 Solution
7 pages
Software Design: Unit - Iii
No ratings yet
Software Design: Unit - Iii
16 pages
CS Data File Handling SLW
No ratings yet
CS Data File Handling SLW
6 pages
P2 260 Reviewer
No ratings yet
P2 260 Reviewer
8 pages
Artificial Intelligence, Lecture 6.4
No ratings yet
Artificial Intelligence, Lecture 6.4
19 pages
Class 4 PA1
No ratings yet
Class 4 PA1
4 pages
Operators Computer Pre Final Reviewer
No ratings yet
Operators Computer Pre Final Reviewer
6 pages
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
No ratings yet
Cse Btech IV Yr Vii Sem Scheme Syllabus July 2022
25 pages
Logcat 1710637078760
No ratings yet
Logcat 1710637078760
9 pages
Yanbu University College: Introduction To Informatics
No ratings yet
Yanbu University College: Introduction To Informatics
31 pages
Project 1
No ratings yet
Project 1
9 pages
PSP Internal Assessment 2 Sample Questions
No ratings yet
PSP Internal Assessment 2 Sample Questions
3 pages
Test 8
No ratings yet
Test 8
2 pages
Answer On Sample Problems On Scientific Measurements
No ratings yet
Answer On Sample Problems On Scientific Measurements
3 pages
LAB#7 Q1:-: Simulink
No ratings yet
LAB#7 Q1:-: Simulink
3 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4

Uploaded by

Decision Trees (I) : ISOM3360 Data Mining For Business Analytics, Session 4

Uploaded by

ISOM3360 Data Mining for Business Analytics, Session 4

Decision Trees (I)

Instructor: Rong Zheng

Preliminary investigation of the data to better understand

Visualization tools are important

Data transformation might be needed

Which of the following normalization method(s)

Post-mortem analysis of a popular data mining competition

Decision trees are one of the most popular data mining

They are easy to understand, implement and use, and c

Employed Balance Age Default

Objective: predicting borrowers who will default on loan payments.

Employed Root Node

Class = Not Class =

Leaves >=45 <45

Balance < 50k

Q: How would you assign estimates of class probability (e.g.,

Objective: based on customer attributes, partition the custome

Partitioning into “purer” groups

Orange Bodies Purple Bodies

Yes Yes No Yes No No

Partitioning into “purer” groups recursively

Red Head Green Head

Partitioning into “purer” groups

Orange Bodies Purple Bodies

Yes Yes No Yes No No

Partitioning into “purer” groups recursively

Blue Limbs Yellow Limbs

Orange Bodies Purple Bodies

Yes Yes No Yes No No

A tree is constructed by recursively partitioning the e

This person is an entrepreneur.

Some important questions without being answered

Impurity measures: many available but most common

is the proportion of class in the data

For example: our initial population is composed of 16 cases

Entropy (entire population of examples) =

Two-class entropy function

The information gain is based on the reduction in entro

where weight of each child is given by the

Balance < 50k Balance >= 50k

(Weighted) Average Impurity of Children =

Information Gain = 0.997 - 0.615 = 0.382

Age < 45 Age >=45

Now, ready to answer anxiously awaiting question:

Answer: at each node, choose the attribute that obt

 Step 1: Calculate the information gain from splitting over

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.