0% found this document useful (0 votes)

18 views37 pages

dm 3

The document covers the fundamental concepts of classification in data mining, including supervised and unsupervised learning, decision tree induction, and various classification methods such as Bayesian and rule-based classification. It discusses model construction, validation, testing, and techniques to improve classification accuracy, including ensemble methods and attribute selection measures like information gain and Gini index. Additionally, it addresses challenges such as overfitting and tree pruning in decision tree algorithms.

Uploaded by

mrpulluri1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views37 pages

dm 3

Uploaded by

mrpulluri1

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

AI512PE: DATA MINING (PE - I)

Unit - 3
Unit - III
Classification
❑ Classification and Prediction
❑ Basic concepts
❑ Decision tree induction
❑ Bayesian classification
❑ Rule–based classification
❑ Lazy learner.

2
Classification: Basic Concepts
❑ Classification: Basic Concepts

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Linear Classifier

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

❑ Summary

3
Supervised vs. Unsupervised Learning (1)
❑ Supervised learning (classification)
❑ Supervision: The training data such as observations or measurements are
accompanied by labels indicating the classes which they belong to
❑ New data is classified based on the models built from the training set
Training Data with class label:
age income student credit_rating buys_computer Training Model
<=30 high no fair no
Instances Learning
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
31…40 low yes excellent yes
<=30 medium no fair no Positive
<=30 low yes fair yes
>40
<=30
medium yes fair
medium yes excellent
yes
yes
Test Prediction
31…40 medium no excellent yes Instances Model
31…40 high yes fair yes Negative
>40 medium no excellent no
4
Supervised vs. Unsupervised Learning (2)
❑ Unsupervised learning (clustering)
❑ The class labels of training data are unknown
❑ Given a set of observations or measurements, establish the possible existence
of classes or clusters in the data

5
Prediction Problems: Classification vs. Numeric
Prediction
❑ Classification
❑ Predict categorical class labels (discrete or nominal)
❑ Construct a model based on the training set and the class labels (the values in a
classifying attribute) and use it in classifying new data
❑ Numeric prediction
❑ Model continuous-valued functions (i.e., predict unknown or missing values)
❑ Typical applications of classification
❑ Credit/loan approval
❑ Medical diagnosis: if a tumor is cancerous or benign
❑ Fraud detection: if a transaction is fraudulent
❑ Web page categorization: which category it is
6
Classification—Model Construction, Validation and Testing
❑ Model construction
❑ Each sample is assumed to belong to a predefined class (shown by the class label)
❑ The set of samples used for model construction is training set
❑ Model: Represented as decision trees, rules, mathematical formulas, or other forms
❑ Model Validation and Testing
❑ Test: Estimate accuracy of the model
❑ The known label of test sample is compared with the classified result from the
model
❑ Accuracy: % of test set samples that are correctly classified by the model
❑ Test set is independent of training set
❑ Validation: If the test set is used to select or refine models, it is called validation (or
development) (test) set
❑ Model Deployment: If the accuracy is acceptable, use the model to classify new data
7
Classification: Basic Concepts
❑ Classification: Basic Concepts

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Linear Classifier

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

❑ Summary

8
Decision Tree Induction: An Example
❑ Decision tree construction: Training data set: Who buys computer?
age income student credit_rating buys_computer
❑ A top-down, recursive, divide-and-
<=30 high no fair no
conquer process <=30 high no excellent no
❑ Resulting tree: 31…40 high no fair yes
age? >40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
<=30 31…40 low yes excellent yes
overcast
31..40 >40 <=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
student? Buy credit rating? <=30 medium yes excellent yes
31…40 medium no excellent yes
no yes excellent fair 31…40 high yes fair yes
>40 medium no excellent no

Not-buy Buy Not-buy Buy Note: The data set is adapted from
“Playing Tennis” example of R. Quinlan
9
From Entropy to Info Gain: A Brief Review of Entropy
❑ Entropy (Information Theory)
❑ A measure of uncertainty associated with a random number
❑ Calculation: For a discrete random variable Y taking m distinct values {y 1, y2, …, ym}

❑ Interpretation
❑ Higher entropy → higher uncertainty
❑ Lower entropy → lower uncertainty
❑ Conditional entropy

m=2

10
Information Gain: An Attribute Selection Measure
❑ Select the attribute with the highest information gain (used in typical
decision tree induction algorithm: ID3/C4.5)
❑ Let pi be the probability that an arbitrary tuple in D belongs to class C i,
estimated by |Ci, D|/|D|
❑ Expected information (entropy) needed to classify a tuple in D.

❑ Information needed (after using A to split D into v partitions) to classify D:

❑ Information gained by branching on attribute A

11
Example: Attribute Selection with Information Gain
❑ Class P: buys_computer = “yes”
❑ Class N: buys_computer = “no”

age pi ni I(pi, ni)

<=30 2 3 0.971
31…40 4 0 0
>40 3 2 0.971
age income student credit_rating buys_computer
<=30 high no fair no
<=30 high no excellent no
31…40 high no fair yes
>40 medium no fair yes
>40 low yes fair yes
>40
31…40
low
low
yes excellent
yes excellent
no
yes Gain(income) = 0.029
Gain( student ) = 0.151
<=30 medium no fair no
<=30 low yes fair yes
>40 medium yes fair yes
<=30 medium yes excellent yes Gain(credit _ rating ) = 0.048
31…40 medium no excellent yes
31…40 high yes fair yes
12 >40 medium no excellent no
Example: Attribute Selection with Information Gain…

13
Decision Tree Induction: Algorithm
❑ Basic algorithm
❑ Tree is constructed in a top-down, recursive, divide-and-conquer manner
❑ At start, all the training examples are at the root
❑ Examples are partitioned recursively based on selected attributes
❑ On each node, attributes are selected based on the training examples on that
node, and a heuristic or statistical measure (e.g., information gain)
❑ Conditions for stopping partitioning
❑ All samples for a given node belong to the same class
❑ There are no remaining attributes for further partitioning
❑ There are no samples left
❑ Prediction
❑ Majority voting is employed for classifying the leaf

14
How to Handle Continuous-Valued Attributes?
❑ Method 1: Discretize continuous values and treat them as categorical values
❑ E.g., age: < 20, 20..30, 30..40, 40..50, > 50
❑ Method 2: Determine the best split point for continuous-valued attribute A
❑ Sort the value A in increasing order:, e.g. 15, 18, 21, 22, 24, 25, 29, 31, …
❑ Possible split point: the midpoint between each pair of adjacent values
❑ (ai+ai+1)/2 is the midpoint between the values of ai and ai+1
❑ e.g., (15+18/2 = 16.5, 19.5, 21.5, 23, 24.5, 27, 30, …
❑ The point with the maximum information gain for A is selected as the split-
point for A
❑ Split: Based on split point P
❑ The set of tuples in D satisfying A ≤ P vs. those with A > P
15
Gain Ratio: A Refined Measure for Attribute Selection
❑ Information gain measure is biased towards attributes with a large number of
values
❑ Gain ratio: Overcomes the problem (as a normalization to information gain)

❑ GainRatio(A) = Gain(A)/SplitInfo(A)
❑ The attribute with the maximum gain ratio is selected as the splitting attribute
❑ Gain ratio is used in a popular algorithm C4.5 (a successor of ID3) by R. Quinlan
❑ Example
4 4 6 6 4 4
❑ SplitInfoincome D = − log 2 − log 2 − log 2 = 1.557
14 14 14 14 14 14
❑ GainRatio(income) = 0.029/1.557 = 0.019

16
Another Measure: Gini Index
❑ Gini index: Used in CART, and also in IBM IntelligentMiner
❑ If a data set 𝐷 contains examples from 𝑛 classes, gini index, 𝑔𝑖𝑛𝑖(𝐷) is defined as
❑ 𝑔𝑖𝑛𝑖 𝐷 = 1 − σ𝑛𝑗=1 𝑝𝑗2
❑ 𝑝𝑗 is the relative frequency of class 𝑗 in 𝐷
❑ If a data set 𝐷 is split on 𝐴 into two subsets 𝐷1 and 𝐷2, the 𝑔𝑖𝑛𝑖 index 𝑔𝑖𝑛𝑖(𝐷) is
defined as
𝐷1 𝐷2
❑ 𝑔𝑖𝑛𝑖𝐴 𝐷 = 𝐷
𝑔𝑖𝑛𝑖 𝐷1 + 𝐷
𝑔𝑖𝑛𝑖 𝐷2
❑ Reduction in Impurity:
❑ Δ𝑔𝑖𝑛𝑖 𝐴 = 𝑔𝑖𝑛𝑖 𝐷 − 𝑔𝑖𝑛𝑖𝐴 (𝐷)
❑ The attribute provides the smallest 𝑔𝑖𝑛𝑖𝑠𝑝𝑙𝑖𝑡(𝐷) (or the largest reduction in
impurity) is chosen to split the node (need to enumerate all the possible splitting
points for each attribute)
17
Computation of Gini Index
❑ Example: D has 9 tuples in buys_computer = “yes” and 5 in “no”

❑ Suppose the attribute income partitions D into 10 in D1: {low, medium} and 4 in D2
10 4
❑ 𝑔𝑖𝑛𝑖𝑖𝑛𝑐𝑜𝑚𝑒∈ 𝑙𝑜𝑤,𝑚𝑒𝑑𝑖𝑢𝑚 𝐷 = 𝑔𝑖𝑛𝑖 𝐷1 + 𝑔𝑖𝑛𝑖 𝐷2
14 14
2 2 2 2
10 7 3 4 2 2
= 1− − + 1− − = 0.443
14 10 10 14 4 4
= 𝐺𝑖𝑛𝑖𝑖𝑛𝑐𝑜𝑚𝑒∈ ℎ𝑖𝑔ℎ 𝐷
❑ Gini{low,high} is 0.458; Gini{medium,high} is 0.450
❑ Thus, split on the {low,medium} (and {high}) since it has the lowest Gini index
❑ All attributes are assumed continuous-valued
❑ May need other tools, e.g., clustering, to get the possible split values

18 ❑ Can be modified for categorical attributes

Comparing Three Attribute Selection Measures
❑ The three measures, in general, return good results but
❑ Information gain:
❑ biased towards multivalued attributes
❑ Gain ratio:
❑ tends to prefer unbalanced splits in which one partition is much smaller than
the others
❑ Gini index:
❑ biased to multivalued attributes
❑ has difficulty when # of classes is large
❑ tends to favor tests that result in equal-sized partitions and purity in both
partitions

19
Other Attribute Selection Measures
❑ Minimal Description Length (MDL) principle
❑ Philosophy: The simplest solution is preferred
❑ The best tree as the one that requires the fewest # of bits to both (1) encode
the tree, and (2) encode the exceptions to the tree
❑ CHAID: a popular decision tree algorithm, measure based on χ2 test for
independence
❑ Multivariate splits (partition based on multiple variable combinations)
❑ CART: finds multivariate splits based on a linear combination of attributes
❑ There are many other measures proposed in research and applications
❑ E.g., G-statistics, C-SEP
❑ Which attribute selection measure is the best?
❑ Most give good results, none is significantly superior than others
20
Overfitting and Tree Pruning
❑ Overfitting: An induced tree may overfit the training data
❑ Too many branches, some may reflect anomalies due to noise or
outliers
❑ Poor accuracy for unseen samples
❑ Two approaches to avoid overfitting
❑ Prepruning: Halt tree construction early ̵ do not split a node if this
would result in the goodness measure falling below a threshold
❑ Difficult to choose an appropriate threshold
❑ Postpruning: Remove branches from a “fully grown” tree—get a
sequence of progressively pruned trees
❑ Use a set of data different from the training data to decide which is
21
the “best pruned tree”
Classification: Basic Concepts
❑ Classification: Basic Concepts

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Linear Classifier

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

❑ Summary

22
What Is Bayesian Classification?
❑ A statistical classifier
❑ Perform probabilistic prediction (i.e., predict class membership probabilities)
❑ Foundation—Based on Bayes’ Theorem
❑ Performance
❑ A simple Bayesian classifier, naïve Bayesian classifier, has comparable
performance with decision tree and selected neural network classifiers
❑ Incremental
❑ Each training example can incrementally increase/decrease the probability that
a hypothesis is correct—prior knowledge can be combined with observed data
❑ Theoretical Standard
❑ Even when Bayesian methods are computationally intractable, they can provide
a standard of optimal decision making against which other methods can be
measured
23
Bayes’ Theorem: Basics
❑ Total probability Theorem:
p B = ෍ p B A i p(A i)
i
❑ Bayes’ Theorem:
p 𝐗H P H
p H|𝐗 = ∝p 𝐗H P H
p(𝐗)

posteriori likelihood prior probability

probability
What we should choose What we just see What we knew previously

❑ X: a data sample (“evidence”) Prediction can be done based on Bayes’ Theorem:

❑ H: X belongs to class C Classification is to derive the maximum posteriori
24
Naïve Bayes Classifier: Making a Naïve Assumption
❑ Practical difficulty of Naïve Bayes inference: It requires initial knowledge of many
probabilities, which may not be available or involving significant computational cost
❑ A Naïve Special Case
❑ Make an additional assumption to simplify the model, but achieve comparable
performance.

attributes are conditionally independent

(i.e., no dependence relation between attributes)

p X|𝐶𝑖 = ςk p x k Ci) = p x1 Ci) ∙ p x 2 Ci) ∙∙∙∙∙ p x n Ci)

❑ Only need to count the class distribution w.r.t. features

25
Naïve Bayes Classifier: Categorical vs. Continuous
Valued Features
❑ If feature xk is categorical, p(xk = vk |Ci ) is the # of tuples in Ci with xk = vk ,
divided by |Ci, D| (# of tuples of Ci in D)

p X|𝐶𝑖 = ςk p xk Ci ) = p x1 Ci ) ∙ p x2 Ci) ∙∙∙∙∙ p xn Ci )

❑ If feature xk is continuous-valued, p(xk = vk |Ci ) is usually computed based on

Gaussian distribution with a mean μ and standard deviation σ
2
𝑥−𝜇 𝐶
1 − 2
𝑖
p xk = vk Ci = 𝑁 xk μCi , σCi = 𝑒 2𝜎
2πσCi

26
Naïve Bayes Classifier: Training Dataset

age income student credit_rating buys_computer

Class: <=30 high no fair no
C1:buys_computer = ‘yes’ <=30 high no excellent no
31…40 high no fair yes
C2:buys_computer = ‘no’ >40 medium no fair yes
>40 low yes fair yes
>40 low yes excellent no
Data to be classified: 31…40 low yes excellent yes
<=30 medium no fair no
X = (age <=30, Income = medium, <=30 low yes fair yes
Student = yes, Credit_rating = Fair) >40 medium yes fair yes
<=30 medium yes excellent yes
31…40 medium no excellent yes
31…40 high yes fair yes
>40 medium no excellent no

27
Naïve Bayes Classifier: An Example
age income student credit_rating buys_computer
❑ P(Ci): P(buys_computer = “yes”) = 9/14 = 0.643 <=30 high no fair no
P(buys_computer = “no”) = 5/14= 0.357 <=30 high no excellent no
31…40 high no fair yes
❑ Compute P(X|Ci) for each class
>40 medium no fair yes
P(age = “<=30”|buys_computer = “yes”) = 2/9 = 0.222 >40 low yes fair yes
P(age = “<= 30”|buys_computer = “no”) = 3/5 = 0.6 >40 low yes excellent no
31…40 low yes excellent yes
P(income = “medium” | buys_computer = “yes”) = 4/9 = 0.444 <=30 medium no fair no
P(income = “medium” | buys_computer = “no”) = 2/5 = 0.4 <=30 low yes fair yes
P(student = “yes” | buys_computer = “yes) = 6/9 = 0.667 >40 medium yes fair yes
<=30 medium yes excellent yes
P(student = “yes” | buys_computer = “no”) = 1/5 = 0.2 31…40 medium no excellent yes
P(credit_rating = “fair” | buys_computer = “yes”) = 6/9 = 0.667 31…40 high yes fair yes
>40 medium no excellent no
P(credit_rating = “fair” | buys_computer = “no”) = 2/5 = 0.4
❑ X = (age <= 30 , income = medium, student = yes, credit_rating = fair)
P(X|Ci) : P(X|buys_computer = “yes”) = 0.222 x 0.444 x 0.667 x 0.667 = 0.044
P(X|buys_computer = “no”) = 0.6 x 0.4 x 0.2 x 0.4 = 0.019
P(X|Ci)*P(Ci) : P(X|buys_computer = “yes”) * P(buys_computer = “yes”) = 0.028
P(X|buys_computer = “no”) * P(buys_computer = “no”) = 0.007
Therefore, X belongs to class (“buys_computer = yes”)
28
Avoiding the Zero-Probability Problem
❑ Naïve Bayesian prediction requires each conditional probability be non-zero
❑ Otherwise, the predicted probability will be zero
p X|𝐶𝑖 = ς𝑘 𝑝 𝑥𝑘 𝐶𝑖 ) = 𝑝 𝑥1 𝐶𝑖 ) ∙ 𝑝 𝑥2 𝐶𝑖 ) ∙∙∙∙∙ 𝑝 𝑥𝑛 𝐶𝑖 )
❑ Example. Suppose a dataset with 1000 tuples:
income = low (0), income= medium (990), and income = high (10)
❑ Use Laplacian correction (or Laplacian estimator)
❑ Adding 1 to each case
Prob(income = low) = 1/(1000 + 3)
Prob(income = medium) = (990 + 1)/(1000 + 3)
Prob(income = high) = (10 + 1)/(1000 + 3)
❑ The “corrected” probability estimates are close to their “uncorrected”
counterparts
29
Naïve Bayes Classifier: Strength vs. Weakness
❑ Strength
❑ Easy to implement
❑ Good results obtained in most of the cases
❑ Weakness
❑ Assumption: attributes conditional independence, therefore loss of accuracy
❑ Practically, dependencies exist among variables
❑ E.g., Patients: Profile: age, family history, etc.
Symptoms: fever, cough etc.
Disease: lung cancer, diabetes, etc.
❑ Dependencies among these cannot be modeled by Naïve Bayes Classifier
❑ How to deal with these dependencies?
❑ Use Bayesian Belief Networks (to be covered in the next chapter)
30
Classification: Advanced Methods
❑ Bayesian Belief Networks

❑ Support Vector Machines

❑ Neural Networks and Deep Learning

❑ Pattern-Based Classification

❑ Lazy Learners and K-Nearest Neighbors

❑ Other Classification Methods

❑ Summary

31
Lazy vs. Eager Learning
❑ Lazy vs. eager learning
❑ Lazy learning (e.g., instance-based learning): Simply stores training data (or only
minor processing) and waits until it is given a test tuple
❑ Eager learning (the above discussed methods): Given a set of training tuples,
constructs a classification model before receiving new (e.g., test) data to classify
❑ Lazy: less time in training but more time in predicting
❑ Accuracy
❑ Lazy method effectively uses a richer hypothesis space since it uses many local
linear functions to form an implicit global approximation to the target function
❑ Eager: must commit to a single hypothesis that covers the entire instance space

32
Lazy Learner: Instance-Based Methods
❑ Instance-based learning:
❑ Store training examples and delay the processing (“lazy evaluation”) until a
new instance must be classified
❑ Typical approaches
❑ k-nearest neighbor approach
❑ Instances represented as points in a Euclidean space.
❑ Locally weighted regression
❑ Constructs local approximation
❑ Case-based reasoning
❑ Uses symbolic representations and knowledge-based inference

33
The k-Nearest Neighbor Algorithm
❑ All instances correspond to points in the n-D space
❑ The nearest neighbor are defined in terms of Euclidean distance, dist(X1, X2)
❑ Target function could be discrete- or real- valued
❑ For discrete-valued, k-NN returns the most common value among the k training
examples nearest to xq
❑ Vonoroi diagram: the decision surface induced by 1-NN for a typical set of
training examples

_
_
_ _ .
+
_
. +
xq +
. . .
_ + .
34
Discussion on the k-NN Algorithm
❑ k-NN for real-valued prediction for a given unknown tuple
❑ Returns the mean values of the k nearest neighbors
❑ Distance-weighted nearest neighbor algorithm
❑ Weight the contribution of each of the k neighbors according to their distance
to the query xq
❑ Give greater weight to closer neighbors
❑ Robust to noisy data by averaging k-nearest neighbors
❑ Curse of dimensionality: distance between neighbors could be dominated by
irrelevant attributes
❑ To overcome it, axes stretch or elimination of the least relevant attributes

35
Case-Based Reasoning (CBR)
❑ CBR: Uses a database of problem solutions to solve new problems
❑ Store symbolic description (tuples or cases)—not points in a Euclidean space
❑ Applications: Customer-service (product-related diagnosis), legal ruling
❑ Methodology
❑ Instances represented by rich symbolic descriptions (e.g., function graphs)
❑ Search for similar cases, multiple retrieved cases may be combined
❑ Tight coupling between case retrieval, knowledge-based reasoning, and problem
solving
❑ Challenges
❑ Find a good similarity metric
❑ Indexing based on syntactic similarity measure, and when failure, backtracking,
and adapting to additional cases
36
END OF UNIT - III

gis
No ratings yet
gis
111 pages
DWDM UNIT 4
No ratings yet
DWDM UNIT 4
80 pages
cf-unit-1-notes
No ratings yet
cf-unit-1-notes
14 pages
Slide 07 Chapter8 Classification Basic Concept
No ratings yet
Slide 07 Chapter8 Classification Basic Concept
55 pages
FDBMS_Unit-2(a)
No ratings yet
FDBMS_Unit-2(a)
87 pages
08 Class Basic
No ratings yet
08 Class Basic
81 pages
Module 4
No ratings yet
Module 4
99 pages
ML Unit II
No ratings yet
ML Unit II
183 pages
dm4
No ratings yet
dm4
68 pages
Unit-3
No ratings yet
Unit-3
98 pages
08ClassBasic-L
No ratings yet
08ClassBasic-L
78 pages
Unit 4, DWDM,IT Dept, III Year- II Semester
No ratings yet
Unit 4, DWDM,IT Dept, III Year- II Semester
87 pages
Solution Manual for Applied Partial Differential Equations with Fourier Series and Boundary Value Problems, 5/E Richard Haberman download
100% (2)
Solution Manual for Applied Partial Differential Equations with Fourier Series and Boundary Value Problems, 5/E Richard Haberman download
45 pages
_08ClassBasic_v1
No ratings yet
_08ClassBasic_v1
46 pages
Unit 4
No ratings yet
Unit 4
186 pages
08 Class Basic
No ratings yet
08 Class Basic
81 pages
Classification Intr DT .Pptx
No ratings yet
Classification Intr DT .Pptx
31 pages
IV YEAR II SEM MID -I WSSOA
No ratings yet
IV YEAR II SEM MID -I WSSOA
3 pages
Unit 4 DM
No ratings yet
Unit 4 DM
88 pages
LECTURE 8
No ratings yet
LECTURE 8
81 pages
Classification
No ratings yet
Classification
73 pages
Decision Tree
No ratings yet
Decision Tree
33 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
200 pages
unit 2 notes (1)
No ratings yet
unit 2 notes (1)
83 pages
UNIT 2 Class Basic
No ratings yet
UNIT 2 Class Basic
69 pages
VII - CS8031 - DMDW - Module 6 - Classification - VBP
No ratings yet
VII - CS8031 - DMDW - Module 6 - Classification - VBP
99 pages
Pcd lawn menial process
No ratings yet
Pcd lawn menial process
24 pages
MIS416 Chapter6 by DrAsimAlwabel
No ratings yet
MIS416 Chapter6 by DrAsimAlwabel
73 pages
Computer breaks system and mound
No ratings yet
Computer breaks system and mound
5 pages
Unit 3-Classification
No ratings yet
Unit 3-Classification
71 pages
dm 1
No ratings yet
dm 1
47 pages
Classification
No ratings yet
Classification
45 pages
Edwin Babbitt - Principles of Light and Color (1878)
100% (6)
Edwin Babbitt - Principles of Light and Color (1878)
588 pages
Chap4 Classification Lecture 5
No ratings yet
Chap4 Classification Lecture 5
74 pages
Chapter 6 Classification and Prediction25.10.13
No ratings yet
Chapter 6 Classification and Prediction25.10.13
43 pages
05 Classification
No ratings yet
05 Classification
79 pages
OverallMarks of individual
No ratings yet
OverallMarks of individual
1 page
10.1016@j.ecoinf.2019.05.004
No ratings yet
10.1016@j.ecoinf.2019.05.004
19 pages
Decision Tree
No ratings yet
Decision Tree
22 pages
Module - 4.1-DM-1
No ratings yet
Module - 4.1-DM-1
63 pages
08 Class Basic
No ratings yet
08 Class Basic
76 pages
Year 8 2023-2024 Curriculum Mapping
No ratings yet
Year 8 2023-2024 Curriculum Mapping
25 pages
Lecture 4
No ratings yet
Lecture 4
79 pages
P9-10 ClassBasic
No ratings yet
P9-10 ClassBasic
82 pages
04 Classification
No ratings yet
04 Classification
72 pages
AI Chapter 3 Part 2
No ratings yet
AI Chapter 3 Part 2
51 pages
An Introduction To Topological Spaces
No ratings yet
An Introduction To Topological Spaces
3 pages
CH 5
No ratings yet
CH 5
81 pages
Lecture-10 Factor Analysis - Reduced & Modified James McNeill Set W Consent
No ratings yet
Lecture-10 Factor Analysis - Reduced & Modified James McNeill Set W Consent
55 pages
Test Item Format
No ratings yet
Test Item Format
42 pages
Class Basic
No ratings yet
Class Basic
75 pages
RPF si PDF
No ratings yet
RPF si PDF
2 pages
Data Augmentation For DNN Model in Eeg Classification Task-A Review
No ratings yet
Data Augmentation For DNN Model in Eeg Classification Task-A Review
15 pages
05classification Rule Mining
No ratings yet
05classification Rule Mining
56 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
42 pages
Classification: Basic Concepts
No ratings yet
Classification: Basic Concepts
73 pages
SHA512 Ftfubbj
No ratings yet
SHA512 Ftfubbj
11 pages
Classification and Prediction
No ratings yet
Classification and Prediction
143 pages
Class Basic
No ratings yet
Class Basic
67 pages
Concepts and Techniques: Data Mining
No ratings yet
Concepts and Techniques: Data Mining
88 pages
Concepts and Techniques: Data Mining
100% (1)
Concepts and Techniques: Data Mining
81 pages
Ecture Ecision REE: Sajal Halder Bsmrstu
100% (1)
Ecture Ecision REE: Sajal Halder Bsmrstu
22 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
87 pages
20210913115613D3708 - Session 05-08 Decision Tree Classification
No ratings yet
20210913115613D3708 - Session 05-08 Decision Tree Classification
37 pages
Name: - : ENCE353: Introduction To Structural Analysis Midterm #1
No ratings yet
Name: - : ENCE353: Introduction To Structural Analysis Midterm #1
5 pages
Classification and Prediction: Data Mining 이복주 단국대학교 컴퓨터공학과
No ratings yet
Classification and Prediction: Data Mining 이복주 단국대학교 컴퓨터공학과
75 pages
08 Class Basic
No ratings yet
08 Class Basic
86 pages
Data Mining Book
No ratings yet
Data Mining Book
84 pages
Expressing One Quantity As A Percentage of Another: Grade D/E
100% (1)
Expressing One Quantity As A Percentage of Another: Grade D/E
10 pages
The Transfer Function of The Nth-Order Digital Butterworth Low Pass Filter
No ratings yet
The Transfer Function of The Nth-Order Digital Butterworth Low Pass Filter
5 pages
MAE 4171: Principles of Heat Transfer Solution-Assignment #1
100% (5)
MAE 4171: Principles of Heat Transfer Solution-Assignment #1
4 pages
Practice Test 2 Bus2023 Spring09 Solutions
No ratings yet
Practice Test 2 Bus2023 Spring09 Solutions
15 pages
Syllabus Comp 12
No ratings yet
Syllabus Comp 12
4 pages
AP Chemistry Chapter 10
No ratings yet
AP Chemistry Chapter 10
87 pages
Classification and Prediction
No ratings yet
Classification and Prediction
40 pages
Data Mining: Concepts and Techniques: - Chapter 7
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 7
61 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
Data Mining: Classification
No ratings yet
Data Mining: Classification
70 pages
Nodal Methods For Three-Dimensional Simulators: (Received July
No ratings yet
Nodal Methods For Three-Dimensional Simulators: (Received July
23 pages
Outlay For Management Report: Format of Research Reports
No ratings yet
Outlay For Management Report: Format of Research Reports
1 page
Data Mining & Knowledge Discovery
No ratings yet
Data Mining & Knowledge Discovery
34 pages
How To Outperform Markets Using Trading Systems
100% (4)
How To Outperform Markets Using Trading Systems
81 pages
Concepts and Techniques: - Chapter 8
No ratings yet
Concepts and Techniques: - Chapter 8
81 pages
1 Class Assignments Miscellaneous - PMD
No ratings yet
1 Class Assignments Miscellaneous - PMD
3 pages
I. Model Problems. II. Practice III. Challenge Problems VI. Answer Key
No ratings yet
I. Model Problems. II. Practice III. Challenge Problems VI. Answer Key
6 pages
Pennmuseum Egypt Previsit Combined PDF
No ratings yet
Pennmuseum Egypt Previsit Combined PDF
33 pages
7 - Classification
No ratings yet
7 - Classification
71 pages
2
No ratings yet
2
14 pages
Volume by Price
No ratings yet
Volume by Price
26 pages
Classification
100% (1)
Classification
37 pages
IB Biology Lab Report Template
100% (1)
IB Biology Lab Report Template
6 pages
Regression Analysis Method
No ratings yet
Regression Analysis Method
6 pages
Unigraphics General Interview Questions
No ratings yet
Unigraphics General Interview Questions
5 pages
SETS Revision
No ratings yet
SETS Revision
19 pages
The Smart Math Tricks Secrets to Solving Math Fast and Easy
From Everand
The Smart Math Tricks Secrets to Solving Math Fast and Easy
Leonardo Cruz
No ratings yet
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
From Everand
Certified Lean Six Sigma Green Belt (ICGB) Practice Questions And Exam Tests ICGB Exam Guidebook And Updated Questions
Idea Link
No ratings yet
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
From Everand
This is The Statistics Handbook your Professor Doesn't Want you to See. So Easy, it's Practically Cheating...
S. Deviant
4.5/5 (6)
Crush Hypothesis Testing
From Everand
Crush Hypothesis Testing
Allison Dillard
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

dm 3

Uploaded by

dm 3

Uploaded by

AI512PE: DATA MINING (PE - I)

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

❑ Information needed (after using A to split D into v partitions) to classify D:

❑ Information gained by branching on attribute A

age pi ni I(pi, ni)

18 ❑ Can be modified for categorical attributes

❑ Decision Tree Induction

❑ Bayes Classification Methods

❑ Model Evaluation and Selection

❑ Techniques to Improve Classification Accuracy: Ensemble Methods

❑ Additional Concepts on Classification

posteriori likelihood prior probability

❑ X: a data sample (“evidence”) Prediction can be done based on Bayes’ Theorem:

attributes are conditionally independent

p X|𝐶𝑖 = ςk p x k Ci) = p x1 Ci) ∙ p x 2 Ci) ∙∙∙∙∙ p x n Ci)

❑ Only need to count the class distribution w.r.t. features

p X|𝐶𝑖 = ςk p xk Ci ) = p x1 Ci ) ∙ p x2 Ci) ∙∙∙∙∙ p xn Ci )

❑ If feature xk is continuous-valued, p(xk = vk |Ci ) is usually computed based on

age income student credit_rating buys_computer

❑ Support Vector Machines

❑ Neural Networks and Deep Learning

❑ Lazy Learners and K-Nearest Neighbors

❑ Other Classification Methods

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.