0% found this document useful (0 votes)
391 views2 pages

Assignment 3: 1. Consider A Binary Classification Problem With The Following Set of Attributes and

This document contains 5 assignments related to classification algorithms: 1. Analyzing properties of a rule-based classifier and its rules. 2. Evaluating candidate rules for a binary classification problem based on different measures. 3. Calculating probabilities from data about student smoking rates and enrollment levels. 4. Estimating conditional probabilities from sample data and using naive Bayes classification. 5. Determining linear separability of Boolean functions.

Uploaded by

Saptarick Mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
391 views2 pages

Assignment 3: 1. Consider A Binary Classification Problem With The Following Set of Attributes and

This document contains 5 assignments related to classification algorithms: 1. Analyzing properties of a rule-based classifier and its rules. 2. Evaluating candidate rules for a binary classification problem based on different measures. 3. Calculating probabilities from data about student smoking rates and enrollment levels. 4. Estimating conditional probabilities from sample data and using naive Bayes classification. 5. Determining linear separability of Boolean functions.

Uploaded by

Saptarick Mishra
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Assignment 3

1. Consider a binary classification problem with the following set of attributes and
attribute values:
• Air Conditioner = {Working, Broken}
• Engine = {Good, Bad}
• Mileage = {High, Medium, Low}
• Rust = {Yes, No}
Suppose a rule-based classifier produces the following rule set:
Mileage = High -→ Value = Low
Mileage = Low -→ Value = High
Air Conditioner = Working, Engine = Good -→ Value = High
Air Conditioner = Working, Engine = Bad -→ Value = Low
Air Conditioner = Broken -→ Value = Low
(a) Are the rules mutually exclusives?
(b) Is the rule set exhaustive?
(c) Is ordering needed for this set of rules?
(d) Do you need a default class for the rule set?

2. Consider a training set that contains 100 positive examples and 400 negative
examples. For each of the following candidate rules,
R1: A -→ + (covers 4 positive and 1 negative examples),
R2: B -→ + (covers 30 positive and 10 negative examples),
R3: C -→ + (covers 100 positive and 90 negative examples),
determine which is the best and worst candidate rule according to:

I. FOIL’s information gain.


II. The likelihood ratio statistic
III. The Laplace measure.
IV. The m-estimate measure (with k = 2 and p+ = 0.2).
V. Rule accuracy.
3. (a) Suppose the fraction of undergraduate students who smoke is 15% and the
fraction of graduate students who smoke is 23%. If one-fifth of the college students
are graduate students and the rest are undergraduates, what is the probability that a
student who smokes is a graduate student?
(b) Given the information in part (a), is a randomly chosen college student more
likely to be a graduate or undergraduate student?
(c) Repeat part (b) assuming that the student is a smoker.
(d) Suppose 30% of the graduate students live in a dorm but only 10% of the
undergraduate students live in a dorm. If a student smokes and lives in the dorm, is
he or she more likely to be a graduate or undergraduate student? You can assume
independence between students who live in a dorm and those who smoke.
4. Consider the data set shown in Table 1
(a) Estimate the conditional probabilities for P(A|+), P(B|+), P(C|+), P(A|-), P(B|-),
and P(C|-).
(b) Use the estimate of conditional probabilities given in the previous question to
predict the class label for a test sample (A = 0, B = 1, C = 0) using the naive Bayes
approach.
(c) Estimate the conditional probabilities using the m-estimate approach, with p =
1/2 and m = 4.
(d) Repeat part (b) using the conditional probabilities given in part (c).

Table 1
5. For each of the Boolean functions given below, state whether the problem is linearly
separable.
(a) A AND B AND C
(b) NOT A AND B
(c) (A OR B) AND (A OR C)
(d) (A XOR B) AND (A OR B)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy