0% found this document useful (0 votes)

458 views14 pages

Data Mining Exam

Data mining questions examples

Uploaded by

mariazaqout377

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

458 views14 pages

Data Mining Exam

Data mining questions examples

Uploaded by

mariazaqout377

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

2024/8/31 ‫حل اختبار الفاينل تنقيب بيانات‬

By : Mohamed Suhail El-Ejel

The concept of “Apriori Principle” notes that all of its subsets must also be frequent if an itemset is
frequent.
Question 1Answer

a.FALSE

b.TRUE

The “Religion Set” data field can best be defined as _______.

Question 2Answer

a.ratio data

b.ordinal data

c.interval data

d.nominal data

The probability of an individual owning a horse is 25%, given that they subscribe to at least one royal
equestrian club. We also know that at least one royal equestrian club is subscribed to by 8% of the adult
population. Finally, the probability of an individual owning a horse given that they don’t subscribe to at least
one royal equestrian club is 15%. Use the Bayes theorem to compute the probabilty that an individual
subscribes to at least one royal equestrian club given that they own a horse.
Question 3Answer

a.None of these

b.≈ 0.13

c.≈ 0.28

d.≈ 0.34

e.≈ 0.09
If the training data classes are unknown, which of the following algorithms could be used to find useful classes?
Question 4Answer

a.Clustering

b.Pruning Analysis

c.
Bayesian Analysis

d.
Binary Sort

Which of these types of variables is the set of odd integers from n = 5 to n = 41?
Question 5Answer

a.
Categorical

b.
Independent

c.
Interval

d.
Dependent

e.
Ordinal
Discretization is the process of converting a continuous attribute into a nominal attribute.
Question 7Answer

a.
FALSE

b.
TRUE

A false positive is an occurrence classified as true by the algorithm despite being false in fact during
classification in data mining.
Question 8Answer

a.
TRUE
Transactions form market baskets show:
Question 9Answer

a.
data relationships

b.
monthly customer purchases

c.
daily customer purchase data

d.
tea, sugar, and biscuit

When conducting mining operations, which of these data attributes is of interest?

Question 10Answer

a.
Dissimilarity between any given attribute of data items/objects in terms of the Supremum distance

b.
Dissimilarity between two data items/objects in terms of the Hamming distance of the bits

c.
Dissimilarity between points in terms of the Euclidean definition of distance

d.
All of these

For the rule set extracted from a decision tree, which statement is most true?
Question 11Answer

a.
Such rules are mutually exclusive, exhaustive, and unordered

b.
Such rules are non-exclusive, exhaustive, and ordered

c.
Logical OR exists between such rules, they are unordered

d.
None of these
In prediction methods, which statement is true?
Question 12Answer

a.
The designed model is used to classify current behaviors

b.
A numeric output/class attribute must be

c.
A categorical output/class attribute must be

d.
The designed model is used to determine future outcomes

Association analysis is a way of finding and grouping together sets of closely related observations.
Question 13Answer

a.
FALSE

b.
TRUE

Proximity refers to similarity measure only.

Question 14Answer

a.
FALSE

b.
TRUE

How many types of data mining functions are involved?

Question 15Answer

a.
5

b.
4

c.
2

d.
3
A methodology useful for discovering interesting relationships within large data sets is_________.
Question 16Answer

a.
Algorithm

b.
Data Mining

c.
Association analysis

d.
Big Data

The clustering of K-means requires prior knowledge of the number of clusters required as its input.
Question 17Answer

a.
FALSE

b.
TRUE

In a partition with 10 instances, assuming log base2, the entropy of a binary function with (# of As = 4 and # of
Bs = 6) is:
Question 18Answer

a.
≈ 0.88

b.
≈ 0.72

c.
≈ 0.47

d.
≈ 0.97
Add True Positive, True Negative and divide by adding False Negative and False Positive while evaluating
Accuracy in a Confusion Matrix Table.

Question 19

Answer

FALSE

TRUE

Examples of training that are relatively close to the test example's attributes are considered nearest neighbors.
Question 20Answer

a.
FALSE

b.
TRUE

Which field of data mining applications analyzes information and establishes rules to differentiate between
specified classes?
Question 21Answer

a.
Visualization

b.
Classification

c.
Clustering

d.Associations

Analysis of clusters is a way of finding patterns based on closely correlated data characteristics in the data.
Question 22Answer

a.
FALSE

b.
TRUE
Ratio data is a categorical data type.
Question 23Answer

a.
TRUE

b.
FALSE

_________ is the basis for the existing decision tree algorithms ID3, C4.5, and CART.
Question 24Answer

a.
Gini index

b.
Hunt’s Algorithm

c.
ID4

d.
Information gain

A graphical evaluation approach for binary classification models in which the true positive rate on the y-axis is
plotted and the false positive rate on the x-axis is plotted.
Question 25Answer

a.
Ratio data

b.
Decision tree

c.
Distance measure

d.
Area under the ROC curve

A decision tree is a predictive model.

Question 26Answer

TRUE
__________ are quantitative attributes.
Question 27Answer

a.
Random

b.
Alphabetical

c.
Numeric

d.
Nominal

Suppose we have a dataset containing 200 people's details. One hundred of these people have paid insurance
for their cars. The following rule was discovered by a supervised data mining session:
IF age ≥ 18 & driving license = yes
THEN vehicles insurance = yes
Rule Precision: 80%
Rule Coverage: 40%
How many people have driving license and age ≥ 18 years old in the class vehicles insurance = no?

Question 28Answer

a.
120

b.
80

c.
16

d.
64

e.
8
A _____________ shows correctly and incorrectly predicted counts of test records by a classification model.
Question 29Answer

a.
decision tree

b.
learning model

c.
attribute class

d.
confusion matrix

Using the confusion matrix below, what are the accuracy and the precision of the classifier respectively?

A pattern that does not satisfy a minsup threshold is called a :

Question 31Answer

a.
Infrequent/Rare

b.
Frequent/Regular

c.
Small/Minimal
A random error or variation in calculated variables is noise.
Question 32Answer

a.
TRUE

b.
FALSE

The probability of hypothesis H defined by P(H) is referred to by the Bayes theorem as ___________.

Question 33Answer

a conditional probability

an a priori probability

a posterior probability

a bidirectional probability

Classification of data is a ______-step process.

Question 34Answer

a.
three

b.
two

c.
four

d.one
The _____________ strategy aims to find all the items that satisfy the minimum support (minsup) threshold.
Question 35Answer

a.
Frequent Itemset Generation

b.
Association Rule Discovery

c.
Rule Generation

d.Rule-pruning

To deal with missing data items during the learning process, some data mining techniques __________.
Question 36Answer

a.
replace missing items of real-value data with class means

b.
remove records with missing data

c.
replace missing values of attributes with values found in related instances

d.ignore missing attribute values

The true positive rate is __________, when calculating the accuracy of data mining classification models.
Question 37Answer

a.
the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly
classified positives

b.
the ratio of correctly classified positives divided by the total positive count

c.
the ratio of correctly classified positives divided by the sum of correctly classified positives and incorrectly
classified negatives.

d.
the ratio of correctly classified negatives divided by the total negative count
To apply the Bayes theorem, the following relationship must be maintained between hypothesis H and
evidence E.
Question 38Answer

a.
P(H|E) + P(H| ~E) = 1

b.
P(H|E) + P(~H| E) = 1

c.
P(H|E) + P(H| ~E) = 0

d.
P(H|E) + P(~H| E) = 0

“If an itemset is frequent, then all of its subsets must also be frequent” referred to
Question 39Answer

a.
Apriori Principle

b.
A theorem which can never be proven

c.
The main to understanding market basket analysis

d.
All of these

In the data classification process, which of these terms describes the first major task?
Question 40Answer

a.
Choose training data

b.
Classify

c.Learning

d.Data preprocessing
The ______________ strategy aims to derive from the frequent itemsets contained in the Frequent Itemset
Generation all the high-confidence rules.
Question 41Answer

a.Association Rule

b.Rule Generation

c.Association Analysis

d.Association Generation

Assume that a group of 900 individuals has been surveyed. Evaluate the following survey observations:
participants who read history books only = 400, participants who read non-history books only = 150, and
participants who read both = 100. What is the confidence of a participant (X, "history books") → reads (X, "non-
history books")?
Question 42Answer

a.30%

b.None of the these

c.20%

d.25%

e.15%

By : Mohamed Suhail El-Ejel ❤️

Fortra Data Classification Suite For Windows Deployment Guide
No ratings yet
Fortra Data Classification Suite For Windows Deployment Guide
69 pages
EBM2.1 MANUAL For Compute and Tablet
No ratings yet
EBM2.1 MANUAL For Compute and Tablet
40 pages
COA - Practice Set
No ratings yet
COA - Practice Set
3 pages
Final Exam 2nd Semester Data Mining 2nd Version
No ratings yet
Final Exam 2nd Semester Data Mining 2nd Version
1 page
DataMining - Workbook MCQ
No ratings yet
DataMining - Workbook MCQ
16 pages
Livegrade Pro Manual
No ratings yet
Livegrade Pro Manual
122 pages
Data Mining Practice Final Exam Solutions: True/False Questions
100% (1)
Data Mining Practice Final Exam Solutions: True/False Questions
5 pages
Exercises 695 Clas
No ratings yet
Exercises 695 Clas
3 pages
HW 2
100% (1)
HW 2
29 pages
E-Tivity 2.2 Tharcisse 217010849
No ratings yet
E-Tivity 2.2 Tharcisse 217010849
7 pages
Data Mining Practice Final Sol
No ratings yet
Data Mining Practice Final Sol
5 pages
IS328 Final Exam
No ratings yet
IS328 Final Exam
12 pages
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
100% (1)
Business Intelligence and Analytics: Systems For Decision Support, 10e (Sharda) Chapter 5 Data Mining
13 pages
MCQ
No ratings yet
MCQ
2 pages
Data Mining and Warehousing
No ratings yet
Data Mining and Warehousing
12 pages
Frame-Based Expert Systems
No ratings yet
Frame-Based Expert Systems
50 pages
AMNA SHAHID - Docx MCQS
No ratings yet
AMNA SHAHID - Docx MCQS
8 pages
Assignment 2 DM
No ratings yet
Assignment 2 DM
5 pages
Data Mining Worksheet One
No ratings yet
Data Mining Worksheet One
2 pages
Data Mining - Tasks: Data Characterization Data Discrimination
No ratings yet
Data Mining - Tasks: Data Characterization Data Discrimination
4 pages
02 - Data Types - MCQ
No ratings yet
02 - Data Types - MCQ
4 pages
One Dimensions Random Variables PDF
No ratings yet
One Dimensions Random Variables PDF
99 pages
Prelim Exam - Data Analysis
No ratings yet
Prelim Exam - Data Analysis
19 pages
Data Interpretation Set Theory
No ratings yet
Data Interpretation Set Theory
2 pages
Attribute Selection Measures: Decision Tree Based Classification
No ratings yet
Attribute Selection Measures: Decision Tree Based Classification
16 pages
Mining Frequent Patterns, Association and Correlations
No ratings yet
Mining Frequent Patterns, Association and Correlations
42 pages
Aproiri Qand A
No ratings yet
Aproiri Qand A
9 pages
DM Important Questions
100% (1)
DM Important Questions
2 pages
1.write A Program in Prolog To Show The Sum of N Natural Numbers. Code
No ratings yet
1.write A Program in Prolog To Show The Sum of N Natural Numbers. Code
2 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
4 pages
MCQ On Data Mining
No ratings yet
MCQ On Data Mining
20 pages
Database Testbank
No ratings yet
Database Testbank
13 pages
Data Mining
No ratings yet
Data Mining
15 pages
Data Mining Mcqs PDF
No ratings yet
Data Mining Mcqs PDF
39 pages
(MCQ) - Data Warehouse and Data Mining - LMT
No ratings yet
(MCQ) - Data Warehouse and Data Mining - LMT
4 pages
Code Optimization
0% (1)
Code Optimization
90 pages
Normalization in DBMS11
No ratings yet
Normalization in DBMS11
17 pages
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
100% (1)
1) Statement: Descriptive Analytics, Is The Conventional Form of Business Intelligence and Data Analysis. B. False
21 pages
Sawtooth Software: Analysis of Traditional Conjoint Using Microsoft Excel: An Introductory Example
No ratings yet
Sawtooth Software: Analysis of Traditional Conjoint Using Microsoft Excel: An Introductory Example
7 pages
CS614 FinalTerm Solved Papers
No ratings yet
CS614 FinalTerm Solved Papers
24 pages
CH 6
No ratings yet
CH 6
72 pages
Data Warehousing Mining MCQs
No ratings yet
Data Warehousing Mining MCQs
12 pages
K - Nearest Neighbor
No ratings yet
K - Nearest Neighbor
2 pages
Data Mining
100% (4)
Data Mining
9 pages
Q.1. Why Is Data Preprocessing Required?
100% (1)
Q.1. Why Is Data Preprocessing Required?
26 pages
DWDM Online Bits
No ratings yet
DWDM Online Bits
3 pages
Lesson Plan: Data Warehousing and Data Mining
No ratings yet
Lesson Plan: Data Warehousing and Data Mining
1 page
Data Mining & Business Intelligence (2170715) : Unit-5 Concept Description and Association Rule Mining
No ratings yet
Data Mining & Business Intelligence (2170715) : Unit-5 Concept Description and Association Rule Mining
39 pages
AI Unit 3
No ratings yet
AI Unit 3
89 pages
Data Warehousing, OLAP, Data Mining Practice Questions Solutions
No ratings yet
Data Warehousing, OLAP, Data Mining Practice Questions Solutions
4 pages
Chapter 08
No ratings yet
Chapter 08
24 pages
Unit 5 - Data Mining - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Data Mining - WWW - Rgpvnotes.in
15 pages
Midterm Sp16 Solutions
100% (1)
Midterm Sp16 Solutions
17 pages
Cs 143 Sample Mid
No ratings yet
Cs 143 Sample Mid
4 pages
Ma2262 Probability and Queuing Theory Question Bank Download
No ratings yet
Ma2262 Probability and Queuing Theory Question Bank Download
4 pages
IS328 Data Mining-Tutorial 1 Solution
No ratings yet
IS328 Data Mining-Tutorial 1 Solution
5 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
UNIT V DWM Notes
No ratings yet
UNIT V DWM Notes
18 pages
Test Prep Book for TABE 13 and 14 Math Test Level D
From Everand
Test Prep Book for TABE 13 and 14 Math Test Level D
Coaching For Better Learning
No ratings yet
Unit 4 - Question Bank
No ratings yet
Unit 4 - Question Bank
11 pages
Unit4 Mcqs
No ratings yet
Unit4 Mcqs
7 pages
Data Mining Exam Questions
No ratings yet
Data Mining Exam Questions
25 pages
Short Quizzes 13-15
No ratings yet
Short Quizzes 13-15
9 pages
Data Final
No ratings yet
Data Final
17 pages
SaaS Implementation Best Practices - v2
No ratings yet
SaaS Implementation Best Practices - v2
24 pages
Content Standard:: /configuring-Of-Computer-Systems-And-Networks - PDF Module in ICT CHS 10 Teacher Guide
100% (2)
Content Standard:: /configuring-Of-Computer-Systems-And-Networks - PDF Module in ICT CHS 10 Teacher Guide
2 pages
Software Development: Cansat Program
No ratings yet
Software Development: Cansat Program
22 pages
Lecture # 1
No ratings yet
Lecture # 1
14 pages
Audi A6 f2 Faulty 0009
No ratings yet
Audi A6 f2 Faulty 0009
2 pages
Corning 144F (12x12) Armoured
No ratings yet
Corning 144F (12x12) Armoured
4 pages
Harmonic 1
No ratings yet
Harmonic 1
95 pages
Data Scientist Gaurav 3-1 01-Oct-22 10.22.29-1
No ratings yet
Data Scientist Gaurav 3-1 01-Oct-22 10.22.29-1
3 pages
A Case Study Application of Linear Programming and Simulation To Mine Planning
No ratings yet
A Case Study Application of Linear Programming and Simulation To Mine Planning
9 pages
Scribbed 223751127-Chapter-12-Enhanced-Entity-Relationship-Modeling PDF
No ratings yet
Scribbed 223751127-Chapter-12-Enhanced-Entity-Relationship-Modeling PDF
16 pages
Calculating Devices FV
No ratings yet
Calculating Devices FV
13 pages
Introduction To Web Development
No ratings yet
Introduction To Web Development
2 pages
Gotoxy Statement in Dev C Tutorial PDF
No ratings yet
Gotoxy Statement in Dev C Tutorial PDF
2 pages
Topic 1 (Whole Numbers) - Y4
No ratings yet
Topic 1 (Whole Numbers) - Y4
23 pages
0417 s13 QP 31
No ratings yet
0417 s13 QP 31
8 pages
Security Manual
100% (1)
Security Manual
16 pages
Reduced Row Echelon Form
No ratings yet
Reduced Row Echelon Form
4 pages
Call Fail Cause
100% (1)
Call Fail Cause
3 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Grade 10 CAT Year Planner 2025
No ratings yet
Grade 10 CAT Year Planner 2025
9 pages
D-Tect 50 Ip Quad Pir Datasheet
No ratings yet
D-Tect 50 Ip Quad Pir Datasheet
2 pages
CE 212 Digital Systems Ch4
No ratings yet
CE 212 Digital Systems Ch4
37 pages
Questions Chapter Wise
No ratings yet
Questions Chapter Wise
6 pages
Using Multivariate Statistics 7th Edition Barbara G. Tabachnickdownload
100% (2)
Using Multivariate Statistics 7th Edition Barbara G. Tabachnickdownload
51 pages
De MC Smo PRG en 01 v4 3 1 CNRSZR
No ratings yet
De MC Smo PRG en 01 v4 3 1 CNRSZR
458 pages
Tl-Wa850re Qig V6
No ratings yet
Tl-Wa850re Qig V6
2 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Data Mining Exam

Uploaded by

Data Mining Exam

Uploaded by

2024/8/31 ‫حل اختبار الفاينل تنقيب بيانات‬

By : Mohamed Suhail El-Ejel

The “Religion Set” data field can best be defined as _______.

When conducting mining operations, which of these data attributes is of interest?

Proximity refers to similarity measure only.

How many types of data mining functions are involved?

A decision tree is a predictive model.

A pattern that does not satisfy a minsup threshold is called a :

Classification of data is a ______-step process.

d.ignore missing attribute values

b.None of the these

By : Mohamed Suhail El-Ejel ❤️

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.