0% found this document useful (0 votes)
65 views2 pages

Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?

This document contains a 10 question model question paper for a Data Mining Techniques course. The questions cover a range of topics in data mining including: 1. Designing a data mining system architecture to examine university course data. 2. Examples of problems that can be solved using association rule mining, classification, and clustering. 3. Using the Apriori algorithm to find frequent itemsets in market basket data.

Uploaded by

Delvin company
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
65 views2 pages

Reg. No.: Name:: Architecture You Would Choose. What Is The Purpose of Each Component of This Architecture?

This document contains a 10 question model question paper for a Data Mining Techniques course. The questions cover a range of topics in data mining including: 1. Designing a data mining system architecture to examine university course data. 2. Examples of problems that can be solved using association rule mining, classification, and clustering. 3. Using the Apriori algorithm to find frequent itemsets in market basket data.

Uploaded by

Delvin company
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Reg. No.

:
Name :

Model Question Paper


Programme : M.Tech Software Engineering (Integrated) Semester : Fall 2021 -22
Code : SWE2009
Course : Data Mining Techniques
Faculty : Prof. Pattabiraman.V Slot(s) : A2+TA2

Answer ALL Questions

Q.No. Question Description Marks.

1. Suppose your task as a software engineer at VIT-University is to design a data mining 10


system to examine their university course database, which contains the following
information: the name, address, and status (e.g., undergraduate or graduate) of each student,
the courses taken, and their cumulative grade point average (CGPA). Describe the
architecture you would choose. What is the purpose of each component of this architecture?
2. Consider some problems in which the solution can be obtained through data mining 10
techniques like association rule mining, classification, and clustering? Give examples and
explain.
3. Consider the Market Basket Data in the table given below to illustrate the mechanism of 10
frequent itemset mining, let assume min-support=60% and min_confidence=80%. Find all
frequent itemsets using Apriori Algorithm.
Transaction ID Items bought
101 Tomato, Potato, Onion
102 Tomato, Potato, Brinjal, Pumpkin
103 Tomato, Potato, Onion, Chilly
104 Lemon, Tamarind, Chilly
105 Tomato, Potato, Brinjal,
106 Potato, Brinjal, Onion, Chilly
107 Tomato, Potato, Onion, Chilly
4. Consider the sample dataset given below, apply FP-Growth algorithm to find the frequent 10
pattern. Illustrate the FP-tree construction and mining the FP-Tree with algorithm.
TID Items_bought

10 f, a, c, d, g, I, m, p
20 a, b, c, f, l, m, o
30 b, f, h, I, o, w
40 b, c, k, s, p
50 a, f, c, e, l, p, m, n

5. Consider the below Table contains X – is the Midterm Exam Mark and Y – is the Final 10
Exam Mark.
X 72 50 81 74 94 86 59 83 65 33 88 81 86
Y 84 63 77 78 90 75 49 79 77 52 74 90 ??
Predict the FINAL exam grade of a student who received an 86 on the midterm exam using
Bayesian Classification.

Page 1 of 2
6. The following data set will be used to learn a decision tree for predicting whether a 10
mushroom is edible or not based on its shape, color and odor.
Shape Color Odor Edible
C B 1 YES
D B 1 YES
D W 1 YES
D W 2 YES
C B 2 YES
D B 2 NO
D G 2 NO
C U 2 NO
C B 3 NO
C W 3 NO
D W 3 NO
What is entropy H (Edible|Odor = 1 or Odor = 3)?
Which attribute would the ID algorithm choose to use as the root of the tree?
Draw the full decision tree that would be learned for this data (no pruning).

7. Assume that metallurgical department maintains a repository of the rain fall, maximum and 10
minimum temperature of each district during June to September 2018 in the following
format. Group the rainfall according to the temperature recorded using R Programming and
with the help of visual analytics show the correlation exits with temperature and rainfall.
Predict which districts have the more impact on the temperature.

District District Min. Max. % Rain fall during


Id Name Temperature Temperature June to Sep. 2018
D001 Chennai 32 38 62%
D002 Kanchipuram 31 37 78%
D003 Thiruvallur 30 36 87%
D004 Vellore 33 39 80%

8. Consider the following eight points (with (x; y) representing location) 10


{(2; 10) (2; 5) (8; 4) (5; 8) (7; 5) (6; 4) (1; 2) (4; 9)}, cluster these point into 3 clusters
using K-Mean algorithms with the distance function is Euclidean distance.

9. Given a set of 5-D categorical samples 10


A = (1, 0, 1, 1, 0)
B = (1, 1, 0, 1, 0)
C = (0, 0, 1, 1, 0)
D = (0, 1, 0, 1, 0)
E = (1, 0, 1, 0, 1)
F = (0, 1, 1, 0, 0)
Cluster the above samples using agglomerative clustering with single-linkage as well as
complete-linkage. And also plot the dendograms for the solutions with single-linkage and
multi-linkage measures respectively.

10. Suppose that a city transportation department would like to perform data analysis on 10
highway traffic for the planning of highway construction based on the city traffic data
collected at different hours every day. What information can we mine from such a spatial
data warehouse to help city planners?

Describe in details about the Adaptive Neuro Fuzzy Inference System with example.


Page 2 of 2

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy