Question Bank For All 5 Units: Department of Computer Science and Engineering & Department of Information Technology
Question Bank For All 5 Units: Department of Computer Science and Engineering & Department of Information Technology
&
Department of Information Technology
QUESTION BANK
1
UNIT-1
COURSE PROGRAM BLOOM’S
S.No QUESTION MARKS OUTCOME OUTCOME TAXONOMY
(CO) (PO) LEVEL(BTL)
Why Data Analytics is so 2M 1 1 1
a).
important?
What do you mean by Enterprise 3M 1 1 1
1. b).
requirements in Data architecture?
Describe the Factors that influence 10M 1 2 2
c).
the Data Architecture?
2
UNIT 2
BLOO
PROG
COURSE M’S
RAM
OUTCO TAXON
S.No QUESTION MARKS OUTC
ME OMY
OME
(CO) LEVEL(
(PO)
BTL)
a). What is the Importance of Analytics? 2M 2 1 1
1. b). What is the role of Data Analytics? 3M 2 1 2
c). What are the ways to use the Data Analytics? 10M 2 2 4
3
UNIT 3
BLOOM’S
COURSE PROGRAM
TAXONOMY
S.No QUESTION MARKS OUTCOME OUTCOME
LEVEL(BTL
(CO) (PO)
)
a). When a Regression is chosen? 2M 3 1 1
b). List the Regression Analysis Techniques 3M 3 1 2
1. Explain in detail about the OLS 10M 3 3 4
c). Regression with the inclusion of Error
term?
4
UNIT 4
COURSE PROGRAM BLOOM’S
S. No QUESTION MARKS OUTCOME OUTCOME TAXONOMY
(CO) (PO) LEVEL(BTL)
a). What is Supervised Learning? 2M 4 2 1
What is the major difference between 3M 4 1 2
b).
1. Supervised & Unsupervised Learning?
Compare and contrast Supervised & 10M 4 3 4
c).
Unsupervised Learning.
5
UNIT 5
COURSE PROGRAM BLOOM’S
S. No QUESTION MARKS OUTCOME OUTCOME TAXONOMY
(CO) (PO) LEVEL(BTL)
What do you mean by Data Visualization? 2M 5 2 1
a).
Brief.
Why Data Visualization is required? 3M 5 1 2
b).
1. Elaborate
Explore in detail about the different 10M 5 3 5
c). Geometric Projection Visualization
Techniques.
6
OBJECTIVE QUESTIONS
UNIT – 1
Choose the Correct Answer
1. Most of the data is generated from _________ [ ]
A. Print media B. Organizations
C. Social media D. e-commerce
4. __________ policies and rules will help describe the manner in which enterprise wishes [ ]
to process their data.
A. Working Policies B. Labor Policies
C. Business policies D. Administration
7. The data which is Raw, original, and extracted directly from the official sources is [ ]
known as _____________.
A. Secondary Data B. primary data
C. Input Data D. Processed Data
8. CRD _______________ [ ]
A. Complete Randomized design B. Complete Rough Data
C. Complete Raw Data D. Complete Raw Design
9. LSD – Latin Square Design is _________ squares with an equal number of rows and [ ]
columns
A. N x N B. N x M
C. N x 1 D. 1 x N
10. __________ is the assessment of how much the data is usable and fits its serving [ ]
context.
A. data quality B. Data Integrity
7
C. Data Quantity D. Data Interpretability
UNIT-2
Choose the Correct Answer:
1. ______ is leading analytics tool used for statistics and data modeling. [ ]
A. Java Programming B. C Programming
C. R Programming D. C++ Programming
2. ___________ software that connects to any data source such as Excel, corporate [ ]
Data Warehouse, etc.
A. Tableau B. R
C. Java D. Python
3. _________ can be assembled on any platform like SQL server, a MongoDB database [ ]
or JSON.
A. Java Programming B. Python Programming
C. R Programming D. C++ Programming
5. _______ is one of the largest large-scale data processing engine that executes [ ]
applications in Hadoop clusters.
A. Python B. R
C. Ruby D. Apache Spark
8
together so that they act as a single entity.
A. Cluster computing B. Wide Computing
C. Big Computing D. Close Computing
10. ______is a component on top of Spark Core that introduces a new data abstraction [ ]
A. SQL B. Spark SQL
C. Spark D. NoSQL
9
UNIT-3
MULTIPLE CHOICE QUESTIONS:
1. The term __________ is used to indicate the estimation or prediction of the average [ ]
value of one variable for a specified value of another variable.
A. Segregation B. Progression
C. Regression D. Aggregation
A. y = B0*x * B1 B. y = B0 + B1 * x
C. y = B1 * x D. y = B0 + x
pi xi y x
2 2
i i
Err i 1
Err i 1
A. n B. n
n n
p y y x
2 2
i i i i
Err i 1
Err i 1
C. n D. n 1
4. When we have a single input attribute (x) and we want to use linear regression, this is [ ]
called ________________
A. Multiple Linear Regression B. Continuous Linear Regression
C. simple linear regression D. Auto Linear Regression
6. The goal of is to improve the Data Processing in an optimal way through attribute [ ]
subset selection
A. Rationalization B. Variable correlation
C. Variable Rationalization D. Various Rationalization
10
A. ordered B. Semi-ordered
C. unordered D. Under ordered
UNIT-4
MULTIPLE CHOICE QUESTIONS:
1. Supervised learning is a learning method in which models are trained using _______ [ ]
A. Unlabeled data B. Raw Data
C. Labeled data D. Complete Data
4. The purpose of ___________ is to better understand your customers rather than data [ ]
A. segmentation B. Regression
C. Segregation D. Correlation
11
6. Decision Tree is a _________________technique [ ]
A. supervised learning B. unsupervised learning
C. Semi-supervised learning D. Non-supervised learning
9. _________________is the process of removing the unwanted branches from the tree [ ]
A. Edging B. Pruning
C. Regression D. predicting
12
UNIT- 5
MULTIPLE CHOICE QUESTIONS:
1. _____________ is the art and practice of gathering, analyzing, and graphically [ ]
representing empirical information.
A. Data Modification B. Data visualization
C. Data Validation D. Data Updating
2. __________ is used to get graphical output from data predictive analytics results. [ ]
3. Data Visualization induce the viewer to think about the substance rather than [ ]
about________ through graphic design
A. output B. outcome
C. methodology D. error
4. We need to choose the dimensions and measures in the process of __________ Data. [ ]
A. Extracting B. Estimating
C. Expressing D. Exploring
5. _____________are the category type data points such as landing page, source [ ]
medium, etc.
A. Directions B. Dimensions
C. Detections D. Du-points
6. One of the great __________ qualities Tableau has is its ability to filter data in real [ ]
time
A. show room B. Show space
C. Show case D. Work space
7. DPA: ____________________ [ ]
A. Data presentation architecture B. Dual presentation architecture
C. Data preparation architecture D. Directive presentation architecture
13
FILL IN THE BLANKS:
11. Gain insight into an information space by mapping data onto ______________ provide
qualitative overview of large data sets.
12. ________________ is a Forecast Accuracy can be defined as the deviation of Forecast or
Prediction from the actual results.
13. In MFA, Error = _______________.
14. CHAID stands for ___________________.
15. Regression tree analysis is when the predicted outcome can be considered a
________________.
16. A ___________ tree is a binary decision tree that is constructed by splitting a node into two
child nodes repeatedly
17. Decision Tree Leaning can be able to handle both numerical and categorical data (True/False)
______________.
18. Decision Tree uses a White box Model (True/False) _______________.
19. Regression trees / parallel regression modeling, in which the dependent variable
is______________ .
20. The CART growing method attempts to _____________ within-node homogeneity.
14