0% found this document useful (0 votes)
549 views11 pages

CS 4407 Data Mining and Machine Learning - Term 3, 2018-2019

The document summarizes the results of a graded quiz on data mining and machine learning topics. The student scored poorly, answering only 4 out of 12 questions correctly for a grade of 36.36%. The quiz covered topics like linear regression, classification, and predicting categorical vs continuous variables.

Uploaded by

studyjunky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
549 views11 pages

CS 4407 Data Mining and Machine Learning - Term 3, 2018-2019

The document summarizes the results of a graded quiz on data mining and machine learning topics. The student scored poorly, answering only 4 out of 12 questions correctly for a grade of 36.36%. The quiz covered topics like linear regression, classification, and predicting categorical vs continuous variables.

Uploaded by

studyjunky
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 11

3/31/2019 Graded Quiz Unit 3

CS 4407 Data Mining and Machine Learning - Term 3, 2018-2019


Home ► My courses ► CS 4407 - AY2019-T3 ► 14 February - 20 February ► Graded Quiz Unit 3

Started on Wednesday, 20 February 2019, 3:58 PM


State Finished
Completed on Wednesday, 20 February 2019, 4:58 PM
Time taken 1 hour
Marks 8.00/22.00
Grade 36.36 out of 100.00

Question 1 Which of the following statements will generate a multiple linear


Incorrect regression model within R where the output or predicted variables is Sales
Mark 0.00 out of and the prediction variables include temperature and
1.00 unemploymentrate?  

Select one:
a. lm(sales~temperature+unemploymentrate)

b. lm(temperature+unemploymentrate=sales)

c. lm(sales+temperature~unemploymentrate)

d. None of these commands are valid

Your answer is incorrect.


The correct answer is: lm(sales~temperature+unemploymentrate)

Question 2 True/False:  Reinforcement learning features elements of both supervised


Correct learning and unsupervised learning as the outcome variable or predicted
Mark 1.00 out of values are validated over time and feedback is used to continuously train
1.00 the learning algorithm.

Select one:
True

False

The correct answer is 'True'.

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 1/11
3/31/2019 Graded Quiz Unit 3
Question 3 The sales of a company (in million dollars) for each year are shown in the
Not answered table below, identify the linear regression model in the form y=mx+b and
Marked out of report the values of m (slope) and b (intercept) as well as the estimated
1.00 value of y when the value of x is 10. 

x (year) 2005 2006 2007 2008 2009

y (sales) 12 19 29 37 45

NOTE: You should consider the value x as the elapsed time.  For 2005 this
would be 0 years, for 2006 it would be 1 year and for 2012 it would be 7
years.
What is the value of b? 

Answer:

The correct answer is: 11.6

Question 4 True/False: A linear regression model can be used to predict categorical


Incorrect data values.
Mark 0.00 out of
1.00 Select one:
True

False

The correct answer is 'False'.

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 2/11
3/31/2019 Graded Quiz Unit 3
Question 5 When data observations are placed into speci c groups according to their
Incorrect observed characteristics this is known as: ______________
Mark 0.00 out of
1.00 Select one:
a. Classi cation

b. Decision Tree Analysis

c. Clustering

d. Regression

Your answer is incorrect.


The correct answer is: Classi cation

Question 6 The following diagram represents which technique? 


Correct

Mark 1.00 out of


1.00

Select one:
a. Linear Regression

b. Curvilinear Regression

c. Spline Regression

d. Polynomial curve tting

Your answer is correct.

The correct answer is: Curvilinear Regression

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 3/11
3/31/2019 Graded Quiz Unit 3
Question 7 When using a relational database engine as the backend for analytics
Incorrect processing, the acronym _____ is used to describe it.  
Mark 0.00 out of
1.00 Select one:
a. MOLAP

b. ROLAP

c. OLAP

d. RDBMS

Your answer is incorrect.


The correct answer is: ROLAP

Question 8 ≈
A linear regression model is expressed as y β0+ β1x where β0 is the
Incorrect intercept and β1 is the slope of the line).  The following equations can be
Mark 0.00 out of used to compute the value of the coe cients β0 and β1. 
1.00

Using the following set of data, nd the coe cients β0 and β1rounded to
the nearest thousandths place and the predicted value of y when x is 10.

{(-1 , 0),(0 , 2),(1 , 4),(2 , 5)}

What is the value of β0. 

Answer: 0.9797

The correct answer is: 1.9

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 4/11
3/31/2019 Graded Quiz Unit 3
Question 9 Assume that you have a data set which produces the following data plot. 
Correct You wish to predict if a new case would be a ‘red’ case as opposed to a
Mark 1.00 out of ‘blue’ case based upon the input attribute data.  Which technique should
1.00 you use?  

Select one:
a. Linear Regression

b. Curvilinear Regression

c. Spline Regression

d. Logistic Regression

Your answer is correct.

The correct answer is: Logistic Regression

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 5/11
3/31/2019 Graded Quiz Unit 3
Question 10 Assume that you had a variety of data including medical history, diet,
Correct heredity factors on individuals who developed cancer and you wanted to
Mark 1.00 out of use this data to determine whether a person is likely to develop cancer. 
1.00 Which technique would be the most promising to start with? 

Select one:
a. Classi cation

b. Regression

c. Clustering

d. Estimation

Your answer is correct.

The correct answer is: Classi cation

Question 11 ≈
A linear regression model is expressed as y β0+ β1x where β0 is the
Incorrect intercept and β1 is the slope of the line).  The following equations can be
Mark 0.00 out of used to compute the value of the coe cients β0 and β1. 
1.00

Using the following set of data, nd the coe cients β0 and β1rounded to
the nearest thousandths place and the predicted value of y when x is 10.

{(-1 , 0),(0 , 2),(1 , 4),(2 , 5)}


What is the value of β1. 

Answer: 0.9898

The correct answer is: 1.7

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 6/11
3/31/2019
Question 12
Graded Quiz Unit 3

A linear regression model is expressed as y β0+ β1x where β0 is the
Incorrect intercept and β1 is the slope of the line).  The following equations can be
Mark 0.00 out of used to compute the value of the coe cients β0 and β1. 
1.00

Using the following set of data, nd the coe cients β0 and β1rounded to
the nearest thousandths place and the predicted value of y when x is 10.

{(-1 , 0),(0 , 2),(1 , 4),(2 , 5)}


What is the value of y. 

Answer: 21

The correct answer is: 18.9

Question 13 The sales of a company (in million dollars) for each year are shown in the
Incorrect table below, identify the linear regression model in the form y=mx+b and
Mark 0.00 out of report the values of m (slope) and b (intercept) as well as the estimated
1.00 value of y when the value of x is 10. 

x (year) 2005 2006 2007 2008 2009

y (sales) 12 19 29 37 45

NOTE: You should consider the value x as the elapsed time.  For 2005 this
would be 0 years, for 2006 it would be 1 year and for 2012 it would be 7
years.

What is the value of m? 


(NOTE: The next 2 question relate to this same problem but are separated
to be acomodated by moodle)

Answer: 40

The correct answer is: 8.4

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 7/11
3/31/2019 Graded Quiz Unit 3
Question 14 The income of a company that produces disaster equipment has been
Correct expressed as a linear regression model based upon the input variable
Mark 1.00 out of which is the number of hurricanes projected for the upcoming hurricane
1.00 season.   The model is express as Y = mX + b where Y is the estimated sales
in millions of dollars, m = .67 and b = 8.2.  Assuming that the weather
service is predicting 12 hurricanes during the season what are the sales in
millions of dollars expected to be?  

Answer: 16.24

The correct answer is: 16.24

Question 15 True/False: According to our textbook, residual plots are a useful tool for
Incorrect identifying clusters.
Mark 0.00 out of
1.00 Select one:
True

False

The correct answer is 'False'.

Question 16  True/False: Shared nothing architectures distribute the processing of


Correct queries to access large volumes of data and provide near linear scalability
Mark 1.00 out of in both storage volume and query performance. 
1.00
Select one:
True

False

The correct answer is 'True'.

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 8/11
3/31/2019 Graded Quiz Unit 3
Question 17 True/False: Data Mining can be said to be a process designed to detect
Correct patterns in data sets.
Mark 1.00 out of
1.00 Select one:
True

False

The correct answer is 'True'.

Question 18 True/False:  Supervised learning features both input variables or attributes


Correct and an output or predicted variable.  
Mark 1.00 out of
1.00 Select one:
True

False

The correct answer is 'True'.

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 9/11
3/31/2019 Graded Quiz Unit 3
Question 19 The sales of a company (in million dollars) for each year are shown in the
Not answered table below, identify the linear regression model in the form y=mx+b. 
Marked out of x (year) 2005 2006 2007 2008 2009
1.00
y (sales) 12 19 29 37 45

NOTE: You should consider the value x as the elapsed time.  For 2005 this
would be 0 years, for 2006 it would be 1 year and for 2012 it would be 7
years.
What is the predicted value of y  (in millions of dollars) when the year is
2012?

Answer:

The correct answer is: 70.4

Question 20 True/False: The snow ake schema di ers from the star schema in that the
Not answered table holding the dimensional data are normalized.
Marked out of
1.00 Select one:
True

False

The correct answer is 'True'.

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 10/11
3/31/2019 2 3
Graded Quiz Unit
Question 21 True/False: A regression model has a R statistic of .15.   This indicates that
Not answered the regression model is NOT a good t and does a poor job of predicting
Marked out of the outcome based upon the input variables.
1.00
Select one:
True

False

The correct answer is 'True'.

Question 22 Assuming you have a linear model in which the value of m is .05 and the
Not answered value of b is 10 that explains the relationship between income and credit
Marked out of extended.  If income is 50,000, what credit will be extended?
1.00
Select one:
a. 500

b. 5010

c. 2050

d. 2510

Your answer is incorrect.


The correct answer is: 2510

◄ Self-Quiz Unit 3

Jump to...

Unit Forum ►

https://my.uopeople.edu/mod/quiz/review.php?attempt=1692649&cmid=167013 11/11

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy