0% found this document useful (0 votes)
28 views9 pages

Practice Exam Final 3024

stat

Uploaded by

kimaninduta49
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
28 views9 pages

Practice Exam Final 3024

stat

Uploaded by

kimaninduta49
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 9

STA 3024 Exam 3 Practice Problems

NOTE: These are just Practice Problems. This is NOT meant to look just like the test,
and it is NOT the only thing that you should study. Make sure you know all the
material from the notes, quizzes, suggested homework and the corresponding chapters
in the book.

Questions 1 – 7 Former kicker for the Gator football team, Chris Hetland, was very good at
making field goals in the 2005 season, but in the 2006 regular season had only made 3 out
of 12. The following is the Logistic Regression Output to predict the probability of making
a field goal (yes/no), based on how far the kick is (in yards) and the year (2005 or 2006).

Logistic Regression Table

Predictor Coef SE Coef Z P


Constant 8312.97 3073.50 2.70 0.007
yards -0.173760 0.0901421 -1.93 0.054
year -4.14174 1.53141 -2.70 0.007

1. What kind of variables do we have here?


a) a quantitative predictor and a quantitative response
b) two quantitative predictors and a quantitative response
c) a quantitative predictor and a categorical response
d) two quantitative predictors and a categorical response

2. Write down the fitted logistic regression equation:


a) ŷ = 8312.97 – 0.173760 – 4.14174
b) ŷ = 8312.97 – 0.173760 yards – 4.14174 year
c) p̂ = e 8312.97 – 0.173760 – 4.14174 / 1+ e 8312.97 – 0.173760 – 4.14174
d) p̂ = e 8312.97 – 0.173760 yards – 4.14174 year / 1+ e 8312.97 – 0.173760 yards – 4.14174 year

3. The coefficients of yards and years are both negative. This means:
a) that neither variable is a good predictor of whether the kicker will make or not the field
goal
b) that simple linear regression would have been more appropriate than logistic regression
c) that there was a mistake in the way the data was entered into the computer
d) that the chances of making the field goal go down as the yardage increases, and as the
years increase

Find the probability of making a field goal:


4. from the 30 yd line in 2006
5. from the 30 yd line in 2005
6. from the 40 yd line in 2006
7. from the 40 yd line in 2005
Question 8 - 13 As part of a project for their Intro Stat course, two students compared two
brands of chips, Frito Lays and Golden Flakes, to see which company gives you more for
your money. Five bags of each brand (which, according to the label, each contained 35.4
grams) were measured with a very accurate scale. Use the Wilcoxon Rank-Sum test to see
if there are any significant differences between the two brands in the amount of product they
put in their bags.

Frito Lays: 35.3 35.4 35.8 35.9 35.9

Golden Flake: 35.3 37.8 38.8 38.1 42.5

8. The null hypothesis is about:


a) the mean contents of the bags for Frito Lays and Golden Flakes brands
b) the mode of the contents of the bags for Frito Lays and Golden Flakes brands
c) the distribution of the contents of the bags for the two brands
d) the number of bags with contents below the label weight for the two brands

9. The alternative hypothesis, according to the problem stated above, is that:


a) Frito Lays gives you more chips than Golden Flakes
b) Frito Lays gives you less chips than Golden Flakes
c) Frito Lays gives you either more or less chips than Golden Flakes
d) Golden Flakes gives you more chips than the amount stated on the label

10. The bags that contained 35.9 grams will receive a rank of:
a) 4 b) 4.5
c) 5 d) 5.5 e) 6

11. The p-value for the test was .1164. We conclude that:
a) Frito Lays gives you more chips.
b) Golden Flakes gives you more chips.
c) There is not enough evidence to prove a difference between the two brands.
d) There is enough evidence to prove a difference between the two brands.

12. If the assumptions for the Normal based procedure were satisfied, we could analyze the
data with a confidence interval for:
a) μ b) μ1-μ2 c) μd d) η 1- η 2

13. Why is it not a good idea to use the Normal-based procedure here?
a) the data was not randomly selected
b) the data does not have a continuous distribution
c) the outlier violates the assumption of Normality
d) the nonparametric method is always better
Questions 14 - 18 Do plain and peanut m&m's have the same distribution of colors?
Several bags of each variety (plain and peanut) were randomly selected, and the number of
candies of each color were counted before eating any of them. The data appears below.

brown yellow red blue green orange


plain 81 84 41 17 30 41 294
peanut 17 7 27 13 14 16 94
98 91 68 30 44 57 388

14. The null hypothesis is that:


a) plain and peanut varieties are independent
b) the colors are independent of each other
c) color and variety are independent of each other
d) all of the above

15. The expected number of blue, peanut m&m's (under independence) is:
a) 15.0 b) 15.67
c) 7.27 d) 32.33

16. The sampling distribution is χ 2 with degrees of freedom equal to:


a) 5 b) 10 c) 11 d) 12

17. The test statistic was 32.67. Use the table to approximate the p-value for this test:
a) smaller than .001
b) equal to .05
c) between .05 and .10
d) between .950 and .975

18. What conclusions can you reach from this analysis, based on the data and the test
statistic given?
a) There is something wrong with the data, maybe the m&m's were not randomly selected.
b) The distribution of colors is not significantly different for plain and peanut m&m's.
c) Peanut m&m's are significantly more colorful than plain m&m's.
d) There are significantly more brown m&m's than orange m&m's.
Questions 19 - 21 Match each of the Nonparametric procedures presented on the left with
the corresponding experimental design from the list on the right (use each alternative only
once).

19. Kruskal-Wallis H Test a) two independent samples


20. Wilcoxon Rank-Sum Test b) paired samples
21. Wilcoxon Signed-Rank Test c) several independent samples

Questions 22 -24 Five sets of identical twins were selected at random from a population of
identical twins. One child was selected at random from each pair to form an "experimental
group." These five children were sent to school. The other five children were kept at home
as a control group. At the end of the school year the following IQ scores were obtained.
Does this evidence justify the conclusion that lack of school experience has a depressing
effect on IQ scores? Analyze the data with the Wilcoxon Signed-Rank Test

Experimental Control
Pair Group Group
1 110 112
2 125 120
3 139 128
4 142 135
5 127 126

22. The sums of the ranks for this test are:


a) W+ =13 W- =2
b) W+ =24 W- =2
c) W =9
d) W =30

23. The data shows some evidence that:


a) the experimental (school) group tends to have higher IQs than the control (home)group.
b) the experimental (school) group tends to have lower IQs than the control (home)group.
c) the experimental (school) group tends to have IQs similar to the control (home)group.
d) the experimental (school) group tends to have IQs different from the control
(home)group.

24. Which of the following (one-sided) p-values looks reasonable for this data?
a) 0.0001
b) 0.9663
c) 0.0885
d) 0.4367
Questions 25 – 29 Data collected to study the relationship between child obesity and parental
obesity is shown in the following contingency table.

Child
Obese Nonobese

Obese 34 29
Parent
Nonobese 16 21

25. What is the null hypothesis being tested?


a) the proportion of obese and nonobese parents are the same
b) the proportion of obese and nonobese children are the same
c) the proportion of obese children is the same for obese and nonobese parents
d) all of the above

26. How many obese children were involved in the study?


a) 34 b) 50 c) 63 d) 100

27. What are the expected counts for each category under the null hypothesis (in the same
order as the given table)?
a) 34 29
16 21

b) 35 30
15 20

c) 31.5 31.5
18.5 18.5

d) 22.1 22.1
22.1 22.1

28. How many degrees of freedom are associated with the X2 test?
a) 4 b) 3 c) 2 d) 1

29. Find the contribution to the Test Statistc of the parent obese/child obese cell.
Questions 30 – 32 An experiment was conducted to determine whether a test designed to
identify a certain form of mental illness could be easily interpreted with little psychological
training. The test was given to 100 people (half of which had the illness, and half didn't) and
fifteen people were asked to evaluate them. The fifteen judges were five staff members of a
mental hospital, five trainees at the hospital, and five undergraduate psychology majors. The
results in the table give the number of the 100 tests correctly classified by each judge.
Analyze the data with the Kruskal-Wallis Test.

Staff Trainees Students


78 80 65
76 69 74
80 75 78
79 81 80
86 72 75

30. The ranks for the observations on the first row should be:
a) 2 3 1
b) 7 9 1
c) 8.5 12 1
d) none of the above

31. The highest rank given to any observation is:


a) 11 b) 5 c) 15 d) 3

32. If the p-value of the test is small we would conclude that there are:
a) differences between staff, trainees and students in their ability to interpret the test
b) no differences between staff, trainees and students in their ability to interpret the test
c) differences in the individual judges abilities to interpret the test
d) no differences in the individual judges abilities to interpret the test

Questions 33 – 35 Questions regarding the use of Nonparametric procedures:

33. Which of the following kinds of data can be analyzed with Nonparametric procedures?
a) normal b) continuous c) ranks d) all of the above

34. Which of the following kinds of data should be analyzed with Nonparametric
procedures?
a) normal b) continuous c) ranks d) all of the above

35. Given that all the necessary assumptions for each test are satisfied, which are more
powerful at finding significant differences?
a) Nonparametric procedures, since their assumptions are generally easier to satisfy.
b) Normal-based procedures, since they take into consideration the shape of the
distribution.
c) Nonparametric procedures, since their assumptions are generally harder to satisfy.
d) Normal-based procedures, since they work for distributions of almost any shape.
Questions 36 - 40 For each of the following stories, determine which would be the simplest type of
statistical analysis that would be appropriate to use. Use each type of analysis only once.
a) Paired t test
b) Two sample t-test
c) ANOVA
d) Kruskal-Wallis
e) Wilcoxon Rank-Sum Test

__36. Compare the average number of hours per week spent on Facebook for Freshmen, Sophomore,
Juniors and Seniors at UF, based on a random sample of 100 students.

__37. Compare the distribution of the number of hours per week spent on Facebook for Freshmen,
Sophomore, Juniors and Seniors at UF, based on random samples of 10 students per group, which had
quite different standard deviations.

__38. Compare the average number of hours per week spent on Facebook during the first week in
April and the first week in May (finals week) for random students at UF, measured on the same 100
students.

__39. Compare the distribution of the number of hours per week spent on Facebook for male and
female students at UF, based on a random sample of 10 students. There was an outlier in one of the
groups.

__40. Compare the average number of hours per week spent on Facebook for male and female
students at UF, based on a random sample of 100 students.

Questions 41 - 45 For each of the following stories, determine which would be the simplest type of
statistical analysis that would be appropriate to use. Use each type of analysis only once.
a) Confidence Interval for One Proportion
b) Contingency Table
c) Simple Linear Regression
d) Multiple Regression
e) Logistic Regression

__41. Predict the average number of hours per week UF students spend on Facebook, based on their
age and gender.

__42. Estimate the fraction of UF students who have Facebook accounts.

__43. Determine if the fraction of UF students who have Facebook accounts is different for Males
and Females.

__44. Determine how the probability that a UF student has a Facebbok account changes with the
student’s age.

__45. Predict the average number of hours per week UF students spend on Facebook, based on the
student’s age.
Questions 46 – 49 Which drug slows reaction time the most? The following are the reaction times (in
milliseconds) for randomly selected subjects who took either Drug A or Drug B.
Drug A Drug B
1.96 2.11
2.24 2.43
1.71 2.07
2.41 2.71
1.62 2.50
1.93 4.84
2.88
46. These data represents:
a) two independent samples, but it would have been better to collect data for matched
pairs, since reaction times can vary greatly by individual.
b) matched pairs, but it would have been better to collect data for two independent
samples, since reaction times can vary greatly by individual.
c) quantitative data, but it would have been better to collect categorical data on whether
each subject reacted more slowly under Drug A or B.
d) categorical data, but it would have been better to collect quantitative data on whether
each subject reacted more slowly under Drug A or B.

47. We could analyze this data with a t test or a Nonparametric procedure. When choosing
which procedure works best here it’s important to note that:
a) there are more observations for one treatment than the other
b) the variances of the two groups are quite different
c) there is an outlier in the data
d) all of the above
e) none of the above

48. If we conduct the Wilcoxon Rank-Sum test on this data, the sum of ranks for drug B is:
a) 25 b) 66 c) 28 d) 21 e) 53

49. The best interpretation of the results from the computer output shown below is:
a) There are significant differences in the mean reaction time for Drug A and B.
b) There are significant differences in the median reaction time for Drug A and B.
c) Reaction times are slower for Drug B, on average.
d) Reaction times are slower for Drug B, overall.

50. Are there any problems with the assumptions for the analysis below?
a) No problems if we can trust the subjects were really random.
b) No problem since the story states the subjects were chosen randomly.
c) There is a problem since looking at the data we don’t trust the subjects were really random.
d) There is a problem since the sample size requirement is not satisfied.

Mann-Whitney Test and CI: DrugA, DrugB


N Median
DrugA 6 1.945
DrugB 7 2.500
Point estimate for ETA1-ETA2 is -0.520
96.2 Percent CI for ETA1-ETA2 is (-1.260,-0.110)
W = 25.0
Test of ETA1 = ETA2 vs ETA1 not = ETA2 is significant at 0.0184
Solutions

1. d
2. d
3. d
4. 36%
5. 97%
6. 9%
7. 86%
8. c
9. c
10. d
11. c
12. b
13. c
14. c
15. c
16. a
17. a
18. c
19. c
20. a
21. b
22. a
23. a
24. c.
25. c
26. b
27. c
28. d
29. 0.1984
30. c
31. c
32. a
33. d
34. c
35. b
36. c
37. d
38. a
39. e
40. b
41. d
42. a
43. b
44. e
45. c
46. a
47. c
48. b
49. d
50. a

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy