Quiz Questions and Answers For 2024
Quiz Questions and Answers For 2024
Suppose now that the sample size is fixed. I will consider looking at the following components
to determine whether my sample size will be useful:
A. Size of the effect D: This is the smallest effect that is worth detecting (implying that any
larger effect is also worth detecting). This is a major factor in determining the sample size. The
smaller the size effect that you want to detect, the larger the sample size that is needed.
Significance level α: This specifies the risk you are prepared to take of being wrong when you
say that there is a difference when the null hypothesis is really true. If α changes from 5% to 1%
the sample size will increase. I have to adjust it to make sure that the sample size is still useful.
B. Size of the effect D: This is the smallest effect that is worth detecting (implying that any
larger effect is also worth detecting). This is a major factor in determining the sample size. The
smaller the size effect that you want to detect, the larger the sample size that is needed.
Significance level α: This specifies the risk you are prepared to take of being wrong when you
say that there is a difference when the null hypothesis is really true. If α changes from 5% to 1%
the sample size will increase. I have to adjust it to make sure that the sample size is still useful.
Standard deviation sigma of the population from which the sample is drawn: This is the standard
deviation of the population specified under the null hypothesis, so that it is likely to be the
standard treatment. Adjust the standard deviation to make sure that the sample size is still
useful.
C. Size of the effect D: This is the smallest effect that is worth detecting (implying that any
larger effect is also worth detecting). This is a major factor in determining the sample size. The
smaller the size effect that you want to detect, the larger the sample size that is needed.
Significance level α: This specifies the risk you are prepared to take of being wrong when you
say that there is a difference when the null hypothesis is true. If α changes from 5% to 1%the
sample size will increase. I have to adjust it to make sure that the sample size is still useful.
Standard deviation sigma of the population from which the sample is drawn: This is the standard
deviation of the population specified under the null hypothesis, so that it is likely to bethe
standard treatment. Adjust the standard deviation to make sure that the sample size is still
useful.
Power of the test: This is defined as the probability of detecting a difference that really exists and
denoted by 1-beta. This is the probability of making a correct decision and rejecting null
hypothesis when it is false. In order to determine the power, we need to specify the size of the
difference between the means that one is interested in. Adjust the power to make sure that the
sample size is still useful.
1
D. Size of the effect D: This is the smallest effect that is worth detecting (implying that any
larger effect is also worth detecting). This is a major factor in determining the sample size. The
smaller the size effect that you want to detect, the larger the sample size that is needed.
Significance level α: This specifies the risk you are prepared to take of being wrong when you say
that there is a difference when the null hypothesis is really true. If α changes from 5% to 1%the
sample size will increase. I have to adjust it to make sure that the sample size is still useful.
Power of the test: This is defined as the probability of detecting a difference that really exists
and denoted by 1-beta. This is the probability of making a correct decision and rejecting H0
when it is false. In order to determine the power, we need to specify the size of the difference
between the means that one is interested in. Adjust the power to make sure that the sample
size is still useful.
E. Significance level α: This specifies the risk you are prepared to take of being wrong when
you say that there is a difference when the null hypothesis is really true. If α changes from5% to
1% the sample size will increase. I have to adjust it to make sure that the sample size is still
useful.
Standard deviation sigma of the population from which the sample is drawn: This is the standard
deviation of the population specified under the null hypothesis, so that it is likely to be the
standard treatment. Adjust the standard deviation to make sure that the sample size is still
useful.
Power of the test: This is defined as the probability of detecting a difference that really exists and
denoted by 1-beta. This is the probability of making a correct decision and rejecting null
hypothesis when it is false. In order to determine the power, we need to specify the size of the
difference between the means that one is interested in. Adjust the power to make sure that the
sample size is still useful.
Answer Key:C
Question 13 (5 points)
Which components do you need to calculate the sample size for a paired t-test?
B. Level of significance, population standard deviation, size effect, power of the test and
the correlation between the pairs of observations
C. Level of significance, size effect and correlation between the pairs of observations
D. Level of significance, population standard deviation, size effect and power of the test
2
Question 15 (5 points)
What kind of sampling technique is the researcher using when asking every university student
with ID number ending in 11 to complete a survey?
A. random
B. stratified
C. systematic
D. snowball
E. quota
Answer Key:C
The bias which occurs for example, using a telephone directory to obtain a sample of people
from Johannesburg wil be one of the following kinds of biasedness.
A. non- response
B. non-reports
C. non-representative
D. self-selected
Answer Key:C
Match the example with the source of error which might make the point estimate quite different
from the true population value. One team of research field workers decided they did not like
travelling by foot, so they ignoredtheir randomly selected list of clusters and only weight and
measured children who lived in villages near the road in the rural areas.
A. measurement bias
B. sampling error
C. sampling bias
3
Question 29 (5 points)
Question 1:
A medical researcher wanted to find out if a new medication is effective in reducing the length and severity
of the influenza. The researcher took the next 18 patients that visited the clinic complaining of influenza
symptoms and, after a medical exam to verify that the patients did have influenza, the patients were given
the new medication and were informed about the new drug that they were given. A week later, the patients
were contacted, and 12 patients stated the new medication was helpful in reducing the severity and length
of the influenza. Which of the following in NOT CORRECT?
A. This is a poor experiment because there is no control group. The researcher does not
know how many would feel better in a week without treatment.
B. This is a poor experiment because it is not double-blinded. The patients may feel
relief because they thought the drug should work.
C. This is a poor experiment because a convenience sample was selected. Patients who visit
the clinic may have more severe influenza than people who do not.
D. This is a poor experiment because the medication was not given to people without the
influenza to assess its effect in a control group.
E. This is a poor experiment because the sample size is likely to be too small to detect
anything but a coarse improvement in measuring the proportion of people reporting an
improvement.
Answer Key:D
Question 2 (5 points)
4
Question 3: (5 points)
An experiment was conducted by a botanist who analysed the results of a plant growth experiment
after the amount of water and seed variety was manipulated by the botanist.
Which of the following is CORRECT?
A. The botanist randomized the plants to plots to eliminate any effect of hidden variables.
B. The botanist could determine the best combination of water and seed variety by examining the
difference in the plant height in the final week of the experiment.
C. The variability in growth among plants of the same variety who received the same amount of
water was constant over time.
D. The growth of a particular plant in week 4 is likely to be independent (unrelated) of the growth of
the same plant in week 3.
E. The growth of the plants was linear over time.
Answer Key: B
Question 4: (5 points)
A recent survey by a large-circulation South African tourism magazine on the contribution of tourism
to the South African economy was circulated to 400 people who the magazine decided are the people
who are most likely to know how important tourism are to the South African economy. What is the
main problem with using these results to draw conclusions about the general public’s perception?
A. Selection bias
B. Measurement bias
C. No control group
D. Non-response bias
E. Interviewer bias
Answer key: A
5
Question 30
Question 1 (5 points)
Question 2 (5 points)
6
Question 31 (5 points)
What is the difference between a randomized block design and two-way factorial design? Which
one of the following statements is correct?
A. A randomized block design has only one subject per cell.
B. A randomized block design has more than one subject per cell.
C. The assumptions are different for a factorial design than for the randomized block
design.
D. There are interactions terms in the randomized block design.
Question 32 (5 points)
What is the difference between a crossed design and a nested design when you have twofactors?
Which one of the following statements is correct?
B. If two factors are crossed then you won’t have at least one observation in every
combination of categories for the two factors.
C. If two factors are crossed then you can calculate an interaction.
D. If a factor is nested within another factor, then an observation does not have to be within
one category of Factor 2 in order to have a specific category of Factor 1.
E. If two factors are nested you can calculate an interaction.
Answer Key:C
7
Question 35 ( 5 points)
A researcher uses a regression equation to predict residential electricity usage (rand cost),
based on residence size (square metre). The correlation between predicted electricity usage
and residence size is 0.8. What is the correct interpretation of this finding?
A. 80% of the variability in residential electricity usage can be explained by residence size
B. 64% of the variability in residential electricity usage can be explained by residence size.
C. For each added square metre of residence size, electricity usage increased by 80 cents.
D. For each added square metre of residence size, electricity usage increased by 64 cents.
8
Question 36
Question 1 (5 points)
A researcher applied the linear regression technique to determine the effect of temperature on
revenue for coca cola sales on different days of the month. She examined the residual plots for
this particular set of data. What will she conclude from these residual plots?
9
Question 2 (5 points)
Once the researcher has examined the residual plots and identified the problem with the
residual plots, how will the researcher be able to rectify the problem?
E. She will add the square of the current exploratory variable to the linear regression
equation
Answer Key:D
10
Question 38 (5 points)
What can you say about the relationship of two independent variables in a multiple linear
regression if b1 = +3.5?
B. The estimated value of Y increases by an average of 3.5 units for each increase of 1
C. The estimated value of Y increases by an average of 3.5 units for each increase of 1
Question 42 (5 points)
A. The relationship between the dependent and independent variables must be linear in
parameters.
B. It evaluates whether population means that has been adjusted for differences on
C. ANCOVA does not compare a response variable by both a factor and a continuous
independent variable.
E. ANCOVA assumes that covariates must be linearly related to the dependent variables.
Answer Key:C
11
Question 46 (5 points)
Which one of the following statements is NOT TRUE for the log-linear model for two- way tables?
A. Log-linear models for two-way tables describe associations and interaction patterns
among two categorical random variables.
B. Log-linear models express the cell counts depending on levels of two categorical
variables.
C. Given two categorical random variables, A and B, there are two main types of models that
we will consider; the independence model, (A, B) and the saturated model (AB).
D. If the interaction term is significant in the log-linear model, then we can remove one ofthe
E. If in hierarchical log-linear models a term is set to zero, all its higher-order relatives arealso
set to zero.
Answer Key:D
Question 49 (5 points)
Which one of the following statements is NOT CORRECT when comparing between
correspondence analysis and principal component analysis?
Answer Key:C
12
Question 51 (5 points)
Which one of the following statements is NOT CORRECT for the Chi-square automatic
interactiondetector technique?
B. CHAID can also be extended to apply to the case where we have a continuous
response variable.
C. Chi-square tests are applied at each of the stages in building the CHAID tree.
D. A statistically significant result indicates that the two variables are independent.
E. If a statistically significant difference is observed, then the most significant factor is used
Question 52 (5 points)
Which one of the following statements is not correct for survival analysis?
A. If a subject does not have an event during the observation time, they are described as
censored.
B. Survival estimates are reliable toward the end of a study when there are small numbers
C. It is used for analysing the expected duration of time until one or more events happen.
D. The logrank test compares the survival times of two or more groups.
E. A nonparametric estimator of the survival function, The Kaplan Meier method is widely
13
Question 54 (5 points)
Which one of the following statements will be the correct choice to choose a multiplicative time
series model?
A. Choose the model when the magnitude of the seasonal pattern in the data does not
depend on the magnitude of the data.
B. Choose the model when the magnitude of the seasonal pattern in the data depends on the
magnitude of the data.
C. Choose the model when the amplitude of both the seasonal and irregular variations
does not change as the level of the trend rises or falls.
D. The model assumes that the residuals are roughly the same size throughout the series.
Question 56 (5 points)
Which one of the statements below best describes the ARMA model?
A. One of the assumptions for the model is the presence of linear dependence in the
observations in the series.
C. The model is a deterministic model in the sense that the conditional variance is a purely
14