MSC MA5120
MSC MA5120
Faculty of Engineering
Department of Mathematics
INSTRUCTIONS TO CANDIDATE:
The crosstabulation in Table 1.1 is obtained using a statistical package to analyze the
relationship between Sex and the Source of Stress from a sample of individuals.
Table 1.1
Source of Stress Total
Work Spouse Relatio Family Health General Isolation Other
or nships or issues Life
Partner Children
(i) Write a suitable null hypothesis to test the relationship between the two categorical
variables with Chi-square test. [5 Marks]
(ii) Conduct the Chi-square test of association at 5% significance level using the above data
and write your conclusion. [15 Marks]
Question 2:
A university conducts a placement test for four subject areas: English, Math, IQ, and
Drawing. Students in the batch have completed all 4 placement tests when they enrolled in
the university. A particular course coordinator is interested in only the English and Math
sections and wants to determine whether students tended to score higher on their English or
Math test, on average.
The test scores (out of 100 points) for each subject for all students are available for the
analysis. The summary statistics of the paired samples are in the tables below.
Use a paired t-test to test whether there is a significant difference in the average of the two
tests.
Clearly state all the steps of testing with hypotheses, calculations, assumptions and
conclusions.
[20 Marks]
Question 3:
(i) Table 3.1 contains the data collected by following up the mortality bills issued by a
hospital during a week for adult CVD patients dead by the disease. Find the expected number
of dependents who will lose their guardian by a single death using the following data.
Table 3.1
No of 0 1 2 3 4 5 More
dependents than 5
No of 123 69 79 55 2 1 0
deaths
[10 marks]
(ii) Comment on the use of expected value to describe the central tendency of these data and
suggest a better method to do the same if any. [05 marks]
(iii) Compute another summary statistic to describe the data. [05 marks]
Question 4
A sample data set is drawn from a population with an intention to fit a plane to predict the
water usage.
Table 4.1: Correlations
WaterUsage Production MeanTemp Days
Pearson
1 0.282 0.473 -0.072
Water Correlation
Usage Sig. (2-tailed) 0.374 0.090 0.825
N 12 12 12 12
(i) After a careful analysis of correlations between the variables, decide whether there is
enough evidence to fit a plane. Justify your answer. [05 marks]
(ii) If you decide to fit a regression line, what is the significance level and the
independent variable you decide to take. [05 marks]
(iii)Estimate the coefficients of the regression line and predict the water usage at any of
the following, depending on your independent variable in use.
Prediction to be done at Production 90, Mean Temperature 60, Days 30 [10 marks]