0% found this document useful (0 votes)
280 views153 pages

STA404 Exam Booklet - 20.03.2023

The document contains instructions for a final assessment consisting of 7 questions related to statistics. It provides details such as the date and time of the exam, instructions for candidates, and a note that the assessment contains 12 printed pages. The questions cover topics such as analyzing data to test differences in means, linear regression, descriptive vs inferential statistics, paired t-tests, and independent t-tests.

Uploaded by

Nur Daniel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
280 views153 pages

STA404 Exam Booklet - 20.03.2023

The document contains instructions for a final assessment consisting of 7 questions related to statistics. It provides details such as the date and time of the exam, instructions for candidates, and a note that the assessment contains 12 printed pages. The questions cover topics such as analyzing data to test differences in means, linear regression, descriptive vs inferential statistics, paired t-tests, and independent t-tests.

Uploaded by

Nur Daniel
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 153

CONFIDENTIAL 1 CS/JUL 2021/STA404

UNIVERSITI TEKNOLOGI MARA


FINAL ASSESSMENT (ASSESSMENT 4)

COURSE : STATISTICS FOR BUSINESS AND SOCIAL


SCIENCES
COURSE CODE : STA404
EXAMINATION : 6TH AUGUST 2021
TIME : 2 HOURS (1445 1645)

INSTRUCTIONS TO CANDIDATES

1. This question paper consists of SEVEN (7) questions.

2. Answer ALL questions in the foolscap paper. Start each answer on a new page.

3. Candidates must accomplish this assessment within 2 hours.

4. Candidates are required to convert their completed answer in one PDF file before
submission (<FULLNAME_UiTM ID_GROUP>.pdf).

5. Candidates are given 30 minutes to email their completed answer to the respective
lecturers.

6. Candidates are required to attach the following details in every page of the answer script :

i) Full Name
ii) Student Number
iii) Group
iv) HP Number

7. Please check to make sure that this assessment pack consists of :

i) the Question Paper


ii) a five page Appendix 1

8. Answer ALL questions in English.

PLEASE READ THE INSTRUCTIONS CAREFULLY BEFORE START THE EXAMINATION


This assessment paper consists of 12 printed pages
© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL
CONFIDENTIAL 2 CS/JUL 2021/STA404

QUESTION 1

A random sample of employees were selected from three different types of stores at the
mall and their ages were recorded. Assume that the assumptions for the parametric test
are met. The data were analyzed and the result is shown below.

Department Store Music Sport

a) Calculate the sum of squares between groups.


(4 marks)

b) Given the between-group variance is 727.925 and total sum of square is 2823. Hence,
compute the value of test statistic for the above data.
(3 marks)

c) At the 0.05 significance level, test the claim that there is a difference in mean ages for
three types of stores at the mall.
(4 marks)

QUESTION 2

and the
number of accidents he or she had over a 3-year period. The data collected for 10 drivers
are shown below.

16 24 16 18 23 27 32 24 28 21
age
No of
3 2 5 2 0 1 1 1 0 3
accident

is normally distributed, the data


were analyzed and the output is indicated below.

a) Identify the predictor and response variables.


(1 mark)

b) Compute the value of Z. Explain the value obtained.


(5 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 3 CS/JUL 2021/STA404

c) State the value of slope. Interpret its meaning.


(2 marks)

d) Write down the linear regression equation for this data.


(1 mark)

e) Predict the number of accidents for a driver who is 30 years old.


(2 marks)

QUESTION 3

A manager at one of the popular Telco company is currently conducting a survey regarding
the service failure at their service counter. The main objective of the survey is to find out the
factors that cause the failure. He randomly selected five service counters from ten available
service counters all over Malaysia. A questionnaire is distributed to all the customers at the
five selected service counters. Among the information collected from the customers include
their age, gender, occupation, income, rating of service (0 to 100) and service quality (poor,
moderate and good).

a) State the population in the study.


(1 mark)

b) State the sampling technique used in the study.


(1 mark)

c) Identify ONE ordinal variable and ONE ratio variable obtained from the study.
(2 marks)

d) The followings are the statistics produced from the study. Identify whether each
statement is a descriptive or inferential statistics.
.
i) 45% of the sample customers work in the government sector.

ii) Based on the sample, it can be concluded that there is an association between
gender and service quality.

iii) We are 90% confident that the average rating of service of for the customers falls
between 60 and 90.
(3 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 4 CS/JUL 2021/STA404

QUESTION 4

The depression scores (the higher the score, the more stressed are the patients) of 25
patients were recorded before and after they had undergone a therapy. The scores were
analyzed to see the effectiveness of the therapy. The outputs are indicated below.

Paired Samples Statistics


Mean N Std. Deviation Std. Error Mean
depress1 42.56 25 4.331 .866
Pair 1
depress2 41.36 25 4.957 .991

Paired Samples Test


Paired Differences t df Sig.
Mean Std. Std. Error 95% Confidence (2-tailed)
Deviation Mean Interval of the
Difference
Lower Upper
Pair 1 depress1 - depress2 X 2.121 Y .324 2.076 2.828 24 .009
Note: depress1 is representing the depression scores before the patient undergo a therapy
Assuming the score of depression is normally distributed.
a) Find the values of X and Y.
(3 marks)

b) State the 95% confidence interval for the mean depression scores.
(1 mark)

c) Based on the confidence interval in b), can it be concluded that the therapy is effective?

(2 marks)

QUESTION 5

A study has been made to compare the average amount of sugar contents for two brands of
energy drinks. Ten energy drinks of brands P and eight energy drink of brand Q were
sampled and the amount of sugar content was recorded for this study. The following are the
SPSS output for the analysis of sugar contents.

Group Statistics
Brand of Energy Drink N Mean Std. Deviation Std. Error Mean
Sugar Content Energy Drink P 10 10.80 4.185 1.323
Energy Drink Q 8 11.00 1.069 .378

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 5 CS/JUL 2021/STA404

Independent Samples Test


Sugar Content
Equal variances Equal variances
assumed not assumed
Levene's Test for Equality of F 2.176
Variances Sig. .160
t-test for Equality of Means t -.131 -.145
df 16 10.439
Sig. (2-tailed) .897 .887
Mean Difference -.200 -.200
Std. Error Difference 1.526 1.376
95% Confidence Interval of Lower -3.435 -3.249
the Difference Upper 3.035 2.849

a) Determine whether the variances of the amount of sugar for the two brands are equal.
Use
(3 marks)

b) Assume the sugar content is normally distributed. If it is claimed that energy drink P has
less amount of sugar than energy drink Q, conduct a hypothesis testing to test this
claim. Use = 0.05.
(4 marks)

QUESTION 6

There is an increasing trend on purchasing the laptop due to the pandemic Covid19. Hence,
a group of researchers intend to describe this scenario by using descriptive statistic. SPSS
output illustrating the information on the number of purchasing laptop (in month) according
to fifteen states in Malaysia are shown in the following output. Assume that the number of
laptop purchases (in month) is normally distributed.

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 6 CS/JUL 2021/STA404

a) Calculate the mean and standard deviation for this study.


(2 marks)

b) State the median value in this study. Hence interpret the value.
(2 marks)

c) Construct a box and whisker plot for the above study.


(2 marks)

d) Based on the plot in c), describe the shape of distribution.


(1 mark)

e) If the researchers wish to describe the shape of distribution by using an appropriate


measurement, which descriptive measure would you suggest to the researchers?

(1 mark)

QUESTION 7

A dairy products factory wants to know the milk flavour preferred by the buyers. The
researchers randomly selected several supermarket visitors and conducted an experiment.
Buyers were given three cups of milk with different flavours to drink. After that, they were
asked to choose one flavor that they preferred the most. The data collected are shown in
the following table.

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 7 CS/JUL 2021/STA404

a) Find the value of M by using expected value formula.


(2 marks)

b) Calculate the value of statistic value for this study.


(3 marks)

c) At 10% significance level, what can you conclude about the relationship between the
two variables?
(4 marks)

d) If the variables are measured in ratio, can the Chi-Square Test of Independence be
used?
(1 mark)

END OF QUESTION PAPER

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 8 CS/JUL 2021/STA404

APPENDIX 1 (1)

SAMPLE MEASUREMENTS

Mean

or
Standard deviation

Coefficient of Variation CV =

Coefficient of Skewness =

OR

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 9 CS/JUL 2021/STA404

APPENDIX 1 (2)

CONFIDENCE INTERVAL

Parameter and description A (1 - ) 100% confidence interval


Mean , for large samples,
2
unknown

Mean , for small samples, ; df = n 1


2
unknown

; df = n1 + n2 2
Difference in means of two normal
distributions, 1 - 2
and unknown

Difference in means of two normal


distributions, 1 - 2 ,
and unknown

Mean difference of two normal ; df = n 1 where n is no. of


distributions for
paired samples, d pairs

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 10 CS/JUL 2021/STA404

APPENDIX 1 (3)

HYPOTHESIS TESTING

Null Hypothesis Test statistic


H0 : = 0
2
unknown, large samples

H0 : = 0 ; df = n 1
2
unknown, small samples

; df = n1 + n2 2

H0 : 1 - 2=0
and unknown

H0 : 1 - 2 = 0

and unknown

H0 : d =0 ; df = n 1, where n is no. of pairs

Hypothesis for categorical data

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 11 CS/JUL 2021/STA404

APPENDIX 1 (4)

ANALYSIS OF VARIANCE FOR A COMPLETELY RANDOMIZED DESIGN

Let:
k = the number of different samples (or treatments)
= the size of sample i
the sum of the values in sample i
=
n = the number of values in all samples
= n1 n 2 n3 ...
= the sum of the values in all samples
=
= the sum of the squares of values in all samples

Degrees of freedom for the numerator = k 1


Degrees of freedom for the denominator = n k

Total sum of squares: SST =


Sum of squares between groups:

Sum of squares within groups = SST - SSB

Variance between groups:

Variance within groups:

Test statistic for a one-way ANOVA test:

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 12 CS/JUL 2021/STA404

APPENDIX 1 (5)

SIMPLE LINEAR REGRESSION

Sum of squares of xy, xx, and yy:

and

Least Square Regression Line:

Y = a + bx

Least Squares Estimates of a and b:

and

Total sum of squares:

Linear correlation coefficient:

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 1 CS/JUL 2022/STA404

UNIVERSITI TEKNOLOGI MARA


FINAL ASSESSMENT

COURSE : STATISTICS FOR BUSINESS AND SOCIAL


SCIENCES
COURSE CODE : STA404
EXAMINATION : JULY 2022
TIME : 2 HOURS

INSTRUCTIONS TO CANDIDATES

1. This question paper consists of SEVEN (7) questions.

2. Answer ALL questions in the foolscap paper. Start each answer on a new page.

3. Candidates must accomplish this assessment within 2 hours.

4. Candidates are required to convert their completed answer in one PDF file before
submission (<FULLNAME STUDENTNO GROUP>.pdf).

5. Candidates are given 30 minutes to email their finalized and completed answer to the
respective lecturers.

6. Candidates are required to attach the following details in every page of the answer script :

i) Full Name
ii) Student Number
iii) Group
iv) HP Number

7. Please check to make sure that this assessment pack consists of :

i) the Question Paper


ii) a five – page Appendix 1

8. Answer ALL questions in English.

PLEASE READ THE INSTRUCTIONS CAREFULLY BEFORE START THE EXAMINATION


This assessment paper consists of 11 printed pages
© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL
CONFIDENTIAL 2 CS/JUL 2022/STA404

QUESTION 1

A researcher is interested to study the E-wallet usage among customers of Pasaraya Intan
Belian. The researcher intended to obtain information from 50 respondents by interviewing
every 5th customer of Pasaraya Intan Belian on a particular day. The respondents are asked
on their marital status, age, the frequency of using E-wallet (never, seldom, often, very
often), the preferred E-wallet provider (Boast, Grabpay, Touch n Go, BigPay) and the last E-
wallet transaction amount (RM).

a) State the population and the sample of the study.


(2 marks)

b) Identify THREE (3) variables from the study. Hence, state its scales of measurement.

(3 marks)

c) Identify the sampling method used.


(1 mark)

d) Give ONE (1) advantage of the data collection method used by the researcher.
(1 mark)

QUESTION 2

The director of a government agency heard that their financial department is receiving an
average of 6 complaints from the customers in a week. To solve the problem, he assigned
his secretary to collect some data to see if he needs to replace the supervisor of that
department. The director will replace the supervisor if the actual mean number of complaints
towards the financial department is greater than 6 per week. The secretary gathered data
over the next 12 weeks and discovered that the mean number of weekly complaints towards
the financial department is 7 with a variance of 3.25.

a) Determine an appropriate statistical analysis to be used in this study.


(1 mark)

b) Calculate the t-statistic for this study.


(2 marks)

c) Test at the 5% significance level, is the director going to replace the department
supervisor? Show the relevant steps.
(5 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 3 CS/JUL 2022/STA404

QUESTION 3

The scores of 18 students are summarised as below.

N Mode Q3
score 18 1116 73366 67.00 74.25

a) Calculate the mean and standard deviation.


(4 marks)

b) Compute the coefficient of skewness. Hence, comment on the shape of the distribution.

(3 marks)

c) Explain the meaning of the value for third quartile (Q3) for this study.
(1 mark)

QUESTION 4

A professor at a local university wish to determine whether there is a significant difference in


the average of final examination marks between the students who took his STA404 course
online and face-to-face. Fifteen students were randomly selected from each group and the
final examination marks were recorded. Hence, he analysed the data using IBM SPSS and
the results are as follows.

Independent Samples Test


Mark
Equal variances Equal variances
assumed not assumed
Levene's Test for Equality of F 2.041
Variances Sig. .164
t-test for Equality of Means t -1.524 -1.524
df W 24.625
Sig. (2-tailed) .139 .140
Mean Difference -6.42000 -6.42000
Std. Error Difference 4.21320 4.21320
95% Confidence Interval of Lower X -15.10390
the Difference Upper Y 2.26390

a) Are the variances of the two populations equal? Use =0.05.


(3 marks)

b) Find the value of W.


(1 mark)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 4 CS/JUL 2022/STA404

c) Calculate the values of X and Y.


(4 marks)

d) Based on the confidence interval obtained in c), is there any evidence to support that
the average of final examination marks for students who took online class is different
from face-to-face class? Give a reason to support your answer.
(2 marks)

QUESTION 5

A grocery chain wants to know if the three types of advertisements affect the mean sales
differently. They used each type of advertisement at four different randomly selected stores
for a month and measured the sales (RM ‘000) for each store at the end of the month. The
results are as follow.

Descriptives

Advertisement Statistic
Mean 11.5000
Type 1 Std. Deviation 3.41565
Sum 46.00
Mean 10.0000
Sales Type 2 Std. Deviation 3.26599
Sum 40.00
Mean 7.5000
Type 3 Std. Deviation 2.51661
Sum 30.00

ANOVA
Sales
Sum of Squares df Mean Square F Sig.
Between Groups A 2 16.333 D .235
Within Groups 86.000 9 C
Total B 11

a) Using the sum of squares between groups formula, calculate the value of A.
(3 marks)

b) Compute the values of B, C and D.


(3 marks)

c) State the null and alternative hypothesis for the above study.
(1 mark)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 5 CS/JUL 2022/STA404

d) Using the p-value method, is there any evidence to support that the types of
advertisements affect the mean sales? Test at =0.01.
(3 marks)

QUESTION 6

The lecturers of Mathematical Science Department from University M intended to study the
association between the stress levels and the hours of online lessons in a week among
accounting students. A questionnaire which aimed to assess the stress levels was
administered to the respondents of the study. Their responses towards on the stress levels
were categorised into low, medium, and high levels. The students were also asked to state
the number of hours of their online lessons each week, according to the following category:
less than 16 hours, 16 to 18 hours, 19 to 21 hours and more than 21 hours. The data were
collected and the results are as follow.

Hours of Online Lessons * Stress Levels Crosstabulation


Stress Levels
Total
Low Medium High
Count 17 71 18 106
Less than 16 hours
Expected Count 15.4 69.4 21.2 106.0
Count 18 92 37 147
Hours of 16 - 18 hours
Expected Count 21.3 S 29.5 147.0
Online
Lessons Count 22 97 28 147
19 - 21 hours
Expected Count 21.3 96.2 29.5 147.0
Count T 60 15 89
More than 21 hours
Expected Count 12.9 58.2 17.8 89.0
Count 71 320 98 489
Total
Expected Count 71.0 320.0 98.0 489.0

Chi-Square Tests
Value df
Pearson Chi-Square 4.032 U
Likelihood Ratio 3.963 6
Linear-by-Linear Association .148 1
N of Valid Cases 489

a) Give a reason for conducting the Chi-square Test of Independence for the above study.

(1 mark)

b) Compute the value of S using expected value formula.


(1 mark)

c) Calculate the values of T and U.


(2 marks)

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 6 CS/JUL 2022/STA404

d) State the null and alternative hypothesis for the above study
(1 mark)

e) At the 10% significance level, is there any sufficient evidence to conclude that the stress
level is associated with the hours of online lessons in a week among the accounting
students?
(4 marks)

QUESTION 7

A study was conducted to investigate the influence of the fathers’ height on the sons’
height. The heights (cm) of a random sample of fathers and sons were recorded and
analysed by using IBM SPSS. The following results were obtained from
the bivariate analysis.

Model Summary
Adjusted R Std. Error of the
Model R R Square
Square Estimate
1 .446 .199 .065 6.071

Coefficients
Standardized
Unstandardized Coefficients
Model Coefficients t Sig.
B Std. Error Beta
(Constant) 96.281 60.053 1.603 .160
1
Heights of fathers (cm) .432 .354 .446 1.220 .268

Answer the following questions based on the above output.

a) Name the independent and dependent variable involved in this study.


(2 marks)

b) State the correlation coefficient value. Hence, interpret the relationship between the
variables.
(2 marks)

c) Write the least square regression equation.


(1 mark)

d) Based on the equation in c), comment on the slope value in the context of the above
study.
(1 mark)

e) Predict the height of a son if the height of his father is 192 cm.
(2 marks)

END OF QUESTION PAPER


© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL
CONFIDENTIAL 7 CS/JUL 2022/STA404

APPENDIX 1 (1)

SAMPLE MEASUREMENTS

Mean x
x
n

 x 
  2

s
1 

n 1  x  n  or
2

Standard deviation  

s
1
n 1

( x  x )2 
s
Coefficient of Variation CV =  100%
x

Coefficient of Skewness =
Pearson’s Measure of Skewness
3(mean  median ) mean  mod e
OR
s tan dard deviation s tan dard deviation

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 8 CS/JUL 2022/STA404

APPENDIX 1 (2)

CONFIDENCE INTERVAL

Parameter and description A (1 - ) 100% confidence interval


Mean , for large samples, x  z 2
s
σ2 unknown n
Mean , for small samples, x  t 2
s
; df = n – 1
σ2 unknown n
1 1
( x1  x 2 )  t  2 sp  ; df = n1 + n2 – 2
n1 n 2
Difference in means of two normal
distributions, 1 - 2
12   22 and unknown (n1  1)s12  (n 2  1)s 22
sp 
n1  n 2  2

s12 s2
( x1  x 2 )  t  2  2 ;
n1 n2
2
Difference in means of two normal s12 s22 
distributions, 1 - 2 ,  n  n2 
df   1 
12   22 and unknown 2 2
 s12   s22 
   
n1  n2 
    
n1  1 n2  1

sd
Mean difference of two normal d  t 2 ; df = n – 1 where n is no. of
distributions for n
paired samples, d pairs

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 9 CS/JUL 2022/STA404

APPENDIX 1 (3)

HYPOTHESIS TESTING

Null Hypothesis Test statistic


H0 :  = 0 x  0
z
σ2 unknown, large samples s n

H0 :  = 0 x  0
t ; df = n – 1
σ2 unknown, small samples s n
( x 1  x 2 )  ( 1   2 )
t ; df = n1 + n2 – 2
1 1
sp 
H0 : 1 - 2 = 0 n1 n 2
12   22 and unknown (n1  1)s12  (n 2  1)s 22
sp 
n1  n 2  2

( x 1  x 2 )  (1   2 )
t
s12 s 22

n1 n 2
H0 : 1 - 2 = 0 s12
2
 s 22 
12   22 and unknown  n n2 
df   1
2 2
 s12   s 22 
   
n1  n2 
    
n1  1 n2  1
d  d
H0 : d = 0 t ; df = n – 1, where n is no. of pairs
sd n

(oij  eij )2
Hypothesis for categorical data  2
 eij

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 10 CS/JUL 2022/STA404

APPENDIX 1 (4)

ANALYSIS OF VARIANCE FOR A COMPLETELY RANDOMIZED DESIGN

Let:
k = the number of different samples (or treatments)
ni = the size of sample i
Ti the sum of the values in sample i
=
n = the number of values in all samples
= n1  n 2  n3  ...
x = the sum of the values in all samples
= T1  T2  T3  ...
x 2
= the sum of the squares of values in all samples

Degrees of freedom for the numerator = k – 1


Degrees of freedom for the denominator = n – k

(  x) 2
Total sum of squares: SST =  x2 
n
Sum of squares between groups:

 T12 T22 T32



SSB    
 ( x)

 ...  
2

 n1 n 2 n 3  n

Sum of squares within groups = SST - SSB

SSB
Variance between groups: MSB 
(k 1)
SSW
Variance within groups: MSW 
(n  k )
MSB
Test statistic for a one-way ANOVA test: F 
MSW

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


CONFIDENTIAL 11 CS/JUL 2022/STA404

APPENDIX 1 (5)

SIMPLE LINEAR REGRESSION

Sum of squares of xy, xx, and yy:

SS xy   xy 
(  x)( y)
n

SS xx x 
 x)
2
( 2

and SS yy y 
 y)
2
( 2

n n

Least Square Regression Line:

Y = a + bx

Least Squares Estimates of a and b:


SS xy
b and a  y  bx
SS xx

Total sum of squares: SST   y 


 y) 2
( 2

n
SS xy
Linear correlation coefficient: r 
SS xxSS yy

© Hak Cipta Universiti Teknologi MARA CONFIDENTIAL


Dec 2016
Jul 2017
Jan 2018
Aug 2021
1. a) 1455.84
b) F = 7.9866
c) Reject 𝐻0

2. a) Predictor variable: Driver’s age


Response variable: number of accidents
b) r = -0.736. There is strong negative linear relationship between drivers’ age and number of
accidents.
c) Slope = -0.216. If age increase by 1 year, the number of accidents will decrease by 0.216 unit.
d) 𝑦̂ = 6.747 – 0.216x
e) 𝑦̂ = 0.267

3. a) Population: All customers from the 10 service counters all over Malaysia
b) Cluster sampling
c) Ordinal variable: Service quality, Ratio variable: Age / Income / Rating of service
d) (i) Descriptive (ii) Inferential (iii) Inferential

4. a) X = 1.2, Y = 0.424
b) (0.324 < μd < 2.076)
c) The therapy is effective.

5. a) Equal variance assumed.


b) Do not reject 𝐻0

6. a) 𝑥̅ = 478.6667, s = 141.5489
b) Median = 492. 50% of the number of purchasing the laptop in 15 states is less than 492 and/or
50% of the number of purchasing the laptop in 15 states is more than 492
c)

d) Negatively skewed/Skewed to the left.


e) Pearson’s Coefficient of Skewness.

7. a) M = 6.9841
b) 0.5314
c) Do not reject 𝐻0
d) No
Feb 2022

1. a) - Response variable must be normally distributed (or approximately normally distributed)


- samples are independent
- variances of populations are equal (any two)
b) P = 2865.271, Q = 2, R = 806.886, S = 3.379
c) Do not reject 𝐻0

2. a) –
b) There is a very strong positive linear relationship between age and mileage of the cars.
c) 𝑟 2 = 0.951. 95.1% of variation in mileage of the cars is explained by age of the car and the
other 4.9% is explained by other factors.
d) 𝑦̂ =3.927 + 13.997x; y = mileage, x = age
e) for every 1-year increase in age of the car, the mileage of the car will increase by 13,997km.
f) 𝑦̂ =64.114 (‘000km)

3. a) Population – all employees in the banking sector in Town A.


Sampling frame – a list name of all employees in the banking sector in Town A.
b) Gender – qualitative
Length of service – qualitative continuous
Type of welfare facilities – qualitative
Satisfaction towards welfare services provided by the employer – qualitative
c) Cluster sampling
d) Internet survey (google form)/electronic questionnaire, survey, questionnaire, F2F
Fast and short in time span to complete the questionnaire, cheaper, any relevant answer.

4. a) T = 0.1783
b) (3.0965, 3.7955)
c) Yes

5. a) because the observations are on the same subjects or employees


b) –
c) Reject 𝐻0

6. a) 𝑥̅ = 87.5, s = 23.1862
b) The number of cars sold in May is the most consistent.

7. a) to determine the association between two categorical variables.


b) A = 17, B = 1.838
c) Do not reject 𝐻0
July 2022
1. a) Population : All customers of Pasaraya Intan Belian
Sample : 50 customers of Pasaraya Intan Belian
b) Any 3 of the following:
Marital status – nominal
Age – ratio
Frequency of e-wallet usage – ordinal
Preferred e-wallet provider – nominal
Last e-wallet transaction amount – ratio
c) Systematic sampling
d) Advantage : High response rate or high face validity or any acceptable reason.
2. a) One sample t-test
b) 1.9216
c) Reject H0
3. a)
b) -0.3191
c) 75% of the students managed to score less than 74.25 and the other 25% of the students
managed to score more than 74.25.
4. a) Equal variance assumed
b) W = 28
c) X = -15.0486, Y = 2.2086
d)
5. a) A = 32.6667
b) B = 118.6667, C = 9.5556, D = 1.7093
c) 𝐻0 : 𝜇1 = 𝜇2 = 𝜇3 , 𝐻1 : at least two means are difference
d) Do not reject 𝐻0
6. a) to determine the association between two categorical variables.
b) S = 96.1963
c) T = 14, U = 6
d) 𝐻0 : the stress level is not associated to hours of online lessons in a week among the accounting
students,
𝐻1 : the stress level is associated to hours of online lessons in a week among the accounting
students
d) Do not reject 𝐻0
7. a) Independent variable : Height of father (cm)
Dependent variable : Height of son (cm)
b) r = 0.446. Interpretation : There is moderate positive linear relationship between the height of
father and the height of son.
c) 𝑦̂ =96.281 + 0.432x
d) For every 1 cm increase in the father’s height, there will be 0.432 cm increase in the son’s
height.
e) 𝑦̂ =179.225 cm
Feb 2023
1. b) 𝐻0 : 𝜇 = 700, 𝐻1 : 𝜇 > 700,
c) Yes
2. a) 𝑑̅ = 4.625
c) (1.165, 8.085)
d) Dependant/Paired sample t-test
3. a) 91.47%
b) coefficient of determination
d) 5.4045 (in RM thousand)
4. a) Population – all employees in the company, sampling frame – a complete list of all 1000
employees in the company
b) stratified random sampling
c) email questionnaire/any relevant answer
d) any advantages
5. a) 23.7
b) 𝐻0 : There is no relationship between students’ educational level and their preference for a
learning method
𝐻1 : There is relationship between students’ educational level and their preference for a
learning method
c) Reject 𝐻0
6. a) Mean = 6.687, Std. deviation = 1.293kg
b) Median = 6.5kg
c) 50% of the children’s weight less than 6.5kg and the remaining children weight more than
6.5kg.
d) Stem and leaf plot

7. a) 𝐻0 : 𝜇1 = 𝜇2 = 𝜇3 = 𝜇4
𝐻1 : at least two means are different
b) Q = 136.5, R = 51.5
c) Reject 𝐻0

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy