Week 2 Lecture 1
Week 2 Lecture 1
(i) graphs,
(ii) sample statistics and
(iii) formal hypothesis tests.
We can use two types of graphs to study whether a data set is characterised by
a normal distribution: histogram and QQ-plot.
10
UoM, ECON 20003, Week 2
Ex 1: (Week 1, Ex 2)
The histogram is skewed to the right and on the QQ-plot the points are
scattered around the straight line. Hence, both graphs suggest that diff is
unlikely to be normally distributed.
UoM, ECON 20003, Week 2 7
ii. Quantifying normality with numerical descriptive measures
There are four simple numerical descriptive measures that can help us
decide whether a data set is characterised by a normal distribution: mean,
median, skewness and kurtosis.
A distribution whose tails are relatively long and thus has more outliers,
is called leptokurtic (leptos is Greek for thin, fine).
A distribution whose tails are relatively short and thus has fewer
outliers, is called platykurtic (platus is Greek for broad, flat).
i. The sample mean (12.175) is bigger than the sample median (10.500), so the sample of diff
is skewed to the right (non-normal).
ii. SK-hat = 0.556 is positive, so the sample of diff is skewed to the right. SK-hat divided by
twice of the standard error is skew.2SE = 1.151 > 1, so the distribution of diff is unlikely
normal.
iii. The estimate of excess kurtosis is K-hat – 3 = -0.548. It is negative, so the sample of diff is
platykurtic. However, the absolute value of K-hat – 3 divided by twice of the standard error is
|kurt.2SE | = 0.573 < 1, so the distribution of diff might be normal.
iv. The reported p-value of the SW test is normtest.p = 0.001 < 0.05, thus normality is rejected
at the 5% level.
Since 3 out of 4 checks cast doubt on normality, the t-test in Ex 2, Week 1 might be misleading.
UoM, ECON 20003, Week 2 14