Lab Sta680 Week 8
Lab Sta680 Week 8
Data Description
Data Exploration
CHAPTER 4
4.0 Introduction
Figure displays distribution of age groups of 174 children. Majority of children belongs to the
group 3 (37.93%), followed by group 2 (31.62%), group 1 (25.88%) and group 0 (4.60%)
respectively…
Table 1 provides the descriptive statistics of information levels, comprehension level and
arithmetic ability of 174 children. Across all variables, comprehension level (9.99) has the
highest mean followed by information level (9.47) and arithmetic level (8.98) respectively.
Table 1. Descriptive Statistic of Independent Variables
Table 2 shows that there are no missing data across the three variables belongings of 174
children.
Table 2. Data Missingness Detection
Table 3 provides the summary of Q-Q plots and boxplots of all independent variables. Across
arithmetic level (X1), there are possible outliers as on the higher sides of values from both
diagrams. Across Q-Q plots, 4 values seem to be deviating from the straight line. Meanwhile,
from the box plots, several extreme values are observed on the higher tail of boxplot.
The Q-Q plot for Information Level The box plot for Information Level
Table 4 provides the summary of univariate outliers across three independent variables.
Across information level, score belonging to child 136 with score of 19 has z value equivalent
to 3.29 9more than +3). Therefore, 95th percentile of 14 was used to replace the original value.
…………..
Table 4. Summary of univariate outliers across three independent variables
1. Specify 𝑯𝟎 𝒂𝒏𝒅 𝑯𝟏
𝐻𝑜 : 𝐷𝑎𝑡𝑎 𝑓𝑜𝑙𝑙𝑜𝑤 𝑎 𝑡𝑟𝑖𝑣𝑎𝑟𝑖𝑎𝑡𝑒 𝑛𝑜𝑟𝑚𝑎𝑙 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛
𝐻1 : 𝐷𝑎𝑡𝑎 𝑑𝑜𝑒𝑠 𝑛𝑜𝑡 𝑓𝑜𝑙𝑙𝑜𝑤 𝑎 𝑡𝑟𝑖𝑣𝑎𝑟𝑖𝑎𝑡𝑒 𝑛𝑜𝑟𝑚𝑎𝑙 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛
3. Calculate Mahalanobis Distance for each data and compare with critical value
𝟗𝟏
4. Altogether there are 𝟏𝟕𝟒 % = 53%
𝐻0 : 𝐴𝑙𝑙 𝑜𝑓 𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑐𝑜𝑣𝑎𝑡𝑖𝑎𝑛𝑐𝑒 𝑚𝑎𝑡𝑟𝑖𝑐𝑒𝑠 𝑎𝑟𝑒 𝑡ℎ𝑒 𝑠𝑎𝑚𝑒 𝑎𝑐𝑟𝑜𝑠𝑠 4 𝑎𝑔𝑒 𝑔𝑟𝑜𝑢𝑝𝑠
At 𝛼 = 0.05, all of variance covariance matrices are the same across 4 age group. Therefore,
𝐻0 : 𝜇1 = 𝜇2 = 𝜇3 = 𝜇4
𝛼 = 0.05
𝜌 = 0.219