MVN: An R Package For Assessing Multivariate Normality
MVN: An R Package For Assessing Multivariate Normality
Abstract Assessing the assumption of multivariate normality is required by many parametric mul-
tivariate statistical methods, such as MANOVA, linear discriminant analysis, principal component
analysis, canonical correlation, etc. It is important to assess multivariate normality in order to proceed
with such statistical methods. There are many analytical methods proposed for checking multivariate
normality. However, deciding which method to use is a challenging process, since each method may
give different results under certain conditions. Hence, we may say that there is no best method, which
is valid under any condition, for normality checking. In addition to numerical results, it is very useful
to use graphical methods to decide on multivariate normality. Combining the numerical results from
several methods with graphical approaches can be useful and provide more reliable decisions. Here,
we present an R package, MVN, to assess multivariate normality. It contains the three most widely
used multivariate normality tests, including Mardia’s, Henze-Zirkler’s and Royston’s, and graphical
approaches, including chi-square Q-Q, perspective and contour plots. It also includes two multivariate
outlier detection methods, which are based on robust Mahalanobis distances. Moreover, this package
offers functions to check the univariate normality of marginal distributions through both tests and
plots. Furthermore, especially for non-R users, we provide a user-friendly web application of the
package. This application is available at http://www.biosoft.hacettepe.edu.tr/MVN/.
Introduction
Many multivariate statistical analysis methods, such as MANOVA and linear discriminant analysis
(MASS, Venables and Ripley (2002)), principal component analysis (FactoMineR, Husson et al. (2014),
psych, Revelle (2014)), and canonical correlation (CCA, González and Déjean (2012)), require multivari-
ate normality (MVN) assumption. If the data are multivariate normal (exactly or approximately), such
multivariate methods provide more reliable results. The performances of these methods dramatically
decrease if the data are not multivariate normal. Hence, researchers should check whether data are
multivariate normal or not before continuing with such parametric multivariate analyses.
Many statistical tests and graphical approaches are available to check the multivariate normality
assumption. Burdenski (2000) reviewed several statistical and practical approaches, including the Q-Q
plot, box-plot, stem and leaf plot, Shapiro-Wilk, and Kolmogorov-Smirnov tests to evaluate univariate
normality, contour and perspective plots for assessing bivariate normality, and the chi-square Q-Q plot
to check multivariate normality. The author demonstrated each procedure using real data from George
and Mallery (1999). Ramzan et al. (2013) reviewed numerous graphical methods for assessing both
univariate and multivariate normality and showed their use in a real life problem to check the MVN
assumption using chi-square and beta Q-Q plots. Holgersson (2006) stated the importance of graphical
procedures and presented a simple graphical tool, which is based on the scatter plot of two correlated
variables to assess whether the data belong to a multivariate normal distribution or not. Svantesson
and Wallace (2003) applied Royston’s and Henze-Zirkler’s tests to multiple-input multiple-output
data to test MVN. According to the review by Mecklin and Mundfrom (2005), more than fifty statistical
methods are available for testing MVN. They conducted a comprehensive simulation study based
on type I and type II error and concluded that no single test excelled in all situations. The authors
suggested using Henze-Zirkler’s and Royston’s tests among others for assessing MVN because of their
good type I error control and power. Moreover, to diagnose the reason for deviation from multivariate
normality, the authors suggested the use of Mardia’s multivariate skewness and kurtosis statistics test
as well as graphical approaches such as the chi-square Q-Q plot. Deciding which test to use can be a
daunting task for researchers (mostly for non-statisticians) and it is very useful to perform several
tests and examine the graphical methods simultaneously. Although there are a number of studies
describing multifarious approaches, there is no single easy-to-use, up-to-date and comprehensive tool
to apply various statistical tests and graphical methods together at present.
In this paper, we introduce an R package, MVN (Korkmaz et al., 2014), which implements the
three most widely used MVN tests, including Mardia’s, Henze-Zirkler’s, and Royston’s. In addition to
statistical tests, the MVN also provides some graphical approaches such as chi-square Q-Q, perspective,
and contour plots. Moreover, this package includes two multivariate outlier detection methods, which
are based on Mahalanobis distance. In addition to multivariate normality, users can also check
univariate normality tests and plots to diagnose deviation from normality via package version 3.7
and later. First, we discuss the theoretical background on the corresponding MVN tests. Second, two
illustrative examples are presented in order to demonstrate the applicability of the package. Finally,
we present a newly developed web interface of the MVN package, which can be especially handy for
non-R users. The R version of MVN is publicly available from the Comprehensive R Archive Network
(CRAN, http://CRAN.R-project.org/package=MVN).
Mardia (1970) proposed a multivariate normality test which is based on multivariate extensions of
skewness (ĝ1,p ) and kurtosis (ĝ2,p ) measures as follows:
1 n n 3 1 n 2
ĝ1,p = ÂÂm
n2 i=1 j=1 ij
and ĝ2,p =
n iÂ
mii (1)
=1
where mij = ( xi x̄ )0 S 1 ( x j x̄ ), the squared Mahalanobis distance, and p is the number of variables.
The test statistic for skewness, (n/6)ĝ1,p , is approximately c2 distributed with p( p + 1)( p + 2)/6
degrees of freedom. Similarly, the test statistic for kurtosis, ĝ2,p , is approximately normally distributed
with mean p( p + 2) and variance 8p( p + 2)/n.
For small samples, the power and the type I error could be violated. Therefore, Mardia (1974)
introduced a correction term into the skewness test statistic, usually when n < 20, in order to
control type I error. The corrected skewness statistic for small samples is (nk/6)ĝ1,p , where k =
( p + 1)(n + 1)(n + 3)/(n(n + 1)( p + 1) 6). This statistic is also distributed as c2 with degrees of
freedom p( p + 1)( p + 2)/6.
The Henze-Zirkler’s test is based on a non-negative functional distance that measures the distance
between two distribution functions. If data are distributed as multivariate normal, the test statistic
is approximately log-normally distributed. First, the mean, variance, and smoothness parameter are
calculated. Then, the mean and the variance are log-normalized and the p-value is estimated (Henze
and Zirkler, 1990; Johnson and Wichern, 1992; Henze and Wagner, 1997; Mecklin and Mundfrom, 2003;
Alpar, 2013). The test statistic of Henze-Zirkler’s multivariate normality test is given in equation 2.
⇣ ⌘ b2 ⇣ ⌘
1 n n n
p p
b2 Di
Dij
2 1 + b2 + n 1 + 2b2
n i Âe Âe
2 2 ( 1+ b2 ) 2
HZ = 2 (2)
=1 j =1 i =1
where
p: number of variables
✓ ◆ 1
1 n (2p + 1) p+4
b= p
2 4
0 1
Dij = ( xi xj ) S ( xi xj )
0
1
Di = ( x i x̄ ) S ( xi x̄ ) = mii
From equation 2, Di gives the squared Mahalanobis distance of ith observation to the centroid and
Dij gives the Mahalanobis distance between ith and jth observations. If data are multivariate normal,
the test statistic (HZ) is approximately log-normally distributed with mean µ and variance s2 as given
below:
p
⇣ 2
⌘
a 2 1 + pb a + p ( p + 2) b4
µ =1
2a2
⇣ ⌘ p 2a p 1 + 2pb4
3p ( p + 2) b8
s2 = 2 1 + 4b2
2
+ 2
+
a
! 4a4
p 3pb4 p ( p + 2) b8
4w b 2 1+ +
2w b 2w b 2
where a = 1 + 2b2 and w b = (1 + b2 )(1 + 3b2 ). Hence, the log-normalized mean and variance of the
By using the log-normal distribution parameters, µ and s, we can test the significance of multivariate
normality. The Wald test statistic for multivariate normality is given in equation 4.
log( HZ ) log(µ)
z= (4)
log(s)
Royston’s test uses the Shapiro-Wilk/Shapiro-Francia statistic to test multivariate normality. If kurtosis
of the data is greater than 3, then it uses the Shapiro-Francia test for leptokurtic distributions, otherwise
it uses the Shapiro-Wilk test for platykurtic distributions (Shapiro and Wilk, 1964; Royston, 1982, 1983,
1992; Johnson and Wichern, 1992; Royston, 1995; Mecklin and Mundfrom, 2005).
Let Wj be the Shapiro-Wilk/Shapiro-Francia test statistic for the jth variable ( j = 1, 2, . . . , p) and Zj
be the values obtained from the normality transformation proposed by Royston (1992).
h ⇣ ⌘i
if 4 n 11; x=n and w j = log g log 1 Wj
⇣ ⌘ (5)
if 12 n 2000; x = log(n) and w j = log 1 Wj
As seen from equation 5, x and w j -s change with the sample size (n). By using equation 5, transformed
values of each random variable can be obtained from equation 6.
wj µ
Zj = (6)
s
where g, µ and s are derived from the polynomial approximations given in equation 7. The polynomial
coefficients are provided by Royston (1992) for different sample sizes.
where e is the equivalent degrees of freedom (edf) and F(·) is the cumulative distribution function for
the standard normal distribution such that,
e = p/[1 + ( p 1)c̄]
n h io2
yj = F 1 F( Zj )/2 , j = 1, 2, . . . , p. (9)
As seen from equation 9, another extra term c̄ has to be calculated in order to continue with the
statistical significance of Royston’s test statistic given in equation 8. Let R be the correlation matrix
and rij be the correlation between ith and jth variables. Then, the extra term c̄ can be found by using
equation 10.
cij n o
c̄ = Â Â , cij (10)
i j
p ( p 1) i6= j
where ⇢
g(rij , n) if i 6= j
cij =
1 if i = j
with the boundaries of g(·) as g(0, n) = 0 and g(1, n) = 1. The function g(·) is defined as follows:
h µ i
g(r, n) = r l 1 (1 r ) µ
n
The unknown parameters µ, l, and n were estimated from a simulation study conducted by Ross
(1980). He found µ = 0.715 and l = 5 for sample size 10 n 2000 and n is a cubic function which
where x = log(n).
Similarly, the Iris data can be loaded from the R database by using the following R code:
The Iris data set consists of 150 samples from each of the three species of Iris including setosa, virginica,
and versicolor. For each sample, four variables were measured including the length and width of the
sepals and petals, in centimeters.
Example I: For simplicity, we will work with a subset of these data which contain only 50 samples of
setosa flowers, and check MVN assumption using Mardia’s, Royston’s and Henze-Zirkler’s tests.
The mardiaTest function is used to calculate the Mardia’s test multivariate skewness and kurtosis
coefficients as well as their corresponding statistical significance. This function can also calculate the
corrected version of the skewness coefficient for small sample size (n < 20).
g1p : 3.079721
chi.skew : 25.66434
p.value.skew : 0.1771859
g2p : 26.53766
z.kurtosis : 1.294992
p.value.kurt : 0.1953229
chi.small.skew : 27.85973
p.value.small : 0.1127617
Here:
g1p: Mardia’s estimate of multivariate skewness, i.e ĝ1,p given in equation 1,
chi.skew: test statistic for multivariate skewness,
p.value.skew: significance value of skewness statistic,
As seen from the results given above, both the skewness (ĝ1,p = 3.0797, p = 0.1772) and kurtosis
(ĝ2,p = 26.5377, p = 0.1953) estimates indicate multivariate normality. Therefore, according to
Mardia’s MVN test, this data set follows a multivariate normal distribution.
One may use the hzTest function in the MVN package to perform the Henze-Zirkler’s test.
HZ : 0.9488453
p-value : 0.04995356
Here, HZ is the value of the Henze-Zirkler’s test statistic at significance level 0.05 and p-value is
the significance value of this test statistic, i.e., the significance of multivariate normality. As the
p-value, which is derived from hzTest, is mathematically lower than 0.05, one can conclude that this
multivariate data set deviates slightly from multivariate normality (HZ = 0.9488, p = 0.04995). As the
p-value is very close to 0.05, researchers should also check the multivariate graphical approaches as
well as univariate tests and plots to make a more reliable decision on multivariate normality.
In order to carry out the Royston’s test, roystonTest function in the MVN package can be used as
follows:
H : 31.51803
p-value : 2.187653e-06
Here, H is the value of the Royston’s test statistic at significance level 0.05 and p-value is an approximate
significance value for the test with respect to edf. According to Royston’s test, the setosa data set does
not appear to follow a multivariate normal distribution ( H = 31.518, p < 0.001).
One can clearly see that different MVN tests may come up with different results. MVN assumption
was rejected by Henze-Zirkler’s and Royston’s tests; however, it was not rejected by Mardia’s test at a
significance level of 0.05. In such cases, examining MVN plots along with hypothesis tests can be quite
useful in order to reach a more reliable decision.
The Q-Q plot, where “Q” stands for quantile, is a widely used graphical approach to evaluate
the agreement between two probability distributions. Each axis refers to the quantiles of probability
distributions to be compared, where one of the axes indicates theoretical quantiles (hypothesized
quantiles) and the other indicates the observed quantiles. If the observed data fit hypothesized
distribution, the points in the Q-Q plot will approximately lie on the line y = x.
MVN has the ability to create three multivariate plots. One may use the qqplot = TRUE option in
the mardiaTest, hzTest, and roystonTest functions to create a chi-square Q-Q plot. We can create this
plot for the setosa data set to see whether there are any deviations from multivariate normality. Figure
1 shows the chi-square Q-Q plot of the first 50 rows of the Iris data, which are setosa flowers. It can be
seen from Figure 1 that there are some deviations from the straight line and this indicates possible
departures from a multivariate normal distribution.
●
10
Chi−Square Quantile
●
●
8
●
●
●
●
●
6
●
●
●
●
●●
● ●
●
●●
4
●
●●
●
●●
● ●●
●●
●
●
●●
●
2
●
●●
●
●
●
●
●●
●●●
●
0
0 2 4 6 8 10 12
As a result, we can conclude that this data set does not satisfy MVN assumption based on the
fact that the two test results are against it and the chi-square Q-Q plot indicates departures from
multivariate normal distribution.
As noted by several authors (Burdenski, 2000; Stevens, 2012; Kass et al., 2014), if data have a multi-
variate normal distribution, then, each of the variables has a univariate normal distribution; but the
opposite does not have to be true. Hence, checking univariate plots and tests could be very useful to
diagnose the reason for deviation from MVN. We can check this assumption through uniPlot and
uniNorm functions from the package. The uniPlot function is used to create univariate plots, such as
Q-Q plots (Figure 2a), histograms with normal curves (Figure 2b), box-plots and scatterplot matrices.
As seen from Figure 2, Petal.Width has a right-skewed distribution whereas other variables have
approximately normal distributions. Thus, we can conclude that problems with multivariate normality
arise from the skewed distribution of Petal.Width. In addition to the univariate plots, one can also
perform univariate normality tests using the uniNorm function. It provides several widely used
univariate normality tests, including Shapiro-Wilk, Cramer-von Mises, Lilliefors, and Anderson-
Darling. For example, the following code chunk is used to perform the Shapiro-Wilk’s normality test
on each variable:
1.2
● ●
●● ●
Sample Quantiles
Sample Quantiles
●
0.8
●● ●
●●●●● ●●
●●●●
0.8
● ●●●
Density
Density
●●● ●
●●
●
●●
●●
●
●●
●●
●●
●● ●●
●●
●●
●●
●
5.0
●●
●●
●●
●● ●●
●●●
●●
0.4
●●
●● ●●●●
0.4
●●
●●● ●●●●●●
●● ●
●●●●
4.5
●
● ●●
0.0
0.0
● ●
0 1 2 3 4 5 6 7
● ● ●
1.8
Sample Quantiles
Sample Quantiles
0.5
2.0
●●●● ●
●●●●●●●
Density
Density
●●●●●●●
●
●●
●●
●●
●●
●●
●●
1.4
●●
●●●
●●
●●
●●
●●
0.3
1.0
●
●●●●●●
●●●●●●●
●● ●●●●●●●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●
1.0
0.1
0.0
● ● ● ●●●
−2 −1 0 1 2 −2 −1 0 1 2 1.0 1.2 1.4 1.6 1.8 0.1 0.2 0.3 0.4 0.5 0.6
From the above results, we can see that all variables, except Petal.Width in the setosa data set, have
univariate normal distributions at significance level 0.05. We can now drop Petal.With from setosa data
and recheck the multivariate normality. MVN results are given in Table 1.
According to the three MVN test results in Table 1, setosa without Petal.Width has a multivariate
normal distribution at significance level 0.05.
Example II: Whilst the Q-Q plot is a general approach for assessing MVN in all types of numerical
multivariate datasets, perspective and contour plots can only be used for bivariate data. To demonstrate
the applicability of these two approaches, we will use a subset of Iris data, named setosa2, including
the sepal length and sepal width variables of the setosa species.
Univariate normal marginal densities are a necessary but not a sufficient condition for MVN. Hence,
in addition to univariate plots, creating perspective and contour plots will be useful. The perspective
plot is an extension of the univariate probability distribution curve into a 3-dimensional probability
distribution surface related with bivariate distributions. It also gives information about where data
are gathered and how two variables are correlated with each other. It consists of three dimensions
where two dimensions refer to the values of the two variables and the third dimension, which like
in univariate cases, is the value of the multivariate normal probability density function. Another
alternative graph, which is called the “contour plot”, involves the projection of the perspective plot
into a 2-dimensional space and this can be used for checking multivariate normality assumption. For
bivariate normally distributed data, we expect to obtain a three-dimensional bell-shaped graph from
the perspective plot. Similarly, in the contour plot, we can observe a similar pattern.
To construct a perspective and contour plot for Example 2, we can use the mvnPlot function in
MVN. This function requires an object of the "MVN" class that is the result from one of the MVN
functions. In the following codes, the object from hzTest is used for the perspective plot given in
Figure 3a. It is also possible to create a contour plot of the data. Contour graphs are very useful as they
give information about normality and correlation at the same time. Figure 3b shows the contour plot
of setosa flowers. As can be seen from the graph, this is simply a top view of the perspective plot where
the third dimension is represented with ellipsoid contour lines. From this graph, we can say that there
is a positive correlation among the sepal measures of flowers since the contour lines lie around the
main diagonal. If the correlation were zero, the contour lines would be circular rather than ellipsoid.
0.1
0.2
0.3
4.0
0.5
0.7
0.9
1.1
Sepal.Width
3.5
1.4
1.3
1.2
3.0
0.8 1
0.4 0.6
Dens
0.2
Sepa
0.1
2.5
ity
l.Wid
0.1
th
Sepal.Length Sepal.Length
Figure 3: Perspective and contour plot for bivariate setosa2 data set.
As neither the univariate plots in Figure 2 nor the multivariate plots in Figure 3 show any significant
deviation from MVN, we can now perform the MVN tests to evaluate the statistical significance of
bivariate normal distribution of the setosa2 data set.
All three tests in Table 2 indicate that the data set satisfies bivariate normality assumption at the
significance level 0.05. Moreover, the perspective and contour plots are in agreement with the test
results and indicate approximate bivariate normality.
Figures 3a and 3b were drawn using a pre-defined graphical option by the authors. However,
users may change these options by setting function entry to default = FALSE. If the default is
FALSE, optional arguments from the plot, persp and contour functions may be introduced to the
corresponding graphs.
Multivariate outliers
Multivariate outliers are the common reason for violating MVN assumption. In other words, MVN
assumption requires the absence of multivariate outliers. Thus, it is crucial to check whether the data
have multivariate outliers, before starting multivariate analysis. The MVN includes two multivariate
outlier detection methods which are based on robust Mahalanobis distances (rMD( x )). Mahalanobis
distance is a metric which calculates how far each observation is to the center of joint distribution,
which can be thought of as the centroid in multivariate space. Robust distances are estimated from
minimum covariance determinant estimators rather than the sample covariance (Rousseeuw and
Leroy, 1987). These two approaches, defined as Mahalanobis distance and adjusted Mahalanobis
distance in the package, detect multivariate outliers as given below,
Mahalanobis Distance:
The mvOutlier function is used to detect multivariate outliers as given below. It also returns a new
data set in which declared outliers are removed. Moreover, Q-Q plots can be created by setting qqplot
= TRUE within mvOutlier for visual inspection of the possible outliers. For this example, we will use
another subset of the Iris data, which is versicolor flowers, with the first three variables.
● ●
● Outliers (n=2) ● Outliers (n=0)
● Non−outliers (n=48) ● Non−outliers (n=50)
10
10
● ●
Chi−Square Quantile
Chi−Square Quantile
8
● ●
● ●
● ●
● ●
6
● ●
● ●
● ●
● ●
● ●
Quantile: 9.348
● ●
● ●
4
●
● ●
●
●●
● ●●
●
●● ●●
● ●● ● ●●
●
● ●
●
● ●
●●● ●●●
2
●●
●
● ●●
●
●
●●● ●●●
● ●
● ● ●
●
●
●● ●
●●
●●
●● ●●
●●
●● ●●
●● ●●
0
0 2 4 6 8 10 0 2 4 6 8 10
Anderson-Darling) in the Univariate analysis tab (Figure 5b) and possible multivariate outliers can
be detected by using the Mahalanobis distances via the Outlier detection tab as seen in Figure 5c.
Finally, users can assess MVN in the Multivariate analysis tab by choosing one of the MVN tests
including Mardia’s, Henze-Zirkler’s, and Royston’s and graphical approaches, including chi-square
Q-Q, perspective, and contour plots (Figure 5d). All the results can be downloaded by using download
buttons in the respective tabs.
Currently, MVN works with several statistical tests and graphical approaches. It will continue to
add new statistical approaches as they are developed. The package and the web-tool will be regularly
updated based on these changes.
Acknowledgments
We would like to thank Prof. Dr. C. Reha Alpar and the anonymous reviewer for their very useful
comments and suggestions which helped us to improve the quality of our paper. We would also like
to thank Izzet Parug Duru from Marmara University Department of Physics and Vahap Eldem from
Istanbul University Department of Biology for making the web-tool version of the package possible.
This study was supported by the Research Fund of Marmara University [FEN-C-DRP-120613-0273].
Bibliography
R. Alpar. Uygulamalı Çok Değişkenli İstatistiksel Yöntemler. Detay Yayıncılık, Ankara, Turkey, fourth
edition, 2013. ISBN 978-605-5437-42-8. [p152]
E. Anderson. The species problem in Iris. Missouri Botanical Garden Press, 23(3):457–509, 1936. [p154]
T. Burdenski. Evaluating univariate, bivariate, and multivariate normality using graphical and
statistical procedures. Multiple Linear Regression Viewpoints, 26(2):15–28, 2000. [p151, 156]
R. A. Fisher. The use of multiple measurements in taxonomic problems. Annals of Eugenics, 7(2):
179–188, 1936. [p154]
D. George and P. Mallery. SPSS for Windows step by step. Allyn & Bacon, Boston, 1999. [p151]
I. González and S. Déjean. CCA: Canonical correlation analysis, 2012. URL http://CRAN.R-project.
org/package=CCA. R package version 1.2. [p151]
N. Henze and T. Wagner. A new approach to the BHEP tests for multivariate normality. Journal of
Multivariate Analysis, 62(1):1–23, 1997. [p152]
N. Henze and B. Zirkler. A class of invariant consistent tests for multivariate normality. Communications
in Statistics - Theory and Methods, 19(10):3595–3617, 1990. [p152]
F. Husson, J. Josse, S. Le, and J. Mazet. FactoMineR: Multivariate Exploratory Data Analysis and Data
Mining with R, 2014. URL http://CRAN.R-project.org/package=FactoMineR. R package version
1.26. [p151]
R. A. Johnson and D. W. Wichern. Applied Multivariate Statistical Analysis. Prentice Hall, New Jersey,
third edition, 1992. [p152, 153]
R. E. Kass, U. T. Eden, and E. N. Brown. Analysis of Neural Data. Springer, 2014. [p156]
S. Korkmaz, D. Goksuluk, and G. Zararsiz. MVN: Multivariate Normality Tests, 2014. URL http:
//www.biosoft.hacettepe.edu.tr/MVN/. R package version 3.7. [p151]
K. V. Mardia. Measures of multivariate skewness and kurtosis with applications. Biometrika, 57(3):
519–530, 1970. [p152]
K. V. Mardia. Applications of some measures of multivariate skewness and kurtosis in testing normality
and robustness studies. Sankhyā: The Indian Journal of Statistics, Series B (1960–2002), 36(2):115–128,
1974. [p152]
C. J. Mecklin and D. J. Mundfrom. On using asymptotic critical values in testing for multivariate
normality. InterStat, 1:1–12, 2003. [p152]
C. J. Mecklin and D. J. Mundfrom. A monte carlo comparison of the type I and type II error rates of
tests of multivariate normality. Journal of Statistical Computation and Simulation, 75(2):93–107, 2005.
[p151, 153]
W. Revelle. psych: Procedures for Psychological, Psychometric, and Personality Research. Northwestern
University, Evanston, Illinois, 2014. URL http://CRAN.R-project.org/package=psych. R package
version 1.4.8. [p151]
G. J. S. Ross. MLP, Maximum Likelihood Program. Harpenden: Rothamsted Experimental Station, 1980.
[p153]
P. J. Rousseeuw and A. M. Leroy. Robust Regression and Outlier Detection. John Wiley & Sons, Inc., New
York, NY, USA, 1987. ISBN 0-471-85233-3. [p158]
J. P. Royston. An extension of Shapiro and Wilk’s W test for normality to large samples. Applied
Statistics, 31(2):115–124, 1982. [p153]
J. P. Royston. Some techniques for assessing multivariate normality based on the Shapiro-Wilk W.
Applied Statistics, 32(2):121–133, 1983. [p153]
P. Royston. Approximating the Shapiro-Wilk W test for non-normality. Statistics and Computing, 2(3):
117–119, 1992. [p153]
P. Royston. Remark as r94: A remark on algorithm AS 181: The W test for normality. Applied Statistics,
44(4):547–551, 1995. [p153]
RStudio, Inc. shiny: Web Application Framework for R, 2014. URL http://CRAN.R-project.org/
package=shiny. R package version 0.10.1. [p159]
S. S. Shapiro and M. B. Wilk. An analysis of variance test for normality (complete samples). Biometrika,
52:591–611, 1964. [p153]
J. P. Stevens. Applied multivariate statistics for the social sciences. Routledge, 2012. [p156]
T. Svantesson and J. W. Wallace. Tests for assessing multivariate normality and the covariance structure
of mimo data. In Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP’03). 2003 IEEE
International Conference on, volume 4, pages 656–659. IEEE, 2003. [p151]
W. N. Venables and B. D. Ripley. Modern Applied Statistics with S. Springer, New York, fourth edition,
2002. URL http://www.stats.ox.ac.uk/pub/MASS4. ISBN 0-387-95457-0. [p151]
Selcuk Korkmaz
Department of Biostatistics
Hacettepe University
Turkey
selcuk.korkmaz@hacettepe.edu.tr
Dincer Goksuluk
Department of Biostatistics
Hacettepe University
Turkey
dincer.goksuluk@hacettepe.edu.tr
Gokmen Zararsiz
Department of Biostatistics
Hacettepe University
Turkey
gokmen.zararsiz@hacettepe.edu.tr