Little1988 Test
Little1988 Test
To cite this article: Roderick J. A. Little (1988) A Test of Missing Completely at Random for Multivariate Data with Missing Values,
Journal of the American Statistical Association, 83:404, 1198-1202
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the
publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or
warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions
and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed
by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with
primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings,
demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly
in connection with, in relation to or arising out of the use of the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is
expressly forbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms-
and-conditions
A Test of Missing Completely at Random for
Multivariate Data With Missing Values
RODERICK J. A. LITTLE*
A common concern when faced with multivariate data with missing values is whether the missing data are missing completely
at random (MCAR); that is, whether missingness depends on the variables in the data set. One way of assessing this is to
compare the means of recorded values of each variable between groups defined by whether other variables in the data set are
missing or not. Although informative, this procedure yields potentially many correlated statistics for testing MCAR, resulting
in multiple-comparison problems. This article proposes a single global test statistic for MCAR that uses all of the available
data. The asymptotic null distribution is given, and the small-sample null distribution is derived for multivariate normal data
with a monotone pattern of missing data. The test reduces to a standard t test when the data are bivariate with missing data
confined to a single variable. A limited simulation study of empirical sizes for the test applied to normal and nonnormal data
suggests that the test is conservative for small samples.
KEY WORDS: Incomplete data; Multivariate normal distribution; Nonresponse.
1198
Little: A Test c Missing Completely at Random 1199
Lemma. (a) d i is the likelihood ratio statistic for test- swept covariance matrix computed by EM (see Little and
ing Model (2) against the alternative (3). (b) Under (2), Rubin 1987).
d i has a chi-squared distribution with f = Cp, - p df. (c)
If the data are MCAR and y, has any distribution with 3.4 The Test Statistic for Monotone Missing Data
mean p and covariance matrix 2,d i is asymptotically chi- The small-sample null distribution of d2 is extremely
squared with f df. That is, for large samples the assumption complex for a general pattern of missing data, but sim-
of normality in (2) can be relaxed. plifies for particular missing-data patterns. Consider first
Proof. The log-likelihood under (3) (to within a con- the special case of p = 2 variables Yl and Y2,where Yl is
stant) is observed for all n cases and Y2is observed for n2< n cases,
say i = 1, . . . , n2. There are J = 2 patterns: Pattern 1
l(v I Yobs)
denotes cases with Yl and Y2present and pattern 2 denotes
1 ’ cases with only Yl present. Then, Y0bs.i = (yil, yiz) for i =
= -- 2
J-1
mj(yobs,, - vobs,j)Zob&(yobs.j - vobs.j)T, (4) 1, . . . , n2 and yobs,i= yil for i = n2 + 1, . . . , n; yobs,l
= (Fl, J 2 ) , the sample means of Yl and Y2based on the
where v denotes the set of means for all of the patterns.
- first n2 cases; yobs,2 = jjt, the sample mean of Yl based on
Substituting ML estimates = Y0bs.l yields l(* I yobs) =
the last n - n2 cases; Zobs.1 = 2; and 2 0 b s . 2 = 611.n u s
0. Substituting ML estimates of the means under Model
( 5 ) becomes
(2) yields 1(p*I Yobs) = -di/2. Hence the likelihood ratio
Downloaded by [Michigan State University] at 12:57 26 December 2014
where nq is the number of cases with Yq observed; k, is Table 1 shows the empirical sizes of the test for each
the number of patterns with Y, observed; SSB1, MSTl, problem. For example, 20.2 in the table indicates that for
and Fl are, respectively, the between-groups sum of squares, this problem the null hypothesis of MCAR was rejected
total mean square, and F statistic from the ANOVA of Yl at the 20% level in 202 out of 1,000 data sets. Superscripts
on all kl patterns; SSB2I , MST, and F2 I are the between- a and b indicate that the empirical size differs significantly
groups sum of squares, total mean square, and F statistic from the nominal size at, respectively, the 1% and 5%
from the analysis of covariance of Y2 on all k2 patterns levels of significance.
with Y2observed, adjusting for Y l ;and the remaining terms The empirical sizes do not differ significantly from nom-
are defined similarly. Under normality and MCAR, each inal levels for the data sets with 80 observations. For the
of the contributions in (8) is independent, so the small- smaller sample sizes the test appears overly conservative,
sample null distribution of d2 is a sum of functions of particularly at the lower nominal levels. An encouraging
independent F statistics. In large samples, these functions feature of the results is the relatively small impact of non-
become chi-squared distributed, and d2has the asymptotic normality (in the form of long tails or skewness) on the
chi-squared distribution discussed in Section 3.3. empirical sizes, suggesting a fair degree of robustness for
the method. This reinforces the fact that asymptotically
3.5 Simulation Study the test does not require normality.
A limited simulation study was conducted to examine These results on size should be treated as suggestive
Downloaded by [Michigan State University] at 12:57 26 December 2014
the empirical size of tests based on d2, for incomplete rather than definitive, given the modest scope of the sim-
multivariate normal, skewed (lognormal), and long-tailed ulation study. Power calculations are not included, since
(multivariate r3) data sets with n = 20, 40, or 80 obser- the power depends greatly on what departures from the
vations. To generate the multivariate normal and multi- MCAR assumption are contemplated. For example, in the
variate r data sets, observations i on p = 4 variables Y l , bivariate monotone case of Section 3.4, power may be
Y2,Y,, and Y, were generated from independent standard high if missingness of Y, depends on the fully observed
normal deviates Z1,Z2,Z3,and Z4,using variable Y l . On the other hand, if missingness of Y2de-
pends on Y,, then the test statistic only has good power
Yll = z 1 , / V z ,
if Yl and Y2are highly correlated. For a general pattern
of missing data, the global nature of the test leads to a
y, = z l l m + z 1 2 m , loss of power relative to others that test specific alternative
y13 = Z I l + Z~ I 2 2 / - l / q r + z 1 3 m ,
hypotheses, such as an alternative that specifies that miss-
ingness is a function of a particular variable. In most cir-
and cumstances such specific alternative hypotheses are hard
to formulate with much certainty, so this loss of power
y,, = - Z l 1 +~ z 1 2 m may be tolerated in the interests of achieving a single
+ z 1 3 W z14-. + global test statistic. Since power may be low, it is prudent
to keep in mind that accepting the null hypothesis of MCAR
For the multivariate normal data sets qi = 1 for all i; for does not imply its correctness.
the multivariate r3 data sets qi equals a chi-squared deviate
with 3 df. The resulting data sets all have mean vector (0, 4. DISCUSSION
0, 0, 0) and variances (1, 1, 1, 1). The correlations are p12 I conclude by discussing some limitations of the pro-
= .9487, pi3 = .4472, p23 = .5243, = w.7746, p24 = posed test and outlining some alternative procedures.
- .5767, and pM = .0763, thus encompassing a range of Close relatives of the test statistic d2 in Equation ( 5 ) are
values. The lognormal data sets were obtained by expo- d:, where 3 is replaced by the ML estimate of Z under
nentiating the values (yij) generated for the multivariate the alternative model (3), and d i , the likelihood ratio test
normal case.
Missing data were then created in the data set by an Table 1. Percent Empirical Sizes for a Test of the MCAR
MCAR mechanism such that for every data set exactly Assumption, From N = 1,000 Simulated Data Sets
40% of the cases were complete (i.e., had the pattern ri
~~~
statistic obtained by subtracting twice the maximized log- lett’s test for comparing dispersions, I expect the test to
likelihoods for Models (3) and (2). These statistics have be sensitive to departures from the normality assumption,
the same asymptotic null distribution as d2. They both and even under normality the asymptotic null distribution
require ML estimates to be calculated under both (2) and seems unlikely to be reliable unless the sample size is large.
(3), and hence involve a bit more computation.
[ReceivedJuly 1986. Revised March 1988.1
We have seen that the test based on d2 is valid asymp-
totically without normal assumptions; however, it seems
most appropriate when the variables are quantitative. If REFERENCES
they are categorical, with data forming a contingency table Amemiya, T. (1984), “Tobit Models: A Survey,” Journal of Economet-
with supplemental margins, more appropriate large-sam- rics, 24, 3-61.
ple tests of the MCAR assumption can be based on the Anderson, T. W. (1957), “Maximum Likelihood Estimators for the
Multivariate Normal Distribution When Some Observations Are Miss-
chi-squared statistics in Fuchs [1982, eqs. (4.1) and (4.2)], ing,” Journal of the American Statistical Association, 52, 200-203.
with estimates of the cell probabilities based on the sat- Cramer, H. (1946), Mathematical Methods in Statistics, Princeton, NJ:
urated model. Princeton University Press.
Dixon, W. J. (ed.) (1983), BMDP Statistical Software, Berkeley: Uni-
An important limitation of d2, d:, and di is that they versity of California Press.
are derived from an alternative hypothesis (3) that allows Efron, B., and Hinkley, D. V. (1978), “Assessing the Accuracy of the
missingness to affect the means but constrains the vari- Maximum Likelihood Estimator: Observed Versus Expected Infor-
mation,” Biometrika, 65, 457-487.
ances and covariances to be the same for all patterns. A
Downloaded by [Michigan State University] at 12:57 26 December 2014