Bayesian Statistics For Small Area Estimation
Bayesian Statistics For Small Area Estimation
Abstract. National statistical offices are often required to provide statistical infor-
mation about characteristics of the population, such as mean income or unem-
ployment rate, at several administrative or small area levels. Having good area
level estimates is important because policies will often be based on this type of
information.
In this paper we describe how Bayesian hierarchical models can help in the task of
providing good quality small area estimates. Starting from direct estimates ob-
tained from survey data, we describe a range of Bayesian hierarchical models
that incorporate different types of random effects and show that these give im-
proved estimates. Models that synthesise individual and aggregated information
are considered as well. Finally, we highlight some additional applications that
further exploit the estimates produced, such as the classification and ranking of
areas and how to approach the problem of having no direct information in several
areas.
1. Introduction
Small Area Estimation (Rao, 2003) tackles the important statistical problem of
providing reliable estimates of a target variable in a set of small geographical
areas. The main difficulty is that it is nearly always impossible to measure
the value of the target variable for all the individuals in the areas of interest
and, hence, a survey is conducted to obtain a representative sample (Cochran,
1977). Surveys are often designed to include different sorts of data in order to
E-mail: v.gomezrubio@imperial.ac.uk
2 V. Gómez-Rubio et al.
make the best use of them. The information collected is used to produce some
direct estimate of the target variable that relies only on the survey design and
the sampled data. Unfortunately, sampling from all areas can be expensive in
resources and time. A more practical approach is to select a subset of areas
where the survey is conducted; estimates for all areas are then produced using
the sample and some additional auxiliary information which must be available
for all small areas (Särndal et al., 1992).
Regression models are often used in this context to provide estimates for
the non-sampled areas. The model is fitted on data from the sample and used
together with additional information from auxiliary data to compute estimates
for non-sampled areas. This method is often known as synthetic estimation
(Gonzalez, 1973).
Regression estimators can be extended to include random effects, which can
be estimated by Empirical Best Linear Unbiased Predictors (usually known as
EBLUP estimators, see Robinson, 1991). Jiang and Lahiri (2006) provide a re-
view on this topic for small area estimation. In addition to the usual covariates,
mixed-effect models include random effects to model different types of individ-
ual variation and space and time interactions (Singh et al., 2005; Petrucci and
Salvati, 2005). Different types of responses can be modelled with this frame-
work, but the computations for non-Normal responses can be quite involved
and techniques such as Penalised Quasi-Likelihood are required (Breslow and
Clayton, 1993). EURAREA Consortium (2004) make a summary of direct and
model-based likelihood-based small area estimators for several target variables
for different national datasets from different countries.
The potential of spatial and spatio-temporal modelling in Small Area Es-
timation has been addressed by Jiang and Lahiri (2006) and the discussion
therein. The two main benefits that they point out are the possibility of borrow-
ing information from neighbouring areas when estimating spatially-correlated
random effects and improving estimation in non-sampled areas. Petrucci and
Salvati (2005) have also addressed the use of spatial random effects to produce
improved Small Area estimates.
Bayesian alternatives of both the non-spatial and spatial mixed effects mod-
els for Small Area Estimation have been proposed (see, for example, Datta and
Ghosh (1991), Ghosh et al. (1998), and Rao (2003) for a recent review). In
particular, Bayesian small area spatial modelling has already been successful in
other similar contexts, such as the estimation of the rate of disease in different
geographic regions (Best et al., 2005). Complex mixed-effects and correlation
between areas can be easily handled and modelled hierarchically in different
layers of the model. For example, Besag et al. (1991) propose a spatial model
in which the area variation not explained by the available covariates is split
into two components: one which is unstructured (and independent) for each
Bayesian Statistics for Small Area Estimation 3
area and another one which reflects likely correlation between neighbouring re-
gions. Note that disease mapping applications are based on data available on
disease status for all individuals in every area, whilst Small Area Estimation
is usually based on a survey which provides access to a limited sample of the
population under study and does not cover all areas, hence creating its own set
of methodological issues.
Although implementation of complex Bayesian models requires computa-
tionally intensive Markov Chain Monte Carlo simulation algorithms (Gilks et al.,
1995), there are a number of potential benefits of the Bayesian approach for
small area estimation. It offers a coherent framework that can handle different
types of target variable (e.g. continuous, dichotomous, categorical), different
random effects structures (e.g. independent, spatially correlated), areas with no
direct survey information, models to smooth the survey sample variance esti-
mates, and so on, in a consistent way using the same computational methods
and software whatever the model. Uncertainty about all model parameters is
automatically captured by the posterior distribution of the small area estimates
and any functions of these (such as their rank), and by the predictive distribu-
tion of estimates for small areas not included in the survey sample. Bayesian
methods are particularly well suited to sparse data problems (for example, when
the survey sample size per area is small) since Bayesian posterior inference is
exact (modulo Monte Carlo simulation error associated with the estimation algo-
rithms) and does not rely on asymptotic arguments. The posterior distribution
obtained from a Bayesian model also provides a much richer output than the
traditional point and interval estimates from a corresponding likelihood-based
model. In particular, the ability to make direct probability statements about
unknown quantities — for example, the probability that the target variable ex-
ceeds some specified threshold in each area — and to quantify all sources of
uncertainty in the model, make Bayesian small area estimation well suited to
informing and evaluating policy decisions.
In this paper, we aim to illustrate some of these points by considering a
range of Bayesian hierarchical models for small area estimation that incorporate
different types of spatial and non-spatial random effects structures. We compare
the predictive accuracy of the small area estimates produced by each model, and
focus in particular on two common problems faced by statistical bureaus when
dealing with Small Area Estimation: (1) ranking and classification of areas,
and (2) providing estimates in areas that have been left out of the survey. In a
Bayesian framework, we will tackle the ranking problem by means of posterior
ranks and posterior probabilities of being among a certain proportion of areas,
whilst for the problem of missing direct information we will rely on observed area
level covariates and the use of spatial random effects at different administrative
levels to predict the missing data.
4 V. Gómez-Rubio et al.
We consider the case in which the response variable is Normal, but all these
techniques can be easily extended to the more general case. The use of the
models and methods that we are proposing is illustrated through an example
based on the average equivalised income per household in the 284 municipalities
of Sweden. This data set has also been considered by EURAREA Consortium
(2004), where many different types of likelihood-based estimators were com-
puted and compared.
The paper is structured as follows. In Section 2 an introduction to typical
survey design and synthetic estimators for Small Area Estimation is presented.
Section 3 includes a general description of Bayesian methods and different types
of models for Small Area Estimation. How to provide estimates for areas with
no direct observations is considered in Section 4. The classification of areas for
policy making is described in Section 5. An example with real data is shown in
Section 6. Finally, some conclusions and remarks are presented in Section 7.
Yˆ D,i = µi + ei (1)
where µi is the true area mean and ei is a random term which reflects the vari-
ation of a direct estimator Yˆ D,i around the mean and which we will assume
Normally distributed with zero mean and variance Vi2 . In practice, Vi2 is re-
placed by an estimate V̂i2 . Here we have taken V̂i2 equal to the variance of the
direct estimator (often termed the design variance; Särndal et al., 1992). Al-
ternatively, area level variances can be smoothed, for example using generalized
variance functions (Jiang and Lahiri, 2006, see)[page 6], which should yield more
stable estimates when the within-area sample size is small. Note that, rather
than follow this 2-stage approach, in Section 3, we propose a Bayesian model
with a hierarchical structure on both the small area means and variances, that
smooths the sample variances as an integral part of the model fitting.
Standard regression techniques can then be employed to model the mean
µi on area level covariates X i . These covariates are the area average of the
Bayesian Statistics for Small Area Estimation 5
PNi
individual values xij over the population: X i = j=1 xij /Ni .
Hence, the mean is modelled as
µi = α∗ + Xi β ∗
and the coefficients α∗ and β ∗ can be estimated using typical model fitting
algorithms.
Note that when the direct estimators are missing for some areas and only the
values of X i are available, a synthetic estimator (Rao, 2003) can be computed
as
where the value of the regression coefficients α̂∗ and β̂ ∗ are computed using the
data available from other areas.
Alternatively, a unit level version of this model can be fitted by regressing
yij on the unit level covariates xij
Note that the coefficients for this model may be different from those estimated
for the aggregate model, hence the different notation for the regression coef-
ficients (see Section 2.2). The area level average can then be computed by
averaging over all the values of µij :
Ni
X µij
µi = = α + Xi β (3)
j=1
Ni
Hence, a small area estimate that combines aggregated and individual infor-
mation by means of the fitted values of α and β from the unit level model
is
µ̂s,i = α̂ + X i β̂ (4)
where µij is the true value of the target variable of individual j in the sample
from area i and σe2 reflects the individual sampling variation which we will
assume the same in all areas for the time being (i.e., σi2 = σe2 ).
Although covariates are typically introduced to model dependence between
the mean and some explanatory factors, it is likely that some residual variance
will remain unexplained by the covariates. This can be accounted for effectively
by including random effects in the model. These random effects capture the
unobserved patterns such as spatial dependence and between area variation. In
particular, we consider the following random effects regression model for the
unit level means:
Here α is the intercept of the model, β is the vector of coefficients of the co-
variates xij , ui is a random effect which accounts for area level variation and is
distributed independently as
where δi is the set of neighbours of area i and |δi | the number of neighbours.
In addition, we have added the constraint that the sum of the values of all
the random effects vi is zero to make the intercept and the random effects
identifiable (see, Banerjee et al., 2004, pages 163–164).
As an alternative to this conditional specification, we can model the mean
µij by including spatial random effects wi which are correlated according to the
distance dkl between two areas k and l (Diggle et al., 1998):
2
w|Σ ∼ M V N (0, Σ); Σkl = σw exp{−(φdkl )} (10)
2
σw is the variance at any given point and φ is a smoothing parameter that
controls the scale of the correlation between areas.
Unlike model (7), we do not include a separate independent random effect ui
in model (9). The motivation for doing so in model (7) lies with the fact that the
spatial dependence of the intrinsic CAR random effects (8) is pre-determined by
the neighbourhood structure. Hence unstructured effects are also included to
allow for Bayesian learning about the strength of spatial dependence in the data,
via the relative contribution of the ui and vi to the posterior (Besag et al., 1991;
Eberley and Carlin, 2000). In the case of model (9), Bayesian learning about
the strength of spatial dependence of the wi random effects takes place directly
8 V. Gómez-Rubio et al.
effects. Kleffe and Rao (1992) approximate the MSE of this unit level model
with different area level variances with common prior distribution.
of the random effects, but we have not found differences in the small area esti-
mates.
r
(k)
ˆ −Y ) 1
PK ˆ (k) − Y )2
k=1 (Y i
PK
1 k=1 (Y i i K i
RBi = , RRM SEi =
K Yi Yi
where k indexes the survey samples.
As global measures, we can take the Mean Absolute Relative Bias (MARB)
and the Mean Relative Root Mean Square Error (MRRMSE):
m m
1 X 1 X
M ARB = |RBi |, M RRM SE = RRM SEi (14)
m i=1 m i=1
where m is the total number of areas. In this way, we can assess if the estimators
are biased and if they are variable. Either of these latter two criteria can be
used to decide on the best model. Better estimators will be those which produce
smaller values of the MARB and MRRMSE.
Bayesian Statistics for Small Area Estimation 11
We can also calculate the Relative Bias (RBvar) and Relative Root Mean
Square Error (RRMSEvar) of the variance estimates in a simulation study, to
asses how well they estimate the true error of the small area estimator. They
are defined as
PK (k)
1 k=1 (var(Y
ˆ − EM SEi )
i )
RBvari = ,
K EM SEi
q
1
PK (k)
K k=1 (var(Y
ˆ i ) − EM SEi )2
RRM SEvari =
EM SEi
where EM SEi is the true empirical error
K
1 X ˆ (k)
EM SEi = (Y i − Y i )2 .
K
k=1
The Mean Absolute Relative Bias (MARBvar) and Mean Relative Root Mean
Square Error (MRRMSEvar) of the variance estimates are then defined analo-
gously to (14).
In order to reduce costs, surveys are often carried out in a subset of areas by
taking a sample of the population which is representative of the whole study
region. This means that direct estimates can only be provided for a few areas
and that estimates for the out-of-sample areas must be obtained by other means.
Hence, we can split the areas between in-sample and off-sample areas, according
to whether or not they have been included in the survey.
In area level models we will therefore have missing the values of Yˆ D,i and
2
σ̂i for the off-sample areas, whilst in unit level models we will miss the values
yij and xij from off-sample areas. However, we will assume that the area level
covariates X i are available for all areas since they are obtained from a different
source to the survey. This is a key (but realistic) assumption in order to be able
to provide small area estimates for all areas.
Note that here we only consider the problem of data that are missing by
design of the survey (see section 6 for further particulars). Non-response in
surveys, for example, is another common source of missing data in Small Area
Estimation, but we do not address this issue here.
A simple approach to tackle the problem of not having direct observations
in several areas is to employ a regression model (such as described in Section
12 V. Gómez-Rubio et al.
2.1) on some covariates, which is fit using survey data (from in-sample areas).
Estimates for the off-sample areas are imputed by relying on the estimated model
and additional information (e.g. area level covariates). The main drawback of
this method is that the imputed values do not account for the uncertainty in
the estimation of the regression coefficients or spatial correlation between the
target variable in different areas.
Yˆ D,s
" # " # " #
Xs zs
es
=α+ X β+ z + (15)
µs s s 0
where z represents spatially correlated random effects (which can have different
specifications, as discussed below), the subindex s refers to the observed (in-
sample) areas and s to the unobserved (off-sample) areas. The value of zs can
be estimated by exploiting the (spatial) correlation with zs .
M V N (Σss Σ−1 T
ss ws , Σss − Σss Σss Σss )
as explained in Diggle et al. (1998). The estimate of ŵs is then the posterior
expectation of the mean of the above conditional MVN, i.e. E·|y [Σss Σ−1
ss ws ].
The final estimator for the set of areas in s becomes
Yˆ s = α̂ + β̂X s + ŵs
Bayesian Statistics for Small Area Estimation 13
In principle, the model can still be fitted, even if there are areas with all neigh-
bours missing (provided the area is not an island), but if too many of them
are missing, the estimation procedure will be very unstable. We discuss some
possible remedies to this problem in the next section.
14 V. Gómez-Rubio et al.
Non-spatial random effects u can be included in the model as well, but all the
elements of us must be set to zero to avoid problems of identifiability with vs .
In a sense, these unstructured random effects measure the discrepancy between
the response (observed data) and the predicted data (fixed and spatial random)
effects in the model. If we lack the response, given that the random effects u are
not correlated across areas, it is impossible to estimate the scale of the values
in us .
µi = α + βX i + ui + rk(i) (16)
where index k(i) represents the higher administrative level index of area i. A
similar formulation can be proposed for µij to create a unit level model. rk(i)
is given a spatial structure to capture the large scale spatial variation. A CAR
prior similar to that assigned to vi is assumed, but adjacency is defined according
to the higher administrative level and it is assumed that at least one area in each
region is sampled. If the spatial pattern at this higher level is too weak or non-
existent, the random effects rk(i) may be assigned a non-spatial distribution,
such a N (0, σr2 ).
Policy makers are often interested in targeting areas with particular needs in
order to conduct specific actions. For example, areas with the highest unem-
ployment rates can be selected to carry out training programmes to improve
the possibilities of finding a job or becoming self-employed. In statistical terms,
this is the same as selecting the areas which are in the (lower or upper) tails of
the distribution of the area level target variable using some form of rank-based
estimator.
Bayesian Statistics for Small Area Estimation 15
We note the advice of Goldstein and Spiegelhalter (1996) and consider the
ranking of the areas with care. As they argue, comparison of areas is difficult
due to the (often considerable) uncertainty in estimating the ranks and the
results must be taken more as a guidance than a definitive classification of the
areas. Nevertheless, a number of different approaches to ranking a set of units
(small areas, hospitals, schools etc.) have been proposed in the literature, and
we consider how some of these can be applied in the present context.
The first approach consists of estimating the posterior distribution of each
area’s rank, Ri . This is straightforward to do using Bayesian sampling-based
(MCMC) methods, and just involves ranking the values of the predicted target
variable (µi ) across areas for every posterior sample. The mean of the posterior
distribution of each area’s rank, R̂i = E·|y [Ri ], has been shown to be the optimal
point estimate under squared error loss (Shen and Louis, 1998). Bell (2004)
offers some insight into this approach and its applications to poverty mapping
in the U.S. in an unpublished work. Note that the posterior mean of the ranks
will not necessarily be the same as the ranks based on the posterior mean of
the target variable itself, since the rank is a non-linear transformation of the
latter. This inconsistency may be seen as an undesirable feature if both the
posterior means of the target variable and posterior means of the ranks are to
be reported.
To address this problem, Shen and Louis (1998) considered the problem of
producing a set of area-specific estimates that satisfy the ‘triple goals’ of being
good estimates of (1) the true histogram (distribution) of parameters across
areas; (2) the true ranks; (3) the true parameter values. They note that no single
set of estimates can simultaneously optimise all three goals, but that producing
a single set of estimates with good performance on each criterion is important in
many policy settings. Their proposed estimator produces point estimates of the
area-specific target parameter that optimise the first two criteria (in particular,
the rank of these point estimates is equivalent to the posterior mean ranks), and
that produce estimates of the individual area-specific parameters that, although
not as good as the posterior means of the target parameter, generally produce
acceptable estimates.
Other authors who have worked on trying to produce an ensemble of small
area estimates whose empirical distribution (histogram) is not overshrunk, in
order to provide good rankings, include Louis (1984); Spjøtvoll and Thomsen
(1987); Lahiri (1990) and Ghosh (1992). In all these cases, the estimates are
based on some form of ‘constrained’ hierarchical or empirical Bayes estimation.
The point estimates are optimised under a particular loss function (usually
squared error loss) subject to constraints on the mean and variance of the en-
semble of estimators — usually that they match the posterior expectation of the
mean and variance, respectively, of the true ensemble distribution. However,
16 V. Gómez-Rubio et al.
Shen and Louis (1998, 2000) have shown that, although constrained estimators
yield improved estimates of the histogram of small area target values, they al-
ways produce rankings equivalent to the ranking of the posterior means of the
target variables, which are not optimal. On the other hand, triple goal esti-
mates have similar or improved performance in terms of estimating both the
ranks and the shape of the empirical distribution. Relatively little work has
been done on evaluating posterior mean ranks or constrained or triple goal es-
timators in the context of spatial hierarchical models, although work by Conlon
and Louis (1999) suggests that performance should be similar to that found for
non-spatial settings. On a cautionary note, a number of authors have found
constrained and triple goal estimates to be sensitive to model mis-specification
(Shen and Louis, 2000; Paddock et al., 2006).
Ranking is relative to the other areas and, for example, will not determine
whether a particular area reaches the desired level of income or wealth. For this
purpose, a different approach can be followed by defining a specific threshold,
T , and estimating the probability that the target variable in each area is above
(or below) it. For example, when estimating the average income per household,
we can set this threshold to the poverty line (see section 6.2 for details). This
‘exceedence probability’ is simply the tail-area probability of the posterior distri-
bution of the target variable in each area that is above (or below) the threshold,
i.e.
pex
i (T ) = pr·|y (µi ≥ T ) (17)
These exceedence probabilities can either be used directly, or can be used to
rank areas. Morris and Christiansen (1996) and Normand et al. (1997) propose
a similar approach to ranking the performance of different hospitals and the
identification of those that might be under-performing.
A difficulty with the previous approach is that it can be problematic to set a
suitable threshold because it may have to be chosen subjectively. An alternative
approach which combines the ideas of ranks and exceedence probabilities is to
estimate the probability of being ranked in the top (or bottom) Q% of areas.
Different values of Q can be used at the same time if required. These ‘percentile
probabilities’ are computed by first converting the ranks to percentiles as follows
Pi = Ri /(m + 1)
(where m is the total number of small areas) and then calculating the tail
area probability of the posterior distribution of Pi that is above (or below) the
threshold Q, i.e.
ppc
i (Q) = pr·|y (Pi ≥ Q) (18)
Again, these percentile probabilities can either be used directly, or used to
compute a final ranking of the areas.
Bayesian Statistics for Small Area Estimation 17
Lin et al. (2006) consider the theoretical and empirical performance of vari-
ous ranking criteria, including R̂i (or its percentile equivalent, P̂i = R̂i /(m+1)),
pc pc
P̂iex (T ) = rank(pex
i (T ))/(m + 1) and P̂i (Q) = rank(pi (Q))/(m + 1), in the con-
text of normal-normal and Poisson-gamma hierarchical models for comparing
mortality estimates in around 4000 renal dialysis centres in the US. They con-
clude that the optimal estimator will depend on the specific purpose for which
the ranking or classification of units is to be used, but that in most applications,
the above three criteria perform well. They also note that all the ranking esti-
mates can perform poorly if the underlying model parameters (equivalent to our
small area estimates µi ) are imprecisely estimated. We illustrate and compare
these various criteria in the context of small area estimation in the example in
Section 6.
EqInc = NetIncome /(1+ 0.5*(# of adults -1) + 0.3*(# of children under 16))
for each household include the equivalised income plus the five covariates al-
ready described. We have simulated 100 replicated surveys from the complete
population survey in order to evaluate the sample-to-sample variability of the
small area estimates. A discussion of the impact of the sample size used in the
survey is given in Section 6.3.
In addition to the different survey data, the area means (or proportions) of
the five covariates are available for every municipality. They have been com-
puted using the whole of the records from the LOUISE register, but other
sources of information could have been used as well.
The adjacency of the areas has been computed considering that two regions
are neighbours if they have a common boundary. Furthermore, the centroids
of the areas (not population-weighted but computed from the boundaries) are
also available. This information will be used when considering the correlation
structure between the spatial random effects as in (10).
Comparing the different Bayesian estimators, the biggest impact comes from
the way the sampling variances are treated. Unit level model 1, which assumes
a common variance for all areas, has higher bias and mean square error than
the other models which all assume area-specific variances. When the sample
size per area is sufficiently large, as in the present case, the design variance is a
good estimate and so there is little to choose between treating the area-specific
variances as fixed (as in our area-level model) or modelling them independently
or hierarchically (unit level models 2 and 3 respectively). A different picture
emerges in the sparse data case (see Section 6.3).
Inclusion of spatial random effects with a CAR structure (vi ), in combina-
tion with unstructured random effects, tends to reduce bias and mean square
error of the small area estimates compared to models with only unstructured
or only spatial random effects. Models including spatial random effects based
on the distance between areas (wi , equation 10) do not perform well compared
to the models with spatial random effects based on the CAR specification (re-
sults not shown). In practice, the performance is very similar to the model
with unstructured random effects ui . We believe that this happens because
the stationarity assumption underlying the former specification does not hold.
Another drawback of using this model is that fitting takes much longer than
with other spatial models.
Table 1 also shows that, broadly speaking, the ranking of models by DIC is
similar to that based on MARB or MRRMSE, indicating that DIC can be useful
for model selection in a real application setting (although note that the latter
cannot be used to compare models based on different data, i.e. area versus unit
level models).
The right half of Table 1 summarises the bias and mean square error of
the variance estimates from the different models. For all models, the variance
estimates have much higher relative bias and mean square error than do the
small area estimates themselves. Nevertheless, most of the Bayesian variance
estimates (except unit level model 1 and the unit level model 2 with only spa-
tial random effects) are substantially better than the corresponding synthetic
variance estimates, and comparable or better than the variance of the direct
estimator. In contrast to the findings for the small area point estimates, how-
ever, the Bayesian models with spatial random effects only tend to perform
poorly relative to models including unstructured random effects (either alone
or in combination with spatial effects). The variance estimates from the former
models tend to be systematically smaller than those from the latter (data not
shown), suggesting that while borrowing of information from neighbouring areas
can improve the small area estimates, it tends to over-estimate their precision.
Thus models with both unstructured and spatial random effects may represent
the best compromise in terms of producing accurate small area estimates which
20 V. Gómez-Rubio et al.
Table 1. Summary of the performance of the small area estimates and their variance estimates
for different models fitted to the 1% survey samples. The values are averaged over the n
replicates.
Model MARB MRRMSE DIC∗ MARBvar† MRRMSEvar† Cov. n
×100 ×100 ×100 ×100 rate]
Direct 0.5 6.5 — 68.7 68.8 0.94 100
Area level
Synthetic 3.5 3.6 — 89.3 89.3 0.17 100
Bayesian
ui 2.0 3.1 3246 58.8 69.1 0.93 100
vi 2.1 2.8 3279 80.2 89.7 0.91 100
ui + vi 1.8 2.9 3232 60.9 72.0 0.93 96
Unit level
Synthetic 6.0 6.0 — 94.6 99.4 0.06 100
Bayesian
Model 1
ui 3.0 4.3 495714 122.8 185.4 0.94 98
vi 3.7 4.3 495825 109.4 144.6 0.85 98
ui + vi 2.8 3.6 490784 109.6 130.0 0.93 72
Bayesian
Model 2
ui 2.0 3.5 474461 42.4 61.5 0.93 100
vi 2.8 3.3 474682 102.5 124.6 0.88 100
ui + vi 1.9 3.2 474122 49.4 69.6 0.91 75
Bayesian
Model 3
ui 2.0 3.5 474118 43.9 64.0 0.93 94
vi 2.0 3.0 474077 62.3 72.6 0.90 94
ui + vi 1.9 3.2 474363 56.2 76.5 0.94 76
∗
Area and unit DICs are not comparable because they are computed using
different data.
†
Variance estimate for direct estimator is the design variance; for synthetic estimator it is
V ar(X i β̂); for Bayesian estimators it is the posterior variance of µ̂i .
]
95% intervals for direct and synthetic estimators are the 95% confidence intervals
assuming normality; for the Bayesian estimators they are the central 95% posterior
credible interval for µ̂i .
22 V. Gómez-Rubio et al.
1700
●●
●
●
●●
●
● ●●●●
●
●●
●
●●
●●●
● ●
●●
●●
●
●●
●●
●
●
●● ● ●
●
●
● ●● ●
●●● ●
●●
●
● ●●
250
● ●
1600
●
● ●
●
●
●
●●
●● ● ● ● ●
●●
●● ●
●
●
● ●
●
●●
●● ●
●●
● ●● ●
●
●
● ●
●
● ●
● ● ●
1500
●
●
●
●●●
●●● ●
●
● ●
● ● ●
●
● ● ●
●
●
●● ● ●
● ●
● ●●●
●● ●● ● ●
● ● ● ● ●
●
●
● ●●
●● ● ●● ● ●
●
●
●●
●
●
●● ● ●
● ●
Estimated rank
●
●● ●
●
● ●● ●
● ●
●
● ● ● ●
1400
● ● ●
●● ●
● ●● ●
● ●●
●
● ●● ●
● ● ● ● ●
● ●●
●
● ● ●●
●● ●
●● ● ●
●●
● ● ●
● ● ● ●
150
● ●●●●● ●●●
●● ●
● ●●●
● ●
● ●
● ●
●
● ●● ●
●
●●
●●
●● ● ●
●
●
●
● ● ●
● ●
●
●● ●
● ●
●
● ● ●
● ●
●
● ●
●●
●●
● ●●
●
●
●
● ● ●
● ● ●
● ● ●● ● ●
● ●
● ● ●
● ● ●●
●
1300
● ●
●●
● ●
● ● ● ● ●●
●
●
●●●●
●
●●● ●● ● ●
● ●
● ● ●
●● ●
● ●●
● ●● ● ● ●
● ●●
●
● ●● ●
● ●● ●● ● ●
● ●● ●●
●●● ● ●
●● ●
● ●
●● ● ●●
●● ●
● ● ● ● ●●
● ●
●
● ●● ● ●
● ● ●● ● ● ●●
●
● ● ● ● ●●
●
●
●●●
●
100
●
●●
●
●
●
● ● ●●
●●
●● ●
●● ●
●● ●
●
●
●
● ●●
●● ●
● ●●●● ●● ●● ●● ●
●●● ●●
●
●
● ●
● ●● ● ● ● ●
●● ● ● ●
● ●●●
●●● ● ●
● ● ● ● ● ● ● ●● ●●
● ● ● ●● ● ● ●
●● ●
1200
● ●●●
●● ●●
● ●
●●● ● ● ● ●
●
● ● ●● ● ●● ●●
● ● ●
●●
●● ● ● ●
●● ●
● ● ●● ●● ●●
●
● ●
● ●●
● ● ●● ●● ●●
● ●
● ● ●
●●
● ●
●●
●●
● ●
● ●
●● ● ● ● ●
●
● ●
● ●
● ●●
●●
●
●●
● ●
●●
●●
●● ●●● ●
● ●●●
●● ● ●●●●
●● ● ● ●● ●
●
●●
● ● ● ●● ● ●●
● ●●● ● ● ● ●● ● ●
● ●● ●
● ● ●
● ●● ●
●
● ●● ● ●
● ● ●
● ● ●
● ●● ● ● ● ●
● ●
● ● ● ●
●● ●
●● ● ● ●● ● ● ●
● ●
● ●●
●●
● ● ● ●
● ●●
●● ●●● ● ●
● ● ●
●●
●● ●● ● ●●●● ●
●● ●●●
● ●● ● ●
● ● ●● ●
● ● ●● ●●
●● ●● ● ●
● ● ● ● ● ●
●
●
●● ●●
● ●● ● ●
●
●
● ● ●●
● ●● ● ●
●
●●
● ●
●
● ● ●
●●
●●●●● ● ●●● ● ●●
● ●
●● ● ●
● ● ● ● ● ● ● ●
●
●
● ● ● ● ●● ●
● ●● ●
●●● ●
● ●
●
●●
●
●●
● ● ● ● ● ● ● ●●
●
●●
● ●● ●
● ● ● ●●
●●● ● ● ●
●● ●
●● ●●
●●●
●● ●●●
●●● ● ● ● ● ● ●●
● ● ●● ●●●
● ●●
● ● ● ● ● ●
● ●
●
●●
● ● ●●
●
● ●● ● ●
●●●●●●●
● ●●
●● ●● ●
●
●● ● ● ●
●● ●● ●●
●●
●● ●● ●
●
50
● ● ●
●●
●
● ●● ● ● ● ●●● ● ● ●
1100
●
●● ●
● ●●● ● ●
●● ● ●
●
●
●●
●● ● ●
● ● ●
● ● ●
●●●
●●● ● ●
●●
●●
● ● ● ●●●
●● ●●
●
●● ●● ● ● ●
● ● ● ● ●
●●
● ●
●
●
●
● ● ●
● ● ●●
●● ●●
● ●● ●
● ●●
●● ●●
●●
●●●
● ●
●●
● ●●
●
●
●●
1000
0
(a) (b)
Figure 1. Posterior estimates of (a) the area rank, and (b) the target variable (income)
for a randomly chosen dataset, estimated using the area level model with unstructured
and CAR spatial random effects. 95% credible intervals (in grey) show the high uncer-
tainty about the estimates, particularly for the ranks.
• Posterior probabilities against true area rank for a single randomly chosen
data set
• Mean of posterior probabilities across datasets against true area rank,
with 95% sampling intervals (see below).
• Mean of the ranks of the posterior probabilities across datasets, with 95%
sampling intervals (see below).
The first two rows displayed in Figure 2 show the probabilities of being
among the 20% and 10% of the most deprived areas, respectively. These poste-
rior probabilities are not simple monotonic functions of the ranks and some ad-
ditional variability is observed. Dotted lines representing the uncertainty about
the replication variability have also been displayed. These sampling intervals
correspond to the variability between the different survey samples, because in
this case the Bayesian approach does not provide them for an individual data
set. As with the ranks, there is considerable uncertainty about these probabili-
ties, highlighting the difficulty of finding a reliable way to classify or discriminate
a number of areas from the rest.
As alternatives to reduce sample-to-sample variance of these probabilities,
we consider more extreme criteria, and compute the posterior probability of
being the most deprived area and the posterior probability of the average in-
come being below the threshold of 1050. In both of these plots, the sampling
variability of the probability is greatly reduced, and we can be quite confident
that the areas with non-zero probabilities are poorer than those with zero prob-
abilities. However, it is still difficult to discriminate between the areas with
non-zero probabilities due to overlapping intervals.
In order to measure the performance of the ranking methods based on com-
puting different percentile probabilities, we have computed the Operating Char-
acteristic (OC) proposed in Lin et al. (2009, see page 28 for details), which are
shown in Table 2. The Operating Characteristic is similar to computing the
probability of wrongly classifying an area as below (or above) the specified
quantile. The OC’s are then averaged over all samples and 95% sampling inter-
vals have been computed from the OC values for all the data sets. The results
indicate the OC’s for a particular quantile are quite consistent across data sets
and across models, although models that include a spatial structured random
effect generally have slightly lower OC values than those with only unstructured
random effects, showing better classification performance. There are substan-
tial differences in the OC’s for different quantiles however. In this particular
example, all the models perform better at discriminating between the richest
areas and the rest (for example, the OC is around 0.3 for classifying areas as
above or below the 80th percentile), and less good at correctly identifying the
24 V. Gómez-Rubio et al.
Table 2. Operating characteristic (with 95% sampling intervals). Low values of the OC
indicate good performance in ranking the areas.
Percentile (r)
Model 80% 50% 20% 10% (1/284)%
Area level
ui 0.28 (0.24,0.33) 0.39 (0.34,0.41) 0.47 (0.43,0.51) 0.53 (0.47,0.58) 0.81 (0.54,0.91)
vi 0.24 (0.20,0.28) 0.36 (0.32,0.39) 0.45 (0.41,0.49) 0.50 (0.45,0.55) 0.71 (0.47,0.86)
u i + vi 0.25 (0.20,0.31) 0.37 (0.33,0.40) 0.45 (0.40,0.50) 0.51 (0.46,0.57) 0.74 (0.43,0.90)
Unit level
Model 2
ui 0.36 (0.33,0.39) 0.42 (0.39,0.45) 0.48 (0.44,0.52) 0.54 (0.49,0.59) 0.82(0.61,0.93)
vi 0.33 (0.30,0.36) 0.44 (0.39,0.49) 0.50 (0.44,0.56) 0.52 (0.45,0.61) 0.61 (0.39,0.82)
u i + vi 0.30 (0.24,0.39) 0.41 (0.38,0.44) 0.47 (0.42,0.51) 0.52 (0.45,0.58) 0.72 (0.39,0.90)
Model 3
ui 0.36 (0.33,0.40) 0.42 (0.39,0.46) 0.49 (0.45,0.53) 0.55 (0.50,0.60) 0.82 (0.61,0.93)
vi 0.28 (0.24,0.32) 0.41 (0.38,0.45) 0.47 (0.42,0.52) 0.52 (0.45,0.59) 0.69 (0.37,0.88)
u i + vi 0.33 (0.30,0.37) 0.43 (0.40,0.46) 0.50 (0.46,0.54) 0.55 (0.50,0.60) 0.77 (0.51,0.89)
poorest areas (for example, the OC is around 0.5 for classifying areas as above
or below the 10th or 20th percentile). This is reflected in the plots in Figure
2, which show much less uncertainty about the probabilities and ranks of the
richer areas.
square error of both the estimates and their variances are generally higher than
for the 1% sample size. For the unit level model 3, the overall picture for the
0.1% and 1% sample sizes is similar, however, with the Bayesian estimates out-
performing the synthetic estimates on most criteria, and inclusion of spatial
random effects leading to improved small area estimates but over-precise vari-
ance estimates. For the Bayesian area level model, the reduction in sample size
has a particularly detrimental impact on the mean square error and coverage
rates of the small area estimates. This probably reflects the fact that the area
level model considered here uses the design variance as the (fixed) estimate of
the sampling variance in each area, and this is likely to be a poor estimate when
the sample size in each area is small. Specifying a hierarchical model to smooth
the variances, as in unit level model 3, could be used to address this problem.
26 V. Gómez-Rubio et al.
Regarding the convergence of the models, it seems to be better for unit level
model 3 than the other Bayesian models, probably because of the hierarchi-
cal structure on the area level variances. In all cases, the model with both
random effects ui and vi showed that it is difficult to disentangle spatial and
non-spatial variation when data are sparse because of poor convergence and
possible confounding between the two effects; a phenomenon documented in
disease mapping (Best et al., 2005).
7. Discussion
In this paper we have described different Bayesian area and unit level models
for the estimation of variables in small areas by combining information from
survey data and other sources. We have studied the importance of taking into
account non-spatial and spatially correlated area level variation, and we have
found that by including random effects to model both these sources of variation
the small area estimates can be improved. These improvements are somewhat
offset by a tendency for spatial models to under-estimate the variances of the
estimates, however. Despite this, we have shown that it becomes particularly
important to model spatial dependence when some areas have no direct survey
estimates, as information from nearby areas can be used to improve prediction
in the off-sample areas.
When comparing area versus unit level models, the models performed sim-
ilarly when using a 1% survey sample. However, when a 0.1% sample was
employed, area level models had a smaller bias but were worse than unit level
models in terms of MRRMSE. We believe that this is due to the fact that the
direct estimators on which area level models rely are unbiased by design and
have a wide design variance in this case and are not very reliable. Smoothing
the within-area variances, as we did for the Bayesian unit level models, would
help to address this problem.
Regarding unit level models, we considered three ways of modelling the
within-area variance. Clearly, allowing for a different within-area variance for
each area improves the fitting. When there are sufficient data to estimate the
within area variance with accuracy there is not much difference between Models
2 and 3. However, the hierarchical structure on the area level variances as in
Model 3 leads to better convergence of the MCMC simulations. In a more
general framework, a hierarchical structure based on covariates (for example,
Bayesian Statistics for Small Area Estimation 29
linear regression) could be employed to model the different area level variances
(see, for example, Gelman, 2006). Hence, we have shown the importance of
borrowing strength across areas when data are sparse not only to estimate the
area level means but also the area level variances.
In this work we have only considered models with a Normal response. Gen-
eralised Linear Mixed Models can be used to deal with non-Normal variables
and the ideas presented here can still be applied. However, combining individ-
ual and aggregated data is not so straightforward because in these models the
response and the explanatory covariates are not linked linearly any more and
so care is required to specify an appropriate aggregate form of the individual-
level model. For example, as shown by Jackson et al. (2006), it is possible to
synthesise different sources of data, with different levels of aggregation when an
appropriately specified model is used. Otherwise, a bias in the estimation of
the coefficients of the covariates is introduced, which may bias the small area
estimates of the target variable as well. We intend to explore these ideas in the
future and tackle their application to the estimation of area level counts and
rates, such as the number of persons per household and rate of unemployment.
Areas can be classified to help inform policy issues by exploiting the results
provided by Bayesian inference. We have considered different approaches to
the ranking of areas. Accounting for the uncertainty of the estimates is crucial
because when areas tend to be similar it will be difficult to separate low-ranked
areas from the rest. Alternatively, the probability of being among the q% lowest
ranked areas can be used instead, for some suitable quantile, q. We have shown
that choosing more extreme quantiles (e.g. the lowest ranked area rather than
the bottom 10% or 20%) reduces uncertainty about the ranks due to sampling
variation. However, it is still difficult to confidently identify all but the most
extreme ranked areas.
When some areas are not included in the survey, it is still possible to pro-
vide estimates for those areas by relying on the fitted Bayesian models (using
in-sample areas) and their spatial correlation to off-sample areas. Area level
covariates are still required in all areas to compute the small area estimates. As
expected, these estimates are less accurate than in the case with survey data in
all areas but, despite the loss of performance, the results are still reasonable and
have lower bias and better coverage than traditional synthetic estimates. When
data are very sparse, spatial random effects can be incorporated at a regional
level, so that larger-scale spatial patterns are modelled. This can help to cope
with large amounts of areas with no direct observations and provide reliable
results.
30 V. Gómez-Rubio et al.
8. Acknowledgements
We would like to thank ONS for providing access to the Swedish data, which
were originally provided by Statistics Sweden. This work was carried out as part
of the Imperial College BIAS node of the ESRC National Centre for Research
Methods (grant numbers RES-576-25-5003 and RES-576-25-0015). We would
also want to thank the associate editor and two anonymous referees whose
comments have helped to improve the quality of this paper.
References
Arora, V. and P. Lahiri (1997). On the superiority of the Bayesian method over
the BLUP in Small Area estimation problems. Statistica Sinica 7, 1053–1063.
Besag, J., J. C. York, and A. Mollié (1991). Bayesian image restoration, with
two applications in spatial statistics. Annals of the Institute of Statistical
Mathematics 43, 1–59.
Cochran, W. G. (1977). Sampling Techniques (3rd ed.). John Wiley & Sons,
Inc., New York.
Conlon, E. M. and T. A. Louis (1999). Addressing multiple goals in evaluat-
ing region-specific risk using Bayesian methods. In A. Lawson, A. Biggeri,
D. Böhning, E. Lesaffre, J.-F. Viel, and R. Bertollini (Eds.), Disease Mapping
and Risk Assessment for Public Health, pp. 31–47. Wiley.
Datta, G. S. and M. Ghosh (1991). Bayesian prediction in linear models: Appli-
cations to small area estimation. The Annals of Statistics 19 (4), 1748–1770.
Diggle, P., R. Moyeed, B. Rowlingson, and M. Thomson (2002). Childhood
malaria in the Gambia: a case-stuy in model-based geostatistics. Applied
Statistics 51, 493–506.
Diggle, P., J. Tawn, and R. Moyeed (1998). Model-based geostatistics. Applied
Statistics 47, 299–350.
Eberley, L. E. and B. P. Carlin (2000). Identifiability and convergence issues for
Markov Chain Monte Carlo fitting of spatial models. Statistics in Medicine 19,
2279–2294.
EURAREA Consortium (2004). Project reference volume. Technical report,
EURAREA Consortium.
Fay, R. E. and R. A. Herriot (1979). Estimates of income for small places:
An application of James-Stein procedures to census data. Journal of the
American Statistical Association 74 (366), 269–277.
Gelman, A. (2006). Prior distributions for variance parameters in hierarchical
models (Comment on article by Browne and Draper). Bayesian Analysis 31,
515 – 533.
Gelman, A., J. B. Carlin, H. S. Stern, and D. B. Rubin (1995). Bayesian Data
Analysis. Chapman & Hall/CRC, Boca Raton, Florida.
Ghosh, M. (1992). Constrained Bayes estimation with applications. Journal of
the American Statistical Association 87 (418), 533–540.
Ghosh, M., K. Natarajan, T. W. F. Stroud, and B. P. Carlin (1998). Generalized
linear models for small-area estimation. Journal of the American Statistical
Association 93 (441), 273–282.
Gilks, W. R., S. Richardson, and D. J. Spiegelhalter (Eds.) (1995). Markov
Chain Monter Carlo in Practice. Chapman & Hall/CRC, Boca Raton,
Florida.
32 V. Gómez-Rubio et al.
Heady, P., P. Clarke, and others (2003). Small Area Estimation Project Report.
Technical report, Office for National Statistics, United Kingdom.
Jiang, J. and P. Lahiri (2006). Mixed model prediction and small area estima-
tion. Test 15 (1), 1–96.
Let us assume that we have an area level model and that we have data from
the first l areas, i.e., we have the values of direct estimates and their sampling
variances for these areas. In this context, s = {1, . . . , l} and s = {l + 1, . . . , m}.
The full conditionals to be used in Gibbs Sampling for this model are:
(X V −1 X)−1 X V −1 (Yˆ − v), (X V −1 X)−1
T T T
Π(α, β| . . .) = N
−1 ˆ −1
Π(vi |...) ∝ exp{ (Y i − α − βX i − vi )2 + 2 ni (vi − v i )2 }, i ∈ s
σ̂i2 σv
−1
Π(vj |...) ∝ exp{ nj (vj − v j )2 }, j ∈ s
σv2
where ni and nj represent the number of neighbours of areas i and j, respec-
tively.
n 1X
Π(σv2 |...) = Ga−1 (ε + ,ε + (vk − vl )2 )
2 2
k∼l
Note that the full conditional for each vi depends directly on observed data
and its neighbours, whilst vj only depends on its neighbours through v j and
not directly on the observed data. Furthermore, α and β are only informed by
the data from the areas included in the survey, as we would expect.
36 V. Gómez-Rubio et al.
Mean of the ranks of the posterior prob.
Prob. in poorest 20% of areas Prob. in poorest 20% of areas
(in poorest 20% of areas)
1.0
1.0
●
●●
●●
●
250
● ● ●●
●●
● ●●
● ● ●●
● ●●
●● ●●
●
0.8
0.8
● ● ●
● ● ● ●
● ●
●
●
●
●
● ● ● ●
● ●●●● ●
●●●
200
●
● ● ● ●●●● ●
● ●
●●
● ●●●
● ● ●
● ● ●●●●
●●●●●
Probability
● ● ● ● ●
●● ●
●● ●
0.6
0.6
● ● ●●
● ● ●●● ●●
●● ● ● ● ● ●● ●
● ● ● ● ● ● ●
● ●
●●
Rank
● ● ● ●●
● ● ●●● ● ●
150
● ● ●
● ● ● ● ●●●
●
● ● ●
●●●●
●●
● ●
● ● ●
●● ● ● ●
●●
●
● ● ● ● ●● ●
●● ●●● ●
●● ● ● ●
●
● ● ● ● ●●● ● ● ●
● ●
●●●
● ● ●
●● ● ●
● ●
● ● ● ●●●●
● ●●●
● ●●
● ● ● ● ● ● ● ●● ● ●●
0.4
0.4
● ● ●● ●
●
●
●
● ● ● ●● ● ●●●
●●
● ●● ● ● ●
● ● ● ● ● ●
● ●●
100
● ●
● ● ●● ● ● ●●●●● ●● ● ●●● ●
● ● ●●● ●●●
● ● ● ● ● ●●
● ● ●● ●●●
●● ●●●
● ● ●
● ●
● ●
●
●
●●● ● ●● ● ● ●● ●● ● ●
● ● ● ● ● ● ● ●● ● ●
● ● ● ● ● ● ●●
● ● ● ●● ● ● ● ● ●● ●
● ● ● ● ● ● ● ●● ● ●●● ●●
● ● ●
● ● ● ● ●● ● ● ● ●● ● ●
● ● ●
● ● ● ●● ● ●● ●● ●
●
●●● ●●● ● ● ●●
● ●●
0.2
0.2
● ● ● ●
● ● ● ● ●
● ● ●● ●
●● ●
● ● ●● ●● ●
●
50
● ● ● ● ● ● ●
● ● ● ● ● ● ● ●● ● ● ●
● ● ● ● ● ● ● ●
● ● ● ●● ● ● ●●
● ● ● ● ● ● ● ● ● ● ●● ●
● ●●●●●
●
● ● ● ● ● ●● ● ●
●
● ●
●● ●● ● ● ● ● ● ● ●
● ● ●● ● ● ● ● ●
●●
●
●
● ● ● ● ● ●
● ● ●● ● ●●
● ●● ● ●●
●● ●
●
● ● ● ●● ● ●
● ● ●● ●●● ● ●
●
●● ●
●
● ● ● ●
● ● ● ●
● ●
● ● ● ●● ● ●
● ●
●
● ●
●● ●● ●
●●
● ● ● ● ● ●
● ● ●● ●●● ● ●● ● ● ● ● ● ● ●● ●
● ●●
● ● ● ● ● ●●●● ●● ● ●
0.0
0.0
● ● ● ●● ● ●●●● ● ●
●
● ●●●● ● ●● ● ● ●●● ● ● ● ● ● ●●
●●
●
● ●
● ●● ●●●● ● ●●
●●●●●● ● ● ●●●●● ● ●● ●
0
● ●●●●●●● ●●
●●●●●●●●●
●●
●●●●
● ●
●●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
● ● ●
●●●●●
●●●●
●●
● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
0 50 100 150 200 250 0 50 100 150 200 250 0 50 100 150 200 250
1.0
●
●●
●●
●
Mean of the posterior probabilities
●
●●
●●
●
●
●●
●●
●
●
●●
●●
●
●●
250
●●
●
●●
● ●
●
●
●
●●
●
●
●●
●
●
●
●
0.8
0.8
●
●
●
●
●
●
● ●
●
●
●
● ●
●
200
●
● ●● ● ●
●
●
● ●
●
●
●
●
●
●
Probability
●
●
0.6
0.6
● ●●● ●
●
● ● ●●
● ● ● ● ●● ●●
● ●●
●
●●●● ● ●
Rank
● ● ● ●● ● ●
● ●● ●
150
●
● ●● ●●● ●●●●
●
● ●● ● ● ●●
●●
● ● ● ●● ●
● ● ●● ●●● ●
● ● ● ● ●●●●
● ●●
● ● ●
●
● ●●
● ●●●●●●●
● ● ● ●
●●●● ● ●
●
●● ●●●●● ●
0.4
0.4
● ●●● ● ●●
● ● ● ●●●●
● ●●● ●● ●
●● ●
100
●●● ● ●
●
● ● ● ● ●● ●● ●● ● ●●
● ●●● ●●●
● ● ● ● ●●● ●
● ● ● ● ● ●● ● ● ●● ●
● ● ● ●●
● ● ● ● ●●● ●
● ●● ● ● ● ● ●● ● ●
● ● ● ● ●●●●●●
●●
● ● ● ● ● ●● ● ●
●● ● ●
● ● ● ●●
●●
●● ● ● ● ● ●
●
● ●
●●●●● ●
● ●
0.2
●
0.2
● ● ● ● ●● ●● ●●
● ● ● ●
● ●
● ● ●●●●
50
● ● ● ●
●● ●
●● ● ● ● ●●● ● ● ● ● ● ●●
● ● ● ● ● ● ● ●●●
●
●
● ● ●
● ● ● ●● ●
●● ● ● ● ● ● ● ●● ● ● ● ●
●●● ●
● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●
●●
● ● ●
●
● ● ● ● ● ● ● ●●
● ● ● ●● ●
● ● ● ● ● ● ●● ● ● ● ●
● ● ● ●● ● ●● ●
● ●● ● ●
●
●●●
● ●●
●
● ● ● ● ● ●● ● ●
● ●● ● ●
● ●
●● ● ●● ● ● ●
● ● ● ● ● ● ● ● ● ● ● ● ●● ● ●
● ● ● ● ● ● ● ●● ● ●● ●
● ●● ● ● ●●
●
●
● ●● ●● ●● ● ● ● ● ● ●● ●● ● ●● ●
● ●●● ●● ●● ● ● ● ●● ● ● ●
●●●
●●
● ● ●
0.0
0.0
0
● ● ● ●●●● ● ●●● ● ●
●●●●●● ●
●●
●●●●●●●
●●
●●
●●
● ●●
●●●●
●●
●●●●
● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
● ●●●
●●
●●●●●
● ●●
●●● ●● ●●
● ●
●●
●●
●●●
● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
0 50 100 150 200 250 0 50 100 150 200 250 0 50 100 150 200 250
1.0
●●
●
●
●●
Mean of the posterior probabilities
●●
●
●
●●
●●
●
●
●●
●●
●
●
●●
●●
●
250 ●
●
●
●
●
●
●
●
●
●●
●●
●
●
●●
●●
●
●
●●
●
0.8
0.8
●
●●
●●
●
●
●●
●
●
●●
●
●
●●
●●
●
●
●●
●
200
●
●●
●●
●
●
●●
●●
●
●
●●
●●
●
●
●●
●
Probability
●
●●
●
●
0.6
0.6
●●
●
●
●●
●●
●
●
●●
●
Rank
●
●●
150
●●
●
●
●●
●●
●
●●
●
●●
●
●
●●
●
●
●●
●●
●
●●
●
●●
0.4
0.4
●
●
●
●
●
●
●
●
●
100
●●
●●
●●
●●
●●
●●
● ● ● ●●
●●● ● ● ●● ●●
● ● ●
● ● ●●● ● ●
●
● ●●● ● ●●●
● ●●
●●● ●
●
●●
●●● ●●●
●● ● ●
●●
●●●●
●●●● ●● ● ● ● ●●
●
●● ● ● ● ●
●
● ●● ●
● ● ●
0.2
0.2
●
● ●
●
●●●
50
●
●●
●
●●
● ● ●●
●●●
●●
● ●●●
●●
● ●
●
●●
●
●
●● ● ●
● ●●
● ● ●● ● ● ● ●
●●●● ●●
0.0
0.0
●● ●● ●● ● ● ●●
●● ●●● ●● ●● ●● ● ●●●
●●●
●●●●●●● ● ●● ●
● ●●●● ● ● ●● ●● ●● ●●●●● ●●●●
●●
●●●●●● ●●
● ●● ● ●
0
●●●● ● ●● ●●●●
●●●●● ● ●
●●
● ●●●
●●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●● ●
●●●
●●
●●
●●
●●
●●
●●
● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●● ●
●● ●● ● ●●
●●
● ● ●● ●
● ●●
●●
●●
●●●●
●● ●
●● ●●●
●● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
0 50 100 150 200 250 0 50 100 150 200 250 0 50 100 150 200 250
1.0
●
●●
●●
●
Mean of the posterior probabilities
●
●●
●●
●
●
●●
●●
●
●
●●
●●
●
●
●●
250
●●
●
●
●●
●●
●
●
●●
●●
●
●
●●
●●
●
●●
0.8
0.8
●
●●
●●
●
●
●●
●
●
●●
●
●
●●
●●
●
●
200
●
●
●●
●
●
●●
●●
●
●●
●●
●●
●●
Probability
●●
●
0.6
0.6
●
●
●
●
●
●
●
●
●
Rank
●●
150
●●
●●
●●
●●
● ●●
●●
●●
●●
●●
●●
●
0.4
0.4
● ● ● ●
●
●● ●● ●● ●
●●
●●●●●●● ● ● ●● ●
● ●●●● ●● ●● ●●● ●● ●●
●●●●
●● ●●
●● ● ●
100
●●●
● ●● ●● ● ●●●●●
●● ● ●
●●●●
● ●
● ●●●● ● ●
● ● ●● ●
●
● ●● ● ● ●
●●
● ●●● ●
● ● ●●
●
●●● ● ●●
●● ●●
● ●●●●
● ●●●
● ●●
0.2
0.2
● ●
● ●●
●●●
50
● ● ●●●●
●
● ● ● ● ● ●●●●
● ●● ● ●●●
● ● ●●
●●
● ● ●● ●
● ● ●
●● ●●●
● ● ● ● ● ●
● ●
● ● ●
● ●●● ● ●●● ● ● ● ● ●● ●
● ●● ● ● ● ● ●●
●
● ● ●●● ● ●● ● ●● ● ●● ●
● ● ●●● ●● ● ●●
●● ●● ● ● ●●
● ● ● ● ● ● ● ● ●● ● ●● ● ● ●
0.0
0.0
● ● ●● ●
●
● ● ●●●● ●●● ●● ●● ● ● ● ● ●●●● ●● ● ● ● ● ● ●●●
●● ● ● ●● ●●●● ● ●●●●●● ● ● ●●●
●●●
● ●● ● ●● ● ●● ●
●●● ● ● ●● ●●
● ●● ● ● ● ● ●●●●
● ●
●●●● ●● ● ●
●● ● ● ●●● ● ●●●● ●
0
●● ●● ● ●●●●●●●
● ●●●●● ●
● ●●●
● ●
●●
●●
●●●●
●●
●●
●●
●●●
●●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●●
● ●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●●
● ●●●
●●
●●
●●
●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
● ● ● ●●
● ●
●●
●●●●
●●●●● ●
●●●
●● ●
●●●● ●
●●●
●●
●●
●●
●●●
●●
●●
●●●●●●
●●
●●
●●●●
●●
●●
●●●
● ●
●●
●●
●●
●●●
●●●
●●
●●
●●
●●
●●
●●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
●●
0 50 100 150 200 250 0 50 100 150 200 250 0 50 100 150 200 250
Figure 2. Ranking of the areas according to four different criteria. Left column shows
the exceedence or percentile probabilities for each area for a randomly selected data
set; middle column shows the mean of these probabilities across all datasets; right col-
umn shows mean of the rank of these probabilities across datasets. Grey dashed lines
are 95% ’sampling intervals’ and show the uncertainty about the replication variability
for the probabilities and ranks across the survey samples.