Geostatistics - Noel Cressie
Geostatistics - Noel Cressie
Geostatistics
a
Noel Cressie
a
Department of Statistics , Iowa State University , Ames , IA , 50011 , USA
Published online: 27 Feb 2012.
To cite this article: Noel Cressie (1989) Geostatistics, The American Statistician, 43:4, 197-202
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the
publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or
warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions
and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed
by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with
primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings,
demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly
in connection with, in relation to or arising out of the use of the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is
expressly forbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms-
and-conditions
Geostatistics
NOEL CRESSIE”
Most data have a space and time label associated with them; is dependence between measurements at different locations.
data that are close together are usually more correlated than The models need to be more flexible than their temporal
those that are far apart. Prediction (or forecasting) of a counterparts; for example, it is not reasonable to assume
process at a particular label where there is no datum, from that spatial locations of data occur regularly, and “past,”
observed nearby data, is the subject of this article. One “present,” and “future” make way for (possibly aniso-
approach, known as geostatistics, is featured, from which tropic) dependence in a multitude of directions. Cliff and
linear methods of spatial prediction (kriging) will be con- Ord (1981), Ripley (1981), and Upton and Fingleton (1985)
sidered. Brief reference is made to other lineadnonlinear, have written books concerned with the statistical analysis
stochasticldeterministic predictors. The (linear) geostatis- of spatial data.
tical method is applied to piezometric-head data around a A scientific problem that is both temporal and spatial in
potential nuclear-waste repository site. nature is the study of the effects of atmospheric pollution,
and in particular acid rain (e.g., see Bilonick 1985; Cressie,
Downloaded by [Michigan State University] at 19:12 20 January 2015
0 1989 American Statistical Association The American Statistician, November 1989, Vol. 43, No. 4 197
typically the spatial index s is two- or three-dimensional 2y(h) = Ihl ( - m < h < m), but cov(W(t), W ( U ) )=
(i.e., d = 2 or 3). The quantity 2y(.), which is a function min(t, u), which is not a function of It - uI. Thus the class
only of the dzfference between the spatial locations s and of intrinsically stationary processes strictly contains the class
s + h, has been called the variograrn by Matheron (1963), of second-order stationary processes.
although earlier appearances in the scientific literature can Now I turn to finding an estimator of the variogram,
be found. It has been called a structurefunction by Yaglom which also serves as a measure of spatial dependence among
(1957) in probability and Gandin (1963) in meteorology, the data {Z(s,): i = 1 , . . ., n}. Suppose these are obser-
and a mean-squared difference by Jowett (1952) in time vations on an intrinsically stationary process [i.e., a process
series. Kolmogorov (1941a) introduced it in physics to study that satisfies (1) and (3)], taken at the n spatial locations
the local structure of turbulence in a fluid. Nevertheless, it +
{s;: i = 1 , . . ., n}. Because of (3), var(Z(s h) - Z ( s ) )
has been Matheron’s mining terminology that has persisted. = E ( Z ( s + h) - Z(s))*; hence the method-of-moments
The variogram must satisfy the conditional negative semi- estimator of the variogram 2y(h) is
definiteness condition, Xf= Xjk= a,aj2y(si - sj) 5 0, for
any finite number of spatial locations {s;: i = 1, . . ., k}, 2&h) =
Wh)
C ( Z ( S i ) -z(sj))*/IN(h)I, h E Rd, ( 5 )
and real numbers {a;: i = 1 , . . ., k} satisfying Xf=, ai =
0. When 2y(h) = 2f(llhll), the variogram is said to be where the average in (5) is taken over N(h) = {(s;, sj): s;
isotropic. Various parametric models were presented by - sj = h} and (N(h)l is the number of distinct elements
Journel and Huijbregts (1978, sec. II1.B). in N(h). For irregularly spaced data, N(h) is usually mod-
Downloaded by [Michigan State University] at 19:12 20 January 2015
Replacing (1) with the stronger assumption ified to {&, sj):si - sj € T(h)}, where T(h) is a tolerance
region of Rd surrounding h (see Sec. 4). Other estimators,
cov(Z(s + h), Z ( s ) ) = C(h) for all s, s + h € D, (2) more robust than ( 5 ) , are given in Cressie and Hawkins
and specifying the mean function to be constant, (1980). Parametric models can be fit to the estimator (5)
by various means; as a compromise between efficiency and
E ( Z ( s ) ) = p for all s E D, (3) simplicity, Cressie (1985) proposes minimizing a weighted
defines the class of second-order (or wide-sense) stationary sum of squares,
processes in D, with covariance function C(.). Time series
analysts usually assume (2) and work with the quantity p( * )
= C(.)/C(O).Conditions ( 1 ) and (3) define the class of
intrinsically stationary processes, which is now shown to with respect to variogram model parameters 8. The sequence
contain the class of second-order stationary processes. h(l), . . ., h(K) denote the “lags” at which an estimator
Assuming only (2), ( 5 ) was obtained, and that satisfy conditions such as those
given by Journel and Huijbregts (1978, p. 194, sec. 111.42).
y(h) = C(0) - C(h); (4) Zimmerman and Zimmerman (1989) summarize and com-
that is, the sernivariogram (half of the variogram) is related pare several methods of variogram-parameter estimation.
very simply to the covariance function. An example of a They find that no method dominates, but their simulation
process for which 2y( .) exists but C( .) does not is a one- results indicate that the weighted least squares approach
dimensional standard Wiener process {W(t):t 2 0). Here usually performs well and never does poorly.
I I I‘
Figure 7. Location and Levels of Piezometric-Head Data in the Wolfcamp Aquifer: 9, Location of Amarillo, Texas (Potter County).
To derive the kriging equations in Section 3, I assume where A = ( A l , A,, . . ., A,,, m ) ‘ , y = (y(s, - so), . . .,
that the variogram is known. In practice the estimation and y(s, - so), I)’, and
fitting stage of the spatial analysis has to be carried out first,
and this I do for the example in Section 4. Discussion of I’= y ( s , - s j ) , i = 1, . . . , n , j = 1, . . . , n
the effect of this initial variogram fitting is given in Section = 1, i =n + 1, j = 1 , . . . , n
5.
= 0, i = n + l , j = n + l ;
3. KRIGING
In Matheron’s geostatistics and Gandin’s objective anal-
+ +
r is a symmetric (n 1) X (n 1) matrix. The minimized
root mean squared prediction error (sometimes called kng-
ysis, the variogram is used to define coefficients in an op- ing standard error) is given by
timal linear predictor. Suppose it is desired to predict Z(%)
at some unsampled spatial location s,using a linear function
of the data
n
Matheron (1963) called this spatial prediction method
kriging, after D. G. Krige (a South African mining engineer
who, in the 1950s, developed empirical methods for deter-
which is unbiased and minimizes the mean squared predic-
mining true ore grade distributions from distributions based
tion error E(Z(s,,) - 2 ( ~ ~Then) ) ~a .straightforward min-
on sampled ore grades), whereas Gandin (1963) called it
imization using Lagrange multipliers to ensure unbiasedness
optimal interpolation. In fact, derivation of an optimal linear
+
yields optimal A’s that satisfy an ( n 1)-dimensional equa-
predictor (with known mean p ) can be found earlier in the
tion depending on 2y(si - s j ) , i, j = 0 , l , . . ., n:
literature in works of Wold (1938), Kolmogorov (1941b),
rh = y, (7) and Wiener (1949); see Cressie (1989b) for further historical
are sited at the spatial locations, and their heights are equal to the The goal in analyzing this (purely) spatial data set is to
corresponding piezometric heads (in feet above sea level). draw a map of a predicted surface, based on the (irregularly
located) 85 data available. An advantage of using a sto-
observations. The kriging predictor given by (6) and (7)
chastic method of prediction (kriging) is that a map of root
assumes that the process Z contains no measurement error;
mean squared prediction error can also be drawn, quanti-
Cressie (1988) has details on how to predict noiseless ver-
fying the uncertainty in the predicted piezometric-head sur-
sions of Z.
face.
In the notation of Section 3, it is desired to use the pre-
4. FLOW OF GROUNDWATER FROM A
dictor 2(s,) defined by (6), along with a&,) defined by
NUCLEAR-WASTE SITE
(8), as s, varies over the extent of the Wolfcamp Aquifer.
Several years ago there were three potential high-level The kriging equations (7) are straightforward to solve once
nuclear-waste sites proposed in Nevada, Texas, and the state the variogram (1) is known.
of Washington. This section contains a geostatistical anal- Assume that the data { Z ( s , ) , . . ., Z(sss)} given in Table
ysis of groundwater flow in the surrounding regions of one 1 are observed without measurement error, and that they
of those sites. are a sampling from an intrinsically stationary stochastic
The chosen site will eventually contain more than 68,000 process. [In fact, Harper and Furr (1986) did not make
high-level waste canisters placed about 30 feet apart in holes Assumption (3), assuming instead linear deterministic trend
or trenches surrounded by salt, covering an area of about (see Sec. 5 for further discussion).] Estimation of the var-
two square miles, and buried deep in the ground. Leaks iogram defined by (1) allows (7) to be solved and hence
could occur, however, or the radioactive heat could cause maps based on (6) and (8) to be drawn.
the tiny quantities of water in the salt to migrate toward the It is clear from the data shown in Figure 2 that the behavior
heat until eventually each canister is surrounded by water of the process in the NE-SW direction is different from that
(about six gallons). The chemical reaction of salt and water in the NW-SE direction. For this reason variogram esti-
would create hydrochloric acid that could slowly corrode mators based on ( 5 ) were calculated in these two directions.
the canisters. Figure 3a shows 2T(h) for h E {h(l)e, . . ., h(24)e);
Using geostatistics, I address the question Where would e is a vector of length llell = 5 miles, and direction d 4 ;
radionuclide contamination flow for the site in Deaf Smith h(k) = k - Yz for k = 1, . . ., 24. Figure 3b shows a
County, Texas? (As a matter of interest, a decision was similar plot, but e has direction 37~14.Since the data are
0 I 10 I5 20 25 0 I 10 I5 20 21
h h
Figure 3. Experimental Variogram in the (a) NE-SW and (b) NW-SE Directions. The dashed line is the weighted-least-squares fit of a
power model: 2y(h; co, b, p) = 2{c0 + b ( h ( p }(h > 0). On the horizontal axis, 7 unit = 5 miles; on the vertical axis, 7 unit = 705ft2.
160.00
12O.OcI 120.00
40.00
I I l ! l I I I I I I I I 0.00 I I I I I I I I l - i o , I o . w
-135.W -95.00 -55.00 -15.00 25.00 65.00 105.00 --135.00 -95.00 -55.W -15.00 25.00 65.03 105.00
Figure 4. (a) Predicted Surface (6) and (b) Standard Error Surface (8) (in ft.) Obtained by Ioiging: x, Center of Deaf Smith County;
8, Location of Amarillo, Texas.
not on a regular grid, tolerance regions for N(h) in (5) have 5. CONCLUDING REMARKS
Downloaded by [Michigan State University] at 19:12 20 January 2015