GS+ User's Guide COKRIGING
GS+ User's Guide COKRIGING
Version 7
GS
D E S I G N
S O F T W A R E
GS
GeoStatistics for
the Environmental Sciences
Version 7
Copyright
Copyright 1989-2005 Gamma Design Software, LLC. All Rights Reserved
ISBN number 0-9707410-0-6
Information in this document is subject to change without notice and does not represent a commitment on the part of Gamma Design Software. The software described
is provided under a license agreement and may be used or copied only as specified
in the agreement. No part of this document may be reproduced in any manner
whatsoever without the express written permission of Gamma Design Software.
Gamma Design Software
P.O. Box 201
Plainwell, Michigan 49080
U.S.A.
Citation
The appropriate citation for this documnent is
Gamma Design Software. 2004. GS+: Geostatistics for the Environmental Sciences.
Gamma Design Software, Plainwell, Michigan USA.
Trademarks
Microsoft and Windows are trademarks or registered trademarks of Microsoft Corporation. Surfer is a registered trademark of Golden Software, Inc. ArcView and
Arc/Info are registered trademarks of ESRI, Inc. Other brands and their products
are trademarks or registered trademarks of their respective holders and should be
noted as such. GS+ is a trademark of Gamma Design Software.
February 2004
October 2004 rev
Table of Contents
Table of Contents
Chapter 1
Introduction
Overview ..............................................................................................1
System Requirements ..........................................................................2
Installation ............................................................................................2
Updates ................................................................................................2
Licensing and Copy Protection ............................................................3
Single-User License Agreement ..........................................................3
Chapter 2
Getting Started
From Data to Maps: How to Proceed...................................................5
General Screen Layout ........................................................................5
Menus...................................................................................................6
User Preferences ...............................................................................13
Graph Settings ...................................................................................17
Printing Graphs ..................................................................................23
Printing Data or Text ..........................................................................24
Using Older-Version GS+ Files...........................................................24
Chapter 3
Working with Data
The Data Worksheet Window ............................................................25
Column Assignments .........................................................................28
Covariate Values Warning..................................................................29
Missing Values ...................................................................................29
Data Filter Dialog................................................................................30
Duplicate Values ................................................................................31
Chapter 4
Importing Data from External Files
File Import Dialog ...............................................................................33
Importing Text Files............................................................................34
Text Input File Formats
GS+ .............................................................................................36
GeoEas.......................................................................................37
ArcView XYZ.............................................................................38
Surfer XYZ ................................................................................39
Spreadsheet and Database Input Files..............................................40
Input File Formats (File Import Properties) ........................................41
i
Table of Contents
Viewing Files (File View Window) ..................................................... 42
Appending Data to an Existing Worksheet (Data Append Dialog).... 42
Chapter 5
Summary Statistics
Z Variate Summary............................................................................ 43
Frequency Distributions
Frequency Distribution Histograms ........................................... 45
Cumulative Frequency Distribution............................................ 47
Normal Probability Distribution .................................................. 49
X,Y Coordinates Summary ................................................................ 51
Coordinate Postings (Quantile Plots)
2-Dimensional Data ................................................................... 52
1-Dimensional Data ................................................................... 54
Defining Posting Intervals .......................................................... 55
Regression Analysis Tab................................................................... 56
Cross-Variate Regression Window............................................ 57
Regression Values..................................................................... 58
Chapter 6
Semivariance Analysis
Overview ................................................................................................... 59
The Autocorrelation Window ..................................................................... 60
Define Nonuniform Lag Class Intervals............................................. 64
Variograms Window .......................................................................... 66
Semivariance Values Window........................................................... 67
Isotropic Variogram Models............................................................... 68
The Spherical Isotropic Model ................................................... 71
The Exponential Isotropic Model ............................................... 72
The Linear Isotropic Model ........................................................ 73
The Gaussian Isotropic Model................................................... 74
Anisotropic Variogram Models .......................................................... 75
The Spherical Anisotropic Model............................................... 77
The Exponential Anisotropic Model ........................................... 78
The Linear Anisotropic Model.................................................... 78
The Gaussian Anisotropic Model............................................... 79
Anisotropic Semivariance Surface (2D Variogram Map)................... 80
Cross-Autocorrelation or Cross-Semivariance Analysis ................... 83
Chapter 7
Other Autocorrelation Measures
Standardized Variograms .................................................................. 85
Madograms........................................................................................ 86
ii
Table of Contents
Rodograms.........................................................................................87
Drift ....................................................................................................88
Correlograms......................................................................................89
Covariance Analysis...........................................................................90
General Relative Variograms .............................................................91
Pairwise Relative Variograms ............................................................92
Morans I Analysis ..............................................................................93
Fractal Analysis ..................................................................................94
Chapter 8
Variance Clouds and h-Scattergrams
Overview ............................................................................................95
Variance Clouds .................................................................................95
Variance Cloud Pairs .........................................................................99
h-Scattergrams ................................................................................100
h-Scattergram Pairs ........................................................................104
Chapter 9
Interpolation Basics
Overview ..........................................................................................105
The Interpolation Window ................................................................107
Z Estimate Boundaries .....................................................................109
Defining a Uniform Interpolation Grid ..............................................110
Defining a NonUniform Interpolation Grid ........................................112
Polygon Outlines (Interpolation Masks) ...........................................114
Polygon Outline Map .......................................................................116
Cross-Validation Analysis ................................................................117
Cross-Validation Values ..................................................................118
Chapter 10
Conditional Simulation
Overview ..........................................................................................119
Secondary Data for Simulations.......................................................121
Number of Simulations .....................................................................122
Chapter 11
Kriging and Cokriging
Kriging ..............................................................................................123
Cokriging ..........................................................................................125
Chapter 12
Inverse Distance Weighting
Inverse (IDW) and Normal (NDW) Distance Weighting ...................127
iii
Table of Contents
Chapter 13
Interpolation Output File Formats
GS+ (.krg) Format ............................................................................ 129
ArcInfo or ArcView (.asc) Format............................................... 130
Surfer (.grd) Format ...................................................................... 131
Chapter 14
Mapping
The Mapping Window ..................................................................... 133
Map Contour Intervals .................................................................... 137
3D Maps ......................................................................................... 139
Standard Deviations ............................................................... 140
Rotation ................................................................................... 141
2D Maps ......................................................................................... 142
Standard Deviations ............................................................... 142
Sample Posting ....................................................................... 143
Transects (1d Maps) ....................................................................... 144
Standard Deviation ................................................................. 144
Chapter 15
Bibliography ......................................................................................... 145
Chapter 16
Glossary ................................................................................................. 147
Index
iv
................................................................................................. 155
Chapter 1 Introduction
Chapter 1
Introduction
GS+ is a GeoStatistical Analysis program that allows you to readily measure and
illustrate spatial relationships in geo-referenced data.
GS+ analyzes spatial data for autocorrelation and then uses this information to make
optimal, statistically rigorous maps of the area sampled. The maps can be created in
GS+ or in other mapping programs or geographic information systems.
Chapter 1 Introduction
GS+ provides basic parametric statistics:
Sample means and variance;
Frequency distributions, probability distributions, and skewness and kurtosis
measures for determining departures from normality; and
Quantile plots or coordinate maps show the distribution of sample values
across the spatial domain;
Transformations for returning the data to normality; and
Regression analysis for covariates vs. primary variates.
System Requirements
PC Compatible with Pentium or faster processor
Windows NT4 / 2000 / XP or higher Operating System
A minimum of 40 MB of free hard disk space
A minimum of 64 MB RAM
Installation
To install GS+:
1) Insert the CD-ROM or download the installation file to a temporary folder
2) From the Windows Start Button, click Run
3) Type g:\Setup [use your CD-ROM drive letter for g:]
4) The Setup program will prompt you through the installation process. Follow
the instructions on the screen. The serial number for your copy of GS+ can
be found on the GS+ CD-ROM package.
Updates
Maintenance updates are available free of charge to registered users. Update files
are available by download only from http://www.gammadesign.com. The current
version of GS+ can be checked from the Help menu as described in Chapter 2.
Chapter 1 Introduction
Chapter 1 Introduction
ment constitutes a waiver of Gamma Designs rights under any federal or state law.
Limited Warranty
1. Gamma Design warrants that GS+ will conform substantially to the accompanying writ+
ten materials for a period of 1 year from the date of purchase, provided that GS is
used on computer hardware and with the operating system for which it was designed.
2. Gamma Design disclaims all other warranties, either express or implied, including implied warranties of merchantability and fitness for a particular purpose. This applies
to both the software itself and accompanying written materials. This limited warranty
gives you specific legal rights; you may have others that vary from state to state.
3. Under no circumstances shall Gamma Design be liable for any damages whatsoever
arising out of the use of or inability to use GS+, even if Gamma Design has been advised of the possibility of such damages. Such damages include but are not limited
to damages for loss of profits or revenue, loss of use of the software, loss of data, the
cost of recovering such software or data, the cost of substitute software, or claims by
third parties. In no case shall Gamma Design be liable for more than the amount of
the license fee, as set forth below. Some states do not allow the exclusion or limitation of liability for consequential or incidental damages, so this limitation may not apply to you.
User Remedies
1. Gamma Designs entire liability and your exclusive remedy shall be, at Gamma Designs discretion, either (1) refund of the purchase price or (b) replacement of the
software that does not meet Gamma Designs limited warranty. In either case software must be returned to Gamma Design with a copy of the sales receipt. This warranty is void if failure has resulted from accident, abuse, or misapplication. Any replacement will be warranted for one year.
2. The warranties and remedies set forth above are exclusive and in lieu of all others,
oral or written, express or implied. No Gamma Design Software distributor or employee is authorized to make any modification or addition to this warranty.
U.S. Government Restricted Rights
+
GS software and documentation are provided with RESTRICTED AND LIMITED RIGHTS.
Use, duplication, or disclosure by the Government is subject to restrictions as noted in subparagraph (c)(1)(ii) of The Rights in Technical Data and Computer Software clause at 52.2277013. The manufacturer is Gamma Design Software, P.O. Box 201, Plainwell, MI 49080.
General
You must fill out and return the Warranty Registration Card to be eligible for customer support
and service. If you have questions about this agreement, write to Gamma Design Software,
P.O. Box 201, Plainwell, MI 49080, U.S.A.
Chapter 2
Getting Started
From Data to Maps: How to Proceed
To make a map using GS+:
First, collect samples from known locations. The sample locations do not
need to be evenly spaced or even to lie on a grid, you simply need to know
their location in a Cartesian (x,y) coordinate system;
Second, bring the data into the GS+ Data Worksheet; you can enter the data
directly into the worksheet or import the data from a text file, spreadsheet, or
another source; often the easiest way to import data is to cut-and-paste from
the source spreadsheet or text file;
Main Menu
The Main Menu presents access to the windows that provide GS+ analyses. Below
the command menus are icons that represent short-cuts to many of these functions.
Commands and toolbar buttons that are shaded out are not selectable. For example, the print icon is not available when viewing a window without a graphic or worksheet to print. The six covariate icons for covariate functions are not available
unless a covariate has been defined in the Data Worksheet window.
The tool bar is customizable via the Edit menu or by right-clicking on the main menu.
You may add or remove shortcut icons for most analyses. It is also movable and
reconfigurable; it can be dragged to any spot on the screen with the mouse.
Open File Allows the user to load an existing parameter file. To open a text
data file, use the Import Command in the Worksheet window.
Sort sort the highlighted column; the sort alternates between ascending
and descending order. If the cursor is not in a worksheet cell the sort command will be dimmed, as it is here.
Export Data export the contents of either the active worksheet or the active
data arrays to a text file. The active data arrays are the non-missing data in
the ID, X-Coordinate, Y-Coordinate, and Z-data columns of the Data Worksheet window. If the Z data have been transformed (using a transformation
command), the transformed z values will be printed in addition to the original
z values in the Data Worksheet.
Summary Statistics display basic statistics for the primary variate (Z) and
covariate (Z2) if defined.
Variogram
Correlogram
Madogram
Covariance Analysis
Morans I Analysis
Drift
Fractal Analysis
Standardized Variogram
For each analysis you may choose to analyze the primary variate Z, the covariate
Z2, or the cross-variate Z x Z2, although cross-variate analysis is not available for
every autocorrelation measure. If Z or Z2 are undefined the menu command will be
dimmed.
10
12
Reset
The Reset command returns all user-default values on this tab to original default
values. To reset all values on all tabs, use the Global Reset command.
Missing Value Indicator
Specify the value used by GS+ on output files and when importing data files to indicate that a value is missing. Missing values are ignored during data builds and subsequent analyses. This value can be overridden by values specified in individual
dialog windows.
Show tips
Display short explanations of commands and input boxes when mouse hovers over
buttons and boxes.
Places Past Decimal
For different types of variates, allow GS+ to format values automatically or specify
directly the number of places past the decimal to report in windows and printouts.
All calculations are performed on double-precision values regardless of the values
requested here. These values can be overridden by values on specific dialog windows such as the Field Assignment Dialog of the Data Worksheet Window.
Global Reset
Sets all user-default values on all tabs to original (GS+ - defined) default values.
13
Reset
The Reset command returns all user-default values on this tab to original (GS+ - defined) default values. To reset all values on all tabs, use the Global Reset command.
Filename Extension
Default extension for the data file name specified when importing data files to the
Data Worksheet.
Default Fields
When importing data files, these values indicate which fields to assign initially to different variates. Field (column) assignments can be changed in the Data Worksheet
window.
Default Import File Type
When importing data files, this file type will be the default type.
Places Past Decimal to Show
When importing data files, report this many places past the decimal in the Data
Worksheet cells. This value is overridden when specifying places past the decimal
in the Field (Column) Assignment Dialog of the Data Worksheet window.
Automatically Rebuild Data Arrays
When new data is imported and when columns are reassigned (e.g. when the X14
Reset
The Reset command returns all user-default values on this tab only to original (GS+
- defined) default values. To reset all values on all tabs, use the Global Reset
command.
Variogram Defaults
Offset start of first lag by interval with this check box GS+ will (when
automatically defining uniform interval lag classes) shift the first lag class to
of its normal separation distance. This can provide better resolved
variograms when there are sufficient pairs of points in shorter separation distance classes. The disadvantage is that if there are few pairs of points for
the shortest distance classes when the first lag is not offset (a common prob15
Automatically make anisotropic surface map the Surface Map command on the Semivariance Window will be automatically enacted after
variograms are calculated. This will make the exploration of anisotropic relationships more automated but there is a performance penalty as creating a
surface map requires further computation.
Smoothing Factor (>0) this value provides the initial default smoothing
factor for inverse distance weighting (IDW) interpolation. This value may be
overridden from the Interpolate Window; see the Interpolate Window for further information on this parameter.
Default to Normal Distance Weighting (NDW) sets the default IDW interpolation to Normal Distance Weighting (NDW). This value may be overridden from the Interpolate Window; see the Interpolate Window for further information on this parameter.
Interpolation Defaults
Draw map after interpolating check this box to automatically draw the
map of interpolated values after interpolation.
Global Reset
Sets all user-default values on all tabs to original (GS+ defined) default values.
16
17
Lines Tab
On graphs with lines (for example variogram models) you may specify line pattern,
width, and color:
Graph Title
A title is text that appears at the top center of the graph area. To change the font of
the title press Change, which will bring up a Font Dialog Window.
Graph Legend
The legend appears on maps and quantile postings to indicate the value ranges for
different symbols or colors. Legend text appears at the top of the legend table. To
change the font of the legend title and values press Change, which will bring up a
Font Dialog Window.
Graph Footnote
A footnote is text that appears at the bottom left of the graph area. To change the
font of the footnote press Change, which will bring up a Font Dialog Window. To
have variogram model parameters printed as a footnote to variogram graphs, place
in this field the exact phrase (model results) [do not include quotes]. This notation
is placed in the field by default whenever a new variogram and model are calculated;
you will need to remove it if you do not want variogram model parameters to appear
as a graph footnote.
Apply Now
Press Apply Now to apply any changes made to the graph and keep the Graph Settings dialog window open.
Cancel
Press Cancel to exit the Graph Settings Dialog without applying any changes since
the last Apply Now command.
Exit
Press Exit to close the Graph Settings Dialog window. Any changes made since the
last Apply Now command will be applied to the graph.
18
Axis Format
Use these boxes to set the axis titles and how axis values are formatted. Decimals
refers to the number of places past the decimal to format axis values (e.g. 3.1415
has 4 places past the decimal); check exponential to format the axis value in scientific notation (e.g. 3.14E0). For 3D maps, boxes for the Z-Axis are also provided as
shown above; for x-y and bar graphs, there will be no boxes for the Z-Axis. The font
for axis titles and values can be reset with one of the Change commands.
20
Projection
3D displays a 3-dimensional map of the data. The height of the map can
be adjusted by changing the 3D Proportion; the perspective can be adjusted
with the Rotate (Mouse Action) command of the calling window
2D displays a flat, 2-dimensional map of the data
3D Proportion
for 3D projections, sets the height-to-width ratio of the map. For tall, skinny
maps set this value to 2 or higher. For flatter maps, set to less than 0.5.
This setting is not available for 2D projections. This command will not affect
the aspect ratio of the map, i.e. the x and y axes will remain proportional to
one another
Map Surface
Contour lines draws lines between contour intervals
Color bands fills the space between contour lines with different colors; colors can be specified from the Define Contour dialog window by pressing Define Contour Levels, below
Smoothing apply slight smoothing to the data to improve visualization
21
22
Printing Graphs
The Graph Print dialog window allows you to specify how you would like graphs
printed to what device or file, where on the page, and how big the image should
be:
Destination
Choose where to send the image. If Printer is checked, the graph will be sent to the
printer specified in the Printer box. If File is checked, the graph will be sent to a file
that is specified in a subsequent Select File dialog window. If Clipboard is checked,
the graph will be sent to the Windows clipboard, from which it can be transferred to
another program via the Edit-Paste command.
Printer and Page Setup
Provides access to printer settings as provided by the operating system, including
printer selection, orientation, and margins. All printers available to your Windows
system are available to GS+. This command is not available unless Printer is the
destination specified elsewhere in this dialog window.
Scale
Set the scale of the finished graph image as % of full page for the printer specified.
The Scale command is not available unless Printer is the selected destination.
Center on page
Check this box to have the graphic centered on the output device. Otherwise the
graph will be printed to the upper left corner of the margin area specified in the Settings command. This box is not available unless Printer is the selected destination.
File Format
Use the File Format list box to choose a graphics file format. File format options
include a standard Windows Metafile (.wmf) format, an Enhanced Windows Metafile
(.emf) format, a standard Bitmap (.bmp) format, a web-ready JPEG format (.jpg),
and a web-ready PNG format (.png) . These commands are not available unless
File is specified as the destination for the image.
23
Destination
Choose where to send the image. If Printer is checked, the graph will be sent to the
printer specified by Settings. If File is checked, the graph will be sent to a file that is
specified in a subsequent Select File dialog window. If Clipboard is checked, the
graph will be sent to the Windows clipboard, from which it can be transferred to another program via the Edit-Paste command.
Range
Choose how much of the data to print. Selection prints only the highlighted portion
of spreadsheets, First Page prints only page 1.
Printer and Page Setup
Provides access to printer settings as provided by the operating system, including
printer selection, orientation, and margins. All printers available to your Windows
system are available to GS+. This command is not available unless Printer is the
destination specified elsewhere in this dialog window.
Make backup
Check this box to make a backup copy of the file before converting. The backup
copy will have the same name as the original file except will have the extension .old.
24
Chapter 3
Working with Data
The Data Worksheet Window
The Data Worksheet contains the data for GS+ analyses. Data can be entered
manually or can be imported via the Import file command. Entered data can be
edited, filtered (or bounded), and can be temporarily or permanently deleted from
subsequent analyses. Field assignments (assigning fields or columns to xcoordinate values, y-coordinate values, etc.) are made in the worksheet window by
clicking on the second row.
27
Column Assignments
In the Data Worksheet window, you may specify which field or column to associate
with the Sample ID, X-Coordinate, Y-Coordinate, the Z variate, or the Z2 variate by
clicking on the second cell from the top in any column. This will bring up the Column
Assignment dialog:
Column name
The name of the column or variate name.
Assign column as
None make column unassigned.
Sample ID the specified column contains Sample Number or Sample ID information. This can be text or numeric data.
X Coordinate the specified column contains values for the X-Coordinate location. If you choose a column that is already assigned, the other variates
column will switch with the original X Coordinate column.
Y Coordinate -- the specified column contains values for the Y-Coordinate location. If you choose a column that is already assigned, the other variates
column will switch with the original Y Coordinate column.
Z (Primary Variate) the specified column contains values for the Z variate.
If you choose a column that is already assigned, the other variates column
will switch with the original Z column.
Z2 (Covariate) the specified column contains values for the Covariate.
The Covariate is used in cokriging. If you choose a column that is already
assigned to another variate, the other variates column will switch with the
original Z2 column.
Rebuild automatically when reassign
Rebuild the internal worksheet when window closes. This is equivalent to pressing
the Rebuild button on the Data Worksheet window.
28
Missing Values
Data in the Worksheet that are marked as Missing are ignored during analyses.
Permanent missing values appear as blank cells and temporary missing values appear in red. You can use the right mouse button to make cells temporarily missing
and vice versa.
When importing or exporting files special placeholders (values or symbols) can be
used to indicate missing values These placeholders are specified by the user in the
Preferences window, or during file imports by a value or symbol specified in the File
Import Properties window. In GS+, the default missing value indicator is the numeric
value -99.0 but this can be changed in the User Preferences window (Chapter 2).
Missing values appear in output files when a value cannot be interpolated because
the location appears in an exclusive polygon or because numerical limitations disallowed its computation (such as when a variogram model is inappropriately used during kriging).
29
Range to Use
In these fields specify how to constrain the data in the Worksheet to a particular
range. In the example shown, only those records for which the X direction field is
within the range of 0.60 to 40.00 will be included in analyses even though there are
records across the entire range of 0.60 79.90 in the worksheet. Although Worksheet values are treated as missing during data builds, they are not marked as missing in the Worksheet.
Reset
Reset Range to Use to the actual data range for a given coordinate or variate. Data
ranges can be set individually or (with the Reset All command) all at once.
Cancel
Cancel filter dialog.
OK
Accept the ranges specified. Note that for the filter to take effect you must check the
filter box on the Worksheet window.
30
Duplicate Values
When worksheet data are rebuilt into data arrays, GS+ checks each record against
every other record to check that no duplicate coordinates are present. If one or
more records are found to be duplicates, a warning screen will appear:
32
Chapter 4
Importing Data from External Files
File Import Dialog
A number of different file types can be imported into GS+. These include spreadsheets (e.g. Excel formats), databases (e.g. Access, dBase, Paradox, and FoxPro
formats), HTML (web) pages, and text files. Files are imported via the Import command within the Data Worksheet Window, which brings up the dialog box below.
From this dialog you choose the file and the format (as illustrated in the pull-down
list box at the bottom of the menu).
Open
Press Open to open the identified file. The window that next appears will depend on
the type of file to be opened. A text file will bring up the Import Text File dialog
whereas spreadsheet, database, and HTML formats will bring up the Import Spreadsheet dialog.
33
File Format
A variety of text file types can be imported into GS+, including formats defined by the
user. Each type has its own manner for separating fields within data records, for
handling missing values, for allowing header records, and for specifying names of
fields (column titles) within the file. Predefined input format types include the following; examples of files appear in the next section:
GS+ format , in which fields are separated by spaces (free format) or tabs,
missing values are indicated with the placeholder -99, the number of header
records is automatically detected, and column titles (variate names) appear
on the 2nd record separated by commas.
GeoEas format, in which fields are separated with commas, there are no
missing value indicators, the number of header records is specified on the
2nd record of the file, and column titles appear as individual records following
this second record.
34
Surfer XYZ format, in which fields are separated by spaces (free format),
missing values are indicated by blank fields, and the first record in the file is
a header record in which column titles (variate names) appear as fields
separated by spaces. Note that the Surfer XYZ format also allows fields to
be separated by commas, which should be specified separately as described below. Note that this format is not the same as the Surfer Grid file
format that can be used for Krig output files or Map input files.
ArcView XYZ format, in which fields are separated with commas, missing
values are indicated by placeholder commas, there is a single header record, and within the header record variate names are separated by quotes
and commas.
Custom, in which any of these format specifications can be changed or customized as specified in File Import Properties.
Properties
The Properties Command brings up a File Import Properties dialog window (described below) within which you can specify how GS+ should adjust its input parameters.
Change Column Assignments
The Change command brings up a Field or Column Assignment window from which
you can assign variates (e.g. X coordinate) to columns or fields in the data file. Columns can be reassigned later from the Data Worksheet window.
OK
Press OK to import the specified files data into the GS+ Worksheet and close the
dialog window. If data already exists in the GS+ Worksheet, you will be asked what
to do with it.
Cancel
Press Cancel to exit the Import Text File dialog without importing data.
35
36
Pb,
0.42
0.6
0.6
0.43
0.51
0.34
0.37
0.61
0.46
Al
0.42
0.45
0.08
-99.00
0.14
0.32
0.12
-99.00
0.49
37
38
Ydata
11.9
29.4
32.6
44.5
64
71.8
3.5
10.2
16.3
Zdata
0.42
0.45
0.08
0.14
0.32
0.12
0.49
39
Worksheets Available
The worksheets or tables within individual spreadsheet or database files are listed
here and can be chosen via the pull-down listing.
Properties
The Properties Command brings up a File Import Properties dialog window within
which you can specify how GS+ should identify missing values. When importing
spreadsheets and databases, other file import properties are limited to single predefined choices e.g. spreadsheet columns are used to denote different GS+ Worksheet fields.
Change Column Assignments
The Change command brings up a Field or Column Assignment window from which
you can assign variates (e.g. X coordinate) to columns or fields in the data file. Columns can be reassigned later from the Data Worksheet window.
OK/Cancel
Press OK to import the specified files data into the GS+ Worksheet and close the
dialog window. If data already exists in the GS+ Worksheet, you will be asked what
to do with it. Press Cancel to exit without importing data.
40
Data Records
The Field (Column) Delimiter specifies how individual values within the data records
are formatted. For text files values can be comma separated, tab separated, space
separated (free format), character separated, or binary data. When spreadsheets
are imported, columns are delimited by Table Cells only.
The Missing Value Indicator specifies the value or character within the file that indicates that a value is missing and that the record should be ignored during importing. A missing value that is imported becomes a blank cell in the GS+ Data Worksheet. The indicator can be absent, a decimal point, a numeric value, or a character.
Header Records
The Number of Header Records indicates whether the first records in the file contain header records descriptive text that should be ignored as the file is read into
the Data Worksheet. For text files choose None, Varies, or Fixed Number. Header
records will be read into the Data Description field of the Data Worksheet Window.
Spreadsheet files are assumed to contain a single header row containing column
names.
The Column Title Separator refers to whether column titles (variate names) appear
in the second record of the file, and if so, how names are separated from one another within the record. For text files choose No Field Names, Same as for Data Records, Quotes, Brackets, Comma, Tab, Space, or Character. When spreadsheets
are imported, columns titles appear in different cells of the first spreadsheet r ow
OK/Cancel
Press OK to accept the settings shown and return to the Import Text File or Import
Spreadsheet dialog. If the settings are different from the standard settings for the
format originally defined, the format will change to Custom with the new settings defining the new Custom format. Press Cancel to close without importing.
41
42
Chapter 5
Summary Statistics
Data Summary Window - Z Tab
The Data Summary window provides standard descriptive statistics for the variates
defined in the Data Worksheet window. Information is provided for both the Zvariate (as below), for the coordinate variates in a separate X,Y Coordinates tab,
and when a covariate is being analyzed, for the regression of Z vs. the Covariate Z2
in a separate Regression tab.
For the Z-variate it is also possible to specify a lognormal (base e) or square-root
transformation in order to better normalize the variates distribution prior to geostatistical analysis. If you do transform the variate, you may choose to have GS+ report
the interpolated (Kriged) values either in transformed form or backtransformed to the
original measurement domain. The backtransformation occurs after all analyses
have been performed, and it is not applied to autocorrelation results.
Also from the Data Summary window you can access a full-window frequency distribution, a cumulative probability distribution, or a normal probability distribution by
clicking on the small frequency distribution image.
Transformation
It is often helpful to apply a lognormal or a square root transformation to a Z variate
in order to normalize for skewed frequency distributions. It can also be useful to
scale data to a range of 0-1 if the values are extremely large. The transformation
specified is applied to every Z value in the data set prior to geostatistical analysis;
43
44
List Values
Press List Values to bring up a window containing the data used for the frequency
distribution.
Edit Graph
Press Edit Graph to bring up the Graph Settings Dialog Window, which will allow
you to make changes to the graph including changes in the number of distribution
classes (bars).
Print Graph
Press Print Graph to bring up the Graph Print dialog window.
Plot
Type of plot to graph. Choose either Frequency (as displayed above), Cumulative
Frequency, or Normal Probability curves (shown later).
45
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
46
List Values
Press List Values to bring up a window containing a listing of cumulative frequency
distribution values.
Edit Graph
Press Edit Graph to bring up the Graph Settings Dialog Window, which will allow
you to make changes to the graph.
Print Graph
Press Print Graph to bring up the Graph Print Dialog Window.
Plot
Type of plot to graph. Choose either Frequency, Cumulative Frequency (as displayed above), or Normal Probability curves (displayed below).
47
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
48
List Values
Press List Values to bring up a window containing a listing of normal probability distribution values.
Edit Graph
Press Edit Graph to bring up the Graph Settings Dialog Window, which will allow
you to make changes to the graph.
Print Graph
Press Print Graph to bring up the Graph Print Dialog Window.
Plot
Type of plot to graph. Choose either Frequency, Cumulative Frequency, or Normal
Probability (as displayed above) curves.
49
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
50
51
53
1d Scattergrams
The 1d Coordinate Posting window provides a map of coordinate locations for 1dimensional data. The location of each data point in the active data set is marked
by a symbol corresponding to its relative value. Both the number of levels displayed
as well as the range for each level is user-defined. A similar window provides postings for 2D data, as described above.
54
Value Range
Specify the upper threshold of each value range. Changing a Percentile or Intervals
value will change the Type of Level (defined in the Coordinate Posting window) to
Custom. The number of values displayed is based on the number of Levels chosen
in the Coordinate Posting window.
Symbol
Choose a symbol for the specified interval.
Name
Choose a name to appear in the chart legend.
Color
Choose a color for the specified symbol by clicking on the colored box.
Size
Choose a size for the specified symbol
Legend title
Specify the title to appear at the top of the legends. No legend will appear if only 1
interval level is chosen.
55
Enlarge
Press Enlarge to view a larger version of the Cross-Variate Regression scatterplot.
From the Regression window you will be able to edit and print the graph as well as
list graphed data values, as described below.
56
Cross-Variate Regression
The best test of whether a covariate is related to the primary variate a prerequisite
for cokriging is to perform a regression analysis. In GS+ the results of the regression of the primary variate Z vs. the Covariate Z2 is presented in the Data Summary
Regression tab. Clicking the Enlarge command in that window brings up the CrossVariate Regression window:
List Values
Press List Values to bring up a window containing a listing of the values used in the
regression, as described below.
Edit Graph
Press Edit Graph to bring up the Graph Settings Dialog Window, which will allow
you to make changes to the graph.
Print Graph
Press Print Graph to bring up the Graph Print Dialog Window.
57
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
58
Chapter 6
Semivariance Analysis - Overview
Semivariance is an autocorrelation statistic defined as
class = INT([D/DI]+0.5) + 1
Autocorrelation Window
The Autocorrelation Window is where various options for variogram calculations and
the end results appear; see the Semivariance Analysis summary for a definition of
semivariance and formulas for lag class distance intervals.
Variogram Options
Show Sample Variance
Check this option to show the sample variance for the data as a dashed line on the
variogram graphs.
Show Variogram Model
Check this option to show a model for the variogram points. If the model has already been defined, either automatically or manually, the variograms will be redrawn
with the model now graphed. If a model has not yet been defined, or upon executing the Calculate command, a best-fit model will be calculated and graphed.
To see the model parameters and to change the model, use the Model command at
the bottom of the variogram image.
Expand
The Expand command brings up a separate variogram window, from which the
variogram can be printed or formatted. Variance Cloud Analysis, the ability to view
individual semivariance values, and the number of pairs per variogram class interval
are also available from these windows.
Model
The Model command brings up a Model Dialog window within which you may
change the variogram model. The Model command is enabled only when the Show
Model Variogram Option is selected.
Surface
The Surface command brings up the Anisotropic Variogram Surface Map window.
The surface map is useful for visualizing anisotropic autocorrelation when present,
as described below.
Calculate
The Calculate command causes the semivariogram to be calculated.
63
Clear
Clear the worksheet.
Import
Import a text file containing the lag class interval bounds. The default extension for
interval files is .int. The format of the file to be imported is numeric-only records following a variable number of alphanumeric header records. For example:
line 1:
line 2:
line 3:
line 4:
line 5:
line 6:
line 7:
line 8:
64
65
Variograms Window
The Variograms window presents a full-window variogram that can be edited and
printed. Different tabs hold variograms for the different anisotropic directions. Additionally, the semivariance values that were used to produce the variogram can be
listed, and Variance Cloud Analysis can be performed. Variance Clouds provide a
means for detecting outlier pairs of points that may be artificially skewing the variogram.
You can use the mouse cursor to identify the number of pairs in specific lag classes
(reported at the bottom of the window; in the example below, the cursor (not shown)
was over the 2nd symbol in the variogram), and to begin variance cloud analysis.
Note also that parameters for the variogram model (if present) are presented in a
graph footnote. If you do not want parameters to appear as a footnote you must use
the Edit Graph command to bring up the Graph Settings dialog and in the Footnote
field remove the notation (model results).
List Values
Bring up a Semivariance Values Window, including for each lag class the average
separation distance for pairs of points in that class, the average semivariance for
those points, and the number of pairs of points upon which the average distance and
semivariance are based.
66
Semivariance Values
In this worksheet are listed for each lag class the average separation distance for
pairs of points in that class, the average semivariance for those points, and the
number of pairs of points upon which the average distance and semivariance are
based. This is a read-only worksheet.
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
67
Semivariance
0.19
0.12
Nugget (C0) = 0.04
0.06
Range (A) = 13.3
C0
0.00
0
20
40
68
60
80
GS calculates default values for each parameter of the five models. You may
change any of these three model parameters from the Isotropic Variogram Model
dialog window:
70
for h A0
for h > A0
where
71
72
73
74
Nugget Variance or C0 the y-intercept of the model; this value is the same
for all directions.
Sill or C0+C the model asymptote; this value is the same for all directions.
Range or A the separation distance over which spatial dependence is apparent for the direction examined. It is the sum of:
GS+ calculates default values for each parameter of the five models. You may
change any of these four model parameters from the Anisotropic Variogram Model
dialog window:
for h A
(h) = C0 + C
for h > A
where
(h) = C0 + h(C / A)
where
79
Edit Graph
Change axis scales and other graph settings via the Graph Settings dialog window.
Print
Print the graph to a file, printer, or other device.
3D
Change the graph projection to 3D from 2D or to 2D from 3D.
80
Commands for the 3D map projection are the same as those for the 2D projection,
above.
82
Commands in the Cross-Autocorrelation window are the same as in the Autocorrelation window, earlier in this chapter.
83
84
Chapter 7
Other Autocorrelation Measures
Standardized Variograms
In standardized variogram analysis the variogram is computed as a proportion of
sample variance:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
85
Madograms
Madograms are similar to traditional variograms (see Semivariance Analysis), except that the absolute difference between zi and zi+h is used rather than the square
of the difference as for traditional semivariance. The formula thus becomes
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
86
Rodograms
Rodograms are similar to traditional variograms (see Semivariance Analysis), except
that the square root of the difference between zi and zi+his calculated rather than the
square of the difference. The formula thus becomes
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
87
Drift
Drift (also called trend) is calculated as
F(h) = [1 / N(h)] [ (zi zi+h) ]
where
F(h) = drift for interval distance class h,
zi = measured sample value at point i,
zi+h = measured sample value at point i+h, and
N(h) = total number of sample couples for the lag interval h.
Drift graphs (both isotropic and anisotropic) appear in the Autocorrelation Window,
on the Drift tab. Press the Expand button on this tab to bring up the Drift window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
88
Correlograms
Correlograms are calculated using the formula
(h) = C(h) / (-h +h)
where
(h) = correlation for interval distance class h,
+h = standard deviation of all head values (zi) for lag h, and
-h = standard deviation of all tail values (zi+h) for lag h.
Correlograms (both isotropic and anisotropic) appear in the Autocorrelation Window,
on the Correlogram tab. Press the Expand button on this tab to bring up the Correlogram window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
89
Covariance Analysis
Covariance in GS+ is computed as:
C(h) = [1 / N(h)] zi zi+h m-h m+h
where
C(h) = covariance for interval distance class h,
N(h) = total number of sample couples for the lag interval h,
zi = measured sample value at point i,
zi+h = measured sample value at point i+h,
m-h = mean of all head values for lag h, or [1/N(h)] zi, and
m+h = mean of all tail values for lag h, or [1/N(h)] zi+h.
Cross-covariance is computed in the same manner but zi and zi+h represent two different variates, Z and Z2, respectively.
The covariance graph appears in the Autocorrelation Window, on the Covariance
tab. Press the Expand button to bring up the Covariance window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
90
m-h = mean of all head values for lag h, or [1/N(h)] zi, and
m+h = mean of all tail values for lag h, or [1/N(h)] zi+h.
General relative variograms (both isotropic and anisotropic) appear in the Autocorrelation Window, on the General Relative Variogram tab. Press the Expand button on
this tab to bring up the General Relative Variograms window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
91
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
92
Morans I Analysis
The Morans I statistic is a conventional measure of autocorrelation, similar in interpretation to the Pearsons Product Moment correlation statistic for independent
samples in that both statistics range between -1.0 and 1.0 depending on the degree
and direction of correlation. The statistic is defined as:
I(h) = N(h) zi zi+h / zi+h2
where
I(h) = autocorrelation for interval distance class h,
zi = the measured sample value at point i, and
zi+h = the measured sample value at point i+h.
Note that in this analysis all of the weights in the adjacency matrix (Sokal and Oden
1978) are set to 1, i.e. I is weighted by distance (h) between sample points rather
than by simple adjacency. Calculating this statistic for a variety of lag distances
yields the Morans I autocorrelogram.
Morans I correlograms (both isotropic and anisotropic) appear in the Autocorrelation
Window, on the Morans I tab. Press the Expand button on this tab to bring up the
Moranss I window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
93
Fractal Analysis
The Fractal Analysis window is used to set up the parameters for fractal analysis
and then calculate the fractal dimension D for the data set defined in the Data Definition Menu. GS+ calculates the fractal dimension D as a function of the slope of a loglog variogram plot (Burrough 1981, 1986 ):
D=2-m/2
where
D = the Hausdorff-Besicovitch statistic, and
m = the slope of a log-log variogram
Because D is based on an analysis of semivariance, it is sensitive to the same
analysis parameters that affect semivariance analysis.
Fractal variograms (both isotropic and anisotropic) appear in the Autocorrelation
Window, on the Fractal tab. Press the Expand button on this tab to bring up the
Fractal window:
The List, Edit, Print, Cloud, Scatter, and Exit buttons work here as they do for the
Variograms Window.
94
Chapter 8
Variance Clouds and h-Scattergrams
Variograms can be sensitive to outlier values in a data set, and sometimes an erratic-looking variogram can be traced to a single extreme value that inflates (h)
whenever they are compared with other values. One way to locate such outliers is
to draw a variance cloud for each separation distance interval (or lag class) h; another way is to draw an h-Scattergram for each interval.
Variance clouds and h-Scattergrams are measures of pairwise variance in autocorrelation graphs. In a variogram each point on the graph represents the average
semivariance for all pairs of points in a particular lag class. h-Scattergrams and
variance clouds let you see the individual pairs of points that are used to calcualte
the average semivariance for that class.
Variance Clouds
The formula to calculate variance for any given pair of points at locations i and j can
be reduced to the mean square difference between the points:
Varij = [ zi zj ]2
where
Varij = variance for pair (i,j),
zi = measured sample value at point i, and
zj = measured sample value at point j.
Any given pair(i,j) are separated by a specific distance; this distance is plotted along
the x axis of the variance cloud graph. All pairs on a specific graph are in the same
separation distance (lag) class.
Note that a variance cloud is specific to both direction (isotropic or a specific anisotropic direction) and to a particular lag class. In the variogram below, the cursor is on
the point representing lag class 7 of the isotropic variogram, which one might suspect contains an outlier because it is so different from the other points:
95
Clicking on this variogram point or the Cloud button brings up the variance cloud for
lag class 7 below, and it becomes apparent that a number of pairs are very different
from the others placing the cursor over each of the outlying points reveals that all
of them contain record 4 as a member of the pair (the cursor below is over the point
represented by records 4 and 96 at a separation distance of 52.77):
96
Since record 4 is a member of all of these outlier pairs, we can choose to mask record 4, which has a transformed value of 2.639 (if the data were not transformed,
this value would match the value in the Data Worksheet). This gives us a much
more reasonable variogram:
Re-examination of the variance cloud for lag class 7 (below) reveals that the highest
y-axis values are substantially lower than before (0.35 vs. 6.12) and, more importantly, all of the major outlier pairs are gone. This was accomplished by removing a
single data record from the analysis:
97
Lag Class
The lag class for which the variance cloud is created. The variance for every pair of
points in the lag class is plotted against the distance interval separating that pair.
List Values
Bring up a listing of Variance Cloud Pairs, including for each pair the variance, separation distance, and the identity (data record number) of each member of the pair.
Edit Graph
Bring up a Graph Settings dialog window for editing the graph.
Print Graph
Print the graph via a Graph Print dialog window.
Cloud and Scatter
Choose either Variance Cloud or h-Scattergram analysis.
98
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
99
h-Scattergrams
In an h-Scattergram each head value zi is plotted against each tail value zj for a
given lag class h. The formula to calculate this difference for any given pair of points
at locations i and j is
Dij = zi zj
where
Dij = difference for pair (i,j),
zi = measured sample value at point I, and
zj = measured sample value at point j.
Note that an h-Scattergram is specific to both direction (isotropic or a specific anisotropic direction) and to a particular lag class. In the variogram below, displayed earlier, the cursor is on the point representing lag class 7 of the isotropic variogramwhich looks odd:
Clicking on this variogram point or pressing the Scatter button brings up the hScattergram for lag class 7 below, and it becomes apparent that a number of pairs
are very different from the others. Placing the cursor over each point reveals that all
of the points farthest from the 45-degree line contain record 4 as a member of the
pair (the cursor below is over the point represented by records 4 and 92 with a separation distance of 52.28):
100
Clicking on this h-Scattergram point brings up the Sample Details window, which
gives us the option to temporarily mask (remove from the active data set) one of the
data records for this pair:
Since record 4 is a member of all of these outlier pairs, we choose to mask record 4,
which has a transformed value of 2.639 (if the data were not transformed, this value
would match the value in the Data Worksheet). This gives us a much more reasonable variogram:
101
Re-examination of the h-scattergram for lag class 7 reveals that the greatest differences between head and tail values of individual pairs are substantially lower than
before (0.8 vs. 2.8) and all of the major outlier pairs are gone. This was accomplished by removing a single data record from the analysis:
102
103
h-Scattergram Pairs
This read-only window contains a listing of all pairs of points within a specific lag
class, including for each pair the difference, separation distance, and the identity
(data record number) of each member of the pair. These pairs are graphed in the hScattergram window.
Note that in the example below, sorted by difference (with pairs of greatest difference listed first), record 4 shows up as a member of 13 high-variance pairs. This
confirms the record as an outlier, as was found in the h-Scattergram above:
Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
104
Chapter 9
Interpolation Basics
Interpolation is the estimation of values in an area for points not actually sampled.
There are many different interpolation techniques, ranging from simple linear techniques that average the values of nearby sampled points, to more complex techniques like kriging that use in the average weights based on distance to nearby
sample points and the degree of autocorrelation for those distances.
GS+ provides three broad types of interpolation. All are nearest-neighbor techniques
in which values at locations close to the interpolation point are used to estimate the
interpolation point value. They differ in the way that nearby locations are weighted
and interpolations calculated. The three techniques are:
Ordinary Kriging, in which interpolation estimates are based on values at
neighboring locations plus knowledge about the underlying spatial relationships in a data set. Semivariograms provide knowledge about the underlying
relationships. The estimated value at a given location is a weighted moving
average of best estimates calculated to minimize local area variance.
Conditional Simulation, in which interpolations are based on a form of stochastic simulation in which data values are honored at their locations. This
means that local details are not obscured by smoothing as they are in
kriging.
Inverse Distance Weighting (IDW) and Normal Distance Weighting
(NDW), in which interpolation estimates are made based on values at nearby
locations weighted only by distance from the interpolation location. Neither
IDW nor NDW make assumptions about spatial relationships except the basic assumption that nearby points ought to be more closely related than distant points to the value at the interpolate location. IDW applies stronger
weights to nearby points than does NDW.
All three techniques are described in greater detail on the following pages.
105
Interpolation Range
Defines where to place interpolation estimates in a uniformly spaced grid across a
rectangular area or at user-specified locations, in either case with or without masks
that can define areas to include or exclude.
Uniform x-y grid (specified intervals)
A grid is defined by a rectangle that has an X-direction length, a Y-direction
length, and for each direction, intervals between the grid intersections. Interpolation locations are at every grid intersection.
The default range is defined by the minimum and maximum X-coordinate and
Y-coordinate values, and an interval based on a certain number of points in
each direction. For 1-dimensional data sets, there is no y direction.
The grid can be changed with the Define command, which will display an Interpolation Grid dialog window.
106
108
109
Interpolation Range
Specify the beginning and ending values for the region to be interpolated. The region may exceed the Data Range, which is noted on the right side of the dialog window. Irregularly-shaped areas within the bounds of the interpolation range may be
interpolated by specifying polygon masks from the Krig window.
Important Note: Interpolation will begin at the minimum X and Y values, but may
not reach the maximum values if the total distance (e.g. the maximum X value less
the minimum X value) is not evenly divisible by the Distance Interval. For example,
interpolation of an X Range of 0 105 with a Distance Interval of 10 will stop interpolating at X = 100, prior to exceeding the maximum range of 105.
Data Range
This is the range covered by the actual data set; these are read-only values, they
cannot be changed by the user from this dialog window.
Distance Interval
Specify the distance interval between locations within the interpolation range. A distance interval of 2.0 over a range of 0 to 10.0 means that interpolations will be made
at points 0.0, 2.0, 4.0, 6.0, 8.0, and 10.0. Changing the distance interval will change
the number of points value.
Number of Points
The number of points within the range to be interpolated. For any given range, the
110
111
Block Size
For Block Kriging, this is the size of the block around each point that will be kriged.
For uniform grids, block size is equal to the interpolation grid size. The local (discretization) grid is placed within the box. For Point or Punctual Kriging, block size is
irrelevant and is ignored during analysis.
Clear
Clears the worksheet.
Import
The Import command brings into the worksheet a text file containing interpolate locations. The text file can be formatted in a variety of ways. Header records are initial records containing non-numeric data that are ignored during import. Data records follow header records (if present) and contain a value for the X-Coordinate
followed by a value for the Y-Coordinate; additional fields are ignored (as is the Y112
Other Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
113
The example below defines two polygons: the first is a 6-sided area that is excluded
from interpolation, the second defines an inclusive 4-sided area (rectangle) inside
the 6-sided area. You can use the Map command to produce a picture of these
polygons (see below). The first line in this example (Polygon) is unnecessary. The
word "Exclude" or "Include" is not optional; each set of polygon vertices must be
preceded by either Exclude or Include:
114
115
Edit Graph
Make format and text changes to the graph via a Graph Settings dialog window.
Print Graph
Print the graph to a printer or file with the Graph Print dialog window.
116
Cross-Validation Analysis
Cross-validation analysis is a means for evaluating effective parameters for kriging
and IDW interpolations. In cross-validation analysis each measured point in a spatial domain is individually removed from the domain and its value estimated via
kriging or IDW as though it were never there. In this way a graph can be constructed of the estimated vs. actual values for each sample location in the domain.
Placing the cursor over a point provides information on the estimate for a specific
location as in the example below.
Each point on the graph represents a location in the input data set for which an actual and estimated value are available. Information about individual points is provided at the bottom of the screen; points are displayed by placing the cursor on
them; in the case above the cursor was placed about the point representing record
130.
The regression coefficient at the bottom of the graph represents a measure of the
goodness of fit for the least-squares model describing the linear regression equation. A perfect 1:1 fit would have a regression coefficient (slope) of 1.00 and the
best-fit line (the solid line in the graph above) would coincide with the dotted 45degree line on the graph. The standard error (SE = 0.105, above) refers to the
standard error of the regression coefficient; the r2 value is the proportion of variation
explained by the best-fit line (in this case 39.0%; it is the square of the correlation
coefficient); and the y-intercept of the best-fit line is also provided. The SE Prediction
term is defined as SD x (1 - r2)0.5 , where SD = standard deviation of the actual data
(the data graphed on the y-axis).
117
Cross-Validation Values
In cross-validation analysis each measured point in a spatial domain is individually
removed from the domain and its value estimated via kriging as though it were never
there. Results can be graphed as well as listed in this read-only worksheet table.
Note that Estimated Z values may or may not be backtransformed depending on
settings in the Data Summary worksheet. If you have chosen a transformation without a backtransformation, then the Estimated Z values will appear very different from
Actual Z values regardless of the integrity of the interpolation system.
Table
The record number refers to the actual record in the Data Worksheet. For each record successfully kriged the actual Z value is presented in the middle column and
the estimated Z value is provided to its right.
Other Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
118
Chapter 10
Conditional Simulation
Conditional simulation is an advanced interpolation technique whereby Z estimates
are based on a form of stochastic simulation in which measured data values are
honored at their locations. Other interpolation methods, including kriging and IDW,
will smooth out local details of spatial variation, especially as interpolated locations
become more distant from measured locations. This can be a problem when you are
trying to map sharp spatial discontinuities such as contamination hotspots or fault
lines. GS+ uses a sequential gaussian simulation method.
The Interpolation Window contains the Simulate tab:
Variogram Model
Variogram models for isotropic and anisotropic variograms are chosen with the
Model command in the Autocorrelation Analysis window. Here in the Simulate window you can specify whether to use the isotropic or anisotropic model for the
variogram used in the kriging system.
Output
Two types of output are available with conditional simulation: either a) the estimated
Z value and its standard deviation, or b) the probability that the estimated value for
that location is greater than some threshold value t.
An individual Z estimate and its standard deviation is the mean and standard deviation of n simulations for a specific location. Thus the number of
simulations will affect both of these values.
119
Residuals. This is the same as simple kriging with a locally varying mean.
Press the Define button to define the residuals to be used.
External drift. Press the Define button to define the drift values.
Colocated cokriging. Press the Define button to define the covariate values.
120
Clear
Clear existing data from worksheet.
Fill X,Y
Fill the X and Y Coordinate columns with calculated values based on the uniform
interpolation grid defined in the Interpolation window. If a uniform grid is not defined
in the interpolation window this button is not available.
Import
Import values from an external text file. Each record of the file must contain a separate x, y, and secondary data value separated by commas, tabs, or spaces. You
can also import data by copying and pasting into the Windows clipboard.
Other Actions
You may Print or Copy the contents of the worksheet using the menu commands of
the main GS+ window. You may also change the Decimal Places reported by highlighting a column and pressing the Increase or Decrease Decimals icon (or use the
Data | Change Decimals menu command). Clicking on the top of a column will Sort
the worksheet based on the column selected in alternating ascending or descending order. You may also change Column Widths by placing the cursor over the line
between two columns and dragging to a new location.
121
Number of Simulations
In conditional simulation the number of simulations used to produce estimates of Z
can strongly affect the outcome of the interpolation. Presented below are maps of
an 80 x 80 m grid interpolated at a density of 0.5 m, with different numbers of simulations. Below left is a map of block kriged data; on the right is the same data set
interpolated by conditional simulation (n = 1 simulation).
Additional simulations improve clarity; the map below left represents n = 10 simulations, below right n = 100 simulations:
And still more simulations resolve further detail. The map below left represents n =
1,000 simulations; below right n = 10,000 simulations:
You may wish to set the number of simulations high enough that additional simulations reveal little further resolution.
122
Chapter 11
Kriging and Cokriging
Kriging provides a means of interpolating values for points not physically sampled
using knowledge about the underlying spatial relationships in a data set to do so.
Variograms provide this knowledge. Kriging is based on regionalized variable theory
and provides an optimal interpolation estimate for a given coordinate location. GS+
performs ordinary kriging.
The Interpolation Window contains the Krig tab:
Krig Methods
Block vs. Point Kriging. You may choose either block or point (punctual) kriging.
Your choice should be made on the basis of sampling design and variate characteristics. If samples were taken to represent an area around the actual sample point
(e.g. if soil samples from an area around the sampling coordinate were composited
before analysis), then block kriging may be more appropriate than punctual. If samples were taken to represent point values then punctual kriging may be more appropriate. For uniform grids, block size is equal to the interpolation grid size. For nonuniform grids, block size is defined in the
Local grid. The Discretization or Local grid describes the size of the grid
placed around the interpolation point and inside the block when block kriging.
The interpolation estimate for that point is based on the mean value of estimates for each of the discretization grid points. A single discretization point
describes punctual kriging. Larger discretization grids take longer to interpolate.
Cokrig. Cokriging provides a means for interpolating values using a secondary variate Z2 that covaries with the primary variate Z. The covariate is
sampled from the same locations as Z, and also from a number of additional
locations. The kriging estimate is based not only on distance to nearby sam123
124
Cokriging
Cokriging is an interpolation technique that allows one to better estimate map values
if the distribution of a secondary variate sampled more intensely than the primary
variate is known. If the primary variate is difficult or expensive to measure, then cokriging can greatly improve interpolation estimates without having to more intensely
sample the primary variate.
Consider the following example. After a radioactive spill, plutonium was sampled
across an 80 x 80 m area at a sample density indicated by the quartile plot below.
Block kriging (following variography) resulted in the adjacent map:
Soil carbon, easier to measure than Uranium, was sampled at the same locations as
Uranium and additionally at another 60 locations as noted in the quartile map below
left. Regression of carbon against plutonium showed that the variates were highly
correlated (right), suggesting that cokriging might improve the map of plutonium.
125
How do you perform cokriging? Prior to cokriging you must a) define a covariate in
the Data Worksheet Field Assignment dialog, b) perform semivariance analysis (including variogram modeling) for both the primary variate Z and the covariate Z2, and
c) perform cross-semivariance Z x Z2 analysis (including variogram modeling). You
should also check that the covariate is in fact correlated with the primary variate.
126
Chapter 12
Inverse (IDW) and Normal (NDW) Distance Weighting
Inverse Distance Weighting (IDW) and Normal Distance Weighting (NDW) are interpolation techniques in which interpolated estimates are made based on values at
nearby locations weighted only by distance from the interpolation location. Neither
IDW nor NDW make assumptions about spatial relationships except the basic assumption that nearby points ought to be more closely related than distant points to
the value at the interpolate location. IDW applies stronger weights to nearby points
than does NDW. The formula used for Inverse Distance Weighting is:
Zestj = [ zi / (hij + s)p] / [1 / (hij + s)p]
where
Zestj = estimated value for location j
zi = measured sample value at point i,
hij = distance between Zestj and zi;
s = smoothing factor; and
p = weighting power.
The formula used for Normal Distance Weighting is the same as that for IDW except
that the denominator [distance plus smoothing factor] rather than [inverse of distance plus smoothing factor].
The weighting power p defines the rate at which weights fall off with hij, the distance
between the interpolated and sample locations. A value of 1-5 is typical.
The smoothing factor s reduces the likelihood that any one sample value will overly
influence an estimated value for a given interpolation location. IDW is an exact interpolator, so where an interpolation location j coincides with a sample location i,
Zestj = zi and a sharp peak or valley may result; setting s > 0 reduces this peaking effect when it occurs.
127
128
Chapter 13
Interpolation Output File Formats
GS+ Format Output Files
The GS+ format for Krig (.krg), IDW, and Conditional Simulation output files (which is
also the GS+ format for Map input files) contains a number of header records containing information about the file, followed by XYZ-style records for each interpolated
points. Each data record contains fields for an x-coordinate value, a y-coordinate
value (except 1-dimensional transects do not contain a value for the y-coordinate),
an estimated z value for that x-y location, an estimation standard deviation for the
estimated z-value (except IDW files), and the number of pairs used in the interpolation for that x-y location. Missing values are denoted by whatever value is specified
in the Preferences window. Note that this format is different from the GS+ Input File
format.
The following listing is for the first 6 coordinates for a standard GS+ krig output file. A GS+ IDW Output file would look identical except that there would
be no field for Estimated Standard Deviation in the data records.
GS+ Output: Block Kriging Interpolation File v7.0
Set: Field 54, Second Tier
Dimensions: 2; Interval source: Calculated
Interpolation interval (x; y): 1.3333; 1.3333
X-coor: m east
range: 0.00 - 80.00
Y-coor: m north
range: 0.00 - 80.00
Z-est: Pb
range: 0.151 - 0.813
Z-sd:
range: 0.0000 - 0.3655
Mean Z-estimate (sd): 0.391 (0.0186)
Valid N: 3721; Missing N: 0; Missing Value Indicator: -99
X-Coordinate
Y-Coordinate
Z-Estimate
EstStdDev
0.00
0.00
0.431
0.3608 16
0.00
1.33
0.458
0.3486 16
0.00
2.67
0.462
0.3329 16
0.00
4.00
0.466
0.3178 16
0.00
5.33
0.470
0.3034 16
0.00
6.67
0.474
0.2900 16
129
To convert this file to an ArcGIS grid file, use ArcToolbox, either the light version that comes with ArvView or the full version that comes with ArcInfo:
a. Open ArcToolbox
b. Choose the Import to Raster tool
c. Specify the GS+ ouptut file name, the grid type (integer or floating point),
the name of the grid output file, and press OK.
d. You may now open the file in ArcMap.
130
131
132
133
Chapter 14 Mapping
Chapter 14
Mapping
GS+ produces 2D and 3D maps of spatial data following interpolation. The data to be
mapped come from kriging or IDW analysis, and are thus contained in interpolation
output files. Maps can be displayed in a variety of ways with a variety of different
contouring schemes, 3-dimensional maps can be rotated on the fly, and both 2dimensional and 3-dimensional maps can be zoomed to more closely view a transition or other map feature. Additionally, sample postings (original data locations) can
be displayed, and estimation standard errors can be mapped for input files that have
been saved in the standard GS+ format.
134
Format specify the format to use to read the input file. Files with a .krg,
.idw, .blk, or .pun extension are assumed to be in GS+ format. The format
specified in this field will override any assumed format:
Chapter 14 Mapping
+
GS Krig format (.krg) in this format a header area defines the interpolation grid, variate names, and other information about the file needed to
initiate mapping later, and the data records include for each X and Y
Coordinate location that is kriged the interpolation or Z-estimate, the
standard deviation of the Z-estimate, and the number of neighbors that
were used to make the estimate. See detailed example in Chapter 11.
Surfer Grid format (.grd) in this format a short header area defines
information needed for mapping, and the data is written as a continuous
stream of Z-estimates beginning from a specific corner of the interpolation grid. The standard deviation of the estimate and the number of
neighbors used for interpolation are NOT included in this format. This
format is compatible with Golden Softwares Surfer mapping program.
Note that this format is not the same as the Surfer XYZ Input file format.
See detailed example in Chapter 11.
ArcView Format (.asc) this is similar to the Surfer format but the
header area is formatted differently and the Z-estimates are written in a
pattern that begins from a different corner of the interpolation grid. The
standard deviation of the estimate and the number of neighbors used for
interpolation are NOT included in this format. Also for this format, the x
and y interpolation intervals must be the same (you can set them to be
the same from the Interpolation Grid dialog window). This format is
compatible with ESRIs Arc-Info Geographic Information System. See
detailed example in Chapter 11.
Map Grid
X direction the range within the file for values in the x (horizontal) direction.
Y direction the range within the file for values in the y (vertical) direction; for
1-dimensional data sets this field is blank.
Z values the range of estimated Z values in the input file.
Z SD values the range of estimation Z standard deviation values in the input file; only when input files are in a GS+ format that contain both Z values
and Z standard deviation values will this box be selectable;
N (N missing) the number of valid records in the file and (in parentheses)
the number of missing values. The default missing value indicator (e.g. 999.) is specified in the Preferences - General dialog; the active missing
value indicator for map files is indicated in the Map Contour Intervals window.
Missing values are mapped as transparent regions. Exclusive polygons are
mapped as missing values, as is any interpolate location for which there is no
kriging solution.
135
Chapter 14 Mapping
Variate to Map
Z values maps the estimated Z values in the file
Sample Posting map only the original sample locations. Requires the
presence of a posting file, which has the same name as the input file but a
.pos extension. This file is created during interpolation; if it is not present you
will not be able to map a sample posting. Coordinate Postings (Quantile
Plots) of the active data are viewed through the Data Summary X,Y Coordinates Tab window, described earlier.
Graph Type
2D displays a flat, 2-dimensional map of the data
3D displays a 3-dimensional map of the data. The height of the map can
be adjusted with the Edit Graph command of the Map window; the perspective can be adjusted with the Rotate command of the Map window.
1-d displays 1-dimensional data (e.g. a geographic transect or a time series) as an x-y graph. This choice is not available for 2-dimensional data.
Surface
Contour lines draws lines between contour intervals
136
Color bands fills the space between contour lines with different colors; colors can be specified from the Define Contour dialog window by pressing Define Contour Levels, below.
Solid pedestal for a 3D map fill in the area beneath the surface with a solid
color.
Chapter 14 Mapping
Contour Levels
Number the number of contour levels to put on the map.
Map Legend
Show legend displays the contour legend next to the map.
Continuous vs. Stepped display legend as continuous scale with values
next to break points (e.g. between color bands) or display legend as stepped
format, with values next to boxed colors.
Ceiling
Contour lines project contour lines above the plot surface (3D maps only).
Color bands project color bands onto the ceiling of the plot (3D maps only).
Floor
Contour lines project contour lines under the plot surface (3D maps only).
Color bands project color bands onto the floor of the plot (3D maps only).
Grid Lines
X axis place a vertical grid line along the back walls of the 3D plot x axis.
Y axis place a vertical grid line along the back wall of the 3D plot y axis.
Z axis place horizontal grid lines along the back walls of 3D plots.
Draw
Create the map in a Map Image window.
137
Chapter 14 Mapping
Contour Intervals
Color buttons change the color of a specified interval by clicking the adjacent color button.
138
Break points change the break point between adjacent contour intervals by
providing a new value in the space provided. Note that the new break point
must be greater than the preceding point and less than the one that follows.
Missing value specify the missing value indicator for the data file. This
value is usually embedded in the input file as part of the header information,
and is automatically extracted by GS+. This value overrides the missing value
indicator specified in the Preferences General dialog or the map input file.
Chapter 14 Mapping
Intervals
Get read contour intervals from a text file; the first value of each record of
line of the file is presumed to be a separate interval.
Color Source
Standard specifies a default color pattern of red-orange-yellow.. violet.
Save save the existing color pattern as the custom color pattern.
Reset reset the custom color pattern to the default Standard pattern.
139
Chapter 14 Mapping
Map Image - 3D
The 3D map image is produced by the Draw command from the Mapping window.
Edit Graph
Change axis scales and other graph formats via the Graph Settings dialog window.
Print Graph
Print the active graph to a file, printer, or other device.
Mouse Action
Off returns the mouse to normal operation.
Rotate turns the cursor into a rotator cuff when the left mouse button is
pushed, allowing the image to be rotated as desired.
Move allows the graph to be moved within the window by clicking the left
mouse button and dragging the cursor.
Scale shrinks the graph image with the left mouse button.
Zoom allows you to zoom in on a graph area by using the left mouse button
to define a rectangular zoom area. Within the zoomed area the location of
the cursor is noted on the Mouse Location panel.
Reset resets the image to the default rotation angle and scale.
Mouse Location
The current cursor location when on the map surface. Units are map units.
140
Chapter 14 Mapping
141
Chapter 14 Mapping
142
Chapter 14 Mapping
Map Image - 2D
The 2D map image is produced when the Graph Type is set to 2D.
143
Chapter 14 Mapping
144
Chapter 14 Mapping
Chapter 15 Bibliography
Chapter 15
Bibliography
The following references may be useful for those seeking further background about
geostatistics and its use in the environmental sciences.
Burrough, P.A. 1981. Fractal dimensions of landscapes and other environmental data. Nature 294:240-242.
Burrough, P.A. 1986. Principles of Geographical Information Systems for
Land Resources Assessment. Oxford University Press, Oxford.
Burgess, T.M. and R. Webster. 1980a. Optimal interpolation and isarithmic
mapping of soil properties. I. The semivariogram and punctual kriging. Journal of Soil Science 31:315-331.
Burgess, T.M. and R. Webster. 1980a. Optimal interpolation and isarithmic
mapping of soil properties. II. Block kriging. Journal of Soil Science 31:333341.
Cressie, N. 1985. Fitting variogram models by weighted least squares.
Mathematical Geology 17: 563-586.
Cressie, N. A. C. 1991. Statistics for Spatial Data. John Wiley, New York,
USA.
David, M. 1977. Geostatistical Ore Reserve Estimation. Elsevier, Scientific
Publishing Co., Amsterdam, The Netherlands.
Deutsch, C.V. and A.G. Journel 1992. GSLIB Geostatistical Software Library.
Oxford University Press, New York.
Goovaerts, P. 1997. Geostatistics for Natural Resources Evaluation. Oxford
University Press, New York.
Griffith, D.A. 1987. Spatial Autocorrelation: A Primer. Association of American Geographers, Washington, D.C. 86 p.
Haan, C.T. 1977. Statistical Methods in Hydrology. Iowa State University
Press, Ames, Iowa.
Isaaks, E.H. and R.M. Srivastava. 1989. An Introduction to Applied Geostatistics. Oxford University Press, NY.
Journel, A.G. and C.J. Huijbregts. 1978. Mining Geostatistics. Academic
Press, New York.
Krige, D.G. 1966. Two dimensional weighted moving average trend surfaces
for ore-evaluation. Journal of the South African Institute of Mining and Metallurgy 66:13-38.
Krige, D.G. 1981. Lognormal-de Wijsian geostatistics for ore evaluation.
South African Institute of Mining and Metallurgy Monograph Series. Geosta146
Chapter 15 Bibliography
tistics I. South Africa Institute of Mining and Metallurgy, Johannesburg, South
Africa.
Mandelbrot, B.B. 1982. The Fractal Geometry of Nature. W.H. Freeman,
London.
Matheron, G. 1971. The theory of regionalized variables and its applications.
Cahiers du Centre de Morphologie Mathematique, Fontainebleau, No. 5.
Robertson, G.P. 1987. Geostatistics in ecology: interpolating with known
variance. Ecology 68:744-748.
Robertson, G. P., and K. L. Gross. 1994. Assessing the heterogeneity of below-ground resources: Quantifying pattern and scale. Pages 237-253 in M.
M. Caldwell and R. W. Pearcy, eds. Plant Exploitation of Environmental Heterogeneity. Academic Press, New York, New York , USA.
Rossi, R. E., D. J. Mulla, A. G. Journel, and E. H. Franz. 1992. Geostatistical
tools for modeling and interpreting ecological spatial dependence. Ecological
Monographs 62: 277-314.
Sokal, R.R. and N.L. Oden. 1978. Spatial autocorrelation in biology. 1. Methodology. 2. Some biological implications and four applications of evolutionary and ecological interest. Biological Journal of the Linnean Society 10:199228.
Trangmar, B.B., R.S. Yost and G. Uehara. 1985. Applications of geostatistics
to spatial studies of soil properties. Pages 45-94 in N.C. Brady, editor. Advances in Agronomy Volume 38. Academic Press, New York.
Vieira, S.R., J.L. Hatfield, D.R. Nielsen, and J.W. Biggar. 1983. Geostatistical
theory and application to variability of some agronomical properties. Hilgardia
51:1-75.
Webster, R. 1985. Quantitative spatial analysis of soil in the field. Pages 170 in B.A. Stewart, editor. Advances in Soil Science Volume 3. SpringerVerlag, New York.
Webster, R. and M.A. Oliver. 1990. Statistical Methods in Soil and Land Resource Survey. Oxford University Press, NY. 316 p.
147
Chapter 16 Glossary
Chapter 16
Glossary
A0
Range parameter in variogram models. The relationship of A0 to range A (effective
range) depends on the model. See specific model definitions for more information.
A1
Range parameter for the major axis in anisotropic variogram models. The relationship of A1 to major range depends on the model. See specific model definitions for
more information.
A2
Range parameter for the minor axis in anisotropic variogram models. The relationship of A2 to major range depends on the model. See specific model definitions for
more information.
Binary Data Record Format
Binary data from many spreadsheet and database programs can be imported directly; if the file can be viewed using the View command then it can be imported.
Eligible files include Excel and Lotus 1-2-3 worksheets, dBase, Paradox, Access,
etc.
A Character as Missing Value Indicator
Specify the character to be used as a missing value indicator in the field that will appear when a character value is specified in the list box. The example below presumes that % has been specified the missing value indicator, and thus the third
value of this record will be read into the worksheet as a missing value:
13.2, 34.5, % , 0.15
Autocorrelation
Autocorrelation is the degree to which a property is related to itself in time or space.
For example, in a spatial domain values for samples taken close to one another are
more likely to be similar than are samples taken farther apart autocorrelation is a
formal measure of this self-similarity.
Backtransformation
When data values are transformed in order to make their distributions more normally
distributed, after analysis of the transformed data the output data are customarily
(but not necessarily) back-transformed to the original data domain for final reporting.
For example, the backtransformation for a ln(z) transform is exp(z-transformed); for
a z2 transform the backtransformation is the square root of the transformed value.
Offset values (if applied) are subtracted from the backtransformed values.
Binary Data Record Format
Binary data from many spreadsheet and database programs can be imported di148
Chapter 16 Glossary
rectly. Eligible files include Excel and Lotus 1-2-3 worksheets, dBase, Paradox, Access, and FoxPro.
Block Kriging
Kriging provides a means of interpolating values for points not physically sampled
using knowledge about the underlying spatial relationships in a data set to do so.
Block kriging provides an estimate for a discrete area around an interpolation point.
Punctual or point kriging provides an estimate for a precise point. In environmental
work block kriging is usually more appropriate. The block is defined as the rectangular area around a point that is not included in an adjacent block.
Brackets as Column Title Separators
Column or variate names are separated by brackets, e.g.:
<m east> <m north> <Pb (ug/g)> <pH>
C or Structural Variance
C of the variogram model represents spatially-structured variance. Compare to C0
or nugget variance, which is the portion of the variance not spatially structured.
C0 or Nugget Variance
C0 or Nugget Variance is the y-intercept of the variogram model. Nugget variance
represents variation not spatially dependent over the range examined.
Characters as Column Title Separators
Column or variate names are separated by a character specified in the field that appears when this option is selected. For example if "#" were specified as the separated:
m east # m north # Pb (ug/g) # pH
Character Separated Values
Values within data records are separated by a specific character, defined in the field
below the list box; e.g. if the character # is the delimiter:
13.2 # 34.5 # 35.6 # 0.15
Cokriging
An interpolation technique that allows one to use a more intensely sampled covariate in the estimation of values for a related variate. If the primary variate is difficult
or expensive to measure and it is correlated with a more available covariate, cokriging can greatly improve interpolation estimates. See Cokriging Analysis for further
information.
Commas as Column Title Separators
Column or variate names are separated by commas e.g.:
m east, m north, Pb (ug/g), pH
149
Chapter 16 Glossary
Comma Separated Values
Values within data records separated by commas:
13.2, 34.5, 35.6, 0.15
Conditional Simulation
An advanced interpolation technique whereby Z estimates are based on a form of
stochastic simulation in which measured data values are honored at their locations.
Covariate
The Covariate is a second Z-variate that covaries with the primary Z-variate. It is
used in cokriging.
Data Builds
Data in the GS+ Worksheet is read into the actual arrays that get analyzed during
Data Builds. Data builds occur whenever a new data set is read into GS+ (via the
Import command) or whenever a field assignment or data value is changed and the
Rebuild Button (Data Worksheet Window) is pressed. During data builds missing
values are ignored and filters and transformations are applied. Most Data Builds are
automatic; youll need to force a Data Build whenever the Rebuild Button is red (the
Data Summary Window will also be empty).
Data Fields
A data field is an individual item on the data record. In the GS+ Worksheet, records
appear as rows and fields appear as columns. Prior to analysis you must identify an
X-coordinate field, a Y-coordinate field (for 2-dimensional analyses), and the field
(column) that corresponds to the Z-variate. For cokriging you will also need to assign a field to hold the covariate (Z2) values. Field assignments are performed by
clicking on the top row of the Worksheet to bring up a Field Assignment dialog window.
Data Records
Data Records are records in text files that contain data values. Data records follow
header records that contain descriptive information about the file. Data records are
imported into the GS+ Worksheet; header records are imported into the Data Description portion of the Worksheet Window.
Decimal Point Missing Value Indicator
In the record below the third value appears as a decimal point and will be read into
the worksheet as a missing value:
13.2, 34.5, . , 0.15
Discretization Grid
The discretization grid describes the size of the grid placed around the interpolation
point when block kriging. The estimate for the interpolated block is based on the
mean value of estimates for each of the discretization grid points. A single discretization point describes punctual kriging. A 2x2 grid averages 4 grid points. Larger
150
Chapter 16 Glossary
discretization grids take longer to interpolate.
Effective Range
See Range.
Geo-referenced Data
Geo-referenced data are data that have been collected from a specific location, i.e.
any data for which there is a spatial x,y coordinate associated with each data value.
Header Records
Records at the top of a data file that precede the data records and that contain information about the data records.
Interpolate
An interpolate is an estimated value for a location in a domain based on values of
nearby measured locations. In GS+ interpolation is performed by kriging or by inverse distance weighting methods. Maps are based on interpolations at grid points
across the domain.
Kriging
An interpolation method based on regionalized variable theory that provides an optimal interpolation estimate for a given coordinate location. GS+ performs ordinary
kriging.
Lag Class Distance Interval
Each point on a variogram corresponds to the average semivariance for all values in
the spatial domain that are separated by a given distance interval h. This interval
(also called a lag distance class) is a range whose width is specified by the user in
the Semivariance Analysis window. A corresponding definition is also relevant to
Morans I, Fractal, Cross-Semivariance, and Madogram Analyses.
Missing Value
A value not present in a record. A data record can contain fields for coordinates (e.g.
X, Y) and for multiple Z variates. If any one of the fields used for an analysis is missing, the record is ignored during analyses, i.e. when the active data set is built. In the
GS+ Worksheet a value is considered missing if the cell is blank (a permanent missing value) or if the cell contents appear as red italicized text (a temporarily missing
value). Cells can be assigned temporarily missing by right-clicking on them; a second right-click returns them to non-missing. In Input and Output files missing values
are indicated by characters or values (e.g. -99) defined in the Preferences window.
Model Parameters
See Variogram Model Parameters.
No Header Records
The first record of the file contains numeric records that should be treated as data.
151
Chapter 16 Glossary
Number of Header Records are Fixed
An input field beneath the list box appears when this choice is made, and this field
specifies the number of records at the top of the file to consider descriptive text
when reading the file; the first data record is the next record.
Number of Header Records Varies
The number of header records will be assumed to be all records prior to the first allnumeric record. If all records contain an alphanumeric Sample ID field then all records will be assumed to be header records and no data records will be read. In the
example below there are 2 header records:
File GMD Test Site
<m east> <m north> <Pb> <pH>
34.5
45.6
0.231 5.8
36.7
46.5
0.241 5.9.
Numeric Value as a Missing Value Indicator
The indicator value that will appear when a numeric value is specified in the list box.
The example below presumes that -99 has been specified the missing value indicator and thus the third value of this record will be read into the worksheet as a missing value:
13.2, 34.5, -99.0 , 0.15
Parameter File
Files that store analysis parameters and data for a particular data set. GS+ parameter files typically have a .par extension.
Polygons
Polygons are Irregular shapes that can be interpolated or excluded from interpolation during kriging. Polygons are defined by at least 3 vertices (coordinate pairs) that
define the polygon outline. Polygons can be as complex as continent outlines or as
simple as building outlines.
Posting
A Coordinate Posting is a map of coordinate locations. The location of each data
point in the active data set can be marked by a symbol corresponding to its relative
value.
Punctual Kriging
A type of kriging that provides an estimate for a precise point. Block kriging provides
an estimate for a discrete area around a specific interpolation point.
Quotes as Column Title Separators
Column or variate names are separated by quotes. E.G.:
m east m north Pb (ug/g) pH
152
Chapter 16 Glossary
Range
The range (also called effective range) is the separation distance over which sample
locations are autocorrelated, i.e. over which there is spatial dependence among
sample locations. In variogram models the range is calculated from the range parameter A0 (isotropic models) or A1 and A2 (anisotropic models). See model definitions for further information.
Same as for Data Records
The delimiter for column or variate titles is the same as specified for data records.
E.G. if the data records in a file with four data columns are formatted with "#" as a
delimiter:
m east # m north # Pb (ug/g) # pH
Sample ID
The Sample ID is text or a numeric value that identifies a particular data record. It is
optional. To specify a Sample ID column click on the top row of the data worksheet.
Sill or C0+C
The Sill of the variogram model (C0 + C) represents spatially-independent variance.
Data locations separated by a distance beyond which semivariance does not
change i.e. after the model asymptote or sill are spatially independent of one
another. Theoretically the sill is equivalent to sample variance.
Space Separated (free format) Values
Values within data records are separated by spaces, e.g.:
13.2 34.5 35.6 0.15
Spaces as Column Title Separators
Column or variate names are separated by spaces; note that this option limits column titles to single words. e.g.:
east
north Pb pH
Chapter 16 Glossary
better normalize its distribution. The success of a transformation can be judged by
observing its frequency distribution before and after transformation. Values may be
backtransformed prior to reporting results.
Uniform Lag Interval Classes
The Lag Class Distance Interval defines how pairs of points will be grouped into lag
classes. Each point in a variogram or autocorrelogram represents the average
semivariance or Morans I for a single lag class, which is a group of pairs separated
by a certain Lag Class Distance Interval, sometimes called a step size. This interval
can either be calculated by GS+, in which case it will be uniformly distributed across
the active lag distance, or it can be manually set by the user
Variogram Model Parameters
Model parameters for isotropic variograms include terms for nugget variance (C0),
the combined nugget plus structural variance (C0 + C, also called the model sill), and
the range parameter A0 or in the case of anisotropic variograms, A (a function of A1
and A2). The range parameter may be different from the effective range, which
should be used to compare ranges among models.
What is a valid record
A valid record is any record that contains non-missing values for the coordinate locations (both an X-Coordinate value and a Y-Coordinate value for 2-dimensional domains) AND a non-missing value for the Z-Variate. Missing values appear as blank
cells OR as red colored text; use the right mouse button to turn cells with values into
cells with missing values and vice versa.
X,Y-Coordinates
X,Y coordinates describe a physical location at which a Z variate is measured. The
coordinates are presumed to be in Cartesian space, i.e. with a 0,0 origin that increases for x in an easterly direction and decrease in a westerly direction, and for y
increase in a northerly direction and decrease in a southerly (note that values can be
less than or greater than 0). Thus a value of 10,20 means 10 units east of the origin
and 20 units north.
In GS+, if a Y-Coordinate Column is not specified the data are assumed to be 1dimensional, i.e. collected along a transect or through time. To specify (or remove)
a Y-Coordinate column, click on the top row of the worksheet.
Note also that latitude and longitude are not part of a Cartesian coordinate system
because coordinate distances (measured in degrees) represent different physical
distances (measured in meters) in different places on the globe. For a rough conversion from latitude and longitude to meters, consider that one second of latitude
equals 30.92 meters on the ground; for longitude, calculate the cosine of the latitude
then multiply by 30.92.
154
Chapter 16 Glossary
Z-Variate Data (Z)
The Z-Variate is the variate being analyzed and mapped, e.g. elevation for a topographic map, pH for a map of soil acidity, chlorophyll content for a map of lake productivity, population density for a map of rural population growth, etc. A Worksheet
can contain many variates but only one can be analyzed at a time, thus only one
column can contain the Z-Variate data. A Z-Variate Column must be defined prior to
spatial analysis. To specify a column, click on the top row of the worksheet.
Z2 Variate Data (Z2 or Covariate)
The Z2 Variate or Covariate is a second Z-variate that covaries with the primary Zvariate. It is used in cokriging.
155
Chapter 16 Glossary
Index
1D transects, 145
2D Maps, 21, 134, 143
3D
Map proportion (height to width ratio),
21
Maps, 21, 134, 140
Standard deviations map, 141
A0, 68, 69, 71, 72, 73, 74, 153
A1, A2, 75, 76, 77, 78, 79, 154
Active lag distance, 60
Adjacency matrix, 93
Anisotropy
Axis orientation, 62, 80, 82
Semivariance surface, 16, 62, 63, 80,
82
Variogram map, 80, 82
Variogram models, 75
Appending data, 42
ArcView
Input file format, 35
Map input file format, 135
Output file format, 108, 130
Assign field, 28
Autocorrelation
Autocorrelation window, 60
Correlograms, 89
Covariance analysis, 90
definition, 148
Drift, 88
Fractal analysis, 94
General relative variograms, 91
Madograms, 86
Moran's I analysis, 93
Pairwise relative variograms, 92
Rodograms, 87
Semivariance, 59
156
Standardized variograms, 85
Autocorrelation menu, 9
Axis formats, 20
Azimuth, 81
Backtransformations, 44, 118, 148
Base input file, 25
Bibliography, 146
Binary data record format, 148
Block kriging, 149
Block size, 112
C0, 68, 149
Cartesian (x,y) coordinate system, 5, 154
Ceiling, 22, 137
Character Separated Values, 149
Cokriging, 29, 60, 123, 125
Column, 26
Field assignments, 25, 28
Titles, 36, 149, 152, 153
Widths, 27
Comma separated values, 41
Commas as column title separators, 41
Conditional simulation, 105, 119
Cokriging, 120
External drift, 120
Number of simulations, 122
Residuals, 120
Secondray data, 120, 121
Contour levels, 22, 137, 138
Convert GS+ DOS File, 24
Coordinate postings, 51
1d scattergrams, 54
2d scattergrams, 52
Definition, 152
Map data, 136, 144
Posting intervals, 55
Coordinates
Assigning, 28
Cartesian, 154
Index
Latitude, Longitude, 154
Correlograms, 89
Covariance analysis, 90
Covariate
Assignment, 28
Definition, 150, 155
Icons, 6
Regression, 56
Values warning, 29
Cross-semivariance analysis, 9, 83, 124
Cross-validation
Analysis, 108, 117
Values, 118
Cross-variate regression, 56, 57, 58
Cross-variograms, 9, 83, 124
Cumulative frequency distribution
Graph, 47
Values, 48
Cumulative probability distribution, 44
Data
Append dialog, 42
Bounding, 30
Builds, 150
Fields, 28, 150
File import, 33, 41
Filter, 25, 30
Input file format, 34, 36
Menu, 8
Records, 26, 36, 37, 38, 39, 41, 112,
150
Summary window, 51, 56
Regression analysis tab, 56
X,Y coordinates tab, 51
Z tab, 43
Title, 26
Worksheet, 25
Database input files, 33, 40
Decimal places, 13, 14, 27
Define posting intervals, 55
Descriptive statistics, 44
Discretization grid, 123, 150
Distance interval, 110
DOS GS files, 24
Drift, 88
Duplicate values, 31, 44
Edit menu, 7
Effective range, 68, 69, 76, 154
Email, 12
Excel files, 40
Exclusive polygons, 107, 114, 116
Exponential anisotropic model, 78
Exponential isotropic model, 72
Export data, 8
Field (column)
Assignments, 25, 26, 28
Default assignments, 14
Delimiter, 41
Names, 26
File
Extensions, 23, 24, 64, 107, 129, 130,
132, 135, 152
Import command, 25
Import dialog, 33
Import properties, 40, 41
Menu, 6
View window, 42
Filtering data, 25, 30
Floor, 22, 137
Formulas in worksheets, 27
Fractal analysis, 94
Frequency distribution
graph, 44
Graph, 45
Values, 46
Gaussian anisotropic model, 79
Gaussian isotropic model, 74
General relative variograms, 91
GeoEas input file format, 34, 37
Geometric anisotropy, 62
Geo-referenced data, 151
Graph settings
Axis scaling, 19
157
Index
Axis titles & labels, 20
Colors, 17
Contour details, 21
Footnote, 18
General, 17
Legend, 18
Title, 18
Grid lines, 22, 137
GS+ Input file format, 34
GS+ Interpolation Output file format, 129
Header records, 36, 37, 38, 39, 41, 112,
151, 152
Help menu, 11
Histogram bars, 20, 45
h-Scattergrams, 95, 100, 104
IDW, 127
Interpolation defaults, 16
Output files, 129, 130, 132
Smoothing factor, 16, 127, 128
Weighting power, 16, 127, 128
Importing
Default file type, 14
Default filename, 14
File import dialog, 33
Import file command, 25
Spreadsheets, 40
Text files, 34
Inclusive polygons, 107, 114, 116
Input file format, 34
ArcView XYZ format, 38
GeoEas format, 37
GS+ format, 36
Spreadsheet and database formats, 40
Surfer XYZ format, 39
Insert rows or columns, 27
Installation, 2
Interpolation, 105
Cokriging, 123, 125
Conditional simulation, 105, 119
Interpolate locations, 112
Interpolate menu, 10
158
Index
Index
Coefficient, 117
Regression analysis, 56, 57
Regression coefficient, 70, 76
Regression values, 58
Residual sums of squares, 69, 76
Rodograms, 87
Rotate graph, 140, 142
Sample ID, 28, 153
Sample postings. See Coordinate postings
Sample variance, 63
Scaling data, 43
Scattergrams. See Coordinate postings
Screen layout, 5
Search neighborhood, 108
Secondary variate (Z2), 125
Semivariance
Overview, 59
Semivariance analysis window, 60
Semivariance offset, 59
Values, 67
Variograms window, 66
Sill, 68, 69, 75, 76, 153
Single-user license agreement, 3
Site licensing, 3
Smoothing factor, 16, 127, 128
Space separated (free format) values, 41
Spaces as column title separators, 41
Spherical anisotropic model, 77
Spherical isotropic model, 71
Spreadsheet input files, 33, 40, 41
Standard deviations map, 141
Standardized variogram, 85
Structural variance, 68, 149, 153
Summary statistics, 43
Surface, 136
Surfer
Grid output file format, 107
Map input file format, 39, 132, 135
XYZ input file format, 35
Symbols, 20, 55
Tab separated values, 41
160
Index
Worksheet
Formulas, 27
Limits, 27
Window, 25
X axis range, 19
X-coordinate
Column assignment, 28
Defintion, 154
Y axis range, 19
Y-coordinate
Column assignment, 28
Defintion, 154
Z (primary variate)
Column assignment, 28
Definition, 155
Z standard deviations, 136
Z axis range, 20
Z Estimate Boundaries, 109
Z2 (covariate)
Cokriging, 123
Column assignment, 28
Definition, 155
Values warning, 29
Zoom, 140
161