OGI352
OGI352
Anna University
Choice Based Credit System (CBCS)
Semester - V (CSE / IT / ECE / Mechanical) Open Elective-I
Geographic Information
System
T. Graceshalini
M.E., M.B.A., Ph.D. (Pursuing in VANETS)
Assistant Professor,
Velammal College of Engineering and Technology,
Madurai
S. Kavitha
M.E. (CSE), B.Tech. (IT)
Assistant Professor,
Velammal College of Engineering and Technology,
Madurai
Price : ` 190/-
I S B N 9 7 8 - 9 3 - 8 9 4 2 0 - 11 - 1
® TM
TECHNICAL
PUBLICATIONS
SINCE 1993 An Up-Thrust for Knowledge 9 789389 420111
(i)
Geographic Information
System
Subject Code : OCE552
Published by :
® TM
TECHNICAL Amit Residency, Office No.1, 412, Shaniwar Peth, Pune - 411030, M.S. INDIA
P h . : + 9 1 - 0 2 0 - 2 4 4 9 5 4 9 6 / 9 7 , Te l e f a x : + 9 1 - 0 2 0 - 2 4 4 9 5 4 9 7
PUBLICATIONS
SINCE 1993 An Up-Thrust for Knowledge Email : sales@technicalpublications.org Website : www.technicalpublications.org
Printer :
Yogiraj Printers & Binders
Sr.No. 10\1A,
Ghule Industrial Estate, Nanded Village Road,
Tal-Haveli, Dist-Pune - 411041.
Price : ` 190/-
I S B N 9 7 8 - 9 3 - 8 9 4 2 0 - 11 - 1
9 789389 420111 AU 17
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-3 Fundamentals of GIS
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-4 Fundamentals of GIS
Longitude : Imaginary lines that run vertically around the globe. Also known as meridians,
longitudes are measured from 180º east to 180º west. Longitudes meet at the poles and are widest
apart at the equator
Prime meridian : Zero degree longitude which divides the earth into two halves-Eastern and
Western hemisphere. As it runs through the Royal Greenwich Observatory in Greenwich, England
it is also known as Greenwich meridian
Equator (0º) is the reference for the measurement of latitude. Latitude is measured north or
south of the equator. For measurement of longitude, prime meridian (0º) is used as a reference.
Longitude is measured east or west of prime meridian. The grid of latitude and longitude over the
globe is known as graticule. The intersection point of the equator and the prime meridian is the
origin (0, 0) of the graticule.
Coordinate measurement
The geographic coordinates are measured in angles. The angle measurement can be understood
as per following :
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-5 Fundamentals of GIS
Date
The International Date Line is the imaginary line on the Earth that separates two consecutive
calendar days. Generally, it is said to be lying exactly opposite to the prime meridian having a
measurement of 180º meridian but it is not so. It zigs and zags the 180º meridian following the
political jurisdiction of the states but for sake of simplicity it is taken as 180º meridian. Starting at
midnight and going east to the International Date Line, the date is one day ahead of the date on the
rest of the Earth.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-6 Fundamentals of GIS
In such a coordinate system the location of a point on the grid is identified by (x, y) coordinate
pair and the origin lies at the centre of grid. The x coordinate determines the horizontal position
and y coordinate determines the vertical position of the point.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-7 Fundamentals of GIS
“GIS is an integrated system of computer hardware, software, and trained personnel linking
topographic, demographic, utility, facility, image and other resource data that is geographically
referenced.” NASA.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-8 Fundamentals of GIS
The first known use of the term "Geographic Information System" was by Dr. Roger
Tomlinson in the year 1968 in his paper "A Geographic Information System for Regional
Planning“. Tomlinson is also acknowledged as the "father of GIS”.
1.6.1 Hardware
It consists of the equipments and support devices that are required to capture, store process and
visualize the geographic information. These include computer with hard disk, digitizers, scanners,
printers and plotters etc.
1.6.2 Software
Software is at the heart of a GIS system. The GIS software must have the basic capabilities of
data input, storage, transformation, analysis and providing desired outputs. The interfaces could be
different for different software’s.
Key software components are
Tools for the input and manipulation of geographic information
A database management system (DBMS)
Tools that support geographic query, analysis, and visualization
A graphical user interface (GUI) for easy access to tools
The GIS software’s being used today belong to either of the category –proprietary or open
source. ArcGIS by ESRI is the widely used proprietary GIS software. Others in the same category
are MapInfo, Microstation, Geomedia etc. The development of open source GIS has provided us
with freely available desktop GIS such as Quantum, uDIG, GRASS, MapWindow GIS etc., GIS
softwares.
1.6.3 Data
The data is captured or collected from various sources (such as maps, field observations,
photography, satellite imagery etc) and is processed for analysis and presentation.
1.6.4 Methods/Procedures
These include the methods or ways by which data has to be input in the system, retrieved,
processed, transformed and presented.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1-9 Fundamentals of GIS
1.6.5 People
This component of GIS includes all those individuals (such as programmer, database manager,
GIS researcher etc.) who are making the GIS work, and also the individuals who are at the user end
using the GIS services, applications and tools
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 10 Fundamentals of GIS
maps, field observations, and sensors into a compatible digital form. A wide range of computer
tools is available for this purpose, including the digitizer, lists of data in text files, scanners and the
devices necessary for recording data already written on magnetic media such as tapes, drums and
disks
Various sources for data input may be :
text files
existing maps
aerial photographs
satellite imagery
airborne scanners
field measurements
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 11 Fundamentals of GIS
It is important to understand that the GIS is not a new invention. In fact, geographic
information processing has a rich history in a variety of disciplines. In particular, natural resource
specialists and environmental scientists have been actively processing geographic data and
promoting their techniques since the 1960's.
1.8.2 Business
Approximately 80 percent of all business data are related to location. Businesses manage a
world of information about sales, customers, inventory, demographic profiles etc. Demographic
analysis is the basis for many other business functions: customer service, site analysis, and
marketing. Understanding your customers and their socioeconomic and purchasing behavior is
essential for making good business decisions. A GIS with relevant data such as number of
consumers, brands and sites they go for shopping can give any business unit a fair idea whether
their unit if set up is going to work at a particular location the way they want it to run.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 13 Fundamentals of GIS
1.8.3 Logistics
Logistics is a field that takes care of transporting goods from one place to another and finally
delivering them to their destinations. It is necessary for the shipping companies to know where
their warehouses should be located, which routes should the transport follow that ensures minimum
time and expenditures to deliver the parcels to their destinations. All such logistics decisions need
GIS support.
1.8.5 Environment
GIS is being increasingly involved in mapping the habitat loss, urban sprawl, land-use change
etc. Mapping such phenomena need historical landuse data, anthropogenic effects which greatly
affect these phenomena are also brought into GIS domain. GIS models are then run to make
predictions for the future.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 14 Fundamentals of GIS
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 15 Fundamentals of GIS
Numeric data
Numeric data is statistical data which includes a geographical component or field that can be
joined with vector files so the data can be queried and displayed as a layer on a map in a GIS.
The most common type of numeric data is demographic data from the US Census.
Vector data
Vector data is a data that has a spatial component, or X,Y coordinates assigned to it. Vector
files can contain sets of points, lines, or polygons that are referenced in a geographic space.
There are three types of features :
Point (vertex, node) is a 0-dimensional object and has the property of location (x,y).
Line (edge, link, chain, arc) is a one-dimensional object that has the property of length. An
arc starts with a node, has zero or more vertices, and ends with a node.
Polygon is a two-dimensional object with properties of area and perimeter.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 16 Fundamentals of GIS
Raster data
Raster data is a data in a .JPG, .TIF, .GIF or similar format.
Items scanned using a flatbed scanner like the map given below is examples of raster files.
Images taken with a digital camera produce these same types of files
These characteristics can be quantitative and/or qualitative in nature. Attribute data is often
referred to as tabular (Numeric) data. Non-spatial data are generally one-dimensional and
independent. It’s a separate data model used to store and maintain attribute data for GIS
software. These data models may exist internally within the GIS software, or may be reflected in
external commercial Database Management Software (DBMS).
Variety of different data models exist for the storage and management of attribute data.
The most common are :
Tabular Model
Hierarchical Model
Network Model
Relational Model
Object Oriented Model
Most early GIS software packages stored their attribute data in Tabular model. The next three
models are those most commonly implemented in Database Management Systems (DBMS).
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 18 Fundamentals of GIS
related record. Each pointer establishes a parent child relationship where a parent can have more
than one child but a child can only have one parent. There is no connection between the elements
at the same level. To locate a particular record, you have to start at the top of the tree with a parent
record and trace down the tree to the child.
The figure above describes the electronic gadgets in day today use. We can see that flash is a
child of mp3 players, which is a child of portable electronics, which is a child of electronics. The
topmost element electronics has no parent. Tube, LCD, plasma, CD players and 2 way radios are
leaf nodes (don’t have any children)
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 19 Fundamentals of GIS
Measurement Scales
The scale determines the amount of information contained in the data.
The scale indicates the data summarization and statistical analyses that are most appropriate.
The attributes shown in a thematic map can be recorded by four different scales.
Numerical values may be defined with respect to nominal, ordinal, interval or ratio scales of
measurement.
It is important to recognize the scales of measurement used in GIS data as this determines
the kinds of mathematical operations that can be performed on the data.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 21 Fundamentals of GIS
o Nominal scales - Qualitative, not quantitative distinction (no absolute zero, not equal
intervals, not magnitude)
o Ordinal scales - Ranking individuals (magnitude, but not equal intervals or absolute
zero)
o Interval scales - Scales that have magnitude and equal intervals but not absolute zero
o Ratio scales - Have magnitude, equal intervals, and absolute zero (so can compute
ratios)
Nominal Scale
Nominal Scales - There must be distinct classes but these classes have no quantitative
properties. Therefore, no comparison can be made in terms of one category being higher than the
other.
For example - There are two classes for the variable gender - males and females.
There are no quantitative properties for this variable or these classes and, therefore, gender is a
nominal variable.
Other Examples :
Country of origin
Biological sex (male or female)
Animal or non-animal
Married vs. Single
Sometimes numbers are used to designate category membership
Ordinal Scale
Ordinal Scales - There are distinct classes but these classes have a natural ordering or ranking.
The differences can be ordered on the basis of magnitude.
For example - Final position of horses in a race is an ordinal variable. The horses finish first,
second, third, fourth, and so on.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 22 Fundamentals of GIS
The difference between first and second is not necessarily equivalent to the difference between
second and third, or between third and fourth.
Does not assume that the intervals between numbers are equal
Fig. 1.11.1 Example : finishing place in a race (first place, second place)
Interval Scale
The data have the properties of ordinal data, and the interval between observations is expressed
in terms of a fixed unit of measure.
Designates an equal-interval ordering - The distance between, for example, a1 and a2 is the
same as the distance between a4 and a5.
Example - Celsius temperature is an interval variable. It is meaningful to say that 25 degrees
Celsius is 3 degrees hotter than 22 degrees Celsius, and that 17 degrees Celsius is the same amount
hotter (3 degrees) than 14 degrees Celsius. Notice, however, that 0 degrees Celsius does not have a
natural meaning. That is, 0 degrees Celsius does not mean the absence of heat!
Ratio Scale
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 23 Fundamentals of GIS
Ratio Scales - captures the properties of the other types of scales, but also contains a true zero,
which represents the absence of the quality being measured. Has an absolute zero that is
meaningful. Can construct a meaningful ratio (fraction), for example, number of clients in past six
months.
It is meaningful to say that “...we had twice as many clients in this period as we did in the
previous six months.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 24 Fundamentals of GIS
Emergency evacuation
Environment
Q.5 Differentiate Spatial and Attribute data.
Ans. :
Sr. Spatial data Attribute data
No.
3. Spatial data are generally multi- Non-spatial data are generally one-
dimensional and auto correlated dimensional and independent.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 26 Fundamentals of GIS
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 27 Fundamentals of GIS
existing maps
aerial photographs
satellite imagery
airborne scanners
field measurements
other GIS databases
Q.20 Differentiate Latitude and longitude.
Ans. : Latitude : Imaginary lines that run horizontally around the globe and are measured from
90º north to 90º south. Also known as parallels, latitudes are equidistant from each other.
Longitude : Imaginary lines that run vertically around the globe. Also known as meridians,
longitudes are measured from 180º east to 180º west. Longitudes meet at the poles and are
widest apart at the equator
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 1 - 28 Fundamentals of GIS
Notes
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-2 Spatial Data Models
Database in GIS:
A database is a collection of related information that permits the entry, storage, input, output
and organization of data. A database management system (DBMS) serves as an interface between
users and their database.
A spatial database includes location. It has geometry as points, lines and polygons.
GIS combines spatial data from many sources with many different people. Databases connect
users to the GIS database.
For example, a city might have the waste water division, land records, transportation and fire
departments connected and using datasets from common spatial databases.
The database structure is the collection of record type and field type definitions that comprise
your database:
Record Types: These define the type of entities or research objects you wish to capture
(e.g. Person).
Fields: These are the properties or attributes that describe your record types
(e.g. Gender, Age, Height etc.).
Collectively, these define the information or data that can be stored in any record of that type.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-3 Spatial Data Models
tuple and represents a basic fact. No two rows of the same table may have identical values in all
columns.
There are two crucial data integrity constraints viz. primary key and foreign key. A primary key
is an attribute whose value is unique across all tuples (rows) in a relation (table). The primary key
of one table appearing as an attribute of another table is known as a foreign key in that table.
Column: The column represents the set of values for a specific attribute.
Relation instance – Relation instance is a finite set of tuples in the RDBMS system.
Relation instances never have duplicate tuples.
Relation key - Every row has one, two or multiple attributes, which is called relation key.
Attribute domain – Every attribute has some pre-defined value and scope which is known
as attribute domain.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-5 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-6 Spatial Data Models
For an example consider an Employee database to understand this model better. In this database
we have different types of employees – Engineer, Accountant, Manager, Clerk. But all these
employees belong to Person group. Person can have different attributes like name, address, age and
phone.
Advantages
Because of its inheritance property, we can re-use the attributes and functionalities. It
reduces the cost of maintaining the same data multiple times. Also, these informations are
encapsulated and, there is no fear being misused by other objects. If we need any new
feature we can easily add new class inherited from parent class and adds new features.
Hence it reduces the overhead and maintenance costs.
Because of the above feature, it becomes more flexible in the case of any changes.
Codes are re-used because of inheritance.
Since each class binds its attributes and its functionality, it is same as representing the real
world object. We can see each object as a real entity. Hence it is more understandable.
Disadvantages
It is not widely developed and complete to use it in the database systems. Hence it is not
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-7 Spatial Data Models
Mapping cardinalities
1. one to one 2. one to many
3. many to one 4. many to many
Attributes
Entities are represented by means of their properties, called attributes. All attributes have
values. For example, a student entity may have name, class, and age as attributes.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-8 Spatial Data Models
There exists a domain or range of values that can be assigned to attributes. For example, a
student's name cannot be a numeric value. It has to be alphabetic. A student's age cannot be
negative, etc.
Types of Attributes
Simple attribute − Simple attributes are atomic values, which cannot be divided further. For
example, a student's phone number is an atomic value of 10 digits.
Composite attribute − Composite attributes are made of more than one simple attribute.
For example, a student's complete name may have first_name and last_name.
Derived attribute − Derived attributes are the attributes that do not exist in the physical
database, but their values are derived from other attributes present in the database. For
example, average_salary in a department should not be saved directly in the database,
instead it can be derived. For another example, age can be derived from data_of_birth.
Single-value attribute − Single-value attributes contain single value. For example −
Social_Security_Number.
Multi-value attribute − Multi-value attributes may contain more than one values. For
example, a person can have more than one phone number, email_address, etc.
In the below diagram, Entities or real world objects are represented in a rectangular box.
Their attributes are represented in ovals. Primary keys of entities are underlined. All the
entities are mapped using diamonds. This is one of the methods of representing ER model.
There are many different forms of representation.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2-9 Spatial Data Models
Basically, ER model is a graphical representation of real world objects with their attributes and
relationship. It makes the system easily understandable. This model is considered as a top down
approach of designing a requirement.
Advantages
It makes the requirement simple and easily understandable by representing simple diagrams.
One can covert ER diagrams into record based data model easily.
Easy to understand ER diagrams
Disadvantages
No standard notations are available for ER diagram. There is great flexibility in the notation.
It’s all depends upon the designer, how he draws it.
It is meant for high level designs. We cannot simplify for low level design like coding.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 10 Spatial Data Models
Since, the real world is made up of complex spatial objects and phenomena, it is practically
impossible for a single data model to represent everything that is present.
This means that different users may have different data models when they attempt to collect
data in the same location.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 11 Spatial Data Models
Line/Arc : Ordered sets of (x, y) coordinate pairs arranged to form a linear feature. The curves
in a linear feature are generated by increasing the density of points/vertices.
The roads, rails and telephone cables are the examples of the spatial features described by lines.
Polygon : The set of (x, y) coordinate pairs enclosing a homogeneous area
The land parcels, agricultural farms and water bodies are the examples of the spatial features
described by polygons.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 12 Spatial Data Models
scanned data
Simulation is easy because each unit has Network linkages are difficult to establish
the same size and shape
a) Entity model :
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 13 Spatial Data Models
Let us assume that the raster data belongs to an area where land is surrounded by water.
Here a particular entity (land) is shown in SHADED color and the area where land is not
present is shown by white.
b) Pixel values :
The pixel value for the full image is shown.
Cells having a part of the land are encoded as 1 and others where land is not present are
encoded as 0.
The huge size of the data is a major problem with raster data.
An image consisting of twenty different land-use classes takes the same storage space as a
similar raster map showing the location of a single forest.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 14 Spatial Data Models
To address this problem many data compaction (Compression) methods have been
developed.
Compression techniques
Run length encoding
Block encoding
Chain encoding
Quadtree
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 15 Spatial Data Models
In the example , the first row is blank and is stored as (0,8). This means there are 8 cells and
they are all zeros. In the second row, there are 4 consecutive zeros so it gets a value of (0,4).
After this, we have three consecutive cells with the value 1 so it gets a value of (1,3). This
continues until it reaches the bottom-right cell.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 16 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 17 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 18 Spatial Data Models
a. Simple features
These are easy to create, store and are rendered on screen very quickly.
They lack connectivity relationships and so are inefficient for modeling phenomena
conceptualized as fields.
This is also called Feature data. Shapefiles are ArcView's native file format for geographic
features and attribute data. ArcView can also display Arc/Info Coverages, which comprises a
more complex representation of vector data. Vector comprise the following:
Point - a pair of x and y coordinates.
Line - a sequence of points
Polygon - a closed set of lines
Attribute information is stored in Feature Tables.
Point entities : These represent all geographical entities that are positioned by a single XY
coordinate pair. Along with the XY coordinates the point must store other information such as
what does the point represent etc.
Line entities : Linear features made by tracing two or more XY coordinate pair.
Simple line: It requires a start and an end point.
Arc: A set of XY coordinate pairs describing a continuous complex line. The shorter the line
segment and the higher the number of coordinate pairs, the closer the chain approximates a
complex curve.
Simple Polygons : Enclosed structures formed by joining set of XY coordinate pairs. The
structure is simple but it carries few disadvantages which are mentioned below:
Lines between adjacent polygons must be digitized and stored twice, improper digitization
give rise to slivers and gaps
Convey no information about neighbor
Creating islands is not possible
b. Topological features
A topology is a mathematical procedure that describes how features are spatially related and
ensures data quality of the spatial relationships.
Topological relationships include following three basic elements:
I. Connectivity : Information about linkages among spatial objects
II. Contiguity : Information about neighbouring spatial object
III. Containment : Information about inclusion of one spatial object within another spatial
object
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 19 Spatial Data Models
The from node and to node of an arc indicate its direction, and it helps determining the
polygons on its left and right side.
Left-right topology refers to the polygons on the left and right sides of an arc.
In the illustration above, polygon B is on the left and polygon C is on the right of the arc 4.
Polygon A is outside the boundary of the area covered by polygons B, C and D.
It is called the external or universe polygon, and represents the world outside the study area.
The universe polygon ensures that each arc always has a left and right side defined.
Containment
Geographic features cover distinguishable area on the surface of the earth. An area is
represented by one or more boundaries defining a polygon. The polygons can be simple or they can
be complex with a hole or island in the middle. In the illustration given below assume a lake with
an island in the middle. The lake actually has two boundaries, one which defines its outer edge and
the other (island) which defines its inner edge. An island defines the inner boundary of a polygon.
The polygon D is made up of arc 5, 6 and 7. The 0 before the 7 indicates that the arc 7 creates an
island in the polygon.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 21 Spatial Data Models
Polygons are represented as an ordered list of arcs and not in terms of X, Y coordinates. This is
called Polygon-Arc topology. Since arcs define the boundary of polygon, arc coordinates are stored
only once, thereby reducing the amount of data and ensuring no overlap of boundaries of the
adjacent polygons.
Topologic Features
Networks : A network is a topologic feature model which is defined as a line graph composed
of links representing linear channels of flow and nodes representing their connections. The
topologic relationship between the features is maintained in a connectivity table. By consulting
connectivity table, it is possible to trace the information flowing in the network.
Polygons with explicit topological structures : Introducing explicit topological relationships
takes care of islands as well as neighbors. The topological structures are built either by creating
topological links during data input or using software. Dual Independent Map Encoding (DIME)
system of US Bureau of the Census is one of the first attempts to create topology in geographic
data.
Fig. 2.6.7 Arc node Topology, Polygon Topology, Polygon Arc Topology
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 22 Spatial Data Models
Vector : Advantages
Data can be represented at its original resolution and form without generalization.
Graphic output is usually more aesthetically pleasing (traditional cartographic
representation);
Since most data, e.g. hard copy maps, is in vector form no data conversion is required.
Accurate geographic location of data is maintained.
Allows for efficient encoding of topology, and as a result more efficient operations that
require topological information, e.g. proximity, network analysis.
Vector : Disadvantages
The location of each vertex needs to be stored explicitly.
For effective analysis, vector data must be converted into a topological structure. This is
often processing intensive and usually requires extensive data cleaning. As well, topology is
static, and any updating or editing of the vector data requires re-building of the topology.
Algorithms for manipulative and analysis functions are complex and may be processing
intensive. Often, this inherently limits the functionality for large data sets, e.g. a large
number of features.
Continuous data, such as elevation data, is not effectively represented in vector form.
Usually substantial data generalization or interpolation is required for these data layers.
Spatial analysis and filtering within polygons is impossible.
Raster Advantages :
The geographic location of each cell is implied by its position in the cell matrix.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 23 Spatial Data Models
Accordingly, other than an origin point, e.g. bottom left corner, no geographic coordinates
are stored.
Due to the nature of the data storage technique data analysis is usually easy to program and
quick to perform.
The inherent nature of raster maps, e.g. one attribute maps, is ideally suited for mathematical
modeling and quantitative analysis.
Discrete data, e.g. forestry stands, is accommodated equally well as continuous data, e.g.
elevation data, and facilitates the integrating of the two data types.
Grid-cell systems are very compatible with raster-based output devices, e.g. electrostatic
plotters, graphic terminals.
Raster : Disadvantages
The cell size determines the resolution at which the data is represented.
It is especially difficult to adequately represent linear features depending on the cell
resolution. Accordingly, network linkages are difficult to establish.
Processing of associated attribute data may be cumbersome if large amounts of data exist.
Raster maps inherently reflect only one attribute or characteristic for an area.
Since most input data is in vector form, data must undergo vector-to-raster conversion.
Besides increased processing requirements this may introduce data integrity concerns due to
generalization and choice of inappropriate cell size.
Most output maps from grid-cell systems do not conform to high-quality cartographic needs.
It is often difficult to compare or rate GIS software that use different data models. Some
personal computer (PC) packages utilize vector structures for data input, editing, and display but
convert to raster structures for any analysis. Other more comprehensive GIS offerings provide both
integrated raster and vector analysis techniques. They allow users to select the data structure
appropriate for the analysis requirements. Integrated raster and vector processing capabilities are
most desirable and provide the greatest flexibility for data manipulation and analysis.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 24 Spatial Data Models
The TIN data structure is defined by two elements: a set of input points with x,y, and z
values, and a series of edges connecting these points to form triangles. Each input point
becomes the node of a triangle in the TIN structure, and the output is a continuous faceted
surface of triangles
The triangles are constructed according to a mathematical technique called Delaunay
triangulation. The technique guarantees that a circle drawn through the three nodes of any
triangle will contain no other input point.
Because points can be placed irregularly over a surface a TIN can have higher resolution in
areas where surface is highly variable. The model incorporates original sample points
providing a check on the accuracy of the model. The information related to TIN is stored in a
file or a database table. Calculation of elevation, slope, and aspect is easy with TIN but these
are less widely available than raster surface models and more time consuming in term of
construction and processing.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 25 Spatial Data Models
The TIN model is a vector data model which is stored using the relational attribute tables. A
TIN dataset contains three basic attribute tables: Arc attribute table that contains length, from node
and to node of all the edges of all the triangles.
Node attribute table that contains x, y coordinates and z (elevation) of the vertices
Polygon attribute table that contains the areas of the triangles, the identification number of
the edges and the identifier of the adjacent polygons.
Storing data in this manner eliminated redundancy as all the vertices and edges are stored only
once even if they are used for more than one triangle. As TIN stores topological relationships, the
datasets can be applied to vector based geoprocessing such as automatic contouring, 3D landscape
visualization, volumetric design, surface characterization etc.
TIN GRID
Advantages Ability to describe the surface at Easy to store and manipulate
different Easy integration with raster database
Efficiency in storing data Smoother, more natural appearance of
derived terrain features
Disadvantages In many cases require visual Inability to use various grid sizes to reflect
inpection and manual control of areas of different complexity of relief.
the network.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 26 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 27 Spatial Data Models
The requests being made by the client are commonly known as Hypertext Transfer Protocol
(HTTP) requests. HTTP is the protocol used for data communication on the World Wide Web and
there are two types of HTTP requests that need to be defined for this course:
So to appropriately define the use of web services: A web service is used to provide access to
data and information from a server via the internet to a client.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 28 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 29 Spatial Data Models
● KML
This standard is a XML language that is based around the visualisation of geographic data, and
is used within Google Earth and Google Maps. Geographic data that this standard commonly
represents include place marks, image overlays, polygon features and paths.
● open GeoSMS
This standard provides an encoding and interface that offers the potential for the
communication of location content betvveen different Location Based Service (LBS) devices or
applications via a Short Message Service (SMS).
● GeoAP1
This standard defines a Java language application programming interface (API) that can be used
to manipulate geographic information via the use of a Java based standard library that contains the
types and methods that can be implemented.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 30 Spatial Data Models
Data quality is a pillar in any GIS implementation and application as reliable data are
indispensable to allow the user obtaining meaningful results. The following review of data quality
focuses on three distinct components, data accuracy, quality, and error.
2.10.1 Accuracy
The fundamental issue with respect to data is accuracy. Accuracy is the closeness of results of
observations to the true values or values accepted as being true. This implies that observations
of most spatial phenomena are usually only considered to estimates of the true value. The
difference between observed and true (or accepted as being true) values indicates the accuracy of
the observations.
Basically two types of accuracy exist. These are positional and attribute accuracy
Positional accuracy is the expected deviance in the geographic location of an object from its
true ground position. There are two components to positional accuracy. These
are relative and absolute accuracy.
Absolute accuracy concerns the accuracy of data elements with respect to a coordinate scheme.
Relative accuracy concerns the positioning of map features relative to one another.
Attribute accuracy is equally as important as positional accuracy. It also reflects estimates of
the truth. Interpreting and depicting boundaries and characteristics for forest stands or soil
polygons can be exceedingly difficult and subjective.
2.10.2 Quality
Quality can simply be defined as the fitness for use for a specific data set. Data that is
appropriate for use with one application may not be fit for use with another. It is fully dependent
on the scale, accuracy, and extent of the data set, as well as the quality of other data sets to be used.
The recent U.S. Spatial Data Transfer Standard (SDTS) identifies five components to data quality
definitions. These are :
Lineage
Positional Accuracy
Attribute Accuracy
Logical Consistency
Completeness
Lineage
The lineage of data is concerned with historical and compilation aspects of the data such as the:
source of the data;
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 31 Spatial Data Models
● Positional Accuracy
The identification of positional accuracy is important. This includes consideration of inherent
error (source error) and operational error (introduced error).
● Attribute Accuracy
Consideration of the accuracy of attributes also helps to define the quality of the data. This
quality component concerns the identification of the reliability, or level of purity (homogeneity), in
a data set.
● Logical Consistency
This component is concerned with determining the faithfulness of the data structure for a data
set. This typically involves spatial data inconsistencies such as incorrect line intersections,
duplicate lines or boundaries, or gaps in lines. These are referred to as spatial or topological errors.
● Completeness
The final quality component involves a statement about the completeness of the data set. This
includes consideration of holes in the data, unclassified areas, and any compilation procedures that
may have caused data to be eliminated.
The ease with which geographic data in a GIS can be used at any scale highlights the
importance of detailed data quality information. Although a data set may not have a specific scale
once it is loaded into the GIS database, it was produced with levels of accuracy and resolution that
make it appropriate for use only at certain scales, and in combination with data of similar scales.
2.10.3 Error
Two sources of error, inherent and operational, contribute to the reduction in quality of the
products that are generated by geographic information systems.
Inherent error is the error present in source documents and data.
Operational error is the amount of error produced through the data capture and manipulation
functions of a GIS.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 32 Spatial Data Models
● DataInformationExchange:
Data information exchange is basically the information about the data provided by the client to
organization. The degree of information provided by the client defines the accuracy and
completeness of data.
● TypeandSource:
Data type and source must be evaluated in order to get appropriate data values. There are many
spatial data formats and each one of them is having some beneficiary elements as well as some
drawbacks.
● DataCapture:
There are many tools that incorporate manual skills to capture the data using various softwares
like ArcGIS. These softwares allows user to capture information from the base data. During this
data capture, the user may misinterpret features from the base data and captures the features with
errors. Data capture must be performed on a perfect scale where one must be able to view the
features distinctly.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 33 Spatial Data Models
● CartographicEffects :
After capturing the data, some cartographic effects like symbology, pattern, colors, orientation
and size are assigned to the features. This is required for a better representation of reality. These
effects must be assigned according to the domain of the features
● DataTransfer:
Some discrepancies may occur while transferring the data from one place to another “There is
no bad or good data. There are only data which are suitable for a specific purpose.” So, Data must
be evaluated according to the domain for which it is supposed to be used.
● Metadata:
Sometimes metadata is not updated according to the original features. So, metadata must be
updated with the original data.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 34 Spatial Data Models
constructed. The vector spatial entity, the point is represented by a single x,y coordinate pair.
Line and area entities are constructed by a series of points into chains.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 35 Spatial Data Models
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 36 Spatial Data Models
systems map overlay is time consuming, complex and computationally expensive. In raster
based systems it is quick, straightforward and efficient
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 37 Spatial Data Models
Q.2 Compare raster and vector data representation with suitable examples. (Refer section 2.7)
Q.3 With neat sketch explain briefly about Raster Data structures. (Refer section 2.4)
Q.6 Describe TIN. Give the difference between TIN and GRID. (Refer section 2.8)
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 2 - 38 Spatial Data Models
Notes
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-2 Data Input and Topology
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-3 Data Input and Topology
Keyboard entry may also be used during manual digitizing to enter the attribute information.
However this is usually more efficiently handled as a separate operation.
Roads files versus the census file -- roads file will use codes for the various road types while
the census file uses exact numbers for things like total population, age range, etc.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-4 Data Input and Topology
3.1.4 Scanning
Scanning provides a faster means of data entry compared to manual digitizing.
The process of conversion of paper maps into digital format usable by computer is known as
scanning.
It is used to convert an analog map into a scanned file, which is again converted to vector
format through tracing.
Scanning automatically captures map features, text and symbols as individual cells, or pixels
and produces an automated image.
The scanned file shows map features as raster lines (a series of connected pixels). And must
be vectorized to complete the process of digitizing.
Vectorization is converting raster lines into vector lines in a process known as tracing.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-5 Data Input and Topology
a. Mechanical scanner
It is called drum scanner since a map or an image placed on a drum is digitized mechanically
with rotation of the drum and shift of the sensor as shown in Fig. 3.2.1(a). It is accurate but slow.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-6 Data Input and Topology
b. Video Camera
Video camera with CRT (cathode ray tube) is often used to digitize a small part of map of firm.
This is not very accurate but cheap. (See Fig. 3.2.1(b))
c. CCD Camera
Area CCD camera (called digital still camera) instead of video camera will be also convenient
to acquire digital image data (See Fig. 3.2.1 (c)). It is more stable and accurate than video camera.
d. CCD Scanner
Flat bed type or roll feed type scanner with linear CCD (charge coupled device) is now
commonly used to digitize analog maps in raster format, either in mono-tone or color mode. It is
accurate but expensive. (See Fig. 3.2.1 (d)).
( 25 - 100 μm)
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-7 Data Input and Topology
Flatbed Scanner
The most commonly used scanner is a flatbed scanner also
known as desktop scanner. It has a glass plate on which the
picture or the document is placed. The scanner head placed
beneath the glass plate moves across the picture and the result is
a good quality scanned image. For scanning large maps or
toposheets wide format flatbed scanners can be used.
Drum Scanner
Then there are the drum scanners which are mostly used by
the printing professionals. In this type of scanner, the image
or the document is placed on a glass cylinder that rotates at
very high speeds around a centrally located sensor
containing photo-multiplier tube instead of a CCD to scan.
Prior to the advances in the field of sheet fed scanners, the
drum scanners were extensively used for scanning maps and
other documents.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-8 Data Input and Topology
Image scanned in Gray colour scale and (especially) colour images can be quite large.
It must be made sure that the system is capable of handling files whose size is often
measured in tens of megabytes.
Because virtually every pixel is populated with a value, an attempt to compress the file
results in little or no reduction in file size.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3-9 Data Input and Topology
There is a wide range of raster file formats used in the GIS world. Some of the most popular
ones are listed below.
GEO-TIFF
As part of a header in a TIFF format it puts Lat/Long at the edges of the pixels.
DEM
Digital Elevation Models or DEM have two types of displays. The first is 30-meter elevation
data from 1:24,000 seven-and-a-half minute quadrangle map. The second is the 1:250,000 3
arc-second digital terrain data. DEMs are produced by the National Mapping Division of
USGS.
RS Landsat
Landsat satellite imagery and BIL information are used in RS Landsat. In one format, using
BIL, pixel values from each band are pulled out and combined. Programs that use this kind
of information include IDRISI, GRASS, and MapFactory. It is fairly easy to exchange
information from within these raster formats.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 11 Data Input and Topology
Step 2 : Control points or tics at four corners of this map sheet should be digitized by the
digitizer and input to PC together with the map coordinates of the four corners.
Step 3 : Map contents are digitized according to the map layers and map code system in either
point mode or stream mode at short time interval.
Step 4 : Editing errors such as small gaps at line junctions, overshoots, duplicates etc. should
be made for a clean dataset without errors.
Step 5 : Conversion from digitizer coordinates to map coordinates to store in a spatial database.
Heads-up digitization
This method uses scanned copy of the map or image and digitization is done on the screen of
the computer monitor. The scanned map lays vertical which can be viewed without bending the
head down and therefore is called as heads up digitization. Semi-automatic and automatic methods
of digitizing requires post processing but saves lot of time and resources compared to manual
method .
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 12 Data Input and Topology
Heads-down digitization
Digitizers are used to capture data from hardcopy maps. Heads down digitization is done on a
digitizing table using a magnetic pen known as Puck. The position of a cursor or puck is detected
when passed over a table inlaid with a fine mesh of wires. The function of a digitizer is to input
correctly the coordinates of the points and the lines. Digitization can be done in two modes.
Point mode : In this mode, digitization is started by placing a point that marks the beginning of
the feature to be digitized and after that more points are added to trace the particular feature (line or
a polygon). The number of points to be added to trace the feature and the space interval between
two consecutive points are decided by the operator
Stream mode : In stream digitizing, the cursor is placed at the beginning of the feature, a
command is then sent to the computer to place the points at either equal or unequal intervals as per
the position of the cursor moving over the image of the feature
3.5 Topology
Topology expresses explicitly the spatial relationships between connecting or adjacent
vector features (points, polylines and polygons) in a GIS, such as two lines meeting perfectly
at a point and directed line having an explicit left and right side.
Topological or topology based data are useful for detecting and correcting digitizing error in
geographic data set and are necessary for some GIS analyses.
Topologic data structures help insure that information is not unnecessarily repeated. The
database stores one line only in order to represent a boundary (as opposed to two lines, one
for each polygon). The database tells us that the line is the “left side” of one polygon and the
“right side” of the adjacent polygon.
Topology is the study of those properties of geometric objects that remain invariant under
certain transformations such as bending or stretching.
Topology is often explained through graph theory.
Topology has least two main advantages.
i) The assurance of data quality
ii) Enhance GIS analysis
Topological relationships are built from simple elements into complex elements: points
(simplest elements), arcs (sets of connected points), areas (sets of connected arcs), and routes
(sets of sections, which are arcs or portions of arcs).
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 13 Data Input and Topology
3. Contiguity (Adjacency) :
Every arc has a direction
A GIS maintains a list of Polygons on the left and right side of each arc.
The computer then uses this information to determine which features are next to one another.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 14 Data Input and Topology
1. Coverage
Coverage is a topology based vector data format. Coverage can be a point coverage, line
coverage, or polygon coverage.
The coverage model supports three basic topological relationships.
Connectivity : Arc connects to each other at nodes.
Area definition : An Area is defined by a series of connected arcs.
Contiguity : Arcs have directions and left and right polygon.
2. Shapefile
Shapefile is a standard non topological data format. Shape file are a first attempt an object
spatial features.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 15 Data Input and Topology
They are very simple floating point geometry feature. A Shapefile is a digital vector storage
format for storing geometric location and associated attribute information.
A shapefile is actually a set of several files
.shp - shape format; the feature geometry itself
.shx - shape index format; a positional index of the feature geometry to allow seeking
forwards and backwards quickly
.dbf - attribute format; columnar attributes for each shape, in dBase III format
The geometry of a shapefile is stored in two basic files .shp and .shx :
Must Be Disjoint
Points cannot overlap within the same feature class or
subtype. Use this rule when points within one feature class
or subtype should never occupy the same space, for
example, when fittings in a water distribution network
should not overlap.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 17 Data Input and Topology
POLYLINE
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 18 Data Input and Topology
POLYGON
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 19 Data Input and Topology
Must be Covered by
Polygons in one feature class or subtype must be covered by a
single polygon from another feature class or subtype. Use this rule
when you want one set of polygons to be covered by some part of
another single polygon in another feature class or subtype, for
example, when countries must be covered by states.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 21 Data Input and Topology
Goals of ODBC
Access any data from any application, regardless of which DBMS is handling the data Insert
a middle layer between an application and the database management system.
A database driver this layer translates the application's data queries into commands that the
DBMS understands
Allow application programs to use SQL to access data from any kinds of sources
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 22 Data Input and Topology
Application
Performs processing and calls ODBC functions to submit SQL statements and retrieve results.
A number of tasks are common to all applications, no matter how they use ODBC.
Taken together, they largely define the flow of any ODBC application. The tasks are :
Selecting a data source and connecting to it.
Submitting an SQL statement for execution.
Retrieving results (if any).
Processing errors.
Committing or rolling back the transaction enclosing the SQL statement.
Disconnecting from the data source.
Driver Manager
Loads and unloads drivers on behalf of an application. Processes ODBC function calls or
passes them to a driver.
The Driver Manager exists mainly as a convenience to application writers and solves a
number of problems common to all applications. These include determining which driver to
load based on a data source name, loading and unloading drivers, and calling functions in
driver
Driver Processes ODBC function calls, submits SQL requests to a specific data source, and
returns results to the application. If necessary, the driver modifies an application's request so
that the request conforms to syntax supported by the associated DBMS.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 23 Data Input and Topology
Drivers are libraries that implement the functions in the ODBC API. Each is specific to a
particular DBMS; Drivers expose the capabilities of the underlying DBMSs; they are not required
to implement capabilities not supported by the DBMS. The only major exception to this is that
drivers for DBMSs that do not have stand-alone database engines, such as Xbase, must implement
a database engine that at least supports a minimal amount of SQL.
Data Source consists of the data the user wants to access and its associated operating system,
DBMS, and network platform (if any) used to access the DBMS.
A data source is simply the source of the data. It can be a file, a particular database on a
DBMS, or even a live data feed. The data might be located on the same computer as the program,
or on another computer somewhere on a network.
The purpose of a data source is to gather all of the technical information needed to access the
data - the driver name, network address, network software, and so on - into a single place and hide
it from the user.
What is GPS ?
GPS or Global Positioning System is a satellite navigation system that furnishes location and
time information in all climate conditions to the user. GPS is used for navigation in planes, ships,
cars and trucks also. The system gives critical abilities to military and civilian users around the
globe. GPS provides continuous real time, 3-dimensional positioning, navigation and timing
worldwide.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 24 Data Input and Topology
Space Segment :
The space segment is the number of satellites in the constellation. It comprises of 29 satellites
circling the earth every 12 hours at 12,000 miles in altitude. The function of the space segment is
utilized to route/navigation signals and to store and retransmit the route/navigation message sent by
the control segment. These transmissions are controlled by highly stable atomic clocks on the
satellites. The GPS Space Segment is formed by a satellite constellation with enough satellites to
ensure that the users will have, at least, 4 simultaneous satellites in view from any point at the
Earth surface at any time.
Control Segment :
The control segment comprises of a master control station and five monitor stations outfitted
with atomic clocks that are spread around the globe. The five monitor stations monitor the GPS
satellite signals and then send that qualified information to the master control station where
abnormalities are revised and sent back to the GPS satellites through ground antennas. Control
segment also referred as monitor station.
User Segment :
The user segment comprises of the GPS receiver, which receives the signals from the GPS
satellites and determine how far away it is from each satellite. Mainly this segment is used for the
U.S military, missile guidance systems, civilian applications for GPS in almost every field. Most of
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 25 Data Input and Topology
the civilian uses this from survey to transportation to natural resources and from there to
agriculture purpose and mapping too.
The distance calculated from the third satellite will add one more sphere to be imagined on
whose surface lies the receiver. This gives rise to only one valid intersection i.e. the point
where the three spheres intersect is the position of the receiver in a two dimensional space.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 26 Data Input and Topology
A GPS receiver determines its position by using the signals that it receives from different
satellites. Since the receiver must solve for its position (X,Y,Z) and the clock error (d), four
satellite are required to solve receiver’s position using the following four equations:
R 12 = (X X1 ) 2 (Y Y1 )2 (Z Z1 ) 2 d 2
R 22 = (X X 2 ) 2 (Y Y2 )2 (Z Z2 ) 2 d 2
R 32 = (X X3 ) 2 (Y Y3 ) 2 (Z Z3 ) 2 d 2
R 24 = (X X 4 ) 2 (Y Y4 )2 (Z Z4 ) 2 d 2
Where (X1, Y1, Z1) (X2, Y2, Z2) (X3, Y3, Z3 ) and (X4, Y4, Z4) are the locations of the satellites
and R1, R2, R3, R4 are the distances of satellites from the receiver position. Hence solving the four
equations for four unknown factors X, Y, Z and d, the location of the receiver is calculated.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 27 Data Input and Topology
Satellite visibility : The more number of satellites a GPS receiver can observe, the better the
accuracy. Buildings, terrain, or dense foliage can block signals which can cause inaccurate
estimation of the position or no position reading at all.
Satellite geometry : The satellite geometry refers to the relative position of the satellites at
any given time. Better GPS signals are obtained when satellites are separated from each than
when they are in tight grouping.
Agriculture
Allows accurate field navigation, and maximum ground coverage in the shortest possible
time.
Enhancement of crop productivity by having precision soil sampling, correct estimation of
variation in chemical applications and planting density.
Environment
Environmental disasters such as fires and oil spills can be tracked accurately.
GPS tracking and mapping to facilitate monitoring and preservation of endangered species.
Aviation
Free, continuous and accurate positioning information of flights on a global basis.
Safe and fuel-efficient routes for airspace service providers.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 28 Data Input and Topology
Advantages of GPS :
GPS satellite based navigation system is an important tool for military, civil and commercial
users
Vehicle tracking systems GPS-based navigation systems can provide us with turn by turn
directions
Very high speed
Disadvantages of GPS :
GPS satellite signals are too weak when compared to phone signals, so it doesn’t work as
well indoors, underwater, under trees, etc.
The highest accuracy requires line-of-sight from the receiver to the satellite, this is why GPS
doesn’t work very well in an urban environment.
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 30 Data Input and Topology
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 31 Data Input and Topology
Disadvantages of GPS :
GPS satellite signals are too weak when compared to phone signals, so it doesn’t work as
well indoors, underwater, under trees, etc.
The highest accuracy requires line-of-sight from the receiver to the satellite, this is why
GPS doesn’t work very well in an urban environment.
Q.15 Define ODBC.
Ans. : Open Database Connectivity-or ODBC-is an application programming interface (API)
that lets software connect with database management systems while remaining independent of
them. This is important, because it allows applications to interact with multiple databases
simultaneously using SQL
Q.16 What are the four components of ODBC ?
Ans. :
Application
Driver Manager
Driver
Data Source
Q.17 Name the three segments in GPS.
Ans. :
The GPS system consists of three segments :
The space segment : the GPS satellites
The user segment, which includes both military and civilian users and their GPS equipment.
Q.18 Write few advantages of ODBC.
Ans. :
Some of the advantages of ODBC are :
ODBC provides a consistent interface regardless of the kind of database server used.
Applications do not have to be bound to each database on which they will run.
Q.19 Name some Topological Consistency rules for Line .
Ans. :
Must Coincide With
Must Be Disjoint
TM
Technical Publications - An up thrust for knowledge
Geographic Information System 3 - 32 Data Input and Topology
TM
Technical Publications - An up thrust for knowledge
DOWNLOADED FROM STUCOR APP
OCE552 – GEOGRAPHIC INFORMATION SYSTEM UNIT- I
systems. The process of projection transforms the Earth’s surface to a plane, and the outcome
P
The map shows the interstate highways in Idaho and The map shows the connected interstate
AP
Montana based on different coordinate systems. networks based on the same coordinate system.
or west from the prime meridian, and latitude measures the angle north or south of the
equatorial plane. For example, the longitude at point X is the angle a west of the prime
meridian, and the latitude at point Y is the angle b north of the equator.
U
ST
can measure the longitude value of a point on the Earth’s surface as 0° to 180° east or west of
the prime meridian. Meridians are therefore used for measuring location in the E–W
The flattening is based on the difference between the semimajor axis a and the semiminor axis b.
The angular measures of longitude and latitude may be expressed in degrees-minutes-
seconds (DMS), decimal degrees (DD), or radians (rad). Given that 1 degree equals 60
minutes and 1 minute equals 60 seconds, we can convert between DMS and DD. For
example, a latitude value of 45°52'30" would be equal to 45.875° (45 + 52/60 + 30/3600).
P
Radians are typically used in computer programs. One radian equals 57.2958°, and one
degree equals 0.01745 rad.
AP
Map Projections:
A map projection transforms the geographic coordinates on an ellipsoid into locations
on a plane. The outcome of this transformation process is a systematic arrangement of
R
parallels and meridians on a flat surface representing the geographic coordinate system. A
map projection provides a couple of distinctive advantages. First, a map projection allows us
CO
to use two-dimensional maps, either paper or digital. Second, a map projection allows us to
work with plane coordinates rather than longitude and latitude values.
Map projections can be grouped by either the preserved property or the projection
surface. Cartographers group map projections by the preserved property into the following
U
four classes: conformal, equal area or equivalent, equidistant, and azimuthal or true direction.
A conformal projection preserves local angles and shapes. An equivalent projection
ST
P
A map projection is defined by its parameters. Typically, a map projection has five or more
parameters. A standard line refers to the line of tangency between the projection surface and
AP
the reference globe. The standard line is called the standard parallel if it follows a parallel,
and the standard meridian if it follows a meridian. The principal scale, or the scale of the
reference globe, can be derived from the ratio of the globe’s radius to the Earth’s radius
(3963 miles or 6378 kilometers). The scale factor is the normalized local scale, defined as the
R
ratio of the local scale to the principal scale. The false easting is the assigned x-coordinate
value and the false northing is the assigned y-coordinate value. Essentially, the false easting
CO
and false northing create a false origin so that all points fall within the NE quadrant and have
positive coordinates. The following are the commonly used map projections: Transverse
Mercator, Lambert Conformal Conic, Albers Equal-Area Conic, Equidistant Conic, Web
Mercator.
U
ST
(a) (b)
The Projected Coordinate System (a): Representation of points in Geographic Coordinate System
(b): Equivalent representation in Projected coordinate system
P
Three coordinate systems are commonly used in the United States: the Universal
Transverse Mercator (UTM) grid system, the Universal Polar Stereographic (UPS) grid
AP
system, and the State Plane Coordinate (SPC) system.
R
CO
U
COMPONENTS OF GIS:
A GIS is an organized collection of computer hardware, software, geographic data,
and personnel designed to efficiently capture, store, update, manipulate, analyze, and display
all forms of geographically referenced information. GIS technology integrates common
database operations, such as query and statistical analysis, with the unique visualization and
geographic analysis benefits offered by maps. A working GIS integrates the following key
components: hardware, software, data, people, and methods.
o Hardware - GIS hardware includes computers for data processing, data storage,
and input/output; printers and plotters for reports and hard-copy maps; digitizers
and scanners for digitization of spatial data; and GPS (Global Positioning System)
P
and mobile devices for fieldwork.
AP
o Software - GIS software, either commercial or open source, includes programs
and applications to be executed by a computer for data management, data analysis,
data display, and other tasks. Additional applications, written in Python,
JavaScript, VB.NET, or C++, may be used in GIS for specific data analyses.
o Method - A successful GIS operates according to a well-designed plan and
R
business rules, which are the models and operating practices unique to each
CO
organization. Any organization has documented their process plan for GIS
operation. These document address number question about the GIS methods:
number of GIS expert required, GIS software and hardware, Process to store the
data, what type of DBMS (database management system) and more. Well
U
system and to develop plans for applying it. GIS users range from technical
specialists who design and maintain the system, to those who use it to help them
do their everyday work.
P
o Data - Maybe the most important component of a GIS is the data. Geographic
AP
data and related tabular data can be collected in-house or bought from a
commercial data provider. Most GIS employ a DBMS to create and maintain a
database to help organize and manage data. The data that a GIS operates on
consists of any data bearing a definable relationship to space, including any data
R
about things and events that occur in nature. At one time this consisted of hard-
copy data, like traditional cartographic maps, surveyor’s logs, demographic
CO
statistics, geographic reports, and descriptions from the field. Advances in spatial
data collection, classification, and accuracy have allowed more and more standard
digital base-maps to become available at different scales.
o Organization - GIS operations exist within an organizational environment;
U
therefore, they must be integrated into the culture and decision-making processes
of the organization for such matters as the role and value of GIS, GIS training,
ST
WORKING OF GIS:
GIS consists of the following elements i.e. geospatial data, data acquisition, data
management, data display, data exploration, and data analysis.
Geospatial Data: By definition, geospatial data cover the location of spatial features.
To locate spatial features on the Earth’s surface, we can use either a geographic or a
projected coordinated systems are available for use in GIS. A GIS represents
geospatial data as either vector data or raster data.
The vector data model uses points, lines, and polygons to represent spatial features
with a clear spatial location and boundary such as streams, land parcels, and
P
vegetation stands. Each feature is assigned an ID so that it can be associated with its
AP
attributes.
The raster data model uses a grid and grid cells to represent spatial features: point
features are represented by single cells, line features by sequences of neighbouring
cells, and polygon features by collections of contiguous cells. The cell value
R
corresponds to the attribute of the spatial feature at the cell location. Raster data are
ideal for continuous features such as elevation and precipitation.
CO
U
The raster data model uses cells in a grid to represent point features
ST
Data Acquisition: Data acquisition is usually the first step in conducting a GIS
project. The need for geospatial data by GIS users has been linked to the development
of data clearinghouses and geoportals. Since the early 1990s, government agencies at
different levels in the United States as well as many other countries have set up
websites for sharing public data and for directing users to various data sources.
Data acquisition involves compilation of existing and new data. To be used in a GIS,
a newly digitized map or a map created from satellite images requires geometric
transformation (i.e., geo-referencing). Additionally, both existing and new spatial data
must be edited if they contain digitizing and/or topological errors.
P
(DBMS) to handle attribute data, which can be large in size in the case of vector data.
Each polygon in a soil map, for example, can be associated with dozens of attributes
AP
on the physical and chemical soil properties and soil interpretations. Attribute data are
stored in a relational database as a collection of tables. These tables can be prepared,
maintained, and edited separately, but they can also be linked for data search and
retrieval.
R
Data Display: A routine GIS operation is mapmaking because maps are an interface
CO
scale bar, and other elements together to convey geographic information to the map
reader.
To make a “good” map, we must have a basic understanding of map symbols, colors,
ST
and typology, and their relationship to the mapped data. Additionally, we must be
familiar with map design principles such as layout and visual hierarchy. After a map
is composed in a GIS, it can be printed or saved as a graphic file for presentation. It
can also be converted to a KML file, imported into Google Earth, and shared publicly
on a web server.
Feature-based query can involve either attribute or spatial data. Attribute data query is
basically the same as database query using a DBMS. In contrast, spatial data query
allows GIS users to select features based on their spatial relationships such as
containment, intersect, and proximity. A combination of attribute and spatial data
queries provides a powerful tool for data exploration.
Data Analysis: A GIS has a large number of tools for data analysis. Some are basic
P
tools, meaning that they are regularly used by GIS users. Other tools tend to be
AP
discipline or application specific. Two basic tools for vector data are buffering and
overlay: buffering creates buffer zones from select features, and overlay combines the
geometries and attributes of the input layers.
R
CO
Four basic tools for raster data are local, neighbourhood, zonal, and global operations,
depending on whether the operation is performed at the level of individual cells, or
ST
P
AP
Basic operation of Raster Data
GIS SOFTWARE PRODUCTS:
The below table shows a select list of commercial GIS software in the left column and
free and open source software (FOSS) for GIS in the right column.
R
CO
U
ST
extensions include 3D Analyst, Network Analyst, Spatial Analyst, Geostatistical Analyst, and
others.
GRASS GIS (Geographic Resources Analysis Support System), the first FOSS for
GIS, was originally developed by the U.S. Army Construction Engineering Research
Laboratories in the 1980s. Well known for its analysis tools, GRASS GIS is currently
maintained and developed by a worldwide network of users. Academicians, government
agencies (NASA, NOAA, USDA and USGS) and GIS practitioners use this open source
P
software because its code can be inspected and tailored to their needs.
AP
SAGA GIS (System for Automated Geoscientific Analyses) is one of the classics in
the world of free GIS software. It started out primarily for terrain analysis such as
hillshading, watershed extraction and visibility analysis. Now, SAGA GIS is a powerhouse
because it delivers a fast growing set of geoscientific methods to the geoscientific
R
community.
GeoDa is a free GIS software program primarily used to introduce new users into
CO
spatial data analysis. Its main functionality is data exploration in statistics. One of the nicest
things about it is how it comes with sample data for you to give a test-drive. From simple
box-plots all the way to regression statistics, GeoDa has complete arsenal of statistics to do
nearly anything spatially.
U
APPLICATION OF GIS:
ST
natural hazards (volcano, flood, and landslide), aquifer depletion, and ground water
management.
In the private sector, most GIS applications are integrated with the Internet, GPS,
wireless technology, and Web services. The following shows some of these applications:
Online mapping websites offer locators for finding real estate listings, vacation
rentals, banks, restaurants, coffee shops, and hotels.
Location-based services allow mobile phone users to search for nearby banks,
restaurants, and taxis; and to track friends, dates, children, and the elderly.
Mobile GIS allows field workers to collect and access geospatial data in the field.
Mobile resource management tools track and manage the location of field crews and
P
mobile assets in real time.
AP
Automotive navigation systems provide turn by-turn guidance and optimal routes
based on precise road mapping using GPS and camera.
Augmented reality lets a smart phone user look through the phone’s camera with
superimposed data or images (e.g., 3-D terrain from a GIS, monsters in Pokemon Go)
R
about the current location.
CO
SCALES/LEVELS OF MEASUREMENTS:
Scales of Measurement or level of measurement is a system for classifying attribute
data into four categories namely nominal, ordinal, interval and ratio.
Nominal: In this level of measurement, the numbers in the variable are used only to
U
classify the data. In this level of measurement, words, letters, and alpha-numeric
symbols can be used. Suppose there are data about people belonging to three
ST
different gender categories. In this case, the person belonging to the female gender
could be classified as F, the person belonging to the male gender could be classified
as M, and transgendered classified as T. This type of assigning classification is
nominal level of measurement.
Ordinal: This level of measurement depicts some ordered relationship among the
variable’s observations. Suppose a student scores the highest grade of 100 in the
class. In this case, he would be assigned the first rank. Then, another classmate
scores the second highest grade of an 92; she would be assigned the second rank. A
third student scores a 81 and he would be assigned the third rank, and so on. The
ordinal level of measurement indicates an ordering of the measurements.
P
AP
Levels of Measurements
Interval: The interval level of measurement not only classifies and orders the
R
measurements, but it also specifies that the distances between each interval on the
scale are equivalent along the scale from low interval to high interval. For
CO
intervals, can have a value of zero as well. A common geographic example of ratio
data is density (i.e. population, ethnicity, etc.). Any percent value from 0 to 100 will
have a meaningful zero.
*****
DATABASE MODEL:
Data model defines the logical structure of a database. Data Models are fundamental
entities to introduce abstraction in a DBMS. Data models define how data is connected to
P
each other and how they are processed and stored inside the system. There are a number of
different database data models. Amongst those that have been used for attribute data in GIS
AP
are the hierarchical, network, relational, object-relational and object-oriented data models. Of
these the relational data model has become the most widely used model.
possible on individual tables or on groups of tables. For the Happy Valley data, the below
figure illustrates an example of one such table.
U
ST
The data in a relational database are stored as a set of base tables with the
characteristics described above. Other tables are created as the database is queried and these
represent virtual views. The table structure is extremely flexible and allows a wide variety of
queries on the data. Queries are possible on one table at a time (for example, you might ask
‘which hotels have more than 14 rooms?’ or ‘which hotels are luxury standard?’), or on more
than one table by linking through key fields (for instance, ‘which passengers originating from
the UK are staying in luxury hotels?’ or ‘which ski lessons have pupils who are over 50 years
of age?’). Queries generate further tables, but these new tables are not usually stored. There
are few restrictions on the types of query possible.
P
‘query by example’ systems. Frequently, queries are built up of expressions based on
relational algebra, using commands such as SELECT (to select a subset of rows), PROJECT
AP
(to select a subset of columns) or JOIN (to join tables based on key fields). SQL (standard
query language) has been developed to facilitate the querying of relational databases. The
advantages of SQL for database users are its completeness, simplicity, pseudo English-
language style and wide application. However, SQL has not really developed to handle
R
geographical concepts such as ‘near to’, ‘far from’ or‘connected to’.
CO
ER Diagram:
An Entity–relationship model (ER model) describes the structure of a database with
the help of a diagram, which is known as Entity Relationship Diagram (ER Diagram). An ER
model is a design or blueprint of a database that can later be implemented as a database. ER
U
Model is best used for the conceptual design of a database. The main components of E-R
model are:
ST
P
AP
R
CO
U
ST
The figure shows the ER diagram for the GPS tracking system. The design has three
entities namely User-generated Data, Publisher and Subscriber.
P
AP
ER Diagram of GPS System
R
SPATIAL DATA MODEL:
CO
block is the individual grid cell, and the shape and character of an entity is created by the
grouping of cells. The size of the grid cell is very important as it influences how an entity
ST
appears.
Topology:
Topology refers to the study of those properties of geometric objects that remain
invariant under certain transformations such as bending or stretching. An example of a
topological map is a subway map.
A subway map depicts correctly the connectivity between the subway lines and
stations on each line but has distortions in distance and direction. In GIS, vector data can be
topological or non-topological, depending on whether topology is built into the data or not.
Topology can be explained through directed graphs (digraphs), which show the arrangements
of geometric objects and the relationships among objects. An edge or arc is a directed line
P
with a starting point and an ending point. The end points of an arc are nodes, and
AP
intermediate points, if any, are vertices. And a face refers to a polygon bounded by arcs. If an
arc joins two nodes, the nodes are said to be adjacent and incident with the arc.
TIGER:
R
An early application example of topology is the Topologically Integrated Geographic
CO
Encoding and Referencing (TIGER) data base from the U.S. Census Bureau. The TIGER
database links statistical area boundaries such as counties, census tracts, and block groups to
roads, railroads, rivers, and other features by topology.
U
ST
Topology has three main advantages. First, it ensures data quality and integrity.
Second, topology can enhance GIS analysis. Third, topological relationships between spatial
features allow GIS users to perform spatial data query.
P
The more complex the shape of a line or area feature the greater the number of points
AP
required to represent it. Selecting the appropriate number of points to construct an entity is
one of the major dilemmas when using the vector approach.
R
CO
U
ST
P
Effect of changing resolution in the vector and raster worlds
AP
SPATIAL DATA STRUCTURES:
Data structures provide the information that the computer requires to reconstruct the
spatial data model in digital form. There are many different data structures in use in GIS. This
diversity is one of the reasons why exchanging spatial data between different GIS software
R
can be problematic. However, despite this diversity data structures can be classified
according to whether they are used to structure raster or vector data.
CO
storage and representation in the computer. The below figure shows the most straightforward
method of coding raster data. The cells in each line of the image (Figure: a) are mirrored by
an equivalent row of numbers in the file structure (Figure: c). The first line of the file tells the
ST
computer that the image consists of 10 rows and 10 columns and that the maximum cell value
is 1. In this example, a value of 0 has been used to record cells where the entity is not present
and a value of 1 for cells where the entity is present (Figure: b).
In a simple raster data structure, such as illustrated in the above figure, different
spatial features must be stored as separate data layers. Thus, to store more raster entities,
separate data files would be required, each representing a different layer of spatial data.
However, if the entities do not occupy the same geographic location (or cells in the raster
model), then it is possible to store them all in a single layer, with an entity code given to each
cell. This code informs the user which entity is present in which cell.
P
AP
Feature coding of cells in the raster world
R
Above figure shows how different land uses can be coded in a single raster layer. The values
CO
1, 2 and 3 have been used to classify the raster cells according to the land use present at a
given location. The value 1 represents residential area; 2, forest; and 3, farmland.
One of the major problems with raster data sets is their size, because a value must be
recorded and stored for each cell in an image. Thus, a complex image made up of a mosaic of
U
different features (such as a soil map with 20 distinct classes) requires the same amount of
storage space as a similar raster map showing the location of a single forest. To address this
ST
The above figure shows such a vector data structure for the Happy Valley car park.
P
Note how a closed ring of co-ordinate pairs defines the boundary of the polygon. The
AP
limitations of simple vector data structures start to emerge when more complex spatial
entities are considered. For example, consider the Happy Valley car park divided into
different parking zones (Figure: b). The car park consists of a number of adjacent polygons.
If the simple data structure, illustrated in Figure: a, were used to capture this entity then the
R
boundary line shared between adjacent polygons would be stored twice. This may not appear
too much of a problem in the case of this example, but consider the implications for a map of
CO
which points are associated with which polygon. This is known as a point dictionary. The
data structure in Figure: b, shows how such an approach has been used to store data for the
ST
P
compression. Other methods include LZW (Lempel—Ziv-Welch) and its variations (e.g.,
LZ77,LZMA).
AP
A lossy compression cannot reconstruct fully the original image but can achieve
higher compression ratios than a lossless compression. Lossy compression is therefore useful
for raster data that are used as background images rather than for analysis. Image degradation
through lossy compression can affect GIS-related tasks such as extracting ground control
R
points from aerial photographs or satellite images for the purpose of georeferencing.
Run length encoding:
CO
Run length encoding stores cells on a row-by-row basis. Instead of recording each
individual cell’s values, run length encoding groups cell values by row.
U
ST
Block coding:
The block coding raster storage technique assigns areas that are blocks to reduce
redundancy. The block coding raster image compression method subdivides an entire raster
image into hierarchical blocks. It’s an extension of the run length encoding technique, but
extends it to two dimensions.
Chain Coding:
Chain coding defines the outer boundary using relative positions from a start point.
The sequence of the exterior is stored where the endpoint finishes at the start point. During
the encoding, the direction is stored as an integer. However, in this example we use cardinal
directions for simplicity. For example, the value 0 is north and 1 is east.
P
Quadtree encoding:
Quadtrees are raster data structures based on the successive reduction of
AP
homogeneous cells. It recursively subdivides a raster image into quarters. The subdivision
process continues until each cell is classed.
R
CO
MrSID uses the wavelet transform for data compression. The wavelet-based
compression is also used by JPEG 2000 and ECW (Enhanced Compressed Wavelet). The
wavelet transform treats an image as a wave and progressively decomposes the wave into
U
simpler wavelets (Addison 2002). Using a wavelet (mathematical) function, the transform
repetitively averages groups of adjacent pixels (e.g., 2, 4, 6, 8, or more) and, at the same time,
ST
records the differences between the original pixel values and the average. The differences,
also called wavelet coefficients, can be 0, greater than 0, or less than 0. In parts of an image
that have few significant variations, most pixels will have coefficients of 0 or very close to 0.
To save data storage, these parts of the image can be stored at lower resolutions by rounding
off low coefficients to 0, but storage at higher resolutions is required for parts of the same
image that have significant variations (i.e., more details). Box 4.4 shows a simple example of
using the Haar function for the wavelet 3transform.
P
VECTOR vs RASTER:
AP
Vector Raster
Usually Complex. Usually Simple.
Difficult for overlay operation. Efficient for overlay operation.
High spatial variability is inefficiently High spatial variability is efficiently
R
represented. represented.
CO
Example: Example:
ST
observations. Thus, an appropriate number of observations must be selected, along with their
geographical location.
The ‘resolution’ of a DTM is determined by the frequency of observations used.
DTMs are created from a series of either regularly or irregularly spaced (x,y,z) data points
(where x and y are the horizontal co-ordinates and z is the vertical or height co-ordinate).
DTMs may be derived from a number of data sources. These include contour and spot height
information found on topographic maps, stereoscopic aerial photography, satellite images and
field surveys.
Triangulated Irregular Networks:
A commonly used data structure in GIS software is the triangulated irregular network
P
(TIN). It is on the standard implementation techniques for digital terrain models, but it can be
AP
used to represent any continuous field. The principles behind a TIN are simple. It is built
from a set of locations for which we have a measurement for instance an elevation. The
locations can be arbitrarily scattered in space and are usually not on a nice regular grid. Any
location together with its elevation value can be viewed as a point in three dimensional space.
This is illustrated in below figure. From these 3D points, we can construct an irregular
R
tessellation made of triangles.
CO
U
our elevation computation for location `P` on the left hand shaded triangle, we will get
another value than from the right hand shaded triangle. The second will provide a better
approximation because the average distance from `P` to the three triangle anchors is smaller.
The triangulation shown in below figure happens to be a Delaunay triangulation, which in a
sense is an optimal triangulation. There are multiple ways of defining what such a
triangulation is, but we suffice here to state two important properties. The first is that the
triangles are as equilateral (‘equal-sided’) as they can be, given the set of anchor points. The
second property is that for each triangle, the circumcircle through its three anchor points does
not contain any other anchor point. One such circumcircle is depicted on the right of Figure
(b).
P
AP
R
Two triangulations based on the input locations (a) one with many ‘stretched’ triangles;
(b) the triangles are more equilateral – Delaunay triangulation.
CO
A TIN clearly is a vector representation: each anchor point has a stored georeference.
Yet, we might also call it an irregular tessellation, as the chosen triangulation provides a
partitioning of the entire study space. However, in this case, the cells do not have an
associated stored value as is typical of tessellations, but rather a simple interpolation function
U
vendors, is working to deliver spatial interface specifications that are available for global use
(OGC, 2001). The OGC has proposed the Geography Markup Language (GML) as a new
GIS data standard.
The Geography Markup Language (GML) is a non-proprietary computer language
designed specifically for the transfer of spatial data over the Internet. GML is based on XML
(eXtensible Markup Language), the standard language of the Internet, and allows the
exchange of spatial information and the construction of distributed spatial relationships.
GML has been proposed by the Open Geospatial Consortium as a universal spatial
data standard. GML is likely to become very widely used because it is:
Internet friendly;
P
not tied to any proprietary GIS;
AP
specifically designed for feature-based spatial data;
open to use by anyone;
compatible with industry-wide IT standards.
It is also likely to set the standard for the delivery of spatial information content to
R
PDA and WAP devices, and so form an important component of mobile and location-based
(LBS) GIS technologies. The collection of geoportals and various other compliemntary
CO
them available on a device through a single entry point often referred to as a 'geoportal'. They
facilitate data providers and users to participate in the digital spatial community at a national
scale and provide a basis for spatial data discovery, evaluation and application for users
within government, commercial and non-profit sectors, and academia and by citizens in
general. The Global Spatial Data Infrastructure (GSDI) Association links national SDIs to
establish a connection for all users in the world to share and reuse the available datasets.
Data Accuracy:
In GIS, data quality is used to give an indication of how good data are. It describes
the overall fitness or suitability of data for a specific purpose or is used to indicate data free
from errors and other problems. Examining issues such as error, accuracy, precision and bias
can help to assess the quality of individual data sets. In addition, the resolution and
generalization of source data, and the data model used, may influence the portrayal of
features of interest. Data sets used for analysis need to be complete, compatible and
consistent, and applicable for the analysis being performed.
Accuracy is the extent to which an estimated data value approaches its true value
(Aronoff, 1989). If a GIS database is accurate, it is a true representation of reality. It is
impossible for a GIS database to be 100 per cent accurate, though it is possible to have data
that are accurate to within specified tolerances. For example, a ski lift station co-ordinate may
be accurate to within plus or minus 10 metres.
Several types of error can arise when accuracy and/or precision requirements are not
P
met during data capture and creation. The five types of error in a geospatial dataset are
AP
related to -
Positional Accuracy:
The identification of positional accuracy is important. This includes consideration of
inherent error (source error) and operational error (introduced error). A more detailed review
is provided in the next section.
R
Attribute Accuracy:
CO
Consideration of the accuracy of attributes also helps to define the quality of the data.
This quality component concerns the identification of the reliability, or level of purity
(homogeneity), in a data set.
Logical Consistency:
U
This component is concerned with determining the faithfulness of the data structure
for a data set. This typically involves spatial data inconsistencies such as incorrect line
ST
intersections, duplicate lines or boundaries, or gaps in lines. These are referred to as spatial
or topological errors.
Completeness:
The final quality component involves a statement about the completeness of the data
set. This includes consideration of holes in the data, unclassified areas, and any compilation
procedures that may have caused data to be eliminated.
*****
Scanner - Raster Data Input – Raster Data File Formats – Vector Data Input –Digitiser –
Topology - Adjacency, connectivity and containment – Topological Consistency rules –
Attribute Data linking – ODBC – GPS - Concept GPS based mapping.
Introduction:
Data encoding is the process of getting data into the computer. It is a process that is
fundamental to almost every GIS project. For example:
P
An archaeologist may encode aerial photographs of ancient remains to integrate with
newly collected field data.
AP
A planner may digitize outlines of new buildings and plot these on existing
topographical data.
An ecologist may add new remotely sensed data to a GIS to examine changes in
habitats.
R
A historian may scan historical maps to create a virtual city from the past.
A utility company may encode changes in pipeline data to record changes and
CO
Once in a GIS, data almost always need to be corrected and manipulated to ensure
that they can be structured according to the required data model. Problems that may have to
U
This unit looks in detail at the range of methods available to get data into a GIS.
These include keyboard entry, digitizing, scanning and electronic data transfer. Then,
methods of data editing and manipulation are reviewed, including re-projection,
transformation and edge matching. The whole process of data encoding and editing is often
called the ‘data stream’.
Analogue data are normally in paper form, and include paper maps, tables of statistics
and hard-copy (printed) aerial photographs. These data all need to be converted to digital
form before use in a GIS, thus the data encoding and correction procedures are longer than
those for digital data. Digital data are already in computer-readable formats and are supplied
on CD-ROM or across a computer network. Map data, aerial photographs, satellite imagery,
data from databases and automatic data collection devices (such as data loggers and GPS) are
all available in digital form.
P
AP
R
CO
U
SCANNER:
Scanning coverts paper maps into digital format by capturing features as individual
cells, or pixels, producing an automated image. Maps are generally considered the backbone
of any GIS activity. But many a time paper maps are not easily available in a form that can be
readily used by the computers. Most of the paper maps had been prepared on the basis of old
conventional surveys. New maps can be produced using improved technologies but this
requires time as it increases the volume of work. Thus, we have to resort to the available
maps. These paper maps have to be first converted into a digital format usable by the
computer. This is a critical step as the quality of the analog document must be preserved in
the transition to the computer domain.
The technology used for this kind of conversions is known as scanning and the
instrument used for this kind of operation is known as a scanner. A scanner can be thought of
as an electronic input device that converts analog information of a document like a map,
photograph or an overlay into a digital format that can be used by the computer. Scanning
automatically captures map features, text, and symbols as individual cells, or pixels, and
produces an automated image.
Working of a Scanner:
The most important component inside a scanner is the scanner head which can move
along the length of the scanner. The scanner head contains either a charged-couple device
P
(CCD) sensor or a contact image (CIS) sensor. A CCD consists of a number of photosensitive
AP
cells or pixels packed together on a chip. The most advanced large format scanners use
CCD’s with 8000 pixels per chip for providing a very good image quality.
While scanning a bright white light from the scanner strikes the image to be scanned
and is reflected onto the photosensitive surface of the sensor placed on the scanner head.
Each pixel transfers a gray tone value (values given to the different shades of black in the
R
image ranging from 0 (black) – 255 (white) i.e. 256 values to the scan board (software). The
CO
form of the image. This type of scanning is known as one pass scanning.
Scanning a colour image is slightly different in which the scanner head has to scan the
ST
same image for three different colours i.e. red, green, blue. In older colour scanners, this was
accomplished by scanning the same area three times over for the three different colours. This
type of scanner is known as three-pass scanner. However, most of the colour scanners now
scan in one pass scanning all the three colours in one go by using colour filters. In principle, a
colour CCD works in the same way as a monochrome CCD. But in this each colour is
constructed by mixing red, green and blue. Thus, a 24-bit RGB CCD presents each pixel by
24 bits of information. Usually, a scanner using these three colours (in full 24 RGB mode)
can create up to 16.8 million colours.
Types of Scanners:
Hand-held scanners although portable, can only scan images up to about four inches
wide. They require a very steady hand for moving the scan head over the document. They are
useful for scanning small logos or signatures and are virtually of no use for scanning maps
and photographs.
The most commonly used scanner is a flatbed scanner also known as desktop scanner.
It has a glass plate on which the picture or the document is placed. The scanner head placed
beneath the glass plate moves across the picture and the result is a good quality scanned
image. For scanning large maps or top sheets wide format flatbed scanners can be used.
P
AP
R
CO
U
Then there are the drum scanners which are mostly used by the printing professionals.
ST
In this type of scanner, the image or the document is placed on a glass cylinder that rotates at
very high speeds around a centrally located sensor containing photo-multiplier tube instead of
a CCD to scan. Prior to the advances in the field of sheet fed scanners, the drum scanners
were extensively used for scanning maps and other documents.
P
information about its data values. For example,
this includes projection, statistics, attributes,
pyramids and whether or not it’s a continuous or
AP
discrete type of raster.
ASCII uses a set of numbers (including floats)
between 0 and 255 for information storage and
processing. They also contain header
information with a set of keywords.
American Standard Code for
R
Information Interchange .ASC In their native form, ASCII text files store GIS
ASCII Grid data in a delimited format. This could be
comma, space or tab-delimited format. Going
CO
other files:
P
Band Interleaved for Line (BIL) stores
pixel information based on rows for all
AP
bands in an image.
.BIL Whereas Band interleaved by pixel (BIP)
Envi RAW Raster .BIP assigns pixel values for each band by
.BSQ rows.
Finally, Band sequential format (BSQ)
stores separate bands by rows.
R
BIL files consist of a header file (HDR) that
describes the number of columns, rows, bands,
CO
Vector data is not made up of grids of pixels. Instead, vector graphics are comprised
of vertices and paths. The three basic symbol types for vector data are points, lines and
polygons (areas).
P
AP
R
CO
U
ST
Esri Shapefile .SHP, But you’ll need a complete set of three files
.DBF, that are mandatory to make up a shapefile. The
.SHX three required files are:
P
SHP is the feature geometry.
SHX is the shape index position.
DBF is the attribute data.
AP
The GeoJSON format is mostly for web-based
mapping. GeoJSON stores coordinates as text
in JavaScript Object Notation (JSON) form.
This includes vector points, lines and polygons
as well as tabular information.
R
GeoJSON store objects within curly braces {}
.GEOJSON and in general have less markup overhead
CO
JSON to binary.
GML allows for the use of geographic
coordinates extension of XML. And
eXtensible Markup Language (XML) is both
human-readable and machine-readable.
P
System of 1984 (WGS84). The vertical
component (altitude) is measured in meters
from the WGS84 EGM96 Geoid vertical
AP
datum.
GPS Exchange format is an XML schema that
describes waypoints, tracks and routes
captured from a GPS receiver. Because GPX is
an exchange format, you can openly transfer
GPS data from one program to another based
R
GPS eXchange Format (GPX) .GPX on its description properties.
DIGITIZING:
Digitizing in GIS is the process of converting geographic data either from a hardcopy
or a scanned image into vector data by tracing the features. During the digitzing process,
features from the traced map or image are captured as coordinates in either point, line, or
polygon format.
P
digitizing. It is an appropriate technique when selected features are required from a paper
AP
map. Manual digitizing requires a digitizing table that is linked to a computer workstation.
The digitizing table is essentially a large flat tablet, the surface of which is underlain by a
very fine mesh of wires. Attached to the digitizer via a cable is a cursor (puck) that can be
moved freely over the surface of the table. Buttons on the cursor allow the user to send
instructions to the computer. The position of the cursor on the table is registered by reference
R
to its position above the wire mesh.
CO
U
ST
The procedure followed when digitizing a paper map using a manual digitizer has the
following five stages:
Registration: The map to be digitized is fixed firmly to the table top with sticky tape.
Five or more control points are identified (usually the four corners of the map sheet
and one or more grid intersections in the middle). The geographic co-ordinates of the
control points are noted and their locations digitized by positioning the cross-hairs on
the cursor exactly over them and pressing the ‘digitize’ button on the cursor. This
sends the co-ordinates of a point on the table to the computer and stores them in a file
as ‘digitizer co-ordinates’.
Digitizing point features: Point features, for example spot heights, hotel locations or
P
meteorological stations, are recorded as a single digitized point. A unique code
AP
number or identifier is added so that attribute information may be attached later. For
instance, the hotel with ID number ‘1’ would later be identified as ‘Mountain View’.
Digitizing line features: Line features (such as roads or rivers) are digitized as a series
of points that the software will join with straight line segments. In some GIS packages
R
lines are referred to as arcs, and their start and end points as nodes. This gives rise to
the term arc–node topology, used to describe a method of structuring line features.
CO
Digitizing area (polygon) features: Area features or polygons, for example forested
areas or administrative boundaries, are digitized as a series of points linked together
by line segments in the same way as line features. Here it is important that the start
and end points join to form a complete area. Polygons can be digitized as a series of
U
individual lines, which are later joined to form areas. In this case it is important that
each line segment is digitized only once.
ST
Manual digitizers may be used in one of two modes: point mode or stream mode. In
point mode the user begins digitizing each line segment with a start node, records each
change in direction of the line with a digitized point and finishes the segment with an end
node. Thus, a straight line can be digitized with just two points, the start and end nodes. For
more complex lines, a greater number of points are required between the start and end nodes.
Smooth curves are problematic since they require an infinite number of points to record their
true shape.
P
AP
(a) Point mode - person digitizing decides where to place each individual point such as to most accurately
R
represent the line within the accepted tolerances of the digitizer.
(b) Stream mode – person digitizing decides on time or distance interval between the digitizing hardware
registering each point as the person digitizing moves the cursor along the line.
CO
In stream mode the digitizer is set up to record points according to a stated time
interval or on a distance basis. Once the user has recorded the start of a line the digitizer
U
might be set to record a point automatically every 0.5 seconds and the user must move the
cursor along the line to record its shape. An end node is required to stop the digitizer
ST
recording further points. The speed at which the cursor is moved along the line determines
the number of points recorded. Thus, where the line is more complex and the cursor needs to
be moved more slowly and with more care, a greater number of points will be recorded.
Conversely, where the line is straight, the cursor can be moved more quickly and fewer
points are recorded.
The choice between point mode and stream mode digitizing is largely a matter of
personal preference. Stream mode digitizing requires more skill than point mode digitizing,
and for an experienced user may be a faster method. Stream mode will usually generate more
points, and hence larger files, than point mode.
TOPOLOGY:
Topology is the mathematical representation of the physical relationships that exists
between the geographical elements. Topology has long been a key GIS requirement for data
management and integrity. In general, a topological data model manages spatial relationships
by representing spatial objects (point, line, and area features) as an underlying graph of
topological primitives—nodes, faces, and edges. These primitives, together with their
relationships to one another and to the features whose boundaries they represent, are defined
by representing the feature geometries in a planar graph of topological elements.
Topology is useful in GIS because many spatial modeling operations don’t require
coordinates, only topological information. For example, to find an optimal path between two
P
points requires a list of the arcs that connect to each other and the cost to traverse each arc in
AP
each direction. Coordinates are only needed for drawing the path after it is calculated.
Connectivity
Connectivity is defined through arc-node topology. This is the basis for many network
tracing and path finding operations. Connectivity allows you to identify a route to the airport,
U
connect streams to rivers, or follow a path from the water treatment plant to a house.
In the arc-node data structure, an arc is defined by two endpoints: the from-node
ST
indicating where the arc begins and a to-node indicating where it ends. This is called arc-node
topology.
Arc-node topology is supported through an arc-node list. The list identifies the from-
and to-nodes for each arc. Connected arcs are determined by searching through the list for
common node numbers. In the above example, it is possible to determine that arcs 1, 2, and 3
all intersect because they share node 11. The computer can determine that it is possible to
travel along arc 1 and turn onto arc 3 because they share a common node (11), but it's not
possible to turn directly from arc 1 onto arc 5 because they don't share a common node.
Containment:
Many of the geographic features that may be represented cover a distinguishable area
on the surface of the earth, such as lakes, parcels of land, and census tracts. An area is
P
represented in the vector model by one or more boundaries defining a polygon. Although this
sounds counterintuitive, consider a lake with an island in the middle. The lake actually has
AP
two boundaries: one that defines its outer edge and the island that defines its inner edge. In
the terminology of the vector model, an island defines an inner boundary (or hole) of a
polygon.
The arc-node structure represents polygons as an ordered list of arcs rather than a
R
closed loop of x,y coordinates. This is called polygon-arc topology. In the illustration below,
polygon F is made up of arcs 8, 9, 10, and 7 (the 0 before the 7 indicates that this arc creates
CO
Contiguity:
Two geographic features that share a boundary are called adjacent. Contiguity is the
topological concept that allows the vector data model to determine adjacency. Polygon
topology defines contiguity. Polygons are contiguous to each other if they share a common
arc. This is the basis for many neighbor and overlay operations.
Recall that the from-node and to-node define an arc. This indicates an arc's direction
so the polygons on its left and right sides can be determined. Left-right topology refers to the
polygons on the left and right sides of an arc. In the below example, polygon B is on the left
of arc 6, and polygon C is on the right. Thus we know that polygons B and C are adjacent.
P
AP
Figure: Left-Right topology example
R
Notice that the label for polygon A is outside the boundary of the area. This polygon
is called the external, or universe, polygon and represents the world outside the study area.
CO
The universe polygon ensures that each arc always has a left and right side defined.
Topology Rules:
There are many topology rules you can implement in your geodatabase, depending on
U
the spatial relationships that are most important for your organization to maintain. You
should carefully plan the spatial relationships you will enforce on your features. Some
ST
topology rules govern the relationships of features within a given feature class, while others
govern the relationships between features in two different feature classes or subtypes.
Topology rules can be defined between sub types of features in one or another feature class.
This could be used, for example, to require street features to be connected to other street
features at both ends, except in the case of streets belonging to the cul-de-sac or dead-end
subtypes.
Many topology rules can be imposed on features in a geodatabase. A well-designed
geodatabase will have only those topology rules that define key spatial relationships needed
by an organization. Most topology violations have fixes that you can use to correct errors.
P
AP
Must Be Disjoint:
Requires that points be separated spatially from other points in the same feature class
(or subtype). Any points that overlap are errors. This is useful for ensuring that points are not
coincident or duplicated within the same feature class, such as in layers of cities, parcel lot ID
R
points, wells, or streetlamp poles.
CO
U
ST
P
AP
Must Be Properly Inside Polygons:
Requires that points fall within area features. This is useful when the point features
are related to polygons, such as wells and well pads or address points and parcels.
R
CO
Requires that points fall on the boundaries of area features. This is useful when the
point features help support the boundary system, such as boundary markers, which must be
ST
rule is used when line features must form closed loops, such as when they are defining the
boundaries of polygon features. It may also be used in cases where lines typically connect to
other lines, as with streets. In this case, exceptions can be used where the rule is occasionally
violated, as with cul-de-sac or dead-end street segments.
P
Must Not Overlap:
Requires that lines not overlap with lines in the same feature class (or subtype). This
AP
rule is used where line segments should not be duplicated, for example, in a stream feature
class. Lines can cross or intersect but cannot share segments.
R
CO
themselves but must not have coincident segments. This rule is useful for features, such as
streets, where segments might touch in a loop but where the same street should not follow the
ST
P
endpoints of) back lot lines and cannot overlap each other.
AP
R
Must Not Have Pseudo Nodes:
Requires that a line connect to at least two other lines at each endpoint. Lines that
CO
connect to one other line (or to themselves) are said to have pseudo nodes. This rule is used
where line features must form closed loops, such as when they define the boundaries of
polygons or when line features logically must connect to two other line features at each end,
as with segments in a stream network, with exceptions being marked for the originating ends
U
of first-order streams.
ST
P
for modeling administrative boundaries, such as ZIP Codes or voting districts, and mutually
AP
exclusive area classifications, such as land cover or landform type.
R
CO
perimeter of the surface. You can either ignore this error or mark it as an exception. Use this
rule on data that must completely cover an area. For example, soil polygons cannot include
ST
Contains Point:
Requires that a polygon in one feature class contain at least one point from another
feature class. Points must be within the polygon, not on the boundary. This is useful when
every polygon should have at least one associated point, such as when parcels must have an
address point.
P
administrative boundaries and their capital cities. Each point must be properly inside exactly
AP
one polygon and each polygon must properly contain exactly one point. Points must be
within the polygon, not on the boundary.
R
CO
two feature classes can share edges or vertices or be completely disjointed. This rule is used
when an area cannot belong to two separate feature classes. It is useful for combining two
ST
mutually exclusive systems of area classification, such as zoning and water body type, where
areas defined within the zoning class cannot also be defined in the water body class and vice
versa.
vertices. Any area defined in either feature class that is not shared with the other is an error.
This rule is used when two systems of classification are used for the same geographic area,
and any given point defined in one system must also be defined in the other. One such case
occurs with nested hierarchical datasets, such as census blocks and block groups or small
watersheds and large drainage basins. The rule can also be applied to non-hierarchically
related polygon feature classes, such as soil type and slope class.
P
Area Boundary Must Be Covered By Boundary of:
AP
Requires that boundaries of polygon features in one feature class (or subtype) be
covered by boundaries of polygon features in another feature class (or subtype). This is useful
when polygon features in one feature class, such as subdivisions, are composed of multiple
polygons in another class, such as parcels, and the shared boundaries must be aligned.
R
CO
U
There are two types of GIS data: spatial data (coordinate and projection information
for spatial features) and attribute data. Attribute data is additional information appended in
tabular format linked with spatial features. The attribute data is linked with spatial data
through unique id (i.e. feature ID). The spatial data contains information about where and
attribute data can contain information about what, where, and why. Attribute data provides
characteristics about spatial data.
Joins:
When our data was all in a single table, we could easily retrieve a particular row from
P
that table. But if the data we are looking for is available in two or more tables then joins can
AP
be used to retrieve those data. Join is used to fetch data from two or more tables, which is
joined to appear as single set of data. It is used for combining column from two or more
tables by using values common to both tables.
There are several types of JOINs: INNER, LEFT OUTER and RIGHT OUTER; they
all do slightly different things, but the basic theory behind them all is the same.
R
CO
Inner Join:
An INNER JOIN returns a result set that contains the common elements of the tables,
i.e. the intersection where they match on the joined condition. An INNER JOIN focuses on
the commonality between two tables. When using an INNER JOIN, there must be at least
U
some matching data between two (or more) tables that are being compared. INNER JOINs
are the most frequently used JOIN operation.
ST
P
AP
R
CO
P
AP
R
Relates:
Relates can help us to discover specific information within our data. A relate (also
CO
called a table relate) is a property of a layer. We can create a table relate so that we can query
and select features in one layer and see all the related features in another layer or
table. Unlike joining tables, relating tables simply defines a relationship between two tables.
The associated data isn't appended to the layer's attribute table like it is with a join. Instead,
U
we can access the related data through selected features or records in your layer or table.
ST
Relation Class:
A relationship class is an object in a geo-database that stores information about a
relationship between two feature classes, between a feature class and a non-spatial table, or
between two non-spatial tables. Both participants in a relationship class must be stored in the
same geo-database.
A relationship class stores information about associations among features and records
in a geo-database and can help ensure your data's integrity. Relates that are added to a layer
or table in a map are essentially the same as simple relationship classes defined in a geo-
database, except that they are saved with the map instead of in a geo-database.
P
AP
R
CO
an application to gain access to one or more data sources. ODBC provides a standard
interface to allow application developers and vendors of database drivers to exchange data
ST
application’s request so that the request conforms to syntax supported by the associated
database. A framework to easily build an ODBC drivers is available from Simba
Technologies, as are ODBC drivers for many data sources, such as Salesforce, MongoDB,
Spark and more.
The following are the steps involved in connecting application programs with the
database using ODBC API:
Load ODBC driver: The forName() method of Class class is used to register the driver
class. This method is used to dynamically load the driver class.
Establish Connection: The getConnection() method of DriverManager class is used to
establish connection with the database.
P
Prepare and Execute SQL Statement: The createStatement() method of Connection
AP
interface is used to create statement. The executeQuery() and execute() method is
used to execute queries to the database.
Process the result: The executeQuery() method returns the object of ResultSet that can
be used to get all the records of a table.
R
Close connection: The close() method is used to close the connection in order to free
the allocated resource used by the connection.
CO
The below java code is used for connecting with mysql database using ODBC
application programming interface.
U
ST
Today, GPS receivers are included in many commercial products, such as automobiles, smart
phones, exercise watches, and GIS devices.
P
Figure: Three Segments of GPS
Space Segment:
AP
The GPS space segment consists of a constellation of satellites transmitting radio
signals to users. The United States is committed to maintaining the availability of at least 24
operational GPS satellites, 95% of the time. To ensure this commitment, the Air Force has
been flying 31 operational GPS satellites for the past few years. GPS satellites fly in medium
R
Earth orbit (MEO) at an altitude of approximately 20,200 km (12,550 miles). Each satellite
circles the Earth twice a day. The satellites in the GPS constellation are arranged into six
CO
equally-spaced orbital planes surrounding the Earth. Each plane contains four "slots"
occupied by baseline satellites. This 24-slot arrangement ensures users can view at least four
satellites from virtually any point on the planet.
U
ST
repositioned, so that three of the extra satellites became part of the constellation baseline. As
a result, GPS now effectively operates as a 27-slot constellation with improved coverage in
most parts of the world.
Control Segments:
The GPS control segment consists of a global network of ground facilities that track
the GPS satellites, monitor their transmissions, perform analyses, and send commands and
data to the constellation. The current Operational Control Segment (OCS) includes a master
control station, an alternate master control station, 11 command and control antennas, and 16
monitoring sites. The locations of these facilities are shown in the map above.
P
AP
R
CO
U
efforts of its operators — the men and women of the U.S. Air Force's 2nd Space Operations
Squadron (2SOPS) and the Air Force Reserve's 19th Space Operations Squadron (19SOPS)
at Schriever Air Force Base, Colorado.
User Segments:
Like the Internet, GPS is an essential element of the global information infrastructure.
The free, open, and dependable nature of GPS has led to the development of hundreds of
applications affecting every aspect of modern life. GPS technology is now in everything from
cell phones and wristwatches to bulldozers, shipping containers, and ATM's.
P
on maps and in geographic information systems (GIS) that store, manipulate, and display
AP
geographically referenced data.
Governments, scientific organizations, and commercial operations throughout the
world use GPS and GIS technology to facilitate timely decisions and wise use of resources.
Any organization or agency that requires accurate location information about its assets can
benefit from the efficiency and productivity provided by GPS positioning. Unlike
R
conventional techniques, GPS surveying is not bound by constraints such as line-of-sight
CO
visibility between survey stations. The stations can be deployed at greater distances from
each other and can operate anywhere with a good view of the sky, rather than being confined
to remote hilltops as previously required.
GPS is especially useful in surveying coasts and waterways, where there are few land-
U
based reference points. Survey vessels combine GPS positions with sonar depth soundings to
make the nautical charts that alert mariners to changing water depths and underwater hazards.
ST
Bridge builders and offshore oil rigs also depend on GPS for accurate hydrographic surveys.
Land surveyors and mappers can carry GPS systems in backpacks or mount them on vehicles
to allow rapid, accurate data collection. Some of these systems communicate wirelessly with
reference receivers to deliver continuous, real-time, centimeter-level accuracy and
unprecedented productivity gains. To achieve the highest level of accuracy, most survey-
grade receivers use two GPS radio frequencies: L1 and L2. Currently, there is no fully
functional civilian signal at L2, so these receivers leverage a military L2 signal using
"codeless" techniques.
*****