'Digital Image Classification' - 250330 - 115910
'Digital Image Classification' - 250330 - 115910
In this spectral plot it is evident that band 4 is best for the extraction
of feature A, and band 7 is best for the extraction of feature B.
Source: http://www.sc.chula.ac.th/courseware/2309507/Lecture/remote18.htm
Selection of suitable classification method
Spatial resolution of the RS data, different sources of additional data, a
classification system, and availability of classification software must be
taken into account when selecting a classification method for use.
- hybrid approaches
Hard VS Fuzzy Classifiers
Hard Classifiers
Fuzzy Classifiers
- Fieldwork
- Interpretation form Aerial Maps
- Map analysis
- Personal experience
From Lillesand, Kiefer and Chipman (2015)
Supervised classification: classification stage
21
Supervised classification: parallelepiped (‘box’)
• Assign boundaries around the spread of a class in feature space i.e. take
account of variance
• typically use minimum/maximum of DN in a particular class to define
limits, giving a rectangle in 2D, box in 3D (if we have > 2 bands) etc.
• pixels outside of these regions are unclassified (which is good or bad,
depending on what you want!!)
• problems if class regions overlap or if high covariance between different
bands (rectangular box shape inappropriate)
– can modify algorithm by using stepped boundaries with a series of
rectangles to partially overcome such problems
• simple and fast technique
• takes some account of variations in the variance of each class
Supervised classification: parallelepiped (‘box’)
24
Supervised classification: Gaussian maximum likelihood
25
Unsupervised Classification (clustering)
27
Unsupervised classification: ISODATA (Iterative self-organising data
analysis) algorithm
28
ISODATA example: 2 classes, 2 bands
Pixel 2 Pixel 1
Pixel 1 b
DN
DN Ch 2
Ch 2
DN DN
Ch 1 All pixels Ch 1
assigned to a or SD of cluster a Split a into 2,
b - update stats too large? recalculate.
Repeat….
New positions of
New positions of
cluster means
cluster means DN
DN
Ch 2
Ch 2
29
Hybrid Approaches
30
Assessing classification accuracy
Accuracy assessment should be an important part of any classification.
Usual procedure: compare the classification with actual reference data that
is known to accurately reflect the true land-cover.
Such reference data could include field or ground truth, higher
resolution satellite images or aerial photos, topographical maps derived
from aerial photo interpretation and GIS vector datasets.
It is important to choose an appropriate sampling method and sample
size. There must be at least some observations for each class.
The accuracy assessment is a reflection of the difference between the
classification and the corresponding reference data.
Assessing classification accuracy
Ideally, accuracy assessment should not be based on the training pixels.
Reason: training pixels are usually not randomly selected and the
classification itself depends on these training pixels.
Increasing sample size results in higher precision of the estimated
accuracy measures. Decreasing sample size will lower the precision.
The Confusion Matrix
One of the most common methods of accuracy assessment is the use of a
confusion matrix. It consists of a table that relates the classes of the
classified pixels to their actual classes based on reference data.
The producer’s accuracy is referred to as the omission error. It describes
how well a certain area can be classified. The producer’s accuracy refers to
the probability that a certain land-cover of an area on the ground is
classified correctly
User’s accuracy is referred to as the commission error. The user’s accuracy
refers to the probability that a pixel labelled as a certain land-cover class on
the classification map really belongs to this class.
Overall accuracy provides an estimate of the ‘correctness’ of the
classification after talking both user and producer accuracy into account.
Kappa accuracy reflects the difference between actual agreement and the
agreement expected by chance. Kappa of 0.65 means there is a 65% better
agreement than by chance alone
Confusion matrix
In this table there were 61 possible pixels of water but only 46 were
correctly attributed to water. Thus, 15 were omitted (omission error)
On the other hand 50 pixels were attributed to water but 4 of them (2 of
agriculture and 2 of forest) were mistakenly included (commission error)
The overall classification accuracy is calculated by adding up all the
correctly classified pixels along the diagonal and dividing by the total
number of pixels:
46 + 37 + 44 x 100 = 84,7%
150
Source: http://www.yale.edu/ceo/OEFS/Accuracy_Assessment.pdf
Confusion matrix
Source: http://www.yale.edu/ceo/OEFS/Accuracy_Assessment.pdf
Confusion matrix
Whether parameters Parametric classifiers Gaussian distribution is assumed. The parameters (e.g. mean Maximum likelihood, linear
such as mean vector vector and covariance matrix) are often generated from discriminant analysis.
and covariance training samples. When landscape is complex,
matrix are used or parametric classifiers often produce ‘noisy’ results. Another
not major drawback is that it is difficult to integrate ancillary
data, spatial and contextual attributes, and non-statistical
information into a classification procedure.
Non-parametric No assumption about the data is required. Non-parametric Artificial neural network,
classifiers classifiers do not employ statistical parameters to calculate decision tree classifier,
class separation and are especially suitable for incorporation evidential reasoning, support
of non-remote-sensing data into a classification procedure. vector machine, expert
system.
Which kind of pixel Per-pixel classifiers Traditional classifiers typically develop a signature by combining Most of the classifiers, such as
information is used the spectra of all training-set pixels from a given feature. The maximum likelihood,
resulting signature contains the contributions of all materials minimum distance, artificial
present in the training-set pixels, ignoring the mixed pixel neural network, decision tree,
problems. and support vector machine.
Subpixel classifiers The spectral value of each pixel is assumed to be a linear or Fuzzy-set classifiers, subpixel
non-linear combination of defined pure materials (or classifier, spectral mixture
endmembers), providing proportional membership of each analysis.
pixel to each endmember.
Source: Lu, D. & Weng, Q. 2007.
Taxonomy of image classification methods
Criteria Categories Characteristics Example of classifiers
Which kind of pixel Object-oriented classifiers Image segmentation merges pixels into objects and classification eCognition.
information is used is conducted based on the objects, instead of an individual
pixel. No GIS vector data are used.
Per-field classifiers GIS plays an important role in per-field classification, integrating GIS-based classification
raster and vector data in a classification. The vector data are approaches.
often used to subdivide an image into parcels, and
classification is based on the parcels, avoiding the spectral
variation inherent in the same class.
Whether output is a Hard classification Making a definitive decision about the land cover class that each Most of the classifiers, such as
definitive decision pixel is allocated to a single class. The area estimation by hard maximum likelihood,
about land cover classification may produce large errors, especially from coarse minimum distance, artificial
class or not spatial resolution data due to the mixed pixel problem. neural network, decision tree,
and support vector machine.
Soft (fuzzy) classification Providing for each pixel a measure of the degree of similarity for Fuzzy-set classifiers, subpixel
every class. Soft classification provides more information and classifier, spectral mixture
potentially a more accurate result, especially for coarse spatial analysis.
resolution data classification.
Whether spatial Spectral classifiers Pure spectral information is used in image classification. A Maximum likelihood, minimum
information is used ‘noisy’ classification result is often produced due to the high distance, artificial neural
or not variation in the spatial distribution of the same class. network.
Contextual classifiers The spatially neighbouring pixel information is used in image Iterated conditional modes,
classification. point-to-point contextual
correction, frequency-based
contextual classifier.
Spectral-contextual Spectral and spatial information is used in classification. ECHO, combination of para
classifiers Parametric or non-parametric classifiers are used to generate metric or non-parametric and
initial classification images and then contextual classifiers are contextual algorithms.
implemented in the classified images.
Source: Lu, D. & Weng, Q. 2007.