Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery

Lyu, Yuxiao; Peng, Lingbing; Pu, Tian; Yang, Chunping; Wang, Jun; Peng, Zhenming

doi:10.3390/rs12010142

Open AccessArticle

Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery

by

Yuxiao Lyu

^1,2,

Lingbing Peng

^2,3,

Tian Pu

^2,3,

Chunping Yang

^2,3,

Jun Wang

^2,3,4 and

Zhenming Peng

^2,3,*

¹

School of Optoelectronic Science and Engineering, University of Electronic Science and Technology of China, Chengdu 610054, China

²

Laboratory of Imaging Detection and Intelligent Perception, University of Electronic Science and Technology of China, Chengdu 610054, China

³

School of Information and Communication Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China

⁴

The Science and Technology on Optical Radiation Laboratory, Beijing 100854, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2020, 12(1), 142; https://doi.org/10.3390/rs12010142

Submission received: 25 November 2019 / Revised: 23 December 2019 / Accepted: 26 December 2019 / Published: 1 January 2020

(This article belongs to the Special Issue Computer Vision and Machine Learning Application on Earth Observation)

Download

Browse Figures

Versions Notes

Abstract

:

In earth observation systems, especially in the detection of small and weak targets, the detection and recognition of long-distance infrared targets plays a vital role in the military and civil fields. However, there are a large number of high radiation areas on the earth’s surface, in which cirrus clouds, as high radiation areas or abnormal objects, will interfere with the military early warning system. In order to improve the performance of the system and the accuracy of small target detection, the method proposed in this paper uses the suppression of the cirrus cloud as an auxiliary means of small target detection. An infrared image was modeled and decomposed into thin parts such as the cirrus cloud, noise and clutter, and low-order background parts. In order to describe the cirrus cloud more accurately, robust principal component analysis (RPCA) was used to get the sparse components of the cirrus cloud, and only the sparse components of infrared image were studied. The texture of the cirrus cloud was found to have fractal characteristics, and a random fractal based infrared image signal component dictionary was constructed. The k-cluster singular value decomposition (KSVD) dictionary was used to train the sparse representation of sparse components to detect cirrus clouds. Through the simulation test, it was found that the algorithm proposed in this paper performed better on the the receiver operating characteristic (ROC) curve and Precision-Recall (PR) curve, had higher accuracy rate under the same recall rate, and its F-measure value and Intersection-over-Union (IOU) value were greater than other algorithms, which shows that it has better detection effect.

Keywords:

fractal dictionary learning; robust principal component analysis (RPCA); cirrus detection; infrared imagery

Graphical Abstract

1. Introduction

Space infrared detector is an essential part of the earth observation and remote sensing system, which plays a vital role in early warning, missile interception, and other aspects, and is one of the research hotspots in the military field [1]. Infrared detection technology has such advantages as strong survivability, good portability, and the ability to detect radar blind areas [2]. The infrared and visible light detectors and telescopes carried by satellites are used to detect and track target aircraft, ships, etc. These infrared radiations are represented as infrared dim small targets in satellite infrared images, and the performance of infrared target detection algorithm is mainly reflected in the detection ability of infrared dim small targets. With the continuous development of infrared imaging detection system, algorithms for small target detection and recognition have been emerging in recent years [3,4,5,6,7,8,9,10,11]. However, because there are a large number of natural landscapes with high radiation in the imaging band of infrared images, such as cirrus, which is similar to the target in the satellite infrared image and has high gray level, it may cause false alarm of early warning system and interfere with small target detection, thus, it is difficult to detect small targets directly. In order to solve the problems existing in small target detection, it is necessary to study the imaging characteristics and detection methods of cirrus, so as to improve the accuracy and response speed of the ground detection system.

Cirrus cloud detection is also a vital part of data processing, which plays an essential role in ecological environment monitoring, weather forecasting, natural disaster prevention, and so on. Domestic and foreign scholars have proposed many cirrus cloud detection methods. They can detect cirrus clouds based on physical models such as infrared radiation and atmospheric attenuation [12,13], which requires prior knowledge. It can also be based on the time series of automatic screening detection methods [14,15], but there are difficulties in data acquisition. In recent years, with the development of artificial intelligence, many methods based on machine learning and neural network have been proposed [16,17], but this method relies on large sample image data and is not suitable for most cases. The method proposed in this paper is based on small sample image data, and the cirrus cloud is detected from the visual features and sparse representation of the cirrus cloud.

The classic principal component analysis (PCA) [18] model is to transform the high-dimensional data to the low-dimensional data, obtain the main information by reducing the dimension, and remove the sparse irrelevant information. At the same time, principal component analysis can be used to obtain sparse components, so as to obtain sparse images with cirrus clouds. PCA has always been an essential research hotspot, widely used in the signal field [19,20], but it is highly dependent on data because the noise of data assumed by PCA is Gaussian. In order to improve the PCA algorithm, Wright et al. proposed robust principal component analysis (RPCA) [21]. RPCA, on the other hand, does not assume Gaussian noise, and its core idea is that the data matrix Y can be represented as the superposition of a low-rank matrix L and a sparse matrix S under the optimization criterion, that is, Y = L + S, as shown in Figure 1. In a physical sense, the rank of a matrix measures the correlation between the columns and columns of a matrix. If the rows or columns of the matrix are linearly independent, the matrix is full rank, which means the rank is equal to the number of rows. There’s some correlation between the rows in this matrix, thus, this matrix is generally low rank. The sparse matrix means that the number of 0 elements in the matrix is much larger than the number of non-zero elements, and the distribution of 0 elements is irregular. Typical practical applications are face shadow removal [22], background estimation [23,24], and infrared dim target detection [25,26]. Face shadow removal and background estimation are mainly used to analyze the low-rank components obtained by RPCA, because faces are low-rank relative to shadows and backgrounds are low-rank relative to moving objects. The detection of infrared small and small targets is to analyze the sparse components obtained by RPCA, because small and small targets are sparse compared with the infrared background. Since different setting parameters can obtain different degrees of sparse components, while the virtual alarm source is sparse compared with the infrared background, while the background is low-rank, RPCA can be used to obtain the sparse components of the infrared image, including the virtual alarm source, noise, and clutter.

With the development of image algorithms, sparse representation, and dictionary learning are increasingly applied to target detection [27], image reconstruction [28], image denoising [29], image compression [30], and other aspects. Sparse representation is to express most or all of the data matrix Y with a linear combination of fewer basic signals. Find a coefficient matrix A and a dictionary matrix D, so that D*A can restore Y as much as possible, and A is as sparse as possible. A is the sparse representation of Y. Dictionary learning is to find the appropriate dictionary for the samples of common dense expressions and transform the samples into appropriate sparse expressions, so as to simplify the learning task and reduce the complexity of the model. The overall strategy for solving the above problems is to optimize the dictionary D and sample sparse representation A iteratively. To start, initialize dictionary D, 1. Fix dictionary D to optimize A. 2. Fix A to optimize dictionary D. Repeat the above two steps to obtain the sparse representation of A for the final D and Y, where each column d_i in D represents the dictionary atom, and each row α_i in A represents the sparse coefficient corresponding to d_i. In recent years, dictionary construction in sparse representation has developed from an orthogonal basis to over complete dictionary [31]. Compared with a complete orthogonal basis, the basis of an overcomplete basis is usually redundant, that is, the number of base elements is larger than the number of dimensions. Given an initial dictionary and a signal to be trained, the dictionary learning algorithm constantly adjusts the dictionary atoms to make the description of the signal more accurate, and finally achieve the goal of constructing redundant dictionary. K-clustering with singular value decomposition (K-SVD) [32], a representative dictionary learning algorithm, is used to construct a dictionary by minimizing the reconstruction error of the original sample in compressed sensing of images [33] and image denoising [34,35], which achieves good results.

In the cirrus cloud detection based on sparse representation, the construction of redundant dictionary is a difficult problem. Through the texture analysis of the false alarm source in the infrared image, it is found that it has fractal characteristics. Fractal characteristics mainly refer to the self-similarity of objective things, which is embodied in fractal dimension, fractal error and multifractal index. In recent years, fractal features have been widely applied in texture analysis [36,37], Image sampling [38,39] and segmentation of medical signals (one-dimensional (1D), two-dimensional (2D), or three-dimensional (3D)) [40,41]. The study shows that most of the natural objects in nature have strong fractal characteristics, which can be consistent with the fractal model. Different types of signals have different shapes and attributes, with low correlation, while the same type of signal has high correlation; thus, the specific type of signal components in infrared image can be efficiently represented by the same type of over complete dictionary [42]. The random fractal image constructed by diamond square method is similar to cloud image, thus, the redundant dictionary constructed by fractal image can effectively represent cirrus.

In this paper, a new method based on RPCA and fractal dictionary learning to detect the cirrus cloud is proposed. By studying the component composition of infrared images, it was found that cirrus, noise, and clutter are sparse relative to the background, while the background is low-rank. Infrared images are composed of low-rank images and sparse images, as shown in Figure 1. In order to study the cirrus cloud more accurately, the sparse components of the infrared image were obtained by means of Robust Principal Component Analysis (RPCA), and only the sparse components of the infrared image were studied. Because the fractal can describe cirrus well, the over-complete dictionary based on the fractal structure can characterize cirrus well. The construction of fractal dictionary can be generated according to random fractal images. The fractal dictionary can be constructed from the random fractal image obtained by diamond square algorithm [43]. Then, the fractal dictionary D_s was studied and sparsely coded by the k-clustering with singular value decomposition (KSVD) method, and the sparsely represented images were obtained. Finally, the sparse represented images were segmented by threshold values to obtain the cirrus cloud false alarm source detection results. The method proposed in this paper has a higher accuracy under the same recall rate and a larger F-measure value and Intersection-over-Union (IOU) value for the best detection effect, indicating that it has a better detection effect. As a matter of convenience, Table 1 represents the nomenclature of this paper.

2. Materials and Methods

In this paper, a new method based on fractal dictionary learning to detect cirrus was proposed. The key is to learn the constructed fractal dictionary to detect cirrus. In this section, we first introduce Robust Principal Component Analysis (RPCA), which is used to obtain sparse components of the original image. Then, the algorithm of generating random fractal image is introduced, and the fractal dictionary is constructed by fractal image. Finally, a dictionary learning algorithm based on KSVD is introduced to obtain sparse representation images of sparse components and detect cirrus.

2.1. Robust Principal Component Analysis

The component composition of infrared image shows that cirrus, noise and clutter are sparse with respect to the background, while the background is low-rank. Then, the original infrared image can be superposed by sparse component and low-rank component, that is, Y = L + S; Y represents the original infrared image, L represents low-rank background component, and S represents sparse cirrus, noise, and clutter component. Among them, the noise refers to the point-like salt and pepper noise, while the clutter refer to the coastline, water ripple, and other long impurities that will interfere with cirrus detection. Because cirrus clouds have fractal features and fractal-based dictionaries can better sparsely represent cirrus clouds, these clutters will not be detected incorrectly. Robust Principal Component Analysis (RPCA) is a current popular model, which is used in this paper to obtain the sparse Component S of infrared images, as shown in Figure 2.

Principal component analysis (PCA) is to find a low rank matrix L, which minimizes the difference between L and Y. It is considered that Y is contaminated by Gaussian noise and the optimal solution can be obtained by singular value decomposition (SVD). However, due to the existence of cirrus, noise, and clutter, the effect of PCA is poor, and the proposal of RPCA makes up for the shortcomings of PCA. Because the noise of the data assumed by the PCA is Gaussian, the PCA will be affected by it, while the RPCA does not exist this hypothesis, but only assumes that the noise is sparse. Therefore, RPCA can be used to obtain the sparse image with the cirrus cloud. Restoring sparse matrix S is a two-objective optimization problem:

m i n_{L, S} (r a n k (L) + λ {‖ S ‖}_{0}) s . t . Y = L + S

(1)

where rank(∙) is the rank of the matrix;

{‖ \cdot ‖}_{0}

is the zero norm of the matrix, which represents the non-zero number of the matrix; and λ is the compromise factor, which can control the proportion of low-rank images and sparse images.

The optimal convex approximation is as follows:

m i n_{L, S} {‖ L ‖}_{*} + λ {‖ S ‖}_{1} s . t . Y = L + S

(2)

RPCA is often used to remove image noise, and the sparse components containing cirrus can also be obtained. Different sparse images can be obtained by changing the value of λ, where the larger the λ is, the smaller the original image component of the sparse image is, as shown in Figure 2.

There are many models to solve RPCA, such as the dual method, Accelerated Proximal Gradient (APG) [44], Iterative Thresholding (IT) [45], Exact Augmented Lagrange Multiples (EALM), and Inexact Augmented Lagrange Multiples (IALM) [46]. IALM is an improvement of EALM, which requires fewer SVD times and has higher accuracy and convergence speed. Therefore, IALM is used to solve RPCA problems.

Sparse images obtained from RPCA mainly consist of cirrus component Y_S, noise, and clutter component n. When the sparse component is acquired by the RPCA method, the images with different sparseness can be acquired by controlling parameter λ. In order to get more complete cirrus clouds, there are still many noises and clutters. The specific type of signal in infrared image can be efficiently represented by the over-complete dictionary of the same type of signal, thus, the dictionary constructed by random fractal image can be used to represent cirrus clouds in infrared image. Then, the method based on fractal dictionary learning can remove noise and cirrus clouds that do not have fractal features.

2.2. Random Fractal

For the objective existence of coastlines, cirrus clouds, rivers, snow mountains, etc. in nature, when some of them are taken out and enlarged appropriately, the images obtained are not the same as the original ones. However, the complexity of dense bending is similar to the original ones, thus, the self-similarity of natural landscape is called random self-similarity. Fractals with random self-similarity are called random fractals. The random fractal images obtained by the Diamond-Square algorithm are similar to the texture images of clouds, thus, the fractal dictionary constructed from this image can efficiently represent cirrus. Next, the Diamond-Square algorithm [43] is introduced to generate random fractal images.

To generate random fractal images, the number of iterations n is first determined, and the square ABCD is meshed to generate

(2^{n} + 1) \times (2^{n} + 1)

resolution fractal images. The generation process is shown in Figure 3. The random value X at the midpoint M of the square ABCD is generated, and the calculation formula is as follows:

X = H^{t} \times 2^{- t H}

(3)

where H represents the value of Hurst index, and t represents the number of current iterations; the calculation formula for the gray value at the middle point M is as follows:

M = X + \frac{(A + B + C + D)}{4}

(4)

The midpoints of edge AB, BC, CD, and DA are E, F, G, and H, respectively. The gray values of point E are calculated according to the gray values of A, B, and M. The formulas are as follows:

E = X + \frac{(A + B + M + M)}{4}

(5)

Similarly, the gray value of point F is calculated according to the gray value of B, C, and M, the gray value of point G is calculated according to the gray value of C, D, and M, and the gray value of point H is calculated according to the gray value of D, A, and M.

The average m of four vertices gray value of small square EBFM is calculated, and the sum of average m and random value X is taken as the gray value of the middle point of small square EBFM. By analogy, the gray value of the middle point of small square MFCG, HMGD, and AEMH is obtained. Repeat the above steps until the current iteration times satisfy n < t, and get the random fractal image, as shown in Figure 4.

Each pixel point in a random fractal image of M × N size is taken as the center, and the atomic sample block with the size of

s \times s

pixels is selected to convert the atomic sample block into a column vector, so as to obtain (M × N) sample atoms. According to the sample atoms, an over-complete dictionary D is formed, that is, an original dictionary atomic matrix with

(s \times s)

rows and (M × N) columns is obtained. The construction process is shown in Figure 5.

2.3. Sparse Representation and Dictionary Learning

The purpose of sparse representation is to represent the signal with as few atoms as possible in a given over-complete dictionary, so as to obtain the information in the signal more easily and facilitate further processing of the signal, such as compression and encoding. The sparse representation model can be described as:

S = ψ \times A

(6)

where

ψ

is expressed as a sparse transform basis and A is expressed as a sparse coefficient. The key to sparse representation is the choice of

ψ

. At present, the most widely used is the sparse representation based on redundant dictionary D. The redundant dictionary is composed of vectors, in which each column is the atom of the dictionary. Dictionary learning is mainly to update the dictionary after the initial dictionary is fixed and adjust the redundant dictionary according to the specific iterative method, so as to get a better sparse representation.

Sparse representation and dictionary learning were first used to solve the signal processing problem in compressed sensing, but now they are increasingly used in image processing. By applying sparse representation and dictionary learning methods to image processing, noise in image can be separated simply and efficiently, and image quality can be improved.

2.3.1. Orthogonal Matching Pursuit Algorithm

The sparse representation model can be transformed into the following forms:

\hat{α_{i}} = a r g m i n_{a_{i}} ‖ S_{i} - D_{s} α_{i} ‖_{2}^{2} s u b j e c t t o ‖ α_{i} ‖_{0} \leq T_{0}

(7)

where S =

{s_{1}, s_{2}, \dots, s_{n - 1}, s_{n}}

, A =

{α_{1}, α_{2}, \dots, α_{n - 1}, α_{n}}

,

\hat{α_{i}}

represents the column with index i in the sparse coefficient matrix A, and T₀ represents the sparsity. The goal of sparse representation is to solve for A.

The greedy algorithm based on redundant dictionary is widely used because of its high efficiency and high accuracy. Matching Pursuit (MP) and Orthogonal Matching Pursuit (OMP) [47] are commonly used in greedy algorithms.

OMP algorithm is an improvement on the MP algorithm with faster convergence speed. The improvement is to orthogonalize all selected atoms at each step of decomposition. The main idea of OMP algorithm is to select the best atom to enter the atom set according to the matching degree, find the projection of the measured signal in the orthogonal space of the atom set, get the optimal sparse approximate solution of the original signal by solving the least square problem, update the signal margin, and make it enter the next iteration. Finally, the signal is linearly represented by atoms through a certain iteration process.

2.3.2. Dictionary Learning Based on KSVD

KSVD algorithm is mainly divided into two stages, the first is sparse coding and the second is dictionary learning. The KSVD algorithm is used to fix the initial dictionary D first, and then, the following two stages are carried out. The objective function of dictionary learning is:

D, A = a r g m i n_{D, A} ‖ {S - D A ‖}_{2}^{2} s . t . \forall i, ‖ α_{i} ‖_{0} < T_{0}

(8)

where

S \in R^{m \times n}

is a matrix to be decomposed,

D \in R^{m \times k}

is a dictionary (when

k ≫ m

, D is an over-complete dictionary),

A \in R^{k \times n}

is a sparse coefficient matrix, and

α_{i}

denotes the row with the subscript i in the sparse coefficient matrix A.

In the first stage, the OMP algorithm is mainly used to solve the sparse coefficient matrix A.

In the second stage, dictionary learning is a further operation of sparse representation. The objective function can ignore the penalty term

‖ α_{i} ‖_{0}

and change it to the following form:

m i n {‖ S - D A ‖}_{2}^{2} = m i n \sum_{i} (‖ S_{i} - D α_{i} ‖_{2}^{2})

(9)

The KSVD algorithm is used to update dictionary and sparse coding simultaneously. The dictionary is updated by column by column. When column k is updated, other atoms remain unchanged.

m i n {‖ (S - \sum_{i \neq k} d_{i} α_{i}) - d_{k} α_{k} ‖}_{2}^{2} = m i n {‖ E_{k} - d_{k} α_{k} ‖}_{2}^{2}

(10)

where E is the error estimation matrix and

α_{k}

is the sparse coefficient corresponding to the atom in the kth column to be updated, which is the kth row of the sparse coefficient matrix.

The Singular Value Decomposition (Singular Value Decomposition, SVD) method can be used to solve the two solutions. Firstly, the zero element in

E_{k}

should be removed, that is, the position of 0 in the corresponding

α_{k}

of

E_{k}

is removed, and the new

E_{k}^{'}

matrix and

α_{k}^{'}

vector can be obtained. In this case, the optimization problem can be described as:

m i n_{d_{k}, α_{k}} ‖ E_{k}^{'} - d_{k} α_{k}^{'} ‖_{2}^{2}

(11)

Singular value decomposition of

E_{k}^{'}

is performed to obtain

E_{k}^{'} = U Δ V^{T}

. Take the first column vector

u 1 = U (:, 1)

of left singular value matrix U as

d_{k}

, that is,

d_{k} = u 1

. Take the product of the first row vector of the right singular value matrix and the first singular value as a product of

α_{k}^{'}

, that is,

α_{k}^{'} = Δ (1, 1) V^{T} (1, :)

, and get the corresponding

α_{k}

according to

α_{k}^{'}

.

After fixing the fractal dictionary, learn the dictionary according to ksvd algorithm, as shown in Figure 6. Figure 6a shows the initial dictionary constructed according to the random fractal image, Figure 6b shows the initial fractal dictionary displayed by converting each column of atoms into image blocks, and Figure 6c shows the learned fractal dictionary.

2.4. Cirrus Detection by RPCA and Fractal Dictionary Learning

The algorithm flow is shown in Table 2. First of infrared image

Y \in R^{m \times n}

RPCA decomposition, get sparse component

S \in R^{m \times n}

, the parameter

λ

value of 0.03. Next, block column vectorization is carried out for S. In this paper, image blocks of size

s \times s

are selected, with each pixel point as the center point, image blocks are selected and converted into column vectors to obtain a matrix of size

s^{2} \times (m \times n)

, which is still named S. Because the sparse component of cirrus has fractal characteristic, thus, the use of the diamond–square algorithm to generate random fractal image

I \in R^{M \times N}

to construct a complete dictionary

D_{s} \in R^{s^{2} \times k}

(if

M \times N > k

, then D_s has k columns; if M by N is less than k, let k be M by N). Then the sparse component S is sparsely represented by KSVD algorithm, and the learned dictionary D_l and sparse coefficient matrix A are obtained. Sparse representation of sparse component S is reconstructed by D_l and A; then, morphological filtering and threshold segmentation are performed. Morphological filtering is the application of open and close operations to selectively remove noise and irrelevant targets at specified scales in texture details while retaining other useful information. Open operation removes the smaller points in the image. Closed operation transforms the fracture structure into a whole. Finally, the detected cirrus image

C \in R^{m \times n}

is obtained.

3. Results

In order to better illustrate the performance of infrared imaging cirrus detection method based on fractal dictionary learning, nine representative cirrus infrared images are tested in this paper, as shown in Figure 7. The test data was derived from the near-infrared band of Landsat8 data set. Let us introduce the morphology and distribution of cirrus. The cirrus in test image (a) are slender and sparsely distributed in the image, with sky and large clouds in the background. The image size is 320 × 256. The cirrus cloud shape of the test image (b) is filamentous and coiled, which are densely distributed in the whole image. The image size is 230 × 162. The cirrus clouds in the test image (c) are densely distributed point-shaped, with mountains and coastlines in the background. The image size is 232 × 162. In the test image (d), strip and cluster cirrus clouds are randomly distributed over the coast and sea water. The image size is 247 × 156. The test images (e) and (f) are similar to each other and both are clustered cirrus clouds with sparse distribution. The image sizes are 349 × 265 and 255 × 171, respectively. The cirrus clouds in the test images (g) and (h) are spot-shaped and densely distributed in the images. The image sizes are both 2035 × 1291. The cirrus clouds in the test image (g) are densely distributed in the lower left, and the test image (h) is densely distributed in the entire image. The cirrus clouds in the test image (i) are large and small, and are sparsely distributed in the entire image. The image size is 329 × 241. These nine test images cover the shape and distribution of most cirrus images, and their test experiments are more convincing.

In order to objectively evaluate the method proposed in this paper, it is compared with the cirrus detection method based on extracting fractal features and the classical detection method. The objective evaluation methods include the receiver operating characteristic (ROC) curve, Precision-Recall (PR) curve, comprehensive evaluation index (F-Measure), and Intersection-over-Union (IOU). The software used is MATLAB R2018.

3.1. Parameter Settings

First, the method proposed in this paper is to perform block decomposition of image

I \in R^{m \times n}

. The specific step is to select an image block of size s × s, with each pixel as the center point, and convert it into column vectors to obtain a matrix of size

s^{2} \times (m \times n)

. The key problem is to find the appropriate s value. In this paper, the s value was set to 8, 15, 20, 30, 40, 45 to find the best s value.

In order to objectively evaluate the value of s, the receiver operating characteristic (ROC) curve and Precision-Recall (PR) curve were used to evaluate six of the images.

The ROC curve is a functional image that describes the sensitivity. ROC curve can be achieved by describing true positive rate (TPR) and false positive rate (FPR). The ROC curve is also known as the correlation operation characteristic curve, because it is used as the standard by comparing two operation characteristics (TPR and FPR). Its abscissa is FPR and its ordinate is TPR. In addition, Area Under Curve (AUC) can be used as a quantitative evaluation index of ROC Curve. Generally, the larger the AUC, the better the detection effect of ROC curve.

ROC and PR curves are supervised evaluations, which need to manually mark the ground truth image as shown in Figure 8a. The predicted image is shown in Figure 8b. The concepts of TP, FP, FN, and TN are illustrated by the obfuscation matrix in Table 3.

TPR = \frac{T P}{T P + F N}

(12)

FPR = \frac{F P}{F P + T N}

(13)

where TP represents the total number of pixels in which the pixel value after I threshold segmentation is 1 and the pixel value in ground truth is also 1. FP represents the total number of pixels whose I threshold value is 1 and the corresponding ground truth is 0. FN represents the total number of pixels whose pixel value after I threshold segmentation is 0 and the pixel value in ground truth is also 1. TN represents the total number of pixels whose pixel value after I threshold segmentation is 0 and the pixel value in ground truth is also 0.

In this paper, the ROC curve of Figure 9 will be used to represent the detection effect of different images under different s values, and the closer to the upper left corner, the better. The PR curve in Figure 10 is closer to the upper right corner, the better the detection effect. Table 4 shows the area under the curve of ROC curve in Figure 9, and Table 5 shows the area under the curve of PR curve in Figure 10. The closer the value is to 1, the better the detection effect is.

In order to solve the shortcomings of ROC curve, PR curve is proposed, which is precision-recall curve, recall as abscissa axis, precision as ordinate axis. When the output image is labeled as the target, the recall rate will be equal to 100%, but the precision rate is very low. However, for ROC images, the evaluation effect is still very good. At this time, the PR curve will play a vital role.

precision = \frac{T P}{T P + F P}

(14)

recall = \frac{T P}{T P + F N}

(15)

According to ROC curve of Figure 9 and PR curve of Figure 10, it was found that the effect is better when s is 8 and 15, but when s is 15, the running time is 19.620 s, and when s is 8, the running time is 12.983 s. Therefore, the s value was set to 8.

3.2. Experimental Results and Analysis

The experimental results of the proposed algorithm for the test image of Figure 7 are shown in Figure 11. (a) Represents the sparse image obtained by RPCA decomposition. (b) The coefficients obtained by updating and sparse encoding fractal dictionary using KSVD algorithm are used to represent the image. (c) Represents the final result of threshold segmentation.

From Figure 11, it can be seen that the low rank components of infrared images can be removed by robust principal component analysis, and the sparse components including cirrus can be obtained. Then the sparse representation image is obtained by updating the fractal dictionary and sparse coding according to the KSVD algorithm. At this time, most of the noise and clutter in the image have been removed, and some image selection has been normalized. Finally, threshold segmentation is carried out according to OTSU method to obtain the final detection result image.

3.3. Evaluation

In order to evaluate the performance of the algorithm objectively, the receiver operating characteristic (ROC) curve, Precision-Recall (PR) curve, comprehensive evaluation index (F-measure), and Intersection-over-Union (IOU) are used to evaluate the performance of the algorithm. The proposed method will be compared with fractaldim [40], DivisorstepTP [48], MaxMedian, EightpixelTP [49], singularityExponent [50], and areaMeasure methods [51].

The method based on fractal dictionary proposed in this paper has time advantages in other methods of extracting fractal features. The time complexity of RPCA is O(CNlgN), where C represents the number of iterations and N represents the number of image pixels. The KSVD algorithm is mainly divided into two stages, the first is sparse coding and the second is dictionary learning. The time complexity of dictionary learning algorithm based on KSVD is O (t(n^2*m+m^2*n)), where t is the number of iterations and m*n is the size of the image. The total time complexity is O(CNlgN+t(n^2*m+m^2*n)). Table 6 shows the average running time of different methods.

In order to observe the experimental results more intuitively, the ROC curve in Figure 12 shows the overall evaluation of the detection effect of different algorithms in different test images. Each point on the curve represents the false alarm rate and recall rate under different thresholds. Where, the ROC curves of (a–i) in Figure 12 respectively represent the ROC curves of (a–i) images in Figure 7. Table 7 shows the area under the ROC curves in Figure 12. The closer the value is to 1, the better the detection effect. The bold number in the table represent the maximum value.

Figure 13 shows the PR curves of different test images. Each point on the curve represents recall and precision under different thresholds. Through the analysis of 9 PR curve images, it can be seen that the proposed algorithm has a higher accuracy under the same recall rate, which indicates that it has a better detection effect. Where, the PR curves of (a–i) in Figure 13 respectively represent the PR curves of (a–i) images in Figure 7. Table 8 shows the area under the PR curves in Figure 13. The closer the value is to 1, the better the detection effect.

The conflict between precision and recall may occur, thus, they need to be considered comprehensively. The most common method is F-Measure (also known as F-Score). F-Measure is the weighted harmonic average of precision, and recall:

F - Measure = \frac{(α^{2} + 1) p r e c i s o n \times r e c a l l}{α^{2} (p r e c i s o n + r e c a l l)}

(16)

The value of

α^{2}

is generally 0.3, which increases the weight of precision and considers the precision to be more essential than the recall. Because when the model marks all the output images as targets, the recall rate will be equal to 100%, but the precision rate is very low.

Table 9 shows the F-Measure corresponding to the detection results of the above methods in nine test images. For each test image, the maximum value is shown in bold. It can be seen that the method proposed in this paper not only has better precision rate, but also has a good recall rate and better detection effect in the detection of false alarm source.

The full Intersection of IOU is called Intersection over Union, which is the ratio of intersection and union of result image obtained by threshold segmentation of image I (predicted) and ground truth image.

I O U = \frac{p r e d i c t e d \cap g r o u n d t r u t h}{p r e d i c t e d \cup g r o u n d t r u t h}

(17)

Table 10 shows the IOU corresponding to the detection results of the above method in 9 test images. For each test image, the maximum value is shown in bold. It can be seen that the method proposed in this paper has better IOU on the cirrus detection and better detection effect.

4. Discussion

With the continuous development of infrared imaging detection system, in recent years, small target detection and recognition algorithms continue to emerge, but there are few algorithms to assist small and weak target detection by detecting the false alarm source. In this paper, a new method to detect the false alarm source of the cirrus cloud based on RPCA and fractal dictionary learning was proposed. Considering the sparsity of the cirrus cloud, the sparse component of infrared image was obtained by RPCA, which includes the cirrus cloud and noise. Then, the noise in the image was removed by using the fractal dictionary learning method, and finally, the cirrus cloud image was obtained.

Fractal is more and more widely used, and new methods of extracting fractal features are emerging, including the box counting method (fractaldim) to extract fractal dimension, the step-by-step triangular prism method (DivisorstepTP) to extract fractal dimension, the eight pixel triangular prism method (EightpixelTP) to extract fractal dimension, multi-scale fractal area (areaMeausre), and the singular index of multifractal analysis (singularityExponent). Because the cirrus cloud has self-similarity, it can be detected by extracting fractal features. The fractal Dictionary of this paper is also based on the fractal characteristics of the cirrus cloud, and it also verifies that the algorithm of fractal dictionary is better than other fractal algorithms.

The performance of the proposed algorithm is fully verified by experiments. According to the ROC curve in Figure 12, it can be seen that the proposed algorithm curve is generally closer to the upper left corner, so its detection effect is better. Figure 13 shows the PR curve of nine images. The algorithm curve proposed in this paper is closer to the upper right corner, with better effect. Table 7 and Table 8, respectively, represent the area under the ROC curve (AUC) in Figure 12 and the area under the PR curve (AUCpr) in Figure 13. Generally, the higher the AUC value, the better the detection effect. The algorithm proposed in this paper showed that the effect of the ROC curve and AUC value was lower than that of other algorithms, for example, as shown in Figure 12g, the Fractaldim algorithm, DivisorstepTP algorithm, and singularityExponent algorithm correspond to the ROC curve with a large AUC value. However, when the points on the curve were observed, the false alarm rate was relatively high. As can be seen from the PR curve evaluation in Figure 13g, the accuracy rate was not high and the detection effect was not good. According to the ROC curve, it can be seen that in the proposed algorithm, the recall rate is high, there is a low false alarm rate, the AUC value is bigger, and it gives better detection results. Because ROC curve ignores the accuracy, in order to evaluate the detection effect more accurately, the ROC curve and PR curve are used to evaluate the algorithm at the same time. Table 9 shows the comprehensive evaluation index (F-measure) of nine test images. It can be seen from the bold value that the F-measure of the proposed method is higher than other methods after combining the precision and recall indexes. Table 10 shows the intersection over union (IOU). It can be seen from the bold value that the IOU value of this method is higher than other algorithms, indicating that the segmentation detection effect of this method is better. In conclusion, the method based on RPCA and fractal dictionary learning proposed in this paper has good detection performance for the detection of cirrus false alarm source.

5. Conclusions

In this paper, a novel infrared cirrus detection method based on RPCA and fractal dictionary learning was proposed to suppress the false alarm sources in infrared detection system. The algorithm focuses on the construction of fractal dictionary for dictionary learning, in order to characterize cirrus cloud more reasonably and completely. Cirrus clouds usually satisfy fractal distribution, such as irregular shape, rough gray surface, complex texture, self-similarity, etc. Since the signal components of a specific type in infrared images can be effectively represented by an over-complete dictionary of the same type of signals, fractal dictionaries based on random fractal construction can well represent false alarm sources. Compared with the traditional detection method, the improved scheme has better detection performance and precision; its quality index, such as ROC, PR, AUC, IOU value, and F-measure, also shows better performance. As an auxiliary scheme, cirrus false alarm source detection and forecast is effective approach to improve the performance of photoelectric detection system, especially in small target detection. The proposed method is suitable for infrared images with single false alarm source. If there are several false alarm sources coexisting in the imaging area, more complex algorithms need to be further considered, such as hybrid modeling with multiple features for infrared imagery and more complete adaptive dictionary learning scheme.

Author Contributions

Y.L. proposed the original idea, performed the experiments, and wrote the manuscript. L.P., T.P., C.Y., and J.W. reviewed and edited the manuscript. Z.P. contributed to the direction, content, and revised the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was funded by the National Natural Science Foundation of China (61571096 and 61775030), Open Research Fund of Key Laboratory of Optical Engineering, Chinese Academy of Sciences (2017LBC003), and Sichuan Science and Technology Program (2019YJ0167 and 2019YFG0307).

Acknowledgments

The authors would thank Geospatial Data Cloud Website for providing experimental data, and also appreciate that this research works are supported by Science and Technology on Optical Radiation Laboratory.

Conflicts of Interest

The authors declare no conflict of interest.

References

Wang, L.; Zhang, R.; Lin, Y.; Xu, S.L. Application in the military of the IR detection technology. Infrared Laser Eng. 2008, 37, 570–574. [Google Scholar]
Wang, H.; Xiaoliang, S.; Yang, S.; Qifeng, Y. Present State and Perspectives of Small Infrared Targets Detection Technology. Infrared Technol. 2015, 37, 1–10. [Google Scholar]
Liu, X.; Chen, Y.; Peng, Z.; Wu, J.; Wang, Z. Infrared image super-resolution reconstruction based on quaternion fractional order total variation with Lp quasinorm. Appl. Sci. 2018, 8, 1864. [Google Scholar] [CrossRef] [Green Version]
Peng, Z.; Zhang, Q.; Wang, J.; Zhang, Q.P. Dim target detection based on nonlinear multi-feature fusion by Karhunen-Loeve transform. Opt. Eng. 2004, 43, 2954–2958. [Google Scholar]
Zhang, L.; Peng, L.; Zhang, T.; Cao, S.; Peng, Z. Infrared small target detection via non-convex rank approximation minimization joint l2, 1 norm. Remote Sens. 2018, 10, 1821. [Google Scholar] [CrossRef] [Green Version]
Zhang, L.; Peng, Z. Infrared Small Target Detection Based on Partial Sum of the Tensor Nuclear Norm. Remote Sens. 2019, 11, 382. [Google Scholar] [CrossRef] [Green Version]
Zhang, T.; Wu, H.; Liu, Y.; Peng, L.; Yang, C.; Peng, Z. Infrared Small Target Detection Based on Non-Convex Optimization with Lp-Norm Constraint. Remote Sens. 2019, 11, 559. [Google Scholar] [CrossRef] [Green Version]
Wang, X.; Peng, Z.; Kong, D.; He, Y. Infrared dim and small target detection based on stable multi-subspace learning in heterogeneous scene. IEEE Trans. Geosci. Remote Sens. 2017, 55, 5481–5493. [Google Scholar] [CrossRef]
Wang, X.; Peng, Z.; Zhang, P.; He, Y. Infrared small target detection via nonnegativity-constrained variational mode decomposition. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1700–1704. [Google Scholar] [CrossRef]
Wang, X.; Peng, Z.; Kong, D.; Zhang, P. Infrared dim target detection based on total variation regularization and principal component pursuit. Image Vis. Comput. 2017, 63, 1–9. [Google Scholar] [CrossRef]
Peng, L.; Zhang, T.; Liu, Y.; Li, M.; Peng, Z. Infrared Dim Target Detection using Shearlet’s Kurtosis Maximization Under Non-Uniform Background. Symmetry 2019, 11, 723. [Google Scholar] [CrossRef] [Green Version]
Foga, S.; Scaramuzza, P.L.; Guo, S.; Zhu, Z.; Dilley, R.D., Jr.; Beckmann, T.; Schmidt, G.L.; Dwyer, J.L.; Hughes, M.J.; Laue, B. Cloud detection algorithm comparison and validation for operational Landsat data products. Remote Sens. Environ. 2017, 194, 379–390. [Google Scholar] [CrossRef] [Green Version]
Qiu, S.; He, B.; Zhu, Z.; Liao, Z.; Quan, X. Improving Fmask cloud and cloud shadow detection in mountainous area for Landsats 4–8 images. Remote Sens. Environ. 2017, 199, 107–119. [Google Scholar] [CrossRef]
Xiaolin, Z.; Helmer, E.H. An automatic method for screening clouds and cloud shadows in optical satellite image time series in cloudy regions. Remote Sens. Environ. 2018, 214, 135–153. [Google Scholar]
Chen, Q.; Wu, Y.; Ye, J.; Xie, D. Cloud Detection Method for Remote Sensing Image in Urban Area. Remote Sens. Inf. 2018, 33, 57–61. [Google Scholar]
Hughes, M.; Daniel, H. Automated Detection of Cloud and Cloud Shadow in Single-Date Landsat Imagery Using Neural Networks and Spatial Post-Processing. Remote Sens. 2014, 6, 4907–4926. [Google Scholar] [CrossRef] [Green Version]
Gu, Y.; Wang, S.; Shi, T.; Lu, Y.; Clothiaux, E.E.; Yu, B. Multiple-kernel learning-based unmixing algorithm for estimation of cloud fractions with MODIS and CloudSat data. In Proceedings of the Geoscience & Remote Sensing Symposium, Munich, Germany, 22–27 July 2012. [Google Scholar]
Meng, S.; Huang, L.T.; Wang, W.Q. Tensor Decomposition and PCA Jointed Algorithm for Hyperspectral Image Denoising. IEEE Geosci. Remote Sens. Lett. 2016, 13, 1–5. [Google Scholar] [CrossRef]
Feldman, D.; Schmidt, M.; Sohler, C. Turning big data into tiny data: Constant-size coresets for k-means, pca and projective clustering. In Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA, 6–8 January 2013; pp. 1434–1453. [Google Scholar]
Huizinga, W.; Poot, D.H.; Guyader, J.M.; Klaassen, R.; Coolen, B.F.; van Kranenburg, M.; Van Geuns, R.J.; Uitterdijk, A.; Polfliet, M.; Vandemeulebroucke, J.; et al. PCA-based groupwise image registration for quantitative MRI. Med. Image Anal. 2016, 29, 65–78. [Google Scholar] [CrossRef]
Wright, J.; Ganesh, A.; Rao, S.; Peng, Y.; Ma, Y. Robust principal component analysis: Exact recovery of corrupted low-rank matrices via convex optimization. In Proceedings of the Advances in neural information processing systems, Vancouver, BC, Canada, 7–10 December 2009; pp. 2080–2088. [Google Scholar]
Wang, L.; Cheng, H. Robust principal component analysis for sparse face recognition Intelligent Control and Information Processing (ICICIP). In Proceedings of the 2013 Fourth International Conference on, Beijing, China, 9–11 June 2013. [Google Scholar]
Ebadi, S.E.; Izquierdo, E. Foreground segmentation via dynamic tree-structured sparse RPCA. In European Conference on Computer Vision; Springer: Cham, Switzerland, 2016; pp. 314–329. [Google Scholar]
Yang, B.; Zou, L. Robust foreground detection using block-based RPCA. Opt. Int. J. Light Electron Opt. 2015, 126, 4586–4590. [Google Scholar] [CrossRef]
Yang, D.; Liao, G.; Zhu, S.; Yang, X. RPCA based moving target detection in strong clutter background. In Proceedings of the 2015 IEEE Radar Conference (RadarCon), Arlington, VA, USA, 10–15 May 2015. [Google Scholar]
Oveis, A.H.; Sebt, M.A. Dictionary-Based Principal Component Analysis for Ground Moving Target Indication by Synthetic Aperture Radar. IEEE Geosci. Remote Sens. Lett. 2017, 99, 1–5. [Google Scholar] [CrossRef]
Zhang, Y.; Wang, X.; Xie, X.; Li, Y. Salient Object Detection via Recursive Sparse Representation. Remote Sens. 2018, 10, 652. [Google Scholar] [CrossRef] [Green Version]
Hu, Z.; Gao, J.; Zhang, N.; Yang, Y.; Liu, X.; Zheng, H.; Liang, D. An improved statistical iterative algorithm for sparse-view and limited-angle CT image reconstruction. Sci. Rep. 2017, 7, 10747. [Google Scholar] [CrossRef] [PubMed]
Zhuang, L.; Bioucas-Dias, J.M. Fast Hyperspectral Image Denoising and Inpainting Based on Low-Rank and Sparse Representations. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2018, 99, 1–13. [Google Scholar] [CrossRef]
Hou, J.; Chau, L.P.; Magnenat-Thalmann, N.; He, Y. SLRMA: Sparse Low-Rank Matrix Approximation for Data Compression. IEEE Trans. Circuits Syst. Video Technol. 2015, 27, 1043–1054. [Google Scholar] [CrossRef]
Qayyum, A.; Malik, A.S.; Naufal, M.; Saad, M.; Mazher, M.; Abdullah, F.; Abdullah, T.A. Designing of overcomplete dictionaries based on DCT and DWT. In Proceedings of the 2015 IEEE Student Symposium in Biomedical Engineering & Sciences ISSBES, Shah Alam, Malaysia, 4 November 2015. [Google Scholar]
Aharon, M.; Elad, M.; Bruckstein, A. K-SVD: An algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Signal Process. 2006, 54, 4311–4322. [Google Scholar] [CrossRef]
Zhai, X.; Zhu, W.; Kang, B. Compressed sensing of images combining KSVD and classified sparse representation. Comput. Eng. Appl. 2015, 51, 193–198. [Google Scholar]
Yan, X.; Yang, B.; Zhang, W.; Liu, C.; Wang, Y. An Improved Denoising Algorithm of Feather and Down Image Based on KSVD. In Proceedings of the 2016 8th International Conference on Information Technology in Medicine and Education (ITME), IEEE Computer Society, Fuzhou, China, 23–25 December 2016. [Google Scholar]
Zhang, Y.; Ji, K.; Deng, Z.; Zhou, S.; Zou, H. Clustering-based SAR image denoising by sparse representation with KSVD. In Proceedings of the IGARSS 2016-2016 IEEE International Geoscience and Remote Sensing Symposium, Beijing, China, 10–15 July 2016. [Google Scholar]
Xu, Y.; Yang, X.; Ling, H.; Ji, H. A new texture descriptor using multifractal analysis in multi-orientation wavelet pyramid. In Proceedings of the Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010; pp. 161–168. [Google Scholar]
Xu, Y.; Quan, Y.; Ling, H.; Ji, H. Dynamic texture classification using dynamic fractal analysis. In Proceedings of the 2011 International Conference on Computer Vision, Barcelona, Spain, 6–13 November 2011; pp. 1219–1226. [Google Scholar]
Liu, M.; Zhao, Y.; Liang, J.; Lin, C.; Bai, H.; Yao, C. Depth Map Up-sampling with Fractal Dimension and Texture-Depth Boundary Consistencies. Neurocomputing 2017, 257, 185–192. [Google Scholar] [CrossRef]
Zhang, Y.; Fan, Q.; Bao, F.; Liu, Y.; Zhang, C. Single-Image Super-Resolution Based on Rational Fractal Interpolation. IEEE Trans. Image Process. 2018, 27, 3782–3797. [Google Scholar]
Zhang, Y.D.; Chen, X.Q.; Zhan, T.M.; Jiao, Z.Q.; Sun, Y.; Chen, Z.M.; Yao, Y.; Fang, L.T.; Lv, Y.D.; Wang, S.H. Fractal Dimension Estimation for Developing Pathological Brain Detection System Based on Minkowski-Bouligand Method. IEEE Access 2017, 4, 5937–5947. [Google Scholar] [CrossRef]
Jin, X.; Qi, D.; Wu, H.; Yu, L.; Zhang, P. Log x-ray image edge detection based on fractal-morphology analysis. In Proceedings of the Chinese Control and Decision Conference, Xuzhou, China, 26–28 May 2010; pp. 2862–2866. [Google Scholar]
Liu, D.; Li, Z.; Liu, B.; Chen, W.; Liu, T.; Cao, L. Infrared Small Target Detection in Heavy Sky Scene Clutter Based on Sparse Representation. Infrared Phys. Technol. 2017, 85, 13–31. [Google Scholar] [CrossRef]
Rian, I.M.; Asayama, S. Computational Design of a nature-inspired architectural structure using the concepts of self-similar and random fractals. Autom. Constr. 2016, 66, 43–58. [Google Scholar] [CrossRef]
Toh, K.C.; Yun, S. An Accelerated Proximal Gradient Algorithm for Nuclear Norm Regularized Least Squares Problems. Pac. J. Optim. 2010, 6, 615–640. [Google Scholar]
Fan, R.Y.; Wang, H.X.; Zhang, H. A New Analysis of the Iterative Threshold Algorithm for RPCA by Primal-Dual Method. Adv. Mater. Res. 2014, 989–994, 2462–2466. [Google Scholar] [CrossRef]
Lin, Z.; Chen, M.; Ma, Y. The Augmented Lagrange Multiplier Method for Exact Recovery of Corrupted Low-Rank Matrices. arXiv 2010, arXiv:1009.5055. [Google Scholar]
Cai, T.T.; Wang, L. Orthogonal Matching Pursuit for Sparse Signal Recovery with Noise. IEEE Trans. Inf. Theory 2011, 57, 4680–4688. [Google Scholar] [CrossRef]
Ju, W.; Lam, S.N. An improved algorithm for computing local fractal dimension using the triangular prism method. Comput. Geosci. 2009, 35, 1224–1233. [Google Scholar] [CrossRef]
Zhou, Y.; Fung, T.; Leung, Y. Improved triangular prism methods for fractal analysis of remotely sensed images. Comput. Geosci. 2016, 90, 64–77. [Google Scholar] [CrossRef]
Salat, H.; Murcio, R.; Arcaute, E. Multifractal methodology. Phys. A Stat. Mech. Appl. 2017, 473, 467–487. [Google Scholar] [CrossRef]
Zhang, K.; Zhang, Q.; Yang, X. Extended target detection in complex background based on fractal theory. Proc. SPIE Int. Soc. Opt. Eng. 2009, 7283, 728331. [Google Scholar]

Figure 1. Description of the composition of the proposed infrared image. The data is decomposed into low rank components and sparse components, which include false alarm sources and noise. The three parts are described separately, i.e., learning low rank components and sparse components, in which D_s is a fractal dictionary constructed from random fractal image.

Figure 2. (a) Original infrared image; (b) sparse image of

λ

= 0.01; (c) sparse image of

λ

= 0.03.

Figure 2. (a) Original infrared image; (b) sparse image of

λ

= 0.01; (c) sparse image of

λ

= 0.03.

Figure 3. Generation process of random fractal images.

Figure 4. (a) random image generated when n = 4 (resolution 17 × 17) (b) random fractal image generated when n = 6 (resolution 65 × 65) (c) random fractal image generated when n = 9 (resolution 513 × 513).

Figure 5. Process of constructing fractal dictionary based on random fractal image.

Figure 6. (a) Fractal dictionary; (b) fractal dictionary represented by image blocks; (c) learned dictionary.

Figure 7. Cirrus images of nine scenes. (a) Slender cirrus; (b) wispy and curly cirrus; (c)pointy cirrus; (d) strip and cluster cirrus; (e) cluster cirrus; (f) cluster cirrus; (g) pointy cirrus; (h) densely distributed punctate cirrus; (i) sparse distribution of large and small cirrus.

Figure 8. (a) Groundtruth image; (b) predicted image.

Figure 9. ROC curves of six images under different s values.

Figure 10. PR curves of 6 images under different s values.

Figure 11. Detecting results. (a) Sparse images obtained by RPCA. (b) Sparse representation image reconstructed by KSVD algorithm. (c) The image after threshold segmentation.

Figure 12. ROC curves of different test images. The ROC curve of (a–i) in the figure respectively corresponds to the detection effect of (a–i) image in Figure 7. The closer a curve is to the top-left corner, the better the corresponding method is.

Figure 13. PR curves of different test images. The PR curve of (a–i) in the figure respectively corresponds to the detection effect of (a–i) image in Figure 7. The closer a curve is to the top-right corner, the better the corresponding method is.

Table 1. This table represents the nomenclature of this paper.

Nomenclature
PCA	principal component analysis	DA	the sparse representation image
RPCA	robust principal component analysis	E	the error estimation matrix
KSVD	k-clustering singular value decomposition	E′	the error estimation matrix after zero removal
OMP	Orthogonal Matching Pursuit	α_i	the ith row in the sparse coefficient matrix
ROC	the receiver operating characteristic
PR	Precision -Recall	α_i′	the sparse coefficient after zero removal
AUC	Area Under ROC Curve
AUCpr	Area Under PR Curve	d_i	the ith atom in the over-complete dictionary
F-measure	comprehensive evaluation index
IOU	intersection over union	s	the size of atomic sample block for constructing fractal dictionary
Y	the data matrix
L	the low-rank matrix	$ψ$	the sparse transform basis
S	the sparse matrix	T₀	the sparsity
D	the over-complete dictionary	TPR	true positive rate
D_s	the fractal dictionary	FPR	false positive rate
D_l	the learnt dictionary	TP	true positive
A	the coefficient matrix	FP	false positive
A_s	the coefficient matrix of sparse component	TN	true negative
		FN	false negative

Table 2. Cirrus detection method based on RPCA and fractal dictionary learning.

INPUT: Infrared image

Y \in R^{m \times n}

,

λ

,

k

, Hurst exponent
OUTPUT: Cirrus detection image

C \in R^{m \times n}

1. The infrared image Y is decomposed by RPCA, and the appropriate sparse component S is obtained according to

λ

2. According to Hurst exponent, random fractal image

I \in R^{M \times N}

is obtained by Diamond–Square algorithm
3. An over-complete dictionary

D_{s} \in R^{s^{2} \times k} = {d_{1}, d_{2} \dots d_{k}}

based on random fractal images is constructed
(if M × N > k, then D_s has k columns; if M by N is less than k, let k be M by N)
4. Sparse coding and dictionary updating are carried out by using KSVD algorithm:
Block column vectorization of sparse component S is used to obtain block matrix

S^{'}

of image
Obtaining Sparse Coefficient Matrix

A \in R^{k \times n} = {α_{1}, α_{2} \dots α_{k}}^{T}

by OMP Algorithm
for i = 1:k do
The error estimation matrix

E_{i}

is obtained when updating column i of the dictionary

E_{i} = S^{'} - \sum_{j \neq i} d_{j} α_{j}

SVD is performed after de-zero operation of

E_{i}

E_{i} = U Δ V^{T}

u

1 = U (:, 1)

d_{i} = u 1

α_{i} = Δ (1, 1) V^{T} (1, :)

end for
5. Sparse Representation Image DA Based on D and A
6. DA was processed by morphological filtering and threshold segmentation
7. The cirrus detection image C is obtained

Table 3. Cirrus detection method based on RPCA and fractal dictionary learning.

	groundtruth 1	groundtruth 0
predicted 1	TP	FP
predicted 0	FN	TN

Table 4. AUC of ROC curve in Figure 9.

Size	blockSize8	blockSize15	blockSize20	blockSize30	blockSize40	blockSize45
Img1	0.9981	0.9995	0.9862	0.9982	0.9966	0.9956
Img2	1	0.9893	0.9737	0.9586	0.9499	0.9477
Img3	0.9882	0.9828	0.9743	0.9227	0.9052	0.8956
Img4	0.9652	0.9813	0.9689	0.9672	0.9591	0.9578
Img5	0.9878	0.9869	0.9806	0.9574	0.9526	0.9507
Img6	0.9769	0.9864	0.9844	0.9792	0.9738	0.9689

Table 5. AUCpr of PR curve in Figure 10.

Size	blockSize8	blockSize15	blockSize20	blockSize30	blockSize40	blockSize45
Img1	0.9701	0.9226	0.8690	0.7852	0.7207	0.6857
Img2	0.9994	0.8771	0.7974	0.6969	0.6296	0.5972
Img3	0.7951	0.8349	0.8032	0.6966	0.6403	0.6215
Img4	0.8449	0.8469	0.7959	0.7078	0.6443	0.6196
Img5	0.8736	0.8776	0.8276	0.7362	0.6861	0.6643
Img6	0.7908	0.8136	0.7831	0.7286	0.6820	0.6600

Table 6. The average running time of the different methods.

Methods	DivisorstepTP	EightpixelTP	MaxMedian	areaMeasure	fractaldim	SingularityExponent	Proposed
Time(s)	13.287	13.083	4.124	22.089	15.440	51.059	12.983

Table 7. AUC of ROC curve in Figure 12. (The bold number represent the maximum value.)

Methods	DivisorstepTP	EightpixelTP	MaxMedian	areaMeasure	fractaldim	SingularityExponent	Proposed
Img1	0.9520	0.5702	0.8020	0.9572	0.8894	0.8805	1
Img2	0.7866	0.5090	0.6603	0.8163	0.7723	0.5954	1
Img3	0.9810	0.5212	0.9623	0.9686	0.9670	0.8322	1
Img4	0.9216	0.9202	0.8643	0.8495	0.9449	0.8320	1
Img5	0.9710	0.9651	0.7689	0.9811	0.9618	0.8741	0.9656
Img6	0.9143	0.9105	0.6902	0.9827	0.9176	0.7660	0.9550
Img7	0.9644	0.4705	0.6117	0.8111	0.9485	0.9577	0.8505
Img8	0.8579	0.5264	0.5997	0.7149	0.8311	0.7200	0.8988
Img9	0.9541	0.9481	0.7759	0.9768	0.9458	0.8475	0.9729

Table 8. AUCpr of PR curve in Figure 13. (The bold number represent the maximum value.)

Methods	Divisorstep TP	Eightpixel TP	MaxMedian	areaMeasure	fractaldim	SingularityExponent	Proposed
Img1	0.0455	0.0053	0.2048	0.2218	0.0266	0.0597	0.8259
Img2	0.6480	0.3772	0.6079	0.6843	0.6095	0.4428	0.7668
Img3	0.6487	0.0500	0.8050	0.5956	0.4578	0.1371	0.9993
Img4	0.2390	0.2299	0.4871	0.4791	0.3633	0.1676	0.9994
Img5	0.3714	0.3180	0.3370	0.8053	0.2928	0.1257	0.8878
Img6	0.2132	0.2038	0.2609	0.7875	0.2389	0.1336	0.8075
Img7	0.3886	0.0154	0.0760	0.3304	0.3355	0.2511	0.6601
Img8	0.5310	0.1898	0.3171	0.4804	0.5023	0.3488	0.8455
Img9	0.3385	0.2985	0.3378	0.7672	0.2759	0.1401	0.8782

Table 9. F-Measure of nine test images. (The bold number represent the maximum value.)

Methods	Divisorstep TP	Eightpixel TP	MaxMedian	areaMeasure	fractaldim	SingularityExponent	Proposed
Img1	0.1168	0.0127	0.4124	0.3989	0.0717	0.1803	1
Img2	0.2951	0.0122	0.2886	0.3171	0.3108	0.0222	0.9275
Img3	0.6315	0.4334	0.5450	0.5450	0.6318	0.4750	0.9963
Img4	0.6181	0.0824	0.7882	0.8228	0.4831	0.1772	0.9973
Img5	0.3782	0.3348	0.4603	0.5261	0.3222	0.1998	0.8287
Img6	0.4015	0.3587	0.4844	0.5327	0.3504	0.1822	0.8371
Img7	0.5094	0.0274	0.1988	0.2709	0.4661	0.3542	0.8172
Img8	0.5229	0.2163	0.3327	0.4339	0.5019	0.3994	0.8661
Img9	0.3283	0.3162	0.5214	0.6046	0.4006	0.2476	0.9723

Table 10. IOU of nine test images. (The bold number represent the maximum value.)

Methods	DivisorstepTP	EightpixelTP	MaxMedian	areaMeasure	fractaldim	SingularityExponent	Proposed
Img1	0.0685	0.0097	0.2054	0.2048	0.0543	0.1295	1
Img2	0.1676	0.0084	0.1409	0.1535	0.1545	0.0168	0.8722
Img3	0.5016	0.3704	0.3722	0.3722	0.4952	0.3704	0.9967
Img4	0.4602	0.0493	0.5897	0.6705	0.3538	0.1352	0.9886
Img5	0.2472	0.2201	0.2858	0.3744	0.2309	0.1453	0.6627
Img6	0.2852	0.2441	0.2953	0.3829	0.2483	0.1260	0.6884
Img7	0.3041	0.0212	0.0881	0.1299	0.2876	0.2426	0.5495
Img8	0.3801	0.1706	0.1892	0.2394	0.3460	0.2790	0.6411
Img9	0.2272	0.2242	0.3477	0.4032	0.2712	0.1571	0.9283

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lyu, Y.; Peng, L.; Pu, T.; Yang, C.; Wang, J.; Peng, Z. Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery. Remote Sens. 2020, 12, 142. https://doi.org/10.3390/rs12010142

AMA Style

Lyu Y, Peng L, Pu T, Yang C, Wang J, Peng Z. Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery. Remote Sensing. 2020; 12(1):142. https://doi.org/10.3390/rs12010142

Chicago/Turabian Style

Lyu, Yuxiao, Lingbing Peng, Tian Pu, Chunping Yang, Jun Wang, and Zhenming Peng. 2020. "Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery" Remote Sensing 12, no. 1: 142. https://doi.org/10.3390/rs12010142

APA Style

Lyu, Y., Peng, L., Pu, T., Yang, C., Wang, J., & Peng, Z. (2020). Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery. Remote Sensing, 12(1), 142. https://doi.org/10.3390/rs12010142

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cirrus Detection Based on RPCA and Fractal Dictionary Learning in Infrared imagery

Abstract

1. Introduction

2. Materials and Methods

2.1. Robust Principal Component Analysis

2.2. Random Fractal

2.3. Sparse Representation and Dictionary Learning

2.3.1. Orthogonal Matching Pursuit Algorithm

2.3.2. Dictionary Learning Based on KSVD

2.4. Cirrus Detection by RPCA and Fractal Dictionary Learning

3. Results

3.1. Parameter Settings

3.2. Experimental Results and Analysis

3.3. Evaluation

4. Discussion

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.