Steel Surface Defect Detection Using Deep Learning
Steel Surface Defect Detection Using Deep Learning
ISSN No:-2456-2165
Abstract:- Steel defects are a frequent problem in steel II. RELATED WORKS
companies. Proper quality control can reduce quality
problems arising from steel defects. Nowadays, steel There are many previous algorithms applied to detect
defects can detect by automation methods that utilize steel defects. The algorithms used are various, namely SIFT
certain algorithms. Deep learning can help the steel and Voting Strategy [3], YOLO network [4], ANN, and
defect detection algorithm become more sophisticated. DAN algorithms [5], etc. S. Tian and K. Xu [6] detected
In this study, we use deep learning CNN with Xception steel defects with genetic algorithms and extreme machine
architecture to detect steel defects from images taken learning. This ELM is combined with genetic algorithms to
from high-frequency and high-resolution cameras. improve the results obtained. The results of experiments
There are two techniques used, and both produce with steel defect samples show that the G-ELM algorithm
respectively 0.94% and 0.85% accuracy. The Xception effectively increases the ELM algorithm's identification
architecture used in this case shows optimal and stable accuracy.
performance in the process and its results.
M. S. Kim et al. [7] applied deep learning to classify
Keywords:- Defect Detection, Steel Defect, Deep Learning, steel defects using the CNN model combined with Siamese
Xception. neural network types. In their research, CNN used in feature
extraction and Siamese used as a network structure with a
I. INTRODUCTION scheme using a convolutional layer with a small filter size.
Of all the algorithms used, it produces good accuracy in
In the steel industry, quality control during the classifying steel defects. S. Y. Lee et al. [8] and J. L. Greece
production process becomes very important. Steel quality et al. [9] have also used deep learning and CNN in their
control performed with detecting defects on the steel research to detect steel defects.
surface. Quality degradation will occur if steel defects are
not detected correctly. Detection of steel defects at the right In previous studies that applied deep learning with the
time can help in handling quality problems of steel to be CNN model, we reviewed the primary ability of CNN can
produced. Directly the quality of steel will determine the be affected by the use of architecture (pre-trained models).
durability and lifetime of the steel itself. CNN has a variety of architectures, and generally, these
architectures have several different layers. Three CNN
Nowadays, automation has utilized in carrying out architectures are the most known and commonly used,
quality control. One method in detecting defects can help by namely VGG16 [10], Inception V3 [11], and Xception [12].
the presence of images from high-frequency and high- Based on ImageNet Keras [13], Table 1 describes the
resolution cameras with the application of algorithms in performances of each architecture.
them [1]. This detection requires a sophisticated learning
algorithm that can improve the quality of detecting over Models Size Top-1 Top-5 Acc Depth
time. Deep Learning technology supports the technique Acc
because of its ability to learn its computing. VGG-16 528 MB 0.713 0.901 23
InceptionV3 92 MB 0.779 0.937 159
Deep learning has revolutionized various industries Xception 88 MB 0.790 0.945 126
because of excellent performance in computer vision. Table 1:- Performances pre-trained CNN
Unlike its predecessor machine learning, deep learning can
work without instructions from its creator to produce fast In the table, the top-1 and top-5 accuracy refers to the
and accurate predictions [2] so that it can help the workload performance of the model in the ImageNet validation
of engineers in the steel industry. dataset. Xception architecture has the highest accuracy
results of top-1 and top-5 accuracy and uses smaller
The Deep Learning Model that commonly used for memory compared to VGG-16 and InceptionV3
image recognition is the Convolutional Neural Network architectures.
(CNN). CNN will help find defective objects contained in
steel surface images. Therefore, deep learning with the CNN Some studies discuss the comparison of the three
model will use in this study to detect steel defects. architectures in terms of performance. Comparison of
Xception architecture and Inception V3 in F. Chollet's
research [12] proves that compared to InceptionV3,
Xception shows a small advantage in its classification
C. Xception
In this study, the CNN architecture used is Xception
Fig 2:- Image with a defect of a single class
[11] created by Francois Chollet. Xception is an extreme
version of the Inception model whose architecture based on
depthwise separable convolution layers. Inception with this
extreme version is a strong hypothesis that illustrates 1 x 1
convolution, which can map the convolution correlations in
n x n whose numbers are likely to be numerous. In deep
Fig 3:- Image with multiple defect classes learning, depthwise separable convolution is called
separable convolution with depthwise convolution, which is
a spatial convolution that can stand alone in each input
channel and is followed by pointwise convolution, which
projects the output channel.
B. Preprocessing
Deep learning in the process of image recognition
requires image preprocessing. Preprocessing is influential in
extracting features in the image. The method that will use in
this preprocessing is data augmentation. Data augmentation
is a technique of modifying data without removing the core
of the data. Due to a large amount of data, this data
augmentation helps speed up the preprocessing stage. The
augmentation methods used in this study are (a) Rescale
with 1./255, (b) Shear range with 0.1, (c) Zoom range with
0.1, (d) Brightness range with [0.6, 1.0], (e) fill mode Fig 7:- Xception Module [11]
"constant", (f) Cval with 0, (g) horizontal flip and (h)
vertical flip. In the use of architecture, the image size will be Xception consists of 36 layers that form a network for
resized to 256 x 256 pixels. Resize the image functions to feature extraction. The Xception architecture feature map
speed up the training process. consists of 3, namely entry, middle, and exit. Each groove
represents several blocks of layers supporting feature
extraction. After the feature extraction block ends with the
global average pooling layer, there is a fully connected layer
whose usage is optional and ends with a logistic regression
layer. In this case, the logistic regression layer not used, but
what used is a fully connected layer consisting of global
average pooling 2D layers and Dense, which shows the
number of classification classes.
D. Performance Evaluation
For presenting the analysis in this study, we will use
several metrics, graphs, and a confusion matrix. Then, the
results of the analysis used to determine the ability of the
model in defect detection.
V. EXPERIMENTAL RESULT
VI. RESULT
REFERENCES