Pereira 2019
Pereira 2019
Authorized licensed use limited to: Auckland University of Technology. Downloaded on June 04,2020 at 07:42:54 UTC from IEEE Xplore. Restrictions apply.
Fig. 3. Sample images of dataset and its annotation.
Fig. 2. Sample images in our dataset.
segmentation procedure after appropriate image smoothing collecting process. The smartphone was set up as shown
and iterative binary pixel classification into the crack or non- in Figure 1. The illumination configuration of smartphone
crack classes. The method classified the identified cracks into camera was set as default. In this data collection task, 700
transversal, longitudinal or miscellaneous. of road images were collected then alocated 560 images for
An evaluation of Fully-CNNs for road segmentation in training set and another 120 images for validation set. The
Synthetic Aperture Radar (SAR) images was also conducted. data collection process was conducted at day time. Figure 2
In the study, the author added spatial tolerance rules to- shows some example of our dataset.
wards thin objects after carefully enhanced it sensitivity. The
proposed model successfully extracted most of the roads in B. Data Preprocessing
the test dataset. The result also reveals that although Fully- The original size of the collected images was 1920 x 1080
CNNs natively lack efficiency for road segmentation task, pixel with RGB channels. Before apply semantic segmentation
good results can be obtained if properly configured [11]. technique, all the collected image we resized to 225 x 225
Another approach is the detection of crack and pothole pixel for reducing the computational time during train the
using spatial features and texture based on deep neural network model.
architecture. In the experiment, a pre-processing method which
ensure real-time performance was proposed. The approach C. Data Normalization
performs well in some viewpoint changes, shadows, occlusion The standardization method is used to normalized all the
and background noise. [12]. collected images. It is done by first subtracting the mean from
Beside using for medical image segmentation [13], There every feature and then diving by standard deviation.
is a modified of U-Net called Wide-Range Attention Unit
(WRAU) which can be used for road segmentation. In this D. Data Annotation
study, a partially branch of dense-connection was added to
the U-Net architecture. In addition, at the cluster of nodes In a semantic segmentation task, it requires a corresponding
of partially dense – connections, the authors also employed ground truth in order to train a model. Therefore, in addition
WRAU. This method resulted a tangible boost to test accuracy. to resizing, the author also annotated the coresponding mask
We have proposed a CNN model for paved and unpaved of original images by using a data annotation tool. There are
road classification [14 ]. In the study, if the input image is three classes annotated: road, sky and pothole. Figure 3 shows
classified as paved road, then we continue to detect whether some examples of images and its coresponding masks for each
the paved road image has only pothole or not [15]. The original classes.
road images are cropped due to the presense of some outliers
before applying a deep learning technique. By applying the E. Description of Model Architecture
cropping process, the proposed model succesfully classsified We used U-Net based architecture as our neural network
whether the input images belong to either paved road or model [7]. It contains of three main paths: contracting, bottle-
pothole. neck and an expansive paths as shown in Figure 4.
In section 3, we explained in detail about our proposed All the descriptions of the model are similar to original U-
method, experimental results explained in section 4 and section Net model. Except for preserving the spatial resolution after
5 provided conclusion and future works. convolution we applied zero padding and we also doubled the
number of feature channels at the contracting path from 32
III. P ROPOSED M ETHOD at the first layer to 512 at the last layer. In addition, to map
each 32 components we also used a 1 x 1 convolution in the
This section consist of data collection, data annotation, expansive path.
description of model architecture and training methodology.
A. Data Collection F. Training Methodology
We utilized ASUS Z01RD smartphone for conducting data The training method of the model is expained in this
collection task. Some urban and rural roads are covered in subsection.
Authorized licensed use limited to: Auckland University of Technology. Downloaded on June 04,2020 at 07:42:54 UTC from IEEE Xplore. Restrictions apply.
Fig. 4. Graphic depiction of model architecture.
Authorized licensed use limited to: Auckland University of Technology. Downloaded on June 04,2020 at 07:42:54 UTC from IEEE Xplore. Restrictions apply.
Fig. 5. Model prediction results compared with the ground truth mask.
segmentation task is still small and lack of variety for building [10] Henrique Oliveira, Paulo Lobato Correia, ”Road Surface Crack Detec-
a good and reliable segmentation model which cause the model tion: Improved Segmentation with Pixel-based Refinement.” 2017 25th
European Signal Processing Conference (EUSIPCO).
failed to segment some unseen data. To solve this issue, more [11] Corentin Henry, Seyed Majid Azimi and Nina Merkle, ”Road Segmen-
annotation of corresponding mask need to be conduct in the tation in SAR Satellite Images with Deep Fully-Convolutional Neural
future. Networks.”Arxiv preprint arXiv:1802.01445v2, 2018.
[12] Sukhad Anand, Saksham Gupta, Vaibhav Darbari and Shivam Kohli,
”Crack-pot: Autonomous Road Crack and Pothole Detection.”Arxiv
VI. ACKNOWLEDGMENT preprint arXiv:1810.05107v1, 2018.
[13] Man Yuan, Zhi Liu, Fan Wang (2019) ,”Using the wide-range attention
U-Net for road segmentation”, Remote Sensing Letters, 10:5, 506-515,
The authors would like to thanks to the lecturers and staff DOI: 10.1080/2150704X.2019.1574990
of Faculty of Engineering and Science and Technology of [14] Vosco Pereira, Satoshi Tamura, Satoru Hayamizu, Hidekazu Fukai,
National University of Timor Leste and Gifu University Japan. ”Classification of Paved and Unpaved Road Image Using Convolutional
Neural Network for Road Condition Inspection System.”, International
Special thanks to Japan International Cooperation Agency Conference on Advance Informatics: Concepts, Theory and Applications
(JICA) for funding this research through the project of CADE- (ICAICTA) 2018.
FEST phase 2. [15] Vosco Pereira, Satoshi Tamura, Satoru Hayamizu, Hidekazu Fukai,
”A Deep Learning-Based Approach for Road Pothole Detection in
Timor Leste”. IEEE International Conference on Service Operations and
Logistics, and Informatics (SOLI) 2018.
R EFERENCES
[1] Burningham, Sally; Stankevich, Natalya. 2005. ”Why Road Maintenance
is Important and How to Get it Done. Transport Notes Series”; No. TRN
4. World Bank, Washington, DC. © World Bank.
[2] Jonathan Long, Evan Shelhamer, Trevor Darrell,”Fully Convo-
lutional Networks for Semantic Segmentation”, ArXiv preprint
arXiv:1411.4038v2, 2015
[3] Simon Jegou, Michal Drozdzal, David Vazquez, Adriana Romero,
Yoshua Bengio,”The One Hundred Layers Tiramisu: Fully Con-
volutional DenseNets for Semantic Segmentation”, ArXiv preprint
arXiv:1611.09326v3, 2017
[4] Yecheng Lyu, Lin Bai, Xinming Huang, ”Road Segmentation Using
CNN and Distributed LSTM”, ArXiv preprint arXiv:1808.04450v2,
2019
[5] Yecheng Lyu, Xinming Huang, ”Road Segmentation Using CNN with
GRU”, ArXiv preprint arXiv:1804.05164v1, 2018
[6] Farnoush Zohourian, Borislav Antic, Jan Siegemund, Mirko Meuter,
Josef Pauli, ”Superpixel-based Road Segmentation for Real-time Sys-
tems using CNN”, In Proceedings of the 13th International Joint Con-
ference on Computer Vision, Imaging and Computer Graphics Theory
and Applications (VISIGRAPP 2018) - Volume 5: VISAPP, pages 257-
265
[7] Olaf Ronneberger, Philipp Fischer, and Thomas Brox, ”U-Net: Convo-
lutional Networks for Biomedical Image Segmentation.”Arxiv preprint
arXiv:1801.09454v2, 2018.
[8] Ye Li, Lele Xu, Jun Rao, Lili Guo, Zhen Yan & Shan Jin, ”A Y-
Net deep learning method for road segmentation using high-resolution
visible remote sensing images.” Journal Remote Sensing Letters Volume
10, 2019 - Issue 4, Pages 381-390.
[9] Shinji Kakegawa, Haruki Matono, Hideaki Kido & Takeshi Shima,
”Road Surface Segmentation based on Vertically Local Disparity His-
togram for Stereo Camera.”Int. J. ITS Res. DOI 10.1007/s13177-017-
0140-8.
Authorized licensed use limited to: Auckland University of Technology. Downloaded on June 04,2020 at 07:42:54 UTC from IEEE Xplore. Restrictions apply.