394 - ICAECA - IEEE-Camera Ready
394 - ICAECA - IEEE-Camera Ready
Abstract— Tomato is a domestic plant that is pre- Tomato - Lycopersicon esculentum is an essential
dominantly used worldwide and less tolerant to diseases. Among vegetable in all cuisines. The nutritional value present in
the diseases that affect tomato plants, often the type and cause the tomato makes it a protective food [3]. Tomato is grown
of the disease is unknown to the farmer. Accurate plant disease worldwide under 46.16 lakh hectares with a global
diagnosis is critical for increasing crop yield, monitoring plant production of about 1279.93 lakh tonnes. Tomato is an
health, and providing appropriate treatment at an earlier stage. important food crop and is easily susceptible to viral,
Therefore, this work aims at developing a system to identify the bacterial and fungal disease. As a result, crop productivity
type of disease that affects the tomato leaves at an early stage. is gradually falling across the globe over the years. Crop
In the proposed system, Gray Level Co-occurrence Matrix
scouting and conducting a manual crop inspection to
(GLCM) and Local Binary Pattern (LBP) texture-based
features are used to identify the type of disease by means of a
identify diseases is time consuming and inconsistent for
Support Vector Machine (SVM) classifier. To improve the farmers and plant breeders. In addition, naked-eye
diagnostic accuracy of disease detection system, fusion of two observation for the identification of diseases becomes
different texture features is performed using Principal cumbersome for large farmland. This requires techniques
Component Analysis (PCA) and Canonical Correlation Analysis to automate the diagnosis of plant disease so as to save
(CCA). A publicly available tomato leaves dataset is used to time and produce more accurate and reliable results. Image
evaluate the proposed system. A considerable improvement in processing in combination with machine learning is used
relevant performance metrics is observed in CCA based feature to automate the detection of plant disease owing to the
fusion. Henceforth the proposed system could be used as an aid advancements in imaging sensors and computational
to farmers and plant breeders to identify the type of disease and intelligence [4].
provide the necessary treatment.
The proposed work aims to develop a system for the
Keywords— Tomato, plant disease detection, texture features, detection of leaf disease in tomato plants using GLCM and
plant health, Canonical correlation analysis, feature fusion LBP texture based features. Feature extraction is followed
by a back end SVM classifier with linear and non-linear
I. INTRODUCTION kernel functions. To improve the multi class performance,
Agriculture is the primary occupation that provides a system is developed by combining GLCM and LBF
food and helps us to live a healthy life. The plant should features. Furthermore, fusion strategy based on PCA and
be healthy in order to provide people with nutritious food. CCA is employed between GLCM and LBP features. The
In modern days, the agricultural activity along with performance of the leaf disease detection system is
climatic conditions affects the plant causing multitude of assessed in terms of relevant metrics .
diseases. The plant is therefore deprived of nutrition and
II. LITERATURE REVIEW
the yield becomes less as well. In order to increase crop
yield, effective plant health monitoring systems at earlier Considering the importance of automated diagnosis of
stages of plant growth are therefore necessary. Early plant disease, several literature studies have focused on
diagnosis of plant disease is a first step towards the developing a plant disease detection system using image
monitoring and management of plant health [1]. analysis and machine learning techniques [5-15].
Precision farming, the modern, efficient farming In [5], the authors have developed an image processing-
practice combines a variety of sensors with real-time data based solution to detect plant diseases. In the proposed system,
analytics to increase crop yields. Satellite imagery, tools image preprocessing is followed by K-means segmentation
for data processing, smart sensors, Internet of Things and texture-based feature extraction. Classification is
(IOT), autonomous tractors and geo spatial tools helps achieved using Neural network.
precision farming to transform current agricultural An image processing-based support system is developed
practices. Drones equipped with imaging sensors capture in [6], to help farmers for early plant disease diagnostic
plant images in real time and monitor plant health and system. Correlation based feature is classified by Support
stress. Depending on the feedback received from the vector Machine classifier.
support system, farmers take necessary steps to manage
plant health. All critical decisions in precision farming are A survey on different image processing techniques used
made using sensor data. Precision farming increases the for diseases is presented in [7]. The general steps involved in
accuracy of all agricultural activities resulting in reduced an end-to-end image processing framework for disease
operational costs and high crop yields. Automatic disease detection is explained in detail.
diagnosis of plants is also one of the key features of
Authors in [8], reviewed characteristics of various fungal,
precision agriculture. Machine learning techniques are
viral, and bacterial diseases that occur in tomato plant.
used to develop decision support system for early
Correlation and color based study have been used as an
diagnosis of plant disease [2].
attempt for disease classification.
+ Multi-Class
Accuracy =
+ + + Linear 76.29 76.41 94.07 0.76 0.70 0.76 0.25
Poly 80.42 80.49 95.10 0.80 0.75 0.81 0.38
RBF 73.26 73.36 93.31 0.73 0.66 0.73 0.16
Sensitivity = =
+
Binary Class
SVM
ACC SEN SPC F1 MC PRE
Type
Linear 96.68 96.50 96.86 0.96 0.93 0.96 0.93
Poly 96.16 95.00 97.38 0.96 0.92 0.97 0.92
RBF 95.40 96.00 94.76 0.95 0.9 0.95 0.90
Multi-Class
Linear 72.35 72.53 0.72 0.65 0.73 0.13
93.09
Poly 77.30 77.42 94.32 0.77 0.71 0.77 0.29
RBF 71.24 71.41 92.81 0.71 0.64 0.72 0.10
Binary Class
SVM
ACC SEN SPC F1 MC PRE
Type
Fig. 4. Diseased image after segmentation
Linear 96.93 96.00 97.91 0.96 0.93 0.97 0.93
Poly 95.40 94.50 96.34 0.95 0.90 0.96 0.90
The metrics as observed from Table II and III, clearly
RBF 94.88 94.50 95.29 0.94 0.89 0.95 0.89
illustrates that performance of multiclass system (5 class) falls
behind the two-class system. An accuracy of 80.42 and 77.30 Multi-Class
is obtained using GLCM and LBP features, respectively.
Linear 81.53 81.61 95.38 0.81 0.77 0.82 0.42
An improvement in accuracy was obtained in image
classification by combining multiple texture features. To Poly 85.27 85.32 96.31 0.85 0.81 0.53
0.85
improve the performance of proposed system combining RBF 77.09 77.14 94.27 0.77 0.71 0.77 0.28
texture features is experimented in this work.
In PCA based feature fusion, resultant fused feature set is the proposed fusion based on CCA could be used as an
of fewer dimensions when compared to combined feature set effective strategy for diagnosing diseases in tomato leaves.
before fusion. In CCA based feature fusion, before combining
two feature sets (GLCM and LBP) correlation features are
extracted between them. The correlation features in CCA TABLE VI. METRICS OF COMBINED SYSTEM FUSED USING CCA
reduces the redundancy at the cost of increased performance
[22].
Binary Class
It is observed from Table V and VI, that feature fusion SVM
ACC SEN SPC F1 MC PRE
outperforms the baseline and combined system performance Type
metrics for both binary and multiclass disease detection. The Linear 96.93 96.00 97.91 0.96 0.93 0.97 0.93
accuracy score of CCA fused system is 87.08%. CCA based Poly 96.42 96.00 96.86 0.96 0.92 0.96 0.92
feature fusion resulted in an increase of 8.28% when RBF 95.65 97.00 94.24 0.95 0.91 0.94 0.91
compared to baseline system.
Multi-Class
Specificity is an important measure for a disease
Linear 83.45 83.53 95.86 0.83 0.79 0.83 0.48
diagnostic system as it reduces the number of false positives.
It is observed from the results, that specificity is almost above Poly 87.08 87.15 96.77 0.87 0.83 0.87 0.59
92% in all the cases irrespective of the class. RBF 83.85 83.92 95.96 0.83 0.79 0.83 0.49