Plant Leaf Disease Detection Using Machine Learning
Plant Leaf Disease Detection Using Machine Learning
Karunya Institute of Technology and Karunya Institute of Technology and Karunya Institute of Technology and
Sciences, Sciences, Sciences,
Coimbatore, India Coimbatore, India Coimbatore, India
kethsy_cse@karunya.edu mbharath@karunya.edu.in kambamsanjay@karunya.edu.in
Abstract: Plant leaf disease detection is a critical task in accuracy is improved by fine-tuning it on a lower
modern agriculture to ensure better crop yield and quality. dimension. CNNs are one of the most powerful tools for
This provides a unique strategy for detecting plant leaf disease image Partition and categorization. The main focus of this
using machine learning techniques. The proposed methodology
proposed system is to accurately classify leaf diseases so
consists of three main stages, followed by classification using
five different models, including KNN, S VM, Decision Trees, that it can solve the leaf world problems that are faced in
Random Forest, and CNN. The collected images are pre - crop fields by farmers. The model is less complex and has
processed to eliminate unwanted features, and the images are easy identification of plant diseases. The system aims to
resized to a standardized size of 256 x 256 pixels. The following improve the accuracy of the existing methodology by using
stage involves utilizing the pre-trained CNN model to extract CNNs for image segmentation and classification, and by
pertinent features. The extracted features are then utilized to incorporating transfer learning methods. The proposed
train the classification models. The performance of each model model is composed of four distinct stages, including data
is assessed using various metrics, to predict its effectivity and collection, preprocessing, segmentation, and extraction,
accuracy. This proposed methodology is expected to provide a culminating in the final stage of classification. Training
reliable and efficient diagnosis of plant diseases, helping
farmers to take timely measures to prevent disease outbreaks makes up 80 percent of the gathered dataset, while testing
and ensure healthy crop growth. The proposed system
makes up 20 percent.
achieved high accuracy, less complexity, and easy
identification. The experimental findings show that the During the data preprocessing stage, the system receives an
suggested paradigm is successful in identifying common input image of a leaf, which is taken with the help of a
diseases. The suggested method of early detection and digital camera and represents the leaf's color in RGB format.
diagnosis of crop diseases can result in timely treatment and The preprocessing is achieved through image resizing and
higher crop yield. rescaling. The collected data is partitioned into subsets with
the help of the validation_split property.
Keywords—Machine Learning, Image Segmentation, Support
Vector Machine, Plant leaf disease detection.
In the Image Segmentation phase, the input image is divided
I. INT RODUCT ION into different segments to make image analysis easier. CNN
is used for segmentation, which reduces processing time and
The improvement in the field of machine learning in overcomes the problem of underfitting. Data aggregation
previous years has revolutionized the field of image data is produced as a result of the picture segmentation step.
recognition, and classification. A model is proposed for the
detection and classification of disease using ML techniques In the Feature Extraction phase, different features of plants
or methodologies. The model is designed to accurately such as size, shape, and surfaces are considered to extract
identify the disease affecting a plant, enabling farmers and
different properties of leaves. The GLCM algorithm is used
researchers to take corrective measures accordingly. for surface feature examination. After generating the co-
occurrence matrices, the necessary features are extracted to
The suggested framework relies on ML techniques. The
classify them in the subsequent stage, which is commonly
classification model is initially fed with a huge dataset for referred to as classification.
training using the machine learning approaches, and then its
In the Classification phase, Support vector machines and technique that is commonly used for pattern identification
other classification algorithms are used. The input images applications. The suggested method detected plant illnesses
for the neural network are reduced in dimensions to reduce with a remarkable precision rate of 96%, demonstrating the
processing time and overcome the problem of underfitting. promise of machine learning methods in the context of
The model undergoes training for a specific number of recognizing plant diseases. This study has important
epochs, after which the classification model's performance implications for enhancing agricultural production and
metrics are evaluated and compared. The results eventually leading to higher yields and better-quality crops.
demonstrate that the proposed model accurately classifies In their study, Ramesh and co-authors [7] introduced a
plant leaf diseases with high accuracy. The proposed system cutting-edge system using advanced machine learning
can be utilized to classify all kinds of diseases easily, which methodologies, namely (CNN) and (SVM), for the purpose
can lead to the early uncovering of plant diseases and the of detecting plant diseases. System yielded a remarkable
subsequent prevention of their spread. accuracy rate of 91%, indicating the high potential of
machine learning in plant disease detection. This outcome is
II. LITERATURE SURVEY significant as it suggests that the advanced transfer learning
Plant diseases are a significant threat to crop methodologies could significantly enhance the ability to
productivity, and the traditional methods of manual detect and mitigate plant diseases, ultimately leading to
observation and identification of diseases are time- improved crop yields and food security.
consuming and often not efficient. With the rapid
development of ML techniques, researchers are able to Kumar [8] proposed a system using machine learning
develop automated systems to detect and diagnose diseases. methodologies or techniques to detect a leaf disease. The
Some of the existing research is reviewed. proposed system achieved 96.5% accuracy, demonstrating
its effectiveness in detecting plant leaf disease.
One of the recent studies was done by Reddy et al. [1], who
created a method utilizing deep learning models to detect Varshney along with colleagues [9] suggested an approach
plant illnesses. They employed pre-existing models such as for identifying plant diseases that utilize machine learning
VGG16 and InceptionV3 to extract features and methods such as SVM and Random Forest. The suggested
subsequently trained classifiers such as SVM, KNN, and method obtained a precision of 92.7%, indicating the
Decision Tree. The outcome was an accuracy rate of potential for machine learning in detecting plant diseases.
97.35%, indicating the vast possibilities of ML in
identifying plant diseases. Kaur and Kang [10] proposed an advanced classifier, which
utilizes a SVM model to enhance the detection process.
Another study on plant disease detection was done by Their suggested approach achieved an impressive accuracy
Panchal [2], who proposed a method using CNNs to detect rate of 95.33%. This accomplishment confirms the efficacy
leaf diseases. The proposed model achieved a high accuracy of machine learning in accurately detecting plant diseases.
of 97.67%, demonstrating the effectiveness of deep learning This innovative approach can lead to improved management
in detecting leaf diseases. of plant diseases, which can ultimately lead to better crop
yields, increased food production, and a more sustainable
De Luna [3] presented an autonomous mechanism for environment.
acquiring photographs of tomato plant foliage and applied
deep-learning algorithms to categorize them as either In their research study, Khirade and Patil [11] introduced an
healthy or diseased. They obtained a precision of 98.4% innovative system for detecting plant diseases by leveraging
using the ResNet-50 architecture, indicating the possibility the power of advanced image processing techniques. Their
of automatic systems in detecting plant diseases. system demonstrated remarkable accuracy of 93.33%. The
researchers' findings shed new light on the efficacy of this
A study by Yusoff[4] proposed a disease identification approach and suggest its feasibility in practical settings.
system thatdetects Hevea leaves using the Sobel edge With further research and development, this technology
algorithm on FPGA. The proposed system achieves 92.3% could revolutionize the way people monitor and manage
accuracy and a processing speed of 40 frames per second, plant health, offering a more precise and efficient alternative
demonstrating the potential of FPGA-based systems for to traditional methods.
plant disease detection.
Sharif and associates [12] suggested a segmentation
Srikaew and collaborators [5] created a plant illness approach that is optimized along with an approach of
identification system using an artificial intelligence selecting a feature for identifying and categorizing pomelo
algorithm and a co-occurrence matrix. The system proposed diseases. The suggested method attained a precision rate of
by them attained a precision of 92.5%, showcasing the 97.27%, showcasing the great prospects of machine learning
capabilities of artificial intelligence in identifying plant in detecting plant diseases.
diseases.
Chandavale et al. [13] developed a comprehensive strategy
In their study, Thirumalesh [6] and colleagues presented a for preventing, identifying, and treating plant diseases. To
K-Nearest Neighbor (KNN) classifier that was used to identify and categorize plant diseases, the suggested method
identify disorders in paddy leaves, which was a unique combines machine learning algorithms with image
method. The KNN method is a supervised machine-learning processing techniques. The scientists reported a 95%
Decision Trees, Random Forest, and CNN. All the on computation time. The average computation time
classification models are trained on the extracted features required for processing a single image was found to be
which are extracted in the above phase and results are noted. 0.003 seconds on a standard desktop computer. This
The evaluation of the individual model is computed utilizing demonstrates the real-time capability of the proposed
diverse criteria such as exactness, meticulousness, retrieval, system, making it suitable for practical applications in the
and F1-measure. The decision tree algorithm is a tree-based field. In summary, the proposed system utilizes machine-
model that partitions the feature space into regions based on learning techniques to classify different plant leaf diseases
the values of the features and then assigns a class label to based on input images. The proposed system attains 91.93%
each region. The SVM algorithm is a binary linear classifier accuracy, which is higher than that achieved by previous
that maximizes the closeness among vectors of various studies. The feature importance analysis reveals that the
kinds in the feature space. The Random Forest model makes Mean Green Intensity is the most significant feature
use of numerous decision trees to improve the classification contributing to the classification performance. Overall, the
performance. proposed system efficiently predicts the disease with the
help of leaf images accurately thereby contributing to the
IV. RESULT development of sustainable agriculture. Some of the sample
The proposed methodology was implemented using Python results of the proposed system regarding the classification of
programming language and the TensorFlow machine different diseases are shown below.
learning framework. The performance metric results are
given below in Table 1 for each classification model.
VII. REFERENCES
[1]P.C Reddy, R. M. S. Chandra, P. Vadiraj, M.A Reddy, T . R. Mahesh,
and G.S Madhuri, "Detection of Plant Leaf-based Diseases Using Machine
Learning Approach," 2021 IEEE (CSIT SS).