Crop Yield Prediction Using Machine Learning Algorithm
Crop Yield Prediction Using Machine Learning Algorithm
978-1-6654-1447-0/21/$31.00 2021
c IEEE 611
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.
a variety of factors, including their expectations. Yield prediction used to
meteorological, geographical, organic, be calculated by looking at a farmer's
political, and economic considerations. previous experience with a specific crop.
When there are multiple crops to raise, it can Agricultural yield is mostly determined by
be challenging for farmers, especially if they weather conditions, pests, and harvest
are unfamiliar with market values. process planning. For making judgments
According to Wikipedia, the farmer suicide about agricultural risk management, having
rate in India has fluctuated between 1.4 and accurate information about crop production
1.8 per 100,000 people over the last decade. history is critical.
In 2015, the number of farmer suicides
surpassed 8000 up from 5650 in 2014. III. Literature Survey:
The employment of technology to raise
cultivation awareness has grown Farmers can use a variety of programs to
unavoidable in recent years. Seasonal forecast crop yields based on climate
climate change is also wreaking havoc on variables. The crops were predicted using
key assets such as land, water, and air, machine learning algorithms. The random
resulting in food insecurity. In one scenario, forest technique is used to train the model
agricultural yields are continually falling for the five meteorological parameters, but
short of demand, necessitating the additional agriculture inputs such as soil
development of a smart system to address quality, pests, chemicals utilized, and so on
the issue of declining crop yields. To address are not taken into account. To build the
this issue, we suggest a system that will give random forest, the model was trained using
crop selection based on economic and 200 decision trees. The trained model's
environmental variables, allowing farmers to accuracy was tested using 10 fold cross-
get the most yield from their crops while validation.
also helping to fulfill the country's rising
demand for food supply. Machine learning is Machine learning, which is a good empirical
used in the suggested method to produce approach for category and prediction, is
predictions. The system will give crop yield some other method to crop yield estimation.
and crop selection depending on weather It defined the corn yield estimation in Iowa
attributes appropriate for the crop, allowing State the usage of 4 system learning
farmers to get the most out of their crops. strategies including RF (Random Forest),
The method provides crop output ERT (Extremely Randomized Trees), and
projections based on characteristics such as DL (Deep Learning). Also, comparisons of
rainfall, temperature, area (in hectares), the validation information amongst them
season, and so on. Crop yield forecasting is had been presented. To observe the seasonal
a significant agricultural issue. Every farmer sensitivities of the corn yields, three-length
is interested in determining how much yield companies had been set up: (1) MJJAS
will be generated and whether it will match (May to September), (2) JA (July and
612 2021 4th International Conference on Computing and Communications Technologies (ICCCT)
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.
August), and (3) OC (surest mixture of the regression, decision Tree, Random Forest,
month).In terms of the correlation and Support Vector Machine can be used.
coefficient, the DL approach had the highest
accuracies for the three-period groups. The IV. Existing System:
accuracies in the OC group were relatively
good, indicating that the best month A crop yield prediction model based on
combination can be important in statistical CNN and Geographical Index. The existing
agricultural yield modeling. model had an issue with agricultural drifts
for crop cultivation that were not compatible
R. Ghadge el [1] concluded that this work with environmental elements such as
aids in enhancing agricultural production temperature, weather, and soil condition.
rates by employing several classification BPNN was utilized to train the created CNN
methods and comparing various model that utilized spatial characteristics as
characteristics. To predict crop yield, input for error prediction. The created model
various machine learning techniques were had the advantage of being deployed on a
examined. Artificial neural networks, real-time dataset derived from legitimate
support vector machines, K-Nearest geospatial resources. However, while the
Neighbors, Decision Trees, Random forests, new model reduced relative error, it
Gradient boosted decision trees, regularised decreased crop yield forecast efficiency.
greedy forests, and the proposed CSM
technique (Crop Selection Method), which The previous model employed SVM to
aids in predicting the sequence of crops that classify crop data based on the texture,
can be considered for planning in the shape, and color of patterns on the sick
coming seasons, are among the algorithms surface since it includes a clear perception
included for comparative analysis. of the faults. A previously utilized
technology, CNN, reduced the relative
Instead of utilizing MLR (multiple linear inaccuracy as well as the crop production
regression) and RF (random forest) models, forecast. Similarly, an existing model that
Khaki al [6] suggested an ELM model based combined a time series model with a Back
on artificial intelligence for coffee yield Propagation Neural Network (BPNN) and
prediction for small farms. Different used a smaller dataset size had inferior
machine learning models were compared to performance since fewer samples were used
the ELM models. In terms of extracting for prediction. In the realm of selection
features, the author claims that ELM models stability and precision, machine learning
are more efficient than RF and MLR methods were used.
models. To predict agricultural yield, many
supervised machine learning techniques ML has several useful techniques for
such as linear regression, polynomial determining the input and output link in
yield and crop prediction. In agriculture,
2021 4th International Conference on Computing and Communications Technologies (ICCCT) 613
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.
machine approaches are utilized for yield played a key part. This study examines the
prediction, smart irrigation, crop disease many agricultural strategies that employ
prediction, crop selection, weather machine learning, as well as their benefits
forecasting, and determining the minimum and drawbacks.
support price, among other things. These
strategies will increase the production of the
fields while reducing the farmers' input
efforts. Furthermore, machine and
technology advancements were accurate
because they utilized considerable data and
V. Architecture Diagram Random Forest Algorithm
614 2021 4th International Conference on Computing and Communications Technologies (ICCCT)
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.
regression. The Random Forest algorithm 2. The vote given by each decision tree
generates decision trees on distinct data for each predicted event was then
samples, predicts data from each subset, and calculated.
then votes on which option 3. Finally, we looked at the most
We employed the Random Forest technique popular predicted outcome, which is
to achieve high accuracy, which provides the random forest algorithm's final
accuracy that predicts by model and the forecast.
actual outcome of prediction in the dataset.
In the random forest, a decision tree is
created from a sample of data, and the trees VII. Conclusion:
provide predictions for each family. The best
solution is chosen by voting, which Data cleaning and processing, missing value
improves the model's accuracy. It produces analysis, exploratory analysis, and model
the best results for the system. creation and evaluation were all part of the
analytical process. Finally, we use a machine
Pseudocode of the Proposed System: learning method to predict the crop, with
varying outcomes. This leads to some of the
1. We first randomly select the 'k's to following crop forecast insights. Because
feature out of the total 'm' feature in this system will cover the most sorts of
the model. crops, farmers will be able to learn about
2. Using the best split point the k crops that have never been farmed before
feature is chosen and node d is and will be able to see a list of all possible
calculated. crops, which will aid them in deciding
3. Using the split method, split the which crop to cultivate. Furthermore, this
nodes into daughter nodes. method takes into account previous data
4. Repeat steps 1 to 3 until several production, allowing the farmer to gain
nodes have been reached. insight into market demand and costs for
5. To make an n number of trees, repeat particular crops. The user-friendly web page
steps 1 to 4 for an n number of times. built for estimating crop yield can be
utilized by any user with their choice of the
crop by giving climate data for that location.
To perform prediction using the trained
random forest algorithm uses the below VIII. References:
pseudocode as shown in figure 1:
[1] R. Ghadge, J. Kulkarni, P. More, S.
1. We used the test features and each Nene, and R. L. Priya, “Prediction of crop
random decision tree to predict the yield using machine learning,” Int. Res. J.
output and the outcome, which was Eng. Technology, vol. 5, 2018.
then saved.
2021 4th International Conference on Computing and Communications Technologies (ICCCT) 615
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.
[2]. J.Jeong, J.Resop, N.Mueller, and the rest 2020. Data Mining Based Marketing
of the crew - Random forests used to Decision Support System Using Hybrid
forecast global and regional crop yields. Machine Learning Algorithm.
PLoS ONE in a peer-reviewed journal.
[9] “Crop yield prediction using deep neural
[3] Crop Yield Prediction Using Machine networks,” by S. Khaki and L. Wang. pp.
Learning Algorithms, Aruvansh Nigam, 621 in Frontiers in Plant Science, vol. 10,
Saksham Garg, Archit Agrawal, Parul 2019.
Agrawal,2019 Fifth International
Conference on Image Information
Processing (ICIIP), pp 125-130.
616 2021 4th International Conference on Computing and Communications Technologies (ICCCT)
Authorized licensed use limited to: M S RAMAIAH INSTITUTE OF TECHNOLOGY. Downloaded on May 03,2025 at 05:52:24 UTC from IEEE Xplore. Restrictions apply.