Diet Recommendation System
Diet Recommendation System
net/publication/376517685
CITATIONS READS
0 778
1 author:
Mohammed Shaheel
Bournemouth University
1 PUBLICATION 0 CITATIONS
SEE PROFILE
All content following this page was uploaded by Mohammed Shaheel on 15 December 2023.
IV. METHODOLOGY
The novelty in this approach is that the machine learning
model will learn based data on USDA food data base B. Data Preprocessing
where up-to-date, vendor-specific nutrition, allergen and
ingredient information of varous food items are • Normalization: Machine learning models perform
available. The idea was to run different machine learning better when all of the values are brought into a
algorithms on this data to learn what category of food single scale through the use of min-max
item would be appropriate for a particular user based on normalisation. Because the dataset contains a
values he given through the front end and predict a food variety of scale values—some are single digit
values, some features have two digest values, and
item suitable for the user.
some features have three digest values—
normalization is the process of condensing the
A. Datasets dataset to a single range of values. The min-max
scaling technique is used in this study to normalise
data that fall between 0 and 1 on the scale.
The recommendation system is based on USDA nutrition
information database. Every food item's nutrition information
is maintained in the USDA database. The USDA Foods • Data encoding is carried out following the removal
Database is available for state agencies and school districts to of inconsistent and duplicate values from the
access up-to-date, vendor-specific nutrition, allergen and dataset. Subsequently, the nominal attributes are
ingredient information for direct delivered USDA Foods for converted into numerical values. Performance will
child nutrition programs offered on the USDA FOOD be improved since machine learning models'
AVAILABLE LIST[8] .A sample of dataset being used are backend operations are carried out on numerical
shown in Fig 1 and Fig 2. values prior to being implemented using a machine
learning model. Before being given to data
encoding process, the non-numeric data included in
the database was converted to numeric data. Using
machine learning techniques, backend
computations were performed on numerical values
instead of nominal values prior to supplying input
to the suggested model..
C. Feature Selection.
For feature selection we make use of recursive feature
elimination approach. In this approach a ranking process is
introduced .Feature are ranked based on model’s coefficient
or feature’s importance approach further fits the model and
starts to eliminate each feature based on lowest ranking
through a recursive fashion. The process involve developing
a model with remaining features after repeatedly removing
least significant parts until desired number of features is
obtained.[9] RFE attempts to eliminate any dependencies or
collinearity that may exist in the model.
D.Machine Learning
1) K-means clustering is a popular unsupervised
machine learning algorithm in various applications, including
diet recommendation systems. K means clustering can be
used to put individuals who are similar together in a dataset
of users who have each disclosed information about their
dietary restrictions, allergies, and food preferences. For each
cluster, this can assist in creating customised meal plans,
recipes, and diet regimens. Each user is assigned to the closest
cluster using K-means clustering, which is based on how
similar they are.
2) Random Forest is a popular machine-learning
algorithm used in various content basedrecommendation
systems. It works especially well for tackling non-linear and
high-dimensional situations[10]. A supervised learning
algorithm that is a member of the decision tree family is
called Random Forest. To produce a more reliable and
accurate prediction model, Random Forest integrates several
decision trees, in contrast to conventional decision trees,
which only use one tree. Using Random Forest, a
recommendation system can build a model that forecasts a
user's preferences based on their prior behaviour as well as
the behaviour of users who are similar to them. Large datasets
can be analysed by the algorithm, which can then be utilised
to find trends and insights to provide tailored suggestions.
3) Decision Tree: A decision tree's primary objective is
to divide a population of data into more manageable
segments. They are supervised learning methods without
parameters that are applied to regression and classification.
The goal is to build a model that, by learning basic decision
rules deduced from the data's attributes, can forecast the value
of an output class.
4) The Naive Bayes approach, which is based on the
Bayes theorem, is an efficient supervised learning strategy for Fig. 3.Workflow of the proposed model
classification tasks. It’s a probabilistic classifier, which
means it generates predictions based on the chances of
something existing.The core idea of this classifier is based on
bayes theorem. In simple terms, a Naive Bayes classifier
assumes that the presence of a particular feature in a class is VI.MODEL EVALUATION
unrelated to the presence of any other feature. For model evaluation mainly four various assessment metrics
are used
X . ACKNOWLEDGMENT