0% found this document useful (0 votes)

19 views18 pages

Processes 11 01507 v2

This research introduces AutoML techniques for predictive maintenance in Industrial IoT, specifically focusing on fault classification of ball bearings. Two models, AutoML using PyCaret and Auto Deep Neural Network with AutoKeras, were evaluated using the CWRU bearing faults dataset, showing superior performance compared to traditional methods. The study highlights the potential of AutoML to streamline maintenance processes, reduce costs, and improve operational efficiency in industries facing challenges from unplanned downtimes.

Uploaded by

noor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views18 pages

Processes 11 01507 v2

Uploaded by

noor

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 18

processes

Article
Improved Fault Classification for Predictive Maintenance in
Industrial IoT Based on AutoML: A Case Study of
Ball-Bearing Faults
Russul H. Hadi 1 , Haider N. Hady 2 , Ahmed M. Hasan 1 , Ammar Al-Jodah 3, * and Amjad J. Humaidi 1

1 Control and Systems Engineering Department, University of Technology, Baghdad 10001, Iraq;
russul.h.hadi@uotechnology.edu.iq (R.H.H.); 60163@uotechnology.edu.iq (A.M.H.);
amjad.j.humaidi@uotechnology.edu.iq (A.J.H.)
2 Production Engineering and Metallurgy, University of Technology, Baghdad 10001, Iraq;
haider.n.hady@uotechnology.edu.iq
3 School of Physics, Maths, and Computing, The University of Western Australia, Perth, WA 6907, Australia
* Correspondence: ammar.al-jodah@uwa.edu.au

Abstract: The growing complexity of data derived from Industrial Internet of Things (IIoT) systems
presents substantial challenges for traditional machine-learning techniques, which struggle to ef-
fectively manage the needs of predictive maintenance applications. Automated machine-learning
(AutoML) techniques present a promising solution by streamlining the machine-learning process,
reducing the necessity for manual hyperparameter tuning and computational resources, thereby
positioning themselves as a potentially transformative innovation in the Industry 4.0 era. This re-
search introduces two distinct models: AutoML, employing PyCaret, and Auto Deep Neural Network
(AutoDNN), utilizing AutoKeras, both aimed at accurately identifying various types of faults in
ball bearings. The proposed models were evaluated using the Case Western Reserve University
(CWRU) bearing faults dataset, and the results showed a notable performance in terms of achieving
high accuracy, recall, precision, and F1 score on the testing and validation sets. Compared to recent
studies, the proposed AutoML models demonstrated superior performance, surpassing alternative
approaches even when they utilized a larger number of features, thus highlighting the effectiveness
Citation: Hadi, R.H.; Hady, H.N.; of the proposed methodology. This research offers valuable insights for those interested in harnessing
Hasan, A.M.; Al-Jodah, A.; Humaidi, the potential of AutoML techniques in IIoT applications, with implications for industries such as
A.J. Improved Fault Classification for
manufacturing and energy. By automating the machine-learning process, AutoML models can help
Predictive Maintenance in Industrial
decrease the time and cost related to predictive maintenance, which is crucial for industries where
IoT Based on AutoML: A Case Study
unplanned downtime can lead to substantial financial losses.
of Ball-Bearing Faults. Processes 2023,
11, 1507. https://doi.org/10.3390/
Keywords: AutoML; predictive maintenance; artificial intelligence; IIoT; fault classification; CWRU
pr11051507
bearing dataset; AutoKeras; PyCaret
Academic Editor: Xiong Luo

Received: 27 March 2023

Revised: 7 May 2023
Accepted: 9 May 2023 1. Introduction
Published: 15 May 2023 The Internet of Things (IoT) has revolutionized several industries, including manu-
facturing, by enabling the integration of physical and digital systems to enhance real-time
services [1,2]. This development has paved the way for Industry 4.0, which is propelled by
IoT and artificial intelligence (AI) [3]. A key advantage of automated industrial systems
Copyright: © 2023 by the authors.
within Industry 4.0 is the substantial growth in the volume of data that can be gathered
Licensee MDPI, Basel, Switzerland.
from sensors, transceiver devices, and data storage systems [4,5]. This data can be pro-
This article is an open access article
cessed and analyzed to yield valuable insights regarding equipment performance, thus
distributed under the terms and
facilitating a transition towards predictive maintenance (PdM) [6,7]. AI techniques can
conditions of the Creative Commons
Attribution (CC BY) license (https://
be employed to automatically extract information from collected historical data, which in
creativecommons.org/licenses/by/
turn can improve maintenance procedures and boost operational sustainability [8,9]. In
4.0/). this context, PdM is gaining prominence across various industries, owing to its potential

Processes 2023, 11, 1507. https://doi.org/10.3390/pr11051507 https://www.mdpi.com/journal/processes

Processes 2023, 11, 1507 2 of 18

to decrease maintenance expenses and prolong the service life of equipment [10,11]. AI
tools such as machine learning (ML) have the potential to further improve industrial sys-
tems by enhancing decision-making capabilities in real-time scenarios [12]. ML is a field
of study that empowers computers to learn without explicit programming. It leverages
computational techniques to extract information directly from data, bypassing the need
for predefined equations or models. Therefore, the integration of AI and IoT technologies,
such as ML and PdM, can play a significant role in improving industrial systems’ overall
efficiency and sustainability in Industry 4.0 [3,7].
Rolling-element bearings (REBs) are a prime example of components that require
diligent PdM, as they are susceptible to various damages caused by the harsh operating
conditions they endure, such as high speed, heavy load, extreme temperatures, and con-
tamination. An unexpected bearing fault can lead to substantial financial losses and catas-
trophic failures, resulting in a breakdown of an entire mechanical system. Consequently,
conducting an REB fault diagnosis is essential for preventing accidents and ensuring safe
operation [13].
This paper aims to enhance the maintenance process of REBs by automating the
detection and classification of possible faults that may occur during operation. AutoML
models were proposed to provide a user-friendly methodology for non-expert users in
the manufacturing industry. Such models streamline the selection of machine-learning
algorithms or deep neural network architecture that is most suitable for a given dataset
and task. Thus, they greatly reduce the need for human intervention and expertise.
The rest paper is ordered as follows: Section 2 provides a literature review of the
related works. Section 3 introduces the categories of maintenance management and focuses
on PdM. Section 4 describes the proposed methodology. Section 5 presents an overview
of the used dataset. Section 6 discusses the experimental results. Finally, Section 7 lists
the conclusions.

2. Related Works
In recent years, several studies have utilized ML models to reduce downtime which
ultimately results in improving the efficiency of production processes. These studies can be
grouped into three main categories, which are summarized in this section. The first category
focuses on using ML for PdM, where algorithms are used to predict equipment failures
before they actually occur. The second category investigates the utilization of AutoML
techniques. AutoML automates model selection, hyperparameter tuning, and feature
engineering. This allows non-experts to build high-performance models with slight effort.
AutoML facilitates early detection of equipment failures. This early detection reduces
downtime, and improves productivity in PdM. Incorporating AutoML with IoT systems
allows predictive decision-making and real-time monitoring, which make it a crucial tool
for Industry 4.0. [14–16]. Finally, the third category investigates the application of ML in
the CWRU bearing faults dataset, which is widely used for benchmarking PdM algorithms.
Focusing on the first category, a PdM approach aimed at diagnosing critical failures
in medical equipment was proposed in [17]. The approach relied on understanding the
physics of failure, real-time IoT data collection, and ML for fault prediction. The approach
was applied to a case study of a Vitros-Immunoassay analyzer and proved to provide
significant cost savings and a short investment payback period. However, it is limited to
certain failure modes and parameters and requires sufficient data for accurate predictions.
Lee et al. [9] discussed the use of AI-based algorithms for monitoring the cutting tool
wear and spindle motor bearing failures where the support vector machine (SVM) and
artificial neural network (ANN) methods were used. Another study in [18] presented a
machine-learning approach based on random forest for the maintenance of electric motors.
The system was tested on a real industry example, and preliminary results showed high
accuracy in predicting different machine states. Nasser and Al-Khazraji [19] proposed a
hybrid convolution neural network and long short-term memory networks (CNN-LSTM)
approach for fault prediction and diagnosis. In [20], a deep learning model called causal
Processes 2023, 11, 1507 3 of 18

augmented convolution network (CaConvNet) is proposed for long-sequence time-series

prediction. While the model outperformed its counterparts in the literature, it faced
limitations such as a complex architecture that can result in extended training durations
and increased computational expenses.
In comparison to the aforementioned studies, the proposed work aims to apply Au-
toML to develop a PdM model. The approach intends to optimize the ML pipeline, in-
cluding feature selection, algorithm selection, and hyperparameter tuning, to enhance the
model’s accuracy and reduce the time and effort required to develop the model.
Numerous recent studies have emerged in the field of PdM utilizing AutoML.
Leite et al. [21] presented a model for real-time fault detection and diagnosis (RT-FDD)
in discrete manufacturing machines (DMMs), which compared 16 ML classification algo-
rithms such as Extra Trees and Random Forest. Tornede et al. [22] proposed a remaining
useful lifetime (RUL) estimation as a co-operative coevolutionary algorithm. Cinar et al. [23]
implemented a new PdM system using a set of key performance indicators (KPIs) and
metrics for enhancing performance-monitoring processes. In [24], the study explored the
potential of using AutoML on real-world data. Garouani et al. [25] presented a framework
of AutoML for researchers aiming to engage industry 4.0 with the field of smart manufac-
turing. Finally, in [26], the researchers analyzed the features of the constant current (CC)
and constant voltage (CV) phases for making the life prediction and capacity estimation of
lithium-ion batteries (LIBs).
For the third category, this work aims to improve the diagnostic process and manage
failures more effectively. To this end, AutoML approach is utilized for failure prediction
in REBs. AutoML automates the section and tunning of ML models that are most suit-
able for a given dataset and task [27]. Several ML algorithms based on extracted features
from the vibration dataset of the CWRU Laboratory were used for various fault classifi-
cations [28]. Some recent and related studies were used for comparative analysis with
this work. Wen et al. [29] developed a new deep transfer learning (DTL) method for fault
diagnosis. The work in [30] demonstrated the effectiveness of combining signal-processing
methods with ML techniques. The work presented by Sharma et al. [31] aimed to detect
and classify faults in various industries. Five different algorithms were used, and their
performance was compared on different datasets. In [32], the use of fusion models and
algorithms for multisource sensing data was investigated, and only four features were
utilized. Jian et al. [33] proposed a one-dimensional fusion neural network (OFNN) method
for the intelligent diagnosis of faults, which was applied with a wide kernel and combined
with the Dempster–Shafer evidence theory. The work in [34] utilized some ensemble-
learning algorithms, including gradient-boosting classifiers, bagging, and extra tree, as
diagnostic techniques. Wen et al. [35] presented the transfer CNN based on ResNet-50
with the depth of 51 convolutional layers. Wang et al. [36] introduced a CNN model
which constructed a signal-to-image conversion method based on fault bearing vibration
characteristics propagating along the space. Han and Jeong [37] proposed a weighted
arithmetic mean CNN ensemble model to improve the stability of CNN models. The study
in [38] showed that ML models can effectively diagnose REB faults. The study found that
the k-nearest neighbors (k-NN) and SVM classifiers performed the best, using specific
frequency-domain and time-domain features. However, the study used an imbalanced
dataset which could potentially bias the classification results. Rajput et al. [39] developed a
method called fuzzy convolution neural network (FCNN). While the method accurately
diagnosed faults in rotating parts and successfully classified different types of faults, it
faced difficulties in detecting outer-race faults due to imbalanced data.
To ensure unbiased results, the current study used a balancing technique to overcome
this challenge.
According to [40], a medium Gaussian SVM was proposed. The study utilized vi-
bration signals collected on-site, extracted features, and clustered and classified them for
motor health classification. Different Gaussian kernel functions were analyzed for their
impact on SVM performance. The study identified a limitation of the medium Gaussian
Processes 2023, 11, 1507 4 of 18

SVM, which was reduced performance on high-dimensional data and sensitivity to the
choice of hyperparameters. In this paper, it is assumed that the features extracted by
Lin [40] are available and that the feature-engineering process was already performed.
Despite recent studies primarily focusing on advancing PdM processes and integrating
smart sensors into critical instruments and machinery used in manufacturing plants, there
is still room for improvement in the journey toward the Industry 4.0 revolution in line
with the IoT concept. ML algorithms have played a crucial role in developing accurate
models for PdM. However, some challenges persist, such as high computational costs and
redundant model information, while minimizing human intervention. In response to these
issues, adopting AutoML algorithms, which embody a pipeline model that automatically
fine-tunes hyperparameters, presents a promising solution. The primary contributions of
this study are as follows:
i. Development of AutoML-based prediction algorithms (PyCaret and AutoKeras) for
application on REB fault datasets;
ii. Design of a preprocessing algorithm to enhance the prediction process’s performance;
iii. Conducting of a comparison study between the proposed prediction algorithms;
iv. Comparative analysis of the models proposed in this research against prior works to
showcase their effectiveness in addressing the same case study.

3. Predictive Maintenance
Among the different categories of maintenance management policies, PdM is recog-
nized as the most recent and holds substantial value in comparison to traditional poli-
cies [41]. According to the literature, these policies can be classified into three primary
categories based on the strategies used [42]:
• Corrective maintenance, also known as run-to-failure (R2F), is a straightforward
strategy that involves addressing equipment issues only when they cease to function,
often necessitating the replacement or repair of specific components.
• Preventive maintenance (PvM) is a scheduled maintenance strategy carried out pe-
riodically at predetermined intervals. While this approach is effective in preventing
equipment failure, it may also result in unnecessary costs for corrective maintenance.
• PdM is a strategy that involves continuous system monitoring to anticipate potential
failures using a combination of machine-learning techniques, integrity factors, engi-
neering approaches, and statistical inference methods. Zonta et al. [43] define PdM
as models that rely on historical data and domain knowledge, enabling advanced
failure anticipation using statistical or machine-learning algorithms. This approach
ultimately improves decision making related to maintenance activities and helps pre-
vent downtime. The evolution of IoT, sensing technology, and AI has facilitated a shift
in maintenance strategies from R2F to PvM, and, finally, to PdM [44].

4. Proposed Methodology
This study involves developing a PdM model with the goal of improving maintenance
strategies and minimizing the impact of equipment malfunctions in an Industry 4.0 context.
The methodology flowchart, depicted in Figure 1, illustrates the general workflow of the
study. The process begins with loading the data, followed by a data-preprocessing phase
that prepares the dataset for model application. Next, two different AutoML models,
PyCaret and AutoKeras, are employed. PyCaret is utilized to develop an efficient ML
model based on the training dataset, while AutoKeras is used to construct a deep neural
network model. During the prediction phase, the data are divided into two sets: one for
testing and the other for validation with unseen data. Finally, a comparison is made using
specific evaluation metrics to identify the best-performing model, ultimately leading to
the selection of the PdM model that excelled in the classification problem of the adopted
case study. It is important to highlight that in this work, the data-preprocessing step—
which includes data sampling, balancing, and encoding categorical data—is carried out
before training the models. The capability to perform preprocessing is possessed by both
Processes 2023, 11, 1507 5 of 18

models. However, the decision was made to have preprocessing executed separately to
ensure compatibility, maintain consistency, and, ultimately, enhance performance. Figure 2
presents a detailed overview of the modeling phase, depicting the several steps involved in
training both PyCaret and AutoKeras models.

Figure 1. General proposed methodology flowchart.

Figure 2. Detailed overview of the modeling phase.

4.1. Data Preprocessing

The dataset employed in this study comprises nine features that were derived from
the original vibration data, representing 10 categories of labels, as provided by [40]. The
dataset consists of 230 labels for each category, yielding a total of 2300 data points. The
proposed methodology involves conducting data preprocessing to prepare the dataset
for modeling. This process aims to ensure the accuracy and reliability of ML models in
predicting equipment failures. Given that the dataset already contains extracted features,
the preprocessing primarily involves the normalization, balancing, and encoding of the
categorical data. Normalization is applied to numerical features to scale them to a common
range, which is essential for ML models that rely on distance measures. Data balancing is
performed to avoid bias towards the majority class, which could result in poor performance
when detecting the minority class. Numerous studies have demonstrated that normaliza-
tion and data balancing significantly improve the performance in various applications,
including PdM (e.g., [45,46]). The categorical target variable is encoded to facilitate its use
Processes 2023, 11, 1507 6 of 18

in ML algorithms. Algorithm 1 outlines the process of preparing the dataset for modeling
by performing data preprocessing. This results in a preprocessed dataset that is ready to be
used as input for either the PyCaret or AutoKeras models.

Algorithm 1 Data Preprocessing.

Input: CWRU dataset (d), input feature columns (f), output target (t)
Output: Preprocessed dataset (pd)
1. Normalize: Feature normalization (d)
2. X ← f
3. Y← t
4. Xn ← normalize(X)
5. Balance: Dataset balancing (Xbal,Ybal)
6. Xbal,Ybal ← balance(Xn, Y)
7. Map: Mapping categorical target from 0 to 9
8. ymap ← map(Ybal)
9. Split: Splitting dataset into training, validation, and test sets (sd)
10. X_train_val, X_test, y_train_val, y_test ← (Xbal, ymap, test_size = 0.05)
11. X_train, X_val, y_train, y_val ← (X_train_val, y_train_val, test_size = 0.2)
12. Return pd ← (X_train, y_train, X_val, y_val, X_test, y_test)

4.2. AutoML (PyCaret) Model

The primary objective of this model is to identify and prevent equipment failures by
analyzing real-time data collected from the system. Utilizing the power of AutoML, the
model employs a range of ML techniques and optimization algorithms to understand and
adapt to the system’s behavior, ultimately enhancing the accuracy and efficiency of the
maintenance process. By incorporating the capacity to learn and adapt from past experi-
ences, this model seeks to minimize the time and resources required for maintenance tasks,
leading to cost savings and improved productivity. PyCaret is a machine-learning library
that requires few lines of code and makes the machine-learning processes more streamlined
by automating tasks such as data preparation, model selection, hyperparameter tuning, and
deployment. It supports various machine-learning tasks and offers a user-friendly interface
for data visualization and model interpretation [46–48]. In this research, the power of the
PyCaret library for constructing and evaluating ML models to predict equipment failures
is demonstrated. In this model, the compare function was used to evaluate a wide range of
advanced ML algorithms automatically. To assess the model’s generalization ability, the
dataset was divided into three subsets. A portion of 5% was reserved for validation to
simulate the model’s performance on unseen data. The remaining 95% was then divided
into the 80% training set and 15% testing set to ensure the model was trained on a diverse
and sufficient dataset. An automated process was employed for selecting the best algo-
rithm and tuning its hyperparameters. This process entailed evaluating various models on
the training set and choosing the best-performing one based on evaluation metrics. The
selected model was further optimized by tuning its hyperparameters using cross-validation
on the training set. Finally, the model’s performance was assessed on the test and validation
sets with the optimized hyperparameters, and the results were compared to those obtained
from the non-tuned model.

4.3. AutoDNN (AutoKeras) Model

AutoKeras automates the selection of optimal hyperparameters and network architec-
ture for a given dataset, saving significant time and effort. It also simplifies the process of
data preparation and model selection by providing built-in neural network models and
preprocessing techniques [49,50]. The model was initialized with a maximum of 15 trials
and trained for 32 epochs.
Algorithm 2 outlines the essential steps for selecting the best PdM model, which is
based on a comparison of accuracy (acc), precision (prec), recall (rec), F1 score (f1), and
confusion matrix (cm).
Processes 2023, 11, 1507 7 of 18

Algorithm 2 Best Model Selection.

Input: Preprocessed dataset (pd), AutoML and AutoDNN models
Output: Best Auto Predictive Maintenance Model (bAutoM)
1. Train and evaluate AutoML Models:
2. AutoML ← train_ AutoML (X_train, y_train)
3. AutoML _metrics ← evaluate_model (AutoML, X_val, y_val)
4. Train and evaluate AutoDNN Model
5. AutoDNN ←train_AutoDNN(X_train, y_train)
6. AutoDNN _metrics = evaluate_model(AutoDNN, X_val, y_val)
7. Model selection based on evaluation metrics (em):
8. begin
9. em←(acc, prec, rec, f1,cm)
10. Best_evaluation_metrics (best_em)
11. BAutoM←None
12. best_em←[0, 0, 0, 0, None]
13. for i in range (len(em)):
14. if AutoML metrics[i] > AutoDNN_metrics[i]
15. if AutoML_metrics[i] > best_em[i]
16. BAutoM←AutoMl
17. end if
18. end if
19. else if
20. AutoDNN_metrics[i] > best_em[i]
21. BAutoM←AutoDNN
22. end if
23. end for
24. Return BAutoM
25. end

5. Case Study
Due to rapid advancements in science and technology, electric machines are widely
used in manufacturing applications. Consequently, these machines often operate under
unfavorable conditions, such as excessive loads and humidity, necessitating maintenance
to prevent motor breakdowns. Key components to consider in the maintenance process
include stators, shafts, rotors, and bearings of rotating machines [51].
Bearings are crucial rolling elements in machines, and any changes in their health con-
ditions, such as operating under varying loads, directly impact the efficiency, performance,
lifespan, and stability of the machines [52]. Figure 3 depicts the components of an REB,
which include the inner race (IR), outer race (OR), ball, and cage [53,54].

Figure 3. REB: exploded and geometric view [54].

As illustrated in Figure 4, the test system comprises a 2 hp motor, a torque trans-

ducer/encoder, a dynamometer, and control electronics. The fault test is implemented at
the fan-end bearing to support the motor shaft. The CWRU dataset includes vibration data
collected using accelerometers attached to both the drive end and fan end of the motor
Processes 2023, 11, 1507 8 of 18

housing. In some experiments, an additional accelerometer was also attached to the motor
base plate for support. A 16-channel digital audio tape (DAT) recorder was used for data
collection [55].
Vibration signals contain valuable information about the health of the equipment. By
extracting features such as the maximum value (max: measure of the highest magnitude
of the signals), minimum value (min: measure of the lowest magnitude of the signals),
mean (measure of the central tendency of the signals), standard deviation (sd: measure
of the spread of the signals), root mean square (rms: measure of the average magnitude
of the signals), skewness (measure of the symmetry of the signals), kurtosis (measure of
the peakedness of the signals), crest factor (measure of the peak-to-peak magnitude of
the signals), and form factor (measure of the shape of the signals) from these signals, it is
possible to identify patterns that may indicate potential equipment failures. These features
provide information about the level, variability, symmetry, peakedness, and shape of the
signals, and can be used to train ML models for PdM tasks. In addition to these features,
other characteristics of the signals such as frequency, time, and waveform can also be
analyzed to improve the accuracy. The specific features that will be most effective for a
given task will depend on the characteristics of the equipment and the data available [56].

Figure 4. CWRU bearing system [55].

The dataset comprises a large collection of vibration signals from bearings subjected
to various fault conditions. In total, it contains 10,000 fault conditions, with each fault
condition represented by 230 vibration signals. The fault conditions are categorized by fault
type and severity. The fault types are inner-race faults, outer-race faults, and ball faults.
The fault severity levels are small, medium, and large. In addition to the fault conditions,
the dataset also contains normal conditions. The normal conditions represent bearings
without any faults. Table 1 provides a summary of the fault and normal conditions. The
table includes the types of faults, levels of severity, and the corresponding abbreviations
used [36].

Table 1. Fault conditions in the CWRU bearing dataset.

Fault Type Severity Description Abbreviation

Small (7 mils) IR_007_1
Inner race Medium (14 mils) Fault in the inner race of the bearing IR_014_1
Large (21 mils) IR_021_1
Small (7 mils) OR_007_6_1
Outer race Medium (14 mils) Fault in the outer race of the bearing OR_014_6_1
Large (21 mils) OR_021_6_1
Small (7 mils) Ball_007_1
Ball Medium (14 mils) Fault in the balls of the bearing Ball_014_1
Large (21 mils) Ball_021_1
Processes 2023, 11, 1507 9 of 18

The objective of this work is to develop a classification model that recognizes the
provided nine types of faults as classes. A tenth class called "Normal" is included to
represent a healthy bearing with no faults. This class serves as a reference for comparison
with the other fault classes and it has no specific fault size. The data for the Normal class
was collected from the same locations as the data for the other fault classes, which are the
drive end, the fan end, and the base [57].

6. Results and Analysis

In this section, the performance of the two proposed models is analyzed, and the
impact of hyperparameter tuning on the results is assessed using both the testing and
validation sets.

6.1. Data Preparation Process

The initial dataset was suffering from imbalances issues, as the majority of samples
belonged to the Normal class. The Random Under Sampler was applied to address this
imbalance. Afterward, the data were normalized by the Robust Scaler, which scales the
features using statistics that are robust to outliers. Finally, categorical values were mapped
to numerical values using dictionary mapping. Table 2 summarizes the steps involved in
the data-preprocessing process used in this research.

Table 2. Preprocessing operations applied on the fault classification dataset.

Preprocessing Operation Details

Data Balancing Random Under Sampler
Normalization Robust Scaler
Categorical mapping
‘IR_007_1’: 0, ‘IR_014_1’: 1, ‘IR_021_1’: 2,
Encoding ‘OR_007_6_1’: 3, ‘OR_014_6_1’: 4, ‘OR_021_6_1’: 5,
‘Ball_007_1’: 6, ‘Ball_014_1’: 7, ‘Ball_021_1’: 8,
‘Normal_1’: 9

6.2. AutoML and AutoDNN Validation Models

In the PyCaret model, the preprocessed data were used as input, and the setup function
provided by the PyCaret library was applied. The resulting configuration parameters are
illustrated in Table 3.

Table 3. AutoML (PyCaret) model configuration summary.

Description Value
Session id 8337
Target fault
Target type Multiclass
Original data shape (2185, 10)
Transformed data shape (2185, 10)
Transformed train set shape (1748, 10)
Transformed test set shape (437, 10)
Numeric features 9
Number of folds 10

The compare function was used to train and evaluate multiple machine-learning
algorithms using predefined metrics, including accuracy, recall, precision, and F1 score.
These metrics can be defined and calculated as follows [58,59]:
Accuracy is defined as the number of correct predictions made by the model over the
total number of predictions. It can be represented as:

Accuracy = (TP + TN)/(TP + TN + FP + FN) (1)

Processes 2023, 11, 1507 10 of 18

where TP (true positive) represents the number of positive cases correctly classified as
positive, while FP (false positive) refers to the number of negative cases wrongly classified
as positive. Similarly, FN (false negative) corresponds to the number of positive cases
mistakenly classified as negative, and TN (true negative) indicates the number of negative
cases correctly identified as negative.
The Recall metric measures the model’s ability to accurately identify all positive cases.
It is defined as the number of true positive predictions made by the model over the total
number of actual positive and it can be calculated by:

Recall = TP/(TP + FN) (2)

Precision measures the model’s ability in identifying positive cases correctly. It is

defined as the number of true positive predictions over the total number of positive
predictions, and it can be calculated as:

Prec.= TP/(TP + FP) (3)

Finally, the F1 score is a metric that balances the harmonic mean of recall and precision,
as follows:
F1 = 2 × (Precision × Recall)/(Precision + Recall) (4)
A confusion matrix is a table that summarizes the performance of a given machine-
learning classification model based on a set of test data. It indicates the TP, TN, FP, and FN
for each class label. The matrix is built by comparing the predicted class labels generated
by the model with the actual class labels in the test dataset. The rows represent the actual
class labels, whilst the columns represent the predicted class labels. A correctly classified
sample is a true positive or true negative, while a misclassified sample is a false positive or
false negative. The confusion matrix presents a comprehensive examination of the model’s
performance across all classes and can be applied to calculate other metrics such as accuracy,
precision, and recall.
Table 4 presents the results of the compare model function for the top five machine-
learning algorithms. Random forest (RF) achieved the best performance, followed by
gradient-boosting classifier (GBC), extra trees (ET), light gradient-boosting machine (Light-
GBM), and finally, extreme gradient boosting (XGBoost). In general, the evaluated models
achieved high accuracy, in the range of 95.94% to 96.34%. The recall scores were also high,
where all models achieved values greater than 95%. The RF model had the highest value
of 96.34%. Precision scores were from 96.12% to 96.52%, and the F1 score was the highest
for the RF model at 96.32%. For the computational time, the XGBoost model was the fastest,
with 0.0640 s training time (TT), on the other hand, the GBC model was the slowest, with
0.9490 s TT. The RF classifier was the best-performing model among those evaluated.

Table 4. Results of compare model function in PyCaret for the top five ML algorithms.

Model Accuracy (%) Recall (%) Prec. (%) F1 (%) TT (S)

RF 96.34 96.34 96.51 96.32 0.1610
GBC 96.28 96.28 96.52 96.28 0.9490
ET 96.23 96.23 96.43 96.22 0.1670
LightGBM 96.17 96.17 96.36 96.16 0.3560
XGBoost 95.94 95.94 96.12 95.92 0.0640

The next step was to fine-tune the best model (RF model) using PyCaret’s tune model
function. The auto-tuning process optimizes the hyperparameters of the model to further
improve its performance. The result of the auto-tuning process is shown in Table 5, which
displays the performance metrics of the model on each fold of the cross-validation.
Processes 2023, 11, 1507 11 of 18

Table 5. Performance metrics of auto-tuned RF algorithm.

Fold No. Accuracy (%) Recall (%) Prec. (%) F1 (%)

0 96.57 96.57 96.84 96.60
1 94.86 94.86 95.15 94.88
2 96.57 96.57 96.98 96.61
3 96.57 96.57 97.00 96.55
4 98.86 98.86 98.89 98.86
5 97.71 97.71 97.74 97.70
6 94.29 94.29 94.23 94.14
7 96.00 96.00 96.20 95.99
8 97.13 97.13 97.12 97.11
9 95.40 95.40 95.64 95.24
Mean 96.40 96.40 96.58 96.37
Std. 0.0128 0.0128 0.0126 0.0131

The tuned RF model showed a slight improvement in performance with accuracy,

recall, precision, and F1 score of 96.40%, 96.40%, 96.58%, and 96.37%, respectively. The
standard deviation values for these metrics were low, ranging from 0.0126% to 0.0131%, as
displayed in Table 5. The performance of the model on the testing set, as measured by the
predict model function, was excellent, with an accuracy, recall, and F1 score of 99.70%.
For the AutoKeras model, Figure 5 shows a histogram displaying the performance
metrics for the testing sets associated with different failure types in the proposed AutoDNN
model. The x-axis represents the different failure types, numbered from 0 to 9, and the
y-axis denotes the score, measured in terms of precision, recall, and F1 score. The plot
indicates that the F1 score for most failure types is relatively high, with scores ranging from
83% to 100%. However, the precision score for failure type 4 (OR_014_6_1) is relatively
low at 78%, and the recall score for failure type 7 (Ball_014_1) is also relatively low at 83%.
The weighted average score for the performance metrics, in addition to the total accuracy,
is 95%.

Figure 5. Performance of proposed AutoDNN model on testing sets.

The architecture of the AutoDNN model generated by AutoKeras is illustrated in

Figure 6. The model comprises an input layer that accepts data with nine features. The
input data undergoes processing via a multi-category encoding layer for data preprocessing,
followed by a normalization layer to ensure that all inputs have a consistent scale. The
processed data is then fed into two dense layers, each with 32 neurons. Batch normalization
and rectified linear unit (ReLU) activation are applied between the dense layers. The output
layer consists of a dense layer with 10 neurons, followed by a softmax activation layer
for classification.
Processes 2023, 11, 1507 12 of 18

Figure 6. Proposed AutoDNN model architecture generated by the AutoKeras model.

6.3. Model Evaluation

In Figure 7, the feature importance plot generated by the PyCaret evaluate model
function indicates that the standard deviation (sd) feature has the highest importance in
predicting fault diagnosis, followed by the root mean square (rms) and mean features.
The remaining features, including kurtosis, minimum, maximum, form factor, crest factor,

6
Processes 2023, 11, 1507 13 of 18

and skewness, have relatively lower importance. Interestingly, the skewness feature is the
least important among them. These results suggest that prioritizing the sd, rms, and mean
features may lead to better classification performance.

Figure 7. Feature importance generated by Pycaret’s evaluate model function.

The performance of the proposed AutoDNN model, as presented in Figure 8, demon-

strates superior results on validation sets compared to testing sets. The overall weighted
average score, comprising total accuracy, reaches 97%, exceeding the corresponding score
of 95% for testing sets. Upon closer examination of individual failure types, the F1 scores
are generally high, ranging from 91% to 100%. However, there are few exceptions where
the precision score for failure type 4 is relatively lower at 89%, and failure type 7 displays a
comparatively lower recall score of 88%. Notably, these scores are only marginally inferior
to those obtained for the same failure types on the testing sets.

Figure 8. Performance of proposed AutoDNN model on validation sets.

The confusion matrices in Figure 9 show that the PyCaret model performed better
before tuning. The best RF model predicted all failure types accurately, while the tuned RF
model had misclassifications for failure types 2, 4, 5, 6, 7, and 8. Despite this, the tuned RF
model still achieved a good F1 score of 96.37% on the testing set.
Processes 2023, 11, 1507 14 of 18

(a) (b)

Figure 9. Confusion matrix of the best AutoML model: (a) random forest classifier before tuning; and
(b) random forest classifier after tuning.

The confusion matrices of the AutoKeras model in Figure 10 show exceptional per-
formance, accurately predicting the majority of the classes. The testing and validation
confusion matrices were identical, indicating that the model did not overfit the training
data. For both sets, all classes except class 4 had perfect precision and recall scores. How-
ever, for class 4, the precision was 89% for the validation set and 82% for the testing set,
and the recall was 94% for the testing set.

(a) (b)

Figure 10. Confusion matrices of the AutoDNN model: (a) testing sets; and (b) validation sets.

6.4. Results Discussion

Figure 11 shows the comparison between the two proposed AutoML models on both
testing and validation sets. The first plot displays the performance on the testing set, where
the PyCaret model achieves an impressive score of 99.70% for all metrics. In contrast, the
AutoKeras model has slightly lower scores of around 95%. The second plot displays the
performance on the validation set. The best model (RF model) scores 95.60% for all metrics,
while the AutoDNN model outperforms with a score of 97%.

(a) (b)

Figure 11. Performance comparison between the two proposed models: (a) testing sets; and
(b) validation sets.

9
Processes 2023, 11, 1507 15 of 18

Table 6 presents a summary of the performance of various ML models employed for

fault classification in ball bearings, including the proposed AutoML models and traditional
machine-learning methods from previous studies. The proposed AutoML model (PyCaret)
utilizing the best machine-learning model (random forest) achieved an impressive accuracy
of 99.70% on the testing sets and 95.60% on the validation sets, using only nine features.
Comparatively, the best results achieved by other studies are those from Wang et al. [36]
with a CNN achieving 99.92% accuracy using four features, and Rajput et al. [39] with
a Fuzzy-CNN achieving 99.87% accuracy using 16 features. The CNN and Fuzzy-CNN
models may have slightly higher accuracy, but the proposed AutoML model has several
advantages that make it a more attractive choice. One of the primary benefits of AutoML
models is their ability to automate the entire machine-learning process, including data
preprocessing, model selection, and hyperparameter tuning. On the other hand, the CNN
and Fuzzy-CNN machine-learning methods demand significant manual effort, including
feature engineering and meticulous selection of layers, number of neurons, kernels, pooling
size, and activation functions.
Furthermore, the design of Fuzzy-CNN models with effective fuzzy rules and mem-
bership functions requires expert knowledge of fuzzy logic. This necessitates a significant
amount of domain knowledge of the application in question and its data. Thus, the need
for highly skilled professionals and human intervention is greatly increased. While it is
true that CNN and Fuzzy-CNN models may have slightly higher accuracy, the AutoML is
more perforable due to its ease of use and automation ability. In addition, the Fuzzy-CNN
model developed in [39] achieved a comparable accuracy level, but with a considerably
larger feature set. This indicates that the proposed AutoML model may provide a more
efficient and effective solution. By using fewer features, the AutoML model streamlines
the process, potentially leading to faster training and reduced computational resources
without sacrificing accuracy. The proposed AutoDNN model (AutoKeras) demonstrates
competitive performance, achieving an accuracy of 95.00% on the testing sets and 97.00%
on the validation sets. While this model does not surpass the CNN model in [36] or the
Fuzzy-CNN model in [39] in terms of accuracy, it still significantly outperforms the majority
of other methods presented in the table. Consequently, the AutoDNN model (AutoKeras)
offers a valuable alternative for those seeking a reliable and efficient machine-learning
solution with less effort.

Table 6. Summary of results comparison with recent relevant studies.

Author Method Accuracy (%) Number of Features

Medium Gaussian SVM 96.00
Coarse Gaussian SVM 93.60 9
Lin [40] Fine Gaussian SVM 89.60
rms 72.60
sd 71.30
back-propagation neural network (BPNN) 91.60
Haung et al. [32] (training set of 75%) radial basis function neural network (RBFNN) 83.60 4
wavelet neural network (WNN) 84.80
Wang et al. [36] CNN 99.92 4
SVM 84.70 16
Fulgencio et al. [38]
CNN 90.60 16
Rajput et al. [39] Fuzzy-CNN 99.87 16
Best ML:RF (testing sets) 99.70
Proposed AutoML model (PyCaret)
Best ML:RF (validation sets) 95.60
9
AutoDNN (testing sets) 95.00
Proposed AutoDNN model (AutoKeras)
AutoDNN (validation Sets) 97.00
Processes 2023, 11, 1507 16 of 18

7. Conclusions
This study focused on developing an automated approach for accurately classifying
different types of faults in industrial IoT ball bearings using the CWRU dataset. The
study aimed to investigate the potential of AutoML techniques for predictive maintenance
while minimizing the need for manual hyperparameter tuning. The experimental results
demonstrated that both the proposed AutoML and AutoDNN models effectively achieved
accurate fault classification. Remarkably, the top-performing AutoML model attained an
impressive 99.7% accuracy, recall, precision, and F1 score on the testing sets, with random
forest emerging as the best algorithm. However, the proposed AutoDNN model displays
better accuracy on the validation set, scoring 97% as opposed to AutoML’s 95.60%. The
study underscores the benefits of employing AutoML techniques, enabling non-experts
in the industry to handle predictive maintenance tasks more efficiently. Consequently,
AutoML offers advantages such as automation, improved accuracy, and reduced resource
requirements. Future work could explore integrating advanced feature-engineering tech-
niques and domain knowledge to further enhance the model’s performance.

Author Contributions: Conceptualization, R.H.H., H.N.H. and A.M.H.; data curation, R.H.H. and
H.N.H.; formal analysis, R.H.H. and H.N.H.; investigation, R.H.H. and H.N.H.; methodology, R.H.H.,
H.N.H. and A.M.H.; resources, R.H.H. and H.N.H.; software, R.H.H. and H.N.H.; supervision, A.J.H.
and A.A.-J.; validation, A.J.H. and A.A.-J.; visualization, R.H.H. and H.N.H.; writing—original draft,
R.H.H., H.N.H. and A.M.H.; writing—review & editing, A.J.H. and A.A.-J. All authors have read and
agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Nasser:, A.R.; Hasan, A.M.; Humaidi, A.J.; Alkhayyat, A.; Alzubaidi, L.; Fadhel, M.A.; Santamaría, J.; Duan, Y. IoT and Cloud
Computing in Health-Care: A New Wearable Device and Cloud-Based Deep Learning Algorithm for Monitoring of Diabetes.
Electronics 2021, 10, 2719. [CrossRef]
2. Alwan, N.F.; Al-Nuaimi, M.K. The Role and Important of Internet of Things in Building Sustainable City. Eng. Technol. J. 2018, 36,
22–29. [CrossRef]
3. Nasser, A.R.; Azar, A.T.; Humaidi, A.J.; Al-Mhdawi, A.K.; Ibraheem, I.K. Intelligent Fault Detection and Identification Approach
for Analog Electronic Circuits Based on Fuzzy Logic Classifier. Electronics 2021, 10, 2888. [CrossRef]
4. Cao, Q.; Zanni-Merk, C.; Samet, A.; Reich, C.; Beuvron, F.D.B.D.; Beckmann, A.; Giannetti, C. KSPMI: A Knowledge-based System
for Predictive Maintenance in Industry 4.0. Robot. Comput. Manuf. 2021, 74, 102281. [CrossRef]
5. Natesha, B.; Guddeti, R.M.R. Fog-based intelligent machine malfunction monitoring system for industry 4.0. IEEE Trans. Actions
Ind. Inform. 2021, 17, 7923–7932.
6. Al-Khazraji, H.; Nasser, A.R.; Hasan, A.M.; Al Mhdawi, A.K.; Al-Raweshidy, H.; Humaidi, A.J. Aircraft Engines Remaining
Useful Life Prediction Based on A Hybrid Model of Autoencoder and Deep Belief Network. IEEE Access 2022, 10, 82156–82163.
[CrossRef]
7. Cardoso, D.; Ferreira, L. Application of Predictive Maintenance Concepts Using Artificial Intelligence Tools. Appl. Sci. 2020,
11, 18. [CrossRef]
8. Turner, S.; Jassin, S.S.; Hassan, A.K. Optimizing artificial neural networks using LevyChaotic mapping on Wolf Pack optimization
algorithm for detect driving sleepiness. Iraqi J. Comput. Commun. Control Syst. Eng. (IJCCCE) 2022, 22, 128–136.
9. Lee, W.J.; Wu, H.; Yun, H.; Kim, H.; Jun, M.B.; Sutherland, J.W. Predictive Maintenance of Machine Tool Systems Using Artificial
Intelligence Techniques Applied to Machine Condition Data. Procedia CIRP 2019, 80, 506–511. [CrossRef]
10. Abood, A.M.; Nasser, A.R.; Al-Khazraji, H. Predictive Maintenance of Electromechanical Systems Using Deep Learning Algo-
rithms: Review. Ing. Syst. Inf. 2022, 27, 1009–1017. [CrossRef]
11. Ayvaz, S.; Alpay, K. Predictive maintenance system for production lines in manufacturing: A machine learning approach using
IoT data in real-time. Expert Syst. Appl. 2021, 173, 114598. [CrossRef]
12. Fadhil, H.M.; Abdullah, M.N.; Younis, M.I. A Framework for Predicting Airfare Prices Using Machine Learning. Iraqi J. Comput.
Commun. Control Syst. Eng. 2022, 22, 81–96. [CrossRef]
Processes 2023, 11, 1507 17 of 18

13. Li, Y.; Wang, X.; Si, S.; Huang, S. Entropy Based Fault Classification Using the Case Western Reserve University Data:
A Benchmark Study. IEEE Trans. Reliab. 2019, 69, 754–767. [CrossRef]
14. Zhang, W.; Yang, D.; Wang, H. Data-Driven Methods for Predictive Maintenance of Industrial Equipment: A Survey. IEEE Syst. J.
2019, 13, 2213–2227. [CrossRef]
15. Tornede, T.; Tornede, A.; Wever, M.; Mohr, F. Automl for Predictive Maintenance: One Tool to Rul Them All, in IoT Streams for
Data-Driven Predictive Maintenance and IoT, Edge, and Mobile for Embedded Machine Learning; Springer: Berlin/Heidelberg, Germany,
2020; pp. 106–118.
16. Ferreira, L.; Pilastri, A.; Romano, F.; Cortez, P. Using supervised and one-class automated machine learning for predictive
maintenance. Appl. Soft Comput. 2022, 131, 109820. [CrossRef]
17. Shamayleh, A.; Awad, M.; Farhat, J. IoT Based Predictive Maintenance Management of Medical Equipment. J. Med. Syst. 2020, 44,
1–12. [CrossRef]
18. Paolanti, M.; Romeo, L.; Felicetti, A.; Mancini, A.; Frontoni, E.; Loncarski, J. Machine Learning approach for Predictive
Maintenance in Industry 4.0. In Proceedings of the 2018 14th IEEE/ASME International Conference on Mechatronic and
Embedded Systems and Applications (MESA), Oulu, Finland, 2–4 July 2018; pp. 1–6. [CrossRef]
19. Nasser, A.; Al-Khazraji, H. A hybrid of convolutional neural network and long short-term memory network approach to
predictive maintenance. Int. J. Electr. Comput. Eng. (IJECE) 2022, 12, 721–730. [CrossRef]
20. Ayodeji, A.; Wang, Z.; Wang, W.; Qin, W.; Yang, C.; Xu, S.; Liu, X. Causal augmented ConvNet: A temporal memory dilated
convolution model for long-sequence time series prediction. ISA Trans. 2022, 123, 200–217. [CrossRef]
21. Leite, D.; Martins, A.; Rativa, D.; De Oliveira, J.F.L.; Maciel, A.M.A. An Automated Machine Learning Approach for Real-Time
Fault Detection and Diagnosis. Sensors 2022, 22, 6138. [CrossRef]
22. Tornede, T.; Tornede, A.; Wever, M.; Hüllermeier, E. Coevolution of remaining useful lifetime estimation pipelines for automated
predictive maintenance. In Proceedings of the GECCO ’21: Genetic and Evolutionary Computation Conference, Lille, France,
10–14 July 2021; pp. 368–376. [CrossRef]
23. Cinar, E.; Kalay, S.; Saricicek, I. A Predictive Maintenance System Design and Implementation for Intelligent Manufacturing.
Machines 2022, 10, 1006. [CrossRef]
24. Ferreira, L.; Pilastri, A.; Sousa, V.; Romano, F.; Cortez, P. Prediction of Maintenance Equipment Failures Using Automated
Machine Learning. In Proceedings of the International Conference on Intelligent Data Engineering and Automated Learning,
Manchester, UK, 25–27 November 2021; Springer: Berlin/Heidelberg, Germany, 2021.
25. Garouani, M.; Ahmad, A.; Bouneffa, M.; Hamlich, M.; Bourguin, G.; Lewandowski, A. Towards Meta-Learning Based Data
Analytics to Better Assist the Domain Experts in Industry 4.0. In Artificial Intelligence in Data and Big Data Processing, Proceedings of
the International Conference on Artificial Intelligence and Big Data in Digital Era, Ho Chi Minh, Vietnam, 18–19 December 2021; Springer:
Berlin/Heidelberg, Germany, 2022. [CrossRef]
26. Luo, C.; Zhang, Z.; Qiao, D.; Lai, X.; Li, Y.; Wang, S. Life Prediction under Charging Process of Lithium-Ion Batteries Based on
AutoML. Energies 2022, 15, 4594. [CrossRef]
27. Escalante, H.J.; Tu, W.-W.; Guyon, I.; Silver, D.L.; Viegas, E.; Chen, Y.; Dai, W.; Yang, Q. AutoML @ NeurIPS 2018 Challenge: Design
and Results; Springer: Berlin/Heidelberg, Germany, 2019; pp. 209–229. [CrossRef]
28. Loparo, K.; Case Western Reserve University Bearing Data Center. Bearings Vibration Data Sets; Case Western Reserve University:
Cleveland, OH, USA, 2012; pp. 22–28.
29. Wen, L.; Gao, L.; Li, X. A New Deep Transfer Learning Based on Sparse Auto-Encoder for Fault Diagnosis. IEEE Trans. Syst. Man
Cybern. Syst. 2019, 49, 136–144. [CrossRef]
30. Magar, R.; Ghule, L.; Li, J.; Zhao, Y.; Farimani, A.B. FaultNet: A Deep Convolutional Neural Network for Bearing Fault
Classification. IEEE Access 2021, 9, 25189–25199. [CrossRef]
31. Sharma, N.; Sidana, T.; Singhal, S.; Jindal, S. Predictive Maintenance: Comparative Study of Machine Learning Algorithms for
Fault Diagnosis. In Proceedings of the International Conference on Innovative Computing & Communication (ICICC), Delhi,
India, 19–20 February 2022.
32. Huang, M.; Liu, Z.; Tao, Y. Mechanical fault diagnosis and prediction in IoT based on multi-source sensing data fusion. Simul.
Model. Pract. Theory 2019, 102, 101981. [CrossRef]
33. Jian, X.; Li, W.; Guo, X.; Wang, R. Fault Diagnosis of Motor Bearings Based on a One-Dimensional Fusion Neural Network. Sensors
2019, 19, 122. [CrossRef]
34. Gangavva, C.; Mangai, J.A. An Investigation of Ensemble Learning Algorithms for Fault Diagnosis of Roller Bearing. In Advances
in Parallel Computing Algorithms, Tools and Paradigms; IOS Press: Amsterdam, The Netherlands, 2022; pp. 117–125. [CrossRef]
35. Wen, L.; Li, X.; Gao, L. A transfer convolutional neural network for fault diagnosis based on ResNet-50. Neural Comput. Appl.
2019, 32, 6111–6124. [CrossRef]
36. Wang, J.; Wang, D.; Wang, S.; Li, W.; Song, K. Fault Diagnosis of Bearings Based on Multi-Sensor Information Fusion and 2D
Convolutional Neural Network. IEEE Access 2021, 9, 23717–23725. [CrossRef]
37. Han, S.; Jeong, J. An Weighted CNN Ensemble Model with Small Amount of Data for Bearing Fault Diagnosis. Procedia Comput.
Sci. 2020, 175, 88–95. [CrossRef]
Processes 2023, 11, 1507 18 of 18

38. Cascales-Fulgencio, D.; Quiles-Cucarella, E.; García-Moreno, E. Computation and Statistical Analysis of Bearings’ Time-and
Frequency-Domain Features Enhanced Using Cepstrum Pre-Whitening: A ML-and DL-Based Classification. Appl. Sci. 2022,
12, 10882. [CrossRef]
39. Rajput, D.S.; Meena, G.; Acharya, M.; Mohbey, K.K. Fault prediction using fuzzy convolution neural network on IoT environment
with heterogeneous sensing data fusion. Meas. Sensors 2023, 26, 100701. [CrossRef]
40. Lin, S.-L. Application of Machine Learning to a Medium Gaussian Support Vector Machine in the Diagnosis of Motor Bearing
Faults. Electronics 2021, 10, 2266. [CrossRef]
41. Selcuk, S. Predictive maintenance, its implementation and latest trends. Proc. Inst. Mech. Eng. Part B J. Eng. Manuf. 2017, 231,
1670–1679. [CrossRef]
42. Carvalho, T.P.; Soares, F.A.A.M.N.; Vita, R.; Francisco, R.D.P.; Basto, J.P.; Alcalá, S.G.S. A systematic literature review of machine
learning methods applied to predictive maintenance. Comput. Ind. Eng. 2019, 137, 106024. [CrossRef]
43. Zonta, T.; da Costa, C.A.; da Rosa Righi, R.; de Lima, M.J.; da Trindade, E.S.; Li, G.P. Predictive maintenance in the Industry 4.0:
A systematic literature review. Comput. Ind. Eng. 2020, 150, 106889. [CrossRef]
44. Ran, Y.; Zhou, X.; Lin, P.; Wen, Y.; Deng, R. A survey of predictive maintenance: Systems, purposes and approaches. arXiv 2019,
preprint. arXiv:1912.07383.
45. Qian, G.; Liu, J. A comparative study of deep learning-based fault diagnosis methods for rotating machines in nuclear power
plants. Ann. Nucl. Energy 2022, 178, 109334. [CrossRef]
46. Gain, U.; Hotti, V. Low-code AutoML-augmented data pipeline–a review and experiments. J. Phys. Conf. Ser. 2021, 1828, 012015.
[CrossRef]
47. Demidova, L.A. Recurrent Neural Networks’ Configurations in the Predictive Maintenance Problems. IOP Conf. Series Mater. Sci.
Eng. 2020, 714, 012005. [CrossRef]
48. Hu, X.; Ban, Y.; Nascetti, A. Uni-Temporal Multispectral Imagery for Burned Area Mapping with Deep Learning. Remote Sens.
2021, 13, 1509. [CrossRef]
49. Jin, H.; Song, Q.; Hu, X. Auto-keras: An efficient neural architecture search system. In Proceedings of the 25th Acm Sigkdd
International Conference on Knowledge Discovery & Data Mining, New York, NY, USA, 4–8 August 2019.
50. Singh, D.; Pant, P.K.; Pant, H.; Dobhal, D.C. Robust Automated Machine Learning (AutoML) System for Early Stage Hepatic
Disease Detection. In Intelligent Data Communication Technologies and Internet of Things: Proceedings of ICICI 2020, Coimbatore, India,
27–28 August 2020; Springer: Berlin/Heidelberg, Germany, 2021.
51. Boudiaf, A.; Moussaoui, A.; Dahane, A.; Atoui, I. A Comparative Study of Various Methods of Bearing Faults Diagnosis Using
the Case Western Reserve University Data. J. Fail. Anal. Prev. 2016, 16, 271–284. [CrossRef]
52. Zhang, S.; Zhang, S.; Wang, B.; Thomas, G. Habetler Deep learning algorithms for bearing fault diagnostics—A comprehensive
review. IEEE Access 2020, 8, 29857–29881. [CrossRef]
53. Li, S.-Y.; Gu, K.-R. Smart Fault-Detection Machine for Ball-Bearing System with Chaotic Mapping Strategy. Sensors 2019, 19, 2178.
[CrossRef] [PubMed]
54. Saidi, L.; Ben Ali, J.; Fnaiech, F. Application of higher order spectral features and support vector machines for bearing faults
classification. ISA Trans. 2015, 54, 193–206. [CrossRef]
55. Liu, C.; Tan, J.; Huang, Z. Fault Diagnosis of Rolling Element Bearings Based on Adaptive Mode Extraction. Machines 2022,
10, 260. [CrossRef]
56. Neupane, D.; Seok, J. Bearing Fault Detection and Diagnosis Using Case Western Reserve University Dataset with Deep Learning
Approaches: A Review. IEEE Access 2020, 8, 93155–93178. [CrossRef]
57. Xu, Y.; Li, Z.; Wang, S.; Li, W.; Sarkodie-Gyan, T.; Feng, S. A Hybrid Deep-Learning Model for Fault Diagnosis of Rolling Bearings.
Measurement 2021, 169, 108502. [CrossRef]
58. Simeone, O. A Brief Introduction to Machine Learning for Engineers. Found. Trends®Signal Process. 2018, 12, 200–431. [CrossRef]
59. Raschka, S. Python Machine Learning; Packt Publishing Ltd.: Birmingham, UK, 2015.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.

A Systematic Literature Review of Machine Learning Methods Applied To Predictive Maintenance
No ratings yet
A Systematic Literature Review of Machine Learning Methods Applied To Predictive Maintenance
16 pages
Creation of A Machine Learning Model For The Predictive Maintenance of An Engine Equipped With A Rotating Shaft
No ratings yet
Creation of A Machine Learning Model For The Predictive Maintenance of An Engine Equipped With A Rotating Shaft
89 pages
Bai 77
No ratings yet
Bai 77
11 pages
AI EnhancedPredictiveMaintenanceSystemsforIndustrialEquipment
No ratings yet
AI EnhancedPredictiveMaintenanceSystemsforIndustrialEquipment
43 pages
AI-Powered Maintenance in The Manufacturing Industry - A Comprehensive Reviewer's Synthesis
No ratings yet
AI-Powered Maintenance in The Manufacturing Industry - A Comprehensive Reviewer's Synthesis
4 pages
2024.A Comprehensive Review of Machine Learning Techniques For Condition-Based Maintenance
No ratings yet
2024.A Comprehensive Review of Machine Learning Techniques For Condition-Based Maintenance
20 pages
Development of A Predictive Maintenance Algorithm For A Diesel Generator Using Machine Learning
No ratings yet
Development of A Predictive Maintenance Algorithm For A Diesel Generator Using Machine Learning
11 pages
Predicting Machine Failures Using Machine Learning and Deep Learning Algorithms
No ratings yet
Predicting Machine Failures Using Machine Learning and Deep Learning Algorithms
11 pages
Electronics 13 00438
No ratings yet
Electronics 13 00438
20 pages
Automobile Predictive Maintenance Using
No ratings yet
Automobile Predictive Maintenance Using
12 pages
Sensors 24 03215
No ratings yet
Sensors 24 03215
25 pages
Deep Learning Models For Predictive Maintenance: A Survey, Comparison, Challenges and Prospects
No ratings yet
Deep Learning Models For Predictive Maintenance: A Survey, Comparison, Challenges and Prospects
31 pages
Machine Learning Models For Predictive Maintenance Report Aditya Tiwari
No ratings yet
Machine Learning Models For Predictive Maintenance Report Aditya Tiwari
37 pages
From Knowledge Based To Big Data Analyti
No ratings yet
From Knowledge Based To Big Data Analyti
15 pages
Machine Learning Based Fault-Oriented Predictive Maintenance in Industry 4.0
No ratings yet
Machine Learning Based Fault-Oriented Predictive Maintenance in Industry 4.0
13 pages
Centrifugal Pump Fault Diagnosis Using A Predictive Maintenance Model
No ratings yet
Centrifugal Pump Fault Diagnosis Using A Predictive Maintenance Model
12 pages
Predictive Maintenance Enabled by Machine Learning - Use Cases and
100% (1)
Predictive Maintenance Enabled by Machine Learning - Use Cases and
21 pages
1 s2.0 S1877050923006178 Main
No ratings yet
1 s2.0 S1877050923006178 Main
6 pages
1 s2.0 S0952197624004986 Main
No ratings yet
1 s2.0 S0952197624004986 Main
15 pages
LSTM 2
No ratings yet
LSTM 2
14 pages
Reinforcement Learning For Predictive Maintenance: A Systematic Technical Review
No ratings yet
Reinforcement Learning For Predictive Maintenance: A Systematic Technical Review
63 pages
JETIR2104196
No ratings yet
JETIR2104196
5 pages
An Overview of The Application of Machin
No ratings yet
An Overview of The Application of Machin
15 pages
Research Paper 1
No ratings yet
Research Paper 1
12 pages
From Knowledge-Based To Big Data Analytic Model
No ratings yet
From Knowledge-Based To Big Data Analytic Model
16 pages
IJRPR26093
No ratings yet
IJRPR26093
6 pages
PDM FSA Predictive Maintenance Framework
No ratings yet
PDM FSA Predictive Maintenance Framework
13 pages
B.tech Project
No ratings yet
B.tech Project
14 pages
Predictive Maintenance For Industrial Equipments Using ML & DL
No ratings yet
Predictive Maintenance For Industrial Equipments Using ML & DL
6 pages
RP
No ratings yet
RP
8 pages
Machine Learning Approach Using MLP and SVM Algori
No ratings yet
Machine Learning Approach Using MLP and SVM Algori
15 pages
Assignmnet 4
No ratings yet
Assignmnet 4
8 pages
Literature Review
No ratings yet
Literature Review
5 pages
Maintenance - fINAL
No ratings yet
Maintenance - fINAL
6 pages
Nishhh Technical - Seminar - PPT
No ratings yet
Nishhh Technical - Seminar - PPT
18 pages
2020 - Application of Predictive Maintenance Concepts Using
No ratings yet
2020 - Application of Predictive Maintenance Concepts Using
18 pages
Machine Learning For Predictive Maintenance Applic
No ratings yet
Machine Learning For Predictive Maintenance Applic
11 pages
Predictive Machine Maintenance Using Deep Learning
No ratings yet
Predictive Machine Maintenance Using Deep Learning
6 pages
Using Artificial Intelligence (AI) For Monitoring and Diagnosing Electric Motor Faults Based On Vibration Signals
No ratings yet
Using Artificial Intelligence (AI) For Monitoring and Diagnosing Electric Motor Faults Based On Vibration Signals
6 pages
2020 - Machine Learning Approach To Predictive
No ratings yet
2020 - Machine Learning Approach To Predictive
10 pages
Iot MM 1
No ratings yet
Iot MM 1
22 pages
Marc 1 1 22110
No ratings yet
Marc 1 1 22110
8 pages
Machine Learning For Predictive Maintenance in Industrial Iot
No ratings yet
Machine Learning For Predictive Maintenance in Industrial Iot
12 pages
Applsci 13 04971 With Cover
No ratings yet
Applsci 13 04971 With Cover
17 pages
IJSDR2305088
No ratings yet
IJSDR2305088
4 pages
E3sconf Icies2022 01036
No ratings yet
E3sconf Icies2022 01036
5 pages
Predictive Maintenance Using Machine Learning
No ratings yet
Predictive Maintenance Using Machine Learning
3 pages
AIoT 4 Zheng TechnicalPaper KDD2020
No ratings yet
AIoT 4 Zheng TechnicalPaper KDD2020
6 pages
Mca 27 00002
No ratings yet
Mca 27 00002
21 pages
Face Mask Detection Project
0% (1)
Face Mask Detection Project
57 pages
Machine Learning in Predictive Maintenance Towards Sustainable Smart Manufacturing in Industry 4.0
No ratings yet
Machine Learning in Predictive Maintenance Towards Sustainable Smart Manufacturing in Industry 4.0
42 pages
Application of Predictive Maintenance in Manufacturing With The
No ratings yet
Application of Predictive Maintenance in Manufacturing With The
5 pages
Predictive Maintenance Unleashing Applications of Machine Learning A Comprehensive Exploration
No ratings yet
Predictive Maintenance Unleashing Applications of Machine Learning A Comprehensive Exploration
5 pages
Applied Sciences
No ratings yet
Applied Sciences
34 pages
Industrial Internet of Things-Based Predictive Maintenance For Equipment Protection
100% (1)
Industrial Internet of Things-Based Predictive Maintenance For Equipment Protection
6 pages
IJCRT2205280
No ratings yet
IJCRT2205280
4 pages
Literatura de Machine Learning PDF
No ratings yet
Literatura de Machine Learning PDF
10 pages
Atlas Medical Ultrasonografie
No ratings yet
Atlas Medical Ultrasonografie
130 pages
Predictive Maintenance in Manufacturing With AI and Data Science
No ratings yet
Predictive Maintenance in Manufacturing With AI and Data Science
3 pages
Diabetic Retinopathy Project
No ratings yet
Diabetic Retinopathy Project
23 pages
Machine Learning in Geoscience
No ratings yet
Machine Learning in Geoscience
22 pages
Machine Learning Approach For Predictive Maintenance in Industry 4.0
No ratings yet
Machine Learning Approach For Predictive Maintenance in Industry 4.0
6 pages
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
No ratings yet
Free Download Data Science Curriculum - Innomatics Research Labs Hyderabad, India
14 pages
Complete Deep Learning Interview Question
No ratings yet
Complete Deep Learning Interview Question
46 pages
Cat and Dog Classification Using CNN: Project Objective
No ratings yet
Cat and Dog Classification Using CNN: Project Objective
7 pages
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
No ratings yet
Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey
103 pages
Wa0001.
No ratings yet
Wa0001.
82 pages
Agronomy 12 00365 v2
No ratings yet
Agronomy 12 00365 v2
14 pages
Data Science: & Artificial Intelligence
No ratings yet
Data Science: & Artificial Intelligence
38 pages
ISEAIA 2017 - 5th Symposium On Engineering, Artificial Intelligence and Applications
No ratings yet
ISEAIA 2017 - 5th Symposium On Engineering, Artificial Intelligence and Applications
318 pages
Ai Fitness Trainer Using Human Pose Estimation IJERTCONV11IS08017
No ratings yet
Ai Fitness Trainer Using Human Pose Estimation IJERTCONV11IS08017
8 pages
ALBUNIA Profile 2021 Min
No ratings yet
ALBUNIA Profile 2021 Min
10 pages
Botanical - Final Paper
No ratings yet
Botanical - Final Paper
5 pages
Ocean Engineering
No ratings yet
Ocean Engineering
21 pages
Energies 16 03491
No ratings yet
Energies 16 03491
17 pages
Actuators 13 00034 v2
No ratings yet
Actuators 13 00034 v2
13 pages
S P Q C N N A: Ubspace Reserving Uantum Onvolutional Eural Etwork Rchitectures
No ratings yet
S P Q C N N A: Ubspace Reserving Uantum Onvolutional Eural Etwork Rchitectures
13 pages
CS 231N Midterm Review
No ratings yet
CS 231N Midterm Review
30 pages
Precise Detection of Diabetic Retinopathy Using Adaptive Remora Optimization Algorithm With Deep Adversarial Approach
No ratings yet
Precise Detection of Diabetic Retinopathy Using Adaptive Remora Optimization Algorithm With Deep Adversarial Approach
31 pages
Deep Learning For Fractographic Classification in Metallic Materials Copy
No ratings yet
Deep Learning For Fractographic Classification in Metallic Materials Copy
24 pages
6 Lecture CNN
No ratings yet
6 Lecture CNN
45 pages
Gray Scale Image Captioning Using CNN and LSTM
No ratings yet
Gray Scale Image Captioning Using CNN and LSTM
8 pages
Deeprx: Fully Convolutional Deep Learning Receiver: Mikko Honkala, Dani Korpi, and Janne M.J. Huttunen
No ratings yet
Deeprx: Fully Convolutional Deep Learning Receiver: Mikko Honkala, Dani Korpi, and Janne M.J. Huttunen
32 pages
Design of Non-Linear Reduced and Full Order Observ
No ratings yet
Design of Non-Linear Reduced and Full Order Observ
9 pages
Plant Leaf Disease Recognition Using Random Forest KNN SVM and CNN
No ratings yet
Plant Leaf Disease Recognition Using Random Forest KNN SVM and CNN
7 pages
A Lightweight Object Grasping Network Using GhostNet
No ratings yet
A Lightweight Object Grasping Network Using GhostNet
10 pages
Swapandeep Kaur 1910941059 Research Paper 8
No ratings yet
Swapandeep Kaur 1910941059 Research Paper 8
28 pages
Yang Foreground-Background Distribution Modeling Transformer For Visual Object Tracking ICCV 2023 Paper
No ratings yet
Yang Foreground-Background Distribution Modeling Transformer For Visual Object Tracking ICCV 2023 Paper
11 pages
Structural Crack Detection From Benchmark Data Sets Using Pruned Fully Convolutional Networks
No ratings yet
Structural Crack Detection From Benchmark Data Sets Using Pruned Fully Convolutional Networks
14 pages
DMFR 20220335
No ratings yet
DMFR 20220335
22 pages
A Transfer Convolutional Neural Network For Fault Diagnosis Based On Resnet-50
No ratings yet
A Transfer Convolutional Neural Network For Fault Diagnosis Based On Resnet-50
14 pages
P&ID 19.903 Grinding 00
No ratings yet
P&ID 19.903 Grinding 00
1 page
A Security Risk Model For Online Banking System: Anshita Dhoot A. N. Nazarov Alireza Nik Aein Koupaei
No ratings yet
A Security Risk Model For Online Banking System: Anshita Dhoot A. N. Nazarov Alireza Nik Aein Koupaei
4 pages
Mastering Embedded C: The Ultimate Guide to Building Efficient Systems
From Everand
Mastering Embedded C: The Ultimate Guide to Building Efficient Systems
Robert Johnson
No ratings yet
Embedded Systems Programming with C++: Real-World Techniques
From Everand
Embedded Systems Programming with C++: Real-World Techniques
Robert Johnson
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Processes 11 01507 v2

Uploaded by

Processes 11 01507 v2

Uploaded by

processes

Received: 27 March 2023

Processes 2023, 11, 1507. https://doi.org/10.3390/pr11051507 https://www.mdpi.com/journal/processes

augmented convolution network (CaConvNet) is proposed for long-sequence time-series

Figure 1. General proposed methodology flowchart.

Figure 2. Detailed overview of the modeling phase.

4.1. Data Preprocessing

Algorithm 1 Data Preprocessing.

4.2. AutoML (PyCaret) Model

4.3. AutoDNN (AutoKeras) Model

Algorithm 2 Best Model Selection.

Figure 3. REB: exploded and geometric view [54].

As illustrated in Figure 4, the test system comprises a 2 hp motor, a torque trans-

Figure 4. CWRU bearing system [55].

Table 1. Fault conditions in the CWRU bearing dataset.

Fault Type Severity Description Abbreviation

6. Results and Analysis

6.1. Data Preparation Process

Table 2. Preprocessing operations applied on the fault classification dataset.

Preprocessing Operation Details

6.2. AutoML and AutoDNN Validation Models

Table 3. AutoML (PyCaret) model configuration summary.

Accuracy = (TP + TN)/(TP + TN + FP + FN) (1)

Recall = TP/(TP + FN) (2)

Precision measures the model’s ability in identifying positive cases correctly. It is

Prec.= TP/(TP + FP) (3)

Model Accuracy (%) Recall (%) Prec. (%) F1 (%) TT (S)

Table 5. Performance metrics of auto-tuned RF algorithm.

Fold No. Accuracy (%) Recall (%) Prec. (%) F1 (%)

The tuned RF model showed a slight improvement in performance with accuracy,

Figure 5. Performance of proposed AutoDNN model on testing sets.

The architecture of the AutoDNN model generated by AutoKeras is illustrated in

Figure 6. Proposed AutoDNN model architecture generated by the AutoKeras model.

6.3. Model Evaluation

Figure 7. Feature importance generated by Pycaret’s evaluate model function.

The performance of the proposed AutoDNN model, as presented in Figure 8, demon-

Figure 8. Performance of proposed AutoDNN model on validation sets.

6.4. Results Discussion

Table 6 presents a summary of the performance of various ML models employed for

Table 6. Summary of results comparison with recent relevant studies.

Author Method Accuracy (%) Number of Features

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.