Applied Mathematics To Mechanisms and Machines
Applied Mathematics To Mechanisms and Machines
to Mechanisms
and Machines
Edited by
Higinio Rubio Alonso, Alejandro Bustos Caballero,
Jesus Meneses Alonso and Enrique Soriano-Heras
Printed Edition of the Special Issue Published in Mathematics
www.mdpi.com/journal/mathematics
Applied Mathematics to Mechanisms
and Machines
Applied Mathematics to Mechanisms
and Machines
Editors
Higinio Rubio Alonso
Alejandro Bustos Caballero
Jesus Meneses Alonso
Enrique Soriano-Heras
MDPI • Basel • Beijing • Wuhan • Barcelona • Belgrade • Manchester • Tokyo • Cluj • Tianjin
Editors
Higinio Rubio Alonso Alejandro Bustos Caballero Jesus Meneses Alonso
Department of Mechanical Department of Mechanics, Department of Mechanical
Engineering, Universidad National University of Engineering, Universidad
Carlos III de Madrid, Distance Education, Carlos III de Madrid,
Leganés, Spain Madrid, Spain Leganés, Spain
Enrique Soriano-Heras
Department of Mechanical
Engineering, Universidad
Carlos III de Madrid,
Leganés, Spain
Editorial Office
MDPI
St. Alban-Anlage 66
4052 Basel, Switzerland
This is a reprint of articles from the Special Issue published online in the open access journal
Mathematics (ISSN 2227-7390) (available at: https://www.mdpi.com/journal/mathematics/special
issues/Applied Mathematics to Mechanisms Machines).
For citation purposes, cite each article independently as indicated on the article page online and as
indicated below:
LastName, A.A.; LastName, B.B.; LastName, C.C. Article Title. Journal Name Year, Volume Number,
Page Range.
© 2022 by the authors. Articles in this book are Open Access and distributed under the Creative
Commons Attribution (CC BY) license, which allows users to download, copy and build upon
published articles, as long as the author and publisher are properly credited, which ensures maximum
dissemination and a wider impact of our publications.
The book as a whole is distributed by MDPI under the terms and conditions of the Creative Commons
license CC BY-NC-ND.
Contents
Oscar Altuzarra, David Manuel Solanillas, Enrique Amezua and Victor Petuya
Path Analysis for Hybrid Rigid–Flexible Mechanisms
Reprinted from: Mathematics 2021, 9, 1869, doi:10.3390/math9161869 . . . . . . . . . . . . . . . . 19
Minh Tuan Pham, Song Huat Yeo and Tat Joo Teo
Three-Legged Compliant Parallel Mechanisms: Fundamental Design Criteria to Achieve Fully
Decoupled Motion Characteristics and a State-of-the-Art Review
Reprinted from: Mathematics 2022, 10, 1414, doi:10.3390/math10091414 . . . . . . . . . . . . . . . 63
Jesús Alan Calderón Chavarri, Carlos Gianpaul Rincón Ruiz, Ana Marı́a Gómez Amador,
Bray Jesús Martin Agreda Cardenas, Sebastián Calero Anaya, John Hugo Lozano Jauregui,
Alexandr Toribio Hinostroza and Juan José Jiménez de Cisneros y Fonfrı́a
Mathematical Analysis of a Low Cost Mechanical Ventilator Respiratory Dynamics Enhanced
by a Sensor Transducer (ST) Based in Nanostructures of Anodic Aluminium Oxide (AAO)
Reprinted from: Mathematics 2022, 10, 2403, doi:10.3390/math10142403 . . . . . . . . . . . . . . . 213
v
Emilio Lechosa Urquijo, Fernando Blaya Haro, Juan David Cano-Moreno, Roberto D’Amato
and Juan Antonio Juanes Méndez
Mechanical Model and FEM Simulations for Efforts on Biceps and Triceps Muscles under
Vertical Load: Mathematical Formulation of Results
Reprinted from: Mathematics 2022, 10, 2441, doi:10.3390/math10142441 . . . . . . . . . . . . . . . 245
Leopoldo Prieto, Alejandro Quesada, Ana Marı́a Gómez Amador and Vicente Dı́az
The General Dispersion Relation for the Vibration Modes of Helical Springs
Reprinted from: Mathematics 2022, 10, 2698, doi:10.3390/math10152698 . . . . . . . . . . . . . . . 313
Ruben Medina, Mariela Cerrada, Shuai Yang, Diego Cabrera, Edgar Estupiñan and
René-Vinicio Sánchez
Fault Classification in a Reciprocating Compressor and a Centrifugal Pump Using Non-Linear
Entropy Features
Reprinted from: Mathematics 2022, 10, 3033, doi:10.3390/math10173033 . . . . . . . . . . . . . . . 331
Ester Olmeda, Enrique Roberto Carrillo Li, Jorge Rodrı́guez Hernández and Vicente Dı́az
Lateral Dynamic Simulation of a Bus under Variable Conditions of Camber and Curvature
Radius
Reprinted from: Mathematics 2022, 10, 3081, doi:10.3390/math10173081 . . . . . . . . . . . . . . . 361
vi
Preface to ”Applied Mathematics to Mechanisms and
Machines”
The science of mechanisms and machines has necessitated mathematics from the very moment
of its birth. The much celebrated renaissance figure Leonardo da Vinci already pointed out how
necessary mathematics is for the advancement of science, stating that “No human investigation can
be called real science if it cannot be demonstrated mathematically”. In fact, the first steps of the
development of mechanisms occurred hand in hand with geometry. At first, graphic methods, of
undoubted pedagogical value, provided results for the motion of mechanisms at a given instant,
which involved the arduous task of determining its kinematics and dynamics for the trajectory over
time; conversely, analytical methods were limited to simple case resolution. With the advent and
development of computing, numerical methods have made their way into all areas of knowledge
and technology. In recent decades, these methods have made possible a rapid development of
the analysis, optimization and synthesis of mechanical systems, mechanisms and machines in the
academic, scientific and industrial fields.
The 16 chapters of this book (all of them accepted and published articles in the Special Issue
”Applied Mathematics to Mechanisms and Machines” of the MDPI’s Mathematics journal) cover a
wide variety of topics, including the synthesis, design and optimization of mechanisms, robotics,
automotive, Maintenance 4.0, machine vibrations, control, biomechanics and medical devices, among
others, that combine mechanisms and machine science with mathematics in synergy.
The Guest Editors of the Special Issue are very grateful to the authors of these works for their
high quality contributions, as well as to their reviewers for the sharpness and insightfulness of their
comments. We also want to sincerely thank the administrative staff of MDPI publications for their
invaluable support throughout the process.
Higinio Rubio Alonso, Alejandro Bustos Caballero, Jesus Meneses Alonso, and Enrique
Soriano-Heras
Editors
vii
mathematics
Article
Hybrid Optimization Based Mathematical Procedure for
Dimensional Synthesis of Slider-Crank Linkage
Alfonso Hernández 1 , Aitor Muñoyerro 2 , Mónica Urízar 1, * and Enrique Amezua 1
1 Faculty of Engineering in Bilbao, University of the Basque Country (UPV/EHU), Plaza Ingeniero Torres
Quevedo, 48013 Bilbao, Spain; a.hernandez@ehu.es (A.H.); enrique.amezua@ehu.es (E.A.)
2 SENER Aeroespacial, Avda. de Zugazarte 56, 48992 Getxo, Spain; aitor.munoyerro@aeroespacial.sener (A.M.)
* Correspondence: monica.urizar@ehu.es (M.U.)
Abstract: In this paper, an optimization procedure for path generation synthesis of the slider-crank
mechanism will be presented. The proposed approach is based on a hybrid strategy, mixing local and
global optimization techniques. Regarding the local optimization scheme, based on the null gradient
condition, a novel methodology to solve the resulting non-linear equations is developed. The solving
procedure consists of decoupling two subsystems of equations which can be solved separately and
following an iterative process. In relation to the global technique, a multi-start method based on a
genetic algorithm is implemented. The fitness function incorporated in the genetic algorithm will take
as arguments the set of dimensional parameters of the slider-crank mechanism. Several illustrative
examples will prove the validity of the proposed optimization methodology, in some cases achieving
an even better result compared to mechanisms with a higher number of dimensional parameters,
such as the four-bar mechanism or the Watt’s mechanism.
developed by using exact differentiation that allows for establishing the position of the
instant center of rotation and the centrode.
An often-employed dimensional optimization procedure consists of minimizing the
error function, formulated as the sum of squared differences between the points of the
discretized prescribed path, and those belonging to the real generated path. The min-
imization process can be solved with different methods that can be classified into two
main groups, these being local and global methods, which are mentioned below. On the
other hand, there are also publications focused not so much on studying the mathematical
optimization techniques, but on proposing new ways of describing the output generated
by the mechanism. This may result in a more advantageous definition for the optimization
error function or for the elaboration of atlases and databases. In relation to the path gen-
eration problems discussed in this paper, there exist different approaches to describe the
trajectories, and probably the most typical ones are based on the Fourier series [16] or Haar
Wavelet transform [17]. Similarly, reference [18] describes a unified theory of the harmonic
characteristic parameter method for mechanism synthesis. Apart from the dimensional
synthesis, other publications focus on the phase that precedes it, i.e., structural synthesis,
the first step in the conceptual design of mechanisms. In this sense, there can be found
some proposals of automatic algorithms intended for the structural synthesis of robots and
closed-loop mechanisms [19].
In relation to the mathematical optimization techniques for dimensional synthesis,
the most effective and widely used local methods consist of applying the null gradient
condition, which leads to a non-linear system of equations. This system often includes
some passive variables that cannot be eliminated. To solve it, the function is linearized and
an iterative method, such as Gauss–Newton, is used, starting from an approximate initial
solution provided by the designer. The original reference for this type of method is a paper
published in 1966 by Chi-Yeh [20], which was dedicated to the four-bar linkage. From then
on, several papers related to dimensional synthesis of this mechanism by means of gradient
methods have been published, exploring different ways to improve the effectiveness
of optimization. As part of these alternative approaches, in [21] the authors proposed
modifying the set of variables to be optimized, considering the nodal coordinates instead
of the usual dimensional parameters, thereby allowing the elimination of some constraints
that were present in the original problem. It is also noteworthy that other publications
focus on reformulating the error function, such as [22,23]. The authors of those works
proposed to minimize the strain energy originated when the mechanism is forced to run
exactly through the prescribed trajectory. On the other hand, some authors have chosen to
estimate the error by avoiding its sensitivity to translation and rotation effects, such as [24],
where a system of relative coordinates between precision points is used. Following the
same idea, the authors proposed to perform a prior and independent phase to optimize the
translation, rotation, and scaling parameters [25]. In addition to the different alternatives
to characterize the design parameters of the mechanism and to estimate the resulting error,
a relevant aspect to achieve good performance in gradient methods is to carry out an exact
calculation of the partial derivatives, avoiding the numerical derivation, since it increases
the computational cost and results in a lower efficiency. Given the interest in solving
this problem, reference [26] presents a general method for calculating the exact partial
derivatives from the loop equations previously identified by the designer.
Despite the large number of existing publications devoted to optimal dimensional
synthesis by means of gradient methods, and to the improvement of their performance,
none of them are capable of solving the main limitation they have. Unfortunately, these
methods are highly sensitive to the chosen starting approximation, since they are local
in nature and hence converge to the nearest minimum, which will not necessarily be the
optimal overall solution. To overcome this drawback, global methods make it possible to
explore the entire space where solutions can be found. Metaheuristic methods are the most
common ones, and they have been covered in several references. These include genetic
algorithms [27–29], differential evolution [30–32], ant search [33], krill herd algorithm [34],
2
Mathematics 2021, 9, 1581
imperialist competitive algorithm [35], or neural networks [36]. Nevertheless, the weakness
of heuristic methods in comparison with gradient methods is their higher computational
cost and a lower convergence rate. Furthermore, there is no guarantee that they will
converge to a minimum, neither locally nor globally.
Hybrid optimization algorithms, such as [37], gain greater strength when a global
method generating seeds for starting mechanisms is combined with a good local method.
Normally, hybrid methods start by running a global method to obtain one or several designs
that will later be used as initial approximations in a local method to quickly converge to
the nearest relative minimum. In this paper, as described below, a hybrid optimization
approach is proposed.
In the case of the slider crank mechanism analyzed in this paper, thanks to the
simplicity of its kinematics, it is possible to express the synthesis variables directly as
a function of the dimensional and input parameters, and thus completely eliminating
the passive variables. A novel aspect of the proposed approach is the way in which the
resulting system of equations is solved. Considering the more general case of unprescribed
timing synthesis, the final system of equations associated with the null gradient condition
can be divided into two subsystems with different characteristics. As will be explained,
the procedure described in this article allows each subsystem to be solved separately
within an iterative process that connects them together. This makes it possible, in some
particularly simple cases, such as the two-parameter slider-crank, to solve all the equations
analytically, while in more complex cases, numerical methods must necessarily be adopted.
The optimum solution reached will be a relative minimum of the error function and will be
influenced by the initial approximation used to solve the system of equations numerically.
In this paper, reducing this dependency will be attempted by running the local optimization
algorithm from different starting points previously selected by a genetic algorithm. The
generation of the starting points could also be done by another type of heuristic method,
or through a sweeping process that generates random points within the entire search
space. Even so, the latter technique would not achieve the most promising regions as the
genetic algorithm does, but it would only generate a wide grid of different starting points.
Therefore, the genetic algorithm is the preferred choice.
It is important to highlight that the procedure described in this paper seeks to obtain
reliable solutions, not only from a mathematical vision, but also from a practical point of
view. For this reason, it will be stated how to avoid the circuit defect in the slider-crank
mechanism, relying on the concept of branch index. This concept was introduced in
reference [21] to analyze the kinematics of the four-bar hinged linkage. The incorporation
of design constraints will be addressed by means of penalty functions included in the error
function to be minimized. This is essential to impose the Grashof criterion and thus ensure
that the input element is able to fully rotate (crank input).
Finally, the effectiveness of the proposed methodology will be illustrated through
different examples. The final solutions obtained in this paper are as accurate as the ones
reached in other papers when solving the same problem by using more complex designs,
such as four-bar or Watt linkages.
The main novelties and highlights of this paper are:
• Deduction of the equations required for the optimal dimensional synthesis of the slider-
crank mechanism, which constitutes an alternative to the hinged four-bar linkage
usually used in the literature to solve this type of problem.
• Proposal of an original methodology to solve a non-linear system of equations re-
sulting from the null gradient condition, based on the decoupling of two subsystems
of equations. It facilitates the resolution of the system and, in some cases, allows to
obtain all the solutions in an analytical way.
• Integration of the local optimization methodology within a hybrid optimization
method, which uses a genetic algorithm to search for the best starting approximations.
The fitness function has been adapted to solve not only the prescribed timing problem,
but also unprescribed timing.
3
Mathematics 2021, 9, 1581
a5 − a1 ·sinϕ
sinθ = (3)
a2
4
Mathematics 2021, 9, 1581
Using the loop equations to solve the passive variables s (Equation (2)) and θ
(Equations (3) and (4)), and substituting them in Equations (5) and (6), the following
expressions are obtained for the synthesis variables, referring to the local system O’X’Y’:
2
a5 − a1 sinϕ a4 ·( a5 − a1 sinϕ)
x = a1 cosϕ + a3 ·K 1− − (7)
a2 a2
2
( a3 − a2 )·( a5 − a1 sinϕ) a5 − a1 sinϕ
y = cosϕ + a5 + a4 ·K 1− (8)
a2 a2
The two possible trajectories of point P associated with the different configurations
of the coupler element, commonly known as branches, can be connected or unconnected,
resulting in a unique circuit (a unicursal curve), or two circuits (a bicursal curve). In this
work, designs where the crank input is able to perform a 360◦ full rotation are considered,
meaning that the Grashof criterion must be fulfilled. Therefore, the two possible branches
will be two unconnected circuits. To avoid branch defects, all the selected points must have
the same value of K. This value will be the one corresponding to the branch that yields a
minimum error with respect to the desired path.
In the most general case, represented in Figure 3, the local reference system O’X’Y’ has
a rotation relative to the global system OXY, defined by the parameter a6 , and a translation
in the plane defined by the parameters a7 and a8 .
The equations that express the synthesis variables in the global reference system are
the following:
xi = xi ·cos( a6 ) − yi ·sin( a6 ) + a8 (9)
yi = xi ·sin( a6 ) + yi ·cos( a6 ) + a7 (10)
5
Mathematics 2021, 9, 1581
Now that all the variables involved in the synthesis problem have been defined, the
following classification can be established:
• Dimensional variables: a1 , a2 , . . . , a8 . These are variables that define the lengths of
the bars and the translation or rotation parameters of the studied mechanism.
• Input variable: ϕ. This is an independent variable corresponding to the degree of
freedom of the mechanism under study.
• Passive variables: θ, s. These are not independent variables, but rather depend on the
input and the dimensional parameters.
• Output variables or synthesis variables: x, y. These correspond to the coordinates
of the coupler point P. In the case of path generation synthesis, these are indeed the
synthesis variables.
N 2 2
E= ∑ xi − xid + yi − yid (11)
i =1
The error between prescribed and generated trajectory must be minimized to obtain
the optimal mechanism. There are two options for carrying out this minimization. In the
modality known as prescribed timing, only the dimensional parameters are optimized,
requiring solving the system shown in Equation (12). In this case, the input parameters
ϕi are not variables to be optimized, but constant values (prescribed values). However,
in an optimization known as unprescribed timing, both the dimensional parameters a j
and the set of input parameters ϕi are optimized, requiring solving the systems given by
Equations (12) and (13). This last option is more complex but its potential to obtain precise
solutions is greater, since the value of the input parameters is not being restricted.
N ∂x ∂y
∂E
= 0 → ∑ xi − xid i
+ yi − yid i
=0 ∀ j = 1, 2, . . . , n (12)
∂a j i =1
∂a j ∂a j
N ∂x ∂y
∂E
= 0 → ∑ xi − xid i
+ yi − yid i
=0 ∀i = 1, 2, . . . , N (13)
∂ϕi i =1
∂ϕi ∂ϕi
6
Mathematics 2021, 9, 1581
From the 4 possible values of ti ( ϕi ), the one that gives the minimum error is the
selected one.
For a design with 3 dimensional parameters, Equation (13) becomes a polynomial of
degree 10. Even so, it is easy to obtain the 10 roots and detect the correct one proceeding as
in the previous case. However, with 4 or more dimensional parameters, it is no longer as
easy to determine its corresponding univariate polynomial, nor is it really worth it. It is
more practical to operate as explained next.
The Equation (13) is solved numerically, starting from an initial approximation, and
arriving at a unique solution of the parameter ϕi . To guarantee that the global optimal
value of ϕi is obtained, it will be necessary to start from an initial approximation obtained
as follows: each sum of Equation (11), which represents the error made in each synthesis
position i, is evaluated as a function of the input parameter ϕi along the discretized domain
[0,2π). In this way, a graph similar to the blue curve represented in Figure 4 will be obtained,
where two minima of the error function appear, the one indicated on the right being the
one with the lowest value. The latter value of ϕi is taken as a starting approximation to
solve Equation (13) (in our case, using the MATLAB fsolve command).
7
Mathematics 2021, 9, 1581
To illustrate this concept, in Figure 5 the prescribed point for a synthesis position i
and the trajectory generated by the mechanism of dimensions a j in the current iteration
are indicated. The effect of evaluating the sum of the error function (11) and choosing
the absolute minimum is equivalent to traversing the generated trajectory and selecting
the point of it (the black point) closest to the prescribed one (red point). In this example,
the black point indicated in Figure 5 corresponds to the absolute minimum, ϕ = 5.88 rad,
which is the one previously indicated in Figure 4.
• Second phase:
In this phase, the values ϕi obtained in the previous phase will be assumed as
constants, and the unknowns (a1 , a2 , . . . , a8 ) will be calculated from the subsystem from
Equation (12).
As an example, in the simplified particular case of the slider-crank mechanism with
2 parameters, Equation (12) becomes the following linear system:
⎧ ⎫
⎡ ⎤ ⎪ a1 ⎪ ⎧ ⎫
N N ⎪
⎪ ⎪ ⎪
⎪ N
⎪
⎢ 4 i∑ cos2 ϕi ∑ sin2ϕi ⎥⎪⎨ ⎬ ⎪
⎪ ⎨ 2 ∑ cosϕi · xid ⎪
⎬
⎢ =1 i =1 ⎥ = i = 1
⎣ N ⎦⎪ ⎪ ⎪ N ⎪
(15)
∑ sin2ϕi N ⎪
⎪ ⎪
⎪ ⎪
⎩ ∑ sinϕi · xid + cosϕi · yid ⎪
⎭
⎪
⎩ ⎪
⎭
i =1 a4 i =1
However, in the design cases with 3 or more dimensional parameters, the subsystem
of equations turns out to be non-linear, making it necessary to apply numerical solving
8
Mathematics 2021, 9, 1581
9
Mathematics 2021, 9, 1581
In general, the optimization problem with certain inequality constraints can be formu-
lated as the minimum error subjected to
gk { a1 , . . . , an } ≥ 0 ∀k = 1, 2, . . . , l
When an inequality constraint, such as Equation (16), is included in the process, the
error function yields
2
N 2 N 2 m
E= ∑ xi − xid + ∑ yi − yid + λ· ∑ a j Cj − C0 (17)
i =1 i =1 j =1
10
Mathematics 2021, 9, 1581
If, on the one hand, Equation (16) is not fulfilled during the iterations of the optimiza-
tion process, it is necessary to amplify the value of the error to invalidate the corresponding
design a j . This can be done by means of the λ parameter, which acts as a penalty factor,
proceeding as follows:
m
If ∑ a j Cj − C0 < 0 then λ = Prede f ined value
j =1
m
If ∑ a j Cj − C0 ≥ 0 then λ = 0
j =1
If, on the other hand, the condition to be imposed is an equality constraint, then λ
will always be a certain predefined value. In this paper, after reviewing similar works in
the literature and conducting several trials with different values, the value λ = 50 was
established, insofar as it provides good results for all the analyzed design cases.
3. Results
The following examples will demonstrate the validity of the proposed methodology,
showing the results of each design case.
The eight dimensional parameters and the 16 input parameters (unprescribed timing)
will be optimized, that is, the most complete case possible. Following the previously
described technique, this example is solved using the proposed multi-start method, starting
from 100 different starting mechanisms. The conditions of the crank-input (based on
Grashof’s criterion) and a maximum 3:1 ratio between the longest and shortest bar are
imposed by means of penalty functions.
11
Mathematics 2021, 9, 1581
i xdi ydi
1 3.2500 1.7010
2 3.6294 1.6510
3 3.9330 1.8840
4 4.0995 2.1724
5 4.2665 2.4616
6 4.4330 2.7500
7 4.4829 3.1294
8 4.2500 3.4330
9 4.0000 3.5000
10 3.7143 3.5000
11 3.4286 3.5000
12 3.1429 3.5000
13 2.8571 3.5000
14 2.5714 3.5000
15 2.2857 3.5000
16 2.0000 3.5000
The best solution, shown in Figure 8, leads to an error equal to 0.0073. The correspond-
ing design parameters are included in Table 2. In the Supplementary Materials, a video
showing the motion of this optimum mechanism can be found.
To illustrate the influence that the starting point has on the final solution obtained,
Figure 9 shows, by way of example, the optimal mechanism obtained from another starting
point, which does not reach a solution as good as the one shown above, so it would be
excluded.
Parameters Inputs
a1 1.292 ϕ1 4.877 ϕ9 2.992
a2 3.277 ϕ2 4.568 ϕ10 2.804
a3 1.292 ϕ3 4.409 ϕ11 2.634
a4 3.875 ϕ4 4.267 ϕ12 2.476
a5 1.970 ϕ5 4.104 ϕ13 2.323
a6 3.090 ϕ6 3.905 ϕ14 2.171
a7 1.356 ϕ7 3.579 ϕ15 2.014
a8 −0.583 ϕ8 3.192 ϕ16 1.846
12
Mathematics 2021, 9, 1581
Figure 9. Example 1: comparison to other multi-start solution (multi-start no. 10), E = 0.0227.
This same example has been used by the authors of this work in a recent work, in
particular reference [38], making use of another optimization method and applying it to
the four-bar mechanism. It should be taken into account that the four-bar mechanism has
an additional dimensional parameter with respect to the slider-crank and thus, in principle,
a greater potential to adjust to the prescribed trajectories. Nevertheless, comparing the
results of both achieved optimal mechanisms (the error in [38] with the four-bar was 0.0089),
it can be seen that they are almost of the same value, even having a slightly smaller error in
the current case.
13
Mathematics 2021, 9, 1581
(a) (b)
Figure 10. Example 2: optimal solution for: (a) four-bar and (b) slider-crank mechanism.
Parameters Inputs
a1 2.309 ϕ1 4.703 ϕ9 −0.237 ϕ17 2.715 ϕ25 4.451
a2 48.819 ϕ2 4.92 ϕ10 0.049 ϕ18 2.933
a3 −3.304 ϕ3 5.041 ϕ11 0.672 ϕ19 3.17
a4 24.498 ϕ4 5.181 ϕ12 1.331 ϕ20 3.483
a5 46.315 ϕ5 5.362 ϕ13 1.698 ϕ21 3.762
a6 4.726 ϕ6 5.528 ϕ14 1.996 ϕ22 3.752
a7 −18.02 ϕ7 5.745 ϕ15 2.281 ϕ23 4.091
a8 15.293 ϕ8 5.875 ϕ16 2.494 ϕ24 4.265
14
Mathematics 2021, 9, 1581
(a)
(b)
Figure 11. (a) Watt’s mechanism optimal solution from [41]; (b) optimal solution for slider-crank
mechanism.
Parameters Inputs
a1 6.719 ϕ1 5.580 ϕ9 3.502
a2 15.635 ϕ2 5.349 ϕ10 3.101
a3 5.217 ϕ3 5.116 ϕ11 2.708
a4 8.737 ϕ4 4.918 ϕ12 2.549
a5 −6.710 ϕ5 4.703
a6 −2.924 ϕ6 4.508
a7 42.513 ϕ7 4.124
a8 35.584 ϕ8 3.848
4. Discussion
In this paper, a hybrid optimization procedure has been presented to address the
dimensional synthesis of path generation using the slider-crank mechanism. On the one
hand, a novel approach has been proposed to solve the non-linear equations corresponding
to the null gradient condition, which is based on the decoupling of two subsystems of
equations, and which simplifies the resolution procedure without prejudice to the precision
obtained. On the other hand, the incorporation of the multi-start method manages to give
the local optimization method a global character, carrying out a previous sweep of the total
space of the dimensional variables to find the most promising starting approximations.
To carry out this search, a genetic algorithm is used where only the eight dimensional
parameters are involved, thanks to the fact that the fitness function has been programmed
to select the optimal input parameters based on the dimensions.
In addition, it should be noted that this methodology is aimed at achieving designs
that are valid from a practical point of view. Therefore, the non-existence of branch defects
is ensured, and different design restrictions can be incorporated in a simple way through the
15
Mathematics 2021, 9, 1581
use of penalty functions. In this way, it can be ensured that the crank input mechanism does
not exceed a maximum ratio between bar lengths or other additional design requirements
at the discretion of the designer.
Finally, the effectiveness of the method has been proved by verifying, through various
examples, that the slider-crank mechanism allows us to achieve solutions with a precision
comparable to other one degree of freedom mechanisms that have a greater number of
dimensional parameters, such as the four-bar or the Watt’s mechanism, with the additional
advantage of having simpler kinematics.
References
1. Erdman, A.G.; Sandor, G.N. Mechanism Design: Analysis and Synthesis, 4th ed.; Pearson: London, UK, 2001.
2. Wampler, C.; Morgan, A.P.; Sommese, A.J. Complete Solution of the Nine-Point Path Synthesis Problem for Four-Bar Linkages. J.
Mech. Des. 1992, 114, 153–159. [CrossRef]
3. Lee, W.-T.; Russell, K. Developments in quantitative dimensional synthesis (1970-present): Four-bar motion generation. Inverse
Probl. Sci. Eng. 2017, 26, 133–148. [CrossRef]
4. Lee, W.-T.; Russell, K. Developments in quantitative dimensional synthesis (1970–present): Four-bar path and function generation.
Inverse Probl. Sci. Eng. 2018, 26, 1280–1304. [CrossRef]
5. Alizade, R.I.; Mohan Rao, A.V.; Sandor, G.N. Optimum Synthesis of Four-Bar and Offset Slider-Crank Planar and Spatial
Mechanisms Using the Penalty Function Approach with Inequality and Equality Constraints. J. Eng. Ind. 1975, 97, 785–790.
[CrossRef]
6. Rao, A.C. Optimum synthesis of a slider-crank mechanism using geometric programming. Int. J. Numer. Methods Eng. 1980, 15,
1595–1602. [CrossRef]
7. Plecnik, M.M.; McCarthy, J.M. Five position synthesis of a slider-crank function generator. In Proceedings of the ASME
International Conference IDETC/CIE 2011, Washington, DC, USA, 28–31 August 2011; pp. 317–324.
8. Almandeel, A.; Murray, A.P.; Myszka, D.H.; Stumph, H.E. A Function Generation Synthesis Methodology for All Defect-Free
Slider-Crank Solutions for Four Precision Points. J. Mech. Robot. 2015, 7, 031020–031021. [CrossRef]
9. Liniecki, A. Synthesis of a slider-crank mechanism with consideration of dynamic effects. J. Mech. 1970, 5, 337–349. [CrossRef]
10. Davidson, J.K. Analysis and synthesis of a slider-crank mechanism with a flexibly-attached slider. J. Mech. 1970, 5, 239–247.
[CrossRef]
11. Zhou, H.; Ting, K.-L. Adjustable slider–crank linkages for multiple path generation. Mech. Mach. Theory 2002, 37, 499–509.
[CrossRef]
12. Russell, K.; Sodhi, R. On the Design of Slider-Crank Mechanisms. Part I: Multi-Phase Motion Generation. Mech. Mach. Theory
2005, 40, 285–299. [CrossRef]
13. Russell, K.; Sodhi, R.S. On the design of slider-crank mechanisms. Part II: Multi-phase path and function generation. Mech. Mach.
Theory 2005, 40, 301–317. [CrossRef]
16
Mathematics 2021, 9, 1581
14. Zhou, H. Dimensional synthesis of adjustable path generation linkages using the optimal slider adjustment. Mech. Mach. Theory
2009, 44, 1866–1876. [CrossRef]
15. Sancibrian, R.; Sarabia, E.G.; Sedano, A.; Blanco, J.M. A general method for the optimal synthesis of mechanisms using prescribed
instant center positions. Appl. Math. Model. 2016, 40, 2206–2222. [CrossRef]
16. Sun, J.; Chu, J. Fourier series representation of the coupler curves of spatial linkages. Appl. Math. Model. 2010, 34, 1396–1403.
[CrossRef]
17. Liu, W.; Sun, J.; Zhang, B.; Chu, J. Wavelet feature parameters representations of open planar curves. Appl. Math. Model. 2018, 57,
614–624. [CrossRef]
18. Jianwei, S.; Jinkui, C.; Baoyu, S. A unified model of harmonic characteristic parameter method for dimensional synthesis of
linkage mechanism. Appl. Math. Model. 2012, 36, 6001–6010. [CrossRef]
19. Ding, H.; Huang, P.; Zi, B.; Kecskeméthy, A. Automatic synthesis of kinematic structures of mechanisms and robots especially for
those with complex structures. Appl. Math. Model. 2012, 36, 6122–6131. [CrossRef]
20. Chi-Yeh, H. A general method for the optimum design of mechanisms. J. Mech. 1967, 1, 301–313. [CrossRef]
21. Angeles, J.; Alivizatos, A.; Akhras, A. An unconstrained nonlinear least-square method of optimization of RRRR planar path
generators. Mech. Mach. Theory 1988, 23, 343–353. [CrossRef]
22. Avilés, R.; Navalpotro, S.; Amezua, E.; Hernández, A. An Energy-Based General Method for the Optimum Synthesis of Mecha-
nisms. J. Mech. Des. 1994, 116, 127–136. [CrossRef]
23. Vallejo, J.; Avilés, R.; Hernández, A.; Amezua, E. Nonlinear optimization of planar linkages for kinematic syntheses. Mech. Mach.
Theory 1995, 30, 501–518. [CrossRef]
24. Sancibrian, R.; Viadero, F.; García, P.; Fernández, A. Gradient-based optimization of path synthesis problems in planar mechanisms.
Mech. Mach. Theory 2004, 39, 839–856. [CrossRef]
25. Sancibrian, R.; De Juan, A.; Sedano, A.; Iglesias, M.; García, P.; Viadero, F.; Fernandez, A. Optimal Dimensional Synthesis of
Linkages Using Exact Jacobian Determination in the SQP Algorithm. Mech. Based Des. Struct. Mach. 2012, 40, 469–486. [CrossRef]
26. Mariappan, J.; Krishnamurty, S. A generalized exact gradient method for mechanism synthesis. Mech. Mach. Theory 1996, 31,
413–421. [CrossRef]
27. Cabrera, J.; Simon, A.; Prado, M. Optimal synthesis of mechanisms with genetic algorithms. Mech. Mach. Theory 2002, 37,
1165–1177. [CrossRef]
28. Acharyya, S.; Mandal, M. Performance of EAs for four-bar linkage synthesis. Mech. Mach. Theory 2009, 44, 1784–1794. [CrossRef]
29. Buśkiewicz, J.; Starosta, R.; Walczak, T. On the application of the curve curvature in path synthesis. Mech. Mach. Theory 2009, 44,
1223–1239. [CrossRef]
30. Kafash, S.H.; Nahvi, A. Optimal synthesis of four-bar path generator linkages using Circular Proximity Function. Mech. Mach.
Theory 2017, 115, 18–34. [CrossRef]
31. Gogate, G.R.; Matekar, S.B. Optimum synthesis of motion generating four-bar mechanisms using alternate error functions. Mech.
Mach. Theory 2012, 54, 41–61. [CrossRef]
32. Bulatović, R.R.; Ðord̄ević, S.R. Control of the optimum synthesis process of a four-bar linkage whose point on the working
member generates the given path. Appl. Math. Comput. 2011, 217, 9765–9778. [CrossRef]
33. Xiao, R.; Tao, Z. A Swarm Intelligence Approach to Path Synthesis of Mechanism. In Proceedings of the Ninth International
Conference on Computer Aided Design and Computer Graphics (CAD-CG’05), Hong Kong, China, 7–10 December 2005; Institute
of Electrical and Electronics Engineers (IEEE): Piscataway, NJ, USA, 2005; pp. 451–456.
34. Bulatovic, R.R.; Miodragovic, G.; Boskovic, M.S. Modified Krill Herd (MKH) algorithm and its application in dimensional
syn-thesis of a four-bar linkage. Mech. Mach. Theory 2016, 95, 1–21. [CrossRef]
35. Ebrahimi, S.; Payvandy, P. Efficient constrained synthesis of path generating four-bar mechanisms based on the heuristic
opti-mization algorithms. Mech. Mach. Theory 2015, 85, 189–204. [CrossRef]
36. Vasiliu, A.; Yannou, B. Dimensional synthesis of planar mechanisms using neural networks: Application to path generator
linkages. Mech. Mach. Theory 2001, 36, 299–310. [CrossRef]
37. Sedano, A.; Sancibrian, R.; De-Juan, A.; Viadero, F.; Egaña, F. Hybrid Optimization Approach for the Design of Mechanisms
Using a New Error Estimator. Math. Probl. Eng. 2012, 2012, 1–20. [CrossRef]
38. Hernández, A.; Muñoyerro, A.; Urízar, M.; Amezua, E. Comprehensive approach for the dimensional synthesis of a four-bar
linkage based on path assessment and reformulating the error function. Mech. Mach. Theory 2021, 156, 104126. [CrossRef]
39. Laribi, M.A.; Mlika, A.; Romdhane, L.; Zeghloul, S. A combined genetic algorithm-fuzzy logic method (GA-FL) in mechanism
synthesis. Mech. Mach. Theory 2004, 39, 717–735. [CrossRef]
40. Smaili, A.; Diab, N. Optimum synthesis of hybrid-task mechanisms using ant-gradient search method. Mech. Mach. Theory 2007,
42, 115–130. [CrossRef]
41. Fernández-Bustos, I.; Aguirrebeitia, J.; Avilés, R.; Angulo, C. Kinematical synthesis of 1-dof mechanisms using finite elements
and genetic algorithms. Finite Elements Anal. Des. 2005, 41, 1441–1463. [CrossRef]
17
mathematics
Article
Path Analysis for Hybrid Rigid–Flexible Mechanisms
Oscar Altuzarra, David Manuel Solanillas *, Enrique Amezua and Victor Petuya
Department of Mechanical Engineering, University of the Basque Country UPV/EHU, 48013 Bilbao, Spain;
oscar.altuzarra@ehu.eus (O.A.); enrique.amezua@ehu.eus (E.A.); victor.petuya@ehu.eus (V.P.)
* Correspondence: david.m.solanillas.frances@gmail.com or davidmanuel.solanillas@ehu.eus
Abstract: Hybrid rigid–flexible mechanisms are a type of compliant mechanism that combines rigid
and flexible elements, being that their mobility is due to rigid-body joints and the relative flexibility
of bendable rods. Two of the modeling methods of flexible rods are the Cosserat rod model and its
simplification, the Kirchhoff rod model. Both of them present a system of differential equations that
must be solved in conjunction with the boundary constraints of the rod, leading to a boundary value
problem (BVP). In this work, two methods to solve this BVP are applied to analyze the influence of
external loads in the movement of hybrid compliant mechanisms. First, a shooting method (SM)
is used to integrate directly the shape of the flexible rod and the forces that appear in it. Then, an
integration with elliptic integrals (EI) is carried out to solve the workspace of the compliant element,
considering its buckling mode. Applying both methods, an algorithm that obtains the locus of all
possible trajectories of the mechanism’s coupler point, and detects the buckling mode change, is
developed. This algorithm also allows calculating all possible circuits of the mechanism. Thus, the
performance of this method within the path analysis of mechanisms is demonstrated.
Citation: Altuzarra, O.; Solanillas, Keywords: hybrid compliant mechanisms; path analysis; numerical methods; elliptic integrals;
D.M.; Amezua, E.; Petuya, V. Path kinematics
Analysis for Hybrid Rigid-Flexible
Mechanisms. Mathematics 2021, 9,
1869. https://doi.org/10.3390/
math9161869 1. Introduction
A compliant body is one whose motion depends on its geometry, its material properties
Academic Editors: Higinio Rubio
and the location and magnitude of the applied forces. If a body of this kind belongs to a
Alonso, Alejandro Bustos Caballero,
mechanism, it is known as a compliant element. When a mechanism is fully composed of
Jesus Meneses Alonso and Enrique
compliant elements, e.g., slender rods, it is named a compliant mechanism, and it gains
Soriano-Heras
all or part of its mobility, thanks to the relative flexibility of those compliant elements. On
another note, if the mechanism combines rigid and compliant elements, it takes the name
Received: 8 July 2021
Accepted: 2 August 2021
of a hybrid compliant or hybrid rigid–flexible mechanism [1].
Published: 6 August 2021
Regarding applications, such compliant—and, especially, hybrid-compliant—mechanisms
are an alternative to robotic systems of rigid bodies [2], or even cable robots [3] used in tasks
Publisher’s Note: MDPI stays neutral
where there is a human–machine interface. Additionally, such hybrid-flexible mechanisms
with regard to jurisdictional claims in
are a plausible alternative to sub-systems subjected to impacts, such as four-link legs in
published maps and institutional affil- bio-inspired mobile multipod robots [4]. Finally, its elasticity, if appropriately tuned, can
iations. play a beneficial role in the dynamic characteristics of systems used in high-speed link
motions, reducing the need for balancing [5].
For the complete kinematic characterization of a linkage with rigid bars, all the circuits
of the mechanism should be obtained [6]. Each circuit is the set of all possible orientations of
Copyright: © 2021 by the authors.
links that can be calculated without disconnecting any of the joints [7]. If the linkage needs
Licensee MDPI, Basel, Switzerland.
to be disassembled to move from one position to another, these positions lie on different
This article is an open access article
circuits. These mechanism’s circuits do not depend on the input link chosen. In [8,9] is
distributed under the terms and presented the circuit analysis for the Watt and Stephenson –like six-bar mechanisms. It
conditions of the Creative Commons should be noted that, in the case of compliant mechanisms, it is not needed to disassemble
Attribution (CC BY) license (https:// any link to obtain the different circuits but to deform a flexible rod through an external load.
creativecommons.org/licenses/by/ To solve the movement of these hybrid rigid–flexible mechanisms, it is needed to
4.0/). obtain the change of shape of compliant elements and the loads that these bodies withstand.
There are a few modeling methods to analyze flexible rods, such as the Chained Beam-
Constraint Model (CBCM) [10] or Pseudo-Rigid-Body Model (PRBM) [11,12]. Another model
is the Cosserat rod model, which produces the equations that describe the deformed shape
of a slender flexible element and the loads that appear in it [13,14]. This modeling method
leads to a set of differential equations that, in conjunction with the boundary constraints of
the compliant element, takes the form of a boundary value problem (BVP). This BVP can
be solved with the aid of a shooting method (SM) [15], which combines a direct integration
method, such as 4th order Runge–Kutta (R-K), and a numerical minimization method, e.g.,
Newton–Raphson (N–R), which refines the solutions given by the direct integration.
If the extension of the neutral line and the transverse shear strain are neglected in the
Cosserat rod model, the Kirchhoff model is obtained. The equations of this new model
can also be solved by applying elliptic integrals (EI) that make use of the two parameters
known as the modulus of the function (k) and the amplitude of the elliptic integral (φ) [16].
When a compliant element deforms, points of null moment can appear along its length.
These points of null moment are those where a change of curvature is produced, and they
are also known as inflection points [17,18]. Hence, the slender rod deforms changing
its curvature in each of these points, leading to what is known as buckling modes. These
buckling modes are ordered according to the number of inflection points. For instance, if a
rod has only one of these points, it is in its first buckling mode. If the same rod has two
points, it is in its second buckling mode, and so on. It is worth noting that the higher the
buckling mode, the more deformation energy is needed for it to appear. For this reason, the
compliant elements tend to deform in lower buckling modes, as they require less energy.
These buckling modes can be determined using EI, which can also be used to obtain the
rod workspace in each buckling mode [19].
As mentioned above, the motion of the compliant bodies, and hence that of the
compliant mechanisms, depends on the applied forces. Therefore, this paper makes use of
the SM and EI to check the loads’ influence in the movement of a compliant mechanism. In
addition, the work here developed shows how these methods allow obtaining the locus of
all possible paths of the mechanism’s coupler under any applied load.
A mechanism similar to that found in [20] was used to carry out the work in this paper,
and it is based on a mechanism of [21] (ch. 12). The mechanisms differ in the boundary
constrains of the compliant element. In the case of [20], a clamped-clamped rod is used,
but in this paper, a clamped-hinged rod is analyzed.
Thus, in the following section, an introduction to the general equations of flexible rods
and the different integration methods is given. Moreover, a way to obtain the workspace
of a slender rod using EI is also depicted here. In the third section, a closed-loop hybrid
mechanism is proposed, and it is used to analyze the load-dependent paths after explaining
its general equations. Finally, the potential of the developed algorithm to detect the
buckling mode changes of flexible rods and the possible circuits of the mechanism is
also shown.
20
Mathematics 2021, 9, 1869
The derivation of the mathematics structure of this section draw on Antman’s work [13]
(chs. 4 and 8) on nonlinear elasticity. Nevertheless, the nomenclature used here are those
used in the robotics community, in the line of the work of Caleb [22].
The framework needed to describe the shape of a slender rod in space includes three
main aspects that, coupled, generate the nonlinear system of differential equations that has
to be solved in order to obtain the relationship between force and deformation. These are
as follows: a kinematic definition of the deformed rod, material constitutive laws and static
equilibrium equations.
2.1.1. Kinematics
A three-dimensional parametric Cartesian curve p(s) ∈ R3 that links the centroids
of each transverse section and the orthonormal rotation matrix R(s) ∈ SO(3) (SO(3) is
the special orthogonal subgroup in three dimensions, SO(3) = {R ∈ R3x3 | R T R =
I, and det(R) = 1}) that orients a local frame attached to the section are used to define
the deformed shape of a rod. The principal axes of each section are usually named the x-
and y-axes. The z-axis is perpendicular to the cross-section and tangent to the deformed
shape; see Figure 1. A scalar reference arc-length parameter s is used to position and orient
each cross-section. This parameter is within the finite interval s ∈ [0, L], being that L is
the length of the rod in the initial state. Due to the assumption that the sections are not
distorted, the whole deformation of the flexible bar can be described by mapping from s to
a homogeneous rigid-body transformation, T(s) ∈ SE(3) (SE(3) is the special Euclidean
subgroup in three dimensions, with p(s) ∈ R3 and R(s) ∈ SO(3)).
R(s) p(s)
T(s) = (1)
0T 1
=
\
]>P@
]
;
[ <
\>P@
[>P@
Figure 1. A rod deformed in space, along with frames used.
For the unloaded pre-curved state of the rod, the magnitudes described above are
designated with the subscript ◦ .
A priori, R◦ (s) could be assigned arbitrarily. However, one can establish conventions
governing the assignment of reference orientations such that the mapping form R◦ (s)
to R(s) has an easily interpretable meaning in terms of material strains. As mentioned
above, we have chosen to assign reference orientation such that the z axis of the local
21
Mathematics 2021, 9, 1869
frame is parallel to the tangent curve to the reference curve (see Figure 1). Then, the
following applies:
dp◦
p
R◦ (s)e3 = ds = ◦
(2)
dp◦ p◦
ds
where e1 , e2 and e3 are used for the standard basis vectors of the local frame: [1 0 0] T ,
[0 1 0] T and [0 0 1] T , respectively. The superscript is used to indicate differentiation with
respect to the arc length s.
Now, let us consider the derivatives of p(s) and R(s) with respect to s. The position
and orientation evolve along the arc length according to the rates of change, linear v(s) ∈ R3
and angular u(s) ∈ R3 . These rates of change are defined in the local frame, and obtained
from the above mentioned derivatives, defined in the global frame, with the use of R(s) T
as follows:
dp(s)
v(s) = R(s)T = R(s)T p (s)
ds (3)
U(s) = R(s)T R (s)
Because U(s) is defined through three independent values, u x (s), uy (s) and uz (s), it
T
can be expressed with a vector u(s) = u x (s) uy (s) uz (s) such that u = U∨ and U = u ,
where the ∨ operator denotes conversion of an element of so(3) (so(3) is the Lie algebra of
SO(3)) to its corresponding element in R3 . The inverse operation, denoted by , maps R3 to
so(3), so that u ∨ = u. In the following, u
(s) is used whenever a vector form is needed to
express U.
When v(s) and u(s) are those of the reference, i.e., the unloaded shape of the rod, they
are designated as v◦ and u◦ . Because of the convention of the rotation matrix stated in (2),
an initially straight rod has v◦ = [0 0 1] T and u◦ = 0.
The material strain can be defined by the variation of v(s), Δv(s) = v(s) − v◦ (s), and
u(s), Δu(s) = u(s) − u◦ (s). The local elongation is represented by Δvz ; values above 1
correspond to extension, values below 1 to compression and values equal to 1 mean that
the length of the curve has not changed. The shear deformation along the local axes is
depicted by Δv x and Δvy . The x and y components of Δu(s), Δu x and Δuy , measure the
bending along the local x- and y-axes. Concerning Δuz , it collects the value of the torsion.
22
Mathematics 2021, 9, 1869
magnitudes in the local frame. It is worth noting that the subscript SE indicates shear and
extension, and BT, bending and torsion.
with
⎡ ⎤
GA(s) 0 0
KSE =⎣ 0 GA(s) 0 ⎦ (7)
0 0 EA(s)
⎡ ⎤
EIx (s) 0 0
K BT = ⎣ 0 EIy (s) 0 ⎦ (8)
0 0 GJz (s)
where A is the area of cross section, E is Young’s modulus, G is the shear modulus, Ix and
Iy are the second moment of area of the rod cross section about the principal axes x and y,
and Jz is the polar moment about the local axis z. The linear constitutive law applied here
is commonly used in mechanics for the small strain range. This happens for most metals
and for polymers in the range of small strain.
The rotation matrix is used to change n(s) and m(s) from the local axes to the fixed
global axes.
Expanding the magnitudes in s + ds with the Taylor series until the element of first
order, simplifying terms, and dividing by ds gives the following:
dn(s)
+ f(s) = 0 (12)
ds
On the other hand, a balance of moments, expanding variables in s + ds with the
Taylor series until the element of first order, simplifying terms, and taking into account (12),
yields the following:
dm(s) dp(s)
+ × n + l(s) = 0 (13)
ds ds
So, the nonlinear ordinary differential equations for the equilibrium of a Cosserat rod
describing the evolution of the internal force n(s) and moment m(s) along arc length are
as follows:
n (s) + f(s) = 0 (14)
m (s) + p (s) × n(s) + l(s) = 0 (15)
23
Mathematics 2021, 9, 1869
where u = K−
BT R m + u◦ .
1 T
\ [
V
V
\>P@
<
5
;
2
[>P@
Figure 2. Deformed shape of a clamped-hinged planar rod under load R at extreme.
To orientate each cross-section the rotation matrix is used and it depends on the angle
θ = θ ( s ):
⎡ ⎤
cos θ − sin θ 0
R(s) = ⎣ sin θ cos θ 0⎦ (17)
0 0 1
24
Mathematics 2021, 9, 1869
This makes sense, as there is no bending moment with respect to the local y-axis and
no torsion (expressed through u x ).
Taking (20) into (16) considering this time v = v◦ = e1 , and taking n and m as
state variables, the expression of the system of nonlinear ordinary differential equations
in (16) yields the following:
⎧ ⎫
⎪ dx ⎪ ⎧ ⎫
⎪ ds ⎪
⎪ dy ⎪ ⎪ ⎪ cos θ ⎪
⎪
⎪ ⎪
⎪ ⎪ ⎪
⎪
⎪
⎪ ds ⎪
⎪ ⎪ ⎪ sin θ ⎪
⎪
⎪ ⎪
⎨ dθ ⎬ ⎨ ⎪ ⎪
⎬
mz
+ u◦,z
ds = EI (21)
⎪ dn x
⎪ ds ⎪ ⎪
⎪ ⎪ ⎪
⎪ − f x ⎪
⎪
⎪
⎪ ⎪ ⎪ ⎪
⎪
⎪
⎪
dn ⎪ ⎪ ⎪ − fy ⎪
⎪ ds ⎪ ⎪
y
⎪ ⎪ ⎩ ⎭
⎩ dmz ⎪⎭ n x sin θ − ny cos θ − lz
ds
If no distributed force and moment (f and l) are applied along the rod, we obtain
the following:
⎧ ⎫
⎪ dx ⎪ ⎧ ⎫
⎪
⎪ ds ⎪ cos θ
⎪ dy ⎪
⎪ ⎪
⎪
⎪
⎪
⎪
⎪
⎪
⎪
⎪
⎪ ds ⎪
⎪ ⎪ ⎪ sin θ ⎪
⎪
⎪ ⎪
⎨ dθ ⎬ ⎨ ⎪ mz
⎪
⎬
ds
dn x = EI (22)
⎪
⎪ ds ⎪⎪ ⎪
⎪ 0 ⎪
⎪
⎪
⎪ ⎪
⎪ ⎪
⎪ ⎪
⎪
⎪
⎪
dn y
⎪ ⎪
⎪ ⎪ 0 ⎪
⎪
⎪ ds ⎪
⎪ dm ⎪ ⎩n sin θ − n cos θ ⎭
⎩ z⎭ x y
ds
Hence, in this particular case, the internal force n is constant. Such a rod with no
distributed loads is quite common in the applications.
The system of differential equations in (22) can be solved by using a direct integration
method, such as the 4th order R-K, or a method based on elliptic integrals.
25
Mathematics 2021, 9, 1869
So focusing on the FKP of a rod that is clamped at the proximal end s = 0 and
subjected to a known force next and moment mext at the distal end s = L (see Figure 2),
the rod undergoes a deformation that depends on these boundary conditions and the
geometric and mechanical characterization of the rod.
If the effect of gravity is not taken into account, there is not distributed force along the
length. Therefore, the shape and the load status of the rod are defined by (22).
This takes the form of a BVP, where the independent variable is s in [0, L], and the
vector of dependent variables, y, and the non-linear function, f(s, y), are the following:
⎧ ⎫ ⎧ ⎫
⎪
⎪ x⎪ ⎪ ⎪
⎪ cos θ ⎪
⎪
⎪
⎪ y⎪ ⎪ ⎪
⎪ sin θ ⎪
⎪
⎪
⎪ ⎪ ⎪ ⎪
⎨ ⎪ ⎬ ⎪
⎨ mz
⎪
⎬
θ
y= , f(s, y) = EI (23)
⎪
⎪ n x⎪
⎪ ⎪
⎪ 0 ⎪
⎪
⎪
⎪ ⎪ ⎪ ⎪
⎪ ny ⎪
⎪ ⎪ ⎪
⎪ 0 ⎪
⎪
⎩ ⎪ ⎭ ⎪
⎩ ⎪
⎭
mz n x sin θ − ny cos θ
The boundary conditions are at s = 0 of the kinematic kind, and at s = L of the load
kind. Regarding the kinematic variables, the position and orientation at s = 0 are data,
and unknown at s = L. In relation to internal forces, and because no distributed force is
considered, its value does not change along the length of the rod. At s = 0, and because of
the static equilibrium, the internal force takes the same value that of the external load, next ,
at s = L, but in opposite direction (although the internal forces could be taken out of the
integration, in this work, they are included because they are needed to solve the FKP of the
whole mechanism). With respect to the internal moment, the value is a datum at s = L (in
fact, m( L) = mext ), and unknown at s = 0.
Then, y at both ends are the following:
⎧ ⎫ ⎧ ⎫ ⎧ ⎫ ⎧ ⎫
⎪
⎪ x (0) ⎪ ⎪ ⎪
⎪ x0 ⎪
⎪ ⎪
⎪ x ( L) ⎪ ⎪ ⎪
⎪ ? ⎪ ⎪
⎪
⎪ ⎪
⎪ ⎪
⎪ ⎪ ⎪ ⎪ ⎪ ⎪
⎪
⎪ y ( 0 ) ⎪
⎪ ⎪ y 0⎪
⎪ ⎪
⎪ y( L) ⎪ ⎪ ⎪
⎪ ? ⎪ ⎪
⎨ ⎬ ⎪ ⎨ ⎪ ⎬ ⎪
⎨ ⎪
⎬ ⎪ ⎨ ⎪
⎬
θ (0) θ0 θ ( L) ?
y (0) = = , y( L) = = (24)
⎪ n x (0) ⎪
⎪
⎪ ⎪ ⎪
⎪ ⎪
⎪
?⎪ ⎪
⎪ ⎪ n x ( L) ⎪
⎪
⎪ ⎪ ⎪
⎪ ⎪
⎪
next,x ⎪
⎪
⎪
⎪
⎪ n y (0) ⎪⎪ ⎪
⎪ ?⎪ ⎪ ⎪
⎪ ny ( L) ⎪⎪ ⎪
⎪ n ⎪
⎪
⎪
⎩ ⎪
⎭ ⎩ ⎪
⎪ ⎭ ⎪
⎩ ⎪ ⎪
⎭ ⎩ ext,y ⎪ ⎭
m z (0) ? mz ( L) mext
dy
= f(s, y) , y ( s0 ) = y0 (25)
ds
where y is a vector that contains the unknown variables at each s, function f is given, s is
the independent variable, and subscript 0 indicates values at initio.
n x = R cos ψ (26)
ny = R sin ψ (27)
26
Mathematics 2021, 9, 1869
For constant E and I, with a stress-free reference straight rod, considering the well-
known Bernoulli–Euler law, and upon substitution of (26) and (27) into (22), we obtain
the following:
d2 θ 1 R
= ( R cos ψ sin θ − R sin ψ cos θ ) = sin (θ − ψ) (28)
d2 s EI EI
Therefore, the system of differential Equation (22) can be simplified to the following:
⎧ ⎫ ⎧ ⎫
dx
⎨ ds ⎪
⎪ ⎬ ⎨ cos θ ⎬
dy
= sin θ (29)
⎪
⎩ d2 θ ⎪
ds
⎭ ⎩ R sin (θ − ψ)⎭
2d s EI
Elliptic integrals is the classical mathematical tool to solve (29) because of its rapid
computation. The following equations are used in this work to solve for the coordinates of
an arbitrary point along the beam, and they are found in [21] (ch. 4).
"
EI
x (φi ) = − cos ψ[2E(k, φi ) − 2E(k, φ1 ) − F (k, φi ) + F (k, φ1 )]
R
"
EI
− 2k sin ψ[cos φi − cos φ1 ] (30)
R
"
EI
y(φi ) = − sin ψ[2E(k, φi ) − 2E(k, φ1 ) − F (k, φi ) + F (k, φ1 )]
R
"
EI
+ 2k cos ψ[cos φi − cos φ1 ] (31)
R
where the functions F (k, φ) and E(k, φ) are the incomplete elliptic integrals of the first and
second kind, respectively. Moreover, the following holds:
"
EI L
= (32)
R [ F (k, φ2 ) − F (k, φ1 )]
Thus, the value of the end-tip force R in terms of k and φ can be obtained from (32) as:
√
√ EI
R= [ F (k, φ2 ) − F (k, φ1 )] (33)
L
In addition, the bending moments along the rod are given by the following:
√
mi = 2k REI cos φi (34)
27
Mathematics 2021, 9, 1869
Hence, (29) is integrated to yield (30), (31), and (33), i.e., a parametric system that
uniquely defines a deformed rod in terms of parameters k and ψ for known boundary
conditions and a given buckling mode.
A pinned end-tip implies that no moment is acting at that point. Upon application
on (34), we obtain the following:
√
0 = 2k REI cos φ2 (37)
This means that the end-tip is a point of null curvature, then an inflection point of the
deflected curve. Additionally, the second part of the equation implies the following:
28
Mathematics 2021, 9, 1869
<
\>P@
N>@
;
5
5
>GHJ@ [>P@
(a) (b)
Figure 3. (a) Given values of k and ψ. (b) Deformed shapes for buckling modes 1 and 2 of a
clamped-hinged rod.
\>P@
\>P@
< <
; ;
[>P@ [>P@
(a) (b)
Figure 4. (a) Deformed shape for buckling mode 1. (b) Deformed shape for buckling mode 2. Both
correspond to a clamped-pinned rod for a given end-tip position.
5
<
< 2
;
\>P@
\>P@
2
;
5
[>P@ [>P@
(a) (b)
Figure 5. Rod’s end-tip workspace for buckling mode 1. (a) Positive k. (b) Negative k.
29
Mathematics 2021, 9, 1869
The curves plotted in Figure 5 were calculated for the first buckling mode of the
rod. To obtain the workspace for other buckling modes, it is necessary to execute the
calculation again, changing φ2 in (38), for the desired mode. For instance, Figure 6a shows
the workspace for positive k and buckling mode 2 of the rod, and Figure 6b represents that
for negative k and the same mode.
5 5
<
<
2
\>P@
\>P@
2 ;
;
(a) (b)
Figure 6. Rod’s end-tip workspace for buckling mode 2. (a) Positive k. (b) Negative k.
As for the orange lines in Figures 5 and 6, they come from the calculation of the rod’s
end-tip position for the higher limit of the range of k and varying ψ. The green lines are
calculated with the lower limit of the interval of k and varying ψ. To close the workspace
area, the purple lines are plotted. They are related to the calculation with the higher limit
of the interval of ψ and varying k. In contrast, the cyan lines derive from the computation
with the lower limit of the interval of ψ and varying k.
30
Mathematics 2021, 9, 1869
The dimensional data and the description of each bar are given in Table 1. Each angle
showed in Figure 7 is explained in Table 2.
Angle Description
α Input angle, which positions the crank
θ1 Orientation of the rigid coupler BC
θ2 Relative angle between BC and EF segments. It has a constant
value of 90º through the cycle.
The flexible segment CD is a slender rod, made of nitinol, with a circular cross-section;
its properties are shown in Table 3.
where m∗z,C is the bending moment at C, ∑ Mz B the set of all moments about B produced
by the external loads, and the superscript * indicates that the variables are not the exact
solutions; instead, they are the approximate solutions obtained numerically in each step of
the SM.
31
Mathematics 2021, 9, 1869
The first two components of (39) are restrictions of the loads kind, and state that the
moments about points B and C must be null because in those points there are hinges. The
other restriction is of the geometric kind and imposes that the distance between points B
and C is the length of the rigid coupler BC.
To find the solution of the mechanism’s position, the SM needs two inputs: the
!T
y
residue function (39) and a vector of guess values, guess = n xguess n guess mzguess , to start
the integration of the rod CD. These guess values are needed to solve the BVP shown in
Section 2.3. The 4th order R-K method explained in that section can be particularized for
the case of the rod CD if mext is equal to 0 since there is a hinge in the point C.
For the first α step, these guess variables can take arbitrary values. Despite this,
previous calculations were made in this work to obtain guess values that improve the
convergence of the SM. For the next α steps, the vector of guess values is composed by the
T
n x ny mz values of the solution of the previous α step.
With these two inputs to the SM, the process to obtain the position of point C and the
shaped of rod CD is as follows:
!T
1. For each guess, apply 4th order R-K to obtain y∗ = xC∗ yC ∗ θ ∗ n∗ n∗ m∗
C x,C y,C z,C ; see
Section 2.3.
2. Evaluate the residue (39). To do this, xC∗ and yC ∗ should be used to calculate θ , and
1
∗ ∗ ∗
θ1 , n x,C and ny,C to solve ∑ Mz B . mz,C from the direct integration of the rod CD is
directly used to compare its value with 0.
3. If all terms are below a tolerance, exit: the final solution is obtained.
4. If not, calculate variation of residue with guess: J = ddresidue
guess .
5. Modify the guess values, guessi+1 = guessi − J−1 residue.
6. Start the loop again.
The results are obtained when the three components of the residue are below a tolerance.
With this, the position of the mechanism is obtained for the current α step, and the results
are the guess values for the next α step.
32
Mathematics 2021, 9, 1869
Figure 8. Trajectory of point F after a complete turn of the crank AB and under different load cases.
)[)\0
)[)\0 )[)\0
)[)\0 )[)\0
)[)\0
)[)\0
>1P@
>1P@
%H[W
%H[W
0
\>P@
[>P@
[>P@ \>P@
(a) (b)
Figure 9. Trajectories for different load cases. (a) External constant moments. (b) External con-
stant forces.
33
Mathematics 2021, 9, 1869
that, in the case of this mechanism, there is always a point of null moment, i.e., null
curvature, in the distal end of the slender rod CD because of the hinge.
3. Once the buckling mode is known, solve the workspace of the rod CD, applying the
concepts of Section 2.4.2, and plot it.
An example of the application of this algorithm is shown in Figures 10–15 (an anima-
tion is attached in the Supplementary Materials: Video S1), in which a complete turn of
the crank AB is simulated while an external horizontal force of 5 N at point F is applied.
Taking into account that the points of null moment of the rod CD are plotted in red, the
flexible rod CD is working in its first buckling mode when the cycle starts (see Figure 10).
Moreover, k parameter of the EI that allows calculating the workspace is positive. The rod
CD is working in this mode (see Figure 11) until it reaches an α of 159 degrees, where its
buckling mode changes from 1 to 2 (see Figure 12). In addition, k also changes from positive
values to negatives ones. The rod CD remains in its second buckling mode (see Figure 13)
until the input α acquires a value of 285 degrees (see Figure 14). From this pose on, the rod
CD works in its first buckling mode, with negatives k, until the crank AB completes the
turn (see Figure 15).
)H[W[ 1
$ )
%
\>P@
(
<
;
' &
[>P@
)H[W[ 1
%
$
\>P@
&
<
;
'
[>P@
34
Mathematics 2021, 9, 1869
)H[W[ 1
)
% $
(
\>P@
&
<
;
'
[>P@
Figure 12. Cycle with buckling mode changes. Mode 1 to Mode 2 change.
%
\>P@
(
<
&
;
'
[>P@
Figure 13. Cycle with buckling mode changes. α = 235◦ .
35
Mathematics 2021, 9, 1869
$ ) 1
H[W[
)
\>P@
%
< (
;
'
&
[>P@
Figure 14. Cycle with buckling mode changes. Mode 2 to Mode 1 change.
) 1
$ H[W[
% )
\>P@
(
<
;
' &
[>P@
Figure 15. Cycle with buckling mode changes. Final position.
36
Mathematics 2021, 9, 1869
If the trajectory of point F is plotted against the moment of external loading about
B for this load case, Figure 16 is obtained. It can be seen that the path is contained in the
locus exposed on the previous Section 3.2.
It is worth noting that a cut appears on the surface for moments less than −2 Nm. The
potential of this method to obtain the locus in which trajectories can lie for different load
cases is highlighted here. Looking at the shadowed surface of Figure 16, it can be detected
that load cases that generate moments about B less than −2 Nm cannot produce complete
closed trajectories of the point F.
Thus, it is demonstrated that the SM proposed in Section 3.1 can detect and solve
buckling mode changes as long as the trajectory of the end-effector point is contained in its
corresponding locus.
)[)\0
>1P@
%H[W
0
\>P@
[>P@
Figure 16. Trajectory for an external applied load of 5 N in the positive x-axis.
37
Mathematics 2021, 9, 1869
0 H[W] 1P
$ ) 1
H[W[
% )
)H[W\ 1
\>P@
(
<
;
' &
[>P@
Figure 17. First mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Initial position.
Figure 18. First mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Final position and trajectory.
38
Mathematics 2021, 9, 1869
$
%
\>P@
) 1
H[W[ 0 1P
H[W]
( )
< )H[W\ 1
;
'
&
[>P@
Figure 19. Second mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Initial position.
$
%
\>P@
[>P@
Figure 20. Second mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Final position and trajectory.
39
Mathematics 2021, 9, 1869
$
%
(
\>P@
<
)H[W[ 1 0 1P
; H[W]
& ' )
)H[W\ 1
[>P@
Figure 21. Third mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Initial position.
$
%
(
\>P@
<
)H[W[ 1 0 1P
; H[W]
& ' )
) 1
H[W\
[>P@
Figure 22. Third mechanism circuit under external loading of Fext,x = −4 N, Fext,y = −4 N, and
Mext,z = 4 Nm. Final position and trajectory.
40
Mathematics 2021, 9, 1869
Start End
Inputs: Yes
- Initial unloaded
position
- Fext, Mext
No
Is the last circuit?
Integrate rod in
points along a
circumference with centre
Yes
in B and radius
No
Find points of that Is the final
circumference where there step?
are sign changes of
Refine potential solutions Search for sign changes Solve and plot the
with a SM similar to that of of internal moment of rod
workspace of rod
section 3.1 but with a
to obtain its applying EI and the
condition of buckling mode concepts of section 2.4.2
Yes
Inputs:
- angle No
- Is residue < tolerance?
- Initial position of the
circuit for the first step
- Residue tolerance
Yes - With
4. Discussion
In this paper, the performance of the SM that combines the 4th order R-K method
with a numerical minimization method, i.e., N-R scheme, to solve the position of hybrid
compliant mechanisms is shown. This method not only allows solving the FKP, but also
41
Mathematics 2021, 9, 1869
allows predicting the locus of all possible paths of the mechanism’s coupler under different
load cases. In this manner, this method can be used in the synthesis process of mechanisms
to know whether the concerned mechanism can describe the desired trajectory supporting
the external loading.
Furthermore, the SM is combined with the use of EI to obtain the workspace of the
flexible rod, allowing detecting changes of buckling modes.
Future work could look for an algorithm able to detect abrupt changes or loops on
the deformed shape of the compliant element. If this information is known, a highly
detailed knowledge of the mechanism motion can be acquired. All this would allow
designing mechanisms able to foresee the possible problems that could appear in the
prototyping phase.
Further research can be conducted in the multiplicity of circuits of this kind of mecha-
nism, applying the method here proposed. An ambitious goal could be an algorithm to
obtain the loci of all trajectories for different load cases and all the mechanism circuits since,
as can be seen in Figures 17–22, diverse circuits lead to different trajectories’ loci. Detecting
the influence of external loading in these circuits also should be studied.
5. Conclusions
The contribution of this paper is an algorithm that combines two numerical methods
(a shooting method and elliptic integrals) to solve the movement of a hybrid rigid–flexible
mechanism, taking into account the influence of the external load in that movement. The
external load can affect the coupler trajectory by changing the deformed shape of the
slender rod since, depending on the load, the flexible rod can work in different buckling
modes, or even the whole mechanism can operate in different circuits. For this reason, the
proposed algorithm is also developed to detect the circuit and buckling mode in which the
mechanism works.
With this work, the potential of this method to study the behavior of compliant
mechanisms is demonstrated. Its potential is such that it can be used, for instance, in the
modeling process of humanoid robots, such as the one that appears in [23].
Supplementary Materials: The following are available online at https://www.mdpi.com/2227-7
390/9/16/1869/s1: Video S1: Buckling mode change; Video S2: First mechanism circuit; Video S3:
Second mechanism circuit; Video S4: Third mechanism circuit.
Author Contributions: Conceptualization, O.A., D.M.S., E.A. and V.P.; methodology, O.A., D.M.S.,
E.A. and V.P.; software, D.M.S.; validation, O.A., D.M.S., E.A. and V.P.; formal analysis, O.A. and
D.M.S.; investigation, O.A., D.M.S., E.A. and V.P.; resources, O.A., E.A. and V.P.; data curation, O.A.
and D.M.S.; writing—original draft preparation, O.A. and D.M.S.; writing—review and editing, O.A.
and D.M.S.; visualization, O.A.; supervision, O.A., E.A. and V.P.; project administration, O.A. and
V.P.; funding acquisition, O.A. and V.P. All authors have read and agreed to the published version of
the manuscript.
Funding: This work was possible thanks to the funding of Project DPI2015-64450-R and PID2020-
116176GB-100 (MINECO/FEDER, UE), and Project Ref. IT949-16 (Departamento de Educación,
Política Lingüística y Cultura from the regional Basque Government).
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Acknowledgments: The authors wish to acknowledge the financial support received from the
Spanish government through the Ministerio de Economía y Competitividad (Project DPI2015-64450-
R and PID2020-116176GB-100 (MINECO/FEDER, UE)), and the support for the research group
through Project Ref. IT949-16, provided by the Departamento de Educación, Política Lingüística y
Cultura from the regional Basque Government.
Conflicts of Interest: The authors declare no conflict of interest.
42
Mathematics 2021, 9, 1869
Abbreviations
Abbreviations
The following abbreviations are used in this manuscript:
References
1. IFToMM Terminology—Compliant Mechanisms. Available online: http://www.iftomm-terminology.antonkb.nl/2057/frames.
html (accessed on 24 June 2021).
2. Görgülü, İ.; Can Dede, M.İ.; Carbone, G. Experimental Structural Stiffness Analysis of a Surgical Haptic Master Device Manipula-
tor. J. Med. Devices 2021, 15, 011110. [CrossRef]
3. Hamida, I.B.; Laribi, M.A.; Mlika, A.; Romdhane, L.; Zeghloul, S.; Carbone G. On the Optimal Design of LAWEX for a Safe
Upper Arm Rehabilitation Exercising. In Mechanism Design for Robotics; MEDER 2021; Zeghloul, S., Laribi, M.A., Arsicault, M.,
Mechanisms and Machine Science, Eds.; Springer: Cham, Switzerland, 2021; Volume 103, pp. 313–321. [CrossRef]
4. Velázquez, R.; Garzón-Castro, C.L.; Acevedo, M.; Orvañanos-Guerrero, M.T.; Ghavifekr, A.A. Design and Characterization of a
Miniature Bio-Inspired Mobile Robot. In Proceedings of the 2021 12th International Symposium on Advanced Topics in Electrical
Engineering (ATEE), Bucharest, Romania, 25–27 March 2021; pp. 1–5. [CrossRef]
5. Orvañanos-Guerrero, M.T.; Acevedo, M.; Sánchez, C.N.; Giannoccaro, N.I.; Visconti, P.; Velázquez, R. Efficient Balancing Optimiza-
tion of a Simplified Slider-Crank Mechanism. In Proceedings of the 2020 IEEE ANDESCON, Quito, Ecuador, 13–16 October 2020;
pp. 1–6. [CrossRef]
6. Myszka, D.H.; Murray, A.P.; Wampler, C.W. Mechanism branches, turning curves and critical points. In Proceedings of
the IDETC/CIE 2012 ASME 2012 International Design Engineering Technical Conferences & Computers and Information in
Engineering Conference, Chicago, IL, USA, 12–15 August 2012; pp. 1513–1525. [CrossRef]
7. Chase, T.; Mirth, J. Circuits and Branches of Single-Degree-of-Freedom Planar Linkages. ASME J. Mech. Des. 1993, 115, 223–230.
[CrossRef]
8. Mirth, J.; Chase, T. Circuit Analysis of Watt Chain Six-Bar Mechanisms. ASME J. Mech. Des. 1993, 115, 214–222. [CrossRef]
9. Wantanabe, K.; Katoh, H. Identification of Motion Domains of Planar Six-Link Mechanisms of the Stephenson-Type. Mech. Mach.
Theory 2004, 39, 1081–1099. [CrossRef]
10. Ma, F.; Chen, G. Modeling Large Planar Deflections of Flexible Beams in Compliant Mechanisms Using Chained Beam-Constraint-
Model. J. Mech. Robot. 2016, 8, 021018. [CrossRef]
11. Kuo, C.H.; Chen, Y.C.; Pan, T.Y. Continuum Kinematics of a Planar Dual-Backbone Robot Based on Pseudo-Rigid-Body Model:
Formulation, Accuracy, and Efficiency. In Proceedings of the ASME 2017 International Design Engineering Technical Conferences
and Computers and Information in Engineering Conference, Cleveland, OH, USA, 6–9 August 2017; ASME: New York, NY, USA,
2017. [CrossRef]
12. Midha, A.; Bapat, S.G.; Mavanthoor, A.; Chinta, V. Analysis of a Fixed-Guided Compliant Beam With an Inflection Point Using
the Pseudo-Rigid-Body Model Concept. J. Mech. Robot. 2015, 7, 031007. [CrossRef]
13. Antman, S.S. Nonlinear Problems of Elasticity, 2nd ed.; Springer: New York, NY, USA, 2005; Volume 107. [CrossRef]
14. Rucker, D.C.; Webster, R.J., III. Statics and dynamics of continuum robots with general tendon routing and external loading. IEEE
Trans. Robot. 2011, 27, 1033–1044. [CrossRef]
15. Black, C.B.; Till, J.; Rucker, D.C. Parallel Continuum Robots: Modeling, Analysis, and Actuation-Based Force Sensing. IEEE Trans.
Robot. 2018, 34, 29–47. [CrossRef]
16. Altuzarra, O.; Caballero, D.; Campa, F.J.; Pinto, C. Position analysis in planar parallel continuum mechanisms. Mech. Mach.
Theory 2019, 132, 13–29. [CrossRef]
17. Gere, J.M.; Timoshenko, S.P. Mechanics of Materials, 4th ed.; PWS Publishing Company: Boston, MA, USA, 1997.
18. Kimball, C.; Tsai, L.W. Modeling of Flexural Beams Subjected to Arbitrary End Loads. J. Mech. Des. 2002, 124, 223–235. [CrossRef]
19. Holst, G.L.; Teicher, G.H.; Jesen, B.D. Modeling and Experiments of Buckling Modes and Deflection of Fixed-Guided Beams in
Compliant Mechanisms. J. Mech. Des. 2011, 133, 051002. [CrossRef]
20. Zhang, A.; Chen, G. A Comprehensive Elliptic Integral Solution to the Large Deflection Problems of Thin Beams in Compliant
Mechanisms. J. Mech. Robot. 2013, 5, 021006. [CrossRef]
21. Howell, L.L.; Magleby, S.P.; Olsen, B.M. Handbook of Compliant Mechanisms; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2013.
[CrossRef]
43
Mathematics 2021, 9, 1869
22. Rucker, D.C.; Jones, B.A.; Webster, R.J. A geometrically exact model for externally loaded concentric-tube continuum robots. IEEE
Trans. Robot. Publ. IEEE Robot. Autom. Soc. 2010, 26, 769–780. [CrossRef] [PubMed]
23. Cafolla, D.; Ceccarelli, M. Design and FEM analysis of a novel humanoid torso. Multibody Mechatron Syst. Mech. Mach. Sci. 2015,
25, 477–488. [CrossRef]
44
mathematics
Article
Adaptive Levenberg–Marquardt Algorithm: A New
Optimization Strategy for Levenberg–Marquardt
Neural Networks
Zhiqi Yan, Shisheng Zhong *, Lin Lin and Zhiquan Cui *
Abstract: Engineering data are often highly nonlinear and contain high-frequency noise, so the
Levenberg–Marquardt (LM) algorithm may not converge when a neural network optimized by the
algorithm is trained with engineering data. In this work, we analyzed the reasons for the LM neural
network’s poor convergence commonly associated with the LM algorithm. Specifically, the effects of
different activation functions such as Sigmoid, Tanh, Rectified Linear Unit (RELU) and Parametric
Rectified Linear Unit (PRLU) were evaluated on the general performance of LM neural networks,
and special values of LM neural network parameters were found that could make the LM algorithm
converge poorly. We proposed an adaptive LM (AdaLM) algorithm to solve the problem of the LM
algorithm. The algorithm coordinates the descent direction and the descent step by the iteration
Citation: Yan, Z.; Zhong, S.; Lin, L.;
number, which can prevent falling into the local minimum value and avoid the influence of the
Cui, Z. Adaptive Levenberg–Marquardt
parameter state of LM neural networks. We compared the AdaLM algorithm with the traditional LM
Algorithm: A New Optimization
algorithm and its variants in terms of accuracy and speed in the context of testing common datasets
Strategy for Levenberg–Marquardt
and aero-engine data, and the results verified the effectiveness of the AdaLM algorithm.
Neural Networks. Mathematics 2021,
9, 2176. https://doi.org/10.3390/
math9172176
Keywords: Levenberg–Marquardt algorithm; convergence; neural networks; local minima; optimiza-
tion
Academic Editors: Higinio
Rubio Alonso, Alejandro
Bustos Caballero, Jesus
Meneses Alonso and 1. Introduction
Enrique Soriano-Heras When applied to real-world data interspersed with high nonlinearity and high-
frequency noise, LM neural networks have irreplaceable advantages. They reduce the
Received: 16 August 2021 requirement of computing resources and guarantee high convergence speed, making their
Accepted: 1 September 2021 performance superior to other existing regression methods. The LM neural network is
Published: 6 September 2021 an artificial neural network with an LM optimizer. The LM algorithm is a second-order
method for solving general least squares problems. As an optimizer, the LM algorithm has
Publisher’s Note: MDPI stays neutral wide engineering applications in LM neural networks because of its fast convergence and
with regard to jurisdictional claims in
small memory occupation [1–3].
published maps and institutional affil-
However, the disadvantage of the LM algorithm makes LM neural networks have
iations.
potential symptoms of divergence or convergence to a bad local minimum. The paper [4]
reported that the LM algorithm may become stuck and fail to degrade the cost function.
For a nonconvex optimization problem, such as neural network optimization, there can be
many local minima. The cost function may end up in the “bad” local minima when it is
Copyright: © 2021 by the authors. difficult to guarantee the nonsingularity of the Jacobian matrix and Lipschitz continuity [5].
Licensee MDPI, Basel, Switzerland. LM neural networks show higher errors than do regular neural networks in practical
This article is an open access article engineering application [6].
distributed under the terms and
Most of the recent work have tried to help the LM algorithm find one of the “good”
conditions of the Creative Commons
local minima by combining with other algorithms for better engineering applications,
Attribution (CC BY) license (https://
because it is generally acceptable in the modern neural network community that searching
creativecommons.org/licenses/by/
for a global minimum is often an unnecessary endeavor, e.g., the genetic algorithm (GA)
4.0/).
can be used to optimize the initial weight of LM neural networks [7,8], and Bayesian
estimation can be used to calibrate model variables. In Refs. [9,10], the wavelet method was
used to preprocess data and the LM neural networks method was used to analyze them. In
Ref. [11], the least squares support vector machine (LSSVM) and LM were combined into a
hybrid model. The above hybrid methods have been proved to be effective by experiments.
Several other research groups have looked into improving the LM algorithm. Yang [12]
presented a high-order LM method that has biquadratic convergence. Chen [13] presented
a fourth-order method called the accelerated modified LM method. Derakhshandeh [14]
presented a new three-step LM (TSLM) algorithm based on fuzzy logic theory (FLT). In
Ref. [15], an adaptive Levenberg–Marquardt (LM) algorithm-based echo state network was
proposed by adding a new adaptive damping term to the LM algorithm. These works
proved the validity of the modified method algorithm by a trust region technique.
However, hybrid methods indicate a lot of computation, which weakens the advantage
of the LM algorithm. The problem with improved algorithms is that their performance
improvement is conditionally limited. These improved algorithms prove to be effective
only under specific prerequisites:
(1). F(w) and f (w) are nonlinearly continuous and differentiable. The f (w) is a neural
network model, w is a parameter of the neural network model, and F(w) is the cost
function of the neural network. In Refs. [12–15], the global optimization ability of
the proposed algorithm based on nonlinearly continuous and differential F(w) and
f (w) was verified. In detail, the activation functions in the neural networks are not
necessarily continuous and differentiable, so the above references are only valid in
the current setting without proving that the global optimization can be achieved in
the neural network model.
(2). J = ∂F(w)/∂w. J is a Jacobian matrix. The Jacobian matrix in Refs. [16–18] was obtained
by directly solving the system of nonlinear equations. The calculation results were
relatively accurate, but the calculation process was very complex. In the neural
networks, the Jacobian matrix participating in the proof process was obtained by
back-propagation rather than by the derivation of the objective function. Therefore,
the theory of the above references may be not suitable for the field of neural networks.
Rather, a new adaptive LM algorithm is proposed for neural networks in this paper.
We explain in detail the specific factors that cause the cost function to fall into bad local
minima by the original LM algorithm, by analyzing the output performance of several
activation functions. In view of these factors, the new algorithm makes up for the deficiency
of the LM algorithm to train a network efficiently.
The remaining parts are organized as follows. Section 2 introduces the LM algorithm.
Section 3 proves that the original LM algorithm has the possibility of falling into local
minima in neural networks. Section 4 gives the optimization of the LM algorithm. Section 5
verifies the correctness of this theory about experiments, and the conclusions are drawn in
last section.
Definition 1. Given a neural network model f(w), the cost function is the least squares problem:
where w represents neural network parameters and ylabel represents label data.
46
Mathematics 2021, 9, 2176
Definition 3. There is a method to evaluate the accuracy of model L, called gain ratio q:
F (w) − F (w + h)
q= (3)
L (0) − L ( h )
h = − (J T J + μI)−1 Jr (4)
where J is the Jacoby matrix, r is the cost function error, and μ is the damping parameter.
L(0) − L(h) = − hT J T f − 12 hT J T Jh
$ %
= − 12 hT 2g + ( J T J + μI − μI )h
= 12 hT (μh − g)
where, g = J T f
$ (5)
% −1 T
hT h > 0, − hT g = − hT ( J T f ) = − − J T J + μI g g
$ % T
− 1
= gT J T J + μI g>0
L(0) − L(h) = 12 hT (μh − g) > 0
47
Mathematics 2021, 9, 2176
parameters are adjusted, we can prove that LM neural networks may diverge or converge
to the poor local minima.
The proof process includes three steps: First, given the inequality f (w) + J(w) < r(w2 ),
the conditions satisfying the inequality are analyzed. Secondly, we prove that “∃f (w), when
f (w) + J(w) < r(w2 ), in Equation (3), the gain ratio q < 0”. The conditions leading to q < 0 can
be obtained. Thirdly, based on the above analysis, we discuss the conditions under which
LM neural networks may diverge or converge to a worse local minimum.
Definition 5. According to the structure of the neural network, the residual function is as follows:
where σ is an activation function, x is the input of the neural network, the weight of the neural
network is w, and the bias is b.
Definition 6. The first-order Taylor expansion for the output function f(w + h) of the neural
network at h is:
f (w + h) = f (w) + J(w)h + r(w2 ) (7)
where r(w2 ) is the remainder.
Lemma 1. ∃{w,b,h}, when the absolute values of training data x and label data y of the neural
network are large enough, f(w) + J(w) < r(w2 ).
Thus, to prove f (w) + J(w) < r(w2 ), you only need to prove Equation (9). By Equation
(6), we obtain:
f (w + h) = σ(wx + hx + b) − ylabel
(10)
J (w) = σ (wx + b)
Substituting Equation (6) and Equation (10) into Equation (9) gives:
w = wx + b, h = hx (14)
Then:
48
Mathematics 2021, 9, 2176
σ(w ) = σ (wx + b)
σ(w + h ) = σ(wx + hx + b)
∂w
σ (w )h = σ (wx + b) ∂w
h = σ ( wx + b ) x = σ ( wx + b ) h
h
(15)
G (w , h ) = σ(w + h ) − 2σ(w ) − 2σ (w )h + ylabel =
σ (wx + hx + b) − 2σ (wx + b) − 2σ (wx + b)h + ylabel
= G (w, h)
The frequently used activation functions include the sigmoid function, tanh function,
ReLU function, and PReLU function. Discussion:
(1). When σ is sigmoid:
σ (w + h ) = 1
1+ e − w − h
σ (w ) = 1
(16)
1+ e − w
e−w
σ (w ) =
1+ e − w
Figure 1. Regardless of the value of w , the g(w ,h ) curve monotonically decreases and intersects the
x-axis at the point (0.5, 0). This means that when the conditions (σ = sigmoid, h ≤ 0.5, and ylabel > 0)
are met, the algorithm has the possibility of divergence.
From Figure 1, it can be seen that the g(w ,h ) curve is monotonously decreasing and
converges at point (h = 0.5, g(w ,h ) = 0); when h ≤ 0.5, g(w ,h ) ≥ 0.That is:
49
Mathematics 2021, 9, 2176
2 4 4h
G (w , h ) = − − − 1 + y label (21)
1 + e 2( w + h ) 1 + e2w 2 + e2w + e−2w
For the arbitrariness of the value of ylabel , the first three terms of the function G(w ,h )
are considered and expressed by g(w ,h ):
The image of g(w ,h ) in Equation (21) drawn by OCTAVE is shown in Figure 2:
Figure 2. When σ = tanh, the curve g(w ,h ) monotonically decreases: (a) when h < 0.3 and w < 0, lim g(w , h ) = −3;
w→∞
(b) when h < −0.3 and w > 0.5, lim g(w , h ) = −1. When the conditions (σ = tanh, h < 0.3, w < 0, and ylabel > 3) are met,
w→∞
the algorithm has the possibility of divergence.
In Figure 2, the curve g(w ,h ) is also monotonically decreasing. When h < 0.3,
g(w ,h )
≥ −3, and so:
50
Mathematics 2021, 9, 2176
σ (w + h ) = max(0, w + h )
σ(w ) = max(0, w ) (24)
σ (w ) = max(0, 1)
then, G(w ,h ) is:
$ % $ %
G (w , h ) = max 0, w + h − max 0, 2w − max(0, 2h) + ylabel (25)
Due to the arbitrariness of the value of ylabel , the first three terms of the function
G(w ,h ) are considered and expressed by g(w ,h ). The image of g(w ,h ) is shown in
Figure 3:
Figure 3. The curve g(w ,h ) at σ = ReLU: (a) w > 0; (b) w < 0. When the conditions (σ = ReLU, h < 0.3, w < 0, and ylabel >
0) are met, the algorithm has the possibility of divergence.
51
Mathematics 2021, 9, 2176
Figure 4. The curve g(w ,h ) at σ = PReLU. (a) w’ > 0; (b) w < 0. When the conditions (σ = PReLU, h’ < 0, w’ < 0, and ylabel >
0) are met, the algorithm has the possibility of divergence.
Lemma 2. ∃ f(w), when f(w) + J(w) < r(w2 ), in Equation (3), the gain ratio q < 0.
Proof. The cost function is: F(w + h) = 12 ||f (w + h)||2 . The first-order Taylor expansion for
F(w + h) at w is:
1
F (w + h) = F (w) + hT J T f + hT J T Jh + R(w) = L(h) + R(w) (28)
2
As f (w) + J(w) < r(w2 ), the r(w2 ) term of the residual function f (w + h) after Taylor’s first-
order expansion cannot be omitted. Therefore, there exists f (w) that makes it impossible to
omit the R(w) term of Taylor’s first-order expansion for F(w + h). This means: L(w) < R(w).
52
Mathematics 2021, 9, 2176
Lemma 3. In the case of Lemma 1 and Lemma 2, the LM algorithm causes cost functions to fall
into “bad” local minima.
Proof. In the kth iteration, we have qk < 0; in the case of Lemma 2, according to the LM
algorithm in the original, when qk < 0, the trust region will be reduced, that is:
v = v×2
μ = μ×v (31)
$ % −1
h = − J T J + μI Jr
At this point, the absolute value of step h decreases, and its sign remains unchanged.
There is a situation where q is always negative when h is at any point in the positive or
negative half-axis, and h will approach zero infinitely and the cost function cannot decrease.
At this time, there is:
q < 0, v → +∞, h → 0 (32)
According to Equation (14), h’ = hx, we have:
h
q < 0, v → +∞, →0 (33)
x
We analyze the common activation functions one by one to find out whether they
have the possibility of this situation:
For the activation function σ = sigmoid, when ylabel > 1, the area covered by the
function is shown in Figure 5a. It can be seen that g(w ,h ) will remain positive when h is
at any point on the negative half of the x-axis. In the case of Lemma 1 and Lemma 2, it is
known that “g(w ,h ) remains positive” means “q remains negative.” According to Equation
(14), h’ = hx, so when ylabel > 1 and hx < 0, q remains negative, and the LM algorithm will
stop running, and the LM neural networks will also diverge or converge poorly.
Similarly, for the activation function σ = tanh, when w ≤ 0 and ylabel > 3, the area
covered by the function is shown in Figure 5b. When w > 0 and ylabel > 1, the area covered
by the function is shown in Figure 5c. Thus, when ylabel > 3, x > 0, and hx < 0, the LM
neural network will diverge or converge poorly.
Similarly, for the activation function σ = ReLU, when w < 0 and ylabel > 0, the area
covered by the function is shown in Figure 5d. When w < 0, ylabel > 0, x > 0, and hx < 0, the
LM neural networks will diverge or converge poorly.
Finally, for the activation function σ = PReLU, when w < 0 and ylabel > 0, the area
covered by the function is shown in Figure 5d. When ylabel > 0 and w < 0, the LM neural
network will diverge or converge poorly.
Figure 5. Cont.
53
Mathematics 2021, 9, 2176
Figure 5. The gray part is the coverage area of g(w ,h ): (a) The grey area covered by the g(w ,h ) function when σ = sigmoid
and ylabel > 1. If h ≤ 0.5, g(w ,h ) is always positive; (b) The grey area covered by the g(w ,h ) function when σ = tanh, ylabel
> 3, and w ≤ 0. If h ≤ 0.3, g(w ,h ) is always positive; (c) The grey area covered by the g(w ,h ) function when σ = tanh, ylabel
> 1, and w > 0. If h ≤ 0, g(w ,h ) is always positive; (d) The grey area covered by the g(w ,h ) function when σ = ReLU, ylabel
> 0, and w < 0. If h ≤ 0, g(w ,h ) is always positive; (e) The grey area covered by the g(w ,h ) function when σ = PReLU, ylabel
> 0, and w < 0. g(w ,h ) is always positive.
A = JT J
g = JT e (34)
hk = − 1+e1k−10 ( A + μI )−1 g − 1 g
1+e10−k g
54
Mathematics 2021, 9, 2176
5. Case Studies
5.1. Experimental Preparation
In order to evaluate the performance of AdaLM, it was arranged to compare with
the original LM algorithm, as well as other popular LM variants, including High order
Levenberg–Marquardt (HLM) and Three Step Levenberg–Marquardt (TSLM). Each al-
gorithm is described in Table 1. In this paper, each algorithm and neural network were
combined to build a prediction model, and the performance level of the algorithm was
reflected by testing the prediction ability of the model.
55
Mathematics 2021, 9, 2176
Table 1. Description table of other LM family algorithms participating in the comparative test.
As for the neural network model, it has been proven that the three-layer neural
network can simulate any complex nonlinear mapping if there are enough neurons in the
hidden layer [21–23]. Therefore, the neural network models involved in this paper were
all three-layer neural networks. We found that all the models involved in this paper will
converge before 50 steps of iteration. Thus, we set the maximum number of iterations of
the model to 50. In this study, the mean absolute error (MAE) was used to give the accuracy
of the experimental results. MAE is the average value of absolute error, which can reflect
well the actual situation of the predicted value error. Its formula is as follows:
1 n
n i∑
MAE( f ( x ), y) = | f ( xi ) − yi | (38)
=1
The number of nodes in the model affects the quality of the neural network. The input
and output nodes of the model need to be set according to the requirements of the task.
The number of hidden nodes of the model can be calculated according to the empirical
formula given in [24]: √
m = n×l (39)
where m is the number of hidden layer nodes, n is the number of input layer nodes, and l is
the number of output layer nodes.
The neural network model is shown in Figure 6. The model uses the square difference
formula to calculate the error between the prediction data and the label data.
56
Mathematics 2021, 9, 2176
In order to test the engineering application ability of each algorithm, the real prediction
task of aero-engine performance parameters was arranged in this paper. In this paper, fuel
flow data of an aero-engine were selected as training data. There is a large noise in the
fuel flow data of the aero-engine due to environmental conditions, operation conditions,
flight tasks, maintenance measures, etc. [25]. In Figure 7, the fuel flow data collected from
the CFM56-5B engine records the fuel consumption rate of the A320 aircraft in pounds per
hour.
It is shown from Figure 7 that the data were highly nonlinear with many outliers. The
data sample size was small. Therefore, the selection of the data can reflect the robustness
and convergence of the algorithm. The specific implementation steps of the data prediction
task for the fuel flow of the aeroengine are as follows:
Step 1: Obtain fuel flow data of the aeroengine. Set the total amount of data to N.
Use normalization and data smoothing to preprocess the fuel flow data of the aeroengine,
and then obtain the data X = (x1 , x2 , x3 , . . . . . . , xN ). The purpose of normalization is to
avoid numerical problems such as overflow and underflow, reducing the impact of model
initialization, and improving the prediction accuracy. The purpose of data smoothing is to
reduce the influence of noise and outliers on the model. The data smoothing method is
given by Equation (40):
1 n
MAE( f ( x ), y) = ∑ | f ( xi ) − yi | (40)
n i =1
Step 2: Organize data into datasets. In this step, the dataset is generated according
to the number of input layer nodes n of neural network, and the determination method
of n is given in step 5. The fuel flow data are divided into subsequences with length of n
as the input data of the model and the adjacent element of the subsequence as the label
data. Because this task is a one-step forecast, the label data dimension is 1. Input data X
and label data Y are, respectively:
{x1 , x2 , . . . . . . , xn } →{xn+1 }
{x2 , x3 , . . . . . . , xn+1 } → {x n+2 }
...
(41)
{xi , xi+1 , . . . . . . , xi+n −1 } → {xi+n }
...
{xN −n , xN −n+1 , . . . . . . , xN −1 } → {xN }
Step 3: Split the dataset {X,Y} into training set {X train , Y train } and test set {X test , Y test }.
According to experience, the first 95% of the data are used as training sets. The rest are the
test set.
Step 4: Build the neural network model and use the training set {X train , Y train } to train
the model. The model uses the square difference formula to calculate the error between the
prediction data and the label data.
Step 5: Determine the number of nodes in the neural network model to make the
model optimal. The enumeration method is used to determine the input node n of the
57
Mathematics 2021, 9, 2176
neural network, with the range of 5−12. In detail, steps 2 to 4 are looped 8 times, and the
input node n of the neural network is set to 5 to 12 in step 2 in each loop. The number
of hidden layer nodes is determined by Equation (39), and the number of output nodes
is 1. The MAE error in Equation (38) is used to describe the prediction accuracy of the
experimental results. The number of input nodes and the corresponding model MAE are
shown in Figure 8.
Figure 8 shows that when the number of input nodes n is 9, the output errors of the
algorithm LM reach the minimum, as well as HLM, TSLM, and AdaLM.
Step 6: Test the model. Input the test set X test into the input model to obtain f (X test ).
Table 2 shows that the traditional LM model diverges when Tanh, ReLU, and PReLU
are used as activation functions, which proves that the LM problem exists. HLM is a higher-
order descent method. The HLM model could not converge when the activation functions
were Tanh and PReLU, which indicates that over-high-order descent may produce negative
effects. The TSLM model is an improved version of HLM. In the test, TSLM converged
with four activation functions. The AdaLM algorithm proposed in this paper could change
the descent direction in time and has good stability, so it achieved convergence.
58
Mathematics 2021, 9, 2176
Figure 9. The prediction effect of each algorithm: (a) LM; (b) HLM; (c) TSLM; (d) AdaLM.
In order to quantify the prediction effect of each algorithm, the time consumption in
50 iterations and the prediction MAE of algorithms are shown in Table 3.
From Table 3, the neural network model with the original LM algorithm was trained
in 140 s. The data predicted by the model after training were basically consistent with the
original data, with an error of 41.4847. This shows that the original LM algorithm could
basically make the neural network predict the fuel consumption data of the aeroengine,
but the training took a lot of time because of the falling into a “bad” local minimum.
59
Mathematics 2021, 9, 2176
The neural network model with HLM algorithm completed the training in 181 s. The
data predicted by the model after training fit the original data with an error of 38.2276. The
HLM algorithm took the longest time to train in the prediction of fuel consumption data
of an aeroengine. The HLM algorithm increased the unnecessary complexity and did not
improve the performance.
The neural network model of the TSLM algorithm was trained in 173 s. The data
predicted by the trained model fit the original data with an error of 42.6715. TSLM
algorithm has the highest error in this task, and the complexity of the algorithm affects the
prediction performance.
The neural network model with the AdaLM algorithm proposed in this paper was
trained in 62 s, and the data predicted by the trained model fit the original data with an
error of only 32.9175, which indicates that the AdaLM algorithm took into account high
performance and high speed in predicting the fuel consumption data of aeroengine, with
the least time consumption and the highest accuracy.
6. Conclusions
This paper pointed out that the LM neural networks may converge poorly or diverge
because the training data are large and the weight is not selected properly. This work can
guide researchers to carry out some appropriate strategies such as training data preprocess-
ing and weights intervention at the beginning of training to avoid problems in the use of
the LM neural networks when they still insist on using this model. Furthermore, this work
can expand the application scope of LM neural networks and make LM neural networks
play its role better in more situations.
This study proposed a new solution to the problem of LM neural networks: the
AdaLM algorithm. The algorithm adds the weight term and error direction on the original
algorithm, which makes the algorithm close to the steepest descent method at the beginning
to preliminarily adjust the weight of the neural network. Then, the AdaLM algorithm
gradually approaches the LM algorithm, which improves the stability based on inheriting
the efficient optimization ability of LM. This paper compared the performance of traditional
LM, HLM, TSLM, and AdaLM algorithms. We found that the performance improvement
of the higher-order algorithm was very limited. Among many high-order algorithms, the
third-order algorithm had the better performance. The AdaLM algorithm could not only
make the neural network converge to a “good” local minimum, but also greatly shorten
the operation time without losing the prediction accuracy.
The application of the AdaLM algorithm focused on a small sample and highly non-
linear engineering data. It was especially effective for the analysis and prediction of engine
performance parameters in the aviation field. In future research, we will collect engine
data from more dimensions to comprehensively evaluate AdaLM’s ability of analysis. The
application scenarios of AdaLM will also be further explored.
Author Contributions: Writing—original draft preparation, Z.Y., S.Z., Z.C.; writing—review and
editing, L.L., Z.C; funding acquisition, Z.C. All authors have read and agreed to the published version
of the manuscript.
Funding: This research was funded by the National Natural Science Foundation of China, grant
number U1733201.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
60
Mathematics 2021, 9, 2176
References
1. Luo, G.; Zou, L.; Wang, Z.; Lv, C.; Ou, J.; Huang, Y. A novel kinematic parameters calibration method for industrial robot based
on Levenberg-Marquardt and Differential Evolution hybrid algorithm. Robot. Comput. Integr. Manuf. 2021, 71, 102165. [CrossRef]
2. Kumar, S.S.; Sowmya, R.; Shankar, B.M.; Lingaraj, N.; Sivakumar, S. Analysis of Connected Word Recognition systems using
Levenberg Mar-quardt Algorithm for cockpit control in unmanned aircrafts. Mater. Today Proc. 2021, 37, 1813–1819. [CrossRef]
3. Mahmoudabadi, Z.S.; Rashidi, A.; Yousefi, M. Synthesis of 2D-Porous MoS2 as a Nanocatalyst for Oxidative Desulfuriza-tion of
Sour Gas Condensate: Process Parameters Optimization Based on the Levenberg–Marquardt Algorithm. J. Environ. Chem. Eng.
2021, 9, 105200. [CrossRef]
4. Transtrum, M.K.; Machta, B.B.; Sethna, J.P. Why are nonlinear fits to data so challenging? Phys. Rev. Lett. 2010, 104, 060201.
[CrossRef]
5. Amini, K.; Rostami, F. A modified two steps Levenberg–Marquardt method for nonlinear equations. J. Comput. Appl. Math. 2015,
288, 341–350. [CrossRef]
6. Kim, M.; Cha, J.; Lee, E.; Pham, V.H.; Lee, S.; Theera-Umpon, N. Simplified Neural Network Model Design with Sensitivity
Analysis and Electricity Consumption Prediction in a Commercial Building. Energies 2019, 12, 1201. [CrossRef]
7. Zhao, L.; Otoo, C.O.A. Stability and Complexity of a Novel Three-Dimensional Envi-ronmental Quality Dynamic Evolution
System. Complexity 2019, 2019, 3941920. [CrossRef]
8. Zhou, W.; Liu, D.; Hong, T. Application of GA-LM-BP Neural Network in Fault Prediction of Drying Furnace Equipment. Matec.
Web Conf. 2018, 232, 01041. [CrossRef]
9. Jia, P.; Zhang, P. Type Identification of Coal Mining Face Based on Wavelet Packet Decomposition and LM-BP. In Proceedings
of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25
November 2018.
10. Hua, L.; Bo, L.; Tong, L.; Wang, M.; Fu, H.; Guo, R. Angular Acceleration Sensor Fault Diagnosis Based on LM-BP Neural
Network. In Proceedings of the 37th Chinese Control Conference, Wuhan, China, 25–27 July 2018; pp. 6028–6032.
11. Hossein, A.M.; Nazari, M.A.; Madah, R.G.H.; BehshadShafii, M.; Ahmadi, M.A. Thermal conductivity ratio prediction of
Al2O3/water nanofluid by applying connectionist methods. Colloids Surf. A Physicochem. Eng. Asp. 2018, 541, 154–164.
12. Yang, X. A higher-order Levenberg–Marquardt method for nonlinear equations. Appl. Math. Comput. 2013, 219, 10682–10694.
[CrossRef]
13. Chen, L. A high-order modified Levenberg–Marquardt method for systems of nonlinear equations with fourth-order convergence.
Appl. Math. Comput. 2016, 285, 79–93. [CrossRef]
14. Derakhshandeh, S.Y.; Pourbagher, R.; Kargar, A. A novel fuzzy logic Leven-berg-Marquardt method to solve the ill-conditioned
power flow problem. Int. J. Electr. Power Energy Syst. 2018, 99, 299–308. [CrossRef]
15. Qiao, J.; Wang, L.; Yang, C.; Gu, K. Adaptive levenberg-marquardt algorithm based echo state network for chaotic time series
prediction. IEEE Access 2018, 6, 10720–10732. [CrossRef]
16. Ma, C.; Tang, J. The quadratic convergence of a smoothing Levenberg–Marquardt method for nonlinear complementarity problem.
Appl. Math. Comput. 2008, 197, 566–581. [CrossRef]
17. Du, S.Q.; Gao, Y. Global convergence property of modified Levenberg-Marquardt meth-ods for nonsmooth equations. Appl. Math.
2011, 56, 481. [CrossRef]
18. Zhou, W. On the convergence of the modified Levenberg–Marquardt method with a non-monotone second order Armijo type
line search. J. Comput. Appl. Math. 2013, 239, 152–161. [CrossRef]
19. Moré, J.J. The Levenberg-Marquardt algorithm: Implementation and theory. In Numerical Analysis; Springer: Berlin/Heidelberg,
Germany, 1978; pp. 105–116.
20. Madsen, K.; Nielsen, H.B.; Tingleff, O. Methods for Non-Linear Least Squares Problems, 2nd ed.; Technical University of Denmark:
Lyngby, Denmark, 2004.
21. Zhang, Z.; Ma, X.; Yang, Y. Bounds on the number of hidden neurons in three-layer binary neural networks. Neural Netw. 2003,
16, 995–1002. [CrossRef]
22. Liang, X.; Chen, R.C. A unified mathematical form for removing neurons based on or-thogonal projection and crosswise
propagation. Neural Comput. Appl. 2010, 19, 445–457. [CrossRef]
23. Chua, C.G.; Goh, A.T.C. A hybrid Bayesian back-propagation neural network approach to multivariate modelling. Int. J. Numer.
Anal. Methods Geomech. 2003, 27, 651–667. [CrossRef]
24. Sequin, C.H.; Clay, R.D. Fault tolerance in artificial neural networks. In Proceedings of the 1990 IJCNN International Joint
Conference on Neural Networks, San Diego, CA, USA, 17–21 June 1990; pp. 703–708.
25. Li, Y.G.; Nilkitsaranont, P. Gas turbine performance prognostic for condition-based maintenance. Appl. Energy 2009, 86, 2152–2161.
[CrossRef]
61
mathematics
Article
Three-Legged Compliant Parallel Mechanisms: Fundamental
Design Criteria to Achieve Fully Decoupled Motion
Characteristics and a State-of-the-Art Review
Minh Tuan Pham 1, *, Song Huat Yeo 2 and Tat Joo Teo 3
Abstract: A three-legged compliant parallel mechanism (3L-CPM) achieves fully decoupled motions
when its theoretical 6 × 6 stiffness/compliance matrix is a diagonal matrix, which only contains
diagonal components, while all non-diagonal components are zeros. Because the motion decoupling
capability of 3L-CPMs is essential in the precision engineering field, this paper presents the fundamen-
tal criteria for designing 3L-CPMs with fully decoupled motions, regardless of degrees-of-freedom
and the types of flexure element. The 6 × 6 stiffness matrix of a general 3L-CPM is derived based on
the orientation of each flexure element, e.g., thin/slender beam and notch hinge, etc., and its relative
Citation: Pham, M.T.; Yeo, S.H.; Teo,
position to the moving platform. Based on an analytical solution, several requirements for the flexure
T.J. Three-Legged Compliant Parallel elements were identified and needed to be satisfied in order to design a 3L-CPM with a diagonal
Mechanisms: Fundamental Design stiffness/compliance matrix. In addition, the developed design criteria were used to analyze the
Criteria to Achieve Fully Decoupled decoupled-motion capability of some existing 3L-CPM designs and shown to provide insight into the
Motion Characteristics and a motion characteristics of any 3L-CPM.
State-of-the-Art Review. Mathematics
2022, 10, 1414. https://doi.org/ Keywords: three-legged parallel mechanism; compliant mechanism; flexure-based mechanism;
10.3390/math10091414 flexure; compliant joint; decoupled motion; coupled motion; stiffness; compliance
Academic Editors: Higinio Rubio
Alonso, Alejandro Bustos Caballero, MSC: 70-10
Jesus Meneses Alonso and
Enrique Soriano-Heras
positioning stages, due to its closed-loop and compact architecture. As a result, numerous
three-legged compliant parallel mechanisms (3L-CPMs) with different numbers of degrees-
of-freedom (DOF) have been developed over the past two decades, e.g., the 3-DOF in-plane
motions (X-Y-θ Z ) 3L-CPMs [18–20,22,27,29,35,53,55], the 3-DOF out-of-plane motions (X-Y-
Z) [24,25] and (θ X -θ Y -Z) [5–7] 3L-CPMs, and the 6-DOF motions 3L-CPMs [28,31,42,43,56].
These 3L-CPMs all adopted a three-legged parallel kinematic configuration as a base archi-
tecture. Due to their popularity, this paper focuses on the 3L-CPM with detailed studies on
its motion decoupling capability.
In general, a 3L-CPM consists of three legs, wherein each leg is formed by either one
or a series of flexure elements, e.g., thin/slender beams or notch hinges, etc., connected
together. Hence, each leg can be partially compliant if a rigid-link is used to connect two
flexure elements, or is fully compliant; or if there is no rigid-link between two flexure
elements. Depending on the structure of the leg, 3L-CPMs can be classified into two types,
i.e., single flexure serial chain and double reflecting flexure serial chains in a leg. For
a 3L-CPM or any compliant mechanism, DOF represents the number of possible output
motions that the moving platform can deliver, i.e., three translation motions along and three
rotation motions about the respective X, Y, and Z axes. In an ideal case, the output motions
have to be fully decoupled, i.e., delivering the desired DOF in the actuating directions
and without any parasitic motion in the non-actuating directions. Based on Hooke’s Law,
the motion property (coupled or decoupled) of a compliant mechanism is governed by a
6 × 6 stiffness matrix where the diagonal components represent the stiffness characteristics
of all six possible actuation directions, while the non-diagonal components are responsible
for the off-axes (or non-actuating) stiffness characteristics.
Past works in the literature have shown that the motion properties of many existing 3L-
CPMs were generally neglected [18–22,24,25,27–29,31,33,35,41–43,54]. Based on the derived
6 × 6 stiffness/compliance matrices, only few recent 3L-CPMs demonstrated decoupled
motions [7,52,55,56], while most 3L-CPMs could only deliver coupled motions [5,6]. The
main reason is that the existing 3L-CPMs were synthesized with the aim of achieving the
desired DOF. As a result, they were able to deliver the motions in the desired actuating
directions, but they also produced undesired parasitic motions in the non-actuating di-
rections. More recent efforts have mainly focused on synthesizing 3L-CPMs with a high
ratio between the non-actuating stiffness and actuating stiffness [6,7,53,55], so as to keep
the undesired parasitic motions to a very small percentage as compared to the actuating
motions. In addition, several design criteria for achieving 3L-CPMs with decoupled motion
capability were recently presented in Reference [7]. Such criteria were obtained by substitut-
ing a number of discrete parameters (orientation and position) of flexure elements into the
mathematical model of a 3L-CPM used in a specific structural optimization method [7,56].
Thus, these criteria are not general and cannot be applied in different design methods.
Because motion decoupling is an important performance indicator for any positioning
system, the criteria that can be used to design decoupled-motion 3L-CPMs regardless of
design method and DOF are essential. This paper presents the fundamentals for designing
any 3L-CPMs with fully decoupled motion characteristic. This includes several design
criteria that need to be fulfilled in order to completely eliminate parasitic motions. The
findings that arise from this work suggest that parametric features of flexure elements,
such as the orientation and relative position to the end effector, will have a direct impact
on the performance of any 3L-CPM in terms of the DOF, the constrained motions, and the
parasitic motions.
The remainder of this paper is organized as follows: Section 2 describes the stiffness
modeling of a typical 3L-CPM and the stiffness property of each leg. The criteria of flexure
elements to design a 3L-CPM with fully decoupled motion are presented in Section 3, and
a special case of 3L-CPMs having two reflecting flexure chains in a leg is discussed in
Section 4. Section 5 presents a review on the decoupled-motion capability of an existing
3L-CPM, and Section 6 provides discussions about the findings in this work. Lastly, some
conclusions are offered in Section 7.
64
Mathematics 2022, 10, 1414
The stiffness property of a 3L-CPM is governed by the stiffness of the legs and the
moving stage (end-effector). The stiffness matrix of the leg along the Y axis is represented
by Kl with respect to (w.r.t.) the local frame, i.e., at point E, as shown in Figure 1. With D as
the vector that represents the distance between the local frame of the leg and the global
frame of the 3L-CPM, the stiffness matrix of the entire 3L-CPM, Km , is expressed as follows:
3 −1 T
Km = ∑ Jil Ril Kl Ril Jil (1)
i =1
where i = 1, 2, and 3 denotes the three legs in the CPM, as illustrated in Figure 1; Jil is the
translation matrix from the local frame of the ith leg to the global frame; and Ril is the
rotation matrix about the Z axis of the ith leg. As three legs are symmetrical and 120◦ apart,
Ril and Jil are written as follows:
⎡ ⎤
cos θi − sin θi 0
Rz ( θ i ) 0
Ril = where Rz (θi ) = ⎣ sin θi cos θi 0 ⎦ (2)
0 Rz ( θ i )
0 0 1
⎡ ⎤
0 Dz i − Dy i
I 0
Jil = where Til = ⎣ − Dz i 0 D xi ⎦ (3)
Til I
Dy i − D xi 0
In Equation (2), the values of θ1 , θ2 , and θ3 are 0◦ , 120◦ , and 240◦ , respectively. In
Equation (3),Dxi , Dyi , and Dzi are three components of Di and represent the projections of
the distance from each local frame to the global frame onto the X, Y, and Z axes, respectively.
65
Mathematics 2022, 10, 1414
Note that Dzi = 0, since the X Y plane of the local frames lies on the same plane as the XY
plane of the global frame.
Here, the stiffness of a general leg, Kl , is represented as follows:
⎡ ⎤
kl11
⎢ kl l ⎥
⎢ 21 k22 SYM ⎥
⎢ l l l ⎥
⎢ k k k ⎥
Kl = ⎢ l31 32 33 ⎥ (4)
⎢ k41 k42 k43 k44
l l l ⎥
⎢ l ⎥
⎣ k51 kl52 kl53 kl54 kl55 ⎦
kl61 kl62 kl63 kl64 kl65 kl66
with d denoting the size of the end effector, as illustrated in Figure 1, the expressions of the
non-zero components within the stiffness matrix of Equation (5) are expressed as follows:
11 = k 22 = 2 k 11 + k 22
3
km m l l
33 =
km 3kl33
44 = 55 = 2 d k 33 − 2dk 43 + k 44 + k 55
3 2 l
km km l l l
66 =
km 3 d2 kl11 + 2dkl61 + kl66 (6)
41 = 52 = − 2 dk 31 − k 41 − k 52
3
km km l l l
51 = −km42 = 2 k 51 + dk 32 − k 42
3
km l l l
63 =
km 3 dkl31 + kl63
66
Mathematics 2022, 10, 1414
the presented mathematical models can be used to synthesize a 3L-CPM with the aim
of achieving fully decoupled motion capability. This special case is to make those seven
components within Kl be zeros, as shown in Equation (8).
⎡ ⎤
kl11
⎢ kl l ⎥
⎢ 21 k22 SYM ⎥
⎢ l ⎥
⎢ 0 0 k ⎥
Kl = ⎢ 33 ⎥ (8)
⎢ 0 l
0 k43 k44 l ⎥
⎢ ⎥
⎣ 0 0 kl53 kl54 kl55 ⎦
kl61 kl62 0 kl64 kl65 kl66
In this work, the two popular-choice flexure elements that are used to synthesize the
compliant mechanisms are the beam type and the notch type, as shown in Figure 2. Both
elements have thin features which permit elastic bending in a specific direction.
Figure 2. Original orientation of the flexure elements (solid lines): (a) beam type and (b) notch type,
with the local frames, X”Y”Z”, attached at the free end and the arbitrary orientation of the flexure
elements (dotted-lines) about these local frames.
Assuming that each leg being formed by a serial chain of flexure elements and rigid
links where a rigid link has infinite stiffness (non-compliance) property, the compliance
of each leg (Cl ) is governed by the compliance of each flexure element, Cej , expressed
as follows:
n −1 T
Cl = ∑ Jej Rej Cej Rej Jej (9)
j =1
where n denotes the number of flexure elements, and Rej and Jej are the rotation matrix, and
translation matrix of the jth flexure element respectively. Referring to Reference [57], the
compliance matrix of each original flexure element, Cej , with respect to the local frame, as
illustrated in Figure 2, is defined as follows:
⎡ e ⎤
c11
⎢ 0 e
c22 SYM ⎥
⎢ ⎥
⎢ 0 0 e
c33 ⎥
Cj = ⎢
e
⎢ e
⎥
⎥ (10)
⎢ 0 0 0 c44 ⎥
⎣ 0 0 e
c53 0 e
c55 ⎦
0 e
c62 0 0 0 e
c66
Equation (10) is applicable for both the beam type and notch type flexure elements, as
illustrated in Figure 2 [57]. In addition, the geometry of each flexure element type can vary
without changing the form of the compliant matrix expressed in Equation (10). Several
kinds of flexure elements which have a similar form of compliance matrix are presented in
67
Mathematics 2022, 10, 1414
Appendix A. Note that the X”Y” plane of the local frame of each flexure element (Figure 2)
lies on the parallel plane with the XY plane of the global frame, as illustrated in Figure 1.
By referring to Equation (9), we see that the rotation matrix, Rej , is a 6 × 6 matrix and is
defined as the multiplication of the rotation matrices about the X”, Y”, and Z” axes (Rex j ,
Rey j and Rez j ). Hence, it is written as follows:
⎡ ⎤
Rez j Rey j Rex j 0 cos γ j − sin γ j 0
Rej = where Rez j = ⎣ sin γ j cos γ j 0 ⎦,
0 Rez j Rey j Rex j
0 0 1
⎡ ⎤
cos β j 0 sin β j
Rey j = ⎣ 0 1 0 ⎦, (11)
− sin β j 0 cos β j
⎡ ⎤
1 0 0
Rx j = ⎣ 0 cos α j − sin α j
e ⎦
0 sin α j cos α j
Here, α j , β j , and γ j represent the rotation angles about the X”, Y”, and Z” axes,
respectively. The geometries of a flexure element before and after orientation are also
illustrated in Figure 2. As for the 6 × 6 translation matrix, Jej , it represents the projected
distances onto the three axes (r x j , ry j , and rz j ) from the jth flexure element to the local frame
of the leg that are indicated by vector r j , as shown in Figure 1, written as follows:
⎡ ⎤
0 rz j −r y j
I rej ⎢ ⎥
Jej = where rej = ⎣ −rz j 0 rxj ⎦ (12)
0 I
ry j −r x j 0
Using Equations (9)–(12), the compliance matrix of a leg, Cl , can be obtained, and the
−1
stiffness matrix of each leg is given as Kl = Cl .
As mentioned earlier, the stiffness matrix of each leg must follow the exact form shown
in Equation (8), and this requirement applies to its corresponding compliance matrix too.
The detailed derivation of the compliance matrix of each leg with the aim of achieving
that requirement is presented in Appendix B. To summarize the results obtained from
Appendix B, one condition which allows for the compliance matrix of a leg to become the
exact same form as Equation (8) is for seven components within the compliance matrix
l = cl = cl = cl = cl = cl = cl = 0. This condition
of a leg to be zeros, i.e., c31 32 41 42 51 52 63
offers simplicity during the design stage and can be used as the standard approach to
synthesize 3L-CPMs with the aim of achieving fully decoupled motion capability. However,
it also introduces a multiple-solutions problem to solve the corresponding components
within the stiffness matrix, Kl . Among a number of possible solutions, kl64 = kl65 = 0 is a
unique solution that is used to fulfill the condition in this work. By adopting this unique
solution, both the stiffness matrix and the compliance matrix of a leg will have the same
form as expressed in Equation (13), and the expression of each component within Kl is
given in Appendix C.
⎡ ⎤ ⎡ ⎤ −1
kl11 l
c11
⎢ kl21 kl22 ⎥ ⎢ l l ⎥
⎢ SYM ⎥ ⎢ c21 c22 SYM ⎥
⎢ ⎥ ⎢ ⎥
⎢ 0 0 kl33 ⎥ ⎢ 0 0 l
c33 ⎥
⎢ ⎥=⎢ ⎥ (13)
⎢ 0 0 kl43 kl44 ⎥ ⎢ 0 0 l
c43 l
c44 ⎥
⎢ ⎥ ⎢ ⎥
⎣ 0 0 kl53 kl54 kl55 ⎦ ⎣ 0 0 l
c53 l
c54 l
c55 ⎦
kl61 kl62 0 0 0 kl66 l
c61 l
c62 0 0 0 l
c66
} }
Kl −1
(Cl )
68
Mathematics 2022, 10, 1414
e
where C j indicates the compliance matrix of the jth oriented flexure element referring to
the local frame X Y Z attached to the free end of the leg, as illustrated in Figure 1. In order
to achieve fully decoupled motion, the nine non-diagonal components (c31 l , cl , cl , cl , cl ,
41 51 32 42
l , cl , cl , and cl ) of the compliance matrix, Cl , must be zeros, as shown in Equation (13).
c52 63 64 65
e
As Cl is the sum of n sub-components, C j , there could be numerous solutions, because all
e
components within C j can have arbitrary values in general cases. In this work, a special
e e
case where all Cj have the same form (C ) is considered and yields the following:
⎡ e ⎤
c11
⎢ e
c21 e
c22 SYM ⎥
⎢ ⎥
⎢ e
c31 =0 e
c32 = 0 c33
e ⎥
C =⎢ ⎥
e
⎢ ⎥ (15)
⎢
e
c41 =0 e
c42 = 0 c43
e e
c44 ⎥
⎣ e
c51 =0 e
c52 = 0 c53
e e
c54 e
c55 ⎦
e
c61 e
c62 e
c63 = 0 c64
e
= 0 c65
e
= 0 c66
e
e
Note that C in Equation (15) is different from Ce in Equation (10), since
C = Je Re Ce (Re )−1 (Je ) T , as expressed in Equation (14). Re and Je are similar to
e
69
Mathematics 2022, 10, 1414
$ %
e
c32 = −c11 e cos β sin β sin γ − ce cos β (r cos β cos γ + r sin β ) r cos γ − r sin γ +
44 z x y x
e $ e % $ e %
cos β sin α cos α c62 r x cos β + cos γ c22 − c62
e r sin β
z + sin α −c62 rz + c22
e sin β sin γ +
e $ e % $ %
cos α cos β c53 r x cos β sin α − cos γ sin α c33 + c53 e r sin β + cos α ce r + ce sin β sin γ +
z 53 z 33
e $ e % $ e %
cos α c66 r x cos β + cos γ c62 − c66
e r sin β
z + sin α −c66 rz + c62
e sin β sin γ · (19)
$ % $ %
cos γ r x sin α + ry cos α sin β + ry sin α − r x cos α sin β sin γ +
e $ e % $ %
c55 r x cos β sin α − cos γ sin α c53 + c55e r sin β + cos α ce r + ce sin β sin γ ·
z 55 z 53
$ % $ %
sin α sin β ry cos γ − r x sin γ − cos α r x cos γ + ry sin γ
e
c42 e cos β cos γ (r cos β cos γ + r sin β ) + (cos α cos γ sin β + sin α sin γ )·
= −c44 z x
e $ e % $ e %
cos α c66 r x cos β + cos γ c62 − c66 rz sin β + sin α −c66
e rz + c62
e sin β sin γ +
(20)
(cos γ sin α sin β − cos α sin γ)·
e $ e % $ %
c55 r x cos β sin α − cos γ sin α c53 + c55
e r sin β + cos α ce r + ce sin β sin γ
z 55 z 53
e
c52 = −c44
e cos β (r cos β cos γ + r sin β ) sin γ + (− cos γ sin α + cos α sin β sin γ )·
z x
e $ e % $ e %
cos α c66 r x cos β + cos γ c62 − c66
e r sin β
z + sin α −c66 rz + c62
e sin β sin γ +
(21)
(cos α cos γ + sin α sin β sin γ)·
e $ e % $ %
c55 r x cos β sin α − cos γ sin α c53 + c55
e r sin β + cos α ce r + ce sin β sin γ
z 55 z 53
$ e % e $ %
e
c63 = 12 cos β c62 + c53
e cos β sin 2α + 2c44 − c55
e − ce + ce − ce
66 55 66 cos 2α ·
$ % $ e % $ % (22)
sin β −ry cos γ + r x sin γ − c55 − c66 e sin 2α r x cos γ + ry sin γ
!
e
c64 = cos β cos γ −c44 e
+ c66
e
cos2 α + c55e
sin2 α sin β + (−c55 e
+ c66
e
) cos α sin α sin γ (23)
!
e
c65 = cos β (c55 e
− c66
e
) cos α cos γ sin α + c66e
cos2 α sin β sin γ + −c44 e
+ c55
e
sin2 α sin β sin γ (24)
This set of equations can be solved by considering Equation (22) first, because β is the
only dominant angular variable, and the results are given as follows:
⎡
β = 90◦
e
c63 ⎣
= 0 ⇔ α = 0◦ and β = 0◦ , 180◦ ; ∀γ, r x , ry , rz (25)
α = 90◦ and β = 0◦ , 180◦
Here, the rotation angle about the X” axis, α, varies from 0◦ to 90◦ , because of the
symmetrical structure of the flexure elements, as illustrated in Figure 2. Equation (25)
shows that there are three possible cases for, c63e
= 0 with the four remaining variables (γ,
r x , ry , and rz ) being arbitrary values.
First, the case with β = 90◦ is considered. With every component within the compli-
ance matrix of the flexure element having a specific value, as shown in Equation (10), six
Equations, from (16) to (21), are always different from zero with any value of γ, r x , ry , and
rz . Hence, β = 90◦ is not a feasible solution.
70
Mathematics 2022, 10, 1414
Next, consider the second case with α = 0◦ and β = 0◦ , 180◦ ; the following results can
be obtained:
$ %
e
c31 = ±rz ∓c55 e r cos2 γ ∓ ce r sin2 γ + cos γ ce + ± ce ∓ ce r sin γ
x 44 x 53 44 55 y
$ e %
e
c41 = c44 − c55
e r cos γ sin γ
z
$ %
e
c51 = rz c55
e cos2 γ + ce sin2 γ
44
e $ e % $ %
e
c32 = ±rz ∓c44 ry cos2 γ + ±c44 e r cos γ sin γ + sin γ ce ∓ ce r sin γ
∓ c55 x 53 55 y (26)
$ %
c42 = −rz c44 cos γ + c55 sin γ
e e 2 e 2
$ e %
e
c52 = −c44 + c55
e r cos γ sin γ
z
e
c64 = c65
e
=0
Here, the upper signs of “±” and “∓” in Equation (26) represent the case of α = 0◦
and β = 0◦ , while the lower signs represent the case of α = 0◦ and β = 180◦ . To solve
e
Equation (26), c51 is first considered to be equal to zero, a unique solution can be obtained
rz = 0 by, and γ can be any value. With rz = 0, all the remaining equations in Equation
(26) will also be equal to zeros. Hence, rz = 0, α = 0◦ , β = 0◦ or 180◦ are solutions used to
obtain a 3L-CPM with any DOF and decoupled motions.
Similarly, for the case with α = 90◦ and β = 0◦ or 180◦ , it can be shown that this is
also a possible solution. In summary, the two feasible solutions found from Equation (25)
with rz = 0 are as follows:
α = 0◦ and β = 0◦ , 180◦
; ∀γ, r x , ry (27)
α = 90◦ and β = 0◦ , 180◦
Equation (27) provides the design criteria for the orientations and positions of the
flexure elements that need to be satisfied to design a 3L-CPM with fully decoupled motion
capability. Figure 3 illustrates the desired orientation of the flexure elements about the X”
and Y” axes, with various orientations about the Z” axis. With flexure elements in a leg
having these orientations and distributing in the X Y plane (rz = 0), a 3L-CPM is able to
achieve fully decoupled motions.
Figure 3. Orientations of the flexure elements in fully decoupled motion 3L-CPMs: (a) beam-type and
(b) notch-type with α = 0◦ , β = 0◦ or 180◦ , and rz = 0, respectively. (c) Beam-type and (d) notch-type
with α = 90◦ , β = 0◦ , 180◦ , and rz = 0 respectively.
71
Mathematics 2022, 10, 1414
Figure 4. Construction of a 3L-CPM leg containing two reflecting serial flexure chains.
From the literature [6,7,28,31,42,43,53,55,56,58], we can see that the double flexure
chains are either on the same plane or have an offset distance of 2Δ along the Z axis, as
shown in Figure 4. The stiffness matrix of each leg is expressed as follows:
−1 −1
n n
Kl = ∑ J(+Δ) Csc J(+
T
Δ) + ∑ J(−Δ) MCsc MT J(−
T
Δ) (28)
j =1 j =1
where M is the reflection matrix about the Y Z plane given in Equation (29); and J(+Δ) and
J(−Δ) represent the offset matrices used to shift the original flexure chain and the reflecting
flexure chain along the Z -axis distances of +Δ and −Δ, respectively, as given in Equation (30).
⎡ ⎤
−1 0 0 0 0 0
⎢ 0 1 0 0 0 0 ⎥
⎢ ⎥
⎢ 0 0 1 0 0 0 ⎥
M=⎢
⎢
⎥
⎥ (29)
⎢ 0 0 0 −1 0 0 ⎥
⎣ 0 0 0 0 1 0 ⎦
0 0 0 0 0 1
⎡ ⎤
1 0 0 0 0 0
⎢ 0 1 0 0 0 0 ⎥
⎢ ⎥
⎢ 0 0 1 0 0 0 ⎥
J(±Δ) =⎢
⎢
⎥ (30)
⎢ 0 ±Δ 0 1 0 0 ⎥
⎥
⎣ ∓Δ 0 0 0 1 0 ⎦
0 0 0 0 0 1
72
Mathematics 2022, 10, 1414
Here, Csc is the compliance matrix of a serial flexure chain that can be calculated
by Equation (9). The results from Section 3 are used to analyze the stiffness property of
3L-CPMs with two reflecting flexure chains. After substituting Equations (9), (14), (29) and
(30) into Equation (28), the results show that the decoupled motion capability can only be
achieved when the offset distance 2Δ = 0 (or Δ = 0). The offset distance can be considered
as the translation component along the Z axis of each flexure elements (rz j ) that can lead
coupled motions, as mentioned before. Most important, to achieve fully decoupled output
motion, Csc must be in the following form:
⎡ sc ⎤
c11
⎢ sc
c21 sc
c22 SYM ⎥
⎢ ⎥
⎢ 0 0 sc
c33 ⎥
C =⎢
sc
⎢ sc sc
⎥
⎥ (31)
⎢ 0 0 c43 c44 ⎥
⎣ 0 0 sc
c53 sc
c54 sc
c55 ⎦
sc
c61 sc
c62 0 0 0 sc
c66
73
Mathematics 2022, 10, 1414
Figure 5. (a) Schematic diagram of 3RRR-CPM and (b) micro-motion stage developed based
on 3RRR-CPM.
Figure 6. (a) Schematic diagram of 3PRR-CPM and (b) a positioning stage developed based
on 3PRR-CPM.
Jej
where and Rej
are obtained by using Equations (12) and (11), respectively. The result of
Equation (32) is shown in Equation (33), and the detailed expression of each component
74
Mathematics 2022, 10, 1414
is given in Appendix D. It is observed that the compliance matrix of each leg within the
3RRR-CPMs and 3PRR-CPMs is similar to the form expressed in Equation (13). Hence,
this observation suggests that the 3RRR and 3PRR configurations are able to deliver fully
decoupled motion. This performance indicator also highlighted why both configurations
are popular designs for developing state-of-the-art 3L-CPMs.
⎡ ⎤
l
c11
⎢ cl l ⎥
⎢ 21 c22 SYM ⎥
⎢ l ⎥
⎢ 0 0 c ⎥
Cl = ⎢ 33 ⎥ (33)
⎢ 0 l
0 c43 c44 l ⎥
⎢ ⎥
⎣ 0 0 c53 l l
c54 l
c55 ⎦
l
c61 l
c62 0 0 0 l
c66
Figure 7. Structure of (a) 3PPS CPM [62] and (b) 3RPS CPM modeled based on the design presented
in [21].
Based on the results obtained from Equation (16) to Equation (24), the components,
ej ej ej ej ej ej
i.e., c31 , c41 , c51 , c32 , c42 , and c52 , are non-zeros because of the existence of rz j . Thus,
the corresponding components in Cl are also non-zeros. Therefore, the legs’ compliance
matrices within the 3PPS- and 3RPS-CPMs do not satisfy Equation (13). As a result, both
the 3-DOF 3PPS- and 3RPS-CPMs will generate coupled motions. This coupled motion
property can be observed from Equation (35), which is the 6 × 6 stiffness matrix of the 3PPS-
CPM (Figure 7a) taken from Reference [5]. Having those five non-diagonal components
indicates that the developed 3PPS-CPM cannot deliver fully decoupled motion.
75
Mathematics 2022, 10, 1414
⎡ ⎤
2.57 × 105
⎢ 0 2.57 × 105 SYM ⎥
⎢ ⎥
⎢ 0 0 3.99 × 105 ⎥
K=⎢
⎢ 5.47 × 101
⎥
⎥ (35)
⎢ 5.18 × 103 0 9.51 × 102 ⎥
⎣ −5.18 × 103 5.47 × 101 0 0 9.51 × 102 ⎦
0 0 −5.63 × 101 0 0 1.72 × 103
Figure 8. Prototypes of 3PRPR-CPMs with (a) X-Y-Z motions modeled based on the design presented
in [24] and (b) θ X -θ Y -Z motions modeled based on the design presented in [17], respectively.
Based on Figures 2b, 8a, and A1b, the two P-joints can have arbitrary rotation angles
(γ1 and γ3 ) about their Z” axes, while the two R-joints have two rotations, i.e., about the
X” axes with an angle of 90◦ (α2 = α4 = 90◦ ) and about the Z” axes with an angle of 90◦ (γ2
= γ4 = 90◦ ). All flexure elements have no rotation about their Y” axes (β1 = β2 = β3 = β4
= 0). In addition, as the first three flexure elements (e1 , e2 , and e3 ) are located at specific
distances from the local frame of the leg X Y Z , the distance vectors are r1 = r x1 , ry1 , rz1 ,
76
Mathematics 2022, 10, 1414
where cuvP and cS (with u, v = 1, 2, . . . , 6) are the components within the compliance
uv
matrices of the P-joints and S-joints, respectively.
77
Mathematics 2022, 10, 1414
Figure 9. Six-DOF 3L-CPM with two reflecting PSS chain in a leg modeled based on the design
presented in [28].
In this design, the two symmetrical PSS serial chains have no offset distance. Hence,
the offset matrices, J(±Δ) , as shown in Equation (30), become identity matrices. By using
Equations (9), (11) and (12) with the defined parameters, the compliance matrix of a serial
flexure chain in the proposed leg, Csc , is written in Equation (39), and the result of each
component is given in Appendix F.
⎡ sc ⎤
c11
⎢ sc
c21 sc
c22 SYM ⎥
⎢ ⎥
⎢ sc
c31 sc
c32 sc
c33 ⎥
C =⎢
sc
⎢ sc sc sc sc
⎥
⎥ (39)
⎢ c41 c42 c43 c44 ⎥
⎣ sc
c51 sc
c52 sc
c53 sc
c54 sc
c55 ⎦
sc
c61 sc
c62 0 0 0 sc
c66
Based on the obtained Csc , the stiffness matrix of each leg, Kl , can be calculated by
Equation (28). It is realized that the form of Csc in this design is different from the condition
shown in Equation (31), as the flexure elements are not located in the XY plane (rz j = 0)
and the P-joint has a rotation about the Y” axis (β 1 = 0). Therefore, even the offset distance
between two serial flexure chains in each leg is zero (Δ = 0), and the output motions of this
6-DOF 3L-CPM are coupled based on the results obtained from Section 4. Using a similar
analysis, the 6-DOF 3L-CPM presented in Reference [31] also generates coupled motions.
78
Mathematics 2022, 10, 1414
⎡ e ⎤ ⎡ e2 ⎤
c111 c11
⎢ 0
e
c221 SYM ⎥ ⎢ 0 e2
c22 SYM ⎥
⎢ ⎥ ⎢ ⎥
⎢ 0 0
e
c331 ⎥ ⎢ e2 ⎥
C1e = ⎢ ⎥, and Ce = ⎢ 0 0 c33 ⎥ (40)
⎢ 0 0 0
e
c441 ⎥ 2 ⎢ 0 0 0 e2
c44 ⎥
⎢ ⎥ ⎢ ⎥
⎣ 0 0
e
c531 0
e
c551 ⎦ ⎣ 0 0 e2
c53 0 e2
c55 ⎦
e e e2 e2
0 c621 0 0 0 c661 0 c62 0 0 0 c66
Figure 10. Six-DOF 3L-CPM modeled based on the design presented in [42].
Based on Equations (9), (11) and (12) and the defined parameters, the obtained compli-
ance matrix of the serial flexure chain shown in Figure 10, Csc , is written as follows:
⎡ e e e ⎤
c111 + c222 + c661 ry21
$ e1 % $ e %
⎢ − c62 e
+ c661 r x1 ry1
e e e
c221 + c112 + r x1 2c621 + c661 r x1 SYM ⎥
⎢ ⎥
⎢ 0 0
e e e e e
c331 + c332 − 2c531 r x1 + c551 r2x1 + c441 ry21 ⎥
Csc = ⎢
⎢ e2 e e e
⎥
⎥ (41)
⎢ 0 0 −c53 + c441 ry1 c441 + c552 ⎥
⎣ 0 0
e e
c531 − c551 r x1 0
e e
c551 + c442 ⎦
e2 e e e e e
−c62 − c661 ry1 c621 + c661 r x1 0 0 0 c661 + c662
It is observed that the compliance matrix presented in Equation (41) matches the form
expressed in Equation (31), and all flexure elements are distributed in the XY plane (Δ = 0
and J(±Δ) become identity matrices). Note that, while c54 sc is zero, those nine essential
non-diagonal components (c31 sc , csc , csc , csc , csc , csc , csc , csc , and csc ) are zero and, thus,
41 51 32 42 52 63 64 65
satisfy the special case proposed in this work. As a result, it shows that this design is able
to produce fully decoupled motion. This can be demonstrated by the resulting diagonal
matrix obtained by applying Equations (1) and (28) to analyze the stiffness property of the
entire 3L-CPM. Several micro-scale 6-DOF 3L-CPMs which developed based on the same
concept were proposed in Reference [43], and a variant of this design was presented in
Reference [58].
79
Mathematics 2022, 10, 1414
80
Mathematics 2022, 10, 1414
Figure 12. Three-DOF (X-Y-θ Z ) 3L-CPM designed by using the optimization method (a) structure of
a leg and (b) prototype built based on the design presented in [53].
From Figure 12a, we can see that the 3L-CPM has two reflecting flexure chains with no
offset distance, so that, depending on the results presented in Section 4, its output motions
are decoupled. However, Equation (43) suggests that this 3L-CPM will generate coupled
motions, since there are two non-diagonal components. This result can be explained by
the stiffness modeling used in Reference [53]; the compliance matrices of the leg were
calculated at the local frame X Y Z in the middle plane, while the stiffness matrix of the
entire 3L-CPM was derived at the global frame XYZ (located at the center of the end effector)
in the top plane, as illustrated in Figure 12b. That generates the translation components Dz j
along the Z axis in the translation matrices of the legs, as demonstrated in Equation (3), and
creates the off-axis stiffness components within the final stiffness matrix. Similar designs
presented in Reference [55] will also generate decoupled motions. If the global frame is
attached at the middle plane of the end effector instead, the non-diagonal stiffness will be
eliminated, and the 3L-CPM will have fully decoupled motion characteristic.
81
Mathematics 2022, 10, 1414
Figure 13. Three-DOF (θ X -θ Y -Z) 3L-CPMs designed by structural optimization method (a) with offset
distance and (b) without offset distance between two flexure chains in a leg [63].
For the 3L-CPM with Δ = 0, as shown in Figure 13a, the results presented in Refer-
ence [6] agree with the findings described in Section 4, where its compliance matrix, Ca ,
has five non-diagonal components, as derived theoretically in Equation (44). For the other
3L-CPM with Δ = 0, as shown in Figure 13b [7], the compliance matrix, Cb , only has diago-
nal components, as demonstrated theoretically in Equation (45). In summary, the motion
property of 3L-CPMs having two reflecting flexure chains in a leg was demonstrated. The
3L-CPM having an offset distance in the Z axis between two flexure chains will generate
coupled motions, while the other having both flexure chains located in the same plane will
generate decoupled motions.
6. Discussion
In order to deliver fully decoupled motion, a 3L-CPM must have a 6 × 6 diagonal
stiffness/compliance matrix whereby all non-diagonal components are zero. Due to the
property of the parallel architecture, the stiffness matrix of a 3L-CPM can be calculated
based on the stiffness matrices of its legs. However, the stiffness matrix of each leg cannot
be derived directly, since it can be constructed by one or two serial chains; each is formed by
a number of flexure elements and rigid links, and, thus, its characteristic is defined by the
compliance. Due to the challenges in converting between the stiffness and compliance ma-
trices, existing 3L-CPMs failed to analyze their motion property analytically. To overcome
this limitation and, most importantly, for designing a 3L-CPM to obtain a full-decoupled
motion characteristic, the conditions for the compliance matrix of the single serial flexure
chain, i.e., Equation (13), and for the compliance matrix of the double serial flexure chains,
i.e., Equation (31), are provided in this work. A short review of various state-of-the-art
3L-CPMs presented in Section 5 showed that the conditions of the compliance matrix can be
used to identify the motion property of these 3L-CPMs. In order to satisfy these conditions,
analytical analyses show that every flexure element within each leg must be located in
the global XY plane with only two orientations (0◦ and 90◦ ) about its local X” axis, as
illustrated in Figure 3, and the offset distance along the Z axis between two serial flexure
chains, as shown in Figure 4, must be zero (Δ = 0). In other words, these design criteria
can be used to synthesize a 3L-CPM that aims to achieve fully decoupled motion capability.
Moreover, the findings in this work are applicable to any synthesis method, e.g., traditional
pseudo-rigid-body model, constraint-based and topology/structural optimization meth-
ods, etc. Consequently, these design criteria and conditions for the compliance matrices of
the flexure chains can be used as the fundamental design guidelines for the syntheses of
3L-CPMs to achieve desired motion property that can be either decoupled or coupled.
82
Mathematics 2022, 10, 1414
As the 3L-CPM plays an important role in precise motion systems, the motion de-
coupling capability needs to be clearly defined in the design process to make the control
simpler and the output motions more accurate as well. Based on the literature, several
compliant systems which are capable of producing precise motions with a simple control
method, due to the defined motion characteristics of their compliant structures, have been
developed. In particular, the 3-DOF spatial-motion (θ X -θ Y -Z) manipulator [6,7] and the
flexure-based electromagnetic nano-positioning actuator [3] are able to produce a large
workspace with high resolutions, using simple open-loop control systems. Moreover, 3L-
CPMs with decoupled motions can also be applied to design micro-fabrication systems,
e.g., the flexural spindle head in a micro drilling machine tool [11,12], the motion stage in a
micro milling system [13], and the flexural stage to adjust the angles of mirror in advanced
three-dimensional fabrication methods [14,15]. In addition, the benefits offered by decou-
pled 3L-CPMs have been recently exploited in biomedical applications, such as the flexural
micro-dissection device [10]. Since the application range of decoupled-motion 3L-CPMs
has been increasing, it can be said that the fundamental criteria for synthesizing 3L-CPMs
with fully decoupled motions and the motion property of some existing designs presented
in this paper are an important background for developing advanced flexure-based systems.
7. Conclusions
This paper presented the fundamental design criteria for synthesizing any 3L-CPM
with fully decoupled motion capability regardless of the targeted DOF. The stiffness charac-
teristics of a 3L-CPM were analytically modeled. The derived criteria suggested that the
flexure elements in each leg must be distributed in the same plane with the end effector
of the 3L-CPM in order to fulfill the decoupled motions requirements. In the case where
each leg contains two parallel reflecting flexure chains, such requirements are valid if both
flexure chains are located in the same plane with no offset distance. To demonstrate the
effectiveness of the design criteria and conditions obtained from this work, several state-
of-the-art 3L-CPMs were analyzed for their stiffness characteristics and compared with
these criteria. The presented cases show that the proposed design criteria can be applied to
accurately determine the motion characteristics of any 3L-CPM through the analysis of its
stiffness/compliance matrix; only 3L-CPMs having diagonal stiffness/compliance matrices
are able to achieve fully decoupled motions. Findings from this work can be used to define
the motion property of any form of 3L-CPM during the design process.
In this paper, only some special solutions were considered to make the non-diagonal
components within the stiffness/compliance matrix of a 3L-CPM equal to zero; there could
be other solutions that need to be explored. Future work will focus on investigating more
general design criteria for synthesizing any CPMs with desired DOF and motion property.
83
Mathematics 2022, 10, 1414
as spherical joint (Figure A1a) or prismatic joint (linear spring), as shown in Figure A1b.
The notch of spherical joint can have a circular or square cross-sectional area, while the
linear spring can be constructed by four notch hinges or a pair of cantilever beams.
Figure A1. Flexure elements: (a) spherical joint and (b) linear spring.
1 l l !
l
c31 = k k − kl21 kl62 kl64 kl53 kl54 − kl43 kl55 + kl65 −kl44 kl53 + kl43 kl54 (A2)
ξ 22 61
1 2
c41l
= − kl22 kl61 − kl21 kl62 kl64 kl53 − kl33 kl55 + kl65 −kl43 kl53 + kl33 kl54 (A3)
ξ
1
2
c51l
= − kl22 kl61 − kl21 kl62 kl64 −kl43 kl53 + kl33 kl54 + kl65 kl43 − kl33 kl44 (A4)
ξ
1 !
l
c32 = − kl21 kl61 − kl11 kl62 kl64 kl53 kl54 − kl43 kl55 + kl65 −kl44 kl53 + kl43 kl54 (A5)
ξ
1 l l 2
l
c42 = k21 k61 − kl11 kl62 kl64 kl53 − kl33 kl55 + kl65 −kl43 kl53 + kl33 kl54 (A6)
ξ
1 l l
2
l
c52 = k21 k61 − kl11 kl62 kl64 −kl43 kl53 + kl33 kl54 + kl65 kl43 − kl33 kl44 (A7)
ξ
1 l 2 !
l
c63 = k21 − kl11 kl22 kl64 kl53 kl54 − kl43 kl55 + kl65 −kl44 kl53 + kl43 kl54 (A8)
ξ
1 l 2 2
l
c64 =− k21 − kl11 kl22 kl64 kl53 − kl33 kl55 + kl65 −kl43 kl53 + kl33 kl54 (A9)
ξ
1 l 2 2
l
c65 =− k21 − kl11 kl22 kl64 −kl43 kl53 + kl33 kl54 + kl65 kl43 − kl33 kl44 (A10)
ξ
84
Mathematics 2022, 10, 1414
( 2 2
l
c21 = 1
ξ −kl21 kl53 kl64 + 2kl43 kl53 kl54 kl61 kl62 + kl21 kl64 kl65 − kl21 kl54 kl66 −
2 2
kl43 kl55 kl61 kl62 + kl21 kl65 − kl21 kl55 kl66 +
2 (A11)
kl33 kl21 kl55 kl64 + kl54 −kl54 kl61 kl62 − kl21 kl64 kl65 + kl21 kl54 kl66 +
2 )
2
kl44 kl53 −kl61 kl62 + kl21 kl66 + kl33 kl55 kl61 kl62 + kl21 kl65 − kl21 kl55 kl66
( 2 2 )
1 2
l
c61 = −2kl43 kl53 kl54 + kl33 kl54 + kl43 kl55 + kl44 kl53 − kl33 kl55 kl22 kl61 − kl21 kl62 (A12)
ξ
( 2 2 )
1 2
l
c62 =− −2kl43 kl53 kl54 + kl33 kl54 + kl43 kl55 + kl44 kl53 − kl33 kl55 kl21 kl61 − kl11 kl62 (A13)
ξ
( 2
l
c43 = 1
ξ 2kl21 kl53 kl54 − kl43 kl55 kl61 kl62 + kl11 −kl53 kl54 − kl43 kl55 kl62 +
2 2
kl21 kl53 kl64 kl65 − kl43 kl65 − kl53 kl54 kl66 + kl43 kl55 kl66 + (A14)
2 )
2 2
kl22 −kl53 kl54 kl61 + kl11 kl64 kl65 − kl11 kl54 kl66 + kl43 kl55 kl61 + kl11 kl65 − kl11 kl55 kl66
( 2
l
c53 = 1
ξ 2kl21 kl43 kl53 − kl33 kl54 kl61 kl62 + kl11 −kl43 kl53 + kl33 kl54 kl62 −
2
kl21 kl33 kl64 kl65 + kl43 kl53 kl66 − kl33 kl54 kl66 + (A15)
)
2 2
kl22 kl43 kl53 − kl61 + kl11 kl66 + kl33 kl54 kl61 + kl11 kl64 kl65 − kl11 kl54 kl66
The form of Cl needs to be specified as a standard for the design process of decoupled-
motion 3L-CPMs. It is observed that the expressions of seven compliance components
l , cl , cl , cl , cl , cl , and cl ) corresponding to the seven zero-components in the
(c31 32 41 42 51 52 63
stiffness matrix have similar forms as shown in Equations (A2) to (A8). In this paper,
these seven compliance components are required to be zeros, so that the form of the
85
Mathematics 2022, 10, 1414
leg’s compliance matrix will be the same with its stiffness matrix. This special form
offers simplicity during the design process and can be used as the standard to define the
decoupled-motion capability of various 3L-CPMs. The requirements to make the seven
compliance components equal to zeros are written in Equation (A17).
⎧ ⎧
⎪
⎪ ⎪
⎪ kl64 kl53 kl54 − kl43 kl55 + kl65 −kl44 kl53 + kl43 kl54 = 0
⎪
⎪ k22 k61 − k21 k62 = 0
l l l l ⎪
⎪
⎪
⎪ ⎪
⎪
⎨ l l ⎨ l l 2
k21 k61 − kl11 kl62 = 0 or k64 k53 − kl33 kl55 + kl65 −kl43 kl53 + kl33 kl54 = 0 (A17)
⎪
⎪ 2 ⎪
⎪
⎪
⎪ ⎪
⎪
⎪
⎪ kl21 − kl11 kl22 = 0 ⎪
⎪ 2
⎩ ⎩ kl64 −kl43 kl53 + kl33 kl44 + kl65 kl43 − kl33 kl44 = 0
As the diagonal components in the stiffness matrix are always non-zeros, while the
non-diagonal components can be zeros or non-zeros, the non-diagonal components are
considered as unknowns, and the diagonal ones are parameters. In the first set of equations,
one of the first two equations can be redundant. The answers to the first set of equations
are as follows: ⎧ #
⎪
⎪
⎨ k21 = ± k11 k22
l l l
" (A18)
⎪
⎪ kl11 l
⎩ kl61 = ± l k 62 k22
The second set of equations contain three equations with five unknowns, so that there
could be many solutions. Here, two simple solutions are proposed, and their answers are
given as follows:
⎧ l l
⎪
⎪ k53 k54 − kl43 kl55 = 0
⎪
⎪ ⎧ #
⎪
⎪ −kl44 kl53 + kl43 kl54 = 0 ⎪
⎪
⎪ ⎪
⎪ kl43 = ± kl33 kl44
* ⎪
⎪ 2 * ⎪
⎪
kl64 =0 ⎨ kl64 =0 ⎨ #
or kl53 − kl33 kl55 = 0 ⇔ or kl53 = ± kl33 kl55 (A19)
kl65 =0 ⎪
⎪ kl65 =0 ⎪
⎪
⎪
⎪ ⎪
⎪ #
⎪
⎪ −kl43 kl53 + kl33 kl54 = 0 ⎪
⎩ kl = ± kl kl
⎪
⎪
⎪
⎪ 54 44 55
⎩ kl 2 − kl kl = 0
43 33 44
−c61
l cl + cl cl
kl21 = $ %
62 21 66
$ l %2 l $ l %2 ! (A21)
−2c21
l cl cl + c11
l l 2
c62 + c21 c66 + c22
l c61 − c11
l cl
61 62 66
l cl − cl cl
c22
kl61 = $ %2
61
$ %
21 62
$ % (A22)
l cl
c22 l cl cl + cl cl 2 + cl 2 cl − cl cl cl
− 2c21
61 61 62 11 62 21 66 11 22 66
2
− c11
l
l cl
c61 66
kl22 = $ %2 $ %2 $ l %2 l (A23)
l cl
c22 61 − 2c21 c61 c62 + c11 c62 + c21 c66 − c11
l l l l l l cl cl
22 66
−c21 l cl + cl cl
kl62 = $ % 2
61 11 62
$ l %2 l $ l %2 ! (A24)
−2c21
l cl cl + cl
61 62 11
l
c62 + c21 c66 + c22
l c61 − c11
l cl
66
86
Mathematics 2022, 10, 1414
2
− c44
l cl
55
l
c54
kl33 = $ % $ % $ % (A25)
l cl 2 − 2cl cl cl + cl cl 2 + cl 2 cl − cl cl cl
c44 53 43 53 54 33 54 43 55 33 44 55
−c53
l cl + cl cl
kl43 = $ %
54 43 55
$ l %2 l $ l %2 ! (A26)
−2c43
l cl cl + c33
l l 2
c54 + c43 c55 + c44
l c53 − c33
l cl
53 54 55
l cl − cl cl
c44
kl53 = $ %2
53
$ %
43 54
$ % (A27)
l cl
c44 l cl cl + cl cl 2 + cl 2 cl − cl cl cl
− 2c43
53 53 54 33 54 43 55 33 44 55
2
− c33 l
l cl
c53
55
kl44 = $ %2 $ %2 $ l %2 l (A28)
l cl
c44 53 − 2c43 c53 c54 + c33 c54 + c43 c55 − c33
l l l l l l cl cl
44 55
−c43 l cl + cl cl
kl54 = $ % 2
53 33 54
$ l %2 l $ l %2 ! (A29)
−2c43
l cl cl + cl
53 54 33
l
c54 + c43 c55 + c44
l c53 − c33
l cl
55
2
− c33
l cl
44
l
c43
kl55 = $ %2 $ % $ % (A30)
l cl
c44 l cl cl + cl cl 2 + cl 2 cl − cl cl cl
− 2c43
53 53 54 33 54 43 55 33 44 55
2
− c11 l
l cl
c21
22
kl66 = $ %2 $ %2 $ l %2 l (A31)
l cl
c22 61 − 2c21 c61 c62 + c11 c62 + c21 c66 − c11
l l l l l l cl cl
22 66
1 3 ej ej ej
e ej
ej
!
l
= ∑ c11 + c22 + 2c66 ry2 j + c11 − c22 cos 2γ j + 4c62 ry j sin γ j
j
c11 (A32)
2 j =1
3 & e e e
e e
!'
l
c21 = ∑ −c66j r x j ry j − c62j r x j sin γ j + cos γ j −c62j ry j + c11j − c22j sin γ j (A33)
j =1
3 e e
l
c61 = ∑ −c66j ry j − c62j sin γ j (A34)
j =1
1 3 ej ej ej ej
e ej
!
2 j∑
l
c22 = c11 + c22 + 2c66 r2x j + 4c62 r x j cos γ j + −c11 + c22 cos 2γ j
j
(A35)
=1
3 ej ej
l
c62 = ∑ c66 r x j + c62 cos γ j (A36)
j =1
3 & e e ej
e ej
l
c33 = 1
2 ∑ 2c33 + c44 + c55 r2x j + ry2 j − c44 − c55 r x j − ry j r x j + ry j cos 2γ j −
j j j
j =1 (A37)
ej ej
e ej
!'
sin γ j − 4r x j cos γ j c53 + c44 − c55 ry j sin γ j
j
4c53 ry j
1 3 ej ej
ej
e ej
!
l
= ∑ c44 + c55 ry j − 2c53 sin γ j + c44 − c55 ry j cos 2γ j − r x j sin 2γ j
j
c43 (A38)
2 j =1
87
Mathematics 2022, 10, 1414
1 3 e e
e
e e
!
2 j∑
l
c53 = − c44j + c55j r x j + 2c53j cos γ j + c44j − c55j r x j cos 2γ j + ry j sin 2γ j (A39)
=1
1 3 ej ej
e ej
!
2 j∑
l
c44 = c44 + c55 + c44 − c55 cos 2γ j
j
(A40)
=1
3 ej ej
!
l
c54 = ∑ c44 − c55 cos γ j sin γ j (A41)
j =1
1 3 ej ej
e ej
!
l
= ∑ c44 + c55 + −c44 + c55 cos 2γ j
j
c55 (A42)
2 j =1
3 ej
l
c66 = ∑ c66 (A43)
j =1
$ R % $ % P $ P % 2
l
c21 = r x2 c53 − c55
Rr
y2 − c66 r x1 ry1 + r x3 ry3 + cos γ1 − c62 ry1 + c11 − c22 + − c44 + c55 rz1 sin γ1 −
P P P P
$
P (r sin γ + r sin γ ) + cos γ − c P r + c P − c P + − c P + c P r2 sin γ
% (A45)
c62 x1 1 x3 3 3 62 y3 11 22 44 55 z3 3
$ %
l
c31 = −c44R r r − c P r r cos2 γ − c P r r cos2 γ + r cos γ c P + c P − c P r sin γ +
x2 z2 55 x1 z1 1 55 x3 z3 3 z1 1 53 44 55 y1 1
P $ P % $ % (A46)
rz3 cos γ3 c53 + c44 − c55
P r sin γ − c P r r sin2 γ + r r sin2 γ
y3 3 44 x1 z1 1 x3 z3 3
l
c41 = c44
P
− c55
P
(rz1 cos γ1 sin γ1 + rz3 cos γ3 sin γ3 ) (A47)
l
c51 = c44
R
rz2 + c55
P
rz1 cos2 γ1 + c55
P
rz3 cos2 γ3 + c44
P
rz1 sin2 γ1 + c44
P
rz3 sin2 γ3 (A48)
$ %
l
c61 = 2c53
R
− c55
R
ry2 − c66
P
ry1 + ry3 − c62
P
(sin γ1 + sin γ3 ) (A49)
$ % $ %
l
c22 = 2c11
R + c R r2 + c P r2 + r2 + c R r2 + 2c P r cos γ + c P + c P r2 cos2 γ + 2c P r cos γ +
55 x2 66 x1 x3 66 z2 62 x1 1 22 44 z1 1 62 x3 3
$ P % $ P % 2 $ P % 2 (A50)
c22 + c44 rz3 cos γ3 + c11 + c55 rz1 sin γ1 + c11 + c55 rz3 sin γ3
P 2 2 P 2 P 2
$ R % $ P % $ P %
l
c32 = − c62 + c66Rr
y2 rz2 − c44 ry1 rz1 cos γ1 + c44 − c55 r x1 rz1 cos γ1 sin γ1 + rz1 sin γ1 c53 − c55 ry1 sin γ1 +
P 2 P P
P $ P % $ % (A51)
rz3 −c44 ry3 cos2 γ3 + c44 − c55P r cos γ sin γ + sin γ c P − c P r sin γ
x3 3 3 3 53 55 y3 3
l
c42 = −c66
R
rz2 − c44
P
rz1 cos2 γ1 − c44
P
rz3 cos2 γ3 − c55
P
rz1 sin2 γ1 + rz3 sin2 γ3 (A52)
l
c52 = − c44
P
− c55
P
(rz1 cos γ1 sin γ1 + rz3 cos γ3 sin γ3 ) (A53)
l
c62 = c55
R
r x2 + c66
P
(r x1 + r x3 ) + c62
P
(cos γ1 + cos γ3 ) (A54)
88
Mathematics 2022, 10, 1414
$ R % $ %2 $ %
l
c33 = 2c33 P + 2c R + c R r2 + r
22 44 x2 y2 c62 + c66 ry2 + c44 ry1 cos γ1 − r x1 sin γ1
R P − c53
P r cos γ + r sin γ +
x1 1 y1 1
$ %$ P % $
P r cos γ + c P r sin γ + c P r cos γ − r sin γ 2 −
%
r x1 cos γ1 + ry1 sin γ1 −c53 + c55 x1 1 55 y1 1 44 y3 3 x3 3 (A55)
$ % $
P r cos γ + r sin γ + r cos γ + r sin γ
%$ %
3 − c53 + c55 r x3 cos γ3 + c55 ry3 sin γ3
c53 P P P
x3 3 y3 3 x3 3 y3
$ %
l
c43 = 2c62
R + c R r + c P r cos2 γ + c P r cos2 γ + − c P + c P r cos γ sin γ + c P r sin2 γ +
66 y2 44 y1 1 44 y3 3 44 55 x1 1 1 55 y1 1
$ P % (A56)
−c44 + c55 r x3 cos γ3 sin γ3 + c55 ry3 sin γ3 − c53 (sin γ1 + sin γ3 )
P P 2 P
$ %
l
c53 = −c44R r − c P r cos2 γ − c P r cos2 γ + cos γ c P + c P − c P r sin γ +
x2 55 x1 1 55 x3 3 1 53 44 55 y1 1
P $ P % $ % (A57)
cos γ3 c53 + c44 − c55
P r sin γ − c P r sin2 γ + r sin2 γ
y3 3 44 x1 1 x3 3
l
c44 = 2c66
R
+ c44
P
cos2 γ1 + cos2 γ3 + c55 P
sin2 γ1 + sin2 γ3 (A58)
l
c54 = c44
P
− c55
P
(cos γ1 sin γ1 + cos γ3 sin γ3 ) (A59)
l
c55 = 2c44
R
+ c55
P
cos2 γ1 + cos2 γ3 + c44
P
sin2 γ1 + sin2 γ3 (A60)
l
c66 = 2 c66
P
+ c55
R
(A61)
1 √ S !
sc
c21 = S
2c11 − 2c22
S
+ 2c53
P
r x1 − 2c62 (r x2 + r x3 ) − c44
S
− c55
S
rz22 + rz23 (A63)
2
1 √ S !
sc
c31 = −2c44
P
r x1 rz1 + 2c53 (rz2 + rz3 ) − c44 S
+ c55
S
( r x2 r z2 + r x3 r z3 ) (A64)
2
1 S
sc
c41 = c − c55 S
( r z2 + r z3 ) (A65)
2 44
1 !
sc
c51 = P
2c44 rz1 + c44
S
+ c55
S
( r z2 + r z3 ) (A66)
2
√
sc
c61 = c53
P
− S
2c62 (A67)
1 √ S !
sc
c22 = c11
P
+ c11
S
+ S
2c22 + 2c55
P 2
r x1 + 2 2c62 (r x2 + r x3 ) + 2c66
S
r2x2 + r2x3 + 2c66 rz1 + c44
P 2 S
+ c55 S
rz22 + rz23 (A68)
2
1 √ S !
sc
c32 = −c62 P
rz1 + 2c53 (rz2 + rz3 ) + c44
S
− c55
S
( r x2 r z2 + r x3 r z3 ) (A69)
2
1 !
sc
c42 = −2c66
P
rz1 − c44 S
+ c55S
( r z2 + r z3 ) (A70)
2
1 S
sc
c52 = − c44 − c55
S
( r z2 + r z3 ) (A71)
2
√
sc
c62 = 2c62 S
+ c55
P
r x1 + c66
S
( r x2 + r x3 ) (A72)
89
Mathematics 2022, 10, 1414
1 √ S !
sc
c33 = P
2c22 + 4c33
S
+ 2c44 r x1 − 2 2c53
P 2
(r x2 + r x3 ) + c44 S
+ c55
S
r2x2 + r2x3 (A73)
2
1 √ S !
sc
c43 = 2c62 P
− 2 2c53 − c44 S
− c55S
( r x2 + r x3 ) (A74)
2
1 √ S !
sc
c53 = 2 2c53 − 2c44
P
r x1 − c44 S
+ c55S
( r x2 + r x3 ) (A75)
2
sc
c44 = c66
P
+ c44
S
+ c55
S
(A76)
sc
c54 = c44
S
− c55
S
(A77)
sc
c55 = c44
P
+ c44
S
+ c55
S
(A78)
sc
c66 = c55
P
+ 2c66
S
(A79)
References
1. Wang, D.H.; Yang, Q.; Dong, H.M. A Monolithic Compliant Piezoelectric-Driven Microgripper: Design, Modeling, and Testing.
IEEE/ASME Trans. Mechatron. 2013, 18, 138–147. [CrossRef]
2. Wang, F.; Liang, C.; Tian, Y.; Zhao, X.; Zhang, D. Design and Control of a Compliant Microgripper with a Large Amplification
Ratio for High-Speed Micro Manipulation. IEEE/ASME Trans. Mechatron. 2016, 21, 1262–1271. [CrossRef]
3. Teo, T.J.; Yang, G.; Chen, I.M. A flexure-based electromagnetic nanopositioning actuator with predictable and re-configurable
open-loop positioning resolution. Precis. Eng. 2015, 40, 249–260. [CrossRef]
4. Teo, T.J.; Bui, V.P.; Yang, G.; Chen, I.M. Millimeters-Stroke Nanopositioning Actuator with High Positioning and Thermal Stability.
IEEE/ASME Trans. Mechatron. 2015, 20, 2813–2823. [CrossRef]
5. Teo, T.J.; Chen, I.M.; Yang, G. A large deflection and high payload flexure-based parallel manipulator for UV nanoimprint
lithography: Part II. Stiffness modeling and performance evaluation. Precis. Eng. 2014, 38, 872–884. [CrossRef]
6. Pham, M.T.; Teo, T.J.; Yeo, S.H.; Wang, P.; Nai, M.L.S. A 3D-printed Ti-6Al-4V 3-DOF compliant parallel mechanism for high
precision manipulation. IEEE/ASME Trans. Mechatron. 2017, 22, 2359–2368. [CrossRef]
7. Pham, M.T.; Yeo, S.H.; Teo, T.J.; Wang, P.; Nai, M.L.S. Design and Optimization of a Three Degrees-of-Freedom Spatial Motion
Compliant Parallel Mechanism with Fully Decoupled Motion Characteristics. J. Mech. Robot. 2019, 11, 051010. [CrossRef]
8. Teo, T.J.; Yang, G.; Chen, I.M. Compliant Manipulators. In Handbook of Manufacturing Engineering and Technology; Springer:
London, UK, 2014; pp. 2229–2300.
9. Thomas, T.L.; Kalpathy Venkiteswaran, V.; Ananthasuresh, G.K.; Misra, S. Surgical Applications of Compliant Mechanisms: A
Review. J. Mech. Robot. 2021, 13, 020801. [CrossRef]
10. Huang, H.; Pan, Y.; Pang, Y.; Shen, H.; Gao, X.; Zhu, Y.; Chen, L.; Sun, L. Piezoelectric Ultrasonic Biological Microdissection
Device Based on a Novel Flexure Mechanism for Suppressing Vibration. Micromachines 2021, 12, 196. [CrossRef]
11. Shinde, S.M.; Lekurwale, R.R. Synthesising of flexural spindle head micro drilling machine tool in PLM environment. Int. J.
Virtual Technol. Multimed. 2021, 1, 246–264. [CrossRef]
12. Shinde, S.M.; Lekurwale, R.R. Radial stiffness computation of single Archimedes spiral plane supporting spring loaded in flexural
mechanism mounted in spindle head of micro drilling machine tool. Mech. Based Des. Struct. Mach. 2022, 1–21. [CrossRef]
13. Lv, B.; Lin, B.; Cao, Z.; Li, B.; Wang, G. A parallel 3-DOF micro-nano motion stage for vibration-assisted milling. Mech. Mach.
Theory 2022, 173, 104854. [CrossRef]
14. Gandhi, P.; Bhole, K. 3D Microfabrication Using Bulk Lithography. In Proceedings of the ASME 2011 International Mechanical
Engineering Congress and Exposition, Denver, CO, USA, 11–17 November 2011; pp. 393–399.
15. Gandhi, P.; Deshmukh, S.; Ramtekkar, R.; Bhole, K.; Baraki, A. “On-Axis” Linear Focused Spot Scanning Microstereolithography
System: Optomechatronic Design, Analysis and Development. J. Adv. Manuf. Syst. 2013, 12, 43–68. [CrossRef]
16. Tanikawa, T.; Arai, T.; Koyachi, N. Development of small-sized 3 DOF finger module in micro hand for micro manipulation. In
Proceedings of the 1999 IEEE/RSJ International Conference on Intelligent Robots and Systems, Kyongju, Korea, 17–21 October
1999; pp. 876–881.
17. Tanikawa, T.; Ukiana, M.; Morita, K.; Koseki, Y.; Ohba, K.; Fujii, K.; Arai, T. Design of 3-DOF parallel mechanism with thin plate
for micro finger module in micro manipulation. In Proceedings of the IEEE/RSJ International Conference on Intelligent Robots
and Systems, Lausanne, Switzerland, 30 September–4 October 2002; pp. 1778–1783.
18. Yi, B.-J.; Chung, G.B.; Na, H.Y.; Kim, W.K.; Suh, I.H. Design and experiment of a 3-DOF parallel micromechanism utilizing flexure
hinges. IEEE Trans. Robot. Autom. 2003, 19, 604–612. [CrossRef]
90
Mathematics 2022, 10, 1414
19. Pham, H.-H.; Chen, I.-M.; Yeh, H.-C. Micro-motion selective-actuation XYZ flexure parallel mechanism: Design and modeling. J.
Micromechatronics 2003, 3, 51–73. [CrossRef]
20. Lu, T.-F.; Handley, D.C.; Yong, Y.K.; Eales, C. A three-DOF compliant micromotion stage with flexure hinges. Ind. Robot. Int. J.
2004, 31, 355–361. [CrossRef]
21. Chao, D.; Zong, G.; Liu, R. Design of a 6-DOF compliant manipulator based on serial-parallel architecture. In Proceedings of the
2005 IEEE/ASME International Conference on Advanced Intelligent Mechatronics, Monterey, CA, USA, 24–28 July 2005; pp.
765–770.
22. Pham, H.-H.; Chen, I.M. Stiffness modeling of flexure parallel mechanism. Precis. Eng. 2005, 29, 467–478. [CrossRef]
23. Choi, Y.-J.; Sreenivasan, S.V.; Choi, B.J. Kinematic design of large displacement precision XY positioning stage by using cross strip
flexure joints and over-constrained mechanism. Mech. Mach. Theory 2008, 43, 724–737. [CrossRef]
24. Yao, Q.; Dong, J.; Ferreira, P.M. A novel parallel-kinematics mechanisms for integrated, multi-axis nanopositioning: Part 1.
Kinematics and design for fabrication. Precis. Eng. 2008, 32, 7–19. [CrossRef]
25. Dong, J.; Yao, Q.; Ferreira, P.M. A novel parallel-kinematics mechanism for integrated, multi-axis nanopositioning: Part 2:
Dynamics, control and performance analysis. Precis. Eng. 2008, 32, 20–33. [CrossRef]
26. Dong, W.; Sun, L.; Du, Z. Stiffness research on a high-precision, large-workspace parallel mechanism with compliant joints. Precis.
Eng. 2008, 32, 222–231. [CrossRef]
27. Wang, H.; Zhang, X. Input coupling analysis and optimal design of a 3-DOF compliant micro-positioning stage. Mech. Mach.
Theory 2008, 43, 400–410. [CrossRef]
28. Wu, T.L.; Chen, J.H.; Chang, S.H. A six-DOF prismatic-spherical-spherical parallel compliant nanopositioner. IEEE Trans. Ultrason.
Ferroelectr. Freq. Control. 2008, 55, 2544–2551. [CrossRef] [PubMed]
29. Yong, Y.K.; Lu, T.-F. Kinetostatic modeling of 3-RRR compliant micro-motion stages with flexure hinges. Mech. Mach. Theory 2009,
44, 1156–1175. [CrossRef]
30. Li, Y.; Xu, Q. A Totally Decoupled Piezo-Driven XYZ Flexure Parallel Micropositioning Stage for Micro/Nanomanipulation. IEEE
Trans. Autom. Sci. Eng. 2011, 8, 265–279. [CrossRef]
31. Liang, Q.; Zhang, D.; Chi, Z.; Song, Q.; Ge, Y.; Ge, Y. Six-DOF micro-manipulator based on compliant parallel mechanism with
integrated force sensor. Robot. Comput.-Integr. Manuf. 2011, 27, 124–134. [CrossRef]
32. Yang, G.; Teo, T.J.; Chen, I.M.; Lin, W. Analysis and design of a 3-DOF flexure-based zero-torsion parallel manipulator for
nano-alignment applications. In Proceedings of the 2011 IEEE International Conference on Robotics and Automation (ICRA),
Shanghai, China, 9–13 May 2011; pp. 2751–2756.
33. Yun, Y.; Li, Y. Optimal design of a 3-PUPU parallel robot with compliant hinges for micromanipulation in a cubic workspace.
Robot. Comput.-Integr. Manuf. 2011, 27, 977–985. [CrossRef]
34. Li, Y.; Huang, J.; Tang, H. A Compliant Parallel XY Micromotion Stage with Complete Kinematic Decoupling. IEEE Trans. Autom.
Sci. Eng. 2012, 9, 538–553. [CrossRef]
35. Kim, H.-Y.; Ahn, D.-H.; Gweon, D.-G. Development of a novel 3-degrees of freedom flexure based positioning system. Rev. Sci.
Instrum. 2012, 83, 055114. [CrossRef]
36. Xiao, S.; Li, Y. Optimal Design, Fabrication, and Control of an XY Micropositioning Stage Driven by Electromagnetic Actuators.
IEEE Trans. Ind. Electron. 2013, 60, 4613–4626. [CrossRef]
37. Bhagat, U.; Shirinzadeh, B.; Clark, L.; Chea, P.; Qin, Y.; Tian, Y.; Zhang, D. Design and analysis of a novel flexure-based 3-DOF
mechanism. Mech. Mach. Theory 2014, 74, 173–187. [CrossRef]
38. Clark, L.; Shirinzadeh, B.; Zhong, Y.; Tian, Y.; Zhang, D. Design and analysis of a compact flexure-based precision pure rotation
stage without actuator redundancy. Mech. Mach. Theory 2016, 105, 129–144. [CrossRef]
39. Hao, G.; Yu, J. Design, modelling and analysis of a completely-decoupled XY compliant parallel manipulator. Mech. Mach. Theory
2016, 102, 179–195. [CrossRef]
40. Li, Y.; Wu, Z. Design, analysis and simulation of a novel 3-DOF translational micromanipulator based on the PRB model. Mech.
Mach. Theory 2016, 100, 235–258. [CrossRef]
41. Wang, R.; Zhang, X. A planar 3-DOF nanopositioning platform with large magnification. Precis. Eng. 2016, 46, 221–231. [CrossRef]
42. Culpepper, M.L.; Anderson, G. Design of a low-cost nano-manipulator which utilizes a monolithic, spatial compliant mechanism.
Precis. Eng. 2004, 28, 469–482. [CrossRef]
43. Chen, S.-C.; Culpepper, M.L. Design of a six-axis micro-scale nanopositioner—μHexFlex. Precis. Eng. 2006, 30, 314–324. [CrossRef]
44. Awtar, S.; Slocum, A.H. Constraint-Based Design of Parallel Kinematic XY Flexure Mechanisms. J. Mech. Des. 2006, 129, 816–830.
[CrossRef]
45. Hopkins, J.B.; Culpepper, M.L. Synthesis of multi-degree of freedom, parallel flexure system concepts via Freedom and Constraint
Topology (FACT)—Part I: Principles. Precis. Eng. 2010, 34, 259–270. [CrossRef]
46. Hopkins, J.B.; Culpepper, M.L. Synthesis of multi-degree of freedom, parallel flexure system concepts via freedom and constraint
topology (FACT). Part II: Practice. Precis. Eng. 2010, 34, 271–278. [CrossRef]
47. Hopkins, J.B.; Culpepper, M.L. Synthesis of precision serial flexure systems using freedom and constraint topologies (FACT).
Precis. Eng. 2011, 35, 638–649. [CrossRef]
48. Awtar, S.; Ustick, J.; Sen, S. An XYZ Parallel-Kinematic Flexure Mechanism with Geometrically Decoupled Degrees of Freedom. J.
Mech. Robot. 2012, 5, 015001. [CrossRef]
91
Mathematics 2022, 10, 1414
49. Hopkins, J.B.; Panas, R.M. Design of flexure-based precision transmission mechanisms using screw theory. Precis. Eng. 2013, 37,
299–307. [CrossRef]
50. Li, H.; Hao, G. A constraint and position identification (CPI) approach for the synthesis of decoupled spatial translational
compliant parallel manipulators. Mech. Mach. Theory 2015, 90, 59–83. [CrossRef]
51. Li, H.; Hao, G.; Kavanagh, R. A New XYZ Compliant Parallel Mechanism for Micro-/Nano-Manipulation: Design and Analysis.
Micromachines 2016, 7, 23. [CrossRef]
52. Hao, G. Design and analysis of symmetric and compact 2R1T (in-plane 3-DOC) flexure parallel mechanisms. Mech. Sci. 2017, 8,
1–9. [CrossRef]
53. Lum, G.Z.; Teo, T.J.; Yeo, S.H.; Yang, G.; Sitti, M. Structural optimization for flexure-based parallel mechanisms—Towards
achieving optimal dynamic and stiffness properties. Precis. Eng. 2015, 42, 195–207. [CrossRef]
54. Jin, M.; Zhang, X. A new topology optimization method for planar compliant parallel mechanisms. Mech. Mach. Theory 2016, 95,
42–58. [CrossRef]
55. Lum, G.Z.; Pham, M.T.; Teo, T.J.; Yang, G.; Yeo, S.H.; Sitti, M. An XY & thetaZ flexure mechanism with optimal stiffness properties.
In Proceedings of the 2017 IEEE International Conference on Advanced Intelligent Mechatronics (AIM), Munich, Germany, 3–7
July 2017; pp. 1103–1110.
56. Pham, M.T.; Yeo, S.H.; Teo, T.J.; Wang, P.; Nai, M.L.S. A Decoupled 6-DOF Compliant Parallel Mechanism with Optimized
Dynamic Characteristics Using Cellular Structure. Machines 2021, 9, 5. [CrossRef]
57. Lobontiu, N. Compliant Mechanisms: Design of Flexure Hinges; CRC Press: Boca Raton, FL, USA, 2010.
58. Akbari, S.; Pirbodaghi, T. Precision positioning using a novel six axes compliant nano-manipulator. Microsyst. Technol. 2017, 23,
2499–2507. [CrossRef]
59. de Jong, B.R.; Brouwer, D.M.; de Boer, M.J.; Jansen, H.V.; Soemers, H.M.; Krijnen, G.J. Design and Fabrication of a Planar
Three-DOFs MEMS-Based Manipulator. J. Microelectromech. Syst. 2010, 19, 1116–1130. [CrossRef]
60. Mukhopadhyay, D.; Dong, J.; Pengwang, E.; Ferreira, P. A SOI-MEMS-based 3-DOF planar parallel-kinematics nanopositioning
stage. Sens. Actuators A Phys. 2008, 147, 340–351. [CrossRef]
61. Lee, K.M.; Arjunan, S. A three-degrees-of-freedom micromotion in-parallel actuated manipulator. IEEE Trans. Robot. Autom. 1991,
7, 634–641. [CrossRef]
62. Teo, T.J. Flexure-Based Electromagnetic Parallel-Kinematics Manipulator System. Ph.D. Thesis, Nanyang Technological University,
Singapore, 2009.
63. Pham, M.T. Design and 3D Printing of Compliant Mechanisms. Ph.D. Thesis, Nanyang Technological University, Singapore, 2019.
92
mathematics
Article
Predictive Suspension Algorithm for Land Vehicles over
Deterministic Topography
Alejandro Bustos *, Jesus Meneses, Higinio Rubio and Enrique Soriano-Heras
Department of Mechanical Engineering, Universidad Carlos III de Madrid, 28911 Leganes, Spain;
meneses@ing.uc3m.es (J.M.); hrubio@ing.uc3m.es (H.R.); esoriano@ing.uc3m.es (E.S.-H.)
* Correspondence: albustos@ing.uc3m.es
Abstract: A good suspension system is mandatory for ensuring stability, comfort and safety in land
vehicles; therefore, advanced semi and fully active suspension systems have been developed along
with their associated management strategies to overcome the limitations of passive suspensions.
This paper presents a suspension algorithm for land vehicles traveling through a deterministic
topography. The kinematics of a half-vehicle model and the algorithm are implemented in Simulink.
The algorithm’s inputs are the measurements provided by a position scanner located on the front
wheel of the vehicle. Based on this input, the algorithm reconstructs the topography in real-time and
sends the corresponding command to an actuator located on the rear wheel to compensate for the
irregularities of the terrain. The actuation is governed by the parameter “D”, which represents the
distance over which the algorithm averages the height of the terrain. Two ground profiles were tested
and sensitivity analysis of the parameter “D” was performed. Results show that larger values of “D”
usually yield less vibration on the actuated mass, but this value also depends on the irregularities of
the terrain.
94
Mathematics 2022, 10, 1467
simulating the vehicle and the response of the algorithm in two types of terrain. The fourth
and last section presents the conclusions of this work.
1 1 1 1
E= m v2 + I ω 2 + m1 · g · y1 + m2 v22 + I2 ω22 + m2 · g · y2 (1)
2 1 1 2 1 1 2 2
where E is the mechanical energy, m1 and m2 are the masses of both wheels, I1 and I2 are
the inertias of both wheels, v1 and v2 are the linear velocities of the wheels, ω 1 and ω 2 are
the angular speeds of the wheels, y1 and y2 are the vertical positions of the wheels and g is
the gravity.
By assuming pure rolling motion and the wheels as solid disks, Equation (1) can be
simplified; therefore, the initial energy of the system is obtained from Equation (2), where
the subscript 0 means “initial conditions”.
1 1 3
E0 = 2·m·v20 + 2·m·v20 + 2·m· g · y0 = ·m·v20 + 2·m· g · y0 (2)
2 4 2
The distance between the front and rear wheels (that is, the wheelbase) L is known
and can be described mathematically as a circumference centered on the front wheel and
radius L. The position of the rear wheel is obtained by computing the intersection of this
circumference with the trajectory described by the ground function y = f(x). Hence, the
solution of the equation system given by Equation (3) yields the position of the rear wheel.
(
( x2 − x1 )2 + ( y2 − y1 )2 = L2
(3)
y = f (x)
This system of equations is solved by iteration, as the ground function can be any type
of function, even not analytical. The flowchart describing the iteration process is shown
in Figure 1.
The velocity of both wheels is computed from the conservation of energy, the time
derivative of the constraint equation and the ground function, obtaining the equation
system built up by Equations (4)–(6).
3 3
E0 − (m1 · g · y1 + m2 · g · y2 ) = m v2 + m2 v22 (4)
4 1 1 4
$. . % $. . %
2 x 2 − x 1 ( x2 − x1 ) + 2 y2 − y1 ( y2 − y1 ) = 0 (5)
. d f (x)
y= (6)
dt
95
Mathematics 2022, 10, 1467
ȱ
Figure 1. Flowchart for computing the position of the rear wheel.
In order to solve this system more efficiently, polar coordinates are used. This implies
defining the pitch angle θ as Equation (7).
y − y2
θ = π + arctg 1 (7)
x1 − x2
The combination of the conservation of energy with the time derivatives of the ground
function and the constraint equation in polar coordinates yields Equations (9)–(12), which
allows computing all the components of the speed of both wheels.
As previously, this system of equations is solved by iteration following the flowchart
in Figure 2.
3 3 3 .2 .2 3 .2 .2
E0 − (m1 · g · y1 + m2 · g · y2 ) = m v2 + m2 v22 = m1 ( x1 + y1 ) + m2 ( x2 + y2 ) (9)
4 1 1 4 4 4
. d f ( x1 )
y1 = (10)
dt
. . .
x2 = x1 − L·θ ·senθ (11)
. . .
y2 = y1 + L·θ · cos θ (12)
96
Mathematics 2022, 10, 1467
Two different ground topographies and several values of the parameter D were
proposed for testing the algorithm. The first topography (profile 1) simulates a wiggly
surface and is defined by Equation (13)—where A1 is the amplitude of the wave and λ1 is
the wavelength, both in meters.
2π
y = A1 sin x (13)
λ1
97
Mathematics 2022, 10, 1467
The model parameters and initial conditions used in the simulations of both topogra-
phy profiles are summarized in Table 1. The input values of both topography profiles are
depicted in Table 2, as well as the computed values of variables A0 and a.
Parameter Value
Simulation time 10 s
Horizontal velocity at t = 0 3 m/s
Vertical velocity at t = 0 0 m/s
Wheel diameter 1m
Wheel mass 60 kg
Wheelbase 3m
Profile 1 Profile 2
Parameter Value Parameter Value
A1 0.05 m A1 0.05 m
λ1 4m λ1 2m
A2 −0.25 m
λ2 10 m
A0 0m
a 0m
. . .. 1 ... . 1 $ .. .. %
yn = yn−1 + yn−1 Δt + y n Δt2 = yn−1 + yn−1 + yn Δt (18)
2 2
. 1 .. 1 ... . 1 $ .. .. %
yn = yn−1 + yn−1 Δt + yn−1 Δt2 + y n Δt3 = yn−1 + yn−1 Δt + 2yn−1 + yn Δt2 (19)
2 6 6
Although the corresponding velocity and position errors grow with time, nΔt, and
time squared, (nΔt)2 , respectively, (easy to show); this is not too much of a problem since
the algorithm does not include the absolute height, but its variation over a certain distance,
“D”. In other words, the database of values obtained in Equations (18) and (19), will be
dynamically refreshed, so that n, and therefore the temporal interval nΔt, does not increase
indefinitely.
98
Mathematics 2022, 10, 1467
Figure 3. Two kinds of terrain defects: (a) peak defects and (b) smooth or corrugated terrain.
The vertical actuator (between the actuated or rear wheel and the sprung mass) has
to maintain the latter at the objective height. In this approach, the objective height is
the result of the moving average of the calculated height over the rolling distance 2D.
Actually, the larger this distance, the softer the objective function, but also the bigger error
is accumulated in calculating the height from the accelerometer reading. Thus, parameter
“D” should be adjusted according to the kind of roughness the topography has (with the
sole requirement not to exceed the distance between wheels).
For the proposed algorithm, it is important to know exactly the position of both wheels
(which are supposed to roll without slipping) on the ground. This is achieved by recording
pro
the corresponding angles, ϕn , ϕnact (angle turned by the probe wheel, and actuated wheel,
respectively), having set both to zero at the initial position (n = 0). The model is sketched
in Figure 4 at the initial position and at the time t = nΔt. The actual trajectories followed
by the center of the wheels and the trajectory the end of the actuator should follow are
also represented.
Figure 4. Model of the vehicle sketched at the initial position (t = 0) and at the time t = nΔt, where the
angles are indicated. Dotted and dashed curves represent respectively the actual trajectory followed
by the center of the wheels and that the actuated mass should follow.
99
Mathematics 2022, 10, 1467
..
At time t = nΔt, the vertical acceleration of the probe wheel,yn , and the angles turned
.
by the wheels, are measured. Then, the velocity, yn , and vertical position, yn , are calculated
by using Equations (18) and (19), respectively. All these values are recorded in a database
in the way indicated in Table 3.
Once enough records have been acquired, the algorithm is ready to provide the current
length to be taken by the actuator. The objective height is calculated as the moving average
centered on the record corresponding to the actuated wheel and over a sufficient number
of records; speaking in terms of rolling distances, the moving average is calculated over a
rolling distance of 2D. This is achieved by identifying the records m, i and f that, respectively,
define the current position for the actuated wheel and the first and last of the records to
average over. In other words, at t = mΔt the probe wheel was just on the same point of the
trajectory that the actuated wheel is at t = nΔt (i.e., at present). In Figure 5, the model at
moments corresponding to records m, n, i and f is sketched.
ȱ
Figure 5. Sketch of the model at the key moments for the algorithm: at the initial position (dashed
line in (up) and (middle) diagrams); when the record m is being registered (up); when the record n
is being registered (middle); when the record i is being registered (dotted line, (down)); when the
record f is being registered (dash line, (down)).
100
Mathematics 2022, 10, 1467
On seeing Figure 5, if we consider both wheels have the same radius, r, the following
rules for identifying the numbers m, i and f are deduced:
m is such that:
pro L
ϕm ≈ ϕnact − (20)
r
i is such that:
pro pro D L+D
ϕi ≈ ϕ m − ≈ ϕnact − (21)
r r
f is such that:
pro pro D L+D
ϕ f ≈ ϕm + ≈ ϕnact + (22)
r r
& '
pro
With “≈” we mean “is the nearest value to” (note that ϕ j is a discrete set of values,
so in general, none of the last equalities could be exactly satisfied). We have also considered
that the first section of the trajectory is flat and horizontal, with a minimum distance that is
equal to the wheelbase, L.
Once the records i, m and f have been identified, the algorithm calculates the objective
height (see Figure 6) as:
f
1
f − i + 1∑
yobj = yj + h (23)
j =i
where h is the initial objective height (i.e., the actuator initial length, see Figures 4 and 6).
Figure 6. Heights that are involved in the calculus of the actuated distance.
Finally, the database is refreshed by deleting the records with subscripts less than i,
and decreasing the remainder subscripts in i units:
if j ≥ i, j = j − i
In this way, the database does not grow indefinitely, and only the records to be used
by the algorithm at the current time of calculation remain.
In summary, the algorithm can be considered as consisting of three tasks:
1. Reading (angles and vertical acceleration) and computing vertical velocity and position.
2. Identifying the indexes i, m and f.
3. Obtaining the actuated distance and refreshing the database.
101
Mathematics 2022, 10, 1467
To understand how the algorithm performs the second task, suppose the database
at time nΔt is as shown in Table 4. For a given angle turned by the actuated wheel ϕnact ,
the algorithm computes the theoretical angle turned by the probe wheel at the three key
moments iΔt, mΔt and f Δt (ϕnact − L+r D , ϕnact − Lr and ϕnact + L+r D , respectively), using
Equations (20)–(22).Then, it searches the closest actual angles recorded in the database to
those computed and determines the indexes i, m and f. As the probe wheel angle turned
does not decrease with n (hence i ≤ m ≤ f ), the search can be performed sequentially.
i MLSUR MLDFW
m MPSUR MPDFW
f M ISUR M DFW
I
n MQSUR MQDFW
102
Mathematics 2022, 10, 1467
103
Mathematics 2022, 10, 1467
3. Results
This section presents the results obtained after testing the predictive suspension
algorithm over the two ground topographies defined in Section 2.1. The first topography
(profile 1) simulates a wiggly surface; the second topography is a combination of two
sinusoidal irregularities. The parameters of both topographies are summarized in Table 2.
As is described in Section 2.2, the algorithm’s key parameter to define the motion of
the sprung or actuated mass is the distance “D”. This parameter is set from D = 0 m (no
actuation over the sprung mass) to D = 3 m (the vehicle’s wheelbase) in steps of 0.5 m.
Hence, seven values of “D” were tested.
Additionally, the parameter “D” is normalized to the wheelbase of the vehicle by
defining a new variable d = D/L, where L is the wheelbase. As the value of the “D” is
limited by the wheelbase (it is not possible to obtain information on positions not yet
traveled by the front wheel), the value of “d” will always be between 0 and 1.
104
Mathematics 2022, 10, 1467
Figure 9. Kinematics of the vehicle when traveling through the ground profile 1.
Figure 10. The motion of the actuator in the first ground profile.
105
Mathematics 2022, 10, 1467
Figure 11. Displacement of the actuated mass in the first ground profile.
Figure 12. Acceleration spectra of the actuated mass in the first ground profile tested.
The displacement of the actuator placed between the rear wheel and the sprung
mass is plotted in Figure 10. As it can be seen, the actuator does not move when the
value of the normalized parameter “d” is 0. When increasing the value of this parameter,
the actuator moves to compensate for the irregularities of the ground, which even leads
to a wave out phased 180º to the ground profile. Lower values of parameter “d ”yield
smaller displacements of the actuator, though it also generates some discontinuities. Larger
values of parameter “d” depict better an out phased trajectory from the terrain, but the
amplitude of the movement is larger too. The actuator always works in extension (positive
106
Mathematics 2022, 10, 1467
displacement) at the beginning of these simulations due to the initial elevation of the
front wheel.
Figure 11 shows the path followed by the center of the rear wheel and the position of
the actuated mass for a better comparison of the followed trajectory and the actuation of
the suspension system. It is observed that the movement of the sprung mass is the opposite
of the rear wheel when the parameter “d” is larger than 2/3, that is, when the distance “D”
over which the moving average is computed is larger than 2 m. In this situation, when the
rear wheel moves upwards, the sprung mass is moved downwards, and vice versa. For
values of parameter “D” below 2 m (d < 2/3), the sprung mass moves vertically in the same
direction as the rear wheel, but this movement is limited.
In order to establish what is the best value of the normalized parameter “d” among
all the tested and for ground profile 1, the spectra of the vertical accelerations experienced
by the sprung mass are computed. The results obtained are plotted in Figure 12. The
analysis of the graph shows that the largest values of the parameter “d” yield the lowest
accelerations at low frequencies; however, it appears to be an undesirable vibration at high
frequencies; therefore, according to this criterion, the best value will be one that obtains the
best arrangement between low and high-frequency vibrations.
To make easier the search for the optimum value of parameter “d”, Table 5 summarizes
the maximum amplitude and the RMS of the spectra shown in Figure 12. Combining the
spectra and the data in the table, it can be concluded that the best value for parameter “d”
in the conditions of this simulation is d = 2/3, which means the distance D = 2 m.
Table 5. Characterization of the acceleration spectra in the first ground profile tested.
107
Mathematics 2022, 10, 1467
Figure 13. Kinematics of the vehicle when traveling through the second ground profile.
Figure 14. The motion of the actuator in the first ground profile.
108
Mathematics 2022, 10, 1467
Figure 15. Displacement of the actuated mass in the second ground profile tested.
ȱ
Figure 16. Acceleration spectra of the actuated mass in the second ground profile tested.
The PSD of the vertical accelerations experienced by the sprung mass through the
second ground profile is computed to establish what is the best value of the parameter “d”
among all the tested values. The results obtained are plotted in Figure 16. As was seen in
the previous subsection, there is a big peak at low frequencies and also some undesirable
accelerations at high frequencies; however, in this set of simulations, a larger value of the
parameter “d” does not mean lower acceleration amplitudes at low frequencies. In fact,
larger acceleration amplitudes are achieved with d = 1 than with d = 5/6, which indicates
that moving the sprung mass in advance to the terrain elevation is not as good as it can
be expected.
109
Mathematics 2022, 10, 1467
Once again, to make easier the search for the optimum value of parameter “d”, the
maximum amplitude and the RMS of the spectra are summarized in Table 6. Combining the
spectra and the numbers in the table, it can be concluded that the best value for parameter
“d” in the conditions of this simulation is d = 5/6, as it achieves the lowest maximum
amplitude and RMS values. The normalized parameter d = 5/6 means that the distance
“D” is equal to 2.5 m.
Table 6. Characterization of the acceleration spectra in the second ground profile tested.
4. Conclusions
This paper presents a predictive suspension algorithm for land vehicles traveling
through a deterministic topography. Prior to the virtual implementation of the algorithm,
the kinematics of a two-wheel 2D (1/2) vehicle were proposed and solved. The algorithm
inputs are the measurements given by a position scanner installed in the first wheel of the
vehicle. From these inputs, the algorithm reconstructs the terrain and averages its height
over a distance “D”. Then, it generates the command for moving the actuator installed in
the rear wheel and connected to the sprung mass. Both the algorithm and the kinematic
model of the 1/2 vehicle were implemented in Simulink.
The algorithm was tested on two terrains with different profiles. Furthermore, sen-
sitivity analyses were carried out in order to establish the best value of the parameter
“D” in each profile. The performance of the algorithm is quantified for several values
of the parameter “D”, using the spectral analysis of the accelerations experienced by the
sprung mass.
The results show that, in general, increasing the parameter “D” reduces the motion of
the sprung/actuated mass and, hence, improves comfort; however, there is an optimum
value of the parameter “D” for each ground profile, and this value does not coincide
necessarily with the highest value of “D”. In fact, the optimum “D” value will depend on
the terrain irregularities and their characteristic length. For the first ground profile, the
optimum value of the parameter “D” is 2 m (d = 2/3), which is half the wavelength of the
unevenness of the terrain. The optimum value for the second terrain is D = 2.5 m (d = 5/6).
The value of the “D” parameter is limited by the wheelbase since it is not possible to
obtain information on positions not yet traveled by the front wheel. In fact, the maximum
value of the “D” parameter must be slightly less than the wheelbase, as the data processing
and calculation time of the algorithm must be taken into account. This is not of major sig-
nificance in the current development status of the research as the Simulink implementation
computes the algorithm in real-time, but it should be taken into account when developing
the physical device to install in a light vehicle for real testing.
Finally, it can be concluded that the algorithm works correctly and decreases the
movement of the sprung/actuated mass when the vehicle travels through a deterministic
topography. The next steps in the research would be the development of a physical device
that executes the proposed predictive algorithm and its implementation in a light vehicle.
Author Contributions: Individual contributions are as follows: original concept: J.M.; mathematical
development: J.M. and A.B.; implementation, simulation and draft preparation: A.B.; review, editing,
validation, and formal analysis: E.S.-H., H.R. and J.M. All authors have read and agreed to the
published version of the manuscript.
Funding: The research work described in this paper is part of the R&D and Innovation projects MC4.0
PID2020-116984RB-C21 and MC4.0 PID2020-116984RB-C22 supported by the MCIN/AEI/10.13039/
501100011033.
110
Mathematics 2022, 10, 1467
References
1. Reimpell, J.; Stoll, H.; Betzler, J.W. The Automotive Chassis: Engineering Principles, 2nd ed.; Butterworth Heinemann: Oxford, UK,
2001; ISBN 978-0-7506-5054-0.
2. Theunissen, J.; Tota, A.; Gruber, P.; Dhaens, M.; Sorniotti, A. Preview-Based Techniques for Vehicle Suspension Control: A
State-of-the-Art Review. Annu. Rev. Control 2021, 51, 206–235. [CrossRef]
3. Luo, R.; Shi, H.; Guo, J.; Huang, L.; Wang, J. A Nonlinear Rubber Spring Model for the Dynamics Simulation of a High-Speed
Train. Veh. Syst. Dyn. 2020, 58, 1367–1384. [CrossRef]
4. Anubi, O.M.; Patel, D.R.; Crane, C.D., III. A New Variable Stiffness Suspension System: Passive Case. Mech. Sci. 2013, 4, 139–151.
[CrossRef]
5. Kalyan Raj, A.H.; Padmanabhan, C. A New Passive Non-Linear Damper for Automobiles. Proc. Inst. Mech. Eng. Part J.
Automob. Eng. 2009, 223, 1435–1443. [CrossRef]
6. Solomon, U.; Padmanabhan, C. Hydro-Gas Suspension System for a Tracked Vehicle: Modeling and Analysis. J. Terramech. 2011,
48, 125–137. [CrossRef]
7. Fu, B.; Giossi, R.L.; Persson, R.; Stichel, S.; Bruni, S.; Goodall, R. Active Suspension in Railway Vehicles: A Literature Survey.
Railw. Eng. Sci. 2020, 28, 3–35. [CrossRef]
8. Anubi, O.M.; Crane, C.A. New Semiactive Variable Stiffness Suspension System Using Combined Skyhook and Nonlinear Energy
Sink-Based Controllers. IEEE Trans. Control Syst. Technol. 2015, 23, 937–947. [CrossRef]
9. Sun, S.; Deng, H.; Li, W. Variable Stiffness and Damping Suspension System for Train. In Proceedings of the SPIE Volume 9057,
Active and Passive Smart Structures and Integrated Systems 2014, San Diego, CA, USA, 9–13 March 2014; p. 90570P.
10. Faraj, R.; Graczykowski, C.; Holnicki-Szulc, J. Adaptable Pneumatic Shock Absorber. J. Vib. Control 2019, 25, 711–721. [CrossRef]
11. Spencer, B.F.; Dyke, S.J.; Sain, M.K.; Carlson, J.D. Phenomenological Model for Magnetorheological Dampers. J. Eng. Mech. 1997,
123, 230–238. [CrossRef]
12. Gavin, H.P.; Hanson, R.D.; Filisko, F.E. Electrorheological Dampers, Part I: Analysis and Design. J. Appl. Mech. 1996, 63, 669–675.
[CrossRef]
13. Dixon, J. The Shock Absorber Handbook; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2008; ISBN 978-0-470-51642-3.
14. Mei, T.X.; Zaeim, A.; Li, H. Control of Railway Wheelsets—A Semi-Active Approach. In Advances in Dynamics of Vehicles on
Roads and Tracks; Klomp, M., Bruzelius, F., Nielsen, J., Hillemyr, A., Eds.; Lecture Notes in Mechanical Engineering; Springer
International Publishing: Cham, Swizerland, 2020; pp. 16–23. ISBN 978-3-030-38076-2.
15. Yang, J.; Ning, D.; Sun, S.S.; Zheng, J.; Lu, H.; Nakano, M.; Zhang, S.; Du, H.; Li, W.H. A Semi-Active Suspension Using a
Magnetorheological Damper with Nonlinear Negative-Stiffness Component. Mech. Syst. Signal Process. 2021, 147, 107071.
[CrossRef]
16. Tudon-Martinez, J.C.; Hernandez-Alcantara, D.; Amezquita-Brooks, L.; Morales-Menendez, R.; Lozoya-Santos, J.D.J.; Aquines, O.
Magneto-Rheological Dampers—Model Influence on the Semi-Active Suspension Performance. Smart Mater. Struct. 2019, 28,
105030. [CrossRef]
17. Soliman, A.; Kaldas, M. Semi-Active Suspension Systems from Research to Mass-Market—A Review. J. Low Freq. Noise Vib. Act.
Control 2021, 40, 1005–1023. [CrossRef]
18. Karnopp, D. Permanent Magnet Linear Motors Used as Variable Mechanical Dampers for Vehicle Suspensions. Veh. Syst. Dyn.
1989, 18, 187–200. [CrossRef]
19. Smith, M.C. Synthesis of Mechanical Networks: The Inerter. IEEE Trans. Autom. Control 2002, 47, 1648–1662. [CrossRef]
20. Lewis, T.D.; Jiang, J.Z.; Neild, S.A.; Gong, C.; Iwnicki, S.D. Using an Inerter-Based Suspension to Improve Both Passenger Comfort
and Track Wear in Railway Vehicles. Veh. Syst. Dyn. 2019, 58, 472–493. [CrossRef]
21. Kawamoto, Y.; Suda, Y.; Inoue, H.; Kondo, T. Electro-Mechanical Suspension System Considering Energy Consumption and
Vehicle Manoeuvre. Veh. Syst. Dyn. 2008, 46, 1053–1063. [CrossRef]
22. Klimenko, Y.I.; Batishchev, D.V.; Pavlenko, A.; Grinchenkov, V.P. Design of a Linear Electromechanical Actuator with an Active
Vehicle Suspension System. Russ. Electr. Eng. 2015, 86, 588–593. [CrossRef]
23. Kilicaslan, S. Control of Active Suspension System Considering Nonlinear Actuator Dynamics. Nonlinear Dyn. 2018, 91, 1383–1394.
[CrossRef]
24. Goodall, R.; Freudenthaler, G.; Dixon, R. Hydraulic Actuation Technology for Full- and Semi-Active Railway Suspensions. Veh.
Syst. Dyn. 2014, 52, 1642–1657. [CrossRef]
25. Goodall, R. Tilting Trains and Beyond-the Future for Active Railway Suspensions. 1. Improving Passenger Comfort. Comput.
Control Eng. J. 1999, 10, 153–160. [CrossRef]
111
Mathematics 2022, 10, 1467
26. Carballeira, J.; Baeza, L.; Rovira, A.; García, E. Technical Characteristics and Dynamic Modelling of Talgo Trains. Veh. Syst. Dyn.
2008, 46, 301–316. [CrossRef]
27. Jamil, I.A.A.; Moghavvemi, M. Optimization of PID Controller Tuning Method Using Evolutionary Algorithms. In Proceedings
of the 2021 Innovations in Power and Advanced Computing Technologies (i-PACT), Kuala Lumpur, Malaysia, 27–29 November
2021; pp. 1–7.
28. Di Gialleonardo, E.; Facchinetti, A.; Bruni, S. Control of an Integrated Lateral and Roll Suspension for a High-Speed Railway
Vehicle. Veh. Syst. Dyn. 2022, 1–27. [CrossRef]
29. Rodriguez-Guevara, D.; Favela-Contreras, A.; Beltran-Carbajal, F.; Sotelo, D.; Sotelo, C. Active Suspension Control Using an
MPC-LQR-LPV Controller with Attraction Sets and Quadratic Stability Conditions. Mathematics 2021, 9, 2533. [CrossRef]
30. Palanisamy, S.; Karuppan, S. Fuzzy Control of Active Suspension System. J. Vibroeng. 2016, 18, 3197–3204. [CrossRef]
31. Díaz-Choque, C.S.; Félix-Herrán, L.C.; Ramírez-Mendoza, R.A. Optimal Skyhook and Groundhook Control for Semiactive
Suspension: A Comprehensive Methodology. Shock Vib. 2021, 2021, 8084343. [CrossRef]
32. Boada, B.L.; Boada, M.J.L.; Vargas-Melendez, L.; Diaz, V. A Robust Observer Based on H ∞ Filtering with Parameter Uncertainties
Combined with Neural Networks for Estimation of Vehicle Roll Angle. Mech. Syst. Signal Process. 2018, 99, 611–623. [CrossRef]
112
mathematics
Article
A Novel Denoising Method for Retaining Data Characteristics
Brought from Washing Aeroengines
Zhiqi Yan 1 , Ming Zu 2 , Zhiquan Cui 3, * and Shisheng Zhong 4
Abstract: Airlines evaluate the energy-saving and emission reduction effect of washing aeroengines
by analyzing the exhaust gas temperature margin (EGTM) data of aeroengines so as to formulate
a reasonable washing schedule. The noise in EGTM data must be reduced because they interfere
with the analysis. EGTM data will show several step changes after cleaning the aeroengine. These
step changes increase the difficulty of denoising because they will be smoothed in the denoising.
A denoising method for aeroengine data based on a hybrid model is proposed to meet the needs
of accurately evaluating the washing effect. Specifically, the aeroengine data is first decomposed
into several components by time and frequency. The amplitude of the component containing the
most noise is amplified, and Gaussian noise is added to generate noise-amplified data. Second, a
Gated Recurrent Unit Autoencoder (GAE) model is proposed to capture engine data features. The
GAE is trained to reconstruct the original data from the amplified noise data to develop its noise
Citation: Yan, Z.; Zu, M.; Cui, Z.;
Zhong, S. A Novel Denoising Method reduction ability. The experimental results show that, compared with the current popular algorithms,
for Retaining Data Characteristics the proposed denoising method can achieve a better denoising effect, retaining the key characteristics
Brought from Washing Aeroengines. of the aeroengine data.
Mathematics 2022, 10, 1485. https://
doi.org/10.3390/math10091485 Keywords: autoencoder; aeroengine; denoising; GRU
the difficulty of denoising. After washing the aircraft engine, the data will experience a step
change. Traditional denoising methods often treat the step changes in data as noise and
smooth them out, making it difficult for the data to accurately reveal the washing effect,
adding difficulties for subsequent evaluation. Therefore, a new denoising model is needed
to eliminate the noise of the engine itself and retain the impact of cleaning on the data, as
shown in Figure 1.
(a) (b)
(c)
Figure 1. Demand for denoising aeroengine data. (a) Raw engine data; (b) Denoising effect of
traditional methods; (c) Denoising required for aircraft engine
Many noise reduction methods have been proposed, including empirical mode decom-
position (EMD), wavelet threshold denoising, and filtering. Xue Feng et al. [2] decomposed
the data with EMD after adding Gaussian noise to the data and successfully separated the
noise from the signal by searching the dominant component of noise through the continuous
mean square error criterion. Lu et al. [3] randomly shuffled the high-frequency noise part of
the data and then decomposed the data with EMD to achieve noise reduction. Kai et al. [4]
denoised complex images with a non-sampling wavelet transform method. Mohammad
Saleh Sadooghi et al. [5] denoised the compressor vibration signal of aeroengines with the
wavelet threshold denoising method. Because the effect of wavelet threshold denoising is
related to the threshold and threshold function, the author evaluates 84 matching effects of
the threshold and threshold function to search for the most suitable method for denoising
engine compressor vibration signals. P. Maragos et al. [6] proposed morphological filters,
which are filters composed of basic operations of mathematical morphology. Morphological
filters can selectively suppress image noise. M. H. Sedaaghi et al. [7] proposed mediated
morphological filters by combining morphological filters with media filtering and classical
gray scale morphological operators. Mediated morphological filters have a good effect on
eliminating salt and pepper noise in images. Yang Chao et al. [8] denoised the temperature
data with a two-way Kalman filter method. They improved the computational efficiency
by simplifying the Kalman filter algorithm by precalculating the filter coefficients.
The above methods show some disadvantages. The EMD algorithm must calculate
the dividing point between noise components and key characters; the Wavelet threshold
denoising algorithm must find the threshold and threshold function manually. These
two algorithms are inefficient, and the calculation results can only be approximate. The
Kalman filter is suitable for linear systems rather than highly nonlinear systems such as
aeroengines. The above three methods are not adaptive methods. Researchers must select
or set model parameters according to data characteristics [9]. Therefore, these methods are
114
Mathematics 2022, 10, 1485
not suitable for aeroengine data with non-uniform data distribution, hardly meeting the
needs of accurately evaluating the washing effect.
Autoencoders are adaptive artificial intelligence algorithms that are frequently used as
noise reduction models. Denoising Autoencoders (DAE) were first proposed by Bourlard
H. [10] to extract features from raw data. The idea of the denoising autoencoder is to make
the autoencoder reconstruct the original data from the noise-added data to obtain its noise
reduction ability. Vincent et al. [11] enhanced the robustness of the model by setting the
input of the autoencoder to zero at a specific scale. Song Hui et al. [12] used seismic data to
verify the above algorithm and found that the algorithm can filter out random noise with
strong intensity, but the algorithm’s efficiency is low. According to the characteristics of
underwater heterogeneous information data, Wang et al. [13] combined a three-layer sparse
autoencoder with two-layer convolution to build a model with strong noise reduction ability.
Peng et al. [14] added convolutional layers to the autoencoder, which made the autoencoder
more robust with a smaller reconstruction error. However, the signal amplitude is reduced
by this. Hui et al. [15] constructed a convolutional autoencoder to filter seismic data with
a low signal-to-noise ratio and achieved desirable results. Alexander Kensert et al. [16]
denoised the chromatogram to a completely or almost completely noise-free state with a
deep convolutional autoencoder.
However, the DAE also smooths the edge features of the data while denoising. Al-
though the literature [11,13] preserves the characteristics of the data to the greatest extent,
the premise is that the noise of underwater heterogeneous information data and seismic
data comes from nature, conforming to the Gaussian noise distribution. The noise of
aeroengine data comes from the engine system rather than nature. There is no reference
documenting that the engine noise conforms to the Gaussian noise distribution. Therefore,
the above-mentioned algorithms are not very applicable to engine data. Therefore, the ex-
isting denoising methods based on autoencoders can only eliminate the conventional noise,
but it is difficult to eliminate the specific noise of the aeroengine. These methods are not
suitable for engine data and hardly meet the need to accurately evaluate the washing effect.
To meet the needs of accurately evaluating the washing effect, a denoising method
for aeroengine data based on a hybrid model is proposed. This method can filter out the
noise of the aeroengine data, retaining the edge features of the data. The method first
splits the data by washing time, and then decomposes the data into several components by
frequency. Second, the method finds the component that contains the most noise, amplifies
its magnitude, and adds Gaussian noise to compose the noise-amplified data. Finally, the
method inputs the noise-amplified data together with the original data into the proposed
autoencoders, training the model’s ability to recover the detailed information of the engine
data from the noise-amplified data. Figure 2 shows the principle of the aeroengine data
denoising method based on the hybrid model.
The rest of the paper is as follows: The second part presents the decomposition method
of the aeroengine data and the identification method of the data frequency band containing
the most noise. A GAE model is proposed to denoise aeroengine data. The third part first
introduces the source of the engine data and secondly gives the identification results of
the data frequency bands containing the most noise and the determination results of the
hyperparameters of the GAE model. Finally, this part tests the noise reduction effect of
the EMD model, the EAD model, and the GAE model on aeroengine data. The fourth part
summarizes the superiority of the proposed aeroengine data denoising method through
the analysis of the above noise reduction testing results.
115
Mathematics 2022, 10, 1485
$HURHQJLQHGDWD
6SOLWE\ZDVKLQJ
UHFRUG
'HFRPSRVHLQWRVHYHUDOFRPSRQHQWV
DFFRUGLQJWRIUHTXHQF\
,GHQWLI\WKHFRPSRQHQWWKDW
FRQWDLQVWKHPRVWQRLVH
$PSOLI\WKHFRPSRQHQWWKDW
FRQWDLQVWKHPRVWQRLVH
$GG*DXVVLDQQRLVH
1RLVHDPSOLILHGGDWD 6SOLWHGGDWD
,QSXWWRWKHDXWRHQFRGHUWRWUDLQ
'HQRLVLQJPRGHO
116
Mathematics 2022, 10, 1485
2.2. Identification Method of Aeroengine Data Components Containing Noise Based on EMD
and DTW
This section is about splitting the data, decomposing the data, and identifying noise.
The washing time is used as the basis for splitting the engine data. The engine data
in this study come from OEM factories, and they are recorded with the flight cycle as the
observation time, so the aeroengine data is time series data. The engine washing records
are tables that record the time, location, engine type, and aircraft type when the engine was
washed. What can be used in this study is wash time. Data is denoted as X = {x1 , x2 , x3 , . . . }.
If the number of washings is n – 1, the washing records can be defined as: Twashing = {t1 , t2 ,
. . . , tn −1 }. Twashing can split data X into n pieces of split data. Let X(i) denote split data,
which can be given by Equation (1).
& '
X (i ) = xt(i−1) +1 , xt(i−1) +2 , · · · , xti , i = 1, 2, · · · , n (1)
117
Mathematics 2022, 10, 1485
FROOHFW
'DWDVSOLWWLQJ
X
6SOLWGDWD X
DFFRUGLQJ
WRZDVKLQJ
X
UHFRUG X &RQWDLQVWKHPRVWQRLV\
ήήή FRPSRQHQW
*DXVVLDQ +
'DWDGHFRPSRVLWLRQEDVHG QRLVH
RQ(0'
*$(DXWRHQFRGHU
1RLVHLGHQWLILFDWLRQEDVHG
RQ'7:
'HQRLVHGGDWDVHJPHQW
&RPELQDWLRQ
'HQRLVHGGDWD
7KHSURSRVHGPHWKRGWRLGHQWLI\
WKHGDWDFRPSRQHQWVWKDWFRQWDLQ
WKHPRVWQRLVH 7KHSURSRVHGGHQRLVLQJPHWKRG
Then X(i) is decomposed in frequency with the EMD algorithm. The EMD algorithm
is a signal analysis algorithm, which decomposes the signal according to the timescale
of the aeroengine data without setting any basis functions in advance. Aeroengine data
can be divided into several “intrinsic mode functions” (IMF) by the EMD method without
deviating from the time domain. The aeroengine data can be expressed as the trend
component and the sum of several IMF functions.
The steps of EMD decomposing the data segment X(i) obtained by dividing the
aeroengine data are as follows:
Step 1: Find all the extreme points of the aeroengine data, use the spline curve to
connect all the maximum points into the upper envelope, and connect all the minimum
points to the lower envelope.
Step 2: Calculate the average value m(i) of the upper and lower envelopes and the
engine data IMF component c(i) according to Equation (2).
Nordne E. Hunag [17] put forward the concept of IMF and defined two conditions
of IMF: 1. In the whole time range, the number of local extreme points and zero-crossing
points of IMF differs by at most one; 2. The mean value of the upper and lower envelopes
118
Mathematics 2022, 10, 1485
is zero. If c(i) does not satisfy these two conditions, repeat steps 1 and 2 until they are
satisfied, and then go to step 3.
Step 3: Separate c(i) from the aeroengine data X(i): X(i) - c(i). Repeat step 1, step 2, and
step 3 until X(i) - c(i) becomes a monotonic sequence. Given r(i), defined by Equation (3):
Real engine data can be seen as a superposition of ideal data and noise data caused by
minor faults. To identify the characteristic noise components of aeroengines, four kinds
of fault data were prepared in this study, including compressor fault data, fan fault data,
high-pressure turbine fault data, and low-pressure turbine fault data. The distance of the
fault data from each data component is calculated to find the data component with the
most noise.
However, each component of the engine data has unequal data lengths and non-
existing linear correspondences with the four types of fault data. There are phenomena
such as amplitude scaling and linear drift between data [18]. The Euclidean distance is
directly used to represent the similarity between each component of the engine data and
the four types of fault data, which may encounter the problem of information flooding
to a certain extent, resulting in an unrealistic distance between each component of the
engine data and the four types of fault data. Therefore, a distance calculation method that
is compatible with various data lengths is required.
The DTW algorithm is proposed for measuring the distance between two time series
of different lengths. DTW is widely used in the field of speech recognition, and it is also
suitable for recognizing two similar aeroengine data sets. In this study, DTW is used to
calculate the minimum distance between the four types of fault data and c(i) or r(i).
The process of the DTW algorithm is shown in Figure 4. First, a matrix grid is created,
and the length and width are respectively the lengths of the engine data component and
the length of the fault data. The elements of the matrix represent the Euclidean distance
d of the data at the corresponding position of the engine data component and the fault
data. Then take the lower-left corner (1,1) point of the matrix grid as the starting point and
move to the adjacent grid corresponding to the smallest element of the right and upper
elements until it reaches the upper-right corner of the matrix grid to form a path. Finally,
calculate the cumulative value of the matrix elements that the path passes through. This
accumulated value is the distance between the engine data component and the fault data.
119
Mathematics 2022, 10, 1485
Figure 4. Principle of DTW algorithm.
Define the distance as dis. The distance of c(i) or r(i) and the four kinds of fault data
are given in Table 2, as are symbols and explanations.
In Table 2, the distance between c(i) and all fault data can be given by Equation (4).
disc(i) = ∑(disc(i), FAN , disc(i), COMP , disc(i), HPT , disc(i), LPT ) (4)
The distance between r(i) and all fault data can be given by Equation (5).
disr(i) = ∑(disr(i), FAN , disr(i), COMP , disr(i), HPT , disr(i), LPT ) (5)
The smaller the distance, the more noise it contains. The engine data component
corresponding to the smallest distance is selected as the noise component of the aeroengine.
120
Mathematics 2022, 10, 1485
2.3. Gated Recurrent Unit Autoencoder(GAE): A Proposed Denoising Autoencoder Model for
Aeroengine Data
DAE is a promising method for data denoising, which can be used for aeroengine data
denoising. It is a feature extractor with a denoising function whose purpose is to convert
noisy aeroengine data into clean aeroengine data.
In addition, the aeroengine data is a time series, and each element in the sequence
is affected by all the previous elements, which requires the model to be able to learn the
influence from the historical accumulation of the aeroengine data. Real engine data is
difficult to obtain, and the limited amount of data has difficulty supporting large-scale
models. This requires the model to have a simple structure, and the GRU (Gated Recurrent
Unit) model meets the requirements. The GRU was proposed by Cho et al. [19]. GRUs have
fewer parameters, making training faster and requiring less data [20].
Therefore, in this paper, the Gated Recurrent Unit Autoencoder (GAE) model is
proposed as a denoising module for aeroengine data by combining the autoencoder with
the GRU. The structure of the GAE model is shown in Figure 5. The model includes
encoders and decoders with special structures. The input end of the encoder and the output
end of the decoder are both GRU modules, and a three-layer autoencoder is used as the
connection in the middle. The three layers of the autoencoder are marked as h1 , h2 , and
h3 . In the coding stage, the aeroengine data is continuously input to the GRU and the
characteristic data is output through h1 and h2 . In the decoding stage, the feature data is
input into the h3 layer, and the denoised data is output through the GRU module.
*DXVVLDQ &RPSRQHQWFRQWDLQLQJ
QRLVH WKHPRVWQRLVH
+
(QFRGHU 'HFRGHU
*58 *58
+
y
*58 *58
+
y
Xi
*58 *58
+
h ...
h h
*58 *58
+
In Figure 5, the input data is the aeroengine data that amplifies its own noise and adds
Gaussian noise, which can be expressed as X(i)noise ; the label data is the raw aeroengine
data, expressed as X(i). The GRU involved in Figure 5 has two doors: update door and
reset door. the basic structure of the GRU is shown in Figure 6.
In Figure 6, the definitions are as follows: z is the update gate, r is the reset gate, h is
the current state, and h—1 is the previous state. The update gate z is used to control the
amount of previous state information which is brought into the current state. The reset
gate r controls how much information from the previous state is written to the current
candidate set. The smaller the reset gate, the less information from the previous state is
121
Mathematics 2022, 10, 1485
where r is the weight from the input and the hidden layer at the previous moment to the
reset gate r; Wz is the weight from the input and the hidden layer at the previous moment
to the update gate z; Wh is the input and the hidden layer of the previous moment to +
h; Wy
is the weights from the hidden layer to the output layer.
hü × +
× ü
r z ×
~
ı ı h
7DQK
The h2 layer’s output of the denoising autoencoder model is the feature code of the
aeroengine data. The encoding process is that the encoding function maps high-dimensional
aeroengine data vectors to low-dimensional feature vectors. The encoder function is the
activation function of the h2 layer of the autoencoder. The activation function is defined as
the sigmoid function. The feature vector output by the h2 layer is co. Then the aeroengine
data encoding process of the h2 layer is given by Equation (7).
1
co = (7)
1 + e−whid ·y−bhid
where whid is the h2 layer weight matrix; bhid is the h2 layer bias vector.
The decoding process maps the feature vector co to the reconstructed aeroengine data.
The activation function of the h3 layer in the decoder is defined as a linear function, and the
feature vector output by the h3 layer is deco, which is given by Equation (8).
where wout is the h3 layer weight matrix; bout is the h3 layer bias vector.
Finally, deco is output by the GRU module as denoised data, which is defined as
X(i)denoise . The loss function in this model is the mean absolute error (MAE). Define the
number of output nodes of the GAE model as nout , then MAE is given by Equation (9).
nout
1
nout j∑
MAE( X (i )denoise , X (i )noise ) = ( X (i )denoise − X (i )noise )2 (9)
=1
3. Experiment
All functions are written on the python 3.5 platform of the Windows system with
Tensorflow as the framework. Tensorflow is an open source machine learning platform. The
122
Mathematics 2022, 10, 1485
hardware platforms involved in the calculation are CPU and GPU; the models are Intel(R)
Core (TM) i3-9100F CPU_3.60GHz and NVIDIA GeForce GTX TITAN X, respectively.
The data involved in this paper is aeroengine EGTM data. Airlines take EGTM as a
reference for aeroengine performance. EGTM refers to the difference between the red line
value of engine exhaust temperature and the exhaust temperature when the engine takes
off at full thrust. When the engine is washed, there are step changes in the EGTM data.
During denoising, these step changes may be recognized as noise and smoothed. There-
fore, selecting EGTM data as the noise reduction object can better verify the performance of
the proposed method.
Since exhaust gas temperature margin (EGTM) data is an important indicator for
evaluating the efficiency of engine washing, the data form of the paper is EGTM data that
includes the number of cycles recorded in the water wash. The EGTM data is collected at
the outlet of the low-pressure turbine of the engine, and Figure 7 is the measurement point
of the exhaust temperature of the aeroengine.
EGTM data is real data provided by the OEM. In this study, the flight cycle was used
as a time unit to record the aeroengine data from takeoff to landing. Therefore, aeroengine
data is time series data. In practice, the airline provided three materials for an aeroengine,
including OEM data in Table 3, fault records in Table 4, and washing records in Table 5.
123
Mathematics 2022, 10, 1485
The data required for the study are spread out across three tables. The collection
process of the data to be denoised is: According to the aircraft registration number (ID) and
water washing date in Table 5, the IDs of the OEM data in Table 3 are located to collect all
the corresponding EGTM data; the collection process of the fault data is: According to the
ID in the fault record in Table 4, the ID of the OEM data in Table 3 is located to collect all
the corresponding EGTM data. Figure 8 shows the aeroengine EGTM data. Figure 9 shows
all types of data.
(*70
:DVKLQJWLPH
(*70ႏ
)OLJKW&\FOHV
Figure 8. The engine data involved in this paper.
During the wing period, the engine was washed a total of nine times. Taking the
washing time as the time point, the aeroengine data is split into eight segments. The first
six segments are used as the training set, and the last two segments are used as the testing
set. Number the eight segments sequentially to obtain Table 6.
124
Mathematics 2022, 10, 1485
(*70ႏ
(*70ႏ
)OLJKW&\FOHV )OLJKW&\FOHV
(a) (b)
í
í
í
(*70ႏ
(*70ႏ
í
í
í
í
í
)OLJKW&\FOHV )OLJKW&\FOHV
(c) (d)
Figure 9. The fault data involved in this paper. (a) EGTM data for compressor failure; (b) EGTM
data for fan failure; (c) EGTM data for high pressure turbine failure; (d) EGTM data for low pressure
turbine failure.
125
Mathematics 2022, 10, 1485
1
(*70ႏ
(*70ႏ
0
ƺ1
)OLJKW&\FOHV )OLJKW&\FOHV
(a) (b)
1
(*70ႏ
(*70ႏ
0
ƺ1
)OLJKW&\FOHV )OLJKW&\FOHV
(c) (d)
(*70ႏ
(*70ႏ
0.5
0
ƺ0.5
)OLJKW&\FOHV )OLJKW&\FOHV
(e) (f)
2
(*70ႏ
(*70ႏ
1
0
ƺ1
)OLJKW&\FOHV )OLJKW&\FOHV
(g) (h)
(*70ႏ
(*70ႏ
1
0
ƺ1
ƺ2
)OLJKW&\FOHV )OLJKW&\FOHV
(i) (j)
1
(*70ႏ
(*70ႏ
0
ƺ1
)OLJKW&\FOHV )OLJKW&\FOHV
(k) (l)
Figure 10. Components of the six-segment aeroengine training data. (a) No. 1 data residual com-
ponent; (b) No. 1 data IMF component; (c) No. 2 data residual component; (d) No. 2 data IMF
component; (e) No. 3 data residual component; (f) No. 3 data IMF component; (g) No. 4 data residual
component; (h) No. 4 data IMF component; (i) No. 5 data residual component; (j) No. 5 data IMF
component; (k) No. 6 data residual component; (l) No. 6 data IMF component.
126
Mathematics 2022, 10, 1485
In Tables 7 and 8, the distances between the residual components of No. 2, 3, and 6 in
the training data and the four types of fault data are all “∞”, indicating that these three sets
of residual component data hardly contain noise components. The distances between the
residual components of No. 1, 4, and 5 and the four kinds of fault data are all within 15,
while the distances between the IMF components of these three groups and the four kinds
of fault data are smaller than those of the residual components, which shows that more
noise is included in the IMF component of the aeroengine EGTM data. Therefore, the IMF
component is selected as the data component containing the most noise. Separate the data
components that contain the most noise in the testing data (No. 7 and No. 8), as shown
in Figure 11.
2
1
(*70ႏ
(*70ႏ
1
0 0
ƺ1
ƺ1
Figure 11. Components that contain the most noise in the testing data. (a) Data No. 7 contains the
most noise data component; (b) Data No. 8 contains the most noise data component.
127
Mathematics 2022, 10, 1485
0.06
Reconstruction error
0.055
0.05
0.045
0.04
0.035
15 20 25
nin
Figure 12. Relationship between the number of nodes in the h1 layer and reconstruction error.
Table 9. Table of the nodes number in the h1 layer and reconstruction error data.
From the data given in Table 9, the average reconstruction error is the lowest when
nin is 20. The average reconstruction error is slightly higher when nin is 19. However, the
standard deviation when nin is 20 is larger than that when nin is 19, being 0.0397 and 0.0272,
respectively. The minimum error value when nin is 20 is smaller than that when nin is 19.
The maximum error when nin is 20 is greater than that when nin is 19. Table 5 shows that
the output is unstable when nin is 20, although the average reconstruction error is small.
Although the average reconstruction error is slightly larger, the output is stable when nin is
19. Therefore, nin is determined to be 19 in this paper.
Similarly, to minimize reconstruction error, nhid must be determined. At present,
researchers have developed empirical formulas [21] for nhid , which help narrow the search
for optimal nodes. The empirical formula is given by Equation (10).
√
nhid = nin × nout (10)
According to Equation (10), it can be roughly determined that nhid is around 19. The
optimal nhid is searched by the enumeration method in the interval of 15–25. After the
experiment is carried out 10 times, nhid and the reconstruction error are plotted in Figure 13.
As can be seen from Figure 13, when nhid is 21, the reconstruction error is the lowest.
To analyze the results in more detail, in Table 10, the mean reconstruction error, standard
deviation, minimum error, and maximum error of 10 experiments are given. The above
four types of data in Table 10 all retain three significant figures.
128
Mathematics 2022, 10, 1485
5HFRQVWUXFWLRQHUURU
nKLG
From the data given in Table 10, the mean reconstruction error is the lowest when nhid
is 21. The standard deviation is the smallest when nhid is 21. This means that the model’s
output is currently stable. Therefore, the most suitable nhid is determined to be 21.
129
Mathematics 2022, 10, 1485
*$( '$( (0'
5HFRQVWUXFWLRQHUURU
'DWDQXPEHU
Figure 14. Comparison of reconstruction accuracy of three noise reduction models.
Tables 11 and 12 show that the MAE error of GAE training data is (0.0864 − 0.0747)/
0.0864 × 100% = 13.54% lower than that of DAE and 78.72% lower than that of EMD; the
MAE error of GAE test data is 3.11% lower than that of DAE and 76.80% lower than that
of EMD.
The MAE error of GAE is relatively smaller than that of DAE, which reflects that
the GRU in GAE can learn the time series relationship of aeroengine data so as to better
reconstruct pure data. The combination of the GRU module and the DAE model effec-
tively improves the noise reduction effect, making the MAE error smaller than that of the
DAE model.
The MAE error of GAE is much smaller than that of EMD. The reason for this is that
the algorithm of EMD is susceptible to noise interference. The algorithm of EMD separates
the noise signal by calculating the envelopes of the extrema of the data. Noise makes the
extremum of the data change, and the envelopes follow. The changes of the envelope also
make the EMD algorithm unstable, resulting in a large error.
To summarize, Tables 8 and 11 reflect that the noise reduction effect of the DAE model
is slightly better than that of the EMD model. The mean reconstruction error of the GAE
model is smaller than that of the EAD model. The data reconstruction ability of the GAE
model is stronger than that of the DAE model and the EMD model. It is proved that the
GAE model can preserve the characteristics of the aeroengine better.
3.3.2. The Effectiveness of the Proposed Noise Reduction Method Based on Hybrid Model
Is Verified
Figure 15 shows the reconstruction curves of DAE, EMD, and the proposed model on
the EGTM Testing data of an aeroengine. In the figure, the original data are marked with
black curves, the denoised data of the proposed model are marked with green dotted lines,
the denoised data of the EAD model are marked with light blue, and the noise reduction of
the EMD model are marked with dark blue.
130
Mathematics 2022, 10, 1485
(*70ႏ
2ULJLQDO'DWD
(0'
7KHSURSRVHGPRGHO
($'
:DVKLQJWLPH
)OLJKW&\FOHV
Figure 15. Comparison of noise reduction effects of three noise reduction models.
The red vertical line in Figure 15 represents the time of washing the aeroengine. Due
to the washing, there is a step change in the original EGTM data. EAD, EMD, and the
proposed method show different performances under the influence of cleaning. The EAD
method significantly smoothed the step change of EGTM. It shows that the EAD model has
an underfitting problem in training. The EMD method separates the step change as noise
from the EGTM, thus smoothing the step change of the EGTM.
In Figure 15, the proposed model can reconstruct the EGTM sequence and express
data mutation well. The denoised data of the proposed model in the figure is closer to the
original data than DAE and EMD, which shows that the denoised data of the proposed
model is more suitable for the subsequent analysis of washing effects. In order to more
accurately express the noise reduction effect of DAE, EMD, and the proposed model on
aeroengine data, the EGTM step value after aeroengine washing is investigated.
The influence of the denoising model on the subsequent evaluation of the washing
effect can be known by calculating step size after washing. The step size after washing is a
direct expression of the washing effect. Therefore, calculating the post-washing step size of
the denoised data is an important indicator to investigate whether the denoising model
will affect the evaluation of the washing effect. Since the data after washing has a gradual
increase rather than a sudden increase, the step size calculated in the paper is the difference
between the average values of 10 data before and after washing. The step size is calculated
from the No. 2 data.
Figure 16 visually shows the steps sizes of No. 2–8 raw data and the step sizes of
denoised data of DAE, EMD and the proposed model. Table 13 gives the step sizes of the
training data and testing data. The data in Table 13 are the steps sizes of each piece of data.
Since No. 1 is unwashed engine data, the step size is recorded from No. 2 data.
131
Mathematics 2022, 10, 1485
2ULJLQDOGDWD 7KHSURSRVHGPRGHO '$( (0'
7KHVWHSVVL]HVRIWKH(*70ႏ
'DWDQXPGHU
Figure 16. The step sizes from data of all models.
Table 13 is the variance of the steps of the original data and the steps of the denoised
data of DAE, EMD, and the proposed model.
It can be seen from Table 14 that EMD has a large variance in both the testing set and
the training set. The variances of the DAE in the training set and testing set are much
smaller than those of the EMD, but they are still large. The variances of the proposed model
are extremely small in both the testing set and the training set. The step sizes calculated
from the denoised data of the proposed model are the closest to the original data. Therefore,
the proposed model has the least impact on the analysis of subsequent washing effects. It
is more suitable for aeroengine data noise reduction than other models.
Table 14. Step value variance between noise reduction data and original data.
4. Conclusions
To improve prediction accuracy, a denoising method for aeroengine data based on
a hybrid model is proposed in this paper. The method first amplifies the noise part of
the data, and then adds Gaussian noise to the data as the input of the autoencoder. Let
the autoencoder reconstruct the original data from the amplified noise data so that the
autoencoder can perform targeted noise reduction. In the paper, the proposed model is
compared with EMD and DAE, which reflects that the proposed model can effectively
denoise the data and retain mutation characteristics after aeroengine washing.
The autoencoder involved in the hybrid-model-based aeroengine data denoising
method is the GAE model proposed in this paper. The GAE model is composed of three
fully connected layers connecting two GRU modules. The model is good at working with
time series data. After testing with real aeroengine data, compared with EMD and DAE,
the reconstruction error of the GAE model is the smallest, preserving the data features to
the greatest extent.
The model proposed in this paper has an ideal effect on the denoising of EGTM data
after aeroengine washing. This model is applicable for denoising the various data with
sudden changes, such as the gas path data of aeroengines or gas turbines after maintenance.
In the future, we will plan to collect more real data to improve our methods.
132
Mathematics 2022, 10, 1485
Author Contributions: Methodology, M.Z.; writing—original draft preparation, Z.Y., Z.C., and S.Z.;
writing—review and editing, Z.C.; funding acquisition, M.Z. and Z.C. All authors have read and
agreed to the published version of the manuscript.
Funding: This research was funded by the National Natural Science Foundation of China, grant
number U2133202; and the National Natural Science Foundation of China, grant number 51975157.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Chen, D.; Sun, J. Fuel and emission reduction assessment for civil aircraft engine fleet on-wing washing. Transp. Res. Part D
Transp. Environ. 2018, 65, 324–331. [CrossRef]
2. Xue, F.; Sun, X.; Dong, Z.; Yang, H.; Wang, H. Research on Data Denoising Algorithm Based on EEMD. Mech. Eng. Autom. 2021, 5,
9–11.
3. Lu, T.; Qian, W.; He, X.; Le, Y.; Huang, J. An improved EMD noise reduction method based on noise statistical characteristics, Bull.
Surv. Mapp. 2020, 11, 71–75.
4. Hu, K.; Cheng, Q.; Li, B.; Gao, X. The complex data denoising in MR images based on the directional extension for the undecimated
wavelet transform. Biomed. Signal Process. Control 2018, 39, 336–350. [CrossRef]
5. Sadooghi, M.S.; Khadem, S.E. A new performance evaluation scheme for jet engine vibration signal denoising. Mech. Syst. Signal
Process. 2016, 76, 201–212. [CrossRef]
6. Maragos, P.; Schafer, R.W. Morphological Filters. Part 1. Their Set-Theoretic Analysis and Relations to Linear Shift-Invariant
Filters. IEEE Trans. Acoust. Speech Signal Process. 1987, 35, 1153–1169.
7. Sedaaghi, M.H.; Daj, R.; Khosravi, M. Mediated morphological filters. In Proceedings of the 2001 International Conference on
Image Processing (Cat. No.01CH37205), Thessaloniki, Greece, 7–10 October 2001; pp. 692–695.
8. Yang, C.; Li, J.; Yang, W.; Yang, W. Denoising Method for Temperature Log Data Based on A Kalman Filter. Well Logging Technol.
2020, 2, 168–171.
9. Li, Y.; Wang, C.; Tian, Y.; Wang, S. Parameter-shared variational auto-encoding adversarial network for desert seismic data
denoising in Northwest China. J. Appl. Geophys. 2021, 11, 104428. [CrossRef]
10. Bourlard, H.; Kamp, Y. Auto-association by multilayer perceptrons and singular value decomposition. Biol. Cybern. 1988, 4,
291–294. [CrossRef] [PubMed]
11. Vincent, P.; Larochelle, H.; Bengio, Y.; Manzagol, P.A. Extracting and Composing Robust Features with Denoising Autoencoders.
In Proceedings of the 25th international conference on Machine Learning, Helsinki, Finland, 5–9 June 2008.
12. Song, H.; Gao, Y.; Chen, W.; Zhang, X. Seismic noise suppression based on convolutional denoising autoencoder. Oil Geophys.
Prospect. 2020, 6, 1210–1219.
13. Wang, X.; Zhao, Y.; Teng, X.; Sun, W. A stacked convolutional sparse denoising autoencoder model for underwater heterogeneous
information data. Appl. Acoust. 2020, 167, 107391. [CrossRef]
14. Peng, F.; Gao, Y. BPSK Signal Denoise Based on Convolution Auto-Encoder Network. Inf. Commun. 2020, 8, 41–44.
15. Song, H.; Gao, Y.; Chen, W.; Xue, Y.J.; Zhang, H.; Zhang, X. Seismic random noise suppression using deep convolutional
autoencoder neural network. J. Appl. Geophys. 2020, 178, 104071. [CrossRef]
16. Kensert, A.; Collaerts, G.; Efthymiadis, K.; Van Broeck, P.; Desmet, G.; Cabooter, D. Deep convolutional autoencoder for the
simultaneous removal of baseline noise and baseline drift in chromatograms. J. Chromatogr. A 2021, 1, 462093. [CrossRef]
[PubMed]
17. Wu, Z.; Huang, N.E. A study of the characteristics of white noise using the empirical mode decomposition method. Proc. R. Soc.
London. Ser. A Math. Phys. Eng. Sci. 2004, 460, 1597–1611. [CrossRef]
18. Park, S.; Chu, W.W.; Yoon, J.; Won, J. Similarity Search of Time-Warped Subsequences via a Suffix Tree. Inf. Syst. 2003, 7, 867–883.
[CrossRef]
19. Chung, J.; Gulcehre, C.; Cho, K.H.; Bengio, Y. Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling.
arXiv 2014, arXiv:1412.3555.
20. Olah, C. Understanding Lstm Networks. 2015. Available online: http://colah.github.io/posts/2015-08-Understanding-LSTMs
(accessed on 9 March 2021).
21. Sequin, C.H.; Clay, R.D. Fault tolerance in artificial neural networks. In Proceedings of the 1990 IJCNN International Joint
Conference on Neural Networks, San Diego, CA, USA, 17–21 June 1990; pp. 703–708.
133
mathematics
Article
Adaptive Rejection of a Sinusoidal Disturbance with Unknown
Frequency in a Flexible Rotor with Lubricated Journal Bearings
Gerardo Amato 1 , Roberto D’Amato 2,3, * and Alessandro Ruggiero 4
1 Nonlinear and Adaptive Controls Laboratory, Department of Electronic Engineering, University of Rome
“Tor Vergata”, Via del Politecnico, 00133 Rome, Italy; amato@ing.uniroma2.it
2 Escuela Técnica Superior de Ingeniería y Diseño Industrial, Universidad Politécnica de Madrid,
Ronda de Valencia, 28012 Madrid, Spain
3 Structural Materials Research Center (CIME-UPM), Departamento de Ciencia de Materiales ETS de Ingenieros
de Caminos, Canales y Puertos, C/Profesor Aranguren S/N, 28040 Madrid, Spain
4 Department of Industrial Engineering, University of Salerno, Via Giovanni Paolo II, 84084 Fisciano, Italy;
ruggiero@unisa.it
* Correspondence: r.damato@upm.es
which consists of a rotor (journal) that rotates inside a bearing with a diameter slightly
larger than that of the rotor, and the fluid film exists in the small space between the two
(meatus) [3]. In the unsteady dynamical behavior, fluid film bearings can, however, present
oscillating behavior (typical elliptical orbits) due to destabilizing cross coupling forces
caused by the nonlinear fluid dynamic phenomena in the oil film [2,5]. To obtain acceptable
operating conditions, it is necessary to simultaneously analyze the rotor-journal bearing
couple in the case of a flexible rotor. These types of dynamic systems exhibit a particular
type of self-excited vibration due to fluid dynamics phenomena in the oil film [2] known as
oil whirl and oil whip, and are characterized by subsynchronous processional motions [6,7].
These vibrations appear when the subsynchronous vortex frequency reaches the natural
frequency of the system [8] and is typically characterized by high vibration amplitudes.
In recent years, research has shown that oil whirl phenomena are generated also when
the journal bearing runs on micropolar lubricant [9–11]. Many authors have studied the
effect of lubricant contamination [12] and the nonlinear behavior of film-oil main bearings
in rotating machines [13]. Furthermore, the performance of the bearing, in order to avoid
instability phenomena, has been analyzed, studied and simulated considering factors such
as misalignment [14], elasticity of the bearing liner [15], dynamic conditions and coupled
surface roughness [16].
136
Mathematics 2022, 10, 1703
dynamic effects and with lubricant film cavitation conditions, an input–output feedback
linearizing controller can be implemented. The proposed control law is adaptive with
respect to the control and disturbance parameter (i.e., phase and frequency of the sinusoidal
disturbance, corresponding to rotor angular speed of the rotor).
137
Mathematics 2022, 10, 1703
Section Section
Authors Year Source Authors Year Source
Content Content
Poritsky 1953 [8] PS Avramov and Borysiuk 2012 [31] PCS
Stanway and Burrows 1981 [22] LR Harika et al. 2013 [12] PS
Burrows and Sahinkaya 1983 [19] LR Marino and Tomei 2016 [27] LR
Lund 1987 [2] PS Carnevale 2016 [33] PCS
Vance 1988 [30] PCS Salazar and Santos 2017 [4] PS
Muszynska 1988 [6] PS Zheng et al. 2017 [23] LR
D’Agostno et al. 2001 [7] PS Ruggiero et al. 2018 [20] LR, PCS
Zhou and Shi 2001 [21] LR Ballnus et al. 2018 [35] PCS
Hamrock and Schimd 2004 [3] PS Das and Guha 2019 [10] PS
Sukumaran Nair and
2004 [15] PS Bhattacharjee et al. 2019 [11] PS
Prabhakaran
Das et al. 2005 [14] PS D’Amato et al. 2019 [29] PCS
Ypma 2006 [36] PCS Marko et al. 2020 [26] LR
Prabhakaran Nair et al. 2007 [16] PS Kumar and Tiwari 2020 [28] LR
Lei and Palazzolo 2008 [25] LR Chen et al. 2021 [5] PS
Carnevale and Astolfi 2008 [32] PCS Tripathy and Bhattacharyya 2022 [9] PS
Friswell et al. 2010 [1] PS Liu et al. 2022 [18] LR, PCS
Hoad et al. 2010 [34] PCS Marko et al. 2022 [17] LR, PCS
Vania et al. 2012 [13] PS D’Amato et al. 2022 [24] PCS
PS: Problem Statement PCS: Proposed Control Strategy
LR: Literary Review
• flexible rotor with lubricated • adaptive noise
• flexible rotor bearings control;
journal bearings; cancellation-rejection;
• active rejection of vibrational phenomena;
• fluid dynamics; • immersion and invariance;
• notch filtering;
• self-excited vibrations due to • method comparison;
• observer-based control;
fluid dynamics; • numerical analysis methods;
138
Mathematics 2022, 10, 1703
$ %T
where: x = ( xc , yc ) T and z j = x j , y j denote the disc center and journal center coordinates,
→
respectively; dw = [0, W/m] T is a constant load term and u x is the control input. The
vector of sinusoidal disturbances acting on the two coordinate dynamics is denoted by
de (t, ω ) = u[sin(ωt + ϕ01 ), − cos(ωt + ϕ02 )] T (ϕ01,2 unknown initial phase), so that
. . 0 −ω u sin(ωt + ϕ01 )
de (t, ω ) = Ω · de (t, ω ) ⇒ de (t, ω ) = · (2)
ω 0 −u cos(ωt + ϕ02 )
139
Mathematics 2022, 10, 1703
The dynamics of acceleration noise in Equation (2) has the canonical parametrization
(Ω) of a disturbance with ellipsoidal trajectories, whose dimensions depend on the values
of the system parameters, as found by the numerical simulations of the analytical model
Equation (1a) presented in [24,29]. The ω is the frequency of the disturbance and corre-
sponds to the operating rotation speed of the flexible rotor. The two matrices l −1 and Σ are
defined as follows
* , −1
$ % −1
$ % zx j −2ζπ (i−1) ey j −y j ex j ex j ey j
l −1 t, z j , e, i i · Λ t, z j , e, i = i + (1 − i ) · 8λ ·
zy j +2ζπ (i−1) ex j xj ey j −y j xj
(3)
$ % − μ10 −ω ζk πi (i−1) 2λ t0
Σ t, z j , e, ω, i + (1 − i ) · k ω ke
ω ζk πi (i−1)
t0
− μ10 − ke 2λ
k ω
#
where: e = x2j + y2j denotes the orbit eccentricity; the i ∈ {1, 2} is the summary index,
which allows us to switch the model representation among uncavitated (i = 1) and cavitated
(i = 2) operating conditions, respectively. In [1], it was shown-analytically and by numerical
simulations- that the dynamical time-variant matrix Λ is always nonsingular with a high
determinant.
To reduce Equation (1a) into Equations (1b)–(3), the following analytical positions are
introduced in [24]
2πζ (1+2ε2 )
3
z= (1− ε2 )
ζ= 1
3/2 μ0 = μR L
c
2(1− ε2 )
σ= e
1/2 λ = σζ (4)
( c2 − e2 )
ρ0
ρ0 = 2RLp0 ε= e
c t0 = μ0
140
Mathematics 2022, 10, 1703
ϕ̂(t) = [ ϕ̂1 , ϕ̂2 ] T is the phase estimate vector, which is responsible for matching at steady-
state the initial conditions of the noise (ϕ0 = [ ϕ0 , ϕ0 ] T = [ ϕ01 , ϕ02 ] T ) in (2), provided
that the amplitude (u) and frequency (ω) are known; φ0 = ϕ̂i (0) = rand(−2π, 2π ) is a
random value chosen as the (arbitrary) initial value for the phase estimation. According
.. .
to [24], ω 2 d+e = x+ + k1 x+ + k2 x+ is a known term, which has been modeled in Equation (6) in
terms of residual spurious dynamics, under the assumption that the asymptotic control
û x0 —linked to the estimation of the displacement Equation (7)—is already converged on
target value. Equation (6) is written according to a linear parametrization of the estimation
error d+e (t, ω ) = de − dˆe [37]: it appears as product of regressor Φ(t, ω ) and the parameter
estimation error, namely d+e = Φ T (t, ω ) · ϕ +(t) (see [24]).
The control gains k1 , k2 , k ϕ1 , k ϕ2 and cη are found by trial-and-error procedures re-
ported in Table 2.
The phase estimator part of Equation (6) is devised since the main problem in sinu-
soidal noise rejection is the knowledge of the initial disturbance phase [27,38], so it acts as a
dynamic Phase-Locked-Loop (PLL [39]). The displacement estimation . . part of Equation (7)
represents a reduced-order observer of the dynamical subsystem x c , yc , x j , y j of the over-
all system Equation (1b). It is required due to technical limitation to install a sensor to
measure the center position of the journal. For the same reasons, moreover, even if an
analytical value of z j was considered by Equation (1b), even in this case, the uncertainty on
the initial condition z j0 = z j (0) may have a fundamental role in the vibration attenuation
at steady-state.
. ..
Remark 1. It is noteworthy that only the disc center position (x), speed (x) and acceleration ( x)
measurements are required and fed back for control closed-loop adaptation Equation (7), as well as
for frequency estimation (Figure 2). ♦
141
Mathematics 2022, 10, 1703
estimator (FE). This module elaborates the analytical output y an of the closed-loop system
Equations (1a)–(7) in loop with the frequency online estimate ω̂ (t). Eventually, the FE
module feeds back to the control loop the value of estimated frequency ωest , which is used
in the ANC-TC module as an input.
Remark 2. The following computation is performed for the only scalar variable dei first since the
two components’ analyses are analogous and since the two disturbance components are shifted of
π/2, so that the estimation is shared among the two components [24]. For convention, let the first
component y = de1 = u sin(ωt + ϕ0 ) be used. ♦
Consistently with the notation used in Equations (5)–(7) (see [24]), by calling ˆ and
ς̂ the online state and parameter estimates of the disturbance, respectively, the following
injected scalar estimation errors [18] are defined as follows
& '
. .
z (y) = k y + ˆ k2 + ς − = β (y) + (α ) ˆ − ;zς (y) = y ˆ + { ς̂ − ς } = β ς (y, ˆ ) + ς+ (9)
where k > 0 is an arbitrary control parameter reported in Table 2. The two errors [z , zς ] in
= ˆ − and ς+ = ς̂ − ς. They contain
Equation (9) differ from the classical error definition +
142
Mathematics 2022, 10, 1703
$ %
two injection terms [β (y), β ς (y, ˆ )] acting as a dynamical correction; α = k2 + ς > 0
acts a kind of scaling. The Equation (9) is written as follows
By setting at zero the estimation errors z (y), zς (y) in Equations (9) and (10), it follows that
est = k2 + ς est ˆ + k y;ς est = ς̂ + y ,
ˆ (11)
$ %
where, est (0) = k2 + ς est (0) ˆ (0) + k y(0) and ς est (0) = ς̂(0) + y(0)ˆ (0).
The two asymptotic estimates [est , ς est ] are found as with a PI (proportional integral)
law with respect to the measured output y. Indeed, both (α ) ˆ and ς̂ turn out to be an
integration process, α denotes the gain of the integral part, while both the two injection
terms represent the proportional part. To prove this point, derive in time the errors [z , zς ](y)
in Equation (9). From Equation (8) it follows that
. . . . .
z (y) = (α )ˆ + ςy + k ;zς (y) = ς̂ + ˆ + y ,
ˆ (12)
The term Δ(y, ,ˆ ς̂) is a function of known signals. By setting at zero the quantities
between braces in both Equations (13a) and (13b), the two scalar adaptation laws for
(,
ˆ ς̂)(t) are found as follows
. . . !
ˆ = −y − k ,ˆ ς̂ = − y ˆ + k2 ˆ 2 + k yˆ + ˆ 2 ς̂ + y ˆ 3 , (14)
where, from Equations (6) and (8), ˆ (0) = u ω0 cos(ω0 t + ϕ̂i (0)) and ς̂(0) = ω02 .
Consequently, Equations (13a) and (13b) become
.
z −k 0 z
. = · . (15)
zς −ˆ −ˆ 2 zς
The first-row of Equation (15) makes that z → 0 exponentially. However, this is not
sufficient to construct an estimation of the state variable since the overall convergence
of the estimation is linked to the convergence of all the error dynamics [z , zς ]. However,
! . .
ˆ ς̂ = [0, 0],
once the two adaptation laws have reached their steady-state values, namely ,
the asymptotic estimates for [est , ς est ] = [, ω 2 ] are given by Equation (11) (see the proof in
Section 3). Note that ω0 may be the rough initial estimate of the operating rotor frequency,
taken from the rotor actuator datasheet as the steady-state operation value. As mentioned
above, the estimates of Equation (11) are two PI, with integral parts (α ) ˆ and ς̂, as may be
inferred from Equation (14).
143
Mathematics 2022, 10, 1703
noise, which is source of uncertainty. Then, the ANC-FE control Equations (8)–(14) with
measured output noise (y(t, ω )) cannot be directly implemented in closed-loop. For this
scope, the value of the output noise y, which is used as a known quantity in the ANC-FE
Equations (8)–(14), is reconstructed analytically from Equation (1b). By using position,
.. . T
speed and acceleration measurements of the disc center coordinates ( x, x, x ) and an
analytical chasing of the journal center displacement (z J (ω0 )), the analytical output noise
(y an ) is computed from Equations (1b)–(3) as follows
( )
$ % .. k$ % →
y an t, z j , ω̂ − x (t) + x − z j (t, ω̂ (t)) + dw − u x (16)
m
. t $
(
% $ %
)
1
z j (t, ω̂ (t), i )z j0 + ke3 l −1 τ, z j , e, i Σ τ, z j , e, ω̂ (τ ), i z j (τ, ω̂ (τ )) + x (τ ) dτ (17)
0 μ0
where: z j0 is the arbitrary initial condition of the analytical chasing z j (t, ω̂ (t), i ) of the
journal
$ displacement
% (z j ); z j (t, ω, i ) is a function of the frequency through the matrix
Σ t, z j , e, ω, i in Equation (3); e is the eccentricity computed in the analytical chasing
coordinates (z j ). From Equations (1b) and (16), it follows
⎡ ⎤
..
$ % ⎢ .
⎥ . .. k .. $ % $ %
y an t, z j , ω̂ = ⎢ ⎥
⎣ y ani ⎦ = de (t, ω ) + m −z j (t, ω ) + z j (t, ω̂ (t)) = de (t, ω ) + δy t, z j0 , ω, ω̂ (t) ≡ y an t, z j , ω, ω̂ (18)
..
.
.. ..
where from (8) dei (ω, t) ∝ dei = y, so that y ani ∝ dei (ω, t). The constant proportional
factor is the square of unknown frequency −ω 2 , which, from (11), is absorbed in the
tuning procedure of k . Then, y an and the bounded disturbance de (t, ω ) are consistent
since, eventually, the
$ two sinusoidal
% vibrations have the same frequency (ω). Indeed, from
Equation (18), y an t, z j , ω, ω̂ is a function of ω. We now state the main assumption in
this paper.
$ %
Assumption 1. The analytical chasing error δy t, z j0 , ω, ω̂ (t) in Equation (18) is bounded and
$ % .
small enough that y an t, z j , ω̂ is assumed at the same frequency of de (t, ω ) = Ω de (t, ω ).
From Equations (8) and (16)–(18), according to Remark 2, and under the Assump-
tion 1, the asymptotic adaptation laws Equations (11)–(14) are updated considering the
computation for only one component y ani , as follows
$ %
est = k2 + ς est ˆ + k y ani ; !
. . .
ς est = ς̂ + y ani ;
ˆ ˆ = −y ani − k ;
ˆ ς̂ = − y ani ˆ + k2 ˆ 2 + k y ani ˆ + ˆ 2 ς̂ + y ani ˆ 3 ,
#
ωest = lim ω̂ (t) = lim ς est (t) (19)
t→∞ t→∞
$ %
with est (0) = k2 + ς est (0) ˆ (0) + k y ani (0); ς est (0) = ς̂(0) + y ani (0)ˆ (0);
ˆ (0) = u ω0 cos(ω0 t + ϕ̂i (0)) ; ς̂(0) = ω02 .
Equations (16), (17), and (19), constitute the frequency estimator (FE) equations.
Equation (15) holds as function of y ani .
By absorbing Equations (16), (17) and (19) in the ANC-TC Equations $ %(5)–(7), the
ANC-FE control, Equations (19)–(20c) with analytical output noise (y an t, z j , ω̂ ), is imple-
mentable in closed-loop as follows
⎧ ..
⎪
⎨ û x û x0 + dˆe (t, ωest )
û x0 mk x + dw + v̂ x (20a)
⎪
⎩ .. .
v̂ x − mk ẑ j + xr − k1 x+ − k2 x+
144
Mathematics 2022, 10, 1703
⎧ ..
⎪
⎪ −u sin(ωest t + ϕ̂(t))
⎪
⎪ dˆe (t, ωest ) ωest 2
⎪
⎪ u cos(ωest t + ϕ̂(t))
⎨ -t ∼
.. .
∼ ∼
k ϕ1 0
ϕ̂(t, ωest ) 0 ϕ(τ, ωest ) · x + k1 x + k2 x dτ (20b)
⎪
⎪ 0 k ϕ2
⎪
⎪
⎪
⎪ cos(ωest t + φ0 ) 0
⎩ ϕ(t, ωest ) u
0 sin(ωest t + φ0 )
* $ .%
ẑ j − mk η̂ + cη · x . m .
. . ⇒ η̂ (0) = 0, ẑ j − · cη · x (0) , (20c)
η̂ −cη · v̂ x + cη · η̂ + c2η · x k
where, in Equations (20a)–(20c), the asymptotic estimate ωest = ω̂ (t)t→∞ (Equation (19)) is
considered for the constant ω as the constant frequency parameter in the phase estimator
(compare Equations (5)–(7) with Equations (20a)–(20c).
At steady-state, when all the estimates’ terms are converged (i.e.,
v̂ x → v x , ẑ j → z j , dˆe → de ), the closed-loop system, Equations (1b)–(4) and (19)–(20c),
is regulated by û x → u x around the reference xr (Figure 2).
Remark 3. The idea of frequency estimation, namely Equations (8)–(15), inspired from immersion
and invariance (I&I) approach, are borrowed from the algorithm proposed in [32]. Nevertheless,
the novelty of applying this technique to a rotordynamic model Equation (1a) [24,30] consists of a
further manipulation. Equations (16)–(19) have been introduced with the aim of reconstructing
analytically (y an ) an unavailable signal, the output noise (y)). The signal chasing Equation (17)
is possible by virtue of the closed-form formulation, Equations (1b)–(4), presented in [24], which
allows a reliable numerical extraction of the coordinates of the journal disc center to be performed,
with a very high determinant value of the matrix Λ, by integration of Equation (1b).
In particular, the system Equation (8) of dimension 3 [ς, , y ani ] is immersed in a wider space
ˆ y ani ]. Then, the observer problem is attracted by the surface at z = 0
with 5 dimensions [ς, ς̂, , ,
and zς = 0, where it remains at steady-state, so that the resultant invariant locus is a manifold
(Figure 3). ♦
Remark 4. It is worth highlighting that Equation (20c) is identical to Equation (7), namely, the
reduced-order-observer, which estimates the z j displacement, acts independently from the other
control blocks. The FE controller designed in this paper (Equations (16)–(19)) based on the analytical
output y an results as a plug-in estimation block downstream to the ANC-TC Equations (5)–(7)
proposed in [24]. This property shall be exploited for control convergence proof. ♦
Remark 5. The control ANC-FE Equations (19) and (20a)–(20c) are a general-purpose control with
respect to the actual lubricated rotordynamics’ operating condition,$namely cavitated/uncavitated.
% $ Indeed,
%
according to the notations in Equations (1b)–(4), the matrices l −1 t, z j , e, i and Σ t, z j , e, ω̂ (t), i in
Equation (17) are functions of the parameter i. ♦
145
Mathematics 2022, 10, 1703
Proposition 1. Consider the lubricated rotordynamic model Equations (1b)–(4) and y = dei (t, ω )
Equation (8) as the sinusoidal output noise from the exosystem (2), with unknown constant frequency
→
( ω) and initial phase (ϕ0 ). Consider the reference dynamics x r = [ xcr , ycr ] for the disc center
coordinates, with x+ = ( x − xr ). $ %
From Equations (9) and (16)–(20c), ϕ + = ( ϕ0 − ϕ̂(t)), + z j = z j − ẑ j , z (t, y ani ) =
$ $ 2 % % $ %
−est + k + ς est ˆ + k y ani and zς (t, y ani ) = −ω̂ 2 (t) + ς̂ + y ani ˆ .
Under the Assumption 1, the û x control in Equations (20a)–(20c) makes that for the closed-loop
system Equations (1b)–(4) and (19)–(20c):
i. the equilibrium point (z (y ani ), ˆ (y ani )zς (y ani )) = (0, 0) is asymptotically stable;
$ %
ii. est = k2 + ς est ˆ + k y ani . is an asymptotic estimate of the analytical output noise deriva-
. /
tive y ani and ωest = lim ω̂ (t) = ς̂ + y ani ˆ is an asymptotic estimate of the rotor
t→∞
operating
$ % frequency ω;
iii. ϕ z j , x+ = (0, 0, 0) is exponentially stable (thesis from [24]).
+, +
3.2. Proof
Let V1 (t, z , zς ) be a suitable Lyapunov function
z2 z2ς
V1 (t, z , zς ) = + ≥ 0. (21)
2 k 2
.. . . . . $.% .
V 1 (t, z , zς ) = −2z z − zˆ ς z − ˆ zς z − z ˆ ς z − ˆ 2 z2ς − ˆ 2 z2ς
$ % (23)
= 2k z2 − zς z (−y ani − k ˆ ) − z ˆ − $ ˆ − ˆ 2 zς %+ k z
z ˆ ς z
−2z
ˆ 2ς (−y ani − k ˆ ) − 2ˆ 2 zς −z ˆ − ˆ 2 zς .
146
Mathematics 2022, 10, 1703
..
The V 1 is bounded since under the Assumption 1 the analytical output noise y ani
is bounded and the estimate ˆ (s) = [−y ani (s)/(s + k )] is a passive filter of a bounded
.
signal, so it is bounded and also its derivative ˆ = (−y ani − k ˆ ) is bounded in turn.
. .
Then, B.2.1 Barbalat’s Lemma [40] applies on V 1 , so that lim V 1 (t) = 0. Consequently,
t→∞
ˆ ς ) → (0, 0) globally asymptotically.
(z , z $ % Now, according to [32] and from
Equations (10), (13) and (19), est = k2 + ς est ˆ + k y ani is an
/asymptotic estimate of ana-
.
lytical output noise derivative y ani , while ωest = ω̂ (t)t→∞ = ς̂ + y ani ˆ is an asymptotic
estimate of the rotor operating frequency ω. This proves the theses (i)–(ii).
The rest of the proof is inherited from [24].
Let V2 (t, ϕ
+) and V3 (t, η+) be introduced as two suitable Lyapunov functions
1 T
+) =
V2 (t, ϕ + ·ϕ
ϕ + ≥ 0, (24)
2
1 T
V3 (t, η+) = η+ · η+ ≥ 0. (25)
2
Deriving V2 (t) Equation (24) in time, from Equations (6) and (20b) it follows
.
!
. k ϕ1 0
V 2 (t) = ϕ + = −ϕ
+T ϕ +T Φ(t, ωest ) ω 2 d+e (t, ωest ) = −d+eT ω 2 k ϕ I(2×2) d+e ≤ 0, (26)
0 k ϕ2
where: k ϕ1 = k ϕ2 = k ϕ > 0; I(2×2) is the 2-by-2 identity matrix; the linear parametrization of
the estimation error in Equation (6) has been considered, i.e., d+e (t, ωest ) = Φ T (t, ωest ) ϕ
+(t, ωest )
(where the ωest has been considered constant at steady-state so that it is replaced for the actual
.
constant ω ). Due to V 2 ≤ 0, V2 (t) does not increase in t ∈ (0, ∞). Then, V2 is bounded
and .
+ is bounded. By time-deriving V2 again, under the assumption of bounded Φ, Φ , the
so ϕ
. .
Barbalat’s Lemma in [40] applies on V 2 . Then, lim V 2 (t) = 0 and d+e → 0 ; consequently,
t→∞
+ → 0 asymptotically. To prove the exponential convergence of ϕ
ϕ +(t, ωest ) estimation error the
Persistency of Excitation condition [40] is required, provided that ∃ T, k T ∈ R+ such that
. t+ T
Φ(τ, ωest )Φ T (τ, ωest ) dτ ≥ k T I(2×2) > 0 , ∀t ≥ 0 , (27)
t
which is always trivially fulfilled, given the regressor definition in Equations (6) and (20b).
.
Now, consider, from Equations (7) and (20c), the following definitions of η+,η+, v+x , u+x
$ % $ . % $ . %
− mk z j − ẑ j = cη x + η − cη x + η̂ η+
. . .
η − η̂ = (−cv x + cη ) − (−cv̂ x + cη̂ ) = −c+ v x + c+ η η+ ,
(28)
v x − v̂ x = (u x − û x ) + ω 2 d+e = u+x + ω 2 d+e v+x
z j − ω 2 d+e u+x
− mk +
with cη < 0 (see [24] for detailed derivation of Equation (28)). Deriving V3 (t) Equation (25)
in time, from (28) it follows
. . $ % 2
V 3 (t) = η+T · η+ = c · η+T η+ + (−c) · η+T u+x + ω 2 d+e ≤ 2c · η+2 + − 2c u+x + ω 2 d+e
$ % $ c%
= 2c · η+2 + − 2c ω 2 d+e 2 + − − k+ z j − ω 2 d+e
2 + (− c ) − k + 2 + 2+
m z j − ω de ω de
$ % $ % 2 k m (29)
c 2
≤ · η+ + − ω de −
c 2 + 2 c
− + 2 +
z j + ω de − + k
z j + ω de + 2ω 2 d+e
2 +
2 2 2 m m
= −2c ω 2 d+e 2 − 2cη+ω 2 d+e ≤ cη+2 − c ω 4 d+e 2 .
Equation (29) is obtained considering the triangular inequalities η+T u+x + ω 2 d+e ≤
2
2
2 η
1 +2 + u+x + ω 2 d+e 2 and − mk +
z j + (−ω 2 d+e ) ≤ − mk +
z j + − ω 2 d+e ; then,
147
Mathematics 2022, 10, 1703
the square of ξ (t) η+ + ω 2 d+e has been reconstructed, so that the inequalities
η+ω 2 d+e ≤ − 12 η+2 + ω 2 d+e 2 have been used.
Following this, d+e (t) is bounded on [0, ∞) so that, according to Lemma A.1 in [41], any
globally exponentially convergent observer for the modal disturbance d+e (t) guarantees the
exponential convergence for η+ → 0 and, consequently, for +z j → 0 and u+x = (u x − û x ) → 0
(see Equation (28)). This is enforced by the global exponential zero convergence of d+e (t), as
has been proved in Equations (26) and (27).
Eventually, by replacing in Equation (1a) the control û x Equation (20a) (û x for u x ), it
follows that for u+x → 0
.. . .. . .
k
x+ = −k1 x+ − k2 x+ + z − d+e = −k1 x+ − k2 x+ − u+x = −k1 x+ − k2 x+
+ (30)
m j
is a Hurwitz system dynamics. Hence, also x+ → 0 globally exponentially. This proves the
thesis (iii).
Remark 6. It has been proved that, under the Assumption 1, the convergence of$ asymptotical %
frequency estimate FE in Equation (19)—with its additional inner loop (ω̂ (t), y an t, z j , ω, ω̂ ) )
in which the FE is elaborated—is standalone with respect to the rest of the control loop: it is a
plug-in estimation block, according to Remark 2. In this case, once ωest → ω , at steady-state ωest
is considered as an internal parameter for the control loop. Then, the frequency estimation phase
is a warm-up process for the ANC-TC Equations (5)–(7) [34,35], which through Equation (19) is
transformed into Equations (20a)–(20c). ♦
Remark 7. From [24,32], as aforementioned at the beginning of Section 2.2.1, the closed-loop system
from the input disturbance de (t) to the output x (t), including the phase estimator
Equation (20b), the journal displacement observer Equation (20c) and the asymptotic frequency
estimator Equation (19), results in a regulation loop unifying the parameter and state estimation
problems as reduced-order observed-based problems, with adaptive notch filtering
characteristics [24]. ♦
4. Results
4.1. Numerical Simulation Setup and Method Description
The proposed control is implemented in the Simulink MATLAB environment with
the parameter set as shown in Table 2 and with the control parameters reported in Table 3.
The results are presented in four simulation cases for cavitated and uncavitated operating
conditions with measured and analytical output noise.
The subscript “i” in the notation of the analytical output noise y ani is omitted for
editing convenience of the graphs. Hence, y an in all figures is referred to as the only one
component y ani , accordingly to Equation (16) and Remark 2.
..
The signal dei (t, ω ) and z j in the figures represent the actual sinusoidal disturbance
and the journal center vector coordinates, respectively, whose values are computed from
..
the simulated model differential equations. These signals, (dei (t, ω ), z j ) would not be
available in practice, so their analytical chasing in simulation is reported only for the sake
of comparison with respect to the analytical signals.
148
Mathematics 2022, 10, 1703
Control Initial
Values Values
Gains Conditions
cη [s−1 ] −400 x̂j (0) 0
^
k1 , k2 2000 yj (0) 0
^
kϕ1 50.66 ϕ(0) (rad) [4, 4]
kϕ2 50.66 ω0 (rad/s) 2985
k 7.5 zJ0 (m) [0, −2 × 10−5 ]
Bias and initialization issues can mislead the estimated response measure of perfor-
mance obtained from a simulation. Several techniques surveyed in [34] can be used to
estimate online the length of the warm-up phase in the output data collection of a simu-
lation model, to understand, automatically, when this preliminary phase ends and when
the actual numerical simulation starts giving a reliable response. The same issue is tackled
in Bayesian parameter estimation [35] where the parameters of the model under study
are usually derived from the available data, using optimization and sampling method
estimations. It is shown that a warm-up phase occurs (the asymptotical frequency estima-
tion phase) before the closed-loop ANC control can start its exponential hooking of the
exosystem output disturbance signal. In practice, such a length cannot be quantified, but
for the proposed technique, the warm-up phase length estimation is not required.
The simulations are designed in order to validate numerically the theoretical analytical
proof reported in the previous Section 3. In practice, the settling time and trajectory
behavior of the phase estimate can be forecast since its trend is exponential, properly,
while this does not occur for the frequency estimate, which converges asymptotically.
Hence, to highlight the differences in the convergence behaviors of the estimation errors, a
further numerical evidence is provided. The Inexact Newton method [36] is applied to the
frequency estimation error and phase estimation error successions, while the convergence of
the overall disturbance error succession is considered only in the time since it follows from
the previous two errors. The iterative Newton method is applied to the error successions in
both cavitated and uncavitated conditions, and only in the case of analytical output noise,
which is more interesting in practice.
To analyze and evaluate the order of the convergence of the estimation errors, the
following point series are defined as the difference between the estimation dynamics terms
and the constant target values of each series (ω and ϕ0 ). Denoting the successions of
frequency and phase estimation with ωest,yann n and ϕ̂−,yann n , respectively, sampled at
discrete time instants n (1 sample each 5 (ms)), the following
eωn = ωest,yann n
− ω e ϕn = ϕ̂−,yann n
− ϕ0 (31)
represent the successions of the frequency and phase estimation errors, respectively. Consid-
ering the absolute value of the ratio between the {n + 1}-element of each error succession
and the {n}-element power of pω,ϕ , as long as the limits of these ratios are constant for
n → ∞ , namely
ωest,yan − ω ϕ̂−,yan − ϕ0
n +1 n +1
lim = constant lim = constant (32)
n→∞ ωest,yan − ω pω n→∞ ϕ̂−,yan − ϕ p ϕ
n 0 n
149
Mathematics 2022, 10, 1703
$ %
parametrized by the coefficient pair k ω,ϕ , pω,ϕ . In Equation (33): pω,ϕ represents the
order of convergence for each error succession, Equations (31) and (32), while k ω,ϕ is a
proportional factor, denoting the exponential convergence time-constant (when pω,ϕ = 1).
The following numerical simulations are proposed to: (1a) evaluate the convergence
performance of disturbance estimation errors in time; (2) verify that the initial frequency
estimation warm-up phase length is acceptable in practice in a real application (namely the
asymptotical convergence occurs at t < ∞). In all four simulation scenarios, 5% of initial
frequency estimation error and about 25% (about 70 deg) of initial phase estimation error
are considered.
150
Mathematics 2022, 10, 1703
151
Mathematics 2022, 10, 1703
Figure 7 shows the comparison (in offset view) of phase estimation performance
in time, in the four simulation cases. The ( ϕ̂unc,yan , ϕ̂unc,y ) represent the phase estimate
for the uncavitated case, with analytical and measured output noise, respectively; the
( ϕ̂cav,yan , ϕ̂cav,y ) represent the phase estimate for the cavitated case, with analytical and
measured output noise, respectively. In the graph, the warm-up phase edges, and time
interval labels, are marked-up in the four cases for illustration purposes.
Figure 7. Offset view of phase estimation in the four simulation cases, with warm-up phase edges.
Figure 8 shows
the scatter plot of the (k, p)-parametrized successions of frequency esti-
mation error eωn+1 = k ω |eωn | pω (Figure 8A,B) and phase estimation error
pϕ
e ϕ = k ϕ e ϕ (Figure 8C,D), following Equations (31)–(33). Both cavitated and
n +1 n
uncavitated conditions with analytical output noise y an are considered. The timeseries in
Figure 8A,B are plotted considering the overall simulation time data. To evaluate whether
the phase estimates’ convergent behavior is exponential starting from the warm-up phase
end (according to Proposition 1), the timeseries in Figure 8C,D are plotted on the two
restricted simulation time intervals starting from 8.41 (s) and 1.59 (s) for uncavitated and
cavitated cases, respectively, as shown in Figure 7.
The text boxes in Figure 8 report the fitting data statistics obtained applying the
iterative Inexact Newton method on the frequency and phase estimation error timeseries,
in cavitated and uncavitated conditions with analytical output noise. Such statistics are
used to evaluate the order of the convergence pω,ϕ in the four cases. The arrows indicate
the direction of the succession point distribution from eωn=0 to eωn→∞ , and from e ϕn=0 to
e ϕn→∞ , as defined in Equation (31).
Figure 9 shows the difference between the scalar analytical noise y an and the actual
..
disturbance component dei (t, ω ) for the uncavitated case (Figure 9A) and for the cavitated
case (Figure 9C). For ease of graph interpretation, due to the high frequency of the signals,
a restricted time lapse (0–0.0012) (s) was chosen, arbitrarily, on the overall simulation time
of 12 (s), in order to highlight the detail of the comparison over a few periods.
152
Mathematics 2022, 10, 1703
Figure 8. Convergence of frequency estimation error timeseries (A,B) and phase estimation error
timeseries (C,D) for uncavitated and cavitated condition, with analytical output noise.
..
Figure 9. Scalar analytical output noise y an and actual disturbance component dei (t, ω ) for uncavitated
and cavitated conditions (A,C); “i” component of actual journal displacement z ji and its analytically
reconstruction z J i with the scaled difference δyi for uncavitated and cavitated conditions (B,D).
153
Mathematics 2022, 10, 1703
In Figure 9B,D, the behavior of only one component of the journal displacement
coordinate vector, z ji , is shown in time. It is compared with the same component “i” of its
analytically reconstructed value z J i . Both Figure 9B,D contain a subplot showing the scaled
$ %
difference δyi = − mk z ji − z J i , where
$ δyi is the acceleration
% injected error referred to as
only one component of the vector δy t, z j0 , ω, ω̂ (t) , as defined in Equation (18).
Figure 10 shows the journal displacement observation errors + z j as defined according
T
to Equations (20c) and (28), in the four simulation cases. The signals z j = x j , y j are
not available in practice, but they are shown only for comparison to their corresponding
T
observation ẑ j = x̂ j , ŷ j .
154
Mathematics 2022, 10, 1703
.. ..
Figure 11. Hooking of the sinusoidal disturbance dei (t, ω ) by its estimate dˆei (t, ωest ) in the case of
uncavitated condition with analytical disturbance (y an ).
..
Figure 12 illustrates the sinusoidal disturbance estimation errors d+ei (t, ω ) in the four
simulation cases.
..
Figure 12. Sinusoidal disturbance estimation errors d+ei (t, ω ) are reported for uncavitated and cav-
itated conditions with analytical output noise (A,C) and for uncavitated and cavitated conditions
with measured output noise (B,D).
In Figure 13, the control vector û x = [û x,1 , û x,2 ] T (Equation (20a)) is plotted in the four
simulation cases. A zoomed plot, referred to by the dashed rectangle, highlights the detail
of the last transient variation in the control before the steady-state behavior.
155
Mathematics 2022, 10, 1703
Figure 13. Control vector û x = [û x,1 , û x,2 ] T for uncavitated and cavitated conditions with analytical
output noise (A,C) and for uncavitated and cavitated conditions with measured output noise (B,D).
Figure 14. Disturbance attenuation of disc center coordinates (xc , yc ) for uncavitated and cavitated
conditions with analytical output noise (A,C) and for uncavitated and cavitated conditions with
measured output noise (B,D).
156
Mathematics 2022, 10, 1703
157
Mathematics 2022, 10, 1703
is restrained in a few tenths of millimeters with respect to the ideal case with measured
..
output (Figure 10B,D). The exponential hooking of the disturbance dei (t, ω ) by the adaptive
..
noise cancellation control dˆei (t, ωest ) (Equation (20b)) is illustrated in Figure 11. For sake
of brevity, only the worst case (uncavitated condition with analytical output noise) is re-
ported. Analogous results are obtained in the other three simulation cases. The preliminary
phase of the overall disturbance estimations is dictated by the asymptotical convergence
of the frequency estimate ωest . From that instant, the same closed-loop behavior of [24]
is recovered, with the last segment of the disturbance convergence, which is exponential.
..
The disturbance adaptation error d+ei (t, ω ), reported in Figure 12, in the four cases recovers
the observer behavior (Figure 10). Moreover, the behavior and performance of the control
input (Figure 13) recover the behaviors of the displacement observation error (Figure 10)
and the sinusoidal estimation errors (Figure 11) in the four cases. A zoomed plot is reported
to highlight the sinusoidal trend of the two control vector components, oscillating at the
same frequency as the rejected disturbance (as shown also in [24]). Figure 13 shows that
the control amplitude stabilizes at the same time (respectively, in the four cases) in which
all the estimation errors converge. Figure 14 illustrates that, from the comparison of the
rotor center coordinate stabilization in the four cases, it emerges that the performance
degradation in terms of vibration amplitude, for analytical output noise with respect to the
(ideal) measured output noise case, is restrained in a few tenths of millimeters.
Overall, better estimation performance is obtained in the cavitated condition with
respect to the uncavitated case. As the main figures of merit for the proposed adaptive
closed-loop control, the rotor vibration attenuation time (about 8.5 (s) in the worst case) and
the vibration residual amplitude (order of 10−1 (mm)) are considered, which are acceptable
values for a practical application. Similar results, in terms of estimation error convergence
time, are obtained in [17,18] (order of seconds), which presented tests of an experimental
setup on thin steel strips [17] (order of millimeters) and simulative in silico validation on a
reversible cold strip rolling mill [18].
5.2. Conclusions
The aim of this study was to provide a mathematical model for a novel adaptive
noise cancellation (ANC) technique designed to stabilize a flexible rotor shaft supported by
two hydrodynamic full (short) journal bearings and affected by a sinusoidal disturbance
output noise with unknown frequency. The disturbance frequency corresponds to the rotor
operating angular speed (ω), which is driven by an external actuator, so that uncertainties
may arise in the frequency actual value due to actuation operating point fluctuations.
The adaptation with respect to the frequency estimation (FE) generalizes the ANC to the
novel ANC-FE.
As the main novelty in this study, inspired from immersion and invariance (I&I)
techniques, an asymptotical frequency estimation (FE) module is designed as a combination
of state-observer and asymptotical parameter estimation. The FE operation represents
the warm-up phase of the overall adaptive noise cancellation control. The FE module is
externalized as an additive plug-in block, which processes the analytical reconstruction of
the output data downstream of the ANC closed-loop system (Figure 2).
The mathematical structural proof of the ANC-FE control theoretical formulation,
Proposition 1, is provided under Assumption 1, which requires that the analytical output
noise reconstruction is a signal at the same frequency of disturbance. It follows that: the
frequency estimation convergence is asymptotical; the disturbance phase estimation and
the rotor center coordinate stabilization are exponential.
The mathematical results have been validated experimentally in silico by numerical
simulations performed in four scenarios: cavitated and uncavitated conditions with analyt-
ical and measured output noise. Moreover, a data fitting analysis with the Inexact Newton
method (with 95% of confidence) is performed on frequency and phase estimation error
point series, in order to also validate numerically Proposition 1, demonstrating that the
158
Mathematics 2022, 10, 1703
phase estimate succession approximates the exponential behavior better than frequency
estimate succession. The orders of convergence obtained by the frequency estimation
timeseries are pω,unc = 0.9983 and pω,cav = 1.005; the order of convergence obtained by
the phase estimation timeseries is p ϕ = 1 for both uncavitated and cavitated conditions.
Considering the analytical output noise as the case of practical interest, the settling time of
disturbance rejection and then of the rotor center coordinates stabilization is about 76% less
in the cavitated than in the uncavitated condition, 2 (s) and 8.5 (s), respectively.
Simulation shows that the analytical output noise is very close to the measured output
noise, enforcing Assumption 1. The warm-up phase length is restrained in less than 10 s,
which is acceptable as an initialization process duration in a real application. For future
investigations the possible application of the proposed control technique to other dynamical
complex systems can be considered.
Author Contributions: Conceptualization, G.A., R.D. and A.R.; methodology, G.A. and R.D.; soft-
ware, G.A.; validation, G.A. and R.D.; formal analysis, G.A., R.D. and A.R.; investigation, G.A., R.D.
and A.R.; data curation, G.A.; writing—original draft preparation, G.A. and R.D.; writing—review
and editing, G.A., R.D. and A.R.; visualization, G.A. and R.D.; supervision, A.R. All authors have
read and agreed to the published version of the manuscript.
Funding: This research is funded by FARB 2019 University of Salerno (Italy).
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Friswell, M.I.; Penny, J.E.T.; Garvey, S.D.; Lees, A.W. Dynamics of Rotating Machines; Cambridge University Press: Cambridge, UK,
2010; ISBN 9780511780509.
2. Lund, J.W. Review of the Concept of Dynamic Coefficients for Fluid Film Journal Bearings. J. Tribol. ASME US 1987, 109, 37–41.
[CrossRef]
3. Hamrock, B.J.; Schmid, S.R. Fundamental of Fluid Film Lubrication, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2004; ISBN 0824753712.
4. Salazar, J.G.; Santos, I.F. Active tilting-pad journal bearings supporting flexible rotors: Part I—The hybrid lubrication. Tribol. Int.
2017, 107, 94–105. [CrossRef]
5. Chen, Y.; Yang, R.; Sugita, N.; Mao, J.; Shinshi, T. Identification of bearing dynamic parameters and unbalanced forces in a flexible
rotor system supported by oil-film bearings and active magnetic devices. Actuators 2021, 10, 216. [CrossRef]
6. Muszynska, A. Stability of whirl and whip in rotor/bearing systems. J. Sound Vib. 1988, 127, 49–64. [CrossRef]
7. D’Agostino, V.; Guida, D.; Ruggiero, A.; Senatore, A. An analytical study of the fluid film force in finite-length journal bearings.
Part I. Lubr. Sci. 2001, 13, 329–340. [CrossRef]
8. Poritsky, H. Contribution to the theory of oil whip. Trans. ASME 1953, 75, 1153–1161.
9. Tripathy, D.; Bhattacharyya, K. Analysis of a Hydrodynamic Journal Bearing of Circular Cross Section Lubricated by a Magne-
tomicropolar Fluid. In Proceedings of the Lecture Notes in Mechanical Engineering; Springer: Singapore, 2022; pp. 1495–1502.
10. Das, S.; Guha, S.K. Non-linear stability analysis of micropolar fluid lubricated journal bearings with turbulent effect. Ind. Lubr.
Tribol. 2019, 71, 31–39. [CrossRef]
11. Bhattacharjee, B.; Chakraborti, P.; Choudhuri, K. Evaluation of the performance characteristics of double-layered porous
micropolar fluid lubricated journal bearing. Tribol. Int. 2019, 138, 415–423. [CrossRef]
12. Harika, E.; Bouyer, J.; Fillon, M.; Hélène, M. Measurements of lubrication characteristics of a tilting pad thrust bearing disturbed
by a water-contaminated lubricant. Proc. Inst. Mech. Eng. Part J. J. Eng. Tribol. 2013, 227, 16–25. [CrossRef]
13. Vania, A.; Pennacchi, P.; Chatterton, S. Dynamic Effects Caused by the Non-Linear Behavior of Oil-Film Journal Bearings in
Rotating Machines. In Proceedings of the Volume 7: Structures and Dynamics, Parts A and B; ASME: New York, NY, USA, 2012; p. 657.
14. Das, S.; Guha, S.K.; Chattopadhyay, A.K. Linear stability analysis of hydrodynamic journal bearings under micropolar lubrication.
Tribol. Int. 2005, 38, 500–507. [CrossRef]
15. Sukumaran Nair, V.P.; Prabhakaran Nair, K. Finite element analysis of elastohydrodynamic circular journal bearing with
micropolar lubricants. Finite Elem. Anal. Des. 2004, 41, 75–89. [CrossRef]
16. Prabhakaran Nair, K.; Sukumaran Nair, V.P.; Jayadas, N.H. Static and dynamic analysis of elastohydrodynamic elliptical journal
bearing with micropolar lubricant. Tribol. Int. 2007, 40, 297–305. [CrossRef]
17. Marko, L.; Saxinger, M.; Steinboeck, A.; Kugi, A. Cancellation of unknown multi-harmonic disturbances in multivariable flexible
mechanical structures. Automatica 2022, 137, 110123. [CrossRef]
159
Mathematics 2022, 10, 1703
18. Liu, L.; Shao, N.; Deng, R.; Ding, S. Immersion and invariance adaptive decentralized control for the speed and tension system of
the reversible cold strip rolling mill. Int. J. Adapt. Control Signal Process. 2022, 36, 785–801. [CrossRef]
19. Burrows, C.R.; Sahinkaya, M.N. Vibration control of multi-mode rotor-bearing systems. Proc. R. Soc. Lond. A 1983, 386, 77–94.
20. Ruggiero, A.; D’Amato, R.; Magliano, E.; Kozak, D. Dynamical simulations of a flexible rotor in cylindrical uncavitated and
cavitated lubricated journal bearings. Lubricants 2018, 6, 40. [CrossRef]
21. Zhou, S.; Shi, J. Active balancing and vibration control of rotating machinery: A survey. Shock Vib. Dig. 2001, 33, 361–371.
[CrossRef]
22. Stanway, R.; Burrows, C.R. Active vibration control of a flexible rotor on flexibly-mounted journal bearings. J. Dyn. Syst. Meas.
Control 1981, 103, 383–388. [CrossRef]
23. Zheng, S.; Li, H.; Peng, C.; Wang, Y. Experimental Investigations of Resonance Vibration Control for Noncollocated AMB Flexible
Rotor Systems. IEEE Trans. Ind. Electron. 2017, 64, 2226–2235. [CrossRef]
24. D’Amato, R.; Amato, G.; Wang, C.; Ruggiero, A. A Novel Tracking Control Strategy with Adaptive Noise Cancellation for Flexible
Rotor Trajectories in Lubricated Bearings. IEEE ASME Trans. Mechatron. 2022, 27, 753–765. [CrossRef]
25. Lei, S.; Palazzolo, A. Control of flexible rotor systems with active magnetic bearings. J. Sound Vib. 2008, 314, 19–38. [CrossRef]
26. Marko, L.; Saxinger, M.; Steinboeck, A.; Kemmetmüller, W.; Kugi, A. Frequency-adaptive cancellation of harmonic disturbances
at non-measurable positions of steel strips. Mechatronics 2020, 71, 102423. [CrossRef]
27. Marino, R.; Tomei, P. Adaptive disturbance rejection for unknown stable linear systems. Trans. Inst. Meas. Control 2016, 38,
640–647. [CrossRef]
28. Kumar, P.; Tiwari, R. Dynamic analysis and identification of unbalance and misalignment in a rigid rotor with two offset discs
levitated by active magnetic bearings: A novel trial misalignment approach. Propuls. Power Res. 2020, 10, 58–82. [CrossRef]
29. D’Amato, R.; Amato, G.; Ruggiero, A. Adaptive Noise Cancellation-Based Tracking Control for a Flexible Rotor in Lubricated
Journal Bearings. In Proceedings of the 2019 23rd International Conference on Mechatronics Technology (ICMT), Salerno, Italy,
23–26 October 2019; pp. 1–5. [CrossRef]
30. Vance, J.M. Rotordynamics of Turbomachinery; Wiley: Hoboken, NJ, USA, 1988; ISBN 0471802581.
31. Avramov, K.V.; Borysiuk, O.V. Nonlinear dynamics of one disk asymmetrical rotor supported by two journal bearings. Nonlinear
Dyn. 2012, 67, 1201–1219. [CrossRef]
32. Carnevale, D.; Astolfi, A. A minimal dimension observer for global frequency estimation. Proc. Am. Control Conf. 2008, 2,
5236–5241. [CrossRef]
33. Carnevale, D. Robust hybrid estimation and rejection of multi-frequency signals. Int. J. Adapt. Control Signal Process. 2016, 30,
1649–1673. [CrossRef]
34. Hoad, K.; Robinson, S.; Davies, R. Automating warm-up length estimation. J. Oper. Res. Soc. 2010, 61, 1389–1403. [CrossRef]
35. Ballnus, B.; Schaper, S.; Theis, F.J.; Hasenauer, J. Bayesian parameter estimation for biochemical reaction networks using
region-based adaptive parallel tempering. Bioinformatics 2018, 34, i494–i501. [CrossRef]
36. Ypma, T.J. Local Convergence of Inexact Newton Methods. SIAM J. Numer. Anal. 2006, 21, 583–590. [CrossRef]
37. Sastry, S.; Bodson, M. Adaptive Control—Stability, Convergence, and Robustness; Prentice-Hall: Hoboken, NJ, USA, 1989; p. 201.
38. Marino, R.; Tomei, P. Output Regulation for Unknown Stable Linear Systems. IEEE Trans. Autom. Control 2015, 60, 2213–2218.
[CrossRef]
39. Sun, X.; Member, S.; Wu, M.; Yin, C.; Wang, S.; Tian, X. Multiple-Iteration Search Sensorless Control for Linear Motor in Vehicle
Regenerative Suspension. IEEE Trans. Transp. Electrif. 2021, 7, 1628–1637. [CrossRef]
40. Marino, R.; Tomei, P. Nonlinear Control Design: Geometric, Adaptive and Robust; Prentice Hall: London, UK, 1995; Volume 1.
41. Marino, R.; Tomei, P.; Verrelli, C.M. Induction Motor Control Design; Springer Science & Business Media: Berlin/Heidelberg,
Germany, 2010; ISBN 1849962847.
160
mathematics
Article
Complete Balancing of the Six-Bar Mechanism Using Fully
Cartesian Coordinates and Multiobjective Differential
Evolution Optimization
María T. Orvañanos-Guerrero 1 , Mario Acevedo 2 , Claudia N. Sánchez 1 , Daniel U. Campos-Delgado 3 , Amir
Aminzadeh Ghavifekr 4 , Paolo Visconti 1,5 and Ramiro Velázquez 1, *
Abstract: The high-speed operation of unbalanced machines may cause vibrations that lead to noise,
wear, and fatigue that will eventually limit their efficiency and operating life. To restrain such
vibrations, a complete balancing must be performed. This paper presents the complete balancing
Citation: Orvañanos-Guerrero, M.T.;
optimization of a six-bar mechanism with the use of counterweights. A novel method based on fully
Acevedo, M.; Sánchez, C.N.;
Cartesian coordinates (FCC) is proposed to represent such a balanced mechanism. A multiobjective
Campos-Delgado, D.U.; Ghavifekr,
optimization problem was solved using the Differential Evolution (DE) algorithm to minimize the
A.A.; Visconti, P.; Velázquez, R.
shaking force (ShF) and the shaking moment (ShM) and thus balance the system. The Pareto front is
Complete Balancing of the Six-Bar
Mechanism Using Fully Cartesian
used to determine the best solutions according to three optimization criteria: only the ShF, only the
Coordinates and Multiobjective ShM, and both the ShF and ShM. The dimensions of the counterweights are further fine-tuned with
Differential Evolution Optimization. an analysis of their partial derivatives, volumes, and area–thickness relations. Numerical results show
Mathematics 2022, 10, 1830. https:// that the ShF and ShM can be reduced by 76.82% and 77.21%, respectively, when importance is given
doi.org/10.3390/math10111830 to either of them and by 45.69% and 46.81%, respectively, when equal importance is given to both. A
comparison of these results with others previously reported in the literature shows that the use of
Academic Editors: Higinio Rubio
Alonso, Alejandro Bustos Caballero,
FCC in conjunction with DE is a suitable methodology for the complete balancing of mechanisms.
Jesus Meneses Alonso and Enrique
Soriano-Heras Keywords: six-bar mechanism; dynamic balancing; fully Cartesian coordinates; multiobjective
optimization; differential evolution; Pareto front
Received: 23 April 2022
Accepted: 21 May 2022
MSC: 91B03; 70B15
Published: 26 May 2022
The force and the moment that occur at the mechanism’s fixed frame are typically
known as shaking force (ShF) and shaking moment (ShM), respectively. Thus, a traditional
but still open challenge in machine theory is how to effectively remove or minimize the
dynamic reactions, i.e., the ShF and ShM, derived from the mechanism’s motion. This
process is known as complete (or dynamic) balancing.
The dynamic balancing of mechanisms has been widely studied in mechanical engi-
neering. Some recent historical reviews addressing the topic can be found in [1,4–6].
Among the first publications, we can find the work of Fischer introducing in 1902 a
method called principal vectors [7]. This method allows for the balancing of the ShF by
analyzing each of the links of the mechanism and determining the points, called principal
points, in which the static balancing can be reached. The work of Fischer provided the
basis for the methods studying the motion of the centers of mass (CoM) of the links in a
mechanism. The method of principal vectors was subsequently used by Goryachkin [8],
Yudin [9], and Kreutzinger [10].
Another early method proposed for the dynamic balancing of mechanisms was the
static substitution of masses. Its aim is to statically replace the mass of the coupler by
concentrated masses, thus transforming the problem of mechanism balancing into a simpler
problem of balancing the rotating links. The works of Maxwell [11], Smith [12], and
Talbourdet [13] are based on this method.
The 1920s were marked by a special interest in the balancing of engines [14,15] and
machines related to agriculture [16]. The Lanchester balancer stands out among these
works. It is still being used for balancing four-stroke engines.
During the 1940s, some methods based on function approximation were proposed to
achieve partial equilibrium. The work of Gheronimus [17] is a representative example. In
this work, the balancing conditions were formulated by minimizing the root mean square
(RMS) or the maximum values of the ShM. The duplicate mechanism method [18] was also
proposed in this decade. It achieves a complete balancing with the addition of axial and
mirror symmetric mechanism duplicates.
The balancing methods based on harmonic analysis appeared in the 1960s with the
use of the crank-slider mechanism in internal combustion engines [19,20]. Such methods
reduce the ShM by balancing certain harmonics of both the ShF and ShM. To carry out
this process, the unbalanced forces and moments are divided into Fourier series and then
analyzed in parts.
In 1969, Berkof and Lowen [21] proposed a new solution for dynamic balancing using
a method called linearly independent vectors. This method consists of formulating an
equation considering a vector representation describing the position of the total CoM of the
mechanism in conjunction with the equation representing the closed kinematic chain. A
system of equations containing linearly independent vectors is then obtained allowing to
find the balancing conditions of the mechanism by reducing to zero the time-dependent
coefficients. This method was subsequently explored in [22,23].
In the 1970s, the dynamic balancing theory achieved significant advances. Berkof [24]
formulated the first method for complete balancing using counterweights and counter-
inertias to eliminate the undesired forces and moments of the moving links, respectively. It
then became clear that a complete balancing of mechanisms is feasible at the expense of
considering complex design modifications and an inevitable increase of the total mass and
volume of the mechanism.
Methods proposing a partial balancing were then proposed searching to keep the
added mass reasonably small. In [25], Wiederrich and Roth formulated simple general
conditions for determining the inertia properties of a four-bar mechanism and achieved
its partial balancing. Dresig examined the partial balancing conditions for several 2D
mechanims with six and eight-bar linkages [26].
Later approaches involved the use of planetary [27] and auto-balanced systems [28].
More recent ones propose the use of the instantaneous dynamic balancing conditions [29],
the trajectory planning of the CoM of the mechanism [30], and the reformulation of the
162
Mathematics 2022, 10, 1830
balancing conditions following a Taylor-based approach [29]. Finally, the most recent and
novel balancing method encompasses the flexibility of the links of the mechanism [31].
We focus our attention on Berkof’s counterweight approach. After the partial balancing
methods, optimization techniques were then explored searching a tradeoff between the
added mass and the reduction of the ShF and ShM.
In 1998, Segla presented the static balancing optimization of a robotic mechanism [32].
The balancing conditions were first obtained and then, a basic genetic algorithm pro-
grammed in Fortran was used for the optimization.
Currently, the most popular optimization techniques for mechanical problems are
based on metaheuristic methods: evolutionary [33], differential evolution [34], genetic [35],
and Firefly [36] algorithms. These proposals were designed to find heuristics (i.e., partial
solutions) that may provide sufficiently good tradeoffs for the dynamic balancing problem.
To our knowledge, all previous works addressing optimization techniques for the
dynamic balancing of mechanisms make use of Cartesian coordinates (CC) to obtain
the expressions of the dynamic reactions. A major drawback of CC is that they involve
trigonometric functions that derive into complex mathematical expressions that are compu-
tationally burdensome.
Our research approach differs from the existing literature in that it presents an alterna-
tive to CC: fully Cartesian coordinates (FCC), which are also called natural coordinates [37].
By using FCC, the dynamic reactions at the fixed frame can be formulated by means of
equations of less complexity (no angular variables). Then, the resulting ShF and ShM
expressions are used to optimize the dynamic balancing of mechanisms through the use
of counterweights.
Our previous work [38,39] successfully reported the dynamic balancing of a four-bar
mechanism and its optimization with Projected Gradient Descent. In this paper, we address
the dynamic balancing of a more complex system, a six-bar mechanism, and explore the
differential evolution (DE) algorithm as our optimization method.
Parallel mechanisms are increasingly being used in robotic applications [40,41]. The
six-bar mechanism is a typical parallel manipulator. This single degree-of-freedom planar
linkage is typically used as a variable-speed transmission mechanism where the input crank
rotates at constant speed and the output link works as an overrunning clutch mounted on
the output shaft [42].
The rest of the paper is organized as follows: In Section 2, the FCC-based mass-matrix
definition for the six-bar mechanism with counterweights is introduced together with
the expressions of the ShF and ShM. In Section 3, the details of the multiobjective DE
optimization are presented. In Section 4, a numerical example is presented to illustrate the
proposed approach. In Section 5, results are discussed and compared to others previously
reported in the literature. Finally, Section 6 concludes, summarizing the main contributions
and giving the future work perspectives of this research.
2. Mechanical Analysis
2.1. FCC-Based Definition of the Mass Matrix of an Element Defined by Three Basic Points
This section details the method used to obtain the mass matrix M3P of elements
defined by three basic points using FCC and the concept of virtual work.
Consider an element defined by three basic points i, j, and k, as shown in Figure 1.
The element is located in a global coordinate system ( x, y) and in a local coordinate system
( x̄, ȳ) with its origin at point i and the x̄ axis directed toward point j.
163
Mathematics 2022, 10, 1830
The location of any point P in this element is defined by a vector r in the global
reference system and a vector r̄ in the local coordinate system. In this way, r can be
expressed, according to Equation (1).
r = ri + Ar̄ (1)
where A is the rotation matrix. Being the element rigid, the local position of vector r̄
remains constant regardless of the element’s motion. Thus, the position of point P can be
defined according to Equation (2).
where c1 and c2 are the components of vector r̄ in the local coordinate system. The compo-
nents of vector r can be expressed in matrix form as shown in Equation (3).
⎧ ⎫
⎪ xi ⎪
⎪
⎪ ⎪
⎪ yi ⎪
⎪ ⎪
⎪
( ) ⎪
⎨x ⎪ ⎬
x 1 − c1 − c2 0 c1 0 c2 0 j
r= = = Cq (3)
y 0 1 − c1 − c2 0 c1 0 c2 ⎪
⎪ yj ⎪
⎪
⎪
⎪ ⎪
⎪
⎪
⎪ x ⎪
⎩ k⎪ ⎭
yk
ṙ = Cq̇ (4)
r̈ = Cq̈ (5)
Coefficients c1 and c2 in matrix C can be expressed in terms of the coordinates of
points i, j, and k in the local reference frame according to Equation (6).
164
Mathematics 2022, 10, 1830
Vector c contains the coefficients c1 and c2 , and matrix X̄ has the components of vectors
r¯j and r¯k in its columns (Equation (8)).
x xk l Kx
X̄ = [ r¯j | r¯k ] = j = ij (8)
y j yk 0 Ky
Now, it is possible to define the virtual work W ∗ generated by the inertial forces
(Equation (9)).
.
W ∗ = −ρ ṙ∗T r̈dΩ (9)
Ω
where ρ is the density of the element’s material. Substituting Equations (4) and (5) into
Equation (9) yields to the definition of virtual work (Equation (10)):
.
W ∗ = −ρ q̇∗T CT Cq̈dΩ (10)
Ω
Since vectors q̇∗T and q̈ are independent of Ω, they can be taken out of the integral as
expressed by Equation (11).
.
W ∗ = −q̇∗T ρ CT CdΩ q̈ (11)
Ω
On the other hand, taking into account the definition of virtual work proposed in [43]
(Equation (12)) and comparing it to Equation (11), the mass matrix M3P can be expressed
by Equation (13).
W ∗ = −q̇∗T Mq̈ (12)
.
M3P = ρ CT CdΩ (13)
Ω
ch = c21 (18)
c i = c1 c2 (19)
cj = c22 (20)
Note that Equation (14) involves solving the integrals of Equations (21)–(23).
.
ρdΩ = m (21)
Ω
165
Mathematics 2022, 10, 1830
. . .
1
− KKy xl x̄
ρX̄−1 r̄dΩ = ρ
lij
ρcdΩ = 1
ij
dΩ =
Ω Ω Ω 0 Ky
ȳ
⎡ ⎤ (22)
. x̄
− ȳK x m x̄ g mȳ K
− Kygl x
ρ lij
ȳ
Ky lij
dΩ = ⎣ lij
mȳ g
ij ⎦
Ω
Ky Ky
. . . 2
x̄ x̄ ȳ
ρccT dΩ = X̄−1 ρr̄r̄T dΩ X̄−T = X̄−1 ρ 2 dΩ X̄−T =
Ω Ω Ω x̄ ȳ ȳ
⎡ 2 ⎤
Ix K x 2Ixy K x Iy I (23)
2 l2 − K l2 + l2 − KIx2Kl x + Kyxyl
I I ⎢ ⎥
X̄−1 X̄−T = ⎣ ij
y xy K y y y ij ij
ij
I
ij
⎦
Ixy Ix − KIx2Kl x + Kyxyl Ix
y ij K2 ij y
where m is the total mass of the element, r̄ represents the local coordinates of the center of
gravity, and Ix , Iy , and Ixy are the moments and products of inertia with respect to local
coordinates with origin at the basic point i.
By substituting the integrals of Equations (21)–(23) into Equation (14), we finally obtain
the mass matrix M3P (Equation (24)).
⎡ ⎤
e 0 f 0 g 0
⎢0 e 0 f 0 g⎥
⎢ ⎥
⎢f 0 h 0 i 0⎥
M3P =⎢
⎢0
⎥ (24)
⎢ f 0 h 0 i⎥⎥
⎣g 0 i 0 j 0⎦
0 g 0 i 0 j
with:
Ix K x Ix Ixy mȳ g
g= − 2− + (27)
Ky2 lij Ky Ky lij Ky
Ix K2x 2Ixy Kx Iy
h= − + 2 (28)
Ky2 lij2 Ky lij2 lij
Ix K x Ixy
i=− + (29)
Ky2 lij Ky lij
Ix
j= (30)
Ky2
166
Mathematics 2022, 10, 1830
For this mechanism, it is possible to define a vector q representing the positions of its
basic points (Equation (31)).
q = [ A x Ay Bx By Cx Cy Dx Dy Ex Ey Fx Fy Gx Gy ] T (31)
q̇ = [VAx VAy VBx VBy VCx VCy VDx VDy VEx VEy VFx VFy VGx VGy ] T (32)
167
Mathematics 2022, 10, 1830
counterweight is located at the local coordinates (xcn , ycn ) for 1 ≤ n ≤ 5. Their thickness is
defined by tcn for 1 ≤ n ≤ 5.
1 1 1
Ixcn = mcn ( xcn
2
+ y2cn ) + my2cn = mcn ( xcn
2
+ 5y2cn ) = ρπtcn ( xcn
2
+ y2cn )( xcn
2
+ 5y2cn ) (35)
4 4 4
1 1 1
Iycn = mcn ( xcn
2
+ y2cn ) + mxcn
2
= mcn (5xcn
2
+ y2cn ) = ρπtcn ( xcn
2
+ y2cn )(5xcn
2
+ y2cn ) (36)
4 4 4
The polar moment of inertia of each counterweight Izcn with respect to the local
coordinate system origin can be defined by Equation (37).
3 3
Izcn = Ixcn + Iycn = mcn ( xcn
2
+ y2cn ) = ρπtcn ( xcn
2
+ y2cn )2 (37)
2 2
Similarly, the product of inertia of each counterweight with respect to the local coordi-
nate system origin can be calculated with Equation (38).
168
Mathematics 2022, 10, 1830
Given that the mechanism consists of seven basic points, each of them represented by
one (x, y) coordinate, the resulting mass matrix will consist of 14 columns and 14 rows.
Equation (39) shows the mass matrix M representing the six-bar mechanism with
counterweights.
⎡ ⎤
a2 0 0 0 b2 c2 0 0 0 0 0 0 0 0
⎢ 0 − c2 ⎥
⎢ a2 0 0 b2 0 0 0 0 0 0 0 0 ⎥
⎢ 0 ⎥
⎢ 0 e3 0 0 0 0 0 g3 0 f3 0 0 0 ⎥
⎢ ⎥
⎢ 0 0 0 e3 0 0 0 0 0 g3 0 f3 0 0 ⎥
⎢ ⎥
⎢ b2 c2 0 0 d 2 + e1 0 g1 0 f1 0 0 0 0 0 ⎥
⎢ ⎥
⎢ − c2 b2 0 0 0 d 2 + e1 0 g1 0 f1 0 0 0 0 ⎥
⎢ ⎥
⎢ 0 0 0 0 g1 0 a4 + j1 0 i1 0 0 0 b4 c4 ⎥
M=⎢
⎢ 0
⎥
⎥ (39)
⎢ 0 0 0 0 g1 0 a4 + j1 0 i1 0 0 − c4 b4 ⎥
⎢ 0 0 g3 0 f1 0 i1 0 h1 + i3 0 i3 0 0 0 ⎥
⎢ ⎥
⎢ 0 h1 + i3 ⎥
⎢ 0 0 g3 0 f1 0 i1 0 0 i3 0 0 ⎥
⎢ ⎥
⎢ 0 0 f3 0 0 0 0 0 i3 0 a5 + h3 0 b5 c5 ⎥
⎢ ⎥
⎢ 0 0 0 f3 0 0 0 0 0 i3 0 a5 + h3 − c5 b5 ⎥
⎢ ⎥
⎣ 0 0 0 0 0 0 b4 c4 0 0 b5 c5 d4 + d5 0 ⎦
0 0 0 0 0 0 − c4 b4 0 0 − c5 b5 0 d4 + d5
where an , bn , . . . , jn are the terms of the different mass matrices for linkages n = 1, . . . , 5 (see
Appendix A).
where B (Equation (41)) is a matrix formed by identity matrices matching the number of
basic points in the mechanism.
T
1 0 1 0 1 0 1 0 1 0 1 0 1 0
B= (41)
0 1 0 1 0 1 0 1 0 1 0 1 0 1
By solving Equation (40) and considering that the velocity of the fixed points is always
zero (VA X = 0, VAY = 0, VBX = 0, VBY = 0), the expressions of the linear momentum (Li
and Lj ) can be obtained.
The ShFi and ShF j of the mechanism can be computed by time-deriving the equations
Li and Lj (Equation (40)) (expressions are not included in the paper because of their length).
To guarantee the equilibrium, the result of these derivatives must be constant (normally
zero) in the analyzed period of time.
dH d(q̇) dr
ShM = = rM( ) + ( )Mq̇ (44)
dt dt dt
169
Mathematics 2022, 10, 1830
dH
ShM = = rMq̈ + ṙMq̇ (45)
dt
with:
To guarantee the dynamic equilibrium of the mechanism, the ShM must be constant,
i.e., the time-derivative of H (Equation (45)) must be zero.
The ShM of the mechanism is finally obtained by solving Equation (45) and considering
VAX = 0, VAY = 0, VBX = 0 y VBY = 0 (equation is not included in the paper because of
its length).
3. Optimization
3.1. Objective Function
Two dimensionless balancing indexes β i , containing the motion parameters (q, q̇, and
q̈) of the six-bar mechanism, can be used to define the optimization’s objective function:
β ShF and β ShM .
β ShF (Equation (47)) is defined by the RMS value of the ShF reaction of the opti-
mized mechanism (rms(o ShF )) with respect to the RMS value of the original mechanism
(rms(ShF )), which are both considered over a time period T.
0
1 N o
rms(o ShF ) 1 ∑k=1 ( ShFik2 +o ShFjk2 )
β ShF = =2 N (47)
rms(ShF ) ∑k=1 (ShFik2 + ShFjk2 )
β ShM (Equation (48)) can be calculated in a similar way. Nevertheless, the reaction
produced by ShM must also be considered.
∑kN=1 o ShMk2
β ShM = (48)
∑kN=1 ShMk2
where o ShM is the shaking moment of the optimized mechanism and ShM is a constant
representing the shaking moment of the unbalanced mechanism.
A multiobjective optimization problem emerges as it is desired to minimize both β ShF
and β ShM considering the variables boundaries (i.e., the physical limits for the locations of
the CoM (xcn and ycn ) and the thickness (tcn ) of each counterweight). To solve this problem,
a linear combination of the objectives is performed as proposed in Equation (49).
where γ is a scalar value that assigns the importance to each optimization objective. Thus,
the 15 variables to be optimized are: xc1 , yc1 , tc1 , xc2 , yc2 , tc2 , xc3 , yc3 , tc3 , xc4 , yc4 , tc4 , xc5 , yc5 ,
and tc5 . The boundaries for optimization are defined according to Equation (50).
min max
xcn xcn xcn
ymin
cn ycn ymax
cn (50)
tmin
cn tcn tmax
cn
3.2. Algorithm
Once the objective function has been proposed, an optimization method can be applied.
In this paper, we explore Differential Evolution (DE) [44].
Being an evolutionary algorithm, DE uses approaches inspired by the theory of evolu-
tion. It optimizes a problem by proposing a population of candidate solutions and creating
new candidate solutions with the ones that obtained the best scores. Thus, the new gen-
170
Mathematics 2022, 10, 1830
erations are better than the previous ones. Recently, DE has been successfully applied to
disruptive fields such as the oil market [45] and genome studies [46].
The DE algorithm proposed for the complete balancing optimization of the six-bar
mechanism is presented in Algorithm 1. It was programmed in Python.
4. Results
This section presents the numerical results of the dynamic balancing of the six-bar
mechanism with FCC and DE optimization. To better visualize the influence of the counter-
weights on the procedure, a Pareto front analysis is proposed. This tool allows to restrict
our attention to the set of best solutions and ease the decision-making process.
171
Mathematics 2022, 10, 1830
Table 2. Parameters of the six-bar mechanism used in the example. Those indicated with a ‘-’ are not
necessary for the numerical analysis.
Link 1 2 3 4 5
n
Mass
0.6935 0.1022 0.9636 0.1825 0.1679
m bn
[kg]
Length
0.19 0.14 0.13416408 0.25 0.23
ln
[m]
Inertia
0.00116161 - 0.00622646 - -
Ixbn
[kg m/s2 ]
Inertia
0.00556534 - 0.00657336 - -
Iybn
[kg m/s2 ]
Inertia
- 0.00066856 - 0.00380360 0.00296204
Izbn
[kg m/s2 ]
Inertia
0.00167596 - 0.00522914 - -
Ixybn
[kg m/s2 ]
CoM 0.08 0.07 0.07751702 0.125 0.115
xbn [m]
CoM 0.03333333 0.0 0.06559133 0.0 0.0
ybn [m]
K x [m] 0.05 - 0.09838699 - -
Ky [m] 0.1 - 0.196677398 - -
A motor located at point A is responsible for actuating the mechanism. Its rotating
speed has been fixed at 500 rpm. Using direct kinematics, it is possible to obtain a sample
of the positions, velocities, and accelerations at each of the basic points of the mechanism.
By replacing all known parameters in the equations of the balancing indexes
(Equations (47) and (48)), it is possible to define the objective function (Equation (49)).
According to the mechanical characteristics of this particular example, the boundaries
considered for the optimization are shown in Equation (51):
172
Mathematics 2022, 10, 1830
In Figure 4b, the dark points represent the Pareto front while the light ones represent
the dominated solutions.
(a) (b)
Figure 4. Pareto analysis of the optimization objectives: β ShM and β ShF . (a) Optimized β ShM and
β ShF according to γ. (b) Pareto Front of the optimized objectives.
Among all the solutions found in the Pareto front, it is possible to select the one that
is the most appropriate according to the desired type of balancing. To exemplify this
statement, let us consider three different solutions from the Pareto front.
In the first one, a greater importance is given to balancing the ShM (β ShM = 0.235917108,
β ShF = 0.932850297). In the second one, a greater importance is given to balancing the ShF
(β ShM = 0.924195224, β ShF = 0.270900009). Finally, in the third solution, the same importance
is given to balancing both the ShF and ShM (β ShM = 0.580111266, β ShF = 0.558041831). These
solutions are detailed below.
1. In the first solution, a greater importance is given to balancing the ShM. This is
achieved by choosing the minimum value of index β ShM (β ShM = 0. 235917108), which
allows us to obtain an improvement of 76.4% without considering any improvement
of the ShF (β ShF = 0.932850297). The following variables values correspond to this
solution:
xc1 = −0.02611622 yc1 = −0.033186164 tc1 = 0.021370695
xc2 = −0.06 yc2 = −0.012314794 tc2 = 0.039459546
xc3 = −0.06 yc3 = 0.016752092 tc3 = 0.039814955
xc4 = −0.002045161 yc4 = −0.000151565 tc4 = 0.005
xc5 = 0.001614171 yc5 = 0.00639164 tc5 = 0.005106848
2. The second chosen solution in the Pareto front is the one with the minimum value
in β ShF (β ShF = 0.270900009), which achieves an improvement of 72.91% in balancing
the ShM. This choice assigns no importance to the balancing of the ShM (β ShM =
0.924195224). This solution yields to the following variable values:
xc1 = −0.049437705 yc1 = −0.04279353 tc1 = 0.0074319
xc2 = −0.06 yc2 = −0.001485936 tc2 = 0.038953174
xc3 = −0.050813691 yc3 = −0.017724236 tc3 = 0.039660979
xc4 = 0.001669535 yc4 = 0.005996916 tc4 = 0.005
xc5 = 0.000387207 yc5 = 0.013090811 tc5 = 0.005
3. The third chosen solution is the one in the Pareto front where both indexes are
optimized (β ShM = 0.580111266, β ShF = 0.558041831). By using this solution, the ShM
is reduced by 41.99% and the ShF is reduced by 44.2%. It corresponds to the following
variable values:
173
Mathematics 2022, 10, 1830
174
Mathematics 2022, 10, 1830
Figure 6. Box plots of partial derivatives of the objective function with respect to each optimization
variable when using five counterweights.
On the other hand, the values of the partial derivatives of variables t2 and t3 tend to
be less than zero, which means that, if possible, it would be convenient to further extend
the boundaries of the optimization of these variables to values greater than 0.04 m.
Note that the information obtained from the Pareto front of the partial derivatives of
the objective function is extremely useful to make decisions concerning the boundaries of
the optimization, allowing us to foresee pertinent changes in the linkages as far as their
mechanical conditions allow it.
Figure 7a shows the histograms of the volumes of the counterweights obtained from
the different optimization results. Together with the analysis of the counterweights’ area
and thickness shown in Figure 7b, it is possible to conclude that counterweight 4 can be
eliminated: it has a very small volume compared to the other counterweights, and its
thickness and area are practically negligible.
In addition, for both counterweights 2 and 3, it can be appreciated that their thickness
tends to remain at the upper limit of the optimization, thus confirming the result of the
partial derivatives analysis: if the mechanical characteristics of the mechanism allow it, it
would be advisable to extend the upper limit of their thicknesses.
175
Mathematics 2022, 10, 1830
(a)
(b)
Figure 7. Dimensional analysis for the five counterweights balancing the six-bar mechanism. (a) Vol-
umes of the five counterweights. (b) Relationship between the area and the thickness for the five
counterweights.
176
Mathematics 2022, 10, 1830
4.3. Implementation
The optimization method and the dimensional analysis conclude with the design of a
solution that effectively reduces the dynamic reactions of the six-bar mechanism.
Let us consider the third case where both ShF and ShM are optimized. Figure 8
shows the proposed implementation. As suggested by the analysis, counterweight 4 has
been eliminated. Note that counterweight 5 is not visible because it exhibits very small
dimensions in relation to the mechanism. Yet, it has been considered for the dynamic
balancing of the mechanism.
Figure 8. Conceptual implementation of the third solution from the Pareto front for the six-bar
mechanism using five counterweights Ci .
Figure 9. Pareto front comparison of the optimization objectives β ShM and β ShF according to the
number of counterweights.
177
Mathematics 2022, 10, 1830
Table 3 summarizes the results obtained. The best results are highlighted in blue.
Again, the best choice depends on the desired balancing: only the ShF, only the ShM,
or both.
Table 3. Summary of the optimization results for the six-bar mechanism balancing.
5. Discussion
The implementation of well-designed counterweights allows us to reduce the ShF and
ShM by 76.82% and 77.21%, respectively, when importance is given to either of them and
by 45.69% and 46.81%, respectively, when equal importance is given to both of them.
Table 4 compares our balancing results with others previously reported in the literature.
Note that the comparison is limited to the cases where either the ShF or the ShM are
optimized as not all the studies report the joint optimization. Table 4 includes other types
of mechanisms such as the crank-slider and the four-bar mechanism, which are indeed
simpler structures and thus easier to optimize. Six-bar mechanisms and their balancing are
more complex and less frequently found in the literature because, as explained in Section 2,
they exhibit a higher number of links with some defined by three basic points.
Table 4. Comparison of balancing results for several types of mechanisms and optimization methods.
Note that the use of FFC to define the mass matrix of a mechanism and thus obtain the
expressions representing the dynamic reactions in its base, in conjunction with an optimiza-
tion algorithm, is a suitable methodology for the complete balancing of mechanisms. Our
previous work in crank-sliders [51] and four-bar [38] mechanisms confirm the efficiency of
the proposed approach. The results obtained surpass those of approaches using Cartesian
coordinates (CC).
The DE optimization method was successfully applied to solve the balancing problem of
a six-bar mechanism. DE is a simple yet robust approach to address multiobjective problems.
Comparison between Pareto fronts has proven to be a useful tool for better visualizing
the impact of each counterweight on the dynamic balancing of the mechanism.
The effectiveness of the analysis of the boundaries by means of box plots of the partial
derivatives of the variables to be optimized together with the histograms of volumes and
relations between area and thickens were also demonstrated. They ease the visualization of
possible improvements on the counterweights and allow us to make useful decisions on
their implementation.
178
Mathematics 2022, 10, 1830
6. Conclusions
This paper has presented a novel approach for the complete (or dynamic) balanc-
ing of mechanisms: the use of fully Cartesian coordinates (FCC) in conjunction with an
optimization method such as Differential Evolution (DE).
Among the main contributions of this paper is the development of the two-dimensional
mass matrix for elements defined by three basic points. To our knowledge, this matrix has
not been proposed so far in the literature and it can be applied to a vast number of more
complex mechanisms that use type of linkages. By using FCC, this work has demonstrated
that even for the most complex mechanisms, it is possible to obtain relatively simple non-
trigonometric equations that define the ShF and ShM and to further optimize their dynamic
balancing with algorithms such as DE with very good results.
As future work, it is expected to continue exploring FCC together with other meta-
heuristic methods to optimize more complex mechanisms in two and three dimensions.
Our approach promises to be highly efficient for optimizing the balancing conditions of
even more complex mechanisms once their mass matrices have been defined.
Author Contributions: Conceptualization, M.A. and M.T.O.-G.; methodology, M.T.O.-G. and C.N.S.;
simulation, M.T.O.-G.; validation, D.U.C.-D., P.V. and R.V.; formal analysis, M.A., R.V. and A.A.G.;
investigation, M.T.O.-G., M.A. and R.V.; writing—original draft preparation, M.T.O.-G.; writing—
review and editing, R.V. and P.V.; visualization, D.U.C.-D. and A.A.G.; supervision, M.A. and R.V. All
authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
CC Cartesian Coordinates
FCC Fully Cartesian Coordinates
ShF Shaking Force
ShM Shaking Moment
CoM Center of Mass
DE Differential Evolution
179
Mathematics 2022, 10, 1830
considering the contribution of both the link and the counterweight. Thus, for each n
linkage, Equations (A1)–(A7) can be written.
(a) (b)
Figure A1. Linkages defined by three basic points. (a) Linkage 1. (b) Linkage 3.
By using the mass matrix M3P (Equation (24)), the terms of the mass matrix for linkages
n = 1 and n = 3 can be defined according to Equations (A8)–(A13).
180
Mathematics 2022, 10, 1830
Ixbn K2xn 2I Kxn I 2Ixybn Kxn 2Ixybn Iybn πK2xn ρcn tcn xcn
4
en = 2 2
− xbn
2
+ xbn
2
− 2
+ + 2 + 2 2
Kyn ln Kyn ln Kyn Kyn ln Kyn ln ln 4Kyn ln
3πK2xn ρcn tcn xcn
2 5πK2xn ρcn tcn y4cn 2Kxn mbn ybn 2πKxn ρcn
+ 2 l2
y2cn + 2 l2
+ + 2
tcn xcn ycn
2Kyn n 4Kyn n Kyn ln Kyn ln
2πKxn ρcn 2πKxn ρcn 2πKxn ρcn πKxn ρcn tcn xcn
4
+ tcn y3cn − 2
3
tcn xcn ycn − 2
tcn xcn y3cn − 2
Kyn ln Kyn ln Kyn ln 2Kyn ln
3πKxn ρcn 5πKxn ρcn tcn y4cn 2m2bn xbn
− 2 2
tcn xcn ycn − −
2
Kyn ln 2
2Kyn ln ln mbn + πln ρcn tcn xcn
2 + πl ρ t y2
n cn cn cn
2πmbn ρcn tcn xcn y2cn 2πmbn ρcn tcn ybn y2cn
− −
ln mbn + πln ρcn tcn xcn + πln ρcn tcn ycn
2 2 Kyn mbn + πKyn ρcn tcn xcn
2 + πK ρ t y2
yn cn cn cn
2πmbn ρcn tcn y3cn 2π 2 ρ2cn t2cn xcn
5
− −
Kyn mbn + πKyn ρcn tcn xcn + πKyn ρcn tcn ycn
2 2 ln mbn + πln ρcn tcn xcn + πln ρcn tcn y2cn
2
Ixbn K2xn I Kxn 2Ixybn Kxn Ixybn Iybn πK2xn ρcn tcn xcn
4
fn = − 2 2
+ xbn2 + 2
− − 2 − 2 2
Kyn ln Kyn ln Kyn ln Kyn ln ln 4Kyn ln
3πK2xn ρcn tcn xcn
2 5πK2xn ρcn tcn y4cn Kxn mbn ybn πKxn ρcn tcn 2
− 2 l2
y2cn − 2 l2
− − xcn ycn
2Kyn n 4K yn n K yn l n Kyn ln
πKxn ρcn tcn 3 2πKxn ρcn 2πKxn ρcn πKxn ρcn tcn xcn
4
− ycn + 3
tcn xcn ycn + tcn xcn y3cn + (A9)
Kyn ln Kyn ln2 Kyn ln2 2 l
4Kyn n
181
Mathematics 2022, 10, 1830
1
bn = mbn xbn + πρcn tcn xcn xcn2
+ y2cn
ln
2 (A15)
1
− 2 Ibn + πρcn tcn xcn 2 2
xcn + y2cn + πρcn tcn y2cn xcn2
+ y2cn + 0.5πρcn tcn xcn2
+ y2cn
ln
1
cn = −mbn − πρcn tcn xcn
2
+ y2cn
ln (mbn + πρcn tcn ( xcn + ycn ))
2 2
(A16)
mbn ybn + πρcn tcn ycn xcn2
+ y2cn
182
Mathematics 2022, 10, 1830
2
1
dn = Ibn + πρcn tcn xcn
2 2
xcn + y2cn + πρcn tcn y2cn xcn
2
+ y2cn + 0.5πρcn tcn xcn
2
+ y2cn (A17)
ln2
References
1. Arakelian, V.; Briot, S. Balancing of Linkages and Robot Manipulators. Mechanisms and Machine Science; Springer International
Publishing: Cham, Switzerland, 2015; Volume 27.
2. Uicker, J.J.; Pennock, G.R.; Shigley, J.E. Theory of Machines and Mechanisms, 5th ed.; Oxford University Press: New York, NY,
USA, 2016.
3. Waldron, K.J.; Kinzel, G.L. Kinematics, Dynamics, and Design of Machinery, 2nd ed.; John Wiley: Hoboken, NJ, USA, 2004.
4. Arakelian, V.; Dahan, M.; Smith, M.A. Historical review of the evolution of the theory on balancing of mechanisms. In Symposium
on History of Machines and Mechanisms Proceedings HMM 2000; Ceccarelli, M., Ed.; Springer: Dordrecht, The Netherlands, 2000;
pp. 291–300.
5. Arakelian, V. Inertia forces and moments balancing in robot manipulators: A review. Adv. Robot. 2017, 31, 717—726. [CrossRef]
6. Wei, B.; Zhang, D. A review of dynamic balancing for robotic mechanisms. Robotica 2021, 39, 55–71. [CrossRef]
7. Fisher, O. Über die reduzierten Systeme und die Hauptpunkte der Glieder eines Gelenkmechanismus und ihre Bedeutung für die
technische Mechanik. Z. Für Angew. Math. Und Phys. 1902, 47, 429–466.
8. Goryachkin, V.P. The forces of inertia and their balancing. In Collection of Scientific Works; Kolos: Moscow, Russia, 1914; pp. 283–418.
9. Yudin, V. The Balancing of Machines and Their Stability; Edition of Academy of Red Army: Moscow, Russia, 1941; 124p.
10. Kreutzinger, R. Über die bewegung des Schwerpunktes beim Kurbelgetriebe. Getriebetechnik 1942, 10, 397–398.
11. Maxwell, R.L. Kinematics and Dynamics of Machinery, 1st ed.; Prentice Hall: Englewood Cliff, NJ, USA, 1960.
12. Smith, M.R.; Maundert, L. Inertia forces in a four-bar linkage. J. Mech. Eng. Sci. 1967, 9, 218–225. [CrossRef]
13. Talbourdet, G.L.; Shepler, P.R. Mathematical solution of 4-bar linkages-IV. Balancing of linkages. Mach. Des. 1941, 13, 73–77.
14. Lanchester, F.W. Engine Balancing. Inst. Automob. Eng. 1914, 8, 195–271. [CrossRef]
15. Root, R.E. Dynamics of Engine and Shaft; John Wiley: New York, NY, USA, 1932.
16. Artobolevsky, I.I.; Edelshtein, B.V. Methods of Inertia Calculation for Mechanisms of Agricultural Machines; Selkhozizdate: Moscow,
Russia, 1935.
17. Gheronimus, Y.L. An approximate method of calculating a counterweight for the balancing of vertical inertia forces. Mechanisms
1968, 3, 283–288. [CrossRef]
18. Doucet, E. Équilibrage dynamique des moteurs en ligne. Tech. Automob. Arienne 1946, 37, 30–31.
19. Emod, I.; Jurek, A. Massenausgleich am Kurbelgetriebe von Sechszylinder-viertakt-V-motoren mit 6 Kurbeln und 60 Zylinder-
winkeln. Period. Polytech. Mech. Eng. 1967, 3–4, 205–211.
20. Semenov, M.V. The synthesis of partly balanced plane mechanisms. Mechanisms 1968, 3, 339–353. [CrossRef]
21. Berkof R.S.; Lowen, G.G. A new method for complete force balancing simple linkages. J. Eng. Ind. 1969, 91B, 21–26. [CrossRef]
22. Smith, M.R. Dynamic analysis and balancing of linkages with interactive computer graphics. Comput. Aided Des. 1975, 7, 15–19.
[CrossRef]
23. Tepper, F.R.; Lowen, G.G. General theorems concerning full force balancing of planar linkages by internal mass redistribution. J.
Eng. Ind. 1972, 94, 789–796. [CrossRef]
24. Berkof, R.S. Complete force and moment balancing of inline four-bar linkages. Mech. Mach. Theory 1973, 8, 397–410. [CrossRef]
25. Wiederrich, J.L.; Roth, B. Momentum balancing of four-bar linkages. J. Manuf. Sci. Eng. 1976, 4, 1289–1295. [CrossRef]
26. Dresig, H.; Jacobi, P. Vollständiger trägheitskraftausgleich von ebenen koppelgetrieben durch anbringen eines zweischlages.
Maschinenbautechnik 1974, 23, 5–8.
27. Feng, G. Complete Shaking Force and Shaking Moment balancing of four types of six-bar linkages. Mech. Mach. Theory 1989, 24,
275–287. [CrossRef]
28. Kochev, I.S. Active balancing of the frame Shaking Moment in high speed planar machines. Mech. Mach. Theory 1992, 27, 53–58.
[CrossRef]
29. de Jong, J.J.; van Dijk, J.; Herder, J.L. A screw based methodology for instantaneous dynamic balance. Mech. Mach. Theory 2019,
141, 267–282. [CrossRef]
30. Acevedo, M.; Orvañanos-Guerrero, M.T.; Velázquez, R.; Arakelian, V. An alternative method for Shaking Force balancing of the
3RRR PPM through acceleration control of the center of mass. Appl. Sci. 2020, 10, 1351. [CrossRef]
31. Meijaard, J.P.; van der Wijk, V. Dynamic balancing of mechanisms with flexible links. Mech. Mach. Theory 2022, 172, 104784.
[CrossRef]
32. Segla, S.; Kalker-Kalman, C.M.; Schwab, A.L. Statical balancing of a robot mechanism with the aid of a genetic algorithm. Mech.
Mach. Theory 1998, 33, 163–174. [CrossRef]
33. Farmani, M.; Jaamialahmadi, A.; Babaie, M. Multiobjective optimization for force and moment balance of a four-bar linkage using
evolutionary algorithms. J. Mech. Sci. Technol. 2011, 25, 2971–2977. [CrossRef]
34. Zamuda, A.; Brest, J.; Boskovic, B.; Zumer V. Differential evolution for multiobjective optimization with self adaptation. In
Proceedings of the 2007 IEEE Congress on Evolutionary Computation, Singapore, 25–28 September 2007.
183
Mathematics 2022, 10, 1830
35. Erkaya, S. Investigation of balancing problem for a planar mechanism using genetic algorithm. J. Mech. Sci. Technol. 2013, 27,
2153–2160. [CrossRef]
36. Bošković, M.; Šalinić, S.; Bulatović, R.; Miodragović, G. Multiobjective optimization for dynamic balancing of four-bar mechanism.
In Proceedings of the 6th International Congress of Serbian Society of Mechanics, Mountain Tara, Serbia, 19–21 June 2017.
37. García de Jalón, J. Twenty-five years of natural coordinates. Multibody Syst. Dyn. 2007, 18, 15–33. [CrossRef]
38. Orvañanos-Guerrero, M.T.; Sánchez, C.N.; Rivera, M.; Acevedo, M.; Velázquez, R. Gradient descent-based optimization method
of a four-bar mechanism using Fully Cartesian coordinates. Appl. Sci. 2019, 9, 4115. [CrossRef]
39. Acevedo, M.; Orvañanos-Guerrero, M.T.; Velázquez, R.; Haro, E. Optimum balancing of the four-bar linkage using Fully Cartesian
coordinates. IEEE Lat. Am. Trans. 2019, 17, 983–990. [CrossRef]
40. Bourbonnais, F.; Bigras, P.; Bonev, I.A. Minimum-time trajectory planning and control of a pick-and-place five-bar parallel robot.
IEEE/ASME Trans. Mechatron. 2015, 20, 740–749. [CrossRef]
41. Wang, D.; Wang, L.; Wu, J.; Ye, H. An experimental study on the dynamics calibration of a 3-DOF parallel tool head IEEE/ASME
Trans. Mechatron. 2019, 24, 2931–2941.
42. Pennock, G.R.; Israr, A. Kinematic analysis and synthesis of an adjustable six-bar linkage Mech. Mach. Theory 2009, 44, 306–323.
[CrossRef]
43. García de Jalón, J.; Bayo, E. Kinematic and Dynamic Simulation of Multibody Systems: The Real-Time Challenge; Springer: New York,
NY, USA, 1994.
44. Storn, R.; Price, K. Differential evolution-a simple and efficient heuristic for global optimization over continuous spaces. J. Glob.
Optim. 1997, 11, 341–359. [CrossRef]
45. Das, A.K.; Mishra, D.; Das, K.; Mallick, P.K.; Kumar, S.; Zymbler, M.; El-Sayed, H. Prophesying the short-term dynamics of
the crude oil future price by adopting the survival of the fittest principle of improved grey optimization and extreme learning
machine. Mathematics 2022, 10, 1121. [CrossRef]
46. Álvarez Gutiérrez, D.; Sánchez Lasheras, F.; Martín Sánchez, V.; Suárez Gómez, S.L.; Moreno, V.; Moratalla-Navarro, F.; Molina de
la Torre, A.J. A new algorithm for multivariate genome wide association studies based on differential evolution and extreme
learning machines. Mathematics 2022, 10, 1024. [CrossRef]
47. Bergstra, J.; Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012, 13, 281–305.
48. Etesami, G.; Felezi, M.E.; Nariman-Zadeh, N. Pareto optimal multi-objective dynamical balancing of a slider-crank mechanism
using differential evolution algorithm. Int. J. Automot. Eng. 2019, 9, 3021–3032.
49. Chaudhary, K.; Chaudhary, H. Optimal design of planar slider-crank mechanism using teaching-learning-based optimization
algorithm. J. Mech. Sci. Technol. 2015, 29, 5189–5198. [CrossRef]
50. Chaudhary, K.; Chaudhary, H. Optimum balancing of slider-crank mechanism using equimomental system of point-masses.
Procedia Technol. 2014, 14, 35–42. [CrossRef]
51. Orvañanos-Guerrero, M.T.; Acevedo, M.; Sánchez, C.N.; Giannoccaro, N.I.; Visconti, P.; Velázquez, R. Efficient balancing
optimization of a simplified slider-crank mechanism. In Proceedings of the 2020 IEEE ANDESCON, Quito, Ecuador, 13–16
October 2020.
52. Chaudhary, K.; Chaudhary, H. Shape optimization of dynamically balanced planar four-bar mechanism. Procedia Comput. Sci.
2015, 57, 519–526. [CrossRef]
53. Belleri B.K.; Kerur, S.B. Balancing of planar six-bar mechanism with genetic algorithm. J. Mech. Energy Eng. 2020, 4, 303–308.
[CrossRef]
54. Hernández, E.; Velázquez, R.; Macías-Quijas, R.; Pissaloux, E.; Giannoccaro, N.I.; Lay-Ekuakille, A. Kinematic computations for
small-size humanoid robot KUBO. ARPN J. Eng. Appl. Sci. 2017, 12, 7311–7320.
55. Xu, K.; Liu, H.; Zhu, X.; Song, Y. Kinematic analysis of a novel planar six-bar bionic leg. Mech. Mach. Sci. 2019, 73, 13–21.
56. Velázquez, R.; Garzón-Castro, C.L.; Acevedo, M.; Orvañanos-Guerrero, M.T.; Ghavifekr, A.A. Design and characterization of a
miniature bio-inspired mobile robot. In Proceedings of the 2021 12th International Symposium on Advanced Topics in Electrical
Engineering, Bucharest, Romania, 25–27 March 2021.
57. Shao, Y.; Xiang, Z.; Liu, H.; Li, L. Conceptual design and dimensional synthesis of cam-linkage mechanisms for gait rehabilitation.
Mech. Mach. Theory 2016, 104, 31–42. [CrossRef]
184
mathematics
Article
Singularities of Serial Robots: Identification and Distance
Computation Using Geometric Algebra
Isiah Zaplana 1, *, Hugo Hadfield 2 and Joan Lasenby 2
Abstract: The singularities of serial robotic manipulators are those configurations in which the robot
loses the ability to move in at least one direction. Hence, their identification is fundamental to enhance
the performance of current control and motion planning strategies. While classical approaches entail
the computation of the determinant of either a 6 × n or n × n matrix for an n-degrees-of-freedom
serial robot, this work addresses a novel singularity identification method based on modelling the
twists defined by the joint axes of the robot as vectors of the six-dimensional and three-dimensional
geometric algebras. In particular, it consists of identifying which configurations cause the exterior
product of these twists to vanish. In addition, since rotors represent rotations in geometric algebra,
once these singularities have been identified, a distance function is defined in the configuration
space C , such that its restriction to the set of singular configurations S allows us to compute the
distance of any configuration to a given singularity. This distance function is used to enhance how
the singularities are handled in three different scenarios, namely, motion planning, motion control
Citation: Zaplana, I.; Hadfield, H.; and bilateral teleoperation.
Lasenby, J. Singularities of Serial
Robots: Identification and Distance Keywords: serial robotic manipulators; singularity identification; geometric algebra; rotor group;
Computation Using Geometric distance to a singularity
Algebra. Mathematics 2022, 10, 2068.
https://doi.org/10.3390/ MSC: 15A66; 15A75; 70B15; 70Q05
math10122068
with
R p
0
Tn = , (2)
0 1
where R is a rotation matrix that describes the end-effector orientation with respect to the
world frame, while p is a position vector describing the end-effector position with respect
to the world frame. This description is equivalent to the one provided by f , known as
the kinematic function of the serial robot. Thus, f (q) = x, where x denotes the vector
describing the end-effector pose and q = (q1 , . . . , qn ) denotes the vector whose components
are the joint variables, also known as the configuration of the robot. Clearly, either qi = θi if
joint i is revolute or qi = di if joint i is prismatic.
Figure 1. A frame {o, x, y, z} is attached to each joint of the serial robot to describe its relative position
and orientation.
Deriving the kinematic relation defined by f with respect to time, we obtain another
relation:
ẋ = J (q)q̇, (3)
where ẋ denotes the end-effector velocity vector, q̇; the vector of the joint velocities; and J,
the Jacobian matrix of f . If J = [ J1 · · · Jn ], then each column Ji can also be computed as:
z i × ( o n − oi )
Ji = if joint i is revolute,
zi
(4)
zi
Ji = if joint i is prismatic.
0
Definition 1. Given a serial robot with n DoF, a singularity or kinematic singularity is a configu-
ration q ∈ C satisfying ρ( J (q)) < min{n, 6}, where ρ(·) denotes the rank of the matrix argument.
The set of all singular configurations is a subset of C that is usually denoted by S and known as the
singular set.
186
Mathematics 2022, 10, 2068
Using the relation (3), it is easy to see that if q ∈ C is a singularity of a given serial
robot, then the following two statements hold:
• The robot loses at least one degree of freedom or, equivalently, its end-effector cannot
be translated along or rotated around at least one Cartesian direction.
• Finite linear and angular velocities of the end-effector may require infinite joint velocities.
In addition, Gottlieb [1] and Hollerbach [2] have independently proven that any serial
manipulator with n > 2 DoF has singularities. The identification of such singularities is
achieved by solving the following non-linear equation:
if it is redundant.
In general, if the serial robot possesses at least one revolute joint, several coefficients
of the Jacobian matrix are non-linear expressions, and thus, neither Equation (5) nor
Equation (6) are easy to formulate and solve. However, for manipulators with a spherical
wrist, a simplification can be made. For these robots, the axes of their last three joints
intersect at a common point, known as the wrist center point, or are parallel (the intersection
point and, hence, the wrist center point, is the point at the infinity). Since the origin of
the frame attached to the end-effector can be placed at the wrist center point, a zero block
appears in J (q) by definition (see Equation (4)). Hence:
J11 (q) 0
J (q) = , (7)
J21 (q) J22 (q)
where J11 (q), J21 (q) are blocks of order 3 × (n − 3), J22 (q) is a block of order 3 × 3 and 0
denotes a block of order 3 × 3, whose entries are all zero. Now, Equation (5) is simplified to:
from which the singularities can be obtained as solutions of either det( J11 (q)) = 0 or
det( J22 (q)) = 0. These two equations allow us to decouple the singularities into position
and orientation singularities as follows:
• Position singularities PS = {q ∈ C : det( J11 (q)) = 0};
• Orientation singularities OS = {q ∈ C : det( J22 (q)) = 0}.
Similarly, we can make the same decoupling for redundant robots:
• Position singularities PS = {q ∈ C : rank( J11 (q)) < 3};
• Orientation singularities OS = {q ∈ C : det( J22 (q)) = 0}.
Remark 1. The Jacobian matrix J (q) is represented with respect to the world frame (usually located
at the base of the robot). However, sometimes it is useful to represent J (q) in a different frame B . To
do so, the following identity is used:
where:
R0B 0
B= , (10)
0 R0B
with R0B = ( R0B ) T and where R0B denotes the rotation matrix that relates the orientation of B with
respect to the orientation of the world frame.
187
Mathematics 2022, 10, 2068
188
Mathematics 2022, 10, 2068
where 0 ≤ σ1 ≤ · · · ≤ σn are the singular values of J (q). Alternatively, we can also define
the condition number of J (q), wc = σn /σ1 . Clearly, the former is close to zero when the
configuration is close to a singularity, while the value of the latter increases as the robot
approaches to a singular configuration. Although there are several approaches based on
the use of such indices [27,28], none of them define a distance function, and as stated in [29],
they do not provide a realistic measure of how close a singularity is, only whether it is
close or not. On the other hand, Yao et al. [30] propose a different index of closeness to
singularities for planar parallel robots based on the volume of the workspace. Although
interesting, it is still not a distance function, and it cannot be easily applied to serial
robots. Similarly, Nawratil [31] defines a distance function for parallel manipulators of the
Stewart–Gough type. However, it measures how close a given pose of the end-effector is
to a singular pose (i.e., the pose associated with a singular configuration). Hence, such a
distance function is not defined in the configuration space C but in the operational space
X. Finally, Bu [32] defines an angle between the velocity vector associated with one of the
joints and the manifold generated by the others. Again, such an angle acts as a measure of
closeness but not as a distance function, and thus, it does not provide a realistic measure of
how close a singularity is.
The rest of the paper is organized as follows: Section 2 presents an overview of
geometric algebra that will be useful for understanding the proposed contribution. In
Section 3, the novel singularity identification approach and the simplification for serial
robots with a spherical wrist are fully developed, while the novel distance function is
constructed in Section 4. The application of these results to the Kuka LWR 4+, a redundant
serial robot with a spherical wrist, is given in Section 5. Section 6 lists three different
applications where both the singularity identification and the novel distance function can
be applied in order to illustrate their utility. Finally, the conclusions are given in Section 7.
The new element defined by the exterior product is called a bivector, and it is defined
to have grade two. By extension, the outer product of a bivector with a vector is known as
a trivector, is denoted by x1 ∧ x2 ∧ x3 and is defined to have grade three. Trivectors can be
seen as the oriented volume obtained by sweeping the bivector x1 ∧ x2 along x3 .
189
Mathematics 2022, 10, 2068
x1 ∧ x2 ∧ · · · ∧ x k (12)
denotes a k-blade, i.e., an element of grade k. Linear combinations of k-blades are known as
k-vectors, while linear combinations of k-vectors (for different k) are known as multivectors.
In his work [33], Clifford extends the exterior product by adding a scalar product
between vectors, the inner product. He defines the geometric product (also known as the
Clifford product) as follows:
x1 x2 = x1 · x2 + x1 ∧ x2 ( x1 , x2 ∈ Rn ). (13)
Thus, the geometric product between two vectors has two components: the scalar compo-
nent given by the inner product and the bivector component given by the exterior product.
Clearly, it also inherits the associativity and bilinearity of the exterior product.
When applied to an orthonormal basis B = {e1 , . . . , en } of Rn , the geometric product
acts as follows: (
1 for i = j
ei e j = (14)
ei ∧ e j for i = j
Thus, for each 0 ≤ k ≤ n, the set of k-vectors is spanned by:
k =0 {1} (scalars).
k =1 {e1 , . . . , en } (vectors).
k =2 {ei ∧ e j }1≤i< j≤n (bivectors).
k =3 {ei ∧ e j ∧ ek }1≤i< j<k≤n (trivectors).
..
.
k = r {ei1 ∧ · · · ∧ eir }1≤i1 <···<ir ≤n (r-vectors).
..
.
k = n {e1 ∧ · · · ∧ en } (pseudoscalar).
Then, for each 0 ≤ k ≤ n, there are exactly C (n, k ) generators for the set of k-
vectors, and thus, the set of k-vectors defines a vector space with basis Bk = {ei1 ∧ · · · ∧
eik }1≤i1 <···<ik ≤n and dimension C (n, k).
Definition 3. Let Rn denote the real vector space of dimension n. Then, the vector space spanned
by the basis
B = {ei1 ∧ · · · ∧ eir }1≤i1 <···<ir ≤n (15)
0≤r ≤ n
endowed with the geometric product defined in (13) is an algebra over R known as the geometric
algebra (GA) of Rn . Such an algebra is denoted by Gn and has the dimension C (n, 0) + C (n, 1) +
· · · + C (n, n) = 2n .
Remark 2. Since the grading structure of multivectors is a property associated with the exterior
product, the elements of Gn can still be called k-blades, k-vectors and multivectors.
A = A 0 + A 1 + · · · + A n . (16)
190
Mathematics 2022, 10, 2068
Hence, the set of all k-vectors for a given 1 ≤ k ≤ n is a vector subspace of Gn denoted
by Gn k and spanned by Bk = {ei1 ∧ · · · ∧ eik }1≤i1 <···<ik ≤n .
The multivector representation (16) is very useful in defining another important
operator in Gn . This linear operator is known as the reversion operator and is denoted by
the superscript ∼. The reversion is defined over the geometric product of m vectors as:
( a1 · · · a m ) ∼ = a m · · · a1 . (17)
due to the anticommutativity of the exterior product. Finally, since reversion is a linear
operator, the reverse of an arbitrary multivector is:
3 4 3 4 n ( n −1)
A + +···+ A
+= A + = A + A − A + · · · + (−1) 2 A . (19)
0 1 2 n
0 n
Notice that the reversion operator corresponds simply to matrix transposition when a
matrix representation of the n-dimensional algebra is considered.
Finally, another operator of great interest is the dual operator. Every grade-n element
of Gn is of the form α(e1 ∧ · · · ∧ en ) for a scalar α ∈ R. For each α ∈ R, α(e1 ∧ · · · ∧ en ) is
known as the volume element Eα of Gn , while the generator e1 ∧ · · · ∧ en is known as the
pseudoscalar of Gn and is usually denoted by I. Pseudoscalars allow us to define the dual
operator, whose action over a k-vector Ak is:
A∗k = I Ak , (20)
α2 α3 B
exp(αB ) = 1 + αB − − +···
2 3!
α2 α3 (22)
= 1− + · · · + B α − +···
2 3!
= cos(α) + B sin(α).
Equation (22) indicates that exp(αB ) could be related to rotations. Indeed, we have
the following result.
+
X = RX R. (23)
191
Mathematics 2022, 10, 2068
1 + e1 f 1 + · · · + e n f n
R= , (24)
1 + e1 f 1 + · · · + e n f n
+ = 1}.
R = { R ∈ Gn 0 + Gn 2 : R R (25)
Therefore, the product of two different rotors R1 and R2 also encodes a rotation. In
particular, it is the rotation resulting from the composition of the rotations encoded by
R1 and R2 , respectively. In addition, the second property states that R provides a double
covering of the rotation group.
Finally, one of the most important geometric algebras is the spatial geometric algebra
G3 , whose basis is:
{1, e1 , e2 , e3 , e12 , e13 , e23 , I }, (26)
where {e1 , e2 , e3 } is an orthonormal basis of R3 and eij = ei ∧ e j .
Theorem 1 (Chasles, 1830 [43]). Every rigid motion f ∈ SE(3) can be realized as a rotation
around an axis followed (preceded) by a translation along the same axis.
192
Mathematics 2022, 10, 2068
˙ θ̇.
Remark 3. For infinitesimal motions, if θ = 0, then the pitch is defined as h = d/
Hence, every rigid motion is a screw motion. Particular cases of screw motions are pure
rotations (pure translations) where the translation (rotation) is the identity or, equivalently,
h = 0 (h = ∞). In addition, every screw motion can be characterized by the triple (, h, q),
where q denotes the magnitude of the motion. If h = ∞, then θ = q and d = hθ, while if
h = ∞, then θ = 0 and d = q. We call this triple the screw associated with the screw motion,
and we denote it using $.
Proposition 2. Given a screw $ = (, h, q) with screw axis , pitch h and magnitude q, there
exists a twist ξ such that the rigid motion it generates is the screw motion associated with $.
Proposition 2 states a correspondence between twists and screws that is useful for
our purposes. In particular, if p is a point on and v is its direction unit vector, then
= { p + vλ : λ ∈ R} and we have that:
v
ξ=θ for a general screw motion,
p × v + hv
v
ξ=θ for a pure rotation, (28)
p×v
0
ξ=d for a pure translation.
v
A twist ξ associated with a magnitude 1 screw $ is said to be a unit twist. Hence, any
twist ξ can be seen as a unit twist multiplied by the magnitude of the associated screw axis:
v
ξ = θξ U = θ , (29)
p × v + hv
where ξ U is a unit twist. Clearly, ξ is associated with $ = (, h, q), while ξ U is associated
with $ = (, h, 1).
Proposition 3. Let us consider a rigid body performing a screw motion represented by the screw
$ = (, h, q(t)), where the magnitude q(t) is a time-dependent variable. Its velocity during the
screw motion is given by the associated twist ξ, where, now, the pitch is defined as in Remark 3.
In particular:
v
ξ = θ̇ (t) if θ = 0,
p × v + hv
(30)
0
ξ = d˙(t) if θ = 0,
v
Now, let us consider a serial robot with n DoF, where ω, v denote the angular and
linear velocity vectors of its end-effector. If Equation (3) is expanded, the following is
obtained:
193
Mathematics 2022, 10, 2068
v
= J1 (q)q̇1 + · · · + Jn (q)q̇n , (31)
ω
where Ji denotes the i-th column of the Jacobian matrix J. Notice that the right side of
Equation (31) can be seen as the addition of the twists associated with the joints of the robot,
where q̇i plays the role of the twist amplitude and where the linear and angular parts are
interchanged. However, for the sake of formality, let us consider the unit twist ξ i associated
with the i-th joint of the robot (Since, from now on, we are going to work exclusively with
unit twists, the subindex U is omitted for simplicity). Then:
⎧
⎪
⎪ zi
⎨ q̇i if joint i is revolute
ξ i (q)q̇i = zi ×
( o n − oi ) (32)
⎪
⎪ 0
⎩ q̇i if joint i is prismatic
zi
where, as stated in the introduction, zi is the direction vector of the joint axis, on (oi ) is the
origin of the frame attached to the end-effector (i-th joint) and q̇i = θ̇i if joint i is revolute
and q̇i = d˙i if joint i is prismatic.
Remark 4. The unit twists ξ i (q) defined in Equation (32) are represented with respect to the world
frame, not with respect to the local frame attached to the previous joint. If the unit twists are defined
with respect to a local frame, we need to use the adjoint transformation to represent them with
respect to the world frame. In particular, ξ i (q) = Ad f ξ i (q), where Ad f : R6 → R6 is the adjoint
transformation associated with the rigid motion f , i.e., the rigid motion transforming the reference
frame to the local frame in which the twist is initially represented.
where, again, ω, v denote the angular and linear velocity vectors of the robot’s end-effector and
q̇ = (q̇1 , . . . , q̇n ).
The main advantage of the screw-based Jacobian matrix defined in Equation (33) is that
it allows a geometrical identification of the singularities. Moreover, if an approach based
on geometric algebra is used, an intuitive geometrical and computer-friendly algebraic
identification of the singularities is possible. For that purpose, let us consider the geometric
algebra G6 , where, for every i = 1, . . . , n, the unit twist ξ i (q) can be modelled as a vector.
Indeed, we make the identification ξ i (q) = [ξ i1 · · · ξ i6 ] T with the vector x = ξ i1 e1 + · · · +
ξ i6 e6 ∈ G6 , where e1 , . . . , e6 are the basis vectors of G6 .
The following gives the main result of this section.
Theorem 3. Let ξ i (q) denote the unit twist defined by the i-th joint expressed as a vector of
G6 . Then:
ξ 1 (q) ∧ · · · ∧ ξ 6 (q) = det([ξ 1 (q) · · · ξ 6 (q)])e1 ∧ · · · ∧ e6 . (34)
194
Mathematics 2022, 10, 2068
Theorem 4 can be easily deduced from Equation (4.143) in [36] (p. 108):
F ( I ) = det( F ) I, (36)
Corollary 1. Given a serial robot with 6 DoF and associated unit twists ξ 1 (q), . . . , ξ 6 (q), then
q ∈ S if, and only if, ξ 1 (q) ∧ · · · ∧ ξ 6 (q) = 0.
Proof. Taking the dual of Equation (34), the following identity is obtained:
and, therefore, the singularities of the serial robot are those configurations q ∈ C verify-
ing that:
(ξ 1 (q) ∧ · · · ∧ ξ 6 (q))∗ = 0. (38)
Now, since for a given non-zero multivector M ∈ Gn , M∗ = 0 if, and only if, M = 0,
Equation (38) can be simplified to:
Remark 5. Notice that what has also been proven in Corollary 1 is that the outer product of n
vectors of Gn is zero if, and only if, the n vectors are linearly dependent.
S = { q ∈ C : ξ 1 ( q ) ∧ · · · ∧ ξ 6 ( q ) = 0}. (40)
Remark 6. What Theorem 3 states is that, for instance, if two unit twists ξ 1 and ξ 2 satisfy
ξ 1 ∧ ξ 2 = 0, then they represent the same twist, and hence, they generate the same screw motion.
This means that if such a screw motion is a pure translation, then the translational axes are either
parallel or coincident, while if the screw motion is a pure rotation, the rotational axes are coincident
(Since the twists contain the term (zi × (o6 − oi )) for i = 1, 2, they cannot be parallel). Regarding
the kinematic singularities of serial robots, this implies that two prismatic joints whose axes are
either parallel or coincident give rise to a singularity and, equivalently, that two revolute joints
whose axes are coincident give rise to a singularity. This is, in fact, in agreement with what is
known about kinematic singularities since two parallel revolute joint axes do not give rise to a
singularity. Obviously, the same geometrical interpretation can be made for three, four or more unit
twists satisfying that their outer product is zero.
With respect to redundant serial robots, it is clear that, for n > 6, ξ 1 (q) ∧ · · · ∧ ξ n (q) = 0
for any q ∈ C . Hence, Corollary 1 on its own does not allow us to characterize the
singularities of redundant robots. However, this problem can be easily overcome by
studying all the possible combinations of six unit twists in {ξ 1 (q), . . . , ξ n (q)}. We denote
the set of all combinations of six elements that can be drawn from {1, . . . , n} by S. Clearly,
S has C (n, 6) = (n6 ) elements of the form {i1 , . . . , i6 }, where 1 ≤ i1 < · · · < i6 ≤ n and
1 ≤ i ≤ C (n, 6).
195
Mathematics 2022, 10, 2068
Theorem 5. Given a serial robot with n DoF and associated unit twists ξ 1 (q), . . . , ξ n (q), then
q ∈ S if, and only if, for each 1 ≤ i ≤ C (n, 6):
where (1) uses Equations (37) and (2) uses the fact that all the minors of order 6 of the
matrix [ξ 1 (q) · · · ξ n (q)] have null determinants. Clearly, ρ([ξ 1 (q) · · · ξ n (q)]) < 6 if, and
only if, ρ( J (q)) < 6, which, in turn, is equivalent to q ∈ S (by Definition 1).
The computation of either Equation (39) for non-redundant robots or Equation (41)
for redundant ones is computationally more efficient than the computation of either
det( J (q)) = 0 or det( J (q) J T (q)) = 0. The main reason for this lies in the computational
complexity of the operations needed to obtain the expressions (39) or (41) with respect to
the complexity of the operations needed for obtaining det( J (q)) = 0 or det( J (q) J T (q)) = 0.
It is clear that the outer product of n vectors of Gn behaves like the addition and product
of real numbers, and hence, it has complexity O(n) + O(n2 ), while the determinant has
complexity O(n3 ) or O(n4 ) depending on the algorithm used. In addition, for redundant
robots, there are two main operations: the product between J (q) and J T (q) and the deter-
minant of the product matrix. This implies that, for this case, the complexity increases to
O ( n3 ) + O ( n4 ).
Theorem 6. Given a serial robot with n DoF and a spherical wrist, if either zi × (on − oi ) or zi
are denoted by si for i = 1, . . . , n − 3, then:
• q ∈ C is a position singularity if, and only if, si1 (q) ∧ si2 (q) ∧ si3 (q) = 0 for each 1 ≤
i ≤ C (n − 3, 3), where {i1 , i2 , i3 } is the i-th combination of three elements drawn from
{1, . . . , n − 3}.
196
Mathematics 2022, 10, 2068
Proof. The proof is completely analogous to the proof of Corollary 1 and Theorem 5.
Remark 7. Since the last three joint axes either intersect at a single point or are parallel, there is
only one orientation singularity, namely, when these three joint axes are coplanar. This can also be
easily deduced from Equation (45). A schematic representation of such singularity, also called wrist
singularity, is depicted in Figure 2.
ࢠିଵ
ߠିଶ
ࢠିଶ ࢠ
ߠ
ߠିଵ
4. Distance to Singularities
Let q1 , q2 ∈ C be two arbitrary configurations of a serial robot with n DoF and let ξ 1 , . . . , ξ n
be the unit twists associated with its joints. Then, there exist R1 (q1 , q2 ), . . . , Rn (q1 , q2 ), where,
for each 1 ≤ i ≤ n, Ri (q1 , q2 ) is a configuration-dependent rotor in the six-dimensional
geometric algebra G6 such that (Figure 3):
+ i ( q1 , q2 ).
ξ i ( q2 ) = R i ( q1 , q2 ) ξ i ( q1 ) R (46)
The reason why these rotors exist is simple: unit twists are modelled as vectors in
G6 and there always exists a rotor relating any pair of vectors in any geometric algebra
Gn . In particular, there is always a rotor relating the same unit twist ξ in two different
configurations q1 , q2 .
197
Mathematics 2022, 10, 2068
The notation chosen for these rotors expresses a configuration dependence that is not
a functional dependency, i.e., there is no analytical expression for these rotors with q as
a variable.
Now, it is clear that Ri j (q, qs ) = 1 if, and only if, q = qs for every j = 1, . . . , r. However,
since for each j, Ri j (q, qs ) does not define a function on q, a distance function cannot be
defined. However, the measure of how close a given configuration q is to a singularity can
be set as:
q ≈ qs ⇐⇒ Ri j (q) ≈ 1 for every j = 1, . . . , r. (48)
Example 1. Let qs ∈ S be a singularity of a serial robot that only involves the second and third
joints. Then, for any configuration q ∈ C , there exist R2 (q, qs ) and R3 (q, qs ) such that:
+2 (q, qs ),
ξ 2 (qs ) = R2 (q, qs )ξ 2 (q) R
(49)
+3 (q, qs ).
ξ 3 (qs ) = R3 (q, qs )ξ 3 (q) R
These rotors can be constructed in many different ways. The easiest way consists of
considering, for each 1 ≤ i ≤ n, the frame {i } attached to joint i and constructed from ξ i .
This three-dimensional frame varies with the configuration q. Hence, for two different
configurations q1 and q2 , there are two frames {i } attached to joint i. As shown in Section 2,
we can recover the three-dimensional rotor that transforms one of the frames into the other.
Since each frame {i } depends continuously on the configuration q, the rotor Ri (q) is a
continuous function defined as follows:
Ri : C → R
(52)
q → Ri ( q )
198
Mathematics 2022, 10, 2068
3 4
+ ∈ R+ = [0, +∞).
Lemma 1. For any given multivector X ∈ Gn , X X
0
Then:
n n 3 4
+=
XX ∑ ∑ X i + .
X (55)
j
i =0 j =0
Now, note that, for each i = 1, . . . , n, X i is a i-vector, i.e., it only contains terms of
grade i. The geometric product of two k-vectors (with different k) is stated as follows [36]
(p. 103):
Ar Bs = Ar Bs |r−s| + Ar Bs |r−s|+2 + · · · + Ar Bs r+s , (56)
5 3 46
Therefore, it is clear that X i X + = 0 for i = j. Thus:
j 0
3 4 n 3 3 44
+
XX = ∑ +
X i X . (57)
0 i 0
i =0
C (n,i )
X i = ∑ α j (i )e j1 · · · e ji , (58)
j =1
where, for every 1 ≤ j ≤ C (n, i ) = (ni), α j (i ) ∈ R and e j1 · · · e ji are the base elements of
Gn i . Therefore:
3 4 C (n,i )
+ = ∑ α j (i ) e j · · · e j
X (59)
i 1
i
j =1
and, thus:
3 4 C (n,i ) C (n,i )
+ = ∑ α j (i ) e j · · · e j ∑ α j (i ) e j · · · e j
X i X 1 i i 1
i
j =1 j =1
(60)
C (n,i ) C (n,i )
= ∑ ∑ α j (i )αk (i )e j1 · · · e ji eki · · · ek1 ,
j =1 k =1
7 8
where, clearly, e j1 · · · e ji eki · · · ek1 0 = δjk with δjk the Kronecker delta. Then:
3 3 44 C (n,i )
+
X i X = ∑ α2j , (61)
i 0
j =1
which, for every 1 ≤ i ≤ n, is a positive scalar. This implies that the sum of Equation (57) is
also a positive scalar.
Lemma 2. Given three strictly positive real numbers a1 , a2 , a3 ∈ R+ \ {0}, the following proper-
ties hold:
√ √ √
• a + a2 − a3 ≤ a1 + a2 ;
√ 1 √ √ √
• a1 + a2 + a3 ≤ a1 + a2 if, and only if, a3 ≤ 2 a1 a2 .
199
Mathematics 2022, 10, 2068
3 4
+ is a norm
Proposition 4. The function · : Gn → R+ defined by the identity X 2 = X X
0
in Gn , i.e.,
(i) X ≥ 0 for all X ∈ Gn . In particular, X = 0 if, and only if, X = 0.
(ii) λX = |λ| X for all X ∈ Gn and λ ∈ R.
(iii) X + Y ≤ X + Y for all X, Y ∈ Gn (usually known as the triangle inequality).
Proof. 3 4
(i) + is equivalent to:
Given a multivector X, identity X 2 = X X
0
"3 4
X = ± +
XX . (62)
0
Thus, it is clear by Lemma 1 that the positive branch of Equation (62) is well defined
and that X ≥ 0. In particular, if X = 0, then:
"3 4 3 4 n 3 3 44
+
XX + = 0 =⇒
= 0 =⇒ X X ∑ +
X i X = 0, (63)
0 0 i 0
i =0
where all the terms of the last equation are positive by Lemma 1 and, thus, all of them
are equal to zero. Now, note that each addend is the geometric product of an i-vector
with its reverse. Therefore, if such product is zero, the corresponding i-vector must be
zero. Since all the terms are zero, all the i-vectors that form X are zero, and thus, X
is zero.
(ii) If λ ∈ R and X ∈ Gn , then:
"3 "3 4 4
λX = +)
(λX )(λ X
= +
λ2 X X
0 0
" 3 4 "3 4 (64)
(1)
= λ2 X X+ = |λ| XX+ = |λ| X ,
0 0
where (1) uses the linearity of the grade-0 projection operator (as stated in Section 2).
(iii) Given two different multivectors X and Y, they can be expanded as linear combina-
tions of the basis elements of Gn as follows:
2n
X= ∑ αi e j1 · · · e ji ,
i =0
(65)
2n
Y= ∑ βi e j1 · · · e ji .
i =0
and hence:
0
# 1 2n
1 (1)
X + Y = ( X + Y )( X + Y ) 0 = 2 ∑ (αi + β i )2
∼
i =0
0
1 (67)
1 2n 2n 2n
1
=1
1 ∑ α2i + ∑ β2i + 2 ∑ αi β i ,
2 i =0 i =0 i =0
A B C
200
Mathematics 2022, 10, 2068
where (1) uses Lemma 1, while A, B and C are just notations given to simplify the
different manipulations. Since A, B > 0 (If either A, B are equal to zero, then either
X = 0 or Y = 0, which will make the condition X + Y ≤ X + Y trivial):
0 0
1 2n 1 2n
√ (1) √ √ 1 1
A + B + C ≤ A + B = 2 ∑ α2i + 2 ∑ β2i = X + Y , (68)
i =0 i =0
2n 2n 2n 2n 2n 2n 2n
AB = ∑ α2i ∑ β2i = ∑ ∑ α2i β2j = ∑ α2i β2i + ∑ ∑ α2i β2j ,
i =0 i =0 i =0 j =0 i =0 i =0 j =0
j =i
2 (69)
2n 2n 2n 2n
C =
2
∑ αi β i = ∑ α2i β2i +∑ ∑ αi β i α j β j
i =0 i =0 i =0 j =0
j =i
2n 2n 2n 2n
0≤ ∑ ∑ α2i β2j − ∑ ∑ αi βi α j β j
i =0 j =0 i =0 j =0
j =i j =i
n n n n n n
1 2 2 2 2 1 2 2 2 2 2 2
2 i∑ ∑ αi β j + 2 ∑ ∑ αi β j − ∑ ∑ αi β i α j β j
=
=0 j =0 i =0 j =0 i =0 j =0
(72)
j =i j =i j =i
n n
1 2 2
= ∑ ∑ ( α i β j − α j β i )2 .
2 i =0 j =0
j =i
Since this last inequality is always true, the triangle inequality is also true.
201
Mathematics 2022, 10, 2068
Proof. The proof is straightforward and uses the fact that · is a norm. Given two
different rotors R1 and R2 :
(i) D ( R1 , R2 ) = R1 − R2 ≥ 0. In particular:
(1)
D ( R1 , R2 ) = 0 ⇐⇒ R1 − R2 = 0 ⇐⇒ R1 − R2 = 0 ⇐⇒ R1 = R2 , (73)
D ( R1 , R3 ) = R1 − R3 = R1 − R2 + R2 − R3
(1) (75)
≤ R1 − R2 + R2 − R3 = D ( R1 , R2 ) + D ( R2 , R3 ),
As stated before, the end-effector pose of a serial robot and the pose of each one of
its joints are described by the configuration-dependent rotors R(q) and Ri (q), respectively.
Thus, one can be tempted to extend the distance function D to C as follows:
D : C × C → R+
(76)
D (q1 , q2 ) = R(q1 ) − R(q2 )
This function verifies all the requirements of a distance function with the exception of:
D (q1 , q2 ) = 0 ⇐⇒ q1 = q2 . (77)
The reason is simple: a given pose of the end-effector can be associated with up to
16 different configurations if the serial robot is non-redundant and an infinite number if
it is redundant. In particular, this means that R(q1 ) = R(q2 ) with q1 = q2 . However, this
problem can be overcome as follows:
• For each joint i, denote by Ci the configuration space of the subchain formed by the
first i joints. It is clear that, if the robot has n degrees of freedom, Ci ⊂ C for every
1 ≤ i ≤ n. Then, the following set of functions can be defined:
Di : C i × C i → R +
(78)
Di (q1 , q2 ) = Ri (q1 ) − Ri (q2 )
where, as stated before, Ri is the rotor that describes the pose of joint i. Again, these
functions are not distance functions for the same reason as D (Equation (76)) is not a
distance function.
• The function:
D : C × C → [0, +∞)
(79)
D (q1 , q2 ) = D1 (q11 q21 ) + · · · + Dn (q1n , q2n )
where q1i (q2i ) denotes the first i coordinates of the configuration vector q1 (q2 ), defines
a distance function in C .
202
Mathematics 2022, 10, 2068
Proof. Since, for each 1 ≤ i ≤ n, Di satisfies the requirements (ii ) and (iii ) of a
distance function, it is clear that D also satisfies them. In addition, Di (q1i , q2i ) ≥ 0
for each 1 ≤ i ≤ n and q1i , q2i ∈ Ci . Therefore, D (q1 , q2 ) ≥ 0 for arbitrary q1 , q2 ∈ C .
Finally, if D (q1 , q2 ) = 0, then, since any term of Equation (79) is a positive scalar, it can
be deduced that Di (q1i , q2i ) = 0 for every 1 ≤ i ≤ n. Thus, q1 and q2 not only have
the same end-effector pose, but the same pose for of each of its joints, which clearly
implies that q1 = q2 .
This distance function can be restricted to S just by considering the joints involved in
a given singularity qs .
Definition 5. Let qs ∈ S be a singularity of a serial robot that involves joints i1 , . . . , ir . Then, the
function D : C × S → R+ is defined by the expression:
where, for each i1 ≤ k ≤ ir , Dk is the function defined in (78) and is a distance function in C .
ݖ ߠ
ݖହ
ݔଷ
ݔଶ ݖଵ ݔହ
ߠଷ ݖଷ ߠ
ߠହ
ݖଶ ݔଵ ݖସ ݔସ
ߠସ ݔ
ߠଶ
ݖ
ߠଵ
With respect to the position singularities, the following system of C (4, 3) = 4 equations
should be solved:
⎫
(z1 × (o7 − o1 )) ∧ (z2 × (o7 − o2 )) ∧ (z3 × (o7 − o3 )) = 0 ⎪ ⎪
⎪
⎪
(z1 × (o7 − o1 )) ∧ (z2 × (o7 − o2 )) ∧ (z4 × (o7 − o4 )) = 0 ⎬
(81)
(z1 × (o7 − o1 )) ∧ (z3 × (o7 − o3 )) ∧ (z4 × (o7 − o4 )) = 0 ⎪ ⎪
⎪
⎪
⎭
(z × (o − o )) ∧ (z × (o − o )) ∧ (z × (o − o )) = 0
2 7 2 3 7 3 4 7 4
203
Mathematics 2022, 10, 2068
z1 × (o7 − o1 )
⎡ ⎤
−400c2 s1 − 390s4 (c1 s3 + c3 s1 s2 ) − 390c2 c4 s1
⎣
= 400c1 c2 − 390s4 (s1 s3 − c1 c3 s2 ) + 390c1 c2 c4 ⎦,
0
z2 × (o7 − o2 )
⎡ ⎤
−c1 (400s2 + 390c4 s2 − 390c2 c3 s4 )
= −s1 (400s2 + 390c4 s2 − 390c2 c3 s4 ) ⎦,
⎣
A1
(82)
z3 × (o7 − o3 )
⎡ ⎤
c2 s1 (390c4 s2 − 390c2 c3 s4 ) − s2 (390s4 (c1 s3 + c3 s1 s2 ) + 390c2 c4 s1 )
=⎣ −s2 (390s4 (s1 s3 − c1 c3 s2 ) − 390c1 c2 c4 ) − c1 c2 (390c4 s2 − 390c2 c3 s4 ) ⎦,
c1 c2 (390s4 (c1 s3 + c3 s1 s2 ) + 390c2 c4 s1 ) + c2 s1 (390s4 (s1 s3 − c1 c3 s2 ) − 390c1 c2 c4 )
z4 × (o7 − o4 )
⎡ ⎤
(390c4 s2 − 390c2 c3 s4 )(c1 c3 − s1 s2 s3 ) − c2 s3 (390s4 (c1 s3 + c3 s1 s2 ) + 390c2 c4 s1 )
= ⎣(390c4 s2 − 390c2 c3 s4 )(c3 s1 + c1 s2 s3 ) − c2 s3 (390s4 (s1 s3 − c1 c3 s2 ) − 390c1 c2 c4 )⎦,
A2
where
and where ci = cos(θi ) and si = sin(θi ). However, in order to simplify these expressions,
the system of Equation (81) is expressed with respect to the frame attached to the fourth
joint of the Kuka LWR 4+. To do so, a relation analogous of relation (9) is applied. Here,
instead of pre-multiplying by the corresponding rotation matrix, the system of Equation (81)
is multiplied by the three-dimensional rotor R that performs the rotation between the frame
attached to the end-effector and the frame attached to the fourth joint. For instance, the
first equation of the system (81) becomes:
+ = 0,
R(z1 × (o7 − o1 )) ∧ (z2 × (o7 − o2 )) ∧ (z3 × (o7 − o3 )) R (83)
which, using the geometric covariance property for rotors introduced in Section 2, becomes:
+ ∧ R(z2 × (o7 − o2 )) R
R(z1 × (o7 − o1 )) R + ∧ R(z3 × (o7 − o3 )) R
+ = 0. (84)
204
Mathematics 2022, 10, 2068
where
z5 ∧ z6 ∧ z7 = 0. (88)
e2 ∧ (−s5 e1 − c5 e3 ) ∧ (c5 s6 e1 + c6 e2 − s5 s6 e3 )
= (−s5 e2 ∧ e1 − c5 e2 ∧ e3 ) ∧ (c5 s6 e1 + c6 e2 − s5 s6 e3 ) (89)
(1)
= −s25 s6 e1 ∧ e2 ∧ e3 − c25 s6 e1 ∧ e2 ∧ e3 = −s6 e1 ∧ e2 ∧ e3 = 0,
where (1) uses the anticommutativity of the outer product. Clearly, the last expression of
Equation (89) is zero if, and only if, s6 = 0 or, equivalently, if, and only if, q6 = 0. Thus, the
Kuka LWR 4+ only has one orientation singularity (the wrist singularity, as explained in
Remark 7).
Finally, the distance function defined in Definition 5 can be applied to any of the
already obtained singular configurations. Let us consider, for instance, the position sin-
gularity q4 = 0. Then, the distance between an arbitrary configuration q ∈ C and this
singularity is given by the expression:
where qs denotes the singular configuration q4 = 0 and R4 is the rotor defining the pose of
the fourth joint of the Kuka LWR 4+.
In particular, R4 can be found as explained in Section 2. Indeed, if {e1 , e2 , e3 } denotes
the orthogonal basis defined by the world frame and { f 1 , f 2 , f 3 } (respectively, { f 1 , f 2 , f 3 }),
the orthogonal basis defined by the frame attached to the fourth joint under the effect of
configuration q (respectively, singular configuration qs ), then:
1 + e1 f 1 + e2 f 2 + e3 f 3
R4 ( q ) = ,
1 + e1 f 1 + e2 f 2 + e3 f 3
(91)
1 + e1 f 1 + e2 f 2 + e3 f 3
R4 ( q s ) = ,
1 + e1 f 1 + e2 f 2 + e3 f 3
205
Mathematics 2022, 10, 2068
where {e1 , e2 , e3 } is the reciprocal frame of {e1 , e2 , e3 }. Since {e1 , e2 , e3 } is also an orthonor-
mal set of vectors, such a reciprocal frame is:
e 1 = e1 ,
e 2 = e2 , (92)
e 3 = e3 .
1 + e1 f 1 + e2 f 2 + e3 f 3
R4 ( q ) = ,
1 + e1 f 1 + e2 f 2 + e3 f 3
(93)
1 + e1 f 1 + e2 f 2 + e3 f 3
R4 ( q s ) = .
1 + e1 f 1 + e2 f 2 + e3 f 3
a 1 + a 2 e1 ∧ e2 + a 3 e1 ∧ e3 + a 4 e2 ∧ e3
R4 ( q ) = # ,
a21 + a22 + a23 + a24
(94)
b + b2 e1 ∧ e2 + b3 e1 ∧ e3 + b4 e2 ∧ e3
R4 ( q s ) = 1 # ,
b12 + b22 + b32 + b42
a1 = c2 s3 + c4 s1 s3 − c4 c3 c1 s2 + s3 s4 c1 + s4 s1 s2 c3 + s4 c1 c2 + c2 c4 s1 ,
a2 = c2 s1 s4 − c4 c1 s3 − c4 c3 s1 s2 − c1 c2 c4 + s4 s1 s3 − s4 s2 c1 c3 ,
a3 = s2 s4 + c2 c3 c4 + c3 s1 + c1 s2 s3 ,
a4 = c4 s2 − c2 c3 s4 − c1 c3 + s1 s2 s3 ,
(95)
b1 = c2 s3 + s1 s3 − c3 c1 s2 + c2 s1 ,
b2 = −c1 s3 − c3 s1 s2 − c1 c2 ,
b3 = c2 c3 + c3 s1 + c1 s2 s3 ,
b4 = s2 − c1 c3 + s1 s2 s3 .
Therefore, by Proposition 4 and the decomposition used in the Proof of Lemma 1, the
distance of an arbitrary configuration q to the position singularity q4 = 0 is given by:
#
D (q, qs ) = ( a1 − b1 )2 + ( a2 − b2 )2 + ( a3 − b3 )2 + ( a4 − b4 )2 , (96)
where
ai bi
ai = # and bi = # . (97)
a21 + a22 + a23 + a24 b12 + b22 + b32 + b42
6. Handling of Singularities
Once the set of singular configurations S has been identify, several methods can be
applied to handle the singularities. The detailed treatment of this topic is beyond the scope
of this work. However, in order to show the possibilities of the distance function proposed
in Section 4, we comment on three different situations, namely, motion planning, motion
control and bilateral teleoperation. In each one of these situations, the distance function
defined in Definition 5 plays an important role for handling the singularities.
206
Mathematics 2022, 10, 2068
207
Mathematics 2022, 10, 2068
where M (q) denotes the inertia matrix of the robot, C (q, q̇) denotes the matrix of
Coriolis and centrifugal forces and g(q) denotes the gravity vector.
• An output, i.e., the vector of torques τ, that is sent to the robot to perform the de-
sired motion;
• The robot executes the motion and updates the vectors q and q̇;
• The robot sends such updated vectors to the controller (also known as the feedback
of the system).
qd τ q, q̇
Controller Robot
qd τ
D (qd , qs ) > D0 Dynamical model
yes
no
qd = qd + D0 qd
208
Mathematics 2022, 10, 2068
7. Conclusions
This paper proposes a novel singularity identification method for arbitrary serial
robots based on the six-dimensional geometric algebra G6 . For non-redundant serial robots,
we take the six unit twists ξ 1 , . . . , ξ 6 associated with the joints, and we model them as
vectors of G6 . Hence, the problem reduces to find the configurations causing the exterior
product ξ 1 (q) ∧ · · · ∧ ξ 6 (q) to vanish since, as proven in Corollary 1, ξ 1 (q) ∧ · · · ∧ ξ 6 (q) = 0
if, and only if, q ∈ S . Analogously, for a redundant robot with n DoF, we consider the
C (n, 6) different combinations of six unit twists taken from {ξ 1 , . . . , ξ n }, and we find
the configurations causing all the exterior products of the form ξ j1 (q) ∧ · · · ∧ ξ j6 (q) for
1 ≤ j ≤ C (n, 6) to vanish.
For serial robots with a spherical wrist, a simplification is possible. For these ma-
nipulators, the singularities are of two types: position singularities and orientation sin-
gularities. The former are identified as the configurations causing the exterior products
si1 (q) ∧ si2 (q) ∧ si3 (q) to vanish for 1 ≤ i ≤ C (n − 3, 3), where si j is the linear velocity
component of the unit twist ξ i j and is modelled as a vector of G3 , while the latter are
identified as the configuration causing the exterior product zn−2 (q) ∧ zn−1 (q) ∧ zn (q) to
vanish, where zi is the i-th joint axis and, again, is modelled as a vector of G3 . Thus, the
simplification consists of evaluating the exterior product of three vectors in G3 instead of
six vectors in G6 .
Once the singularities are identified, a distance function is defined such as its restriction
to the singular set S , defined in Definition 5, is also a distance function that allows us
to check how far an arbitrary configuration q is to a singularity. This distance function
exploits the fact that between any two vectors x, y ∈ Gn , there always exists a rotor R such
that y = Rx R. +
The advantages of the strategy introduced in this work are clear. First, it is a computer-
friendly approach that avoids the computation of the determinant of an order 6 × n (for
non-redundant robots) or n × n (for redundant robots) matrix and the Jacobian matrix J. In
addition, the novel distance function defined in Definition 5 can be used to improve the
performance of current control schemes or motion planning algorithms, which, as seen in
the introduction, is still a hot research topic in robotics.
Author Contributions: Conceptualization, I.Z. and J.L.; methodology, I.Z.; investigation, I.Z. and
H.H.; writing—original draft preparation, I.Z. and H.H.; writing—review and editing, I.Z., H.H. and
J.L.; supervision, J.L. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
209
Mathematics 2022, 10, 2068
References
1. Gottlieb, D. Robots and Topology. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), San
Francisco, CA, USA, 7–10 April 1986; pp. 1689–1691.
2. Hollerbach, J. Optimum kinematic design for a seven degree of freedom manipulator. In Robotics Research: The Second International
Symposium; Hanafusa, H., Inoue, H., Eds.; MIT Press: Cambridge, MA, USA, 1985; pp. 215–222.
3. Carmichael, M.; Khonasty, R.; Aldini, S.; Liu, D. Human Preferences in Using Damping to Manage Singularities During Physical
Human-Robot Collaboration. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Paris,
France, 31 May–31 August 2020; pp. 10184–10190.
4. Lopez-Franco, C.; Diaz, D.; Hernandez-Barragan, J.; Arana-Daniel, N.; Lopez-Franco, M. A Metaheuristic Optimization Approach
for Trajectory Tracking of Robot Manipulators. Mathematics 2022, 10, 1051. [CrossRef]
5. Thananjeyan, B.; Tanwani, A.; Ji, J.; Fer, D.; Patel, V.; Krishnan, S.; Goldberg, K. Optimizing Robot-Assisted Surgery Suture Plans
to Avoid Joint Limits and Singularities. In Proceedings of the International Symposium on Medical Robotics (ISMR), Atlanta, GA,
USA, 3–5 April 2019; pp. 1–7.
6. Dupac, M. Smooth trajectory generation for rotating extensible manipulators. Math. Methods Appl. Sci. 2018, 41, 2281–2286.
[CrossRef]
7. Wang, X.; Zhang, D.; Zhao, C.; Zhang, H.; Yan, H. Singularity analysis and treatment for a 7R 6-DOF painting robot with
non-spherical wrist. Mech. Mach. Theory 2018, 126, 92–107. [CrossRef]
8. Ratajczak, J.; Tchoń, K. Normal forms and singularities of non-holonomic robotic systems: A study of free-floating space robots.
Syst. Control. Lett. 2020, 138, 104661. [CrossRef]
9. Almarkhi, A.; Maciejewski, A. Singularity Analysis for Redundant Manipulators of Arbitrary Kinematic Structure. In Proceedings
of the 16th International Conference on Informatics in Control, Automation and Robotics (ICINCO), Prague, Czech Republic,
29–31 July 2019; pp. 42–49.
10. Sharifi, H.; Black, W. Identification Algorithm to Determine the Trajectory of Robots with Singularities. arXiv 2019,
arXiv:1911.06632.
11. Zhu, G.; Wei, S.; Zhang, Y.; Liao, Q. A Novel Geometric Modeling and Calculation Method for Forward Displacement Analysis of
6-3 Stewart Platforms. Mathematics 2021, 9, 442. [CrossRef]
12. Hadfield, H.; Wei, L.; Lasenby, J. The forward and inverse kinematics of a Delta Robot. In Advances in Computer Graphics;
Magnenat-Thalmann, N., Stephanidis, C., Wu, E., Thalmann, D., Sheng, B., Kim, J., Papagiannakis, G., Gavrilova, M., Eds.;
Springer International Publishing: Berlin/Heidelberg, Germany, 2020; pp. 447–458.
13. Thiruvengadam, S.; Tan, J.; Miller, K. A Generalised Quaternion and Clifford Algebra Based Mathematical Methodology to Effect
Multi-stage Reassembling Transformations in Parallel Robots. Adv. Appl. Clifford Algebr. 2021, 31, 39. [CrossRef]
14. Breuils, S.; Tachibana, K.; Hitzer, E. New Applications of Clifford’s Geometric Algebra. Adv. Appl. Clifford Algebr. 2022, 32, 1–17.
[CrossRef]
15. Hitzer, E.; Lavor, C.; Hildenbrand, D. Current survey of Clifford geometric algebra applications. Math. Methods Appl. Sci. 2022, in
press . [CrossRef]
16. Corrochano, E.; Sobczyk, G. Applications of Lie algebras and the algebra of incidence. In Geometric Algebra with Applications in
Science and Engineering; Corrochano, E., Sobczyk, G., Eds.; Birkhäuser Boston: Boston, MA, USA, 2001; pp. 252–277.
17. Kanaan, D.; Wenger, P.; Caro, S.; Chablat, D. Singularity analysis of lower mobility parallel manipulators using Grassmann–Cayley
algebra. IEEE Trans. Robot. 2009, 25, 995–1004. [CrossRef]
18. Tanev, T. Singularity analysis of a 4-DOF parallel manipulator using geometric algebra. In Advances in Robot Kinematics:
Mechanisms and Motion; Lennarčič, J., Roth, B., Eds.; Springer: Dordrecht, The Netherlands, 2006; pp. 275–284.
19. Chai, X.; Xiang, J. Mobility Analysis of Limited-Degrees-of-Freedom Parallel Mechanisms in the Framework of Geometric
Algebra. ASME J. Mech. Robot. 2016, 8, 41005.
20. Yao, H.; Chen, Q.; Chai, X.; Li, Q. Singularity analysis of 3-RPR parallel manipulators using geometric algebra. Adv. Appl. Clifford
Algebr. 2017, 27, 2097–2113. [CrossRef]
21. Chai, X.; Li, Q. Analytical mobility analysis of Bennett linkage using geometric algebra. Adv. Appl. Clifford Algebr. 2017,
27, 2083–2095. [CrossRef]
22. Ma, J.; Chen, Q.; Yao, H.; Chai, X.; Li, Q. Singularity analysis of the 3/6 Stewart parallel manipulator using geometric algebra.
Math. Methods Appl. Sci. 2018, 41, 2494–2506. [CrossRef]
23. Huo, X.; Sun, T.; Song, Y. A geometric algebra approach to determine motion/constraint, mobility and singularity of parallel
mechanism. Mech. Mach. Theory 2017, 116, 273–293. [CrossRef]
210
Mathematics 2022, 10, 2068
24. Chai, X.; Li, Q.; Ye, W. Mobility analysis of overconstrained parallel mechanism using Grassmann-Cayley algebra. Appl. Math.
Model. 2017, 51, 643–654. [CrossRef]
25. Yang, S.; Li, Y. Classification and analysis of constraint singularities for parallel mechanisms using differential manifolds. Appl.
Math. Model. 2020, 77, 469–477. [CrossRef]
26. Kim, J.; Jeong, J.; Park, J. Inverse kinematics and geometric singularity analysis of a 3-SPS/S redundant motion mechanism using
conformal geometric algebra. Mech. Mach. Theory 2015, 90, 23–36. [CrossRef]
27. Huo, L.; Baron, L. The joint-limits and singularity avoidance in robotic welding. Ind. Robot. 2008, 35, 456–464. [CrossRef]
28. Yahya, S.; Moghavvemi, M.; Mohamed, H. Singularity avoidance of a six degree of freedom three dimensional redundant planar
manipulator. Comput. Math. Appl. 2012, 64, 856–868. [CrossRef]
29. Siciliano, B.; Sciavicco, L.; Villani, L.; Oriolo, G. Robotics: Modelling, Planning and Control; Springer Publishing Company:
Berlin/Heidelberg, Germany, 2008.
30. Yao, H.; Li, Q.; Chen, Q.; Chai, X. Measuring the closeness to singularities of a planar parallel manipulator using geometric
algebra. Appl. Math. Model. 2018, 57, 192–205. [CrossRef]
31. Nawratil, G. Singularity distance for parallel manipulators of Stewart Gough type. In Advances in Mechanism and Machine Science;
Uhl, T., Ed.; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; pp. 259–268.
32. Bu, W. Closeness to singularities of robotic manipulators measured by characteristic angles. Robotica 2016, 34, 2105–2115.
[CrossRef]
33. Clifford, W.; Smith, H.; Tucker, R. Mathematical Papers by William Kingdon Clifford-Edited; Macmillan: London, UK, 1882.
34. Grassmann, H. Ausdehnungslehre-Extension Theory (English Re-Edition); American Mathematical Society: Providence, RI, USA, 2000.
35. Hamilton, W.R. Elements of Quaternions; Longmans, Green: London, UK, 1866.
36. Doran, C.; Lasenby, A. Geometric Algebra for Physicists; Cambridge University Press: Cambridge, UK, 2003.
37. Dorst, L.; Fontijne, D.; Mann, S. Geometric Algebra for Computer Science: An Object-Oriented Approach to Geometry; Morgan Kaufmann
Publishers Inc.: Burlington, MA, USA, 2007.
38. Perwass, C. Geometric Algebra with Applications in Engineering; Springer Publishing Company, Incorporated: Berlin/Heidelberg,
Germany, 2009.
39. Hildenbrand, D. Foundations of Geometric Algebra Computing; Springer Publishing Company, Incorporated: Berlin/Heidelberg,
Germany, 2012.
40. Lavor, C.; Xambó-Descamps, S.; Zaplana, I. A Geometric Algebra Invitation to Space-Time Physics, Robotics and Molecular Geometry;
SRMA/Springerbriefs; Springer: Berlin/Heidelberg, Germany, 2018.
41. Murray, R.; Li, Z.; Shankar-Sastry, S. A Mathematical Introduction to Robotic Manipulation; CRC Press: Boca Raton, FL, USA, 1994.
42. Davidson, J.; Hunt, K. Robots and Screw Theory: Applications of Kinematics and Statics to Robotics; Oxford University Press: Oxford,
UK, 2004.
43. Chasles, M. Note sur les propriétés générales du système de deux corps semblables entr’eux. Bull. Sci. Math. Astron. Phys.
Chemiques 1830, 14, 321–326.
44. Tsai, L. Robot Analysis: The Mechanics of Serial and Parallel Manipulators; John Wiley and Sons: Hoboken, NJ, USA, 1999.
45. Sangwine, S.; Hitzer, E. Clifford Multivector Toolbox (for MATLAB). Adv. Appl. Clifford Algebr. 2017, 27, 539–558. [CrossRef]
46. Zaplana, I.; Claret, J.; Basanez, L. Kinematic analysis of redundant robotic manipulators: applications to Kuka LWR 4+ and ABB
Yumi. Rev. Iberoam. Autom. Inform. Ind. 2018, 15, 192–202. [CrossRef]
47. Basañez, L.; Suárez, R. Teleoperation. In Springer Handbook of Automation; Nof, S., Ed.; Springer: Berlin/Heidelberg, Germany,
2009; pp. 449–468.
211
mathematics
Article
Mathematical Analysis of a Low Cost Mechanical Ventilator
Respiratory Dynamics Enhanced by a Sensor Transducer (ST)
Based in Nanostructures of Anodic Aluminium Oxide (AAO)
Jesús Alan Calderón Chavarri 1,2, *, Carlos Gianpaul Rincón Ruiz 2 , Ana María Gómez Amador 3 ,
Bray Jesús Martin Agreda Cardenas 2 , Sebastián Calero Anaya 2 , John Hugo Lozano Jauregui 2,4 ,
Alexandr Toribio Hinostroza 4 and Juan José Jiménez de Cisneros y Fonfría 2
1 Angewandte Nanophysik, Institut Für Physik, Technische Universität Ilmenau, 98693 Ilmenau, Germany
2 Engineering Department, Pontificia Universidad Católica del Perú, Lima 15088, Peru;
rinconr.carlos@pucp.pe (C.G.R.R.); b.agreda@pucp.edu.pe (B.J.M.A.C.); sebastian.calero@pucp.pe (S.C.A.);
john.lozanoj@pucp.edu.pe (J.H.L.J.); juanjose.cisneros@pucp.pe (J.J.J.d.C.y.F.)
3 Departament of Mechanical Engineering, Universidad Carlos III de Madrid, 28911 Madrid, Spain;
amgomez@ing.uc3m.es
4 Mechatronic Department, Northen (Artic) Federal University Named after M.V. Lomonosov,
Arkhangelsk 163002, Russia; alexandrtoribio@mail.ru
* Correspondence: alan.calderon@pucp.edu.pe
Abstract: Mechanical ventilation systems require a device for measuring the air flow provided to a
Citation: Chavarri, J.A.C.; Ruiz, patient in order to monitor and ensure the correct quantity of air proportionated to the patient, this
C.G.R.; Gómez Amador, A.M.; device is the air flow sensor. At the beginning of the COVID-19 pandemic, flow sensors were not
Cardenas, B.J.M.A.; Anaya, S.C.;
available in Peru because of the international supply shortage. In this context, a novel air flow sensor
Lozano Jauregui, J.H.; Hinostroza,
based on an orifice plate and an intelligent transducer was developed to form an integrated device.
A.T.; Jiménez de Cisneros y Fonfría,
The proposed design was focused on simple manufacturing requirements for mass production in a
J.J. Mathematical Analysis of a Low
developing country. CAD and CAE techniques were used in the design stage, and a mathematical
Cost Mechanical Ventilator
Respiratory Dynamics Enhanced by a
model of the device was proposed and calibrated experimentally for the measured data transduction.
Sensor Transducer (ST) Based in The device was tested in its real working conditions and was therefore implemented in a breathing
Nanostructures of Anodic circuit connected to a low-cost mechanical ventilation system. Results indicate that the designed
Aluminium Oxide (AAO). air flow sensor/transducer is a low-cost complete medical device for mechanical ventilators that is
Mathematics 2022, 10, 2403. able to provide all the ventilation parameters by an equivalent electrical signal to directly display
https://doi.org/10.3390/ the following factors: air flow, pressure and volume over time. The evaluation of the designed
math10142403 sensor transducer was performed according to sundry transducer parameters such as geometrical
Academic Editor: James M. Buick parameters, material parameters and adaptive coefficients in the main transduction algorithm; in
effect, the variety of the described results were achieved by the faster response time and robustness
Received: 8 April 2022
proportionated by transducers of nanostructures based on Anodic Aluminum Oxide (AAO), which
Accepted: 19 June 2022
enhanced the designed sensor/transducer (ST) during operation in intricate geographic places, such
Published: 8 July 2022
as the Andes mountains of Peru.
Publisher’s Note: MDPI stays neutral
with regard to jurisdictional claims in Keywords: air flow medical sensor; emergency air flow sensor; low-cost air flow sensor; nanostructures;
published maps and institutional affil- COVID-19
iations.
MSC: 65K10
time [1]. In this sense, a flow sensor is a device used for air flow measurement, and its
working concept is commonly based on an induced pressure drop across an orifice which
can be measured and related to the air flow value in a breathing circuit over time [2].
Design considerations for air flow measurement in industrial applications are well
established in the steady-state condition [3]; nevertheless, the air flow provided by the
developed low-cost mechanical ventilator has a particular periodic waveform in a transient
condition. Typical medical air flow sensors measure the pressure drop using static pressure
taps connected to a transducer. However, this represents a risk of leaking polluted air
from the patient, which is a harmful risk given the current pandemic context. To counter
this, modern medical air flow sensors are connected to electrical transducers to generate
electrical values analogous to values of static pressure [4]. Having a sensor connected to
an electrical transducer is quite important for the measured variables, as their electrical
equivalent values can be stored in the main control system of the ventilator. On the other
hand, any kind of virus can be reduced, something that cannot be achieved with typical
airflow connectors. Furthermore, air flow sensors cannot provide air pressure and volume
and other parameters, which require monitoring, are measured using an additional sensor
and calculated at the processing stage.
In this study, a novel air flow sensor/transducer was proposed as an integrated design
that is able to measure the air flow in a breathing circuit in the time domain and estimate
the air pressure and volume parameters with the mathematical model developed and
calibrated experimentally for a specific low-cost mechanical ventilator based on cams [1].
This proposed medical device directly provides the equivalent electrical signals of air
pressure, volume and flow after a transduction stage required due to the nonlinearity of
the air flow and pressure drop [5].
During the COVID-19 pandemic, many low-cost mechanical ventilation systems based
on the compression of an airbag were developed [6], but they were required to measure
the generated air flow rate to the patient [7]. However, air flow sensors were not easy to
resource in Peruvian markets due to the international supply shortage during this period
of the pandemic. Moreover, the design and fabrication for this medical application requires
certain considerations, such as those proposed in this research.
214
Mathematics 2022, 10, 2403
controller for data treatment in order to determine and display the pressure, volume and
flow over time using an electronic screen [10].
Sensor transducer design CFD Analysis Computational simulation and Experimental validation
Experimental characterization
- Steady state
- Dynamic response
Prototype and experimental - Response to a Mechanical ventilator
Model Calibration
study
METHOD
EXPERIMENTAL STUDY AND DESIGN IMPROVEMENT
215
Mathematics 2022, 10, 2403
Figure 4 shows the compact air flow sensor developed for the integrated design, which
is composed of an upstream and downstream pipe welded to an orifice plate. Dimensions
are established based on the analytic and experimental results, which guarantee the pressure
difference measurement over time for this particular medical application with a mechanical
ventilation system OxygenIP.PE, whose nominal working conditions correspond to a
transient flow [11].
Figure 4. Compact air flow sensor developed for the air flow sensor/transducer.
216
Mathematics 2022, 10, 2403
In order to find the pressure, airflow and volume equations, the Stokes equations were
analyzed using Equation (1), in which, “f ” is the function as the consequence of airflow
and “μ” and “D” are the average velocity and dispersion coefficient, respectively [13,14].
∂f ∂f ∂f2
+μ =D 2 (1)
∂t ∂x ∂x
This expression can be reduced to the Bernoulli model, and with the analysis of the
energy balance, Equation (2) can be obtained [13], in which ρ is the fluid flow density, p1
and p2 are the initial and final referential pressure through the fluid flow path, and v1 and
v2 are the fluid flow speed.
1 1
ρv 2 + p1 = ρv2 2 + p2 (2)
2 1 2
By conducting a static pressure analysis, it is possible to obtain Equation (3), where is
the volumetric flow rate [13], A1 and A2 are the cross sections area, and the initial and the
final static pressure are p1 and p2 , which are proportionate to the difference of pressure ΔP.
0
1
1 2ΔP
Φ = A2 1 2 (3)
2
1− A
A
2
1
From here, it is further possible to reduce to the equation by considering the “discharge
coefficient Cd ” so that Equation (4) is obtained [13].
0
1
1 2ΔP
Φ = Cd A2 1 2 (4)
2
1− A
A
2
1
Finally, a theoretical model can be derived from the expected breathing curve as shown
by Equation (5) [13], in which Ri has the geometrical and material information of the fluid
flow sensor. Equation (5) proposes the relation among the volumetric flow rate Φ with the
pressure difference of ΔP, and τ is the response time.
ΔP t
Φ= 1 − eτ (5)
Ri
217
Mathematics 2022, 10, 2403
218
Mathematics 2022, 10, 2403
dx (t)
= f ( x ( t ), u ( t ), θ ) (6)
dt
y ( t ) = h ( x ( t ), u ( t ), θ ) (7)
It is possible to propose the mathematical model of the system (sensor/transducer)
with Equations (6) and (7) described above. Moreover, it is necessary to find the optimal
signal (input variable) that provides a predicted response. Hence, Equation (8) depends on
the trajectory solution as set point Rs and r(k i ) is the set point of sample time k i .
Equation (9) has a dependence of matrices Rs and ΔU. The optimal predicted solution
for a desired value in the system is obtained based on the cost function “J” among optimal
predicted response with this “trajectory solution Y” such as the optimal predicted solution
ΔU, which is described using Equation (9). The procedure to calculate ΔU is given by Equa-
tions (10)–(14), furthermore, R is a diagonal matrix to tune the desired optimal response
matrix.
J = ( Rs − Y ) T ( Rs − Y ) + ΔU T RΔU (9)
On the other hand, the “Optimal predicted response Y” has a dependence of “matrices
F, ϕ and the state vector X(ki)” as shown by Equation (10)
Y = FX (k i ) + ϕΔU (10)
Moreover, using the costing function “J” the optimal estimation was analyzed, as it is
described by Equation (11) after replacing Equation (10) in Equation (9).
J = ( Rs − FX (k i )) T ( Rs − FX (k i )) − 2ΔU T ϕ T ( Rs − FX (k i )) + ΔU T ϕ T ϕ + R ΔU (11)
Looking for the minimal error, Equation (11) was derived as the dependence of “ΔU”
in Equation (12).
∂J
= −2ϕ T ( Rs − FX (k i )) + 2 ϕ T ϕ + R ΔU (12)
∂ΔU
Therefore, the minimal value of “J” was achieved using Equation (13).
∂J
=0 (13)
∂ΔU
This means that the optimal “ΔU” is given by Equation (14), which helps to obtain the
optimal and predicted “Y”.
−1
ΔU = ϕ T ϕ + R ϕ T ( Rs − FX (k i )) (14)
219
Mathematics 2022, 10, 2403
On the other hand, the matrix ϕ can help to obtain information of the material sensor
regarding the thin film, which is subject to the air pressure. Moreover, the geometry
(nanostructures) of the thin film can be stored over the matrix. Therefore, Equation (15)
provides the correlation between the optimal predictive solution of the transduction with
the matrix adaptive coefficients, which keep information of the material and geometrical
characteristic of the designed sensor.
Furthermore, the periodical responses from the sensor/transducer are provided by
Equation (15) though a transfer function analysis, thereby making it possible to identify the
first order behavior in the steady state. Therefore, Cs represents the answer variable for the
first order, Ns is the input excitation signal, t is the response time and K is the proportional
gain.
Cs K
= (15)
Ns ts + 1
Equation (15) is reduced in the time domain and Equation (16) is obtained, which
keeps the model of Equation (5) in a steady state. In this context, the excitation signal is ΔP
that looks for the response variable fluid flow Φ, and R is the constant derived from the
first order solution for Equation (14) in Equation (10). Hence, it was possible to correlate
stability parameters with the geometrical and material properties of the designed sensor.
ΔP
Φ= (16)
R
For this reason, the transducer adapts the signal received from the sensor according to
the value measured of the physical variable to another equivalent (for example, the pressure
difference transduced to its equivalent as an electrical signal, which is from “cmH2O” to
“mV”). In this context, Figure 9 represents the static curve given in the section “A, B, C” for
the coordinate system “X1, Y1”. However, the static curve can help to identify the linear
response regions, by working in linear regions or with non-linear mathematical models.
It should be noted that depending on the response time of the system (the system is
considered as a research plant), it can lose information in the linearization process, due to
its non-linear behavior, which is the reason why many authors suggest working in linear
zones despite such risks (losing information in linearization process). In addition, the
process of discretizing the signal information may also be lost, such as the dependence on
the response time of the system and the sampling time (important parameter to discretize a
signal).
Based on the conditions described in the previous paragraph, a transducer is neces-
sary to transform the pressure difference into its electrical equivalent (in mV) in order to
obtain the value for the air flow and also the displaced air volume. Theoretically, using
Navier–Stokes models, the following relationship of the three physical variables is achieved:
“pressure, flow and volume”, where the numerical solutions of the differential equations
are also prone to loss of information as a dependence of the methodologies of numerical
approximations in the solutions. However, the disadvantage of a completely theoretical
220
Mathematics 2022, 10, 2403
correlation is that the modeling of disturbances is frequently not achieved, which is solved
by either approximation modeling or by correlation with experimental data.
Therefore, the transients of the plant’s dynamic system (while it is first-order) for the
sections represented by d0 , d1 ; d2 , d3 ; d4 , d . . . and with similar slopes only in the linear
trends “AB” or “BC” are located in the coordinates “X2, Y2”. In such sections, the response
times of the plant (the designed sensor/transducer) can be obtained.
Moreover, for each section, d1 , d2 ; d3 d4 , and “d . . . , henceforth” the characteristics
of overshoots, settling time, damping and parameters that indicate the stability of the
designed system can be maintained.
The static and dynamic curves are algebraically interpreted as models of differential
equations (polynomial models). Consequently, the correlation is obtained between the theo-
retical modelling and modelling based on the experimental data analysis results in the final
model of the designed “sensor/transducer” system, that implies a response “Experimental
Model (EM) and Theoretical Model (TM)” from linear (linearized) or completely non-linear
(NLM) models.
Indeed, the system formed by the sensor/transducer represented by “ST” receives a
mechanical signal “Ms1” (fluid mechanical signal for this case, which is pressure difference)
and transforms it into the electrical equivalent “Es” or a response signal “Ms2” (fluid
mechanics variable as volume or airflow equivalent signal). This conversion (transduction)
requires characterization and calibration steps, which are depicted by the scheme sum-
marized in Figure 10. From the static measurement data analysis, is possible to recognize
the linear and nonlinear operations in the ST system. This information is quite important
according to the design of the algorithm, which provides the final information of the ST
measurement, such as the breathing variables of pressure, volume, and airflow.
221
Mathematics 2022, 10, 2403
3. Experimental Section
3.1. Orifice Plate Diameter Study
The working concept of the orifice plate is to generate a pressure difference which
is related to the flow passing through it [1]. In this sense, different diameters could be
considered; however, an optimum range of diameters presents a balance between the
produced pressure difference sufficiently high to be used on the transduction stage, and
sufficiently low to not to produce significant air flow or pressure drop in the breathing
circuit.
Figure 12 (by views of subfigures a–c) shows the variable orifice-plate flow sensor it is
manufactured by two pipes joined together with a ferrule clamp to have a quick release
mechanism. Downstream of the ferrule clamp, an orifice-plate is welded to the flange. In
this study, 5 parts were fabricated with different diameters: 4, 6, 8, 10, 12 mm. Distances of
the pressure taps to the orifice plate were established according to the design considerations
of an orifice plate air flow sensor [3]. However, the distance of the downstream pressure
tap was not able to be less than half radius because of welding limitations for the geometry.
222
Mathematics 2022, 10, 2403
Figure 13. Schematic representation of experimental setup for steady state air flow.
Figure 14. Scheme of experimental setup for the sensor transducer connected to a Mechanical
Ventilator.
223
Mathematics 2022, 10, 2403
(a)
(b)
(c)
Figure 15. Setup of the sensor/transducer device connected to OxygenIP.PE Mechanical ventilator.
4. Results
4.1. CFD Simulation Results
4.1.1. Steady State Results
The following results were obtained for the steady state simulations for a 100 L/min
airflow in a 10 mm diameter orifice. Figure 16a shows how the velocity increased as the
area reduced (orifice throat), which is explained by the flow’s intention of conserving
the flow rate (mass conservation and incompressibility assumption). On the other hand,
Figure 16b shows how the static pressure reduced, which occurred in response to the
velocity’s increase. This is explained by the flow’s intention of conserving its momentum.
It must also be noted that the behavior of these contours matched the results presented by
Karthik [19].
224
Mathematics 2022, 10, 2403
since Funk et al. [20] concluded in their work that steady state approximations can be used
in transient situations with excellent or valid results for most engineering applications.
(a)
(b)
Figure 16. (a) Velocity Contours for a 100 L/min airflow, 10 mm diameter orifice; (b) Static pressure
contours for a 100 L/min airflow, 10 mm diameter orifice.
(a) (b)
Figure 17. (a) Airflow profile and DP profile over time; (b) Steady vs. transient state results for a
10 mm diameter orifice.
225
Mathematics 2022, 10, 2403
Figure 18. Flow and Flow ST for diameter 4 mm (Inlet pressure: 5, 10, 20 mBar).
The pressure difference curves (in mV) are plotted together in Figure 19. The “ST” curve
provides information related to the pressure measured by the sensor/transducer in mV.
Figure 19. Pressure measured by ST in mV, for diameter 4 mm (Inlet pressure: 5, 10, 20 mBar).
226
Mathematics 2022, 10, 2403
The pressure difference curves are plotted together next in Figure 20. Each curve
provides information of the pressure measured by the sensor/transducer in Pascal.
Figure 20. Pressure measured by ST (Pa), for diameter 4 mm (Inlet pressure: 5, 10, 20 mBar).
The tests were run for the 8 mm diameter sensor/transducer and a steady-state inlet
pressure of 10 mBar, 20 mBar and 30 mBar was provided by an air compressor, for which
the plotted airflow curves are shown in Figure 21. The “Fluke” curve provides information
on the airflow measured by the “Fluke” analyzer, and the “ST” curve shows the airflow as
the response of the designed sensor/transducer ST.
Figure 21. Flow and Flow ST for diameter 8 mm (Inlet pressure: 5, 10, 30 mBar).
The transduced pressure difference curves in mV are shown in Figure 22. Each curve
provides information on the pressure measured by the sensor/transducer in mV.
227
Mathematics 2022, 10, 2403
Figure 22. Pressure measured by ST (mV), for diameter 8 mm (Inlet pressure: 5, 10, 20 mBar).
On the other hand, the pressure difference curves are shown in Figure 23. Each curve
provides information on the pressure measured by the sensor/transducer in Pascal.
Figure 23. Pressure measured by ST (Pa), for diameter 8 mm (Inlet pressure: 5, 10, 20 mBar).
228
Mathematics 2022, 10, 2403
Figure 24. Airflow comparison in open loop and closed loop for the ST evaluation.
Figure 25 shows the pressure difference measured by the designed ST for a closed
loop (upper subfigure) and open loop (lower subfigure). The ST system can detect changes
or tendencies to maintain a steady state as a consequence of an open or closed loop of the
tester prototype; therefore, it was possible to measure this pressure difference with the
designed ST.
Figure 25. Pressure difference comparison in open loop and closed loop for the ST evaluation.
229
Mathematics 2022, 10, 2403
provide a faster response in the presence of transient changes (red curves for airflow,
pressure difference and volume).
This can be explained by the geometrical characteristics of the ST and its nonlinear
behavior in diameters from 4 mm to 6 mm; however, the ST system can provide a better
response in the presence of disturbances or transient changes when the orifice diameter is
between 8, 10 and 12 mm. The reference corresponds to the measurements of the Fluke
equipment (blue curve for airflow, pressure difference and volume).
For the 8 mm diameter sensor/transducer and the mechanical ventilator excitation
signals, the ventilation curves are shown in Figure 27. In every subfigure, the medical gas
flow meter Fluke VT 650 proportionated the blue color curves in low frequency speed and
red color for high frequency speed, which provide information on the volume, pressure and
airflow (subfigures I–III), and the ST curves provide the volume, pressure, airflow the green
color curves in low frequency speed and yellow color for high frequency speed (subfigures
I–III). During the 2 min of measurement, the ST demonstrated a faster and robust sensor
behavior (for 8 mm of diameter) as a consequence of the dynamical evaluation made by a
low-cost mechanical ventilator based on cams OxygenIP.PE.
Figure 27. The dynamical behavior of the sensor/transducer connected to a mechanical ventilator
low cost based on cams OxygenIP.PE.
230
Mathematics 2022, 10, 2403
Figure 28 shows the airflow curves for the 8 mm diameter sensor/transducer and
the mechanical ventilator excitation signals. In every subfigure, the Fluke curves (blue
color) provide information on the airflow measured by the medical gas flow meter Fluke
VT 650 and the ST curves (red color) represent the airflow measured by the designed
sensor/transducer.
Figure 28. The dynamical behavior of the sensor/transducer connected to an air compressor in
airflow range of work between 75 L/min and 105 L/min.
During the 1 min of measurement, the ST shows a faster and robust sensor behavior
as a consequence of the dynamical evaluation made using an air compressor in the range
of work from 75 L/min to 105 L/min.
5. Discussion
5.1. Data Interpretation Analysis
In order to interpret the experimental data with the designed algorithm, it was neces-
sary to establish a correlation with the theoretical model to enhance the adaptive parameters
of the polynomial model analyzed in Section 2.5.
Therefore, the experimental analysis of the designed polynomial model was correlated
with Equation (18) in the theoretical model, as this equation gives the pressure difference “ΔP”
as dependence on the airflow “Φ”, geometrical parameters “A2 , A1 ” and flow density “ρ”.
2
Φ 2 ρ A2
ΔP = 1− (17)
A2 2 A1
The theoretical model only produces a quadratic relationship between the airflow with
the pressure difference, while “A2 ” tends to decrease, the pressure difference and airflow
tend to increase.
Since the airflow for the breathing analysis had an “increasing and decreasing behav-
ior”, to obtain an appropriate measurement with the designed ST, the static and dynamic
behavior of the sensor was studied using an adaptive analysis of the polynomial model.
Therefore, the static behavior was provided as a dependence of “airflow, pressure
difference, and diameter”. Figure 29 depicts a flow and two points crossing its axis, for
which P1, V1, Y1 and P2, V2, Y2 are the pressure, flow speed, and position at points 1 and
2, respectively.
231
Mathematics 2022, 10, 2403
The static curve gives the relation among two variables in order to identify a linear
and nonlinear range of work; therefore, it is possible to understand its dynamic behavior.
The dynamical analysis of the airflow attempts to obtain the linear answer of the
system so that the ST can make estimations of the physical variables volume, and pressure
as a consequence of the pressure difference, which was achieved by the correlation between
the experimental and theoretical analyses described in previous chapters. Therefore, the
curves “A, B, and C” represent the airflow curves that were expected to be achieved in the
characterization of the interpretation data with the ST, which is depicted by the Figure 30.
Figure 30. Schematic representation of the airflow dynamic and static behavior.
Two types of studies were performed, one for the static responses of the ST system,
and another for the dynamic ones. The first was conducted using a tester prototype with a
compressor as an air source, which is depicted in Figure 31 and was used for the static and
dynamic tests. The pressure difference measured by the sensor was calibrated in order to
provide the same response as the air flow meter Fluke. Finally, the other group of static
232
Mathematics 2022, 10, 2403
and dynamic tests was performed with the mechanical ventilator as the air source. Hence,
the pressure difference registered provided information to understand the dynamics of the
breathing variables (pressure, volume and airflow) in order to understand the behavior of
the sensor/transducer as part of the ventilation circuit.
The designed ST depends on the diameter and pressure difference; for this reason, it
was necessary to choose which transducer to use to obtain the electrical equivalent of the
pressure difference. Finally, the evaluated alternatives were as follows: transducer MAS1,
the integrated circuit MDPDX10P, the transducer constructed with two small microphones
sourced from the ARDUINO company, and a transducer based on nanostructures. Different
alternatives are shown in Figure 31.
The measured pressure difference by the designed ST was central to obtaining the
airflow after a transduction process, which was obtained by MDPX and MAS1, making the
electrical equivalent conversion from air pressure difference signal to airflow signal in Volts.
The electrical transduction was performed with two microphones measuring decibels and
correlating with pressure by converting the response to volts. The electrical equivalence
values help to analyze the statistics of the physical variables “air pressure, volume and
airflow” as a consequence of the designed algorithms by polynomial (as they are depicted
by “Algor Ans1”, “Algor Ans2”, and “Algor Ans3” in Figure 31) correlations between the
transduction signals. However, in order to enhance the robustness and response time of
the designed ST, we modified the samples that receive the pressure difference by samples
based on nanostructures of AAO, as is depicted in S1, S2 in Figure 31, which also send the
transduction information through the algorithm depicted by “Algor Ans4”.
The best estimation of the breathing physical variables “airflow, pressure, and volume”
was chosen because of the pressure difference transduction techniques that were studied in
the previous chapters. These microphone-based transducers obtained estimated variables
with a maximum error of 1% in a steady state, while the MAS1-based transducer estimated
variables with a 0.95% error in a steady state. In turn, the MDPX-based transducer presented
a steady-state error of around 2%. However, the cost of the microphone-based transducer
was around 70% cheaper than the most expensive transducer, the MAS1. For this reason,
the microphone-based transducer was selected as the main transducer for the designed ST.
When performing the main analysis of the transducer algorithm, we sought to identify
the adaptive coefficients from the experimental data obtained by the transducer and the
polynomial identification estimate using pressure difference “DP”, the pressure “P”, the
233
Mathematics 2022, 10, 2403
airflow “F”, and the volume “V”. If the adaptive estimation was predicted according to the
reference variable with an error of less than 1% in the steady state, then the ST provided
information on the breathing variables, which is depicted in Figure 32.
Therefore, it was necessary to define the input variables in order to correlate the
estimated responses under the dependencies of the references variable. Figure 33 shows
the two input variables of the system from the data interpretation analysis, which are given
by the pivot displacement in sexagesimal degrees (red curve color curve in subfigure III of
Figure 33). The pressure difference in its electrical equivalent (mV) was considered as a
second input or excitation variable (blue color curve in subfigure I of Figure 33). Moreover,
the pressure difference measured by the nanosensor based on nanostructures, as shown in
subfigure II, revealed that the nanostructure sensor maintains a rapid response time and
robustness under disturbances of the pivot (green color curve in subfigure II of Figure 33).
234
Mathematics 2022, 10, 2403
On the other hand, it is apparent that for the ST response as a consequence of the
electrical transduction from the pressure difference to the airflow values in millivolts for
cams of extra small, medium, and extra-large sizes, the information achieved is shown in
Figure 34 for the subfigures I–III, in which the blue color indicates high frequency and red
represents low frequency. With this information, it is possible to identify a linear range of
work.
The ventilator works as an intermediatory for the signal received from the rotor
position sensor (RS1(AS5047) and RS2) used to measure the rotor speed of the motor and
control it with a driver (DR1) that has its own control system with its own actuator and
sensor with an electrical current to the motor (M).
This is the reason why a controller (as central control unit) also requires the signal
from a rotor position sensor to measure the angular displacement of the pivot to provide
an estimation of the air volume, and finally, the airflow sensor (FS1), which has its own
transduction algorithm to provide information to medical doctors through the touchscreen
panel, as depicted in Figure 35.
Figure 35. Block diagram of the control system for online operation.
235
Mathematics 2022, 10, 2403
The ST achieved good performance for the diameter of 8 mm due to its optimal
response based on appropriated magnitudes and tracking for the mechanical ventilator
curves. In the “data interpretation analysis” the ST with an orifice diameter of 10 mm
was used. Despite the nonlinear response, the airflow was obtained because of the good
performance of the ST algorithm in transducing the pressure difference signal to the airflow
variable for a larger range of work (low and high speed of the ventilator).
The range of work of the test was limited to a high frequency of 20 Hz and a low
frequency of 15 Hz. The transduction was achieved with the microphone sensors used to
obtain the pressure difference as is shown in Figure 36I, where the blue curve provides
information about the system working at high speeds and the red curve for low-speed
operation. Figure 36I shows that at high speeds it was possible to obtain an increased
amplitude (50 percent). Both signals were achieved in millivolts due to the transducer
(small microphones) having greater sensitivity at higher speeds.
Figure 36. Microphones answer in high and low operation and small and big cam.
Furthermore, it was necessary to define the input variables to correlate the estimated
responses under the dependencies of the reference variable (analyzed in previous chapters).
Figure 36II shows the input variables of the system from the data interpretation analysis,
which are given by the motor displacement in sexagesimal degrees, with the blue curve
indicating high speed and red curve representing low speed. It is necessary to remember
that motor displacement causes pivot displacement and it is enough to use only one sensor
to achieve the angle displacement during the polynomial correlation variable analysis.
Therefore, the main variable obtained from the ST is the airflow rate, given by the
transduction operation supported by the predictive estimations for the other breathing
variables “volume and pressure” obtained by the polynomial analysis (described in chapters
lines above) from the airflow variable.
Figure 36III–V presents the airflow rate, volume, and pressure, respectively, in which
the blue color curves are the airflow, volume, and pressure in high-frequency operation,
and the red color curves are the airflow, volume, and pressure in low-frequency operation,
which were obtained as a consequence of the predictive/adaptive algorithm; however,
it is necessary to keep in mind the importance of the previous data analysis during the
characterization and calibration of the ST to obtain the presented results.
It was necessary to develop a multivariable control algorithm to control the respiratory
variables (air flow, pressure and volume), hence the pressure difference transduced in
volume, air flow and pressure as respiratory variables required a multivariable control
algorithm, which was able to control the rotor speed of the mechanical ventilator motor,
the air pressure difference and the air flow, as the implicit function, pressure and volume.
236
Mathematics 2022, 10, 2403
KI p
PID (S) = KPP + + KD p S (18)
S
The plant (mechanical ventilator motor) was analyzed in two sections, namely the elec-
trical section and the mechanical section which are described in the following paragraphs.
Figure 37. Block diagram for the rotor position control of the mechanical ventilator.
Using the algebra analysis from the block diagram depicted above, the following is
obtained:
Inp(S) − Sens(S)Y (S)Cont(S) MS(S) = Y (S) (19)
From which following equation is achieved:
In Equation (20), the transfer function for the rotor position sensor is generalized, but
in spite of that, the response time of the position sensor is very short in comparison to the
speed of the motor. Hence, its transfer function was reduced as Sens(S) = Ks. Therefore, the
characteristic equation is given by Equation (21), in which m is the mass of the rotor motor,
ki is the electrical current coefficient of the motor and ky is the displacement coefficient of
the motor.
KI p Ki
KPP + + KD p S Ks + 1 = 0 (21)
S m S2 + K y
237
Mathematics 2022, 10, 2403
S 3 + ω0 ( 2 + α ) S 2 + ω0 2 ( 1 + 2 α ) S + α ω0 3 = 0 (24)
Therefore, the controller parameters KPp and KI p , can be obtained by the compari-
son of the coefficients from Equations (23) and (24), from which Equations (25)–(27) are
proposed as functions of Ki , Ks , KD p , m, ω0 and .
$ %
KD p Ki Ks
= ω0 ( 2 + α ) (25)
m
$ %
KPp Ki Ks + Ky
= ω0 2 ( 1 + 2 α ) (26)
m
KI p Ki Ks
= α ω0 3 (27)
m
Moreover, α is obtained from Equation (27) and represented by Equation (28)
KI p Ki Ks
α= (28)
mω0 3
Therefore, Equations (29) and (30) are the integral and proportional controller param-
eters that determine the controller parameter KD p , which can be analyzed as a reference
parameter (analyzed by stability of the dynamic system) to find the PID control for the
motor of the mechanical ventilator. However, the identification parameters of the motor
require a controller with slow reactions to provide sufficient time to identify the parameters
of the system, and the controller selected for the identification of the physical parameters
of the mechanical ventilator motor was a PI controller, hence the derivative parameter KD p
becomes null, and Equation (30) can be obtained from Equation (29), thereby providing the
integral parameter of the PI controller.
2 mω0 3
KI p = − (31)
Ki K s
238
Mathematics 2022, 10, 2403
Finally, Equation (32) is the proportional parameter of the PI controller obtained from
Equation (30) when the derivative parameter is null.
$ %
mω0 2 1 − 4 2 − Ky
KPp = (32)
Ki K s
Therefore, using the last equation, the optimal control response for the pressure
difference can be achieved, as well as the rotor speed and implicitly the volume, airflow
and pressure. Moreover, the optimal response control can be enhanced by the specific
weight matrix “W” that must be included in Equation (A5). Equation (33) has similarities
with Equation (A9) but the difference is given by the adaptive matrix coefficient “φ”,
because for this analysis the matrix depends only on the requirement control strategies
while maintaining its dependence on “ϕ” (the geometrical and material coefficients of the
designed ST).
As a consequence, the main control algorithm was designed using the adaptive cascade
model, which is depicted in Figure 38 in which “U1” is the input variable (pressure
difference measured by the designed ST) and the internal response variable is the air flow
“U2”, which is the input variable used to obtain the pressure “Y1” and volume “Y2”,
moreover the internal control variable of the rotor speed has a correlation with the desired
airflow required by the mechanical ventilator.
Figure 38. Control adaptive cascade model for the ventilator control system.
239
Mathematics 2022, 10, 2403
enhanced by nanostructures (red colour curve) were obtained with a steady state error of
1.4% of the ST and 0.8% for the ST enhanced by nanostructures.
In Figure 39II the curves of the controlled volume (implicit) and measured by the
designed ST (blue colour curve) as well as the volume measured by the designed ST
enhanced by nanostructures (red colour curve) are presented, with a steady state error of
1.5% of the ST and 0.7% of the ST enhanced by nanostructures.
Furthermore, in Figure 39III the curves of the controlled pressure (implicit) and mea-
sured by the designed ST (blue colour curve) as well as the pressure measured by the
designed ST enhanced by nanostructures (red colour curve) are shown, with a steady state
error of 1.8% for the ST and 0.8% for the ST enhanced by nanostructures.
Therefore, the designed ST of nanostructures (TiO2 nanotubes) based on Anodic Alu-
minium Oxide (AAO) provided better results in robustness, response time and resolution,
which supported measurements from the approximated range of operation between 0.985
atm (Lima) and 1 atm (Huancayo).
6. Conclusions
A sensor/transducer (ST) was designed to measure the pressure difference through an
orifice plate and to obtain the airflow rate, volume and pressure over time to register the
performance of mechanical ventilators. The geometrical characteristics of the proposed sensor
based on an orifice plate were researched in order to identify the appropriate parameters to
measure the physical variables. The diameter has an important influence on the performance
of the sensor; in this sense, a diameter of 8 mm provides the optimal response, in terms of
magnitudes and tracking, for the mechanical ventilator curves without significantly increasing
the pressure inlet for the breathing circuit which is provided by the mechanical ventilator. A
diameter in the range of 4 mm to 6 mm can achieve the pressure difference of airflow, but with
a nonlinear response and for the range between 10 mm and 12 mm, the ST can measure the
physical variables in linear responses for breathing values (it can be used to measure airflow
shared by several patients connected to one mechanical ventilator as a future application).
Therefore, it was suggested to use the ST with a diameter of 8 mm due to its linear response and
robustness under disturbances.
The mathematical model of the system was designed and evaluated. Consequently, an
algorithm was also designed to obtain the airflow, volume and pressure from the pressure
240
Mathematics 2022, 10, 2403
difference, which were obtained by the transduction process of the transducer joined to
the designed sensor. Moreover, the ST system was analyzed in dynamical and transient
behavior for ranges of work that depended on geometrical parameters and physical values
of airflow, pressure, and volume of mechanical ventilators for artificial human breathing,
This is a novel proposed sensor design such as other new proposals [22–24] because of the
multiple variable correlation also variables considered like disturbances (temperature and
vibration) and considering the effect of nanostructures in this objective.
For a given orifice diameter, a correlation was observed between the pressure dif-
ference and the airflow. This correlation was not linear, but instead showed a parabolic
tendency. Secondly, similar results and tendencies were observed between the steady-state
and transient-state simulations. Therefore, it can be concluded that a steady-state approach
(and its corresponding assumptions) can be considered to validly describe the transient
behavior of this orifice plate sensor.
For a given flow, the smaller the orifice diameter, the larger the pressure difference
induced in the orifice plate. This could happen because, for a given flow, a smaller orifice
diameter requires the flow to pass through the opening. This increases the flow velocity,
but at the cost of reducing the flow pressure at the outlet, resulting in greater pressure
differences in the orifice plate. Therefore, a bigger inlet pressure would be necessary to
help the flow overcome the pressure loss in the pipes. It can also be concluded that a plate
with a smaller orifice diameter would be the best for this sensor. Taking this into account,
lower pressure drops would occur in the plate, which would not considerably affect the
flow, while at the same time being large enough to be detected by the ST and still being in
the measuring range.
The designed ST enhanced by nanostructures of anodic aluminium oxide can provide
a faster and robust response for geographic conditions where the atmospheric pressure
level is different, as was the case in this research (tests in Lima and Huancayo). The
nanostructure samples used for the designed sensors have the potential to fix adaptive
coefficients to improve the operating work and enhance the response system (breathing
variables) in case of disturbances.
241
Mathematics 2022, 10, 2403
Gratitude is also expressed to the researchers of the Metrology Laboratory PUCP, due to their support
in the experimental analysis, as well as acknowledgement to the Mechanical Department of PUCP
due to the support given to use measurement equipment for the experiments.
Conflicts of Interest: The authors declare no conflict of interest. The funders had no role in the design
of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or
in the decision to publish the results.
Appendix A
This appendix describes the special equipment designed for the validation and mathe-
matical characterization of the air flow sensor in different working conditions of air flow
and the low-cost mechanical ventilator based on cams where the air flow sensor was
installed.
(a) (b)
Figure A1. Air flow test bench designed for model validation.
242
Mathematics 2022, 10, 2403
Figure A2. Mechanical ventilation system OxygenIP.PE, where the novel air low sensor/transducer
is implemented.
Appendix B
In this research, it was necessary to work with strategies of Model Predictive Control
(MPC) to join both subsystems (mechanical and electrical) for the main algorithm, for which
an internal identification system was used, while adaptive control coefficients/weights
searched for the right control. Therefore, the nonlinear function “g” and internal variables
“z(t)” due to excitation “v(t)” as a function of time “t” were important.
dx (t)
= g ( z ( t ) v ( t ), θ) (A1)
dt
On the other hand, “m(t)” correlated with “z(t)” and “v(t)” through a nonlinear function
“n”.
m ( t ) = n ( z ( t ), v ( t ), θ) (A2)
Additionally, “AS ” represents trajectory and the input excitation “O” is included
depending of the sample time ki.
So, the costing function “J” was analyzed to achieve the optimal desired response [21].
J = ( AS − Y ) T ( AS − Y ) + ΔS T AΔS (A4)
Y = LX (k i ) + φΔS (A5)
Otherwise,
J = ( AS − LX (k i )) T AS − LX (k i )) − 2ΔS T φT ( AS − LX (k i ) + ΔU T (φT φ + A)ΔS (A6)
where “L” and “φ” are matrices that contain all the physical parameters of the system (as
joining matrices above for the identified result) [21].
∂J
= −2φT ( AS − LX (k i )) + 2(φT φ + R ΔS (A7)
∂ΔS
So
∂J
=0 (A8)
∂ΔS
243
Mathematics 2022, 10, 2403
From which the optimal excitation signal in order to find the optimal response is given
by [1,21]
−1
ΔS = (φT φ + A) φT ( AS − LX (k i )) (A9)
References
1. Calderón Ch, J.A.; Rincón, C.; Agreda, M.; Jiménez de Cisneros, J.J. Design and Analysis of a Mechanical Ventilation System
Based on Cams. Heliyon 2021, 7, e08195. [CrossRef] [PubMed]
2. Freescale Semiconductor Literature Distribution Center, Inc. Ventilator/Respirator Hardware and Software Design Specification;
Freescale Semiconductor Literature Distribution Center, Inc.: Denver, CO, USA, 2011; pp. 11–20.
3. Brzeski, P.; Lazarek, M.; Perlikowski, P. Experimental study of the novel tuned mass damper with inerter which enables changes
of inertance. J. Sound Vib. 2017, 404, 47–57. [CrossRef]
4. Fowles, G.; Boyes, W. Measurement of Flow, 4th ed.; Elsevier: Amsterdam, The Netherlands, 2010; ISBN 978075068308.
5. Biselli, P.J.C.; Nóbrega, R.S.; Soriano, F.G. Nonlinear flow sensor calibration with an accurate syringe. Sensors 2018, 18, 2163.
[CrossRef] [PubMed]
6. OxyGEN Project. Available online: https://www.oxygen.protofy.xyz/download (accessed on 23 November 2020).
7. Chang, J.; Acosta, A.; Aspiazu, J.B.; Reategui, J.; Rojas, C.; Cook, J.; Nole, R.; Giampietri, L.; Pérez-Buitrago, S.; Casado, F.L.; et al.
Masi: A mechanical ventilator based on a manual resuscitator with telemedicine capabilities for patients with ARDS during the
COVID-19 crisis. HardwareX 2021, 9, e00187. [CrossRef] [PubMed]
8. Jamróz, P. Interaction between the Standard and the Measurement Instrument during the Flow Velocity Sensor Calibration
Process. Processes 2021, 9, 1792. [CrossRef]
9. Włodarczak, S.; Ochowiak, M.; Doligalski, M.; Kwapisz, B.; Krupińska, A.; Mrugalski, M.; Matuszak, M. Flow Rate Control by
Means of Flow Meter and PLC Controller. Sensors 2021, 21, 6153. [CrossRef] [PubMed]
10. Koirala, N.; McLennan, G. Mathematical Models for Blood Flow Quantification in Dialysis Access Using Angiography: A
Comparative Study. Diagnostics 2021, 11, 1771. [CrossRef] [PubMed]
11. Bisgaard, J.; Tajsoleiman, T.; Muldbak, M.; Rydal, T.; Rasmussen, T.; Huusom, J.K.; Gernaey, K.V. Automated Compartment Model
Development Based on Data from Flow-Following Sensor Devices. Processes 2021, 9, 1651. [CrossRef]
12. Hammer, A.; Roland, W.; Zacher, M.; Praher, B.; Hannesschläger, G.; Löw-Baselli, B.; Steinbichler, G. In Situ Detection of Interfacial
Flow Instabilities in Polymer Co-Extrusion Using Optical Coherence Tomography and Ultrasonic Techniques. Polymers 2021, 13,
2880. [CrossRef] [PubMed]
13. Landau, L.D.; Leiftshitz, E.M. Fluid Mechanics, Institute of Physical Problems; U.S.S.R. Academy of Sciences: Moscow, Russia, 1987.
14. Aldoghaither, A.; Liu, D.-Y.; Laleg-Kirati, T.M. Modulating Functions Based Algorithm for the Estimation of the Coefficients and Differen-
tiation Order for a Space-Fractional Advection-Dispersion Equation; Society for Industrial and Applied Mathematics: Philadelphia, PA,
USA, 2015.
15. Bridgeman, D.; Tsow, F.; Xian, X.; Forzani, E. A new differential pressure flow meter for measurement of human breath flow:
Simulation and experimental investigation. AIChE J. 2016, 62, 956–964. [CrossRef] [PubMed]
16. Launder, B.E. MAN—ANSYS Fluent User’ s Guide Releasde 15.0. Knowl. Creat. Diffus. Util. 2013, 15317, 724–746.
17. Paz, C.; Suárez, E.; Concheiro, M.; Porteiro, J. CFD transient simulation of a breathing cycle in an oral-nasal extrathoracic model. J.
Appl. Fluid Mech. 2017, 10, 777–784. [CrossRef]
18. CLÍNIC BARCELONA. Available online: https://www.clinicbarcelona.org/noticias/el-dispositivo-de-ventilacion-de-
emergencia-desarrollado-por-clinic-germans-trias-i-pujol-y-ub-con-protofy-xyz-recibe-la-aprobacion-de-la-aemps-para-
hacer-un-estudio-clinico (accessed on 15 May 2020).
19. Karthik, G.S.Y.; Kumar, K.J.; Seshadri, V. Prediction of Performance Characteristics of Orifice Plate Assembly for Non-Standard
Conditions Using CFD. Int. J. Eng. Tech. Res. 2015, 3, 2321–2869.
20. Funk, J.E.; Wood, D.J.; Chao, S.P. The transient response of orifices and very short lines. J. Fluids Eng. Trans. ASME 1972, 94,
483–489. [CrossRef]
21. Calderón, J.A.; Barriga, E.B.; Mas, R.; Chirinos, L.; Barrantes, E.; Alencastre, J.; Tafur, J.C.; Melgarejo, O.; Lozano, J.H.; Heinrich, B.;
et al. Magnetic Bearing Proposal Design for a General Unbalanced Rotor System enhanced because of using sensors/actuators
based in nanostructures. E3S Web Conf. 2019, 95, 01002. [CrossRef]
22. Хaлилoв, И.A.; Керимoв, C.Х.; Бaгирoвa, C.A.; Гaджиевa, φ., III. Cинтез кулaчкoвoгo мехaнизмa с учетoм услoвий
передaчи сил и кoнтaктнoй прoчнoсти. (Khalilov, I.A.; Kerimov, S.K.; Bagirova, S.A.; Gadzhieva, F.S. Synthesis of a Cam
Mechanism Taking into Account the Conditions for the Transfer of Forces and Contact Strength). 2017. Available online:
http://web.iyte.edu.tr/~{}gokhankiper/ISMMS/Khalilov.pdf (accessed on 6 April 2022).
23. Liang, L.; Qin, K.; El-Baz, A.S.; Roussel, T.J.; Sethu, P.; Giridharan, G.A.; Wang, Y. A Flow Sensor-Based Suction-Index Control
Strategy for Rotary Left Ventricular Assist Devices. Sensors 2021, 21, 6890. [CrossRef] [PubMed]
24. Algarni, M. Optimization of Nano-Additive Characteristics to Improve the Efficiency of a Shell and Tube Thermal Energy Storage
System Using a Hybrid Procedure: DOE, ANN, MCDM, MOO, and CFD Modeling. Mathematics 2021, 9, 3235. [CrossRef]
25. Melaibari, A. Free Vibration of FG-CNTRCs Nano-Plates/Shells with Temperature-Dependent Properties. Mathematics 2022, 10,
583. [CrossRef]
244
mathematics
Article
Mechanical Model and FEM Simulations for Efforts on Biceps
and Triceps Muscles under Vertical Load: Mathematical
Formulation of Results
Emilio Lechosa Urquijo 1 , Fernando Blaya Haro 2 , Juan David Cano-Moreno 2, *, Roberto D’Amato 2
and Juan Antonio Juanes Méndez 1
Abstract: Although isometric contractions in human muscles have been analyzed several times,
there are no FEA models that allow us to use the same modeled joint (the elbow under our case) in
different conditions. Most elbow joints use 3D elements for meshing. Representing the muscles in the
joint is quite useful when the study is focused on the muscle itself, knowing stress distribution on
muscle, and checking damage in muscle in a detailed manner (tendon–muscle insertion, for example).
However, this technique is not useful for studying muscle behavior at different positions of the joint.
Citation: Lechosa Urquijo, E.;
This study, based on the mechanical model of the elbow joint, proposes a methodology for modelling
Blaya Haro, F.; Cano-Moreno, J.D.; muscles that will be studied in different positions by meshing them with 1D elements. Furthermore,
D’Amato, R.; Juanes Méndez, J.A. the methodology allows us to calculate biceps and triceps efforts under load for different angles of
Mechanical Model and FEM elbow joint aperture. The simulation results have been mathematically modelled to obtain general
Simulations for Efforts on Biceps and formulations for these efforts, depending on the load and the aperture angle.
Triceps Muscles under Vertical Load:
Mathematical Formulation of Results. Keywords: FEM analysis; biomechanics; effort mathematical models; NMR reconstruction; reverse
Mathematics 2022, 10, 2441. https:// engineering; elbow joint
doi.org/10.3390/math10142441
average of approximately 44%, relative to healthy normal discs. Samani et al., proposed a
breast biomechanical model using a FEM formulation [9], specially focused on the modeling
of breast tissue deformation which takes place in breast imaging procedures.
The study proposed by Martinez [12] is focused on the characterization of the muscle
and its response under different conditions. Muscle length changes and the internal stress
distribution for different electrical stimulations were studied. Weiss et al. [13] described
strategies for addressing technical aspects of the 3D computational modelling of ligaments
with FEM analysis. Islan et al. [14], studied the behavior of the glenohumeral joint under
different postures that represent the routine of a violinist from an ergonomic point of view
(RULA method) and using a 3D FEM model to study this movement for a high number
of cycles. Sachenkov et al. [15] studied the movement of the femur in the hip, using 1D
models for the muscles. Martins et al. [16] presented a study with FEM techniques applied
to the pelvic floor, using the Hill model for muscles as a 1D element, extrapolating it to 2D
and 3D models, working with isometric and isotonic contractions. Tang et al. [17] studied
muscle fatigue with a 3D muscle model in different activation situations. Syomin et al. [18]
performed a numerical simulation of contraction of the left ventricle approximated by the
axisymmetric body.
Perreault and Heckman evaluated the ability of the Hill model for muscles to describe
muscle force responses for naturally and electrically stimulated muscles [19]. Alonso et al. [20]
proposed a static physiological optimization model instead of the dynamic optimization model.
For this purpose, they used a modeling of muscles by means of 1D elements along with the line
of action of the same. Holzbaur et al. proposed a different biomechanical model representing
the upper extremity, including shoulder, forearm and arm [21]. The muscles are replaced by a
1D element with a force generation parameter based on experimental data. Park et al. [22]
proposed a torque estimation method at the joint of an index finger in the human hand while
pinching, the study is supported by electromyography and Hill muscle model. Soechting
and Flander used a simplified muscle model [23] to predict torques during arm movement,
comparing their prediction with the EMG results measured in 20 subjects. The muscle model
is also based on the Hill muscle model. Zajac developed a mathematical model to analyze
the muscle tendon, ‘musculotendon’ actuator [24] based only on the ration between tendon
length and muscle fiber length, both at rest.
In the study and FEM analysis of human joints, bones are considered rigid elements,
their geometry does not vary when the joint moves, the same happens with cartilages,
and they can be approximated to rigid, nondeformable parts [25]. On the other hand,
when meshing a muscle in 3D elements into a joint, this mesh is only valid for a certain
position, and if there is need to study a different position in the joint, the muscles should be
remeshed once again. This problem disappears by using a 1D element, rod-type, to mesh
the muscles in the joint. Ligaments can be modeled as springs, 1D elements with two nodes.
According to this, a model meshed with 3D elements for rigid bodies and 1D element for
muscles elements, can adopt different positions just by moving the elements of the rigid
bodies to the new positions.
In the case of 1D elements with just two nodes, if those nodes move in the space,
the element will move as well. As ligaments are defined from two different nodes of two
different bones, if the bones move to another position, the ligament will adopt the new
position. This allows us to reuse the same meshed model in several configurations. For this
reason, the final objective of this study is discovering the equation that would allow one to
create a new finite element type that could be used in biomechanics analysis where muscles
forces are involved in “isometric contractions”. This study focuses on the analysis of human
muscle behavior in order to define a new equation that could predict the muscle force that
would appear in a certain muscle under known conditions. The human joint under study
will be the elbow. The proposed FEM analysis is focused only on the biceps and triceps
muscles and the muscle elbow joint considered in this study are (Figure 1):
• Long biceps
• Short biceps
246
Mathematics 2022, 10, 2441
• Brachii
• Long supinator
• Long triceps
• Medium triceps
• Lateral triceps
Since muscles can only work under traction (they do not transmit compression loads),
different models must be proposed for a separate study of the triceps and biceps muscles.
Several analyses, on different aperture angles of the elbow, will be solved looking for the
effort needed on referred muscles, biceps or triceps, to balance 150 N applied on the wrist
in an “isometric contraction”. The scope of the study includes the following aspects:
• Analyzing the elbow under load at different angles;
• Obtaining efforts in different muscles involved in the joint for different positions;
• Defining mathematical models to predict efforts in muscles depending on:
• Joint angle or muscle length;
• Applied load on the wrist.
247
Mathematics 2022, 10, 2441
Figure 2. FEA model, muscles identification, flexors (left) and extensors (right).
Figure 3. FEA model, biceps study (left) and triceps study (right), 74◦ aperture angle.
248
Mathematics 2022, 10, 2441
• Contact between cartilages. It has been modeled using rigid elements that join the
nodes of both cartilage meshed parts involved in the joint, as explained in previous
studies [25], using an RBE2 element. In the RBE2 elements used, the degrees of rotation
freedom have been released so that only forces are transmitted in the X, Y or Z axes.
• Muscles. These are meshed using 1D elements (the type is discussed in this study).
Previous studies have considered muscles in this manner, as 1D element acting along
the imaginary axis of the muscle. Alonso et al. [20] model muscles in the leg to
study human march in the same manner, 1D elements aligned with the axis of the
muscles; Sachenkov et al. [29] use the same approximation for muscles, Parekh [30]
approximates muscles to a 1D FEM element aligned with the muscle axis. This is a
simplification, but it is used by several authors because fusiform muscles only have
two insertion points and low pennation angles. Thus, fibers are aligned so with this
imaginary axis which is formed by joining both insertion points.
• Tendons. These are meshed in combination with muscles, as a unique element muscu-
lotendon (MTU). The same 1D element is considered for tendons.
• Tendon insertion in bone. It has been modeled by adding a rigid element (RBE2),
joining the end node of the 1D MTU element with several nodes in the bone, and thus
distributing the reaction force in the area of the insertion of the tendons, as explained
in previous research [25]. Of the restrictions that an element with these characteristics
can apply, only the displacements in the X, Y or Z axes have been applied, freeing the
turns around them. In this way, and since the MTU only transmits axial forces, we
eliminate any possible bending moments that may affect the insertion of the tendon in
the bone.
The RBE2 elements constitute constraint equations between the nodes to which the
element is connected, they are characterized by having a node, called independent, and
whose degrees of freedom are independent. The rest of the nodes connected to the RBE2
element are called dependent, and in them it must be fulfilled that the displacements of the
independent node must be the same as in the dependent nodes.
That is why they are commonly known as rigid elements, since the distance between
the component nodes remains unchanged.
In Figure 4, an example of an RBE2 element can be found, with an independent node
(i) and four dependent nodes (j1 . . . j4).
The force balancing equation on an RBE2 is such that the reactions that appear on
dependent nodes of the element must balance the force applied in the independent node:
In this way, using an element of this type to simulate tendon insertion in the bone, the
stress that appears in the 1D element that simulates MTU in the insertion zone is distributed,
eliminating stress concentrations.
249
Mathematics 2022, 10, 2441
250
Mathematics 2022, 10, 2441
In search of the element that best reflects the behavior of the muscle within the joint,
the ROD element, given that it only presents axial forces, is the one that best suits the
needs of the proposed mathematical study. However, the said element can work under
tension and/or compression, which would alter the results. That is why two different
FEM models are generated for the study of the biceps or triceps muscles. One in which
the triceps muscles are re-moved, and another in which the biceps muscles are removed.
In this way, and considering the direction of the applied load, it can be ensured that the
analyzed muscles work only under traction.
Several tests have been carried out to define the characteristics of the ROD element that
will be used, maintaining the objective of minimizing its stiffness and keeping the element
deformations at low values so as not to lose the isometric load condition in the analyzes.
For these reasons, the elements used in the FEA models are defined as:
Rod element (1D element);
Constant cross section: 100 mm2 ;
Material parameters:
Young modulus: 2000 MPa;
Poisson coefficient: 0.43.
This type of element transmits only axial forces. Its constitutive equation is shown below.
F K 0 ε
= x · x (2)
M 0 KT αx
where
• F is the axial force in the element;
• M is the moment along the axis of the element;
• Kx is the axial stiffness of the element;
• KT is the rotational stiffness along the axis of the element;
• εx is the length variation of the element;
• αx is the angle rotated along the element axis;
To ensure that no torsional stresses are produced in the element, it is sufficient to
assign a value of KT = 0.
251
Mathematics 2022, 10, 2441
The use of an isotropic material for bone modelling is known and has previously
been used, Teo et al. [26], meshed head and spine using 3D elements with bone tissue as
isotropic, linear, and elastic.
Figure 6. Mesh quality. Jacobian criteria for evaluating mesh quality. Value represented is
(1-measured value on element/ideal value on element). Maximum threshold defined as 0.6.
252
Mathematics 2022, 10, 2441
Figure 7. Boundary condition, muscles fixing scapula (left), constrain in FEA model, (right),
scapula fixed.
Figure 8. Applied load (150 N) in wrist, vertical direction, biceps analysis (left) and triceps
analysis (right).
The load is applied in vertical direction, positive axis (ascending) when analyzing the
triceps and vertical direction, negative axis (descending) when analyzing the biceps. All
load cases are considered as static load cases, isometric contraction of the muscle, and, to
avoid compression loads on muscles, two different models have been developed; one of
them includes only biceps muscles, with descending load, and the other one includes only
triceps muscles, with ascending load.
In the model, gravity force has also been considered. Thus, the mass of the arm is
considered in the analysis.
To do so, the density of elements has been modified so that the values of the upper
arm and forearm mass are [32]:
• Forearm mass: 2.5 kg;
• Upper arm mass: 4.3 kg.
253
Mathematics 2022, 10, 2441
All load cases are considered as static load cases; due to the isometric contraction of
the muscle, there must be no displacement in the joint.
[ R] = [K ]·[U ] (3)
where
[R] = global loading matrix;
[K] = global stiffness matrix;
[U] = global displacement matrix.
Equation (3) can be solved directly using the Gauss elimination method, there is no
iterative process, and the solution is unique.
The resolution of the problem is carried out through several linear static analyses, each
one of them representing a different load condition in the joint, and the result of them is
the necessary effort in each muscle of the joint to balance the applied force on the wrist.
As mentioned above, there are two different models, thus guaranteeing that the
muscles work only in traction, to study the biceps and the triceps muscles.
In each of the models, the configuration of the joint will be modified by varying its
angle between 44 and 164◦ , at intervals of 10◦ .
The applied load will always be 150 N.
As a reference for the reading of the results, the fully extended arm corresponds to
164 degrees and the fully flexed arm corresponds to 44 degrees (see Figure 9).
Figure 9. FEA model, minimum (left) and maximum (right) studied angles.
254
Mathematics 2022, 10, 2441
As described in a previous study [23] the results, comparing the model with contacts
and the model with rigid elements instead, are similar in both cases, and therefore the use
of rigid elements improves the efficiency of the analysis.
i =6
Musclee f f ort = ∑ ai · αi (4)
i =0
Once the forces needed in muscles have been calculated to balance the joint, with
different angles and with a load of 150 N applied at the wrist, a mathematical approach
will be proposed for obtaining these efforts for other loads, due to the mechanism type
considered in the elbow joint, the applied forces and torque analysis done in the joint. The
value of the force exponent is 1 in all cases, so there is a linear relation between the force
applied and the muscle reaction force [33]. According to this, the greater the force applied
to the wrist is, the greater the muscle force must be, and both values are proportional.
Figure 10 and the following equations justify this simplification in the analysis, where:
• Ws means the weight of the forearm;
• Ws means the equivalent force for Ws applied at the wrist. The moment created by
Ws or Ws is the same referred to O point;
• Wq means the load applied in the wrist;
• α means the angle between biceps axis and the normal to the radius longitudinal axis;
• β means the angle between the vertical and the longitudinal axis of the radius;
• a, b and c are geometrical dimensions.
255
Mathematics 2022, 10, 2441
The left image considers the proper mass of the arm located in its center of gravity.
Balancing equation on that forces scheme, referred to O point scheme, is:
$ %
Ws · a + Wq ·b · sin β
Fm = (5)
c· cos α
Instead of using Ws , we can use Ws , applied on the wrist (right image). This Ws
should comply:
Ws · a·sin β = Ws ·b·sin β (6)
So, the balancing equation would be the following:
$ % b·sin β
Fm = Wq + Ws (7)
c·cos α
As seen in Equation (7), the relation between Fm and Wq + Ws is linear, and constant
for a certain angle only depending on b and c and α and β angles.
According to this, the forces in the muscles when the load is different from the 150 N
applied in this study (including the Ws in this 150 N) can be obtained by the relation
between the external load applied and the reaction in the muscle.
Fm = Fm (W, α) (9)
where:
• Fm is the force in the muscle (N);
• α is the angle of the elbow;
• W = Wq + Ws , is the load applied on the wrist (N).
In all mathematical models of muscle efforts, the effort could be obtained for any
angle, α, and for any vertical load, f v . If the forearm mass, m f a , is considered and g is the
gravity acceleration (9.81 m/s2 ) Ws can be expressed as follows:
a a
Ws = Ws · = m f a · · g (10)
b b
Thus, for obtaining the effort:
a
E f f ort( f v ) = Fm f v + m f a · · g , α (11)
b
General equations will be obtained describing the effort in each muscle depending on
the angle of the joint and the load applied. These equations and their surface representations
will be the results of this research. Both of them, the equations and surfaces, have been
obtained using the curve fitting tool from MATLAB. It has been noted that these effort
models for the five involve muscles are only used to study isometric muscular contractions,
when muscle length does not vary.
256
Mathematics 2022, 10, 2441
3. Results
The model analyzed includes all muscles in the elbow joint, but the post-processing
results are only applied to the biceps and triceps muscles.
Figure 11. Different elbow angle configuration: bone Von Mises stress distribution under biceps
loading (150 N on wrist) cases. Figures are included in more quality in Appendix A.
Similarly, Figure 12 shows the Von Mises stresses on the bones due to the FEM
simulation of 150 N of vertical load (ascendant) on the wrist. Each of the images represents
the stress map resulting from the study of an isometric contraction in the elbow joint at the
angle indicated in the image for a load of 150 N applied to the wrist, vertically and upward.
257
Mathematics 2022, 10, 2441
Units. MPa
94° 104° 114° 124° 134°
Figure 12. Different elbow angle configuration: bone Von Mises stress distribution under triceps
loading (150 N on wrist) cases. Figures are included in more quality in Appendix A.
258
Mathematics 2022, 10, 2441
Table 2. Axial efforts (N) in main simulated muscles, 150 N applied on wrist. Isometric contractions
at each angle value considered.
Long Biceps Short Biceps Lateral Triceps Medial Triceps Long Triceps
Elbow Angle Force Length Force Length Force Length Force Length Force Length
(◦ ) (N) (mm) (N) (mm) (N) (mm) (N) (mm) (N) (mm)
44 244 373 96 310 148 217 70 211 269 284
54 330 375 136 312 128 216 144 209 312 282
64 341 383 152 320 212 215 115 208 340 281
74 341 391 161 328 208 215 119 208 347 281
84 340 399 168 336 235 214 150 207 340 280
94 333 407 170 344 263 214 186 207 347 279
104 302 415 181 351 233 213 164 207 331 279
114 301 422 159 359 233 213 176 206 307 278
124 274 429 143 365 213 211 170 204 271 276
134 261 435 95 371 177 207 150 201 226 273
144 185 439 87 375 131 203 120 197 175 269
154 121 442 46 379 79 199 82 192 119 264
164 52 445 1 381 24 194 40 188 58 260
Table 3. The equations developed as polynomial curves of grade 6th, based on the general formulation
of Equation (4), has the following constant values for each muscle involved.
Analogously, Figure 14 shows the efforts of the triceps muscles (dots) and the mathematical
curves (dash point line) which approximate them. The effort in the muscle (y-axis in the graph)
is related to the elbow angle (x-axis in the graph). In the figure, the red color is associated with
medium triceps, the green color is associated with long triceps, and the blue color is associated
with the lateral triceps. These curves can be used to know efforts on the three muscles involved
in the triceps FEM model. Additionally, as an example, if a muscle had limited effort for any
259
Mathematics 2022, 10, 2441
reason, we could know which positions of the elbow joint are not recommendable to use.
Therefore, for an isometric contraction with 150 N as a vertical load on the wrist (including
the forearm mass), for a maximum effort of 200 N in the lateral triceps, it is necessary to avoid
placing the elbow joint between 65◦ and 130 degrees approximately.
Figure 13. Calculated (dots) and tendency polynomial equation (dash point line) comparison, for
biceps forces (y-axis). Isometric contractions on the elbow at different positions, from 44 to 164 degrees
(x-axis) and 150 N are loaded on the wrist.
Figure 14. Comparison of the (continuous line) calculated and (dash point line) comparison, for tri-
ceps forces (y-axis). Isometric contractions on the elbow at different positions, from 44 to 164 degrees
(x-axis) and 150 N are loaded on the wrist.
260
Mathematics 2022, 10, 2441
ͲͲൌǤͻͳሺെͳͺͶǤ͵ǡ͵ͶͲǤʹሻ ǣ
ͳͲൌെ͵ǤͳͺሺെͳͶǤǡǤͷ͵Ͷሻ ǣͻǤͺʹʹͲͶ
ͲͳൌെͲǤͺ͵Ͷ͵ሺെͳǤͳͺ͵ǡെͲǤͶͺͷͷሻ ǦǣͲǤͻͻͳ
ʹͲൌͲǤͲͳ͵ͷሺെͲǤͳͲͻǡͲǤʹ͵Ͳሻ Ǧǣ
ͳͳൌͲǤͲͶͷͶሺͲǤͲ͵ͶͻͶǡͲǤͲͷͺͳ͵ሻ ͲǤͻͻͳ͵
͵ͲൌെͶǤʹͶͷൈ ͳͲ−ͲͶ ሺെͲǤͲͲͳͷʹǡ ǣʹʹǤͻͺ
ͲǤͲͲͲͶሻ
ʹͳൌെʹǤͺͲ͵ൈͳͲെͲͶሺെͲǤͲͲͲ͵ͻͺͺǡ
െͲǤͲͲͲͳͳͻሻ
ͶͲ ൌ ͳǤͲʹʹൈ ͳͲെͲ ሺെͳǤͷͻͷൈ
ͳͲെͲǡ͵ǤͶൈͳͲെͲሻ
͵ͳ ൌ ͳǤͻͷൈ ͳͲെͲ ሺെʹǤͲͺͳൈ
ͳͲെͲǡͷǤͶͳൈͳͲെͲሻ
Figure 15. General formulation of short biceps efforts in function of wrist load (W) and elbow angle
(α) and surface reconstruction of the proposed polynomial.
In the range of the example of the previous figure, 150–164◦ , we could find an effort
on long biceps bigger than in short biceps, reaching around 600 N. Figures 16–19 collect the
results of the surface reconstruction for the rest of studied muscles.
ͲͲൌെʹͲͳͻሺെʹͶͳǡെͳͷͺሻ ǣ
ͳͲൌͻͶǤͺͳሺǤͲ͵ǡͳͳ͵Ǥሻ ǣʹǤͺͲͷ
ͲͳൌͲǤͲͶ͵ͺሺെͲǤͷͶ͵ͷǡͲǤ͵ͳʹሻ ǦǣͲǤͻͻ͵ͺ
ʹͲൌെͳǤͷͶͶሺെͳǤͺʹͻǡെͳǤʹͷͻሻ ǦǣͲǤͻͻ͵ͷ
ͳͳൌͲǤͲͷͲሺͲǤͲ͵ͷ͵ǡͲǤͲͷͷͺሻ ǣ͵ͺǤ
͵ͲൌͲǤͲͳͲͶሺͲǤͲͲͺͳǡͲǤͲͳʹ͵ʹሻ
ʹͳ ൌ െͲǤͲͲͲ͵Ͷ͵ ሺെͲǤͲͲͲͷͶͷͺǡ
െͲǤͲͲͲͳͶͺሻ
ͶͲ ൌ െʹǤͷͳൈ ͳͲെͲͷ ሺെʹǤͻͷൈ
ͳͲെͲͷǡെʹǤͲͷൈͳͲെͲͷሻ
͵ͳൌͳǤͲͺʹൈͳͲെͲሺെͷǤʹൈͳͲെͲǡ
ǤͶͶʹൈͳͲെͲሻ
Figure 16. General formulation of long biceps efforts based on wrist load (W) and elbow angle (α)
and surface reconstruction of the proposed polynomial.
261
Mathematics 2022, 10, 2441
ͲͲൌͳͲͺͳሺͲͲǤ͵ǡͳͷͳሻ ǣ
ͳͲൌെͷͲǤሺെͳǤͳǡെ͵ͲǤ͵ሻ ǣ͵Ǥʹ͵Ͳͷ
ͲͳൌെͲǤͲͶʹሺെͳǤʹͶͳǡͲǤͲ͵ʹͶሻ ǦǣͲǤͻͺͶͳ
ʹͲൌͲǤͺʹ͵ሺͲǤͷͳͳǡͳǤͳ͵ሻ ǦǣͲǤͻͺ͵Ͷ
ͳͳൌͲǤͲ͵ͺሺͲǤͲͳͷͻǡͲǤͲͷͻͻʹሻ ǣͶͳǤͻͷ
͵Ͳ ൌ െͲǤͲͲͷͲͳ ሺെͲǤͲͲͳ͵ǡ
െͲǤͲͲ͵ͷͻሻ
ʹͳ ൌ െͻǤʹൈ ͳͲെͲͷ ሺെͲǤͲͲͲ͵ͳʹͶǡ
ͲǤͲͲͲͳʹሻ
ͶͲ ൌͳǤ͵Ͷൈ ͳͲെͲͷ ሺͺǤͺͺൈ ͳͲെͲǡ
ͳǤͺʹͶൈͳͲെͲͷሻ
͵ͳൌെǤͻͳͷൈͳͲെͲሺെͳǤ͵ͺͳൈͳͲെͲǡ
െʹǤʹʹൈͳͲെͲͻሻ
Figure 17. General formulation of lateral triceps efforts based on wrist load on the wrist (W) and
elbow angle (α) and surface reconstruction of the proposed polynomial.
ͲͲൌെͳ͵ǤͷሺെͳͳǤͻǡͳ͵ͶǤͺሻ ǣ
ͳͲൌͲǤ͵ሺെͷǤͶǡǤͻ͵ሻ ǣ͵ǤͳʹʹͲͶ
ͲͳൌെͲǤʹͷሺെͲǤͶʹʹǡെͲǤͲͻͲͳሻ ǦǣͲǤͻͻͻ͵
ʹͲൌെͲǤͲͳͲ͵ͺሺെͲǤͳͲͷͻǡͲǤͲͺͷͳʹሻ ǦǣͲǤͻͻͻʹ
ͳͳൌͲǤͲͲሺͲǤͲͲͷ͵ǡͲǤͲ͵ሻ ǣͳʹǤͻ
͵Ͳ ൌ ǤͲ͵ͷൈ ͳͲെͲͷ ሺെͲǤͲͲͲͷͷͳǡ
ͲǤͲͲͲͻͳሻ
ʹͳ ൌ െͲǤͲͲͲͶͶ͵ ሺെͲǤͲͲͲͷͶͳǡ
െͲǤͲͲͲͶͲͷሻ
ͶͲൌെͳǤͻͳൈͳͲെͲሺെͳǤͶͷൈͳͲെͲǡ
ͳǤ͵ͲൈͳͲെͲሻ
͵ͳ ൌ ͷǤͶ͵ͻൈ ͳͲെͲ ሺ͵Ǥ͵ͳൈ ͳͲെͲǡ
ǤͷͺൈͳͲെͲሻ
Figure 18. General formulation of long triceps efforts based on wrist load (W) and elbow angle (α)
and surface reconstruction of the proposed polynomial.
ͲͲൌെͳͳͺǤͻሺെͶͺǤǡͶͳͲǤͻሻ ǣ
ͳͲൌͷǤͷሺെͳǤͻʹǡʹͺǤͲሻ ǣ͵ǤͻͺͲͷ
ͲͳൌͲǤʹͷͳ͵ሺെͲǤͶͷͲǡͲǤͻͷ͵͵ሻ ǦǣͲǤͻͲ͵
ʹͲൌെͲǤͲͻͲͺͺሺെͲǤͶ͵ͳͺǡͲǤʹͷͲͳሻ ǦǣͲǤͻͷͺ
ͳͳൌͲǤͲͲͲʹͺͶ͵ሺെͲǤͲʹ͵ͲͷǡͲǤͲʹ͵ʹሻ ǣͶǤʹ
͵ͲൌͲǤͲͲͲͳሺെͲǤͲͲͳͲ͵ǡͲǤͲͲʹͺ͵ͷሻ
ʹͳ ൌ ͲǤͲͲͲʹʹ͵ ሺെͳǤͶͺൈ ͳͲെͲͷǡ
ͲǤͲͲͲͶʹሻ
ͶͲൌെͳǤͶͺͳൈͳͲെͲሺെǤͶͻൈͳͲെͲǡ
͵ǤͺൈͳͲെͲሻ
͵ͳൌെͳǤ͵ͷൈͳͲെͲሺെʹǤͳ͵ͷൈͳͲെͲǡ
െǤͳͷʹൈͳͲെͲሻ
Figure 19. General formulation of the medium triceps efforts based on wrist load (W) and elbow
angle (α) and surface reconstruction of the proposed polynomial.
4. Discussion
In this study, an analytical calculation method is proposed to obtain the forces in the
flexor muscles (long and short biceps) and extensors (long, lateral, and middle triceps) in
any configuration of the opening angle of the elbow joint and to any load applied to the
wrist. The final objective of this study is to allow the creation of a new type of finite element
262
Mathematics 2022, 10, 2441
that can be included in a musculoskeletal model that can be analyzed in different positions
and configurations without the need to repeat the model meshing process.
There are several studies along the same lines. Therefore, Martinez [12] is focused
on the analysis of a certain muscle in a certain load situation, analyzing what happens
in the muscle itself. Islan et al. [14], are focused on the muscles involved in the shoulder
joint, under static condition predicting the fatigue and cumulative damage to the muscles.
Alonso et al. [20] study the forces in the muscles involved in the human march under certain
load conditions, weight and step frequency. Park et al. [22] develop a predictive method for
muscle forces based on electromyography techniques. Teo et al. [26] are focused on the effect
of a collision on the neck muscles. As can be seen, none of them addresses the problem from
our perspective. The use of the finite element method is widely recognized as a calculation
tool for structural, static, and/or dynamic problems. However, characterization of the
muscle, so that it is easy to include it in an FEM model has not been addressed until now.
The main drawbacks presented by the studies carried out so far, in order to include
the muscle in a valid finite element model to analyze the behavior of a muscle within a
joint in various positions of this, are as follows:
• Studies have focused on one joint position, obtaining results related to muscle effort
that are difficult to extrapolate to different positions. Ramírez [12] obtains the results
for the anterior tibial muscle under study conditions, but moving the results to other
load situations should be difficult to solve.
• The equations obtained to characterize the muscle are difficult to implement in a FEM
model. Sachenkov et al. [29] use 1D elements to simulate muscles, but there is no single
equation for any muscle to implement it in another analysis. The same happens with
Alonso et al. [20], where the result is based on a static and physiological optimization
that integrates the forces of any Hill MTU unit.
• Several of these studies use 3D elements to represent the muscle, which allows a
better understanding of the internal behavior of the muscle itself (distribution of
stresses and strains, deformations, etc.) but does not allow the created mesh to be
reused in other positions of the joint. Therefore, Martínez [12] studies the anterior tibia
muscle in a certain position, and if the muscle length changes, a new mesh should
be developed. Another example is shown in research by Islan et al. [14], where they
analyze the shoulder muscles of a violin player holding the instrument, and so, if the
shoulder position changes, the mesh will do as well, remeshing the muscles into their
new position.
With this work, and in a first approximation, we have obtained the equations that
govern the behavior of the flexor and extensor muscles of the elbow, only in isometric
contractions and referring to this behavior to its response in the form of force provided,
throughout the flexion path of the joint for any load applied to the wrist.
For example, to calculate the estimated force that must appear in each of the joint
muscles to support a load applied to the wrist of 235 N with the elbow forming an angle of
119 degrees, we need to substitute the values of α and w in Equation (12), depending on
which muscle we want to analyze.
Changing the indicated values in Equation (12) would obtain the following estimated
forces in the muscles.
Vertical downward load, strength in Biceps muscles.
• Long biceps: 477 N;
• Short biceps: 245 N.
Upward vertical load, strength in the triceps muscles.
• Long triceps: 438 N;
• Middle triceps: 231;
• Side triceps: 304.
In addition to what has been said, the equations obtained to characterize the muscles
in their response are easy to implement mathematically, which will undoubtedly help in
263
Mathematics 2022, 10, 2441
the future in the possibility of creating a new type of finite element that responds to these
behavioral equations obtained.
With the creation of this new type of finite element to represent muscles, a very
interesting possibility is opened to analyze the behavior of the joint in different positions,
with variable loads, different load cycles, etc. notably reducing the study work since
the meshing process of the model is eliminated by being able to reuse the mesh for the
desired positions.
Regarding the analysis of the results obtained, it is worth mentioning the following
important aspects of the study.
Applying the proposed methodology and using the finite element method, we can
obtain the forces that must appear in the muscles involved in the elbow joint to keep it in
balance, in any position between 44 and 164 degrees of opening, with a load applied to the
wrist of 150 N (Table 2, Figures 13 and 14).
Through this same analysis it is possible to obtain the tensions that appear in the
insertions of the tendons of the joint, which allows us to foresee damage before certain load
situations (Figures 11 and 12).
This same analysis allows us to obtain a map of tensions in the rigid tissues of the joint,
bones, and cartilage. This allows a detailed study of damage to the elements mentioned
(Figures 11 and 12).
There are no previous studies that carry out studies similar to the proposed study,
so a direct comparison with what has already been done is not possible. However, the
trend observed in muscle effort during joint movement is similar to what other studies
have published, [34], where the results of the torque in elbow vs. elbow angle shape are
consistent with our results and [35] where the relation between the torque of the elbow and
angle of the elbow is similar to our expressions, although these studies measure the moment
in the joint (or what is the same, the sum of the moments generated by each muscle) and
not the independent force in each muscle the results presented in these investigations are
similar to those presented in this study in terms of the evolution of the moment in the joint,
or of the force in the muscle in our case, along the angle of the joint.
The correlation also presents high R values, which confirms the validity of the analysis
(Figures 13 and 14).
Through the application of classical mechanics and the balance of forces and moments
in the joint, we can extrapolate the results obtained for a specific load, 150 N in our case, to
different loads applied to the joint (Figures 15–19).
Finally, it should be mentioned that the equations that predict the force in each of the
muscles in the joint are obtained as a function of the opening angle of the elbow and the
applied load.
5. Conclusions
The use of a simplified model of the muscle, as a 1D element with axial forces only
(ROD type element), into a FEA model allows one to calculate the forces that must be
generated in the muscles involved in the joint.
It is also possible to predict the force required in each muscle for different load situations.
On the basis of the same analysis, the stresses that appear in other elements of the joint,
bones, tendon insertions, or cartilage, can be obtained. With the obtained characteristic
equations, different lines of research are opened focused on a better characterization of
the muscles. It seems necessary to evolve the equations obtained in this study to obtain
those that give the force of the muscle as a function of the speed of contraction. Or, based
on studies conducted to assess muscle fatigue, they could be modified to predict it in a
moving joint.
The evolution of this study should follow the following stages:
• Validation of the theoretical model obtained through testing with individuals with
whom it is possible to measure the forces in cases of biceps and/or triceps load that,
by means of comparison with this study, allows for knowing its validity.
264
Mathematics 2022, 10, 2441
• Modification of the equations obtained in this study in order to obtain their evolution
for concentric or eccentric contractions.
• Creation of a new type of 1D element for application in finite element models that
allows the behavior of a muscle to be characterized according to the equations obtained
for any type of contraction, isometric, concentric, or eccentric.
Once these phases are completed, the work could be extrapolated to other types of
muscle (flat, or peniform) in order to characterize more muscles that allow us to study new
joints according to the proposed methodology.
One limitation of the presented study is the approximation of the muscle to the 1D
element. This implies that following this methodology cannot obtain information regarding
what happens inside the muscle.
Another one is that the analysis is performed for a certain size of elbow joint, and so the
FEA model created is valid for this joint size; in any other case (children, older people, etc.),
the FEA model should be resized, and the same methodology should be applied.
Author Contributions: Conceptualization, F.B.H. and J.A.J.M.; data curation, E.L.U., J.D.C.-M. and
R.D.; formal analysis, E.L.U., F.B.H. and R.D.; investigation, E.L.U., F.B.H. and J.A.J.M.; methodology,
F.B.H. and R.D.; project administration, F.B.H.; supervision, F.B.H., R.D. and J.A.J.M.; writing—original
draft, E.L.U. and J.D.C.-M.; writing—review and editing, R.D. All authors have read and agreed to the
published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
Appendix A
In this appendix, the sub-Figures included in Figures 11 and 12 are going to be collected
with higher quality.
44°
265
Mathematics 2022, 10, 2441
54°
64°
266
Mathematics 2022, 10, 2441
74°
84°
267
Mathematics 2022, 10, 2441
94°
104°
268
Mathematics 2022, 10, 2441
114°
124°
269
Mathematics 2022, 10, 2441
134°
144°
270
Mathematics 2022, 10, 2441
154°
164°
Figure A1. Different Elbow Angle Configuration: Bone Von Mises Stress Distribution Triceps Loading
Case (150 N on wrist).
271
Mathematics 2022, 10, 2441
44°
54°
272
Mathematics 2022, 10, 2441
64°
74°
273
Mathematics 2022, 10, 2441
84°
94°
274
Mathematics 2022, 10, 2441
104°
114°
275
Mathematics 2022, 10, 2441
124°
134°
276
Mathematics 2022, 10, 2441
144°
Figure A2. Different Elbow Angle Configuration: Bone Von Mises Stress Distribution Triceps Loading
Case (150 N on wrist).
References
1. Strouboulis, T.; Babuška, I.; Whiteman, J.R. The Finite Element Method and Its Reliability; Clarendon Press: Oxford, UK, 2001;
ISBN 978-0-19-850276-0.
2. Babuska, I.; Whiteman, J.; Strouboulis, T. Finite Elements: An Introduction to the Method and Error Estimation; OUP: Oxford, UK,
2010; ISBN 978-0-19-850669-0.
3. Pawełko, P.; Jastrz˛ebski, D.; Parus, A.; Jastrz˛ebska, J. A new measurement system to determine stiffness distribution in machine
tool workspace. Arch. Civ. Mech. Eng. 2021, 21, 49. [CrossRef]
4. Wang, D.; Zhang, S.; Wang, L.; Liu, Y. Developing a Ball Screw Drive System of High-Speed Machine Tool Considering Dynamics.
IEEE Trans. Ind. Electron. 2021, 69, 4966–4976. [CrossRef]
5. Li, Z.; Oger, G.; Le Touzé, D. A partitioned framework for coupling LBM and FEM through an implicit IBM allowing non-
conforming time-steps: Application to fluid-structure interaction in biomechanics. J. Comput. Phys. 2021, 449, 110786. [CrossRef]
6. Della Rosa, N.; Bertozzi, N.; Adani, R. Biomechanics of external fixator of distal radius fracture, a new approach: Mutifix Wrist.
Musculoskelet. Surg. 2020, 106, 89–97. [CrossRef] [PubMed]
7. Zhang, N.-Z.; Xiong, Q.-S.; Yao, J.; Liu, B.-L.; Zhang, M.; Cheng, C.-K. Biomechanical changes at the adjacent segments induced
by a lordotic porous interbody fusion cage. Comput. Biol. Med. 2022, 143, 105320. [CrossRef] [PubMed]
8. Denozière, G.; Ku, D.N. Biomechanical Comparison between Fusion of Two Vertebrae and Implantation of an Artificial Interver-
tebral Disc. J. Biomech. 2006, 39, 766–775. [CrossRef] [PubMed]
9. Samani, A.; Bishop, J.; Yaffe, M.J.; Plewes, D.B. Biomechanical 3-D finite element modeling of the human breast using MRI data.
IEEE Trans. Med. Imaging 2001, 20, 271–279. [CrossRef]
10. Jaecques, S.; Van Oosterwyck, H.; Muraru, L.; Van Cleynenbreugel, T.; De Smet, E.; Wevers, M.; Naert, I.; Sloten, J.V. Individualised,
micro CT-based finite element modelling as a tool for biomechanical analysis related to tissue engineering of bone. Biomaterials
2003, 25, 1683–1696. [CrossRef]
11. Renner, S.M.; Natarajan, R.N.; Patwardhan, A.G.; Havey, R.M.; Voronov, L.I.; Guo, B.Y.; Andersson, G.B.; An, H.S. Novel model to
analyze the effect of a large compressive follower pre-load on range of motions in a lumbar spine. J. Biomech. 2007, 40, 1326–1332.
[CrossRef]
12. Martínez, A.M.R. Modelado y simulación del tejido músculo-esquelético. Validación Experimental con el Músculo Tibial Anterior
de Rata. Ph.D. Thesis, Universidad de Zaragoza, Zaragoza, Spain, 2011. Available online: http://purl.org/dc/dcmitype/Text.
(accessed on 5 November 2021).
13. Weiss, J.A.; Gardiner, J.C.; Ellis, B.J.; Lujan, T.J.; Phatak, N.S. Three-dimensional finite element modeling of ligaments: Technical
aspects. Med. Eng. Phys. 2005, 27, 845–861. [CrossRef]
14. Islan, M.; Carvajal, J.; Pedro, P.S.; D’Amato, R.; Juanes, J.A.; Soriano, E. Linear Approximation of the Behavior of the Rotator
Cuff under Fatigue Conditions. Violinist Case Study. In Proceedings of the ACM 5th International Conference on Technological
Ecosystems for Enhancing Multiculturality, Cádiz, Spain, 18–20 October 2017; p. 58.
277
Mathematics 2022, 10, 2441
15. Sachenkov, O.A.; Hasanov, R.F.; Andreev, P.S.; Konoplev, Y.G. Numerical Study of Stress-Strain State of Pelvis at the Proximal
Femur Rotation Osteotomy. Russ. J. Biomech. 2016, 20, 220–232. [CrossRef]
16. Martins, J.A.C.; Pato, M.P.M.; Pires, E.B. A finite element model of skeletal muscles. Virtual Phys. Prototyp. 2006, 1, 159–170.
[CrossRef]
17. Tang, C.; Tsui, C.; Stojanovic, B.; Kojic, M. Finite element modelling of skeletal muscles coupled with fatigue. Int. J. Mech. Sci.
2007, 49, 1179–1191. [CrossRef]
18. Syomin, F.A.; Tsaturyan, A.K. Mechanical model of the left ventricle of the heart approximated by axisymmetric geometry. Russ.
J. Numer. Anal. Math. Model. 2017, 32, 327–337. [CrossRef]
19. Perreault, E.J.; Sandercock, T.G.; Heckman, C.J. Hill Muscle Model Performance during Natural Activation and Electrical
Stimulation. In Proceedings of the 23rd Annual International Conference of the IEEE Engineering in Medicine and Biology
Society, Istanbul, Turkey, 25–28 October 2001; Volume 2, pp. 1248–1251.
20. Alonso, F.J.; Galán-Marín, G.; Salgado, D.R.; Pàmies Vilà, R.; Font Llagunes, J.M. Cálculo de Esfuerzos Musculares en la Marcha
Humana Mediante Optimización Estática-Fisiológica. In Proceedings of the XVIII Congreso Nacional de Ingeniería Mecánica,
Ciudad Real, Spain, 3–5 November 2010; pp. 1–9.
21. Holzbaur, K.R.S.; Murray, W.M.; Delp, S.L. A Model of the Upper Extremity for Simulating Musculoskeletal Surgery and
Analyzing Neuromuscular Control. Ann. Biomed. Eng. 2005, 33, 829–840. [CrossRef]
22. Park, W.-I.; Lee, H.-D.; Kim, J. Estimation of isometric joint torque from muscle activation and length in intrinsic hand muscle.
In Proceedings of the 2008 International Conference on Control, Automation and Systems, Seoul, Korea, 14–17 October 2008;
pp. 2489–2493. [CrossRef]
23. Soechting, J.F.; Flanders, M. Evaluating an Integrated Musculoskeletal Model of the Human Arm. J. Biomech. Eng. 1997,
119, 93–102. [CrossRef]
24. Zajac, F.E. Muscle and Tendon: Properties, Models, Scaling, and Application to Biomechanics and Motor Control. Crit. Rev.
Biomed Eng. 1989, 17, 359–411.
25. Lechosa Urquijo, E.; Blaya Haro, F.; D’Amato, R.; Juanes Méndez, J.A. Finite Element Model of an Elbow under Load, Muscle
Effort Analysis When Modeled Using 1D Rod Element. In Proceedings of the Eighth International Conference on Technological
Ecosystems for Enhancing Multiculturality, Salamanca, Spain, 21–23 October 2020; Association for Computing Machinery: New
York, NY, USA, 21 October 2020; pp. 475–482.
26. Teo, E.C.; Zhang, Q.H.; Qiu, T.X. Finite Element Analysis of Head-Neck Kinematics Under Rear-End Impact Conditions. In
Proceedings of the 2006 International Conference on Biomedical and Pharmaceutical Engineering, Singapore, 11–14 December
2006; pp. 206–209.
27. Donahue, T.L.H.; Hull, M.L.; Rashid, M.M.; Jacobs, C.R. A Finite Element Model of the Human Knee Joint for the Study of
Tibio-Femoral Contact. J. Biomech. Eng. 2002, 124, 273–280. [CrossRef]
28. Abidin, N.A.Z.; Kadir, M.R.A.; Ramlee, M.H. Three Dimensional Finite Element Modelling and Analysis of Human Knee
Joint-Model Verification. J. Phys. Conf. Ser. 2019, 1372, 012068. [CrossRef]
29. Sachenkov, O.A.; Hasanov, R.; Andreev, P.; Konoplev, Y. Determination of Muscle Effort at the Proximal Femur Rotation Osteotomy.
IOP Conf. Series: Mater. Sci. Eng. 2016, 158, 012079. [CrossRef]
30. Jesal, N. Parekh Using Finite Element Methods to Study Anterior Cruciate Ligament Injuries: Understanding the Role of ACL
Modulus and Tibial Surface Geometry on ACL Loading. Ph.D. Thesis, The University of Michigan, Ann Arbor, MI, USA, 2013.
31. CES EduPack Bulletin: January 2017. Available online: https://www.grantadesign.com/newsletters/ces-edupack-bulletin-ces-
edupack-2017-new-products-database-symposia-deadlines-shared-resources-webinars-and-more/ (accessed on 20 June 2022).
32. Bruno, S.; José, M.; Filomena, S.; Vítor, C.; Demétrio, M.; Karolina, B. The Conceptual Design of a Mechatronic System to Handle
Bedridden Elderly Individuals. Sensors 2016, 16, 725. [CrossRef] [PubMed]
33. Arcila Arango, J.C.; Cardona Nieto, D.; Giraldo, J.C. Abordaje Físico-Matemático Del Gesto Articular. Available online: https:
//www.efdeportes.com/efd171/abordaje-fisico-matematico-del-gesto-articular.htm (accessed on 5 November 2021).
34. Loss, J.F.; Candotti, C.T. Comparative Study between Two Elbow Flexion Exercises Using the Estimated Resultant Muscle Force.
Braz. J. Phys. Ther. 2008, 12, 502–510. [CrossRef]
35. Murray, W.M.; Delp, S.L.; Buchanan, T.S. Variation of Muscle Moment Arms with Elbow and Forearm Position. J. Biomech. 1995,
28, 513–525. [CrossRef]
278
mathematics
Article
Miura-Ori Inspired Smooth Sheet Attachments for
Zipper-Coupled Tubes
Dylan C. Webb 1 , Elissa Reynolds 1 , Denise M. Halverson 1, * and Larry L. Howell 2
1 Department of Mathematics, Brigham Young University, Provo, UT 84602, USA; dylancw2@byu.edu (D.C.W.);
elissar2@byu.edu (E.R.)
2 Department of Mechanical Engineering, Brigham Young University, Provo, UT 84602, USA; lhowell@byu.edu
* Correspondence: halverson@math.byu.edu
Abstract: Zipper-coupled tubes are a broadly applicable, deployable mechanism with an angular
surface that can be smoothed by attaching an additional smooth sheet pattern. The existing design
for the smooth sheet attachment, however, leaves small gaps that can only be covered by adding
flaps that unfold separately, limiting applicability in situations requiring a seamless surface and
simultaneous deployment. We provide a novel construction of the smooth sheet attachment that
unfolds simultaneously with zipper-coupled tubes to cover the entire surface without requiring
additional actuation and without inhibiting the tubes’ motion up to an ideal, unfolded state of
stability. Furthermore, we highlight the mathematics underlying the design and motion of the new
smooth sheet pattern, thereby demonstrating its rigid-foldability and compatibility with asymmetric
zipper-coupled tubes.
Keywords: zipper-coupled tubes; Miura-ori pattern; deployable mechanism; origami inspired design;
Citation: Webb, D.C.; Reynolds, E.; smooth sheet attachment
Halverson, D.M.; Howell, L.L.
Miura-Ori Inspired Smooth Sheet MSC: 74-10; 51E24
Attachments for Zipper-Coupled
Tubes. Mathematics 2022, 10, 2643.
https://doi.org/10.3390/
math10152643 1. Introduction
Academic Editors: Higinio Rubio Origami is the basis for many deployable mechanisms, including self-scaling, modular
Alonso, Alejandro Bustos, Jesus robots [1], satellite reflectarray antennas that pack efficiently [2], and multimodal biomed-
Meneses Alonso and Enrique ical devices that actuate electromagnetically [3]. Zipper-coupled tubes are multistable
Soriano-Heras origami structures that fold up compactly and unfold bidirectionally to fill space and resist
compression [4]. An asymmetric generalization of zipper-coupled tubes with smooth sheet
Received: 29 June 2022
attachments was introduced previously [5]. Together, these origami-based mechanisms
Accepted: 24 July 2022
form a deployable device with a smooth surface that is advantageous in applications, such
Published: 28 July 2022
as prefab architecture, when drivability and walkability are important, and in smooth
Publisher’s Note: MDPI stays neutral medical devices, when sharp edges could harm the body. The smooth sheet attachment we
with regard to jurisdictional claims in presented previously, however, does not fully cover the surface of zipper-coupled tubes
published maps and institutional affil- without additional flaps that actuate separately [5]. We rectify this problem by offering an
iations. alternative, Miura-ori inspired [6–8], construction of a smooth sheet attachment that fully
covers the surface of, and deploys simultaneously with, asymmetric zipper-coupled tubes.
In this paper we briefly review the design of asymmetric zipper-coupled tubes and
then demonstrate how to (1) construct a Miura-ori inspired smooth sheet attachment
Copyright: © 2022 by the authors.
without gaps that attaches to the mountain folds of asymmetric zipper-coupled tubes and
Licensee MDPI, Basel, Switzerland.
This article is an open access article
(2) handle design variations in the symmetric case. We present a mathematically robust
distributed under the terms and
design method, decomposing the attachment into pairs of compatible, tessellating cells
conditions of the Creative Commons inspired by the Miura-ori pattern and then defining the cells’ vertices throughout the
Attribution (CC BY) license (https:// folding motion, thereby confirming rigid-foldability of the smooth sheet attachment and
creativecommons.org/licenses/by/ highlighting the mathematical processes involved in mechanism design. Our Miura-ori-
4.0/). based smooth sheet attachment expands the utility of asymmetric zipper-coupled tubes;
combined, these origami-inspired mechanisms are ideal for applications requiring a rigidly
deployable structure with a smooth surface.
x = xx̂
Thus, the notation x̂ always represents a unit length vector associated with a vector denoted
x, where x = xx̂.
The building blocks for zipper-coupled tubes are the compatible degree-four vertex
cells C1 and C2 illustrated in Figures 1 and 2. When combined, they form a single tube seg-
ment [5]. To couple correctly and satisfy rigid and flat-foldability [9], the design constraints
of these degree-four vertex cells include:
α3 = π − α1 α1 + α3 = α2 + α4 α1 < α2
α4 = π − α2 d sin a4 = b sin a1 α1 + α2 ≤ π
Figure 1. (a) Vectors defining the basic asymmetric degree-four vertex cell, C1 . (b) Vectors defining
the complementary degree-four vertex cell, C2 . (Adapted from [5]).
280
Mathematics 2022, 10, 2643
Figure 2. (a) Vertices of the basic asymmetric degree-four vertex cell, C1 . (b) Vertices of the comple-
mentary degree-four vertex cell, C2 . The points Xi∗ identify with Xi . (Adapted from [5]).
Like a Miura-ori cell, the basic degree-four vertex cell has one degree of freedom in
its motion. Let the basic and complementary cells lie flat in the xz-plane when unfolded,
as depicted in Figures 1 and 2, and let the angles between the xz-plane and panels σ1 and σ4 ,
respectively, be equal as σ1 and σ4 fold toward each other (see Figure 1a). Call this motion
parameter γ. Then, placing F0 at the origin and fixing F1 on the negative z-axis in R3 (see
Figure 2a), the motion of the basic cell is determined by the following vector paths:
where
The same vectors define the motion of both basic and complementary cells. By combin-
ing one basic cell (C1 ) and one complementary cell (C2 ), we obtain the first tube segment in a
zipper-coupled pair, illustrated in Figure 3a. Its vertices, identified with their corresponding
position vectors, are given by:
281
Mathematics 2022, 10, 2643
Figure 3. (a) The vertices in the first origami tube segment in the construction of an asymmetric
zipper-coupled tube segments pair. (b) The vertices in the second origami tube segment in the
construction of an asymmetric zipper-coupled tube segments pair. (Adapted from [5]).
The second tube segment is a copy of the first, but rotated 180◦ about the y-axis and
then shifted by an offset vector so that it attaches to the first tube segment along the vertical
creases (illustrated in Figure 3b). Let C3 and C4 denote the copies of C1 and C2 , respectively,
comprising the second tube segment. To define the motion of the second tube segment, let
x̄ denote the 180◦ rotation of a vector x about the y-axis and define the offset vector:
b cos α1 + d cos α4
s( γ ) = 1 + a( γ ) + c( γ )
2a
Then the vertices of the second tube segment are defined by:
E0 (γ) = d̄(γ) + s(γ) F0 (γ) = s(γ) G0 (γ) = b̄(γ) + s(γ)
E1 (γ) = ā(γ) + d̄(γ) + s(γ) F1 (γ) = ā(γ) + s(γ) G1 (γ) = ā(γ) + b̄(γ) + s(γ)
E2 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) F2 (γ) = ā(γ) + c̄(γ) + s(γ) G2 (γ) = ā(γ) + b̄(γ) + c̄(γ) + s(γ)
E3 (γ) = c̄(γ) + d̄(γ) + s(γ) F3 (γ) = c̄(γ) + s(γ) G3 (γ) = b̄(γ) + c̄(γ) + s(γ)
The pair of tube segments, with vertices positioned as indicated above, form a single
component in a pair of zipper-coupled tubes (Figure 3b), which will be denoted as Z0 .
The zipper-coupled tubes can be extended by taking multiple copies of Z0 and attaching
them end-to-end. In particular, for i = 1 . . . n, let
Z i = Z0 + i (d − b)
Then
9
n
Z= Zi
i =0
The value γ0 will be the terminal value for the deployment of Z with a smooth sheet
attachment. Thus, by construction, the smooth sheet attachment lies flat on the surface of
the zipper-coupled tubes when γ = γ0 and folds up with the zipper-coupled tubes until
γ = π/2, at which point the entire structure lies in a plane and has no volume.
282
Mathematics 2022, 10, 2643
Figure 4. (a) The top smooth sheet attachment (S1 is lightly shaded, S4 is darkly shaded, and co-
planar ridges are bolded). (b) Adding the top and bottom smooth sheet attachments to asymmetric
zipper-coupled tubes (co-planar ridges are bolded). (c) The bottom smooth sheet attachment (S3 is
lightly shaded, S2 is darkly shaded, co-planar ridges are bolded).
3.1. Design of S1
The first smooth sheet cell, S1 , has two panels that fold toward each other as the
zipper-coupled tubes fold up. We first discuss the design of S1 when built atop C4 —the
desired relation between S1 and C4 is shown in Figure 5a. The vectors that define S1 in this
temporary configuration will be re-used to define S4 , allowing the edges of these two cells
to mesh when placed opposite each other. After construction, the smooth sheet cell S1 will
be moved to attach to C1 , as seen in Figure 5b.
283
Mathematics 2022, 10, 2643
In preparation of the design of S1 , let the displacement between G2 (γ) and E2 (γ) be
described by
q̄(γ) = d̄(γ) − b̄(γ)
and
q̄⊥ (γ) = q̄(γ) − (q̄(γ) · c̄ˆ (γ))c̄ˆ (γ)
As depicted in Figure 6a,b, q̄⊥ (γ) is the component vector of q̄(γ) orthogonal to c̄(γ),
the displacement between E1 (γ) and E2 (γ) (or equivalently, the displacement between
G1 (γ) and G2 (γ)). In the ideal state of the zipper-coupled tubes, the distance between the
ridges of Z on which the smooth sheet will be attached is
Δ = q̄⊥ (γ0 )
Figure 6. (a) Vectors in S1 . (b) Vector components of ū and v̄. (c) Relation between h and w in the
ideal state.
Note that, of necessity and by design, the top edges of both S1 and S4 are parallel
with q̄(γ0 ). To remove gaps between zipper-coupled tubes stacked laterally to Z0 and in
anticipation of the behavior depicted in Figure 4c, we extended the side edges of S1 in the
direction of c̄ by the length ε, as shown in Figures 5a and 6a, where
b sin α1 sin γ0
ε=
2|c2 (γ0 )|
284
Mathematics 2022, 10, 2643
The smooth sheet cell is determined by the vectors ū, v̄, and w̄ illustrated in Figure 6a.
As highlighted in Figure 6b, let
d sin α1 − b sin α4 + Δ
u1 = (1)
2
b sin α4 − d sin α1 + Δ
v1 = (2)
2
Note that Δ = u1 + v1 . Examining the components of ū and v̄ in the ideal state,
as shown in Figure 6a,b, we see that u2 + v2 = ||q̄(γ0 ) − q̄⊥ (γ0 )||. Since
u2 u1 u v2 u
= = 1 and = 2
u2 + v2 u1 + v1 Δ v1 u1
it follows that
u1
u2 = ||q̄(γ0 ) − q̄⊥ (γ0 )|| (3)
Δ
u2
v2 = v1 (4)
u1
Let
where
−1 q̄⊥ (γ)2 + u21 − v21
λ(γ) = cos
2q̄⊥ (γ)u1
In order to satisfy flat-foldability, the direction of the vector w̄ must be the same as the
direction of −c̄. Thus, for some positive constant w:
The primary concern in choosing the length w was to avoid intersections with the
zipper-coupled tubes during deployment. So that the smooth sheet cells S1 and S4 lined up
correctly, we defined w and h to be the lengths necessary so that in the ideal state, the tips
of the vectors w̄ (−wc̄ˆ ) and hĉ meet, as shown in Figure 6c. In particular, we set
$ %
hĉ(γ0 ) + F3 (γ0 ) = −wc̄ˆ (γ0 ) + E2 (γ0 ) + ε̄ε (γ0 ) − ū(γ0 )
285
Mathematics 2022, 10, 2643
proj[ x, y, z] T = [ x, y] T
Remark 1. Note that the matrix that is inverted in Equation (5) will be singular only in the
symmetric case when c̄ has the same direction as −c.
We have now stipulated all three vectors—ū, v̄, and w̄—which define S1 when it is
attached to C4 . What remains is to move S1 so that it attaches to C1 . The desired placement
is depicted in Figure 5b, where S1 is positioned on top of C1 and bridges Z0 and Z1 ; note
that the zipper-coupled tubes have been rotated about the y-axis in this figure so that S1
lies parallel with the xy-plane (see [5] for further details). Let X [i ] denote the point in Zi
that is a copy of X in Z0 , for i ≥ 1. Then, the vertex F3 ∗ in S1 attaches to F3 in Z0 and the
vertex F3 ∗ [1] in S1 attaches to F3 [1] in Z1 . More particularly, the edge P0 F3∗ attaches to F0 F3
and P0 [1] F3∗ [1] attaches to F0 [1] F3 [1].
Recall that the 180◦ rotation of a vector x̄ about the y-axis is denoted x. Thus, the ver-
tices in S1 when attached to C1 as desired are:
3.2. Design of S3
The cell S3 is a rotated copy of S1 attached to C3 , so the vectors that define S3 are
rotated copies of the vectors that define S1 , shifted by s. Let the vertices on smooth sheet
cells attached to the rotated tube segment in Z0 —i.e., the vertices in S2 and S3 —be denoted
with primes. Note that the smooth sheet cell S3 bridges Z0 and Z−1 , so the smooth sheet
cell contains the vertices F3 and F3 [−1]. Thus, the vertices of S3 when attached to C3 are:
F3 (γ) = c̄(γ) + s(γ) O0 (γ) = s(γ) − ε̄ε (γ) + ū(γ) F3 [−1](γ) = c̄(γ) + s(γ) − b̄(γ) + d̄(γ)
P0 (γ) = s(γ) − ε̄ε (γ) O3 (γ) = s(γ) − ε̄ε (γ) + ū(γ) − w̄(γ) P0 [−1](γ) = s(γ) − ε̄ε (γ) − b̄(γ) + d̄(γ)
3.3. Design of S4
The smooth sheet cell S4 attaches on top of C4 and fits together with S1 in the ideal
state, as illustrated in Figure 7. We make the edge of S4 opposite of S1 straight in the ideal
state so that zipper-coupled tubes with smooth sheet attachments can be stacked laterally
without gaps. From another point of view, the edge is made straight in preparation of the
design of S2 , a copy of S4 attached to the bottom of Z0 —the straight edges of S2 and S3
meet in the ideal state, as illustrated in Figure 4c.
The smooth sheet cell S4 has a degree-four vertex folding pattern inspired by the
Miura-ori cell, as shown in Figure 8a. This allows the cell to close the gap on the top of
Z0 in the ideal state and fold up without intersecting the adjacent tube segment. For flat-
foldability of the cell, we require the sum of opposite angles at the interior vertex to be
180◦ (see Kawasaki-Justin theorem [11,12]). The pattern in Figure 8a is described by the
previously defined vectors ū, v̄, and w̄ and the yet-to-be-defined vectors r̄, t̄, h̄, f̄, and ḡ.
In the ideal state, r̄ and t̄ are the projections of b and d, respectively, into the xy-plane;
this is necessary to ensure flat foldability. Thus (see Figures 3 and 8a),
β 2 = ∠ P1 Q1 Q2 = ∠ F3 G1 G2 and β 3 = ∠ P1 O1 O2 = ∠ F3 E1 E2
In particular,
d(γ0 ) − s0 a(γ0 )
β 2 = cos−1 −c̄ˆ (γ0 ) ·
d(γ0 ) − s0 a(γ0 )
b(γ0 ) − s0 a(γ0 )
β 3 = cos−1 −c̄ˆ (γ0 ) ·
b(γ0 ) − s0 a(γ0 )
286
Mathematics 2022, 10, 2643
Having derived these angles, we are now ready to define r̄ and t̄ as follows (see
Equations (1)–(4)):
u1
r̄(γ) = − − u2 c̄ˆ (γ) + ū(γ)
tan β 3
v1
t̄(γ) = − + v2 c̄ˆ (γ) + v̄(γ)
tan β 2
Observing Figure 8a, note that r and t can be expressed simply as:
u1 v1
r= t=
sin β 3 sin β 2
In the ideal state, the crease defined by h̄ must have the same direction as −c to satisfy
flat-foldability. Moreover, so that there are no gaps when S1 and S4 come together in the
ideal state, the length of h̄ should be the value h given by Equation (5), according to the
premise upon which Equation (5) was derived (see also Figure 6c). Thus,
h̄(γ0 ) = − hĉ(γ0 )
For an arbitrary parameter value γ, the unit vectors adjacent to h̄ˆ (γ) that emanate from
the degree-four vertex in the interior of S4 are r̄ˆ (γ) and ˆt̄(γ). Because opposite angles in a
degree-four vertex sum to 180◦ [11,12] and we require a rigid folding, h̄(γ) is determined
by the following system of equations:
$ %
h̄ˆ (γ) · r̄ˆ (γ) = cos π − cos−1 c̄ˆ (γ) · ˆt̄(γ)
$ %
h̄ˆ (γ) · ˆt̄(γ) = cos π − cos−1 c̄ˆ (γ) · r̄ˆ (γ)
287
Mathematics 2022, 10, 2643
The first two equations are linear and the third is quadratic. Hence, there are precisely
two solutions: one corresponding to a valley fold assignment and one corresponding
to a mountain fold assignment. The solution corresponding to a mountain fold is the
correct solution.
Because ḡ corresponds to an edge of the panel defined by h̄ and t̄, we can define it in
terms of these vectors. We want the position of vertex Q4 in S4 to equal that of O3 in S1
when in the ideal state. This is equivalent to saying that gh h̄ˆ and − gt ˆt̄ define the same point
when the former is extended from Q1 and the latter is extended from O3 in the ideal state,
as shown in Figure 8b. The reader will recognize that this problem is formulated similarly
to that in the end of Section 3.1, where the lengths h and w were computed using a system
of equations based on two intersecting vectors. We employ the same technique, with the
following system of equations:
giving us
!−1 $ %
[ gh , gt ] T = proj h̄ˆ (γ0 ) , proj ˆt̄(γ0 ) proj −ε + u − w − ā − c̄ − d̄ − s − ε̄ε + ū − w̄ − t̄ (γ0 )
We solve for gh and gt and use these components of projection to define ḡ, using h̄ˆ and
ˆt̄ as a basis:
ḡ(γ) = gh h̄ˆ (γ) + gt ˆt̄(γ)
We define f̄ similarly, solving for f h and f r via the same method and another system of
equations:
f h̄ˆ (γ ) + O (γ ) = − f r̄ˆ (γ ) + O [−1](γ )
h 0 1 0 r 0 3 0
This gives us
!−1 $ %
[ f h , f r ] T = proj h̄ˆ (γ0 ) , proj r̄ˆ (γ0 ) proj −ε + u − w − ā − b̄ − c̄ − s − ε̄ε + ū − w̄ − r̄ (γ0 )
Thus, we have
f̄(γ) = f h h̄ˆ (γ) + f r r̄ˆ (γ)
Now that we have defined all the essential vectors in S4 , we are ready to attach the
degree-four vertex to C4 in the zipper-coupled tubes structure. In the ideal state, this
smooth sheet cell matches exactly with the edges of S1 , providing a smooth surface devoid
of gaps on the top of asymmetric zipper-coupled tubes (Figure 4a). In particular, the edge
O1 O2 attaches to E1 E2 and Q1 Q2 attaches to G1 G2 . Likewise, the edges O1 O4 and Q1 Q4
align perfectly with adjacent cells in the ideal state and fold up at different rates to avoid
intersections. The vertices in S4 identified with their corresponding position vectors are:
O1 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ) + r̄(γ)
O2 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ)
O4 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ) + r̄(γ) + f̄(γ)
P1 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ)
P2 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ)
P4 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ) + h̄(γ)
Q1 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ) + t̄(γ)
Q2 (γ) = ā(γ) + b̄(γ) + c̄(γ) + s(γ) + ε̄ε (γ)
Q4 (γ) = ā(γ) + c̄(γ) + d̄(γ) + s(γ) + ε̄ε (γ) − ū(γ) + w̄(γ) + t̄(γ) + ḡ(γ)
288
Mathematics 2022, 10, 2643
3.4. Design of S2
The smooth sheet cell S2 is a rotated copy of S4 that attaches to C2 . The vertices in this
smooth sheet cell are
Q1 ( γ ) = a( γ ) + c( γ ) + d( γ ) + ε ( γ ) − u( γ ) + w( γ ) + t( γ )
Q2 ( γ ) = a( γ ) + b( γ ) + c( γ ) + ε ( γ )
Q4 ( γ ) = a( γ ) + c( γ ) + d( γ ) + ε ( γ ) − u( γ ) + w( γ ) + t( γ ) + g( γ )
We have now completed the details for the smooth sheet attachment in the asymmetric
case; a summary of the edges and vertices in the attachment is given in Tables A1 and A2,
suppressing γ for concision. This attachment folds up flat and actuates with the zipper-
coupled tubes structure to form a smooth surface, leaving no gaps between the various
asymmetric cells we have described. The symmetric case merits more discussion, however,
because there are fewer constraints on the vectors in the Si , allowing for multiple rigidly
foldable patterns given specific design parameters.
The design parameters for the smooth sheet cells Si are uniquely determined for all
cases where α1 + α2 < π. When α1 + α2 = π, however, there is no longer a unique solution
to Equation (5). In particular, c̄(γ0 ) has the same direction as −c(γ0 ) in the symmetric case,
so the matrix inverted in Equation (5) is singular and the values h and w are not uniquely
defined. Similarly, β 2 and β 3 are no longer constrained, and we may define these features
of the design problem advantageously by choosing a solution that minimizes the amount
by which the smooth sheet attachment protrudes from the structure when folded.
289
Mathematics 2022, 10, 2643
In the design of S1 and S3 , let w∗ replace the value of w. Likewise, in the design of
S4 and S2 , let w∗∗ replace the value of w. As highlighted in Figure 9, for the symmetric
case we no longer require that w∗ = w∗∗ . In selecting a value for w∗ , we set it as large as
possible to maximize the surface area of S1 , thus minimizing the amount by which the
edges of S4 can protrude from the zipper-coupled tubes structure. Applying the analysis
given in [5] (see Section 7.1.3), the largest value for w∗ can be shown to be
d sin α1 + b sin α4 − Δ
w∗ = c − −ε
2 tan (α2 − α1 )
The only requirements for h, w∗∗ , β 2 , and β 3 in the symmetric case are
h + w∗∗ = c − ε and β2 = β3
Adjustments to the values h, w∗∗ , β 2 , and β 3 can also assist in minimizing the protru-
sion of S4 from the zipper-coupled tubes. Optimal values can be determined by numerical
methods according to the specific design application. However, care should be taken in
making these adjustments to avoid intersections with the structure underneath.
On a final note, although f̄ and ḡ are determined after defining the previous quantities,
a convenient simplification in their definition: because S4 is symmetric, the vectors f̄ and ḡ
are parallel to h̄ and have equal lengths. Moreover, in the ideal state,
Therefore,
u1
gt = f r = 0 and gh = f h = (2c − w∗ ) − w∗∗ +
tan β 3
5. Conclusions
We have successfully defined a smooth sheet attachment that folds up with the zipper-
coupled tubes and unfolds to the ideal state without inhibiting their motion to form a flat
surface without any gaps (Figure 10). This pattern is defined for both the asymmetric and
symmetric cases, and we provide access to code which the reader may use to visualize
the origami structures described and print out the corresponding fold patterns: https:
//github.com/dylanwebbc/azct (accessed on 23 July 2022).
Figure 10. Model of asymmetric zipper-coupled tubes with a Miura-ori inspired smooth sheet
attachment at different stages of unfolding; the nature of the deployment of the degree-four vertex
cells is clearly observable on the bottom of the device. Parameters: α1 = 13 π, α2 = 59 π, a = b = c.
Dimensions (inches): 4.9 by 3.3 when folded, 7.4 by 3.5 by 1.5 when unfolded.
290
Mathematics 2022, 10, 2643
Note that the smooth sheet cells protrude from the zipper-coupled tubes structure
when folded. When gaps are tolerable and the folded state must be minimized for trans-
portation, constraining the cells to fold up within the zipper-coupled tubes while max-
imizing surface area in the ideal state results in the smooth sheet attachment described
previously [5]. Thus, the smooth sheet design can be tailored to the situation, much like
zipper-coupled tubes themselves. To inform future applications of these structures, we sug-
gest dynamic and quasi-static analyses. Constructing a device for architectural applications
will likely require the use of thick origami and compliant hinges, and remote self-actuation
via magnetism or heat could be useful in space or medical applications [13–15].
Miura-ori-inspired smooth sheet attachments enhance the utility of zipper-coupled
tubes in various situations. The tubes are useful in architecture because they pack tight
and deploy to a rigid state [4]; our gapless smooth sheet attachments improve existing
designs by increasing drivability and walkability. If a local bridge collapses, for example,
a prefabricated bridge based on zipper-coupled tubes with smooth sheet attachments
can easily be transported on a single vehicle and swiftly deployed on-site to provide
smooth, emergency transit. Space structures are another popular application of origami-
inspired mechanisms—the Miura-ori pattern that the smooth sheet is based on is common
in deployable space array design. Accommodating for material thickness, however, makes
Miura-ori sheets challenging to deploy [16]. In contrast, a thin solar array constructed
from Miura-ori inspired smooth sheet cells can deploy rigidly because it is supported by
zipper-coupled tubes.
The design of smooth sheet attachments without gaps is key to the development of
more versatile zipper-coupled tubes. We have communicated a clear design method for the
origami-based structure, examining the mathematics of its motion in detail. By elucidating
the possibility for further enhancements on the zipper-coupled tubes structure, we hope to
spur many novel and exciting applications beyond those mentioned.
Appendix A
Table A1. Defining the edges in Miura-ori based, smooth sheet attachments for a pair of asymmetric
zipper-coupled tubes.
S1 S2 S3 S4
O0 P0 O0 P0 [1] O1 P1 O2 P2 O4 P4 O0 P0 O0 P0 [−1] O1 P1 O2 P2 O4 P4
O3 F3 O3 F3 [1] P1 Q1 P2 Q2 P4 Q4 O3 F3 O3 F3 [−1] P1 Q1 P2 Q2 P4 Q4
O0 O3 P0 F3 O1 O2 O1 O4 P1 P2 O0 O3 P0 F3 O1 O2 O1 O4 P1 P2
P0 [1] F3 [1] P1 P4 Q1 Q2 Q1 Q4 P0 [−1] F3 [−1] P1 P4 Q1 Q2 Q1 Q4
291
Mathematics 2022, 10, 2643
Table A2. Defining the vertices in Miura-ori inspired smooth sheet attachments for a pair of asym-
metric zipper-coupled tubes.
O0 −ε + u O0 s − ε̄ε + ū
O1 a+c+d+ε −u+w+r O1 ā + c̄ + d̄ + s + ε̄ε − ū + w̄ + r̄
O2 a+c+d+ε O2 ā + c̄ + d̄ + s + ε̄ε
O3 −ε + u − w O3 s − ε̄ε + ū − w̄
O4 a+c+d+ε −u+w+r+f O4 ā + c̄ + d̄ + s + ε̄ε − ū + w̄ + r̄ + f̄
P0 −ε P0 s − ε̄ε
P1 a+c+d+ε−u+w P1 ā + c̄ + d̄ + s + ε̄ε − ū + w̄
P2 a+c+d+ε −u P2 ā + c̄ + d̄ + s + ε̄ε − ū
P4 a+c+d+ε −u+w+h P4 ā + c̄ + d̄ + s + ε̄ε − ū + w̄ + h̄
Q1 a+c+d+ε −u+w+t Q1 ā + c̄ + d̄ + s + ε̄ε − ū + w̄ + t̄
Q2 a+b+c+ε Q2 ā + b̄ + c̄ + s + ε̄ε
Q4 a+c+d+ε −u+w+t+g Q4 ā + c̄ + d̄ + s + ε̄ε − ū + w̄ + t̄ + ḡ
F3 c F3 c̄ + s
F3 [1] c−b+d F3 [−1] c̄ + s − b̄ + d̄
P0 [1] −ε − b + d P0 [−1] s − ε̄ε − b̄ + d̄
References
1. Mena, L.; Muñoz, J.; Monje, C.A.; Balaguer, C. Modular and Self-Scalable Origami Robot: A First Approach. Mathematics 2021, 9,
1324. [CrossRef]
2. Rubio, A.J.; Kaddour, A.S.; Georgakopoulos, S.V.; Ynchausti, C.; Magleby, S.; Howell, L.L. A Deployable Hexagonal Reflectarray
Antenna for Space Applications. In Proceedings of the 2021 United States National Committee of URSI National Radio Science
Meeting (USNC-URSI NRSM), Boulder, CO, USA, 4–9 January 2021; pp. 136–137. [CrossRef]
3. Zhang, F.; Li, S.; Shen, Z.; Cheng, X.; Xue, Z.; Zhang, H.; Song, H.; Bai, K.; Yan, D.; Wang, H.; et al. Rapidly deployable and
morphable 3D mesostructures with applications in multimodal biomedical devices. Proc. Natl. Acad. Sci. USA 2021, 118,
e2026414118. [CrossRef] [PubMed]
4. Filipov, E.T.; Tachi, T.; Paulino, G.H. Origami tubes assembled into stiff, yet reconfigurable structures and metamaterials. Proc.
Natl. Acad. Sci. USA 2015, 112, e1509465112. [CrossRef] [PubMed]
5. Webb, D.C.; Elissa, R.; Halverson, D.M.; Howell, L.L. Deployable Space-Filling Mechanisms: Asymmetric Zipper-coupled Tubes
and Smooth Sheet Attachments. In Proceedings of the ASME 2022 International Design Engineering Technical Conferences and
Computers and Information in Engineering Conference, St. Louis, MI, USA, 14–17 August 2022; Forthcoming.
6. Miura, K. The Science of Miura-Ori: A Review. In Origami 4; Lang, R.J., Ed.; A K Peters/CRC Press: Boca Raton, FL, USA, 2009;
Chapter 4, pp. 87–99.
7. Miura, K. Method of Packaging and Deployment of Large Membranes in Space. Inst. Space Astronaut. Sci. Rep. 1985, 618, 1–9.
8. Miura, K.; Natori, M. 2-D Array Experiment on Board a Space Flyer Unit. Space Sol. Power Rev. 1985, 5, 345–356.
9. Zimmermann, L.; Stanković, T. Rigid and Flat Foldability of a Degree-Four Vertex in Origami. J. Mech. Robot. 2020, 12, 011004.
[CrossRef]
10. Rodrigues, O. Des lois géométriques qui régissent les déplacements d’un système solide dans l’espace, et de la variation des
coordonnées provenant de ces déplacements considérés indépendamment des causes qui peuvent les produire. J. Mathématiques
Pures Appliquées 1840, 5, 380–440. Available online: http://xxx.lanl.gov/abs/http://sites.mathdoc.fr/JMPA/PDF/JMPA_1840_1
_5_A39_0.pdf (accessed on 1 January 2022).
11. Kawasaki, T. On the Relation between Mountain-Creases and Valley-Creases of a Flat Origami. In Proceedings of the 1st International
Meeting on Origami Science and Technology; Huzita, H., Ed.; Universita di Padova: Padova, Italy, 1989; pp. 229–237.
12. Justin, J. Mathematics of origami, part 9. Br. Origami 1986, 118, 28–30.
13. Yellowhorse, A.; Tolman, K.; Howell, L.L. Optimization of Origami-Based Tubes for Lightweight Deployable Structures. In
Proceedings of the ASME 2017 International Design Engineering Technical Conferences and Computers and Information in
Engineering Conference, Cleveland, OH, USA, 6–9 August 2017. [CrossRef]
292
Mathematics 2022, 10, 2643
14. Kim, Y.; Yuk, H.; Zhao, R.; Chester, S.A.; Zhao, X. Printing ferromagnetic domains for untethered fast-transforming soft materials.
Nature 2018, 558, 274–279. [CrossRef] [PubMed]
15. Kuribayashi, K.; Tsuchiya, K.; You, Z.; Tomus, D.; Umemoto, M.; Ito, T.; Sasaki, M. Self-deployable origami stent grafts as a
biomedical application of Ni-rich TiNi shape memory alloy foil. Mater. Sci. Eng. A 2006, 419, 131–137. [CrossRef]
16. Bolanos, D.; Ynchausti, C.; Brown, N.; Pruett, H.; Hunter, J.; Clark, B.; Bateman, T.; Howell, L.L.; Magleby, S.P. Considering
thickness-accommodation, nesting, grounding and deployment in design of Miura-ori based space arrays. Mech. Mach. Theory
2022, 174, 104904. [CrossRef]
293
mathematics
Article
Jump and Initial-Sensitive Excessive Motion of a Class of
Relative Rotation Systems and Their Control via Delayed
Feedback
Ziyin Cui and Huilin Shang *
Abstract: Jump and excessive motion are undesirable phenomena in relative rotation systems, causing
a loss of global integrity and reliability of the systems. In this work, a typical relative rotation system
is considered in which jump, excessive motion, and their suppression via delayed feedback are
investigated. The Method of Multiple Scales and the Melnikov method are applied to analyze critical
conditions for bi-stability and initial-sensitive excessive motion, respectively. By introducing the
fractal of basins of attraction and the erosion of the safe basin to depict jump and initial-sensitive
excessive motion, respectively, the point mapping approach is used to present numerical simulations
which are in agreement with the theoretical prediction, showing the validity of the analysis. It is found
that jump between bistable attractors can be due to saddle–node bifurcation, while initial-sensitive
excessive motion can be due to heteroclinic bifurcation. Under a positive coefficient of the gain,
the types of delayed feedback can both be effective in reducing jump and initial-sensitive excessive
motion. The results may provide some reference for the performance improvement of rotors and
Citation: Cui, Z.; Shang, H. Jump
and Initial-Sensitive Excessive
main bearings.
Motion of a Class of Relative Rotation
Systems and Their Control via Keywords: relative rotation; jump; safe basin; fractal; heteroclinic bifurcation; delayed feedback
Delayed Feedback. Mathematics 2022,
10, 2676. https://doi.org/10.3390/ MSC: 58Z05; 37G35; 37G15
math10152676
power supply may result in strange attractors such as classical Lorenz and Feigenbaum
attractors. In a nonlinear relative rotation system with a triple-well Mathieu-Duffing oscilla-
tor, Liu et al. [14] obtained the threshold of chaos regarding Smale horseshoe commutation
and exhibited the erosion process of safe basins. A class of nonlinear relative rotational
systems containing two rotors was also built whose chaotic response was presented [15].
By introducing the erosion of the safe basin to describe the safe performance of the spur
gear pair, Zhu et al. [16] classified the multiple meshing states and presented the transition
process from safe to unsafe. To control the complex dynamics of the main transmission
system of a scraper conveyor, Ju et al. [17] analyzed its local bifurcations and proposed a
nonlinear state feedback controller whose effect was studied numerically. Considering the
effectiveness of time-delay feedback on controlling fractal erosion of the safe basin and
chaos in nonlinear dynamical systems [18,19], Zhao et al. [20] applied delayed displacement
feedback in a relative torsional vibration system for reducing its response amplitude. On
this basis, Shang et al. [21] discussed the effect of delayed position feedback on controlling
the erosion of the safe basin and chaos. Although there has been meaningful research on
the nonlinear vibration characteristics of the relative rotation system, the study of excessive
motion and jump among multiple attractors is mainly carried out by numerical simulation
and the mechanism of these complex dynamical behaviors and their control is still not
yet clear.
To this end, we select a class of nonlinear relative rotation systems composed of two
rotors and study the mechanisms behind its jump and initial-sensitive excessive motion,
as well as the effect of delayed feedback on suppressing these phenomena. The paper is
arranged as follows. In Section 2, the dynamical model of a relative rotation system is
constructed and made dimensionless. In Section 3, the mechanism of jump and excessive
motion is analyzed. In Section 4, two control strategies, namely, delay position feedback
and delay velocity feedback, are applied to the original system, respectively, whose control
mechanism is then discussed. Section 5 contains the discussion.
where K1 is the coefficient of linear stiffness between the two rotors; K3 is the coefficient of
nonlinear stiffness satisfying K3 < 0; c12 is the coefficient of linear damping; Te1 and Te2 are
external rotational torques loaded on the two rotors, respectively; f 12 is a class of nonlinear
stick-slip frictions [20,21] given by
. . . . 3
f 12 = c0 + c1 (θ 1 − θ 2 ) + c2 (θ 1 − θ 2 ) , (2)
in which c0 , c1 and c2 are the constant of static friction, the coefficient of linear damping
and the coefficient of cubic nonlinear damping, respectively. By denoting x = θ1 − θ2 in
Equation (1), the relative rotation system can be obtained as below:
296
Mathematics 2022, 10, 2676
The torques Te1 and Te2 are usually harmonic excitations, thus can be represented as
where F and Ω are the amplitude and the frequency of the excitation, respectively. Intro-
ducing the following variables,
K1 ( J1 + J2 ) K3 (c + c1 )( J1 + J2 ) c2 ( J1 + J2 )ω0 Ω F
ω0 2 = , b = − , μ = 12 ,g = , T = ω0 t, ω = ,f = , (5)
J1 J2 K1 J1 J2 ω0 J1 J2 ω0 ω0 2
the dimensionless form of the relative rotation system (3) is obtained as
dx dy
y= , = −μy − x + bx3 − gy3 + f cos(ωT ). (6)
dT dT
in which the parameters μ, b, g, and ω are positive, and the position x ( T ) and the velocity
y( T ) represent relative rotational angular and relative angular velocity at the moment T,
respectively.
Since the parameters c12 , c0 , c1 , and c2 in the original system (1) are very small, the
relevant terms concerned, μ, g and f in Equation (6), can be considered as perturbed. The
unperturbed system can be written as
dx dy
= y, = − x + bx3 . (7)
dT dT
There will be three equilibria
√
in the unperturbed
√
system (7), namely, the center S1 (0,0)
and two saddle points S2 ( b
b ,0) and S3 (− b
b ,0). The Hamiltonian of Equation (7) is
1 2 1 2 1 4
H ( x, y) = y + x − bx . (8)
2 2 4
Accordingly, the heteroclinic orbits surrounding the center S1 (0,0) can be given by
√ √ √ √
b 2 2b 2
x± (t) = ± tanh( t ), y ± ( t ) = ± sech2 ( t ). (9)
b 2 2b 2
Fixing b = 0.3, the unperturbed orbits are depicted in Figure 2.
297
Mathematics 2022, 10, 2676
3. Complex Dynamics
3.1. Multistability and Jump
The periodic solution near the equilibrium S1 (0,0) is considered. The Method of
Multiple Scales (MMS) [22] is employed to obtain the approximate theoretical solution of
Equation (3). Supposing a small parameter ε (0 < ε 1) to rescale the nondimensional
parameters as follows:
μ, b = ε+
μ = ε+ g, f = ε f+,
b, g = ε+ (10)
Equation (6) becomes
d2 x +dx + 3
μ dx 3
2
+ x = ε(− + bx − g+( ) + f+cos(ωT )). (11)
dT dT dT
Further rescaling parameters and operators in Equation (11) yields
n
∂ d
ω = 1 + εσ, Ti = εi T, x ( T ) = x0 ( T0 , T1 , · · · ) + εx1 ( T0 , T1 , · · · ) + · · · , Di = , = ∑ εi Di (i = 0, 1, 2 · · ·), (12)
∂Ti dT i =0
where σ is the detuning parameter, and separating the coefficients of ε0 and ε1 of the
system (11) one finds
ε0 : D0 2 x0 + x0 = 0, (13)
and
+D0 x0 + +
ε1 : D0 2 x1 + x1 = −2D1 D0 x0 − μ bx0 3 − g+( D0 x0 )3 + f+cos( T0 + σT1 ). (14)
x0 ( T0 , T1 ) = A( T1 )eiT0 + cc (15)
where cc denotes the complex conjugate of the preceding terms. Substituting Equation (15)
into Equation (14) and separating the secular terms yields
+ A 3( g+ + i+
μ b) A2 A i f+eiσT1
D1 A + + + = 0. (16)
2 2 4
In addition, by defining
a( T1 ) iβ(T1 )
A( T1 ) = e , ψ = σT1 − β, (17)
2
separating the real and imaginary parts of Equation (16), and returning the parameters to the
nondimensional parameters of the system (3), the modulation equation can be written as
3ga3 3ba3
μa + = f sin ψ, −2(ω − 1) a − = f cos ψ. (19)
4 4
Accordingly, the approximate periodic solution can be expressed as x0 = a cos(ωT − ψ). By
eliminating ψ from the relationships in Equation (19), its amplitude a can be solved from
the following equation
2 2
3ga2 3
(μ + ) a2 + (2ω − 2 + ba2 ) a2 = f 2 . (20)
4 4
298
Mathematics 2022, 10, 2676
μ 9ga2 μ 3ga2 9 3
(λ + + )(λ + + ) + (ω − 1 + ba2 )(ω − 1 + ba2 ) = 0, (21)
2 8 2 8 8 8
meaning that for
27( g2 + b2 ) 4
a + 3( gμ + b(2ω − 2)) a2 + 4(ω − 1)2 + μ2 > 0, (22)
16
the periodic solution is unstable and becomes a saddle. Namely, when the inequation (22)
becomes an equation, saddle-node bifurcation will occur in the system (6). For
the response curve of Equation (20) is plotted in Figure 3 where the coexistence of two
stable periodic branches in the same range of the excitation parameters implies bi-stability.
For example, for f = 0.13 and ω within the range (0.65, 0.86), bistable periodic attractors
coexist (see Figure 3a). Fixing ω = 0.80, the increase of the amplitude f can also lead to
saddle-node bifurcation, and thus bi-stability. The numerical results totally match the
theoretical prediction.
(a) (b)
Figure 3. Response curves of system (2). (a) Amplitude a vs. ω when f = 0.13; (b) Amplitude a vs. f
when ω = 0.80.
299
Mathematics 2022, 10, 2676
For " √ √ √
2 2πω 2 2μ 8 2g
πω f csch( )≥ + , (25)
b 2 3b 35b2
Namely,
2(35μb+12g)sinh( πω
√ )
2
f ≥ f0 = 3 , (26)
105πωb 2
there will be a value of t0 satisfying M± (t0 ) = 0 and M± (t0 ) = 0, meaning that the roots
of Equation M± (t0 ) = 0 are simple, enabling the existence of the transverse heteroclinic
orbits, and the system (6) may undergo initial-sensitive motion. According to Equation (26),
f 0 is the amplitude threshold of heteroclinic bifurcation. For parameter values given by
Equation (23), the critical value for heteroclinic bifurcation is f 0 = 0.09. In contrast, it follows
from Inequation (25) that the threshold of the excitation frequency ω for heteroclinic
bifurcation cannot be expressed as an explicit or monotonic increasing function of other
parameters. For example, for f = 0.13 and the values of parameters in Equation (3), it can
be calculated from Equation (25) that for ω < 1.22, heteroclinic bifurcation will occur in the
system (6).
In order to verify the criterion obtained in this subsection, numerical simulations are
carried out by fixing parameter values as in Equation (23). The 4th Runge-Kutta approach
and the point-mapping method [24] are employed to describe the phenomenon’s initial-
sensitive excessive motion. First, some terms such as basin of attraction and safe basin
are introduced briefly. A basin of attraction is defined as the set of initial conditions that
can lead to the same attractor [25]. If the boundary of basin of attraction of an attractor is
fractal and mixed with another, jump among multiple attractors may occur [25]. Safe basin
is defined as the union of basins of attraction for all bounded attractors [26]. Fractal of the
safe basin of the system (6) induces the occurrence of initial-sensitive excessive motion. In
this paper, the basin of attraction is drawn in the sufficiently large space region defined as
−3.0 ≤ x(0) ≤ 3.0, −3.0 ≤ y(0) ≤ 3.0 by generating a 600 × 600 array of starting conditions,
for each of these starting points. The escaping set for infinite time is approximated with
good accuracy by a study with 1000 excited circles. The time step is taken as 0.01. The
white region represents the numerical approximation to the basins of attraction of excessive
motion. The red and blue regions are the basins of attraction for attractors from lower and
higher stable periodic brunches of Figure 3, respectively. Thus, the union of the red region
and the blue is the so-called safe basin.
The evolution of basins of attraction with the frequency ω for f = 0.13 and with the
amplitude f for ω = 0.80 can be observed in Figures 4 and 5. In Figure 4, each the safe
basins, i.e., the union of red region and blue, is fractal-eroded when ω ranges from 0.62
to 0.87 satisfying ω < 1.22 i.e., the condition of heteroclinic bifurcation, which shows the
occurrence of initial-sensitive excessive motion, in agreement with the analytical prediction.
In Figure 4a, there is only a red region, meaning that there is only one periodic attractor
from the lower stable branch. When ω = 0.69 (see Figure 4b), the red region and the blue
one coexists, showing bistability. Specifically, on the left side of the origin, the red region
is fractal and mixed with the white region and the blue, indicating that both jump and
initial-sensitive excessive motion may occur there. As ω increases to 0.81 (see Figure 4c),
most areas of the red region will be eroded by the blue one, showing that most of the safe
initial conditions lead to the higher-amplitude attractor. For ω = 0.87 (see Figure 4d), the
red region disappears, and there is only the blue, which means that the response shifts from
the lower-amplitude branch to the higher-amplitude, in agreement with the theoretical
results of Figure 3a.
300
Mathematics 2022, 10, 2676
Figure 4. Evolution of basins of attraction with the increase of ω for f = 0.13. (a) ω = 0.62; (b) ω = 0.69;
(c) ω = 0.81; (d) ω = 0.87.
Figure 5. Evolution of basins of attraction with the increase of f for ω = 0.80. (a) f = 0.03; (b) f = 0.07;
(c) f = 0.12; (d) f = 0.20.
In Figure 5, the safe basin (the union of red and blue regions) is steadily eroded by the
white region with the increase of the amplitude f, implying that initial-sensitive excessive
motion can be induced by the increase of f. In Figure 5a,b, the boundary of safe basins is
smooth. When f is more than 0.09, i.e., the critical threshold solved from Equation (26),
the boundary of safe basin will become fractal (see Figure 5c,d). In Figure 5a–c, the red
region and the blue one are mixed, indicating bi-stability as well as jump. Note that for
f = 0.12, the red and blue regions are evenly mixed in the neighborhood of the origin (see
Figure 5c). For f = 0.20, only the basin of attraction of the attractors in the higher stable
branch of Figure 3b exists, which is seriously eroded by the white region, showing that the
excessive motion is easy to trigger.
301
Mathematics 2022, 10, 2676
Figure 6. Diagram of the relative rotating system under delayed feedback control.
Letting
J1 + J2 J + J2
Gp = Q, Gv = 1 Qω0 , τ+ = ω0 τ, (28)
J1 J2 J1 J2
and based on Equations (2)–(6), we obtain the relative rotation system under delayed
position feedback control, i.e.,
dx dy
= y, = −μy − x + bx3 − gy3 + f cos(ωT ) + G p ( x ( T − τ+) − x ( T )), (29)
dT dT
and the following delayed-velocity-feedback controlled system
dx dy
y= , = −μy − x + bx3 − gy3 + f cos(ωT ) + Gv (y( T − τ+) − y). (30)
dT dT
In the delayed-feedback-controlled systems (29) and (30), G p , Gv and τ+ are indepen-
dent parameters. For G = 0 or τ+ = 0, the feedback terms in Equations (29) and (30)
become 0, and the delayed systems (29) and (30) become the uncontrolled non-dimensional
system (6). In this paper, considering the engineering application, we do not consider the
periodic characteristics of τ+ but restrict that 0 ≤ τ+ ≤ 2π. Since there is no signal returned
to the controlled systems (29) and (30) before T = 0, it is supposed that the initial states
of the delayed system when −τ+ ≤ T < 0 satisfy x ( T ) = y( T ) = 0. Then, safe basins of
the delayed-feedback controlled systems can also be projected into the initial-state plane
x (0) − y(0), similar to the uncontrolled system (6).
3ga3 3ba3
μa + + aG p sin τ+ = f sin ψ, −2(ω − 1) a − + aG p (1 − cos τ+) = f cos ψ. (32)
4 4
Accordingly, the approximate periodic solution can be expressed as x0 = a cos(ωT − ψ).
Its amplitude a can be solved from the following equation
2 2
3ga2 3
(μ + G p sin τ+ + ) a2 + (2ω − 2 + ba2 + G p cos τ+ − G p ) a2 = f 2 . (33)
4 4
302
Mathematics 2022, 10, 2676
The stability of the solutions, thus obtained, can be ascertained by computing the
eigenvalues from the corresponding characteristic equation:
3ga2
λ2 + λ(μ + G p sin τ+ + 2 )
(μ+ G p sin τ+)2 G p cos τ+− G p 2
(34)
27( g2 +b2 ) a4 3a2
+ 64 + 4 + (ω −1+ 2 ) + 4 ( gG p sin τ
+ + bG p cos τ+ + b(2ω − 2 − G p ) + gμ) = 0.
Assuming λ = iv in Equation (34) and separating the real and imaginary parts,
we have
3ga2
v(μ + G p sin τ+ + )=0 (35)
2
and
As we know, when the delay is small, the delayed feedback can be expanded in a
Taylor series so that the controlled system (29) will become
dx dy
= y, = −(μ + G p τ+)y − x + bx3 − gy3 + f cos(ωT ). (37)
dT dT
For G p > 0, it is much like increasing the damping of the system (6), thus reducing the
3ga2
periodic vibration [20,27]. Letting G p > 0, one has μ + G p sin τ+ + 2 > 0 for 0 ≤ τ+ ≤ π.
Thus, v = 0 in Equation (35); if
303
Mathematics 2022, 10, 2676
sequences of basin of attraction with the increase of f and τ+ will be discussed in detail in
the next subsection.
(a) (b)
Figure 7. Variation of the response amplitude of the system (29) with the excitation for Gp = 0.3.
(a) f = 0.13; (b) ω = 0.80.
Figure 8. Evolution of basins of attraction with the increase of τ when f = 0.13 and ω = 0.78. (a) τ = 0;
(b) τ = 0.05; (c) τ = 0.15; (d) τ = 0.35.
λ2 + μλ + 1 + G p − G p e−λ+τ = 0. (41)
Substituting λ = iv into (41) and separating the imaginary and real parts yields
304
Mathematics 2022, 10, 2676
For
μ2 μ2
G p > max{ − 1, + μ }, (44)
2 2
there are two different positive solutions of Equation (43) expressed as Ω p1 and Ω p2
(Ω p1 > Ω p2 > 0); the critical value of the delay for stability switch of S1 (0,0) is
1 1 − Ω2p1
τ0 = (2π − arccos(1 + )). (45)
Ω p1 Gp
For τ+ < τ0 , the delay position feedback can be regarded as a disturbed term.
Substituting the heteroclinic orbits (9) into the Melnikov function of the system (29) yields
√ √ "
2 2μ 8 2g 2 πω Gp
M± ( t 0 ) = − − + πω f csch( √ ) cos(ωt0 ) − l (τ+), (46)
3b 35b2 b 2 b 1
√ √ √
where l1 (τ+) = csch2 ( 22 τ+)(sinh( 2+
τ ) − 2+
τ ) > 0. The critical condition for heteroclinic
bifurcation can be expressed as
G p sech( πω
√ ) l1 ( τ
+)
f > f P (τ+) = f 0 + √ 2 , (47)
π 2bω
Accordingly, for G p > 0, the threshold of heteroclinic bifurcation f P (τ+) will increase
with time delay and be higher than f 0 . Fixing μ = 0.02 and Gp = 0.3, it can be calculated that
τ0 = 2.55.
Figure 9 shows the variation of f P (τ+) with the increase of time delay τ+ (+
τ << τ0 ). Then
the numerical values of f P (τ+) are obtained at which the boundary of the safe basin begin to
be unsmooth. Each numerical critical value of f P (τ+) is kept at two decimal places. We make
sure that if f is less than the numerical results f P (τ+), the boundary of safe basin will be
smooth. In Figure 9, the numerical results for the critical values of f are in agreement with
the analytical values, demonstrating that the threshold of the amplitude f for heteroclinic
bifurcation will increase monotonically with the delay for G p > 0 and τ+ < τ0 .
Figure 9. fP (τ) of the controlled system (29) versus time delay when Gp = 0.3 and ω = 0.80.
Figure 10 depicts the evolution of the safe basin of the controlled system (29) with
the delay τ+. In Figure 10a,e,i for τ+ = 0, safe basins as well as basins of attraction can
be considered as those of the uncontrolled system (6), illustrating that as the amplitude f
increases, safe basin will be eroded. Specifically, in Figure 10e for f = 0.30, only the basin of
attraction of the higher-amplitude attractor is left, which is seriously eroded by the white
305
Mathematics 2022, 10, 2676
region, showing the high possibility of excessive motion. For Gp = 0.3 and τ+ = 0.11, the
safe basin is obviously expanded. Besides, comparing Figure 10a–d, one can observe that
as τ+ increases, the safe basin changes from the union of red region and blue to the red
region itself, meaning that jump and initial-sensitive excessive motion are reduced by the
basin expansion of the lower-amplitude attractor. In Figure 10e–h, one can also draw this
conclusion. Note that Figure 10e,f, three color regions are mixed in the neighborhood of the
origin, which indicates the high probability of jump and initial-sensitive excessive motion.
As τ+ increases, they can be controlled. For f = 0.30 (see Figure 10i–l), even though the safe
basin is still fractal, its area becomes much larger with the increase of τ+. This shows that
the possibility of excessive motion is reduced. It follows from Figure 9 that the delayed
position feedback can suppress the erosion of the safe basin effectively when Gp is positive.
Figure 10. Sequences of safe basin of the system (29) with the increase of f and τ+ when Gp = 0.3.
(a) f = 0.11, τ+ = 0; (b) f = 0.11, τ+ = 0.05; (c) f = 0.11, τ+ = 0.15; (d) f = 0.11, τ+ = 0.35; (e) f = 0.15, τ+ = 0;
(f) f = 0.15, τ+ = 0.05; (g) f = 0.15, τ+ = 0.15; (h) f = 0.15, τ+ = 0.35; (i) f = 0.30, τ+ = 0; (j) f = 0.30, τ+ = 0.05;
(k) f = 0.30, τ+ = 0.15; (l) f = 0.30, τ+ = 0.35.
306
Mathematics 2022, 10, 2676
The amplitude and phases of the system can be obtained via letting the right side of
Equation (48) be zero, given by
3g 3 3b
f sin ψ = μa + a + Gv (1 − cos τ+) a, f cos ψ = (2 − 2ω − a2 − Gv sin τ+) a. (49)
4 4
Thus, the amplitude of the approximated periodic solution x0 = a cos(ωT − ψ) can be
solved from the following equation:
2 2
3g 2 3b
(μ + a + Gv (1 − cos τ+)) a2 + (2 − 2ω − a2 − Gv sin τ+) a2 = f 2 . (50)
4 4
According to its characteristic equation, the stability switch will occur if there exists a
real number v+ satisfying
3ga2
v(μ + Gv (1 − cos τ+) + 2 ) = 0,
(μ+ Gv (1−cos τ+))2 (2−2ω − Gv sin τ+)2 27( g2 +b2 ) a4
(51)
− v2 + 4 + 4 + μ2 Gv (1 − cos τ+) + 3a2
4 ( μg + b (2ω − 2 + Gv sin τ+)) + 64 = 0.
3g 2
For Gv > 0, in Equation (51), one has μ + 2 a + Gv (1 − cos τ+) > 0, v+ = 0, and
4(μg + b(2ω − 2) + bGv sin τ+)2 − 3( g2 + b2 )((μ + Gv )2 + 2μGv + Gv 2 + 4(1 − ω )2 − 2Gv (2μ + Gv ) cos τ+ + 4(ω − 1) Gv sin τ+) < 0, (53)
there will be no positive roots for Equation (52), meaning no saddle-node bifurcation or
bi-stability. For example, given Gv = 0.3 and ω = 0.8, one can calculate from Equation (53)
that saddle-node bifurcation will disappear for τ+ > 0.68. Similarly to Section 4.1.1, although
delayed velocity feedback can be used to reduce the occurrence of saddle-node bifurcation,
it is hard to express the range of τ+ for SN bifurcation as a function of f. Fixing the value of f
and combining Equations (50), (52) and (53), one can obtain the condition for τ+ to reduce
saddle-node bifurcation. For instance, given f = 0.13, one can calculate that the condition of
τ+ to reduce saddle-node bifurcation is τ+ > 0.77.
The similar as in Section 4.1.1, for Gv = 0.3 and different values of time delay in
the velocity feedback, Figure 11 shows the variation of the response amplitude with the
excitation. It follows from the theoretical and numerical results depicted in Figure 11 that,
even though the response amplitude is not necessary to be suppressed with the increase of
time delay, the velocity feedback can reduce saddle-node bifurcation, and thus bi-stability
and jump. The controlling effect of velocity feedback on jump can also be verified by the
evolution of basins of attraction with the increase of time delay for f = 0.13, ω = 0.78 (see
Figure 12). To be different from the case of the delayed position feedback, as the delay
increases, the basin of attraction of the higher-amplitude attractor is expanded, whose
boundary becomes smoother and smoother. Finally, it becomes the only bounded attractor
(see Figure 12d). For ω = 0.80, if can be concluded from Figure 11b that the delayed velocity
feedback can suppress bi-stability. For the fixed value of ω, sequences of the basin of
attraction with the increase of f and τ+ will be discussed in detail in the next subsection.
307
Mathematics 2022, 10, 2676
(a) (b)
Figure 11. Variation of the response amplitude of the system (30) with the excitation for Gv = 0.3.
(a) f = 0.13; (b) ω = 0.80.
Figure 12. Evolution of basins of attraction with the increase of τ for f = 0.13 and ω = 0.78. (a) τ+ = 0;
(b) τ+ = 0.2; (c) τ+ = 0.5; (d) τ+ = 0.8.
λ2 + μλ + 1 + Gv λ − Gv λe−λ+τ = 0 (54)
The first value of time delay for stability switch of S1 (0,0) will occur when λ = ±Ωv i.
Substituting it into Equation (54) and separating the imaginary and the real parts of
Equation (54) yields
Since μ > 0, for Gv > 0, there will be no real roots of Equation (56), meaning that the
stability of the origin will not be changed. Then the delay velocity feedback of the delayed
system (30) can be regarded as a disturbed term. The Melnikov function of the controlled
system (30) can be expressed as
308
Mathematics 2022, 10, 2676
- +∞
M± ( t 0 ) = −√∞ (− μy ( T ) − gy3 ( T ) + f cos(ω ( T + t0 )) − Gv y( T ) + Gv y( T − τ+))y( T )dT
√ # √ (57)
= − 2 3b2μ − 8 2g
35b2
+ 2b πω f csch( 2πω 2Gv
2 ) cos( ωt0 ) − b l2 ,
√ √ √
where l2 (τ+) = csch2 ( 22+τ )( 2 − τ+coth( 22 τ+)) > 0. Accordingly, for
√ √
2bGv 2πω
f > f V (τ+) = f 0 + sinh( )l2 (τ+), (58)
bπω 2
And heteroclinic bifurcation may occur. In Equation (58), for Gv > 0, the threshold
of heteroclinic bifurcation f V (τ+) increases monopoly with the delay τ+. Given Gv = 0.3,
the variation of f V (τ+) with the increase of τ+ is presented in Figure 13. Numerical and
analytical results both illustrate that, for Gv > 0, the delayed velocity feedback can be used
to increase the threshold of the amplitude f for heteroclinic bifurcation, thus being effective
in reducing the probability of initial-sensitive excessive motion.
Figure 13. f V (τ) of the controlled system (30) versus time delay when Gv = 0.3.
Figure 14 shows the evolution of the safe basin of the controlled system (30) with the
increase of f and τ+. For τ+ = 0, Figure 14a,e,i are the same as Figure 10a,e,i, depicting the safe
basin of the uncontrolled system (6). Under a positive gain Gv = 0.3, when τ+ increases, the
area of safe basin of the delayed-velocity-feedback-controlled system will expand obviously.
To be different from the results of Section 4.1.2, for f = 0.11 and 0.15 where the two periodic
attractors coexisting in the uncontrolled system, as τ+ increases, the basin of attraction of the
higher-amplitude attractor is expanded and finally becomes the safe basin (see the blue
regions in Figure 14a–h). This illustrates that, although the delayed velocity feedback does
not necessary reduce the response amplitude, it reduces jump and excessive motion.
309
Mathematics 2022, 10, 2676
Figure 14. Sequences of safe basin of the system (30) with the increase of f and τ+ when Gv = 0.3.
(a) f = 0.11, τ+ = 0; (b) f = 0.11, τ+ = 0.20; (c) f = 0.11, τ+ = 0.50; (d) f = 0.11, τ+ = 0.80; (e) f = 0.15, τ+ = 0;
(f) f = 0.15, τ+ = 0.20; (g) f = 0.15, τ+ = 0.50; (h) f = 0.15, τ+ = 0.80; (i) f = 0.30, τ+ = 0; (j) f = 0.30, τ+ = 0.20;
(k) f = 0.30, τ+ = 0.50; (l) f = 0.30, τ+ = 0.80.
5. Discussion
In this paper, a typical relative rotation system is considered, and the phenomena
of jump and initial-sensitive excessive motion as well as their suppression via delayed
feedback is investigated. The Method of Multiple Scales and the Melnikov method are
applied to analyze the conditions for bi-stability and initial-sensitive excessive motion,
respectively. By introducing the basins of attraction and safe basin to describe the extent
of jump and initial-sensitive excessive motion, respectively, the point mapping approach
is employed to present the numerical results which matches the theoretical approach,
verifying the validity of the analysis. Some significant results are presented as follows.
(1) The variation of excitation may induce the coexistence of bistable periodic attractors,
which can be ascribed to saddle-node bifurcation.
(2) The increase of the excitation amplitude may cause initial-sensitive excessive motion,
which can be due to heteroclinic bifurcation.
(3) Under positive coefficients of the feedback gain, the delayed position feedback and
the delayed velocity feedback can reduce saddle-node bifurcation and heteroclinic
bifurcation so as to suppress jump and initial-sensitive excessive motion. Compara-
tively, the former can also reduce the amplitude of the response, while the latter may
not; the former works well if time delay does not exceed the first stability switch of
the trivial equilibrium, while the latter does not have that restriction.
This work presents a detailed analysis of jump and initial-sensitive excessive motion
of a typically relative rotation system, which may be beneficial for the performance im-
provement of rotors and main bearings. The relevant experimental investigations will be
included in our future work.
Author Contributions: Conceptualization, H.S.; formal analysis, H.S.; funding acquisition, H.S.;
Investigation, Z.C.; methodology, H.S.; project administration, H.S.; software, Z.C.; supervision, H.S.;
validation, H.S.; visualization, Z.C.; writing—original draft, Z.C. and H.S.; writing—review & editing,
H.S. All authors have read and agreed to the published version of the manuscript.
Funding: This research was funded by the National Natural Science Foundation of China, grant
number 11472176.
310
Mathematics 2022, 10, 2676
References
1. Liang, Y.; Li, N. Optimal vibration control for nonlinear systems of tracked vehicle half-car suspensions. Int. J. Control. Autom.
Syst. 2017, 15, 1675–1683. [CrossRef]
2. Renson, L.; Noël, J.P. Complex dynamics of a nonlinear aerospace structure: Numerical continuation and normal modes. Nonlinear
Dyn. 2015, 79, 1293–1309. [CrossRef]
3. Shi, J.; Gou, X. Bifurcation and Erosion of Safe Basin for a Spur Gear System. Int. J. Bifurc. Chaos 2018, 28, 1830048. [CrossRef]
4. Li, J.; Wu, H. Bifurcation, chaos, and their control in a wheelset model. Math. Methods Appl. Sci. 2020, 43, 7152–7174. [CrossRef]
5. Luo, Z.; Wang, J. Research on vibration performance of the nonlinear combined support-flexible rotor system. Nonlinear Dyn.
2019, 98, 113–128. [CrossRef]
6. Li, Y.; Luo, Z. Dynamic modeling and stability analysis of a rotor-bearing system with bolted-disk joint. Mech. Syst. Signal Process.
2021, 158, 107778. [CrossRef]
7. Shi, P.; Liu, B. Chaotic motion of some relative rotation nonlinear dynamic system. Acta Phys. Sin. 2008, 57, 1321–1328.
8. Li, Z.; Gou, X. Erosion and bifurcation of safety-attraction basin for multi-state meshing gear transmission system under tooth
safety condition. J. Vbration Shock 2021, 40, 63–74.
9. Wang, K.; Guan, X. Precise periodic solutions and uniqueness of periodic solutions of some relative rotation nonlinear dynamic
system. Acta Phy. Sin. 2010, 59, 3648–3653. [CrossRef]
10. Xiao, L.; Xuan, C. The periodic solution problem of a relative rotation nonlinear dynamic system with time-varying stiffness. Acta
Phy. Sin. 2013, 62, 21–25.
11. Li, X.; Yan, J. The periodic solution problem of a relative rotation nonlinear system with nonlinear elastic force and generalized
damping force. Acta Phy. Sin. 2014, 63, 36–41.
12. Shi, P.; Han, D. Chaos and chaotic control in a relative rotation nonlinear dynamical system under parametric excitation. Chin.
Phys. B 2010, 19, 116–121.
13. Verichev, N. Chaotic torsional vibration of imbalanced shaft driven by a limited power supply. J. Sound Vib. 2012, 331, 384–393.
[CrossRef]
14. Liu, B.; Zhao, H. Bifurcation and chaos of some relative rotation system with triple-well Mathieu-Duffing oscillator. Acta Phy. Sin.
2014, 63, 174502.
15. Liu, S.; Tian, S. Chaos of a kind of nonlinear relative rotation system based on the effect of Coulomb friction. Acta Phy. Sin. 2015,
64, 247–254.
16. Zhu, L.; Li, Z. Evolutionary mechanism of safety performance for spur gear pair based on meshing safety domain. Nonlinear Dyn.
2021, 104, 215–239. [CrossRef]
17. Ju, J.; Wei, L. Dynamics and nonlinear feedback control for torsional vibration bifurcation in main transmission system of scraper
conveyor direct-driven by high-power PMSM. Nonlinear Dyn. 2018, 93, 307–321. [CrossRef]
18. Shang, H. Pull-in instability of a typical electrostatic MEMS resonator and its control by delayed feedback. Nonlinear Dyn. 2017,
90, 171–183. [CrossRef]
19. Wang, Q.; Wu, H. The effect of fractional damping and time-delayed feedback on the stochastic resonance of asymmetric SD
oscillator. Nonlinear Dyn. 2022, 107, 2099–2114. [CrossRef]
20. Zhao, Y.; Li, C. The delayed feedback control to suppress the vibration in a torsional vibrating system. Acta Phy. Sin. 2011, 60,
417–425.
21. Shang, H.; Han, Y. Suppression of chaos and basin erosion in a nonlinear relative rotation system by delayed position feedback.
Acta Phy. Sin. 2014, 63, 88–95.
22. Rezaei, M.; Khadem, E.S.; Friswell, I.M. Energy harvesting from the secondary resonances of a nonlinear piezoelectric beam
under hard harmonic excitation. Meccanica 2020, 55, 1463–1479. [CrossRef]
23. Siewe, S.M.; Hegazy, H.U. Homoclinic bifurcation and chaos control in MEMS resonators. Appl. Math. Model. 2011, 35, 5533–5552.
[CrossRef]
24. Danico, K.; Tanmoy, C.; Milan, C.; Sondipon, A.; Karličić, D.; Chatterjee, T.; Cajić, M.; Adhikari, S. Parametrically amplified
Mathieu-Duffing nonlinear energy harvesters. J. Sound Vib. 2020, 488, 115677.
25. Zhu, Y.; Shang, H. Multistability of the vibrating system of a micro resonator. Fractal Fract. 2022, 6, 141. [CrossRef]
26. Rega, G.; Lenci, S. Dynamical integrity and control of nonlinear mechanical oscillators. J. Vib. Control 2008, 14, 159–179. [CrossRef]
27. Liu, C.; Yan, Y.; Wang, W. Resonances and chaos of electrostatically actuated arch micro/nanoresonators with time delay velocity
feedback. Chaos Solitons Fractals 2019, 10, 109512. [CrossRef]
311
mathematics
Article
The General Dispersion Relation for the Vibration Modes of
Helical Springs
Leopoldo Prieto *, Alejandro Quesada, Ana María Gómez Amador and Vicente Díaz
Department of Mechanical Engineering, Universidad Carlos III de Madrid, 28911 Leganes, Spain;
alejandro@ing.uc3m.es (A.Q.); amgomez@ing.uc3m.es (A.M.G.A.); vdiaz@ing.uc3m.es (V.D.)
* Correspondence: leprieto@ing.uc3m.es
Abstract: A system of mathematical equations was developed for the calculation of the natural
frequencies of helical springs, its predictions being compared with finite element simulation with
ANSYS® . Authors derive the general equations governing the helical spring vibration relative to the
Frenet trihedral representing the normal, binormal and tangent unit vectors to the spring medium
line. The dispersion relation ω = f (k ) has been obtained to model a wave traveling along the axis of
the wire.
Keywords: helical spring; vibration; Frenet trihedral; dispersion relation; natural frequency
MSC: 70B15
1. Introduction
Helical springs are one of the most frequently used elastic elements in mechanical
engineering. They are used in the most diverse structures as elastic energy accumulators
supplementing damping devices, e.g., automobile or railroad suspensions, and in advance
Citation: Prieto, L.; Quesada, A.; and return devices, e.g., camshafts and valves of internal combustion engines (Wahl [1],
Gómez Amador, A.M.; Díaz, V. The Shigley [2]; Kobelev [3]). One of the spring failure modes is caused by resonance vibration
General Dispersion Relation for the that occurs when the spring is excited with a periodic signal of frequency equal to its
Vibration Modes of Helical Springs. natural frequency.
Mathematics 2022, 10, 2698. https:// Love [4] develops equations to study the static response of helical springs subjected
doi.org/10.3390/math10152698
to large deformations. Stokes [5] investigates vibrations in the case of a spring subjected
Academic Editor: Dan B. Marghitu to shock loads. Gironnet and Louradour [6] determine more precise expressions for the
calculation of the natural frequencies of the helical spring. There are several numerical
Received: 24 June 2022
methods to determine the natural frequencies, the main ones being: the transfer matrix
Accepted: 28 July 2022
method (Yildirim [7]), the dynamic stiffness formulation (Lee [8]), and the pseudo-spectral
Published: 30 July 2022
method (Lee [9]). Becker [10] determines with the transfer matrix method the resonant
Publisher’s Note: MDPI stays neutral frequencies of a spring subjected to an axial compression load. Jiang [11] studies forced
with regard to jurisdictional claims in vibrations and wave propagation in springs using the Laplace transform.
published maps and institutional affil- Wahl [1] already proposed in the middle of the 20th century the equations that are
iations. widely used today for the design of springs. Among these equations, Equation (1), which
provides the stiffness constant of the spring subjected to an axial force, and Equation (2),
which provides the natural frequency of the spring placed between two parallel flat plates,
stand out.
Copyright: © 2022 by the authors.
Gd4
Licensee MDPI, Basel, Switzerland. kWahl = (1)
8D3 N
This article is an open access article
distributed under the terms and
where kWahl is the spring axial stiffness, G is the shear modulus of elasticity, d is the wire
conditions of the Creative Commons diameter, D is the helix diameter and N is the number of active coils.
Attribution (CC BY) license (https:// Where f is the natural harmonic frequencies of a spring in Hz, and W is the mass of the
creativecommons.org/licenses/by/ spring in kg. The fundamental frequency is determined for m = 1 and is usually the most
4.0/). important frequency in practice. Replacing in Equation (2) the value of the axial stiffness
we obtain Equation (3) also called Harignx’s equation [12], who experimentally verified
its validity. "
m kWahl
f = m = 1, 2, 3 . . . . (2)
2 W
d G
f = (3)
2πR2 N 32ρ
R being the helix radius and ρ the density of the material.
Equation (3) indicates that for a given material, the fundamental frequency of a helical
spring is proportional to the wire diameter and inversely proportional to the product of the
helix diameter and the number of active coils, i.e., the resonance frequency of the spring
depends on all the spring design parameters. If a higher frequency is desired for the same
spring diameter, it is sufficient to increase the wire diameter (this increases the spring
stiffness), and vice versa. If you want to vary the resonance frequency while keeping the
wire diameter constant, just change the spring diameter or the number of coils of the spring.
All the expressions shown above are raised with respect to an inertial reference system
whose main X axis coincides with the symmetry axis of the helical spring.
By design of the machine itself, the spring is always inserted between two masses in
relative oscillatory motion and its function is to keep them away from each other ensuring
at all times that they do not come into contact. The design of the helical spring depends
directly on the inertia between the masses at any given moment. In addition to the diameter
of the wire and the spring itself, the number of coils and the pitch are responsible for
achieving non-interaction between the mechanical elements in relative motion.
It is interesting to note that the behavior of the spring changes radically when its
operation is associated with a mass much greater than that of the spring itself. For example,
in the damping systems of a railway bogie such as the one in Figure 1. In that case, the
mass of the spring is much lower than the mass of the railway car it is supporting and
Equations (2) and (3) are no longer valid. Den Hartog [13] shows that in that configuration
the natural frequency follows Equation (4).
"
1 g
f = (4)
2π δest
where g is the gravitational constant and δest is the static deflection of the spring under the
weight of the suspended load.
Sauvage [15] and Campedelli [16] studied in detail the behavior of the springs of a
bogie and simulated in finite elements the anomaly presented by the springs during axial
compression called “effort de chasse”. This anomaly consists of a rotation and a lateral
displacement appearing on the bearing surface of the spring. The “effort de chasse”, also
314
Mathematics 2022, 10, 2698
called transverse flexibility of the spring, is caused by the forces and moments that appear
at the ends of the spring during its axial compression. Campedelli [16] determines that the
transverse flexibility is determined by both the geometry of the spring end and the way in
which the gap between the coils closes during axial compression. The “effort de chasse”
is an interesting anomaly in railway suspension springs since in their displacement the
springs could interfere with other mechanical elements of the bogie generating interferences
and breakages. To avoid this, the railway standard [17] requires a test for the springs of the
highest category according to the scheme in Figure 2a. From the test, both the direction
and the transverse bowing force (Φc) of the spring subjected to a defined axial load are
obtained. The direction of the transverse direction is marked on the spring by a permanent
system to take it into account in its mounting on the bogie. Figure 2b shows an example of
a bogie spring assembly compensating for the spring bowing forces.
(a) (b)
Figure 2. “Effort de chasse” or transverse flexibility: (a) schematic of the test device; (b) mounting on
a spring bogie compensating the “effort de chasse”.
Kobelev [18] also studies another type of anomaly in the behavior of springs such
as the transverse vibration of the spring once it enters into oscillations close to resonant
frequencies. Kobelev [18] found that the fundamental natural frequency of transverse
oscillations turns to be to zero when the lateral buckling of the spring occurs.
Yildirim [19] demonstrates that Equation (1), which is usually used to determine
the spring stiffness, is only valid for spring angles α ≤ 10◦ . Figure 3a shows the spring
geometry indicating the position of the angle α.
(a) (b)
Yildirim [19] indicates that Equations (1) and (2) only take into account the effect of
the torsion of the spring section when compressing it. However, as Yildirim [19] points out
when the axial compression force acts, both stresses due to torsion and bending moment
and forces normal and shear to the section appear. These last three effects are negligible
for spring angles α ≤ 10◦ where the torsional effect dominates. Yildirim [19] proposes
315
Mathematics 2022, 10, 2698
a global equation to determine the deformation of the spring subjected to an axial force.
Additionally, Kato [20] proposes recently an equation outside the elastic range to determine
large spring deformations.
In general, the previous authors focus the study of the spring behavior using a conven-
tional Cartesian system in which the static or dynamic displacements are referred to as the
first axis of symmetry of the helical spring, X. With this methodology the global behavior
of a spring as a whole can be known with precision, i.e., constituted by a series of coils of a
certain diameter and a certain thickness of the wire, contemplating in some of the occasions
the ending effect.
This type of methodology, therefore, allows us to analyze the elastic effect of the spring
as a whole, but it is not able to explain some mechanical behaviors intrinsic to its own
constitution, i.e., it does not study the phenomenon occurring in the coil itself. In this way,
anomalous spring behaviors such as those described above cannot be fully explained.
In order to achieve a reference system that meets these needs, the Frenet trihedral is
used in this article, since it is intrinsic to the wire of the spring itself, configuring a reference
system positioned according to the neutral line of the helix of the coil as shown in Figure 3b.
In the present study, the equations of motion of the coil of a helical spring are obtained
in order to determine its modes of vibration. This is an original and novel approach since
the usual practice is to obtain the modes of vibration of the complete spring referred to as a
reference system according to its joint motion. Additionally, a sensitivity analysis has been
performed for the main geometrical parameters of the coil: i.e., the helix diameter, the wire
diameter and pitch. The obtained vibration modes have been validated satisfactorily with
finite element calculations with ANSYS® R17 (release 17).
316
Mathematics 2022, 10, 2698
Considering as a scalar variable the arc length of the midline of the loop(s), we obtain
the well-known Frenet–Serret equations, Equation (5), which give us the derivative with
respect to s of the components of the trihedral as a function of the curvature of the midline
of the coil. ⎛ → ⎞ ⎛→⎞ ⎛ ⎞
⎛ ⎞ →
dt
t 0 Ω − Ω t
⎜ → ⎟ ⎜→⎟
ds 3 2 ⎜→⎟
⎜ d n ⎟ = ⎜ n ⎟ = ⎝ − Ω3 0 Ω1 ⎠ ⎜ n⎟ (5)
⎝ ds→ ⎠ ⎝ → ⎠ ⎝→ ⎠
Ω2 − Ω1 0
db
ds
b b
The variation of any vector function, for example, the rotation and translation of the
section, with respect to the length s of the arc follows the expression of Equation (8a) that we
will simplify using the notation of Equation (8b) where we will call [ R] the tensor associated
→
with the operation Ω×.
→ →
dV d V → →
= +Ω×V (8a)
ds ds S
( ) ( )
→ → →
V = V + [ R] V (8b)
⎛ ⎞
0 Ω3 − Ω2
[ R ] = ⎝ − Ω3 0 Ω1 ⎠ (8c)
Ω2 − Ω1 0
→
dV
where ds S is the derivative calculated as if the Frenet trihedron were fixed along the curve.
That is, the variation of the function along the curve has two terms, the first represents the
variation of the vector within the basis itself and the second represents the variation of the
basis itself along the curve.
317
Mathematics 2022, 10, 2698
Due to the shear produced by the shear forces, the i-axis is not transformed into the
i-axis1 by rotation and translation. Consequently, the tangent vector to a point on the
midline of the part will no longer coincide with the first axis of the trihedron as it did in the
→
initial equilibrium state but will form an angle β , called the shear angle, proportional to
the shear stress. If we do not consider the deformation due to the shear stress, the shear
angle is canceled, so that the tangent to the midline of the part will be perpendicular to the
section, or what is the same, will coincide with the first axis of the trihedron.
→
The derivative of ψ with respect to s is given to first order by Equation (9a,b).
&→' → →
ψ = [ M N ] N + [ R] ψ (9a)
⎛ ⎞ ⎛ ⎞
1 1
0 0 I1∗ 0 0
⎟ ⎜ ⎟
k GI1
⎜ 1
[ MN ] = ⎝ 0 1
EI2 0 ⎠=⎜⎝0
1
I2∗ 0⎟⎠. (9b)
1 1
0 0 EI3 0 0 I3∗
where M N is the tensor associated with the torsional-flector torque increment, E is the mod-
ulus of elasticity, G is the shear modulus of elasticity, Ii is the geometric moment of inertia
→
and k1 is the torsion constant. N is the vector of internal torque increase. Its first component
is the torsional moment, and the other two components are the bending moments in the
normal and binormal directions. I1∗ is the torsional stiffness, I2∗ is the bending stiffness
along the normal axis, and I3∗ is the bending stiffness along the binormal axis.
318
Mathematics 2022, 10, 2698
where MF is the tensor associated with the forces and ki the shear constants, or the so-called
→
Timoshenko’s k-factors [21]. Being for symmetric sections k2 = k3 . F is the vector of internal
force increment. Its first component is the axial force, and the other two components are
the shear forces according to the normal and binormal. EA is the axial stiffness and k2 GA
is the shear stiffness.
Equations (9a,b) and (10a–c) constitute a system of six scalar equations for the twelve
→ → → →
components of ψ, η , F , N. The remaining six equations are obtained from the dynamic
study of the system for which the equations of the linear momentum and angular momen-
tum of the part element comprised between s and s + ds are posed.
Vectorial Equations
&→' → →
ψ = [ M N ] N + [ R] ψ
&→' → → →
η = [ MF ] F + [S] ψ + [ R] η
( ) ..
→ → →
F = λ η + [ R] F
( ) ..
→ → → →
λ
N = A [ I ] ψ + [S] F + [ R] N
→
→
η (s, t) = Re η0 (s)·ei(ks−ωt) (13b)
319
Mathematics 2022, 10, 2698
→ →
F (s, t) = Re F0 (s)·ei(ks−ωt) (13c)
→ →
N (s, t) = Re N0 (s)·ei(ks−ωt) (13d)
The new parameters introduced are the wavenumber, k, and the angular velocity of
the wave, ω (rad/s). Where the wavenumber k, is related to the length λ of the vibrating
wave, with Equation (14).
2π
k= (14)
λ
Substituting the system of equations in Table 1 we arrive at the following system in
Equation (15a–d).
→ →
[Λ] F − λω 2 η = 0 (15a)
→ → →
[Λ] η + [ MF ] F + [S] ψ = 0 (15b)
→ →
[Λ] ψ + [ M N ] N = 0 (15c)
→ λω 2 → →
[Λ] N − [ I ] ψ + [S] F = 0 (15d)
A
where the matrix [Λ] follows the expression of Equation (16).
⎛ ⎞
−ik k0 0
[Λ] = ⎝−k0 −ik τ0 ⎠ (16)
0 −τ0 −ik
→
Solving for η in Equation (15a) and replacing it in Equation (15b), and similarly
→
solving for N in Equation (15c) and replacing in Equation (15d) we obtain the following
vector Equation (17a,b):
→ →
[ A] F + [S] ψ = 0 (17a)
→ →
[ B] ψ + [S] F = 0 (17b)
where the matrices [ A], [ B] are calculated with Equations (18) and (19).
1
[ A] = [ Λ ]2 + [ M F ] (18)
λω 2
→ →
[ B] ψ + [S] F = 0 (19)
Equation (17a,b) constitutes a system of six equations with six unknowns
(F1 , F2 , F3 , ψ1 , ψ2 , ψ3 ) and with null independent terms. It is therefore a homogeneous
system and for it to have a solution other than the trivial, null solution, it must be verified
that its determinant is null, i.e., Equation (20).
2 ∗ $ 2 %
λω E − k + β 9 β 4 − 2ikΩ β 5 + 2ikΩ2 0 0 0
$ 3 %
2ikΩ + β λω 2 G1∗ − k2 + β 8 β 6 − 2ikΩ 0 0 λω 2
3 4 $ 1 %
β 5 − 2ikΩ2 β 6 + 2ikΩ1 λω 2 G2∗ − k2 + β 7 0 −λω 2 0
= 0 (20)
0 0 0 k2 I1∗ + α4 − ω 2 λ1 − β 4 I3∗ + ikα1 − β 5 I2∗ − ikα2
0 0 1 − β 4 I3∗ − ikα1 k2 I2∗ + α5 − λ2 ω 2 − β 6 I1∗ + ikα3
0 1 0 β 5 I2∗ − ikα2 β 6 I1∗ + ikα3 −k2 I3∗ + ω 2 λ3 − α6
m1 ω 12 + m2 ω 10 + m3 ω 8 + m4 ω 6 + m5 ω 4 + m6 ω 2 + m7 = 0 (21)
320
Mathematics 2022, 10, 2698
where the coefficients of the polynomial are calculated according to the expressions given
in Equation (A1) of Appendix A.
The real roots in ω 2 of Equation (21) will give us the number of branches of the
dispersion equation for each value of the wavenumber k.
If in the dispersion relation obtained for the spring we cancel the independent term,
m7 (which is equivalent to making ω → 0 ), we will have the solutions of the wave number
shown in Equation (22a,b).
k∗ = 0 (22a)
#
k∗∗ = Ω21 + Ω23 (22b)
That is to say, there exists a value of the wave number k different from zero (to which
corresponds, therefore, a finite wavelength) for which one of the real and positive roots of
the dispersion relation (degree 12) cancels, so that the branch corresponding to this solution,
for any value of k, is tangent to the x-axis in k = k∗∗ .
If we do k → 0 we will obtain the number of branches of the dispersion relation
(which will be equivalent to the number of real roots of the equation). For this case, the real
and positive branches are determined by Equation (23a–c):
ω=0 (23a)
The result of the oscillation is bending accompanied by torsion which tends to make
the ends of the coil open and close in time, oscillating according to the movement. This is
composed of two modes: opening and closing of the ends, the coil remaining at all times
inscribed in the original helix of the spring (of diameter D) and the other mode consists
of the ends of the coil entering and leaving the original helix of diameter D, forming an
ellipse. This last mode responds to the reason why the springs, under certain dynamic
conditions, pull the coils out of their original helix, and may impact other elements of the
machine, thus affecting its operation. Authors consider this can explain part of the “Effort
de Chasse” anomaly described in Section 1.
321
Mathematics 2022, 10, 2698
In this article, we are in the case of a freely oscillating coil. In the case of several coils
joined together, it would be necessary to consider the ligatures of their ends as shown in
Figure 7 for the end of a squared and ground spring. That is to say, with a helix angle of
zero degrees and a flat seat, which allows a better load transfer. The assembly thus defined
becomes an oscillatory mode in which the spring as a whole adopts the barrel configuration.
Pearson [23] studies the important contribution of the ends to the calculation of the spring
natural frequencies.
The dispersion relation allows us to obtain the normal modes of oscillation by applying
the appropriate boundary conditions. For example, in the case of a helical spring with one
end clamped and the other free, the boundary conditions would be as follows: the rotation
→ →
vector ψ and the translation vector η cancel out at the clamped end and the internal torque
→ →
N and the internal stress F cancel at the free end. Assuming the clamped end for s = 0 and
the free end for s = L, Equation (24) is satisfied which implies that the wave number k takes
the values of Equation (25).
→ →
ψ (0, t) = ψ ( L, t) = 0 (24)
n·π n·π
k= = √ n = 1, 3, . . . .. (25)
2L 2N h2 + π 2 D2
Entering the value of k calculated according to Equation (25) in the dispersion
Equation (21), the natural oscillation frequencies for the helical spring are obtained.
In order to validate the proposed theoretical model, a comparison with finite elements
and with the experimental tests of a helical spring is made below.
322
Mathematics 2022, 10, 2698
In finite elements, a meshing has been performed using extruded hexahedral ele-
ments of 2 mm characteristic size, which represents the circumference of the wire cross-
section (10 mm diameter) with 16 nodes in its contour. This results in 6000 nodes per turn,
54,000 nodes for the complete spring and 162,000 degrees of freedom.
The meshing strategy used consists of an extrusion method from the two-dimensional
meshing (with quadrilaterals) of the cross-section, which guarantees a regular meshing
that avoids geometric distortions of the elements and achieves a low bandwidth for the
stiffness matrix, resulting in a better performance of the iterative methods of resolution.
The SOLID185 element has been selected in its homogeneous form, which is frequently
used to model three-dimensional structures in ANSYS. The element consists of eight nodes
with three degrees of freedom per node: translations in the three directions of space.
SOLID185 uses the reduced selective integration method.
The material has been modeled linearly, with steel properties considering Young’s
modulus and Poisson’s ratio.
The problem has been plausibly solved using the Lanczos method for the determina-
tion of eigenfrequencies and eigenmodes.
In free-free solid conditions, the different natural frequencies of one spring turn
belonging to a helical spring, with different helix diameters and wire thickness have been
calculated. In all cases, the pitch of the helix has always been the same.
In all cases, the corresponding oscillation modes are studied. Table 2 shows the results.
Columns 4 to 9 show the frequency and the mode of vibration. Essentially, the first mode
corresponds to an opening between ends, such that the original helix is transformed into
an ellipsoidal helix (Figure 8a). In this case, the predominant deformation of the helix
is bending. The second mode corresponds to the relative motion between these ends
(Figure 8b) but preserves the circumference of the helix, which corresponds to a mode in
which the torsional effect of the helix is more predominant than its bending. In this second
case, the pitch of the coil would increase significantly.
Table 2. FEM calculation of the first natural frequencies (Hz) of helical spring.
From these two basic modes, the deformation profiles are reproduced with increasing
inflection points. The third (Figure 8c) and fifth (Figure 8e) modes constitute double bending
and triple bending, respectively. Always without leaving the imaginary plane that would
contain the spiral if it were a ring in a plane and not helical, but distorting the circumference
that defines it. The fourth (Figure 8d) and sixth (Figure 8f) modes constitute double torsion
(with one inflection point) and triple torsion (with two inflection points), respectively.
If we continue to search for natural modes at increasing frequencies, increasingly com-
plex deformation profiles with small displacements and numerous inflection points appear.
323
Mathematics 2022, 10, 2698
(a) (b)
(c) (d)
(e) (f)
Figure 8. Modes of vibration of a spring: (a) Bending 1; (b) Torsional 1; (c) Bending 2; (d) Torsional 2;
(e) Bending 3; (f) Torsional 3.
With respect to the results shown in Table 2, it can be deduced that, in the case of
the first resonance mode (bending), keeping the helix diameter constant and varying the
wire thickness, the resonance frequency increases with increasing thickness. This is logical
because the stiffness of the spring increases. Figure 9 shows this trend. Moreover, as
shown in Figure 9, as the diameter of the helix increases, the resonance frequency decreases.
Despite maintaining the stiffness, the loop flexes with lower frequency values. A similar
situation occurs with the first torsional mode.
324
Mathematics 2022, 10, 2698
ϮϬϬϬ
ϭϱϬϬ
ĞŶĚŝŶŐϭсϴϬŵŵ
ĞŶĚŝŶŐϭсϭϬϴŵŵ
ϭϬϬϬ
&ƌĞƋƵĞŶĐLJ;,njͿ ĞŶĚŝŶŐϭсϮϮϬŵŵ
dŽƌƐŝŽŶĂůϭсϴϬŵŵ
ϱϬϬ
dŽƌƐŝŽŶĂůϭсϭϬϴŵŵ
dŽƌƐŝŽŶĂůϭсϮϮϬŵŵ
Ϭ
ϭϱ ϮϬ Ϯϱ ϯϬ
tŝƌĞĚŝĂŵĞƚĞƌĚ;ŵŵͿ
Proceeding analogously for the first bending mode, the wire thickness is now kept
constant, and the average diameter of the helix is varied (Figure 10). The frequencies
decrease as the helix diameter increases. Moreover, as the wire diameter increases, the
resonant frequency increases. The situation is repeated for the first torsional mode of
vibration. It is a situation analogous to that of the previous paragraph.
ϮϬϬϬ
ϭϴϬϬ
ϭϲϬϬ
&ƌĞƋƵĞŶĐLJ;,njͿ
ϭϰϬϬ ĞŶĚŝŶŐϭĚсϭϴŵŵ
ϭϮϬϬ ĞŶĚŝŶŐϭĚсϮϮŵŵ
ϭϬϬϬ
ĞŶĚŝŶŐϭĚсϮϴŵŵ
ϴϬϬ
ϲϬϬ dŽƌƐŝŽŶĂůϭĚсϭϴŵŵ
ϰϬϬ dŽƌƐŝŽŶĂůϭĚсϮϮŵŵ
ϮϬϬ
dŽƌƐŝŽŶĂůϭĚсϮϴŵŵ
Ϭ
Ϭ ϱϬ ϭϬϬ ϭϱϬ ϮϬϬ ϮϱϬ
,ĞůŝdžŝĂŵĞƚĞƌ;ŵŵͿ
Figure 10. Sensitivity of the first resonance modes to the helix diameter, D.
Analyzing the variation of the frequency as a function of the variation of the average
diameter of the helix, it is observed (Figure 10) that the first bending and torsional mode
of the spring decrease as the mass increases due to a higher helix diameter. This has to
do with the wire diameter/helix diameter ratio and it can be seen that there is a certain
characteristic value of this ratio that linearizes the resonant behavior.
As shown in Figure 9 the relationship between frequencies and wire diameter (d) is
linear. This result is consistent with Equation (3).
It has also been considered whether the pitch of the spring can be an influential design
parameter when calculating resonances. Figure 11 shows that the pitch variation does not
affect the frequencies. Figure 11 shows the different modes obtained and commented.
325
Mathematics 2022, 10, 2698
ϱϬϬϬ
ϰϱϬϬ
ϰϬϬϬ
&ƌĞĐƵĞŶĐLJ;,njͿ
ϯϱϬϬ ĞŶĚŝŶŐϭ
ϯϬϬϬ dŽƌƐŝŽŶĂůϭ
ϮϱϬϬ
ϮϬϬϬ ĞŶĚŝŶŐϮ
ϭϱϬϬ dŽƌƐŝŽŶĂůϮ
ϭϬϬϬ
ĞŶĚŝŶŐϯ
ϱϬϬ
Ϭ dŽƌƐŝŽŶĂůϯ
ϮϬ ϯϬ ϰϬ ϱϬ ϲϬ
WŝƚĐŚ;ŵŵͿ
Figure 11. Sensitivity of the first resonance modes to the pitch of the helix.
Table 3 includes a comparison for the first harmonic of the results obtained with the
finite element simulation and the numerical model presented in this paper. The results of
the numerical model are obtained by solving ω in Equation (21) with the wavenumber k
calculated with Equation (22b) or Equation (25). In general, it is observed that the numerical
model overestimates the frequency values with an average difference of 3%.
Table 3. Comparison of the numerical model with FEM for the first harmonic (Hz).
4. Conclusions
The study of a single coil has been proposed since, in general, the authors who
propose equations to model the vibration behavior of a spring consider the spring as a
whole, including its ends, but do not analyze the behavior of the coil itself. From the
analysis of a single coil, the singular behavior of the spring known as “effort de chasse”
described in the introduction could be justified.
For the study of the vibratory behavior of a spring, an analytical model has been
developed based on a reference system consisting of a Frenet trihedron whose tangent
vector is the midline of the coil. One of the results obtained from the model is the dispersion
relation that relates the wavelength to its oscillation frequency, ω = f (k ). Applying the
appropriate boundary conditions the dispersion relation allows us to calculate the normal
modes of oscillation.
#
Another of the results obtained is that there is a value of the wavenumber
k = Ω21 + Ω23 for which the roots of the dispersion equation cancel out. Being Ω1
and Ω3 the torsional curvature and the bending curvature, respectively.
Two clear modes of oscillation have been characterized, one in which the coil moves
according to the torsion of the cross-section increasing the pitch of the coil, and another
326
Mathematics 2022, 10, 2698
bending mode in which it moves by making an opening of its ends so that the original helix
is transformed into an ellipsoidal helix.
A spring has been modeled by finite elements with ANSYS® software in order to
contrast results with the numerical model, obtaining an average difference of 3% in the
fundamental oscillation frequency.
Author Contributions: Individual contributions are as follows: original concept: L.P., A.Q., A.M.G.A.
and V.D.; mathematical development: L.P. and A.M.G.A.; implementation, simulation and draft
preparation: L.P., A.M.G.A., A.Q. and V.D.; review, editing, validation, and formal analysis: L.P., A.Q.,
A.M.G.A. and V.D. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
Appendix A
In this appendix, the expressions for the calculation of the coefficients of Equation (21)
are indicated. Equation (A1a–g) contains the expressions for the calculation of mi
m1 = φ1 ·φ5 (A1a)
φ1 = λ1 λ2 λ3 (A2a)
φ2 = −[λ2 λ3 a4 + λ1 λ2 a6 + λ1 λ3 a5 ] (A2b)
φ3 = {λ1 ( a5 a6 − c3 ) + λ2 ( a4 a6 − c5 ) + λ3 ( a4 a5 − c4 )} (A2c)
φ4 = c4 a6 + c5 a5 − a4 ( a5 a6 − c3 ) + b5 (A2d)
φ5 = λ 3
G1∗ G2∗ E∗ (A2e)
φ6 = −λ2 { G1∗ G2∗ a3 + E∗ ( G1∗ a1 + G2∗ a2 )} (A2f)
φ7 = λ{ a3 [ G1∗ a1 + G2∗ a2 ] + E∗ [ a2 a1 − c6 ] + G2∗ c2 − G1∗ c1 } (A2g)
φ8 = − a3 ( a1 a2 − c6 ) + c1 a2 − c2 a1 + b6 (A2h)
3 ∗
φ9 = −λ E λ1 (λ3 G1∗ + λ2 G2∗ ) (A2i)
$ %
φ10 = λ2 { λ 1 λ3 E∗ a ∗ ∗ ∗ ∗ ∗
2 + G1 a3 + λE G1 ( λ1 a6 + λ3 a4 ) + λ1 λ2 ( E a1 + G2 a3 )
(A2j)
+λE G2∗ (λ2 a4 + λ1 a5 ) + λλ1 E∗ }
∗
327
Mathematics 2022, 10, 2698
φ11 = λ{$λ2 E∗ G1∗ (c5 −% a4 a6 ) + G2∗ (c4 − a4 a5 )
−λ[ E∗ a2 + G1∗ a3 (λ1 a6 + λ3 a4 )
+( E∗ a1 + G2∗ a3 )(λ2 a4 + λ1 a5 )] (A2k)
−λ1 [λ2 ( a1 a3 − c1 ) + λ3 ( a2 a3 + c2 )]
+λ[λ1 ( E∗ b1 − a3 ) − E∗ a4 ]}
φ12 = λ{( a2 a3 + c2 )(λ1 a6 + λ3 a4 ) + (λ2 a4 + λ1 a5 )( a1 a3 − c1 )
−λ1 (b3 + a3 b1 ) $ % (A2l)
−λ[( E∗ a1 + G2∗ a3 )(c4 − a4 a5 ) + E∗ a2 + G1∗ a3 (c5 − a4 a6 )
−λa3 a4 + λE∗ ( a4 b1 + b2 )]}
φ13 = λ[( a2 a3 + c2 )(c5 − a4 a6 ) + ( a1 a3 − c1 )(c4 − a4 a5 ) + a3 (b1 a4 + b2 ) + a4 b3 + b4 ] (A2m)
a1 = k 2 + β 7 (A3a)
a2 = k 2 + β 8 (A3b)
a3 = k + β 9
2
(A3c)
a4 = k2 I1∗ + α4 (A3d)
a5 = k2 I2∗ + α5 (A3e)
a6 = k2 I3∗ + α6 (A3f)
With Equation (A4a–f) for the calculation of the parameters of the bi
!
b1 = 2 β26 I1∗ − 2k2 β 1 ( I2∗ + I3∗ ) (A4a)
! !
b2 = 2β 5 I2∗ β 4 β 6 I3∗ + 2k2 β 5 ( I1∗ + I2∗ ) − 2k2 α1 α2 β 6 − 2β24 I3∗ ( I1∗ + I3∗ ) (A4b)
! !
b3 = 2β 5 β 4 β 6 I1∗ + 2k2 β 5 ( I2∗ + I3∗ ) − 4k2 2β26 I1∗ − β24 ( I2∗ + I3∗ ) (A4c)
$ %$ %
b4 = 2 β 4 β 5 I2∗ I$3∗ − k2 α1%α2 β 4 β 5 − 4k2 β 6
−2k2 [ β$5 I2∗ I1∗ + ∗
% I2 ( β 5 β 3 + β 6 β 4 ) (A4d)
+ β 4 I3∗ I1∗ + I3∗ ( β 5 β 6 + β 2 β 4 )]
b5 = 2β 5 I2∗ β 6 β 4 I1∗ I3∗ − k2 α1 α3 − 2k2 α2 (α1 β 6 I1∗ + α3 β 4 I3∗ ) (A4e)
b6 = 2β 5 β 4 β 6 − 8k2 β24 + β25 + β26 (A4f)
c1 = β 1 β 3 + 4k2 β 2 (A5a)
c2 = − 4k2 β 3 + β24 (A5b)
β 1 = Ω21 (A6a)
328
Mathematics 2022, 10, 2698
β 2 = Ω22 (A6b)
β3 = Ω23 (A6c)
β 4 = Ω1 Ω2 (A6d)
β 5 = Ω1 Ω3 (A6e)
β 6 = Ω2 Ω3 (A6f)
β7 = β1 + β2 (A6g)
β8 = β1 + β3 (A6h)
β9 = β2 + β3 (A6i)
where Equation (A7a–f) indicates the calculation of the parameters αi
λI1
λ1 = (A8a)
A
λI2
λ2 = (A8b)
A
λI3
λ3 = (A8c)
A
References
1. Wahl, A.M. Mechanical Spring, 2nd ed.; McGraw-Hill: New York, NY, USA, 1963.
2. Shigley, J.E. Mechanical Engineering Design, 10th ed.; McGraw-Hill: New York, NY, USA, 2015.
3. Kobelev, V. Durability of Springs; Springer: Berlin/Heidelberg, Germany, 2018.
4. Love, A.E.H. A Treatise on the Mathematical Theory of Elasticity, 4th ed.; Dover Publications: Mineola, NY, USA, 1927.
5. Stokes, V.K. On the dynamic radial expansion of helical springs due to longitudinal impact. J. Sound Vib. 1974, 35, 77–99.
[CrossRef]
6. Gironnet, B.; Louradour, G. Comportement Dynamique des Resorts; Techniques de l’Ingénieur: Paris, France, 1983.
7. Yldirim, V. An efficient numerical method for predicting the natural frequencies of cylindrical helical springs. Int. J. Mech. Sci.
1999, 41, 919–939. [CrossRef]
8. Lee, J.; Thompson, D.J. Dynamic stiffness formulation, free vibration and wave motion of helical springs. J. Sound Vib. 2001, 239,
297–320. [CrossRef]
9. Lee, L. Free vibration analysis of cylindrical helical springs by the pseudo spectral method. J. Sound Vib. 2007, 302, 185–196.
[CrossRef]
10. Becker, L.E.; Chassie, G.G.; Cleghorn, W.L. On the natural frequencies of helical compression springs. Int. J. Mech. Sci. 2002, 44,
825–841. [CrossRef]
11. Jiang, W.; Wang, T.L.; Jones, W.K. The forced vibration of helical spring. Int. J. Mech. Sci. 1992, 34, 549–562. [CrossRef]
12. Haringx, J.A. On highly compressible helical springs and rubber rods, and their application for vibration-free mountings. Philips
Res. Rep. 1949, 4, 49–80.
13. Den Hartog, J.P. Mechanical Vibrations; Dover Civil and Mechanical Engineering; Courier Corporation: Chelmsford, MA,
USA, 1956.
14. Díaz, V. Automóviles y Ferrocarriles; Universidad Nacional de Educación a Distancia: Madrid, Spain, 2012; p. 287,
ISBN 978-84-362-6568-2.
329
Mathematics 2022, 10, 2698
15. Sauvage, G. Determining the Characteristics of Helican Springs: A simplification for Application in Railway Suspensions. Veh.
Syst. Dyn. Int. J. Veh. Mech. Mobil. 1984, 13, 43–59.
16. Campedelli, J. Modelisation Globale Statique des Systemes Mecaniques Hyperstatiques Pre-Charges Application a un Bogie
Moteur. Ph.D. Thesis, INSA Lyon, Lyon, France, 2002.
17. UNE-EN 13298; Railway Applications. Suspension Components. Helical Suspension Springs, Steel. The European Committee for
Standardisation: Brussels, Belgium, 2003.
18. Kovelev, V. Effect of static axial compression on the natural frequencies of helical springs. Multidiscip. Model. Mater. Struct. 2014,
10, 379. [CrossRef]
19. Yildirim, V. Axial Static Load Dependence Free Vibration Analysis of Helical Springs Based on the Theory of Spatially Curved
Barx. Lat. Am. J. Solids Struct. 2016, 13, 2852–2875. [CrossRef]
20. Kato, H.; Suzuki, H. Nonlinear deflection analysis of helical spring in elastic-perfect plastic material: Application to the plastic
extension of piano wire spring. Mech. Mater. 2021, 160, 103971. [CrossRef]
21. Timoshenko, S.P. On the correction for shear of the differential equation for transverse vibrations of prismatic bars. Philos. Mag.
1921, 41, 744–746. [CrossRef]
22. Rayleigh, J.W.S. The Theory of Sound; Dover Publications: Mineola, NY, USA, 1945.
23. Pearson, D. Modelling the ends of compression helical springs for vibration calculations. Proc. Inst. Mech. Eng. 1986, 200, 3–11.
[CrossRef]
330
mathematics
Article
Fault Classification in a Reciprocating Compressor and
a Centrifugal Pump Using Non-Linear Entropy Features
Ruben Medina 1, *, Mariela Cerrada 2 , Shuai Yang 3 , Diego Cabrera 2 , Edgar Estupiñan 4 and René-Vinicio Sánchez
2, *
Abstract: This paper describes a comparison of three types of feature sets. The feature sets were
intended to classify 13 faults in a centrifugal pump (CP) and 17 valve faults in a reciprocating com-
pressor (RC). The first set comprised 14 non-linear entropy-based features, the second comprised
15 information-based entropy features, and the third comprised 12 statistical features. The clas-
sification was performed using random forest (RF) models and support vector machines (SVM).
The experimental work showed that the combination of information-based features with non-linear
entropy-based features provides a statistically significant accuracy higher than the accuracy provided
by the Statistical Features set. Results for classifying the 13 conditions in the CP using non-linear
entropy features showed accuracies of up to 99.50%. The same feature set provided a classification
accuracy of 97.50% for the classification of the 17 conditions in the RC.
Citation: Medina, R.; Cerrada, M.; Keywords: approximate entropy; non-linear systems; phase space reconstruction; fault classification;
Yang, S.; Cabrera, D.; Estupiñan, E.; random forest; support vector machines
Sánchez, R.-V. Fault Classification in
a Reciprocating Compressor and MSC: 28D20
a Centrifugal Pump Using
Non-Linear Entropy Features.
Mathematics 2022, 10, 3033. https://
doi.org/10.3390/math10173033 1. Introduction
Academic Editor: Andrey Jivkov In industrial applications, centrifugal pumps are equipment for transferring energy to
fluids to enable pipeline transportation. Similarly, reciprocating compressors are essential
Received: 30 June 2022
in petrochemical plants and refineries for gas and other fluid transportation. Reliable
Accepted: 17 August 2022
functioning of these types of equipment is essential in industry to avoid unexpected
Published: 23 August 2022
halting of processes and economic losses. Consequently, several condition monitoring (CM)
Publisher’s Note: MDPI stays neutral methods have been proposed for diagnosing faults in centrifugal pumps and reciprocating
with regard to jurisdictional claims in compressors [1–3].
published maps and institutional affil- The CM methods are based on sensing several variables from the target equipment,
iations. such as electrical current, vibration, sound, temperature, or acoustic emission. Some of
the most common variables are vibration signals recorded using accelerometer sensors.
Such a set of signals is processed for extracting features useful for fault detection and
classification. Traditional techniques for vibration signal analysis are divided into time,
Copyright: © 2022 by the authors.
frequency, and time-frequency methods. In general, signal processing techniques assume
Licensee MDPI, Basel, Switzerland.
that signals represent linear systems that comply with the stationary and periodicity
This article is an open access article
condition. Unfortunately, this assumption is not always met. In particular, in most rotatory
distributed under the terms and
machines, the vibration signals represent a non-linear system [4]. These constraints can be
conditions of the Creative Commons
Attribution (CC BY) license (https://
efficiently handled using chaos theory and non-linear dynamics techniques.
creativecommons.org/licenses/by/
In this work, we propose to use the approximate entropy (AppEn) [5,6] and variants,
4.0/).
such as the sample entropy (SampEn) [7,8] and fuzzy entropy (FuzzyEn) [9,10] for fault
classification. The emphasis of this research is feature extraction. The classification stage
can be performed using either classical or deep learning-based classifiers. However, as the
feature extraction is computationally expensive, selecting deep learning models would
impose an unnecessary cost. In consequence, we have selected two efficient classical models
corresponding to RF and the multi-class SVM [11,12]. The original contributions of this
research are the following:
• Extraction of a non-linear entropy-based feature set that provides high classification
accuracy using RF and SVM models. The accuracy attained by the SVM model trained
with the non-linear Entropy Features set was higher than the accuracy attained by a
CNN model trained with 2D spectrogram images for both the CP and the RC.
• Detailed comparison of three feature sets useful for classifying a large number of faults
in a CP and an RC. These are the non-linear Entropy Features, the Information Entropy,
and the Statistical Features sets. The first set is composed of the approximate entropy
and several variants, and the second set is composed of the combination of the wavelet
packet transform (WPT)-based features and the power spectrum entropy (PSE)-based
features. Finally, the third set is composed of classical time series statistical features.
• The non-linear Entropy Features set and the All Features set corresponding to the fusion
of the three feature sets when compared provided a classification accuracy of up to
99.59% for the CP and up to 97.90% for the RC. For the CP, 13 different conditions
were classified, and 17 valve conditions were classified for the RC.
The paper is organized as follows: in Section 3 the theoretical background is presented.
In Section 4, the test bed for acquiring the vibration signals from the CP and RP is presented.
In Sections 5 and 6, the methodology for extraction of the different sets of features is
described. The methodology for classifying faults using RF and SVM models based on
the extracted features is presented in Section 7. The results are described in Section 8, and
finally, the conclusion and future work are presented in Section 10.
2. Related Research
The related research is discussed in the following subsections. A compendium with
several publications is presented in Table A1 in terms of their main features. A brief revision
concerning these publications is presented in Sections 2.1–2.3.
332
Mathematics 2022, 10, 3033
with the normalized permutation entropy and the Jensen–Shannon complexity of time
series. The authors showed the utility of this representation for improving the classification
accuracy of faults in roller bearing and gear applications using SVM models. A review
of entropy algorithms and its variants in RM is reported in [18,19]. The reviews briefly
introduced several entropy methods and their application in fault detection in RM.
333
Mathematics 2022, 10, 3033
correlation sequence. In the third stage, time, frequency, and time–frequency domain raw
hybrid features are extracted from baseline signal and vibration signals from faulty classes.
A new set of discriminant features is composed of the correlation coefficient between raw
hybrid feature pools. The combination of all extracted features is fed to an SVM model for
classification. The method is used to classify four conditions in a CP, attaining accuracy
of up to 98.4%. A method for fault classification in a CP based on vibration signals is
reported in [29]. A feature extraction pre-processing is used to obtain a time-frequency
representation using the continuous wavelet transform (CWT). The CWT is converted
to gray images fed to a CNN model with an adaptive learning rate. Accuracies of 100%
are obtained for classifying four different fault conditions. A fault classification method
of a multi-stage CP reported in [30] starts by selecting the fault-specific frequency band
and continues by extracting statistical features in time, frequency, and wavelet domains
from this band. The next step is to select a low dimensional features vector using the
informative ratio PCA. The classification of four conditions in a multi-stage CP using KNN
models. The accuracy attained was 100%. Fault detection and classification for water
pump bearings based on features extracted from the instantaneous power spectrum is
reported in [31]. The instantaneous power spectrum (IPS) was obtained with the voltage
and current measured, and several features were extracted from the IPS for classifying
three different conditions at different load levels. The classification was performed using
an extreme gradient boosting (XBG) model [32]. The application of CNN for classifying
faults in a CP based on acoustic images is reported in [33]. In this application, the sound
signals were acquired from a CP test rig where five different health conditions were
implemented. The sound signals were converted into acoustic images using the analytical
wavelet transform (AWT). The acoustic images were fed to the CNN for classification.
The attained accuracy was 100%.
3. Theoretical Background
3.1. Phase Space Reconstruction
The phase space reconstruction method is based on the embedding theorem of Taken,
reported in [47]. The theorem postulates that we can recover the equivalent dynamics of
a non-linear system by using time delays from a recorded time-domain signal. The ap-
proximate dynamics correspond to a 1D projection of the system trajectory. The theorem is
mainly applied to univariate time series [47]; however, the multivariate version has also
been reported in [48]. In this research, we are mainly concerned with univariate embedding.
The average mutual information (AMI) enables the selection of the time delay or lag
denoted τ [49]. The AMI is plotted versus τ. According to [50], the selection of τ could be
334
Mathematics 2022, 10, 3033
the location of the first local minimum of the AMI. The system’s embedding dimension,
denoted as m, could be selected by applying the false nearest neighbors (FNN) method, as
reported in [51].
This research used the method reported in [57] for computing the PermEn. The method
is based on pre-calculating values for successive ordinal patterns of order m. This ap-
proach allows the calculation of PermEn in large time series using overlapped successive
time windows.
335
Mathematics 2022, 10, 3033
The entropy concept is also used in classical thermodynamics in the context of heat
transfer [68]. Gibbs and Boltzmann attained a definition of entropy with an expression
similar to Equation (2) [69]. When dealing with discrete-time signals represented as a vector,
the entropy concept has also been applied to select the best basis for orthogonal wavelet
packets [70]. The authors showed that the assumption that the Karhunen–Loeve basis is
the best, even for a single vector which enables the calculation of several entropy features.
Consider the time-series x where the vector can be arranged in decreasing order such that
xi = 0 for large i. Then, pi = | xi |2 x−2 . Without loss of generality, the time-series can
be normalized so x−2 = 1. The following features can be estimated: the log energy
entropy, the Shannon information entropy, the norm entropy, the threshold entropy, and the
“sure” entropy.
336
Mathematics 2022, 10, 3033
where K is a positive constant and q denotes any real number. Tsallis showed that when
q → 1, HTq tends to the classical Boltzmann–Gibbs–Shannon entropy represented by
Equation (2). Similarly, q is used in the Rényi entropy [72] to amplify probabilities as:
N
K
log ∑ pi
q
HRq = (4)
q−1 i =1
The PSE is calculated for each time frame and is a vector denoted as S. When the
signal has high complexity, the spectrum of the time series tends to be uniform, and the
PSE is high. In contrast, when the spectrum has a narrow peak, the PSE feature is lower,
and the signal is less complex.
4. Experimental Test-Bed
In this research, two different vibration signal datasets were considered. The first was
acquired from a CP and the second from an RC.
337
Mathematics 2022, 10, 3033
are shown in Figure 1. The signal acquisition was performed under controlled operating
conditions concerning the room temperature, relative humidity, and environmental noise.
(a) (b)
Figure 1. CP and sensors. (a) Multi-stage vertical CP. (b) Locations of sensors. The vibration signals
were recorded using A1, A2, A3, and A4 accelerometers.
Six experimental conditions were considered related to the discharge pressure. The con-
ditions are denoted as Ci with i ranging from 1 to 6. The pressure started at 5.5 bar,
increasing by 1 bar for each condition. C1 corresponds to a discharge pressure of 5.5 bar.
338
Mathematics 2022, 10, 3033
Table 1. Cont.
Figure 2. Sensors’ locations in the RC. The accelerometers are denoted as A1, A2, A3, and A4.
339
Mathematics 2022, 10, 3033
340
Mathematics 2022, 10, 3033
8. Results
8.1. Results for the CP Dataset
The set of fault conditions for the CP included several severity levels in the impeller
(see Table 1). Examples of the vibration signals for the CP dataset are shown in Figure 3.
Four conditions are shown, and in this subset, only subtle differences in signal amplitude
can be visually detected. Any fault signature could be affected by noise.
In Figure 4, the AppEn for four vibration signals of the CP dataset are shown. Each
of the vibration signals represents a fault condition. The utility of non-linear features for
detecting fault conditions in vibration signals is shown in the following paragraphs. The
feature estimation in each signal starts by performing the phase space reconstruction [47]
aimed at estimating the system dimension m and the lag τ. These parameters are used
for calculating the AppEn for several values of r. The AppEn for the CP signals increases
with r until attaining a maximum and decreases with the increase in r. In this example,
the differences are evident between each impeller’s conditions.
The LLE is calculated using four vibration signals extracted from the CP dataset.
The calculation procedure was explained in Section 3.9. Results concerning the LLE are
shown in Figure 5. The LLE is negative for the healthy class, class P6, and class P13.
However, it is positive for class P10. The sign of the Lyapunov exponent could be related to
the presence of chaos when the LLE is positive. When the LLE is negative, the time series
could represent a system with periodic dynamics, as suggested in [65].
A non-integer dimension usually characterizes non-linear systems that could have
a chaotic nature. Generally, when dealing with chaotic systems, the Rényi dimension is used
as a feature of strange attractors that enables the estimation of several feature dimensions.
The D0 is the fractal dimension, D1 is the information dimension, and D2 is the CD [81].
Like AppEn estimation, the first step corresponds to the phase space reconstruction that
provides the embedding dimension m. Parameter m is the upper bound of the CD [82].
The CD feature was estimated for several vibration signals of the CP dataset. The results
are shown in Figure 6. This set of signals has estimated embedding dimensions of 5 for P1
and P6, and 4 for P10 and P13. The CD is lower than 5 and higher than 4 for the signals
341
Mathematics 2022, 10, 3033
from classes P1 and P6. Similarly, for classes P10, and P13, the CD values are higher than 3
and lower than 4.
(a) (b)
(c) (d)
Figure 3. Vibration signals from the CP dataset. (a) Signal from class P1. (b) Signal from class P6.
(c) Signal from class P10, (d) Signal from class P13.
(a) (b)
(c) (d)
Figure 4. AppEn for several signals from the pump dataset. (a) ApEn for a signal from class P1,
(b) ApEn for a signal from class P6, (c) ApEn for a signal from class P10, (d) ApEn for a signal from
class P13.
342
Mathematics 2022, 10, 3033
(a) (b)
(c) (d)
Figure 5. LLE for several signals from the CP dataset. (a) LLE for class P1, (b) LLE for class P6,
(c) LLE for class P10, (d) LLE for class P13.
(a) (b)
(c) (d)
Figure 6. CD for several signals from the CP dataset. (a) CD for a signal from class P1, (b) CD for a
signal from class P6, (c) CD for a signal from class P10, (d) CD for a signal from class P13.
The percentages of accuracy for classifying the fault considering the CP dataset using
RF and SVM are presented in Table 3. The accuracy is presented for each of the models and
feature sets considered. The vibration signal channels A2, A3, and A4 were considered
for feature extraction. Vibration signal channel A1 was not considered for two reasons.
Firstly, the computational time of the feature extraction is high, and to keep the total feature
extraction time feasible with the available hardware, we only considered three channels.
343
Mathematics 2022, 10, 3033
Secondly, we selected the channels close to the mechanical connection with the motor,
where a roller bearing is located, because the pump stages in this neighborhood have
a higher probability of failing.
Table 3. Percentages of accuracy for fault classification using the vibration signals of the CP dataset.
Features Model A2 A3 A4
Statistical RF 67.76 68.34 68.50
SVM 68.50 70.51 71.03
Entropy-based RF 98.09 97.83 94.70
SVM 99.50 98.96 97.30
Info-Statistical RF 84.60 84.80 93.81
SVM 86.26 86.96 96.32
All Features RF 97.76 99.38 99.37
SVM 99.59 99.69 99.81
The lowest classification accuracy was attained using RF trained with the Statistical
Features set. In this case, the accuracy attained from channel A2 was 67.76%, for channel A3
was 68.34%, and for channel A4 was 68.50%. The accuracy attained using SVM with this
set of features was 68.5% for channel A2, 70.51% for channel A3, and 71.03% for channel
A4. The highest accuracy was 99.59% for channel A2, 99.69% for channel A3, and 99.81%
for channel A4. This accuracy was attained with the All Features set and SVM. With this
set of features, using the RF model, the accuracy was 97.76% for channel A2, 99.38% for
channel A3, and 99.37% for channel A4.
A comparison of accuracy considering both machine learning models and feature sets
is presented in Figure 7. In the box plot, a red line represents the median of the data that
divides the box or inter-quartile range in two parts. This box represents 50% of the data.
The horizontal line over the box represents the upper quartile that indicates that 75% of
the data have values below this quartile. The horizontal line below the box represents
the lower quartile that indicates that 25% of the data have values below this quartile. The
red cross below or over the box represents the outliers. The accuracy is presented for
channel A2 in box plots calculated based on the ten repetitions performed for obtaining ten
different cross-validated models. The highest accuracy (99.50% and 99.59% in Table 3) was
attained based on the Entropy Features and the complete set of features, All Features, with
the SVM model. The lower accuracies (67.76% and 68.50%) for the RF and SVM models
were attained by considering the Statistical Features set. The combination of information
entropy and statistical features (InfoStat Features) provided an intermediate accuracy value
(84.60% and 86.96%) for both considered models.
Table 4 presents several performance metrics [79,80,83] results expressed in percent-
ages. The included metrics are the sensitivity, the specificity, the error, the false positive
rate, and the area under the curve (AUC). The metrics were calculated during the cross-
validation results of the SVM model trained with the All Features set extracted from vibration
signals recorded in the channel A2 from the CP. The highest sensitivity of 100.00% was
attained by classes P1, P6, P12, and P13. In contrast, the lowest sensitivity of 98.67% was
attained by class P9. The highest specificity of 100% was attained by classes P1, P3, P5,
and P12. The lowest specificity of 99.87% was attained by classes P4 and P8. Concerning
the AUC [84], the highest value of 100% was attained by classes P1 and P12, and the lowest
AUC value was 99.19%.
344
Mathematics 2022, 10, 3033
Figure 7. Comparison of accuracy for the machine learning models tested, considering several
combinations of features from channel A2 acquired from the CP.
Table 4. Performance metrics (in percent) obtained with the SVM model and the All Features set.
The model was trained using vibration signals in channel A2 from the CP dataset.
345
Mathematics 2022, 10, 3033
The calculation of the LLE for the set of vibration signals selected from the RC dataset
is shown in Figure 10. Although the LLE value is different for each signal presented, all
the estimated values are positive, suggesting that the system is chaotic [65]. The CD is
presented in Figure 11 for the set of signals of the RC. The CD for these signals has different
values according to the fault type. The estimated values of CD are close to the embedded
dimensions [82] (four, calculated using the phase space reconstruction method).
(a) (b)
(c) (d)
Figure 8. Vibration signals from the RC dataset. (a) Signal from the class P1, (b) signal from class P6,
(c) signal from class P11, (d) signal from class P16.
(a) (b)
(c) (d)
Figure 9. AppEn for several signals from the RC dataset. (a) ApEn for a signal from the class P1,
(b) ApEn for a signal from the class P6, (c) ApEn for a signal from the class P11, (d) ApEn for a signal
from the class P16.
346
Mathematics 2022, 10, 3033
(a) (b)
(c) (d)
Figure 10. LLE for several signals from the RC dataset. (a) LLE for class P1, (b) LLE for class P6,
(c) LLE for a signal from the class P11, (d) LLE for class P16.
(a) (b)
(c) (d)
Figure 11. CD for several signals from the RC dataset. (a) CD for a signal from the class P1, (b) CD for a
signal from the class P6, (c) CD for a signal from the class P11, (d) CD for a signal from the class P16.
Concerning the RC, we selected three vibration channels from the four available.
The selected channels were A1, A2, and A3. We excluded A4 to save computational time
because A4 was far from the valve’s location. The percent accuracy attained using RF and
SVM is presented in Table 5. Four feature sets are considered: the first is the Statistical
Features set, and the second is the Entropy Features. The third is the InfoStat Features set.
347
Mathematics 2022, 10, 3033
Finally, we have the All Features set. The lowest accuracy was attained with the Statistical
Features set. In this case, the accuracy attained by the RF model was 83.96% for vibration
signal channel A1, 74.07% for A2; and 72.33% for A3. The accuracy attained by the SVM
model was 86.91% for vibration signal channel A1, 76.91% for A2, and 75.55% for A3.
The highest accuracy was attained using the All Features set. The accuracy attained by the
RF model was 95.35% for vibration signal channel A1, 94.40% for A2, and 88.82% for A3.
The SVM model attained a classification accuracy of 97.90% for signal channel A1, 95.47%
for A2 and 93.63% for A3. The rest of the feature combinations attained intermediate
accuracy close to the optimal accuracy.
Table 5. Percentage accuracy for fault classification using the vibration signals of the RC dataset.
Features Model A1 A2 A3
Statistical RF 83.96 74.07 72.33
SVM 86.91 76.55 75.55
Entropy based RF 95.35 94.12 86.21
SVM 97.57 94.55 91.87
Info-Statistical RF 93.96 90.66 85.78
SVM 96.83 94.27 90.38
All Features RF 95.35 94.40 88.82
SVM 97.90 95.47 93.63
Figure 12. Comparison of accuracy for the machine learning models tested, considering several
combinations of features extracted from channel A1 of the RC dataset. In the box plot, a red line
represents the median of the data. The horizontal line over the box represents the upper quartile. The
horizontal line below the box represents the lower quartile. The red cross, below or over the box,
represents the outliers.
348
Mathematics 2022, 10, 3033
Table 6. Performance metrics (in percent) obtained with the SVM model and the All Features set.
The model was trained using vibration signals in channel A1 from the RC dataset.
9. Discussion
The proposed methodology was implemented on a laptop computer. The laptop
was equipped with an intel(R) Core(TM) i7-6700HQ CPU, 12 GB of RAM memory, and
a graphics card NVIDIA GeForce GTX 950M. The Matlab software was used to implement
the feature extraction and classification. Concerning the computational time, the Entropy
Features set required an average time of 311.85 s when extracted from the five windows
(each with a length of 12,000-time samples) of the vibration signals from the CP dataset.
In contrast, the Information Entropy features require 0.97 s, and the Statistical Features set
requires only 0.156 s. In the RC, when the sampling rate is similar to that of the CP and the
window length is similar, the computational time is similar to the time required in the CP.
In the case of the CP, multiple pairwise comparisons of machine learning models
are presented in Figure A1. A one-way analysis of variance by ranks was performed
using a Kruskal–Wallis approach [85]. The test showed no significant statistical differences
between results attained by either machine learning models when trained with the non-
linear Entropy Features set or All Features set. Results attained by the SVM machine learning
models trained with the non-linear Entropy Features set or the All Features set are statistically
significantly superior to than those of RF or SVM models trained with the Statistical Features
or the combination InfoStat Features. The comparison also shows that the non-linear Entropy
Features set provides higher accuracy without combining with the Statistical Features set.
349
Mathematics 2022, 10, 3033
Table 7. Comparison of performances of the tested models obtained during the 10-fold cross-
validation between entropy features with classical models and the spectrogram with a CNN model.
Results can be compared to previous research that used non-linear entropy features
with classical models. In [27], the average accuracy for classifying five different conditions
in a CP was 94.58%. The work reported in [28] used an SVM for classifying four different
conditions in a CP using vibration signals. The method attained an average accuracy of
98.4%. In [21] the p − V, was used with LDA to classify six different conditions, attaining
an accuracy of 99.62% in an RC. Several papers reported CNN-based approaches for fault
classification in CP and RC [24,25,29,33]. These methods can highly accurately classify a
small set of faults in a CP or an RC. In contrast, our method was able to classify 13 types of
faults in a CP and 17 different conditions in an RC with accuracies close to those of these
deep learning-based methods.
10. Conclusions
The non-linear Entropy Features set enables fault classification in a CP or an RC with
high accuracy. In this research, 13 impeller faults in a CP incorporating different degrees of
350
Mathematics 2022, 10, 3033
severity were classified using RF and SVM models. Additionally, a set of 17 valve faults in
an RC were also classified.
The research showed that the non-linear Entropy Features set provides high accuracy
(99.50% for the CP and 97.57% for the RC) considering the SVM model. The accuracy
attained using the Entropy Features set and the RF model was 98.09% for the CP and 95.35%
for the RC. Results showed no significant statistical differences between Entropy Features
and All Features. However, both feature sets provided accuracies statistically significantly
higher than the accuracy provided by the Statistical Features set. The Statistical Features set
provided the lowest accuracy in this application. In this case, the accuracy attained with
the SVM was 71.03% for the CP and 76.55% for the RC. Concerning the comparison of
machine learning models, when both models RF and SVM were trained with the Entropy
Features set, there were no statistically significant differences in the accuracy of results.
The results suggest that the Entropy Features set or the All Features set could be used for
fault classification in centrifugal pumps and reciprocating compressors, where multiple
fault types in impellers and valves should be diagnosed at early stages.
The novelty of the approach is to propose using the entropy features for classifying
faults in centrifugal pumps and reciprocating compressors with high accuracy. In addition,
we showed that the entropy-based feature sets provide high classification accuracy even
for a large number of faults in both the CP and the RC. Concerning the comparison with
respect to a deep learning model, the accuracy results obtained by the SVM trained with
the Entropy Features set (99.50% for the CP and 97.57% for the RC) were superior to those
obtained by a CNN trained with the spectrogram (98.86% for the CP and 96.74% for the RC).
The limitation of the proposed approach is the high computational cost necessary for
extracting the entire set of entropy-based features.
Further research is oriented at reducing the computational time required for extracting
the non-linear Entropy Features set and exploring their implementation using multi-scale
approaches. We also plan to select some of the entropy-based features that would be useful
for anomaly detection in RM, combined with deep learning approaches, such as generative
adversarial networks and autoencoders.
Author Contributions: Conceptualization, R.-V.S., M.C. and R.M.; methodology, D.C. and M.C.;
software, R.M.; validation, R.M., E.E. and R.-V.S.; formal analysis, S.Y.; investigation, E.E.; resources,
S.Y.; data curation, D.C.; writing—original draft preparation, R.M.; writing—review and editing,
M.C. and R.-V.S.; visualization, D.C.; supervision, R.-V.S.; project administration, R.-V.S.; funding
acquisition, R.-V.S. All authors have read and agreed to the published version of the manuscript.
Funding: This research was funded by the MoST Science and Technology Partnership Program
(KY201802006) and National Research Base of Intelligent Manufacturing Service, Chongqing Tech-
nology and Business University, and by the Universidad Politécnica Salesiana through the GIDTEC
research group.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: The data used in this research are available upon reasonable request to
the corresponding authors. We also plan to prepare a paper describing the datasets and submit it to
the MDPI journal Data to release them for public use.
Conflicts of Interest: The authors declare no conflict of interest.
351
Mathematics 2022, 10, 3033
Abbreviations
The following abbreviations are used in this manuscript:
CP Centrifugal Pump
RC Reciprocating Compressor
CM Condition Monitoring
CD Correlation Dimension
LLE Largest Lyapunov Exponent
LMD Local Mean Decomposition
SVM Support Vector Machine
CEEMD Complementary Ensemble Empirical Mode Decomposition
SampEn Sample Entropy
RF Random Forest
ELM Extreme Learning Machine
AMI Average Mutual Information
FNN False Nearest Neighbor
AppEn Approximate Entropy
FuzzyEn Fuzzy Entropy
ShannEn Shannon Entropy
PermEn Permutation Entropy
DFA Detrended Fluctuation Analysis
LLE Larger Lyapunov Exponent
S Power Spectral Entropy
PEB Pitting at the Entrance of Impeller Blades
POB Pitting at the Output of the Impeller Blades
ICB Impeller Channel Blockage
IB Imbalance Impeller
ROC Receiver Operator Curve
AUC Area Under the Curve
HP Horse Power
RMS Root Mean Square
RPM Revolutions per minute
GPM Gallons per minute
RM Rotating Machinery
CNN Convolutional Neural Network
SGWT Spectral Graph Wavelet Transform
CECP Complexity-Entropy Causality Plane
MSE Multi-Scale Entropy
PCA Principal Component Analysis
LDA Linear Discriminant Analysis
BMFO Binary Moth Flame Optimization
SE Squeeze and Excitation
HYP Hydraulic Pump
RSDD Resonance-based sparse signal decomposition
MHAAPE Multi-scale Hierarchical Amplitude Aware Permutation Entropy
ELM Extreme Learning Machine
CEEMD Complementary Ensemble Empirical Mode Decomposition
CWT Continous Wavelet Transform
XBG Extreme Gradient Boosting
AWT Analitical Wavelet Transform
352
Mathematics 2022, 10, 3033
Feature Equation
ApEn [5,6] ApEn(m, k, τ, N ) = φm (r ) − φm+1 (r ), k = 0.06 (A1)
353
Mathematics 2022, 10, 3033
Feature Equation
A m (r )
SampEn [7] SampEn(m, r ) = −ln m , k = 0.06 (A5)
B (r )
m
A (r )
SampEn [7] SampEn(m, r ) = −ln m , k = 0.21 (A6)
B (r )
ShannonEn [54] ShanEn(m) = − ∑ pm logpm (A7)
m
( m +1) ! −1
1
PermEn [57] PermEn(m) = lnM −
M ∑ q j lnq j (A10)
j =0
∂lnCm (r )
CorDim [59] CD (m) = lim (A11)
r →0 ∂lnr
1 δZ (t)
LLE [66] LLE = lim ln (A12)
t→∞ t δZ0
DFA1(α1) [66] α1 = τ (1), τ (1) = h (1) − 1 (A13)
DFA2(α2) [66] α2 = τ (2), τ (2) = 2h(2) − 1 (A14)
Feature Equation
WPT-Shannon [70,86] WPT − Shannon(s) = − ∑ s2i log(s2i ) (A15)
i
WPT-Norm [70,86] WPT − Norm(s) = − ∑ |si | p , p≥1 (A16)
i
354
Mathematics 2022, 10, 3033
Feature Equation
rms(S)
PSE-shape [73,74] PSE − shape = (A25)
1
N ∑i | si |
PSE-MaxToRms [73,74] PSE − MaxToRms = max (S)/rms(S) (A26)
Feature Equation
N
1
Mean μ=
N ∑ xi (A30)
i =1
0
1
11 N
Root Mean Square (RMS) RMS = 2 ∑ ( xi )2 (A31)
N i =1
0
1
11 N
Standard deviation σ = 2 ∑ ( x i − μ )2 (A32)
N i =1
N ∑iN=1 ( xi − μ)4
Kurtosis Kurtosis = ! (A33)
2 2
∑iN=1 ( xi − μ)
Maximum value Max = max (xn ) (A34)
max (xn )
Crest factor CrestFact = (A35)
rms(xn )
N
1
Rectified mean value RMV =
N ∑ | xi | (A36)
i =1
rms(xn )
Shape factor ShapeFact = (A37)
1
N∑iN=1 | xi |
max (xn )
Impulse factor ImpulseFact = (A38)
1
N∑iN=1 | xi |
N
1
Variance Var =
N ∑ ( x i − μ )2 (A39)
i =1
Minimum value Min = min(xn ) (A40)
N ∑iN=1 ( xi − μ)3
Skewness Skewness = (A41)
σ3
355
Mathematics 2022, 10, 3033
Figure A1. Kruskal–Wallis comparison between the machine learning models tested. The plot shows
several combinations of features calculated from the channel A2 recorded from the CP. There are
statistical significant differences between the groups labeled with red and the group labeled with
blue, between the groups SVM-Entropy and SVM-AllFeat and the groups labeled with blue and gray.
In general there are statistically significant differences, when the projection of the horizontal lines of a
group over the horizontal axis does not have any interception with the projection of another group.
Figure A2. Kruskal–Wallis comparison between the machine learning models tested, considering
several combinations of features extracted from the channel A1 of the RC dataset. There are statistical
significant differences between the groups labeled with red and the group labeled with blue. In
general there are statistically significant differences, when the projection of the horizontal lines of a
group over the horizontal axis does not have any interception with the projection of another group.
References
1. Ali, S.M.; Hui, K.; Hee, L.; Leong, M.S. Automated valve fault detection based on acoustic emission parameters and support
vector machine. Alex. Eng. J. 2018, 57, 491–498. [CrossRef]
2. Cerrada, M.; Macancela, J.C.; Cabrera, D.; Estupiñan, E.; Sánchez, R.V.; Medina, R. Reciprocating compressor multi-fault
classification using symbolic dynamics and complex correlation measure. Appl. Sci. 2020, 10, 2512. [CrossRef]
3. Sharma, V.; Parey, A. Performance evaluation of decomposition methods to diagnose leakage in a reciprocating compressor
under limited speed variation. Mech. Syst. Signal Process. 2019, 125, 275–287. [CrossRef]
356
Mathematics 2022, 10, 3033
4. Soleimani, A.; Khadem, S. Early fault detection of rotating machinery through chaotic vibration feature extraction of experimental
data sets. Chaos Solitons Fractals 2015, 78, 61–75. [CrossRef]
5. Pincus, S.M. Approximate entropy as a measure of system complexity. Proc. Natl. Acad. Sci. USA 1991, 88, 2297–2301. [CrossRef]
[PubMed]
6. Pincus, S.M.; Huang, W.M. Approximate entropy: Statistical properties and applications. Commun. Stat. Theory Methods 1992,
21, 3061–3077. [CrossRef]
7. Richman, J.S.; Moorman, J.R. Physiological time-series analysis using approximate entropy and sample entropy. Am. J. Physiol.
Heart Circ. Physiol. 2000, 278, H2039–H2049. [CrossRef]
8. Weippert, M.; Behrens, M.; Rieger, A.; Behrens, K. Sample entropy and traditional measures of heart rate dynamics reveal
different modes of cardiovascular control during low intensity exercise. Entropy 2014, 16, 5698–5711. [CrossRef]
9. Zhao, L.; Wei, S.; Zhang, C.; Zhang, Y.; Jiang, X.; Liu, F.; Liu, C. Determination of sample entropy and fuzzy measure entropy
parameters for distinguishing congestive heart failure from normal sinus rhythm subjects. Entropy 2015, 17, 6270–6288. [CrossRef]
10. Chen, W.; Wang, Z.; Xie, H.; Yu, W. Characterization of surface EMG signal based on fuzzy entropy. IEEE Trans. Neural Syst.
Rehabil. Eng. 2007, 15, 266–272. [CrossRef]
11. Wang, Z.; Xue, X. Multi-class support vector machine. In Support Vector Machines Applications; Springer: Berlin/Heidelberg,
Germany, 2014; pp. 23–48. [CrossRef]
12. Escalera, S.; Pujol, O.; Radeva, P. On the decoding process in ternary error-correcting output codes. IEEE Trans. Pattern Anal.
Mach. Intell. 2010, 32, 120–134. [CrossRef] [PubMed]
13. Janjarasjitt, S.; Ocak, H.; Loparo, K. Bearing condition diagnosis and prognosis using applied nonlinear dynamical analysis of
machine vibration signal. J. Sound Vib. 2008, 317, 112–126. [CrossRef]
14. Sun, Y. Fault Detection in Dynamic Systems Using the Largest Lyapunov Exponent. Ph.D. Thesis, Texas A & M University,
College Station, TX, USA, 2012.
15. Medina, R.; Macancela, J.C.; Lucero, P.; Cabrera, D.; Sánchez, R.V.; Cerrada, M. Gear and bearing fault classification under
different load and speed by using Poincaré plot features and SVM. J. Intell. Manuf. 2020, 33, 1031–1055. [CrossRef]
16. Dong, X.; Li, G.; Jia, Y.; Li, B.; He, K. Non-iterative denoising algorithm for mechanical vibration signal using spectral graph
wavelet transform and detrended fluctuation analysis. Mech. Syst. Signal Process. 2021, 149, 107202. [CrossRef]
17. Radhakrishnan, S.; Lee, Y.T.T.; Rachuri, S.; Kamarthi, S. Complexity and entropy representation for machine component
diagnostics. PLoS ONE 2019, 14, e0217919. [CrossRef]
18. Li, Y.; Wang, X.; Liu, Z.; Liang, X.; Si, S. The entropy algorithm and its variants in the fault diagnosis of rotating machinery:
A review. IEEE Access 2018, 6, 66723–66741. [CrossRef]
19. Huo, Z.; Martínez-García, M.; Zhang, Y.; Yan, R.; Shu, L. Entropy measures in machine fault diagnosis: Insights and applications.
IEEE Trans. Instrum. Meas. 2020, 69, 2607–2620. [CrossRef]
20. Zhao, H.-Y.; Wang, J.-D.; Xing, J.-J.; Gao, Y.-Q. A feature extraction method based on LMD and MSE and its application for fault
diagnosis of reciprocating compressor. J. Vibroeng. 2015, 17, 3515–3526.
21. Li, X.; Ren, P.; Zhang, Z.; Jia, X.; Peng, X. A p−V Diagram Based Fault Identification for Compressor Valve by Means of
Linear Discrimination Analysis. Machines 2022, 10, 53. [CrossRef]
22. Lv, Q.; Cai, L.; Yu, X.; Ma, H.; Li, Y.; Shu, Y. An Automatic Fault Diagnosis Method for the Reciprocating Compressor Based on
HMT and ANN. Appl. Sci. 2022, 12, 5182. [CrossRef]
23. Patil, A.; Soni, G.; Prakash, A. A BMFO-KNN based intelligent fault detection approach for reciprocating compressor. Int. J. Syst.
Assur. Eng. Manag. 2021, 1–13. [CrossRef]
24. Zhao, D.; Liu, S.; Zhang, H.; Sun, X.; Wang, L.; Wei, Y. Intelligent fault diagnosis of reciprocating compressor based on attention
mechanism assisted convolutional neural network via vibration signal rearrangement. Arab. J. Sci. Eng. 2021, 46, 7827–7840.
[CrossRef]
25. Xiao, S.; Nie, A.; Zhang, Z.; Liu, S.; Song, M.; Zhang, H. Fault Diagnosis of a Reciprocating Compressor Air Valve Based on Deep
Learning. Appl. Sci. 2020, 10, 6596. [CrossRef]
26. Zhou, F.; Liu, W.; Yang, X.; Shen, J.; Gong, P. A new method of health condition detection for hydraulic pump using enhanced
whale optimization-resonance-based sparse signal decomposition and modified hierarchical amplitude-aware permutation
entropy. Trans. Inst. Meas. Control 2021, 43, 3360–3376. [CrossRef]
27. Wang, Y.; Lu, C.; Liu, H.; Wang, Y. Fault diagnosis for centrifugal pumps based on complementary ensemble empirical mode
decomposition, sample entropy and random forest. In Proceedings of the 2016 12th World Congress on Intelligent Control and
Automation (WCICA), Guilin, China, 12–15 June 2016; IEEE: Piscataway, NJ, USA, 2016; pp. 1317–1320. [CrossRef]
28. Ahmad, Z.; Rai, A.; Maliuk, A.S.; Kim, J.M. Discriminant feature extraction for centrifugal pump fault diagnosis. IEEE Access
2020, 8, 165512–165528. [CrossRef]
29. Hasan, M.J.; Rai, A.; Ahmad, Z.; Kim, J.M. A fault diagnosis framework for centrifugal pumps by scalogram-based imaging and
deep learning. IEEE Access 2021, 9, 58052–58066. [CrossRef]
30. Ahmad, Z.; Nguyen, T.K.; Ahmad, S.; Nguyen, C.D.; Kim, J.M. Multistage centrifugal pump fault diagnosis using informative
ratio principal component analysis. Sensors 2021, 22, 179. [CrossRef]
357
Mathematics 2022, 10, 3033
31. Irfan, M.; Alwadie, A.S.; Glowacz, A.; Awais, M.; Rahman, S.; Khan, M.K.A.; Jalalah, M.; Alshorman, O.; Caesarendra, W. A novel
feature extraction and fault detection technique for the intelligent fault identification of water pump bearings. Sensors 2021,
21, 4225. [CrossRef]
32. Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM Sigkdd International Conference
on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [CrossRef]
33. Kumar, A.; Gandhi, C.; Zhou, Y.; Kumar, R.; Xiang, J. Improved deep convolution neural network (CNN) for the identification of defects
in the centrifugal pump using acoustic images. Appl. Acoust. 2020, 167, 107399. [CrossRef]
34. Howedi, A.; Lotfi, A.; Pourabdollah, A. An entropy-based approach for anomaly detection in activities of daily living in the
presence of a visitor. Entropy 2020, 22, 845. [CrossRef]
35. Callegari, C.; Giordano, S.; Pagano, M. Entropy-based network anomaly detection. In Proceedings of the 2017 International
Conference on Computing, Networking and Communications (ICNC), Santa Clara, CA, USA, 26–29 January 2017; IEEE:
Piscataway, NJ, USA, 2017; pp. 334–340. [CrossRef]
36. Palmieri, F. Network anomaly detection based on logistic regression of nonlinear chaotic invariants. J. Netw. Comput. Appl. 2019,
148, 102460. [CrossRef]
37. Wu, S.; Moore, B.E.; Shah, M. Chaotic invariants of lagrangian particle trajectories for anomaly detection in crowded scenes. In
Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA,
USA, 13–18 June 2010; IEEE: Piscataway, NJ, USA, 2010; pp. 2054–2060. [CrossRef]
38. Garland, J.; Jones, T.R.; Neuder, M.; Morris, V.; White, J.W.; Bradley, E. Anomaly detection in paleoclimate records using
permutation entropy. Entropy 2018, 20, 931. [CrossRef] [PubMed]
39. Wang, S.; Lu, M.; Kong, S.; Ai, J. A Dynamic Anomaly Detection Approach Based on Permutation Entropy for Predicting
Aging-Related Failures. Entropy 2020, 22, 1225. [CrossRef] [PubMed]
40. Ahmad, S.; Styp-Rekowski, K.; Nedelkoski, S.; Kao, O. Autoencoder-based condition monitoring and anomaly detection method
for rotating machines. In Proceedings of the 2020 IEEE International Conference on Big Data (Big Data), Atlanta, GA, USA,
10–13 December 2020; IEEE: Piscataway, NJ, USA, 2020; pp. 4093–4102. [CrossRef]
41. de Paula Monteiro, R.; Lozada, M.C.; Mendieta, D.R.C.; Loja, R.V.S.; Bastos Filho, C.J.A. A hybrid prototype selection-based deep
learning approach for anomaly detection in industrial machines. Expert Syst. Appl. 2022, 204, 117528. [CrossRef]
42. Li, D.; Chen, D.; Jin, B.; Shi, L.; Goh, J.; Ng, S.K. MAD-GAN: Multivariate anomaly detection for time series data with generative
adversarial networks. In Proceedings of the International Conference on Artificial Neural Networks, Munich, Germany,
17–19 September 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 703–716. [CrossRef]
43. Kumar, S.R.; Iniyal, U.; Harshitha, V.; Abinaya, M.; Janani, J.; Jayaprasanth, D. Anomaly Detection in Centrifugal Pumps Using
Model Based Approach. In Proceedings of the 2022 8th International Conference on Advanced Computing and Communication
Systems (ICACCS), Coimbatore, India, 25–26 March 2022; IEEE: Piscataway, NJ, USA, 2022; Volume 1, pp. 427–433. [CrossRef]
44. Dutta, N.; Kaliannan, P.; Subramaniam, U. Application of machine learning algorithm for anomaly detection for industrial pumps.
In Machine Learning Algorithms for Industrial Applications; Springer: Berlin/Heidelberg, Germany, 2021; pp. 237–263.[CrossRef]
45. Dutta, A.; Karimi, I.A.; Farooq, S. PROAD (Process Advisor): A Health Monitoring Framework for Centrifugal Pumps. Comput.
Chem. Eng. 2022, 163, 107825. [CrossRef]
46. Charoenchitt, C.; Tangamchit, P. Anomaly Detection of a Reciprocating Compressor using Autoencoders. In Proceedings of the
2021 Second International Symposium on Instrumentation, Control, Artificial Intelligence, and Robotics (ICA-SYMP), Bangkok,
Thailand, 20–22 January 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–4. [CrossRef]
47. Rand, D.; Young, L.S. Dynamical Systems and Turbulence. In Lecture Notes in Mathematics; Springer: Cham, Switzerland, 1981;
Volume 898. [CrossRef]
48. Vlachos, I.; Kugiumtzis, D. State space reconstruction for multivariate time series prediction. arXiv 2008, arXiv:0809.2220.
https://doi.org/10.48550/arXiv.0809.2220.
49. Abarbanel, H.D.; Brown, R.; Sidorowich, J.J.; Tsimring, L.S. The analysis of observed chaotic data in physical systems. Rev. Mod.
Phys. 1993, 65, 1331. [CrossRef]
50. Fraser, A.M.; Swinney, H.L. Independent coordinates for strange attractors from mutual information. Phys. Rev. A 1986, 33, 1134.
[CrossRef]
51. Kennel, M.B.; Brown, R.; Abarbanel, H.D. Determining embedding dimension for phase-space reconstruction using a geometrical
construction. Phys. Rev. A 1992, 45, 3403. [CrossRef]
52. Ribeiro, M.; Henriques, T.; Castro, L.; Souto, A.; Antunes, L.; Costa-Santos, C.; Teixeira, A. The entropy universe. Entropy 2021,
23, 222. [CrossRef] [PubMed]
53. Azami, H.; Escudero, J. Amplitude-and fluctuation-based dispersion entropy. Entropy 2018, 20, 210.
doi: 10.3390/e20030210. [CrossRef] [PubMed]
54. Porta, A.; Baselli, G.; Liberati, D.; Montano, N.; Cogliati, C.; Gnecchi-Ruscone, T.; Malliani, A.; Cerutti, S. Measuring regularity by
means of a corrected conditional entropy in sympathetic outflow. Biol. Cybern. 1998, 78, 71–78. [CrossRef] [PubMed]
55. Feutrill, A.; Roughan, M. A Review of Shannon and Differential Entropy Rate Estimation. Entropy 2021, 23, 1046. [CrossRef]
56. Bandt, C.; Pompe, B. Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 2002, 88, 174102.
[CrossRef]
358
Mathematics 2022, 10, 3033
57. Unakafova, V.A.; Keller, K. Efficiently measuring complexity on the basis of real-world data. Entropy 2013, 15, 4392–4415.
[CrossRef]
58. Steven, V. Heart rate variability linear and nonlinear analysis with applications in human physiology. Diss. Abstr. Int. 2010, 71.
59. Lu, C.; Sun, Q.; Tao, L.; Liu, H.; Lu, C. Bearing health assessment based on chaotic characteristics. Shock Vib. 2013, 20, 519–530.
[CrossRef]
60. Rolo-Naranjo, A.; Montesino-Otero, M.E. A method for the correlation dimension estimation for on-line condition monitoring of
large rotating machinery. Mech. Syst. Signal Process. 2005, 19, 939–954. [CrossRef]
61. Ihlen, E.A. Introduction to multifractal detrended fluctuation analysis in Matlab. Front. Physiol. 2012, 3, 141. [CrossRef]
62. JiaQing, W.; Han, X.; Yong, L.; Tao, W.; Zengbing, X. Detrended Fluctuation Analysis and Hough Transform Based Self-Adaptation
Double-Scale Feature Extraction of Gear Vibration Signals. Shock Vib. 2016, 2016, 3409897. [CrossRef]
63. Golińska, A.K. Detrended fluctuation analysis (DFA) in biomedical signal processing: Selected examples. Stud. Log. Gramm.
Rhetor. 2012, 29, 107–115.
64. Henry, B.; Lovell, N.; Camacho, F. Nonlinear dynamics time series analysis. In Nonlinear Biomedical Signal Processing: Dynamic
Analysis and Modeling; Wiley: Hoboken, NJ, USA, 2012; Volume 2, pp. 1–39. [CrossRef]
65. Wolf, A.; Swift, J.B.; Swinney, H.L.; Vastano, J.A. Determining Lyapunov exponents from a time series. Phys. D Nonlinear Phenom.
1985, 16, 285–317. [CrossRef]
66. Rosenstein, M.T.; Collins, J.J.; De Luca, C.J. A practical method for calculating largest Lyapunov exponents from small data sets.
Phys. D Nonlinear Phenom. 1993, 65, 117–134. [CrossRef]
67. Shannon, C.E. A mathematical theory of communication. ACM SIGMOBILE Mob. Comput. Commun. Rev. 2001, 5, 3–55. [CrossRef]
68. Gao, J.; Liu, F.; Zhang, J.; Hu, J.; Cao, Y. Information entropy as a basic building block of complexity theory. Entropy 2013,
15, 3396–3418. [CrossRef]
69. Goldstein, S.; Lebowitz, J.L.; Tumulka, R.; Zanghì, N. Gibbs and Boltzmann entropy in classical and quantum mechanics. arXiv
2019, arXiv:1903.11870.
70. Coifman, R.R.; Wickerhauser, M.V. Entropy-based algorithms for best basis selection. IEEE Trans. Inf. Theory 1992, 38, 713–718.
[CrossRef]
71. Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [CrossRef]
72. Rényi, A. On measures of entropy and information. In Contributions to the Theory of Statistics, Proceedings of the Fourth Berkeley
Symposium on Mathematical Statistics and Probability, Davis, CA, USA, 20–30 June 1960; The Regents of the University of California:
Davis, CA, USA, 1961; Volume 1.
73. Sharma, V.; Parey, A. A review of gear fault diagnosis using various condition indicators. Procedia Eng. 2016, 144, 253–263.
[CrossRef]
74. Pan, Y.; Chen, J.; Li, X. Spectral entropy: A complementary index for rolling element bearing performance degradation assessment.
Proc. Inst. Mech. Eng. Part C J. Mech. Eng. Sci. 2009, 223, 1223–1231. [CrossRef]
75. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [CrossRef]
76. Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; CRC Press: Boca Raton, FL, USA, 1994.
77. Vapnik, V.N. An overview of statistical learning theory. IEEE Trans. Neural Netw. 1999, 10, 988–999. [CrossRef]
78. Scholkopf, B. Support vector machines: A practical consequence of learning theory. IEEE Intell. Syst. 1998, 13, 4. [CrossRef]
79. Hossin, M.; Sulaiman, M. A review on evaluation metrics for data classification evaluations. Int. J. Data Min. Knowl. Manag.
Process 2015, 5, 1. [CrossRef]
80. Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009,
45, 427–437. [CrossRef]
81. Wang, W.; Chen, J.; Wu, Z. The application of a correlation dimension in large rotating machinery fault diagnosis. Proc. Inst.
Mech. Eng. Part C J. Mech. Eng. Sci. 2000, 214, 921–930. [CrossRef]
82. Boon, M.Y.; Henry, B.I.; Suttle, C.M.; Dain, S.J. The correlation dimension: A useful objective measure of the transient visual
evoked potential? J. Vis. 2008, 8, 6. [CrossRef] [PubMed]
83. Brown, J. Classifiers and their metrics quantified. Mol. Inform. 2018, 37, 1700127. [CrossRef] [PubMed]
84. Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997,
30, 1145–1159. [CrossRef]
85. Pohlert, T. The pairwise multiple comparison of mean ranks package (PMCMR). R Package 2014, 27, 9.
86. Leite, G.d.N.P.; Araújo, A.M.; Rosas, P.A.C.; Stosic, T.; Stosic, B. Entropy measures for early detection of bearing faults. Phys. A
Stat. Mech. Its Appl. 2019, 514, 458–472. [CrossRef]
359
mathematics
Article
Lateral Dynamic Simulation of a Bus under Variable Conditions
of Camber and Curvature Radius
Ester Olmeda 1, *, Enrique Roberto Carrillo Li 2 , Jorge Rodríguez Hernández 2 and Vicente Díaz 1
Abstract: The objective of this paper is to describe a model for the simulation of the lateral dynamics
of a vehicle, specifically buses, under variable trajectory conditions, such as camber and radius
of curvature; in addition, a variable speed is added as a simulation parameter. The objective of
this study is the prevention of vehicle rollover and sideslip. An 8 degrees of freedom model was
developed, considering a front and a rear section of the bus with its respective suspension system,
and both sections have been connected by a torsion spring that emulates the torsional stiffness of the
vehicle chassis. A Panhard bar is also added at the rear as an additional element to the suspension
and the behavior of the bus when it is added is analyzed. This model also allows the evaluation
of the force on each suspension component, which allows for future controllability of the active
suspension components. The results show the dynamic behavior of the vehicle, and some indicators
are introduced to show the possible sideslip or rollover. As a conclusion, the influence of the road
parameters on the dynamic behavior of the bus and the effect of the Panhard bar on the dynamic
behavior of the bus can be pointed out.
Citation: Olmeda, E.; Carrillo Li, E.R.; Keywords: lateral dynamics; track parameters; vehicle mathematical modeling; safety vehicle index
Rodríguez Hernández, J.; Díaz, V.
Lateral Dynamic Simulation of a Bus MSC: 70E50
under Variable Conditions of Camber
and Curvature Radius. Mathematics
2022, 10, 3081. https://doi.org/
10.3390/math10173081 1. Introduction
Academic Editor: Armin Fügenschuh Vehicle dynamics deals with the mechanical modeling of vehicles as well as the
mathematical description and analysis of vehicles.
Received: 30 June 2022
Vehicle research models have undergone major changes in recent decades, from the
Accepted: 18 August 2022
traditional grouped parameter model to the finite element model, the dynamic substructure
Published: 26 August 2022
model and the dynamic multibody model; and from linear models to nonlinear models
Publisher’s Note: MDPI stays neutral with nonlinear stiffness or damping. The responses of these models can be obtained
with regard to jurisdictional claims in theoretically or numerically depending on the complexity of the resulting mathematical
published maps and institutional affil- models. This complexity is closely linked to the number of degrees of freedom considered
iations. for the respective modeling [1].
According to [2–4], among others, there are two ways to perform modeling of mechan-
ical systems: an empirical approach and an axiomatic or theoretical approach. In this work,
the axiomatic modeling method has been used for a vehicular system. The mathematical
Copyright: © 2022 by the authors.
or virtual model is obtained through the application of the Newton-D’Alembert principle,
Licensee MDPI, Basel, Switzerland.
obtaining a system of ordinary differential equations.
This article is an open access article
In general, the analytical modeling of any vehicle involves analyzing the interrela-
distributed under the terms and
conditions of the Creative Commons
tionships between a large number of components, as well as their response to different
Attribution (CC BY) license (https://
external disturbances and the effects of these on the rest of the components. Because of this,
creativecommons.org/licenses/by/ different models must be proposed to predict the behavior of the vehicle in the presence of
4.0/). a set of external stimuli in order to determine a specific type of response. Such modeling
implies leaving aside some components and disturbances with little or no relevance to the
phenomenon we are trying to understand.
In the case of the present work, the objective is to analyze the lateral dynamics of a
vehicle, and therefore to know the vertical and lateral response of it, as well as the vertical
and lateral movements of its components due to vertical disturbances and transverse loads;
therefore, longitudinal disturbances are left aside. In addition, the kinematics of the vehicle
in the longitudinal axis and its rotation around the z-axis will not be determined.
In recent years, different vehicle models have been proposed for the analysis of lateral
dynamics and accident consequences in case of rollover or sideslip. In [5], a restricted lateral
dynamics model of articulated vehicles and an algorithm for the estimation of sideslip
angle and cornering stiffness were proposed. The articulated vehicle was modeled by using
the bicycle model, the linear tire model, and the modified Dug-off model. In [6] is presented
an application of combined dynamic and finite element (FE) simulations to evaluate the
rollover crash of a bus. A nonlinear mathematical model with six degrees of freedom is
shown in [7] from which the effect of different factors on the stability of the vehicle on a
curved trajectory (braking angle, vehicle speed and acceleration a.s.o.) is studied. In [8], a
model that simulates the lateral rollover test including the effect of a variable position of
the center of gravity and calculates the maximum speed at which the bus can travel in a
curve without rolling over is proposed. The influence of several parameters (i.e., height
of the center of gravity, weight distribution between axles, chassis torsional stiffness, etc.)
on bus roll stability during a lane change maneuver, has been analyzed by means of six
degrees of freedom beam finite element model in [9].
The aim of lateral dynamics is to describe the behavior of a vehicle when it is subjected
to the action of accelerations and/or forces in its lateral direction in order to prevent a
possible rollover or sideslip of the vehicle. These solicitations, whose magnitudes are a
function of speed, commonly appear when the vehicle travels on a curved track. Different
studies show that a high number of traffic accidents take place on horizontal curves, where
the risk of accidents is significantly higher than on other road sections [10–12]. According
to several recent studies, the radius of curvature represents the most important factor in the
geometric design of horizontal curves, and the accident rate is directly related to it [13,14].
The most significant parameters of a curved path for this study are radius of curvature
and camber since these geometrical characteristics of a curved path influence the lateral
behavior. Due to these solicitations, vehicle driving can be difficult or even dangerous
when driving on a curved path. The vehicle is subjected to a lateral force caused by
centripetal acceleration acting on the center of gravity of each of the vehicle’s components.
Consequently, in order to achieve a balance, the adherence forces and the normal forces
between the road and the tire create a balance that prevents the vehicle from rollover or
sideslip. However, these adhesion forces have a limit given by Coulomb’s dry friction law
and another limit can also be considered when the normal force of any wheel reaches a
value of zero, the moment in which the vehicle rolls over. These limits are directly related
to the geometrical factors of the curve and to the environmental conditions that may cause
these limits to vary.
Therefore, the study of track characteristics in lateral vehicle dynamics is also of great
relevance. the safety issue of horizontal road curves was evaluated by [15] according to the
American Association of Highway Transportation Officials (AASHTO) standard. Different
parameters, such as vehicle weight, vehicle dimensions, longitudinal slopes, and vehicle
velocity are evaluated in the geometric design of horizontal curves using a multi-body
dynamic simulation process. A combination of simple circular and clothoid transition
curves with several longitudinal upslopes and downslopes were designed.
Ref. [16] studied the radius of curvature of the real vehicle trajectory under driver’s
instantaneous emergency steering maneuvers. It was calculated based on the bicycle model.
They analyzed the possibility of rollover taking into account the curve entry velocity and camber.
In this study, a trajectory composed of three paths (a straight path, a path of variable
radius of curvature and a path of constant curvature) is modeled. For simulation purposes,
362
Mathematics 2022, 10, 3081
only geometric variations are considered in order to obtain the values of the adherence and
normal forces present, while a vehicle follows a simulation trajectory composed of these
three paths with a different radius of curvature. This design fits to realistic transport routes
since it has been calculated and parameterized with curves of agreement (trajectories that
connect a straight and circular trajectory in such a way that it starts with an infinite radius
of curvature and ends in a constant radius of curvature) used in road design.
A bus model is provided using rigid multibody systems with which a virtual or
mathematical model (system of ordinary differential equations) has been generated. The
passenger bus is then modeled in two sections, front and rear. The front part contemplates
several subsystems, such as the unsprung masses, the unsprung masses are connected by
a stability or torsion bar, the flexible wheels, etc. These parts are represented by a set of
inertia, stiffness, and damping elements. The rear part has been modeled in an analogous
way although, unlike the front, in this there is only one unsprung mass and a Panhard
type bar is used to improve its vertical and lateral dynamic behavior. For this model, the
suspension system and tire stiffness as well as the damping of both bodies have been
modeled as linear components.
The Panhard bar is a suspension element that prevents lateral displacements of an axle
and also restricts the relative vertical movement between the suspended and unsuspended
mass. Independent rear suspension systems do not require a Panhard bar. The bar is
attached to two points that allow only the up and down movement of its ends, so that it
can only move in the vertical plane. The forces that restrict the movement between the
connected elements take part in the dynamics of these, since the forces and the consequent
moments that originate, both in the non-suspended mass and in the suspended, intervene
in their dynamics. Some previous studies have incorporated a Panhard bar in their vehicle
models. In [17], a roll plane model of a road vehicle is provided incorporating the kinematic
constraint provided by a Panhard bar. The results show that the location and orientation of
the Panhard bar significantly influences the kinetostatic roll properties of the suspension
when the vehicle is subjected to vertical and lateral forces. In [18], a common front axle
suspension is designed for four different tractor models, in which a Panhard bar is used to
improve safety and driving stability. Related with the Panhard bar [19], it compares the
structural durability behavior of commercial vehicles Panhard bars of a ferritic cast nodular
iron with the behavior of rods of an austempered.
Both sections of the bus are connected by a torsional spring (defined by its torsional
stiffness) to emulate the dynamic influence of the vehicle chassis. Due to the different
longitudinal position of the two sections and, consequently, the different roll angle, the
chassis exerts a torsional moment on both sections, which influences the lateral stability.
This bus model travels along the simulated trajectory and allows the simulation of the
lateral dynamics of a bus when it travels on a cambered curve.
z
D
363
Mathematics 2022, 10, 3081
These dimensions are subsequently applied to modeling of the road, with the radius
of curvature ρ and camber angle α as geometric parameters defining the dynamic behavior
of the vehicle on a curved path. The track’s width Bt and roadway width Bw define the
maximum value of the camber height and the vehicle width, respectively; the track width
also defines the number of lanes on the road. In order to simplify the simulation, it has
been assumed that the vehicle is driving in the middle roadway lane.
ay = v2 /ρ (1)
This acceleration, mainly dependent on the speed (v) and inversely proportional to
the radius of curvature, is translated into an inertial force, opposite in sense to the normal
acceleration and oriented on the horizontal of the plane in which it is located. This inertial
force is commonly known as centrifugal force, Fi :
v2
Fi = mT (2)
ρ
57
)L
K*
5
P7 J
51
\
&%
)I 1
)I 1
For a skidding situation, it is said that the parallel component of R force is higher
than the adherence force determined by the Coulomb’s dry friction force; on the other
364
Mathematics 2022, 10, 3081
hand, when a rollover is produced, the vehicle will turn into the external side of the curve,
it means that it will turn over one or more wheel from the external side of the vehicle.
Consequently, we can affirm that when the normal force of one or more wheels in the inner
side are zero the rollover is produced, or is at least imminent. To analyze the vehicles
rollover, a fixed point is required thus the concept of roll center is introduced as a reference
point, which despite the strains that suspension suffers, it remains static on the vehicle
frontal plane. From this point, all moments are calculated so it is important to highlight
that the higher the vehicle’s mass center, the higher the possibility of rollover.
The transition curves enhance the horizontal curve safety as they help to gradually
apply camber and centrifugal force. Since the transition path is the only one which varies
its parameters, it is the only one that defines parameter variation equations. Such paths
connect a straight and a circular path so that begins with an infinite curvature radius and
ends at a constant curvature radius; furthermore, over those paths, the camber angle starts
to increase or decrease. To describe those paths a variety of mathematical curves are applied
to determine their points in the space. Among those curves, the most known are Bernoulli’s
lemniscate and the clothoid, which is the most widely applied [15]. The use of a clothoid
transition curve between tangent and a circular curve has been suggested to increase the
safety margin of safety against sideslip, especially when the camber exceeds 12% [20]. So,
the clothoid has been used in this study.
A clothoid is defined by the following equation:
ρ ( s ) · s = A2 (3)
where s is the traveled distance over the curve and ρ(s) is the curvature radius as a function
of the traveled distance, it means that there is a linear correspondence through a constant.
This could be interpreted as while the vehicle is moving forward over the path, the magnitude
of the curvature radius increases or decreases. A clothoid curve is depicted in Figure 3.
365
Mathematics 2022, 10, 3081
The position vector is described by the following equation which is expressed in terms
of Fresnel functions:
→ √ s s
r (s) = [ x (s); y(s)] = A π FresnelC √ ; FresnelS √ (4)
A π A π
One of the most important functions is avoid the appearance of sudden uncompen-
sated acceleration, thus it is required to evaluate the magnitude of the over acceleration,
which is the rate of variation of the acceleration. According to [21], for railway vehicles,
it is recommended that this value must not overcome 0.2 m/s3 ; nevertheless, for a road
vehicle, this value should be controlled by the maneuvering of the car’s rudder. For this
simulation, this value must be settled as an input parameter, hence the maneuvering of the
vehicles is not being simulated as it is a random parameter.
A2
ρ(t) = (5)
v(t)t
zb
z(t) = v(t)t (6)
sb
In the previous expressions, zb is the constant camber value in the circular path and
sb is the total length of the clothoid. The velocity is going to be considered as a constant
value for simulation path generation purpose. Due to the vehicle’s behavior is studied
just a transition and circular path are required, hence in a straight path there is no risk of
rollover because lateral acceleration is not present.
From Equation (6) an equivalence to determine sb,min clothoid curve minimum length
can be calculated. According with [21], the maximum noncompensated lateral acceleration
should be 1 m/s2 for comfort and safety reasons. From the given conditions, the following
expression can be developed:
√ 2
9.81 zb v2 Bw −zb 2
Bw − ρmin Bw vD
< sb,min (7)
0.2
From this expression, the clothoid’s constant can be determined:
√
A= sb,min ρb (8)
Once the transition path is defined, a circular path can be added giving as a result the
whole simulation path.
For this work, a simulation path was developed considering the following parameters:
• Camber over the circular path (Final value): 0.25 m
• Curvature radius over the circular path (Final value): 400 m
• Track’s width: 0.9 m
• Number of tracks: 3
• Maximum over acceleration: 0.15 m/s3
• Path’s total length: 1500 m
• Average Speed: 120 km/h
Figure 4 shows a plane view of the track and the center of the circular path:
366
Mathematics 2022, 10, 3081
Taking as a reference the average speed, function for camber angle and curvature
radius can be developed in terms of time (Figure 5).
D E
Figure 5. (a) Curvature radius vs. time and (b) camber angle vs. time for simulation.
3. Vehicle Modeling
As it has been described, to simplify the vehicle analysis, the bus has been divided
into two sections, front and rear, both are joined by a torsional spring, which emulates the
stiffness belonging to the bus structure.
367
Mathematics 2022, 10, 3081
I
U
K*
] P , N V
K %
F.R.C F V
F V P K (
\ N V N 7% ]
N Z
] P F Z
F Z
N Z
Bt
D
The model proposed is based on [22,23]. This model considers not only the stiffness of
the wheels but also the damping of that element so that it is possible to get a more realistic
bus mechanical behavior. Additionally, a sway bar is considered as passive element, this
component has the functionality of link the vertical displacement of the unsprung masses. It is
performed taking advantage of the lever arms generated by the displacement giving as a result
a torsional moment along the bar so that restrict the movement of both unsprung masses, it is
desirable to avoid excessive vibrations, as well as a large displacement of the roll center.
I
U
K*
] P , FV N V
F V K%
N V I K(
F Z
P , N Z
F Z
]
N Z
\
Bt
D
368
Mathematics 2022, 10, 3081
Regarding the camber angle and curvature radius, the values of both parameters are
different from the front section, hence they are in different positions over the path; for that
reason, a time interval delay is introduced to simulate that effect.
I
]
P ,
NV FV
FV N(
NV
] I
P ,
N Z FZ NZ
FZ
I
]
P ,
N V FV FV NV
]
P N7% P
N Z FZ FZ ]
NZ
369
Mathematics 2022, 10, 3081
P \*
,I
\
P ]* D I
]
Figure 9. Acceleration decomposition.
Suspended mass rotation, its angular speed and its vertical displacement are responsi-
ble for the forces on the suspension’s system components and each of them is linked to the
displacement of the correspondent unsprung mass connected to them. From those forces
and considering the center of gravity location, the moments around its roll center can be
calculated. In Figure 10, the front of the body diagram for dynamic equilibrium is shown.
E
E
P ]
7(VW K %
P \ K*
)G )V
]
)V )G
& % )
Figure 10. Front cabin’s free body diagram for dynamic equilibrium.
From the free body diagram, the following differential equations are settled. For the
vertical displacement z1 :
.. . . .
!
m1 z1 = − k s1 (z1 + b1 φ1 − z2 ) + c1 (z1 + b1 φ1 − z2 )
. . .
!
− k s2 (z1 − b2 φ1 − z3 ) + c2 (z1 − b2 φ1 − z3 ) (9)
2
+m 1v
ρ (t)
sin[α1 (t) + φ1 (t)]
1
For rotation φ1 :
! .. . . .
!
I1 + m1 (hG1 − h B1 )2 φ1 = b2 k s2 (z1 − b2 φ1 − z3 ) + c2 (z1 − b2 φ1 − z3 )
. . .
!
−b1 k s1 (z1 + b1 φ1 − z2 ) + c1 (z1 + b1 φ1 − z2 )
2
! (10)
− m1 ρ v(t) cos(α1 (t) + φ1 (t)) (hG1 − h B1 )
1
+[(m1 g sin α1 (α1 (t) + φ1 (t)))(hG1 − h B1 )] + Test
This equation considers a resistant torque, which is caused by the torsional stiffness
of the vehicle’s structure; said moment is denoted by Test , which will be determined later.
For the unsprung masses, the analysis is quite similar as that carried out for the suspended
mass, with the difference of the intervention of the torsion bar, which links the movements
370
Mathematics 2022, 10, 3081
of both suspended masses. Figure 11 shows the free body diagram and equivalent system
of both unsprung masses.
)V )G )%7
)V )G )%7
{
{ P
P
)ZG P ]*
)ZG P ]* ] )Z
]
)Z
Figure 11. Unsprung frontal masses free body diagram.
In Figure 11, it is possible to note that there is a force produced by the sway bar FTB ,
which is opposite at each unsprung mass. This element takes advantage of the lever arms
generated by each unsprung mass, which due to the displacement difference between the
ends of the bar generates a torsion moment.
The displacement can be calculated as follow:
F1 lb3
z2 = (11)
3Eb Ib
where F1 is the force applied by the left unsprung mass, lb is the arm bar’s length, Eb Young
Modulus of the bar material and Ib is the bar’s inertia moment. The force applied at each
end of the bar can be determined by the following equation:
3z2 Eb Ib
F1 (z2 ) = (12)
lb3
From Equation (12), a correlation between unsprung mass displacements and force in
the bar can be obtained. Figure 12 shows the free body diagram of the sway bar.
O7
* - R )
O%
)
From the force calculated in (12), the torsion moment over the bar is calculated:
3z2 Eb Ib
Mt1 = F1 lb = (13)
lb2
In a similar way, it is possible calculate the moment generated by the unsprung right
mass as follow:
3z3 Eb Ib
Mt2 = F2 lb = (14)
lb2
371
Mathematics 2022, 10, 3081
Due to the moment exerted by both arms being opposite, the resultant torsion moment
is calculated from the difference between them:
3 Eb Ib
MTB = Mt1 − Mt2 = ( z2 − z3 ) (15)
lb2
MTB lT
ΦB = (16)
G J0
3 Eb Ib lT
z Δ = zΔ = Φ lb = ( z2 − z3 ) (17)
lb G J0
From Equation (17) and due to the linear correlation between the displacement and
the unsprung mass displacements, an elastic constant is determined:
9 Eb2 Ib2 lT
k tb = (18)
lb4 G J0
Considering the influence of the sway bar and applying the acceleration decomposition
and transposing them to the first member of the equivalence (Dynamic equilibrium), the
following motion equations are developed:
Unsprung left mass:
.. 2 . . .
m2 z2 − m2 ρ v(t) sin[α1 (t) + φ1 (t)] = k s1 (z1 + b1 φ1 − z2 ) + c1 (z1 + b1 φ1 − z2 )
1 . (19)
−k TB (z2 − z3 ) − k w1 z2 − cw1 z2
E
E
7(VW K%
P \* K*
)3Y
] )G )V
)3K )G P J
)V
& % 3
Figure 13. Rear cabin’s free body diagram for dynamic equilibrium.
372
Mathematics 2022, 10, 3081
In comparison with the front section, here there is a noticeable effect of the Panhard
bar, which is represented by the forces FPh and FPv , which are the horizontal and vertical
component of the force along the bar. Additionally, for the acceleration decomposition, the
influence of the rotation angle of both masses, unsprung and suspended, must be taken
into account, including the torsional torque produced by the structure. From this and the
free body diagram the following motion equations are obtained:
For the displacement z4
.. 2
m4 z4 − ρ v(t) sin[α2 (t) + φ4 (t) + φ5 (t)] = −k s3 (z4 + b3 φ4 − z5 − b5 φ5 )
2
−k s4 (z4 − b4 φ4 − z5 + b6 φ5 ) (21)
. . . .
−cs3 (z4 + b3 φ4 − z5 − b5 φ5 )
. . . .
−cs4 (z4 − b4 φ4 − z5 + b6 φ5 ) + FPv
)G
)V
P \*
)Z )ZG K(
P J
] )Z )ZG
&%3PQV
Figure 14. Rear unsprung mass free body diagram for dynamic equilibrium.
From the free body diagram, the following motion equations are obtained:
For the displacement z5
.. . .
m5 z5 = −k w3 (z5 + b5 φ5 ) − cw3 (z5 + b5 φ5 ) − k w4 (z5 − b6 φ5 )
. .
−cw4 (z5 − b6 φ5 ) + k s3 (z4 + b3 φ4 − z5 − b5 φ5 )
. . . . (23)
+k s4 (z4 − b4 φ4 − z5 + b6 φ5 ) + cs3 (z4 + b3 φ4 − z5 − b5 φ5 )
. . . . m5 v2
+cs4 (z4 − b4 φ4 − z5 + b6 φ5 ) − FPv + ρ2 ( t )
sin(α2 (t) + φ5 (t))
373
Mathematics 2022, 10, 3081
The forces produced by the Panhard bar require a kinematic analysis of both ends of it.
Such a component could be modeled as a bar element on which its inner force acts along
itself, therefore the force’s direction is already known.
Considering only the vertical displacement at the ends of the bar, Figure 15 shows the
displacement of each point connected to the Panhard bar:
E
E E
E ] E I
I
KV
] I
]
] E I
Figure 15. Rear masses displacements at the ends connected to the Panhard Bar.
From that displacement, the bar vertical displacement can be analyzed as it is shown
in Figure 16.
I
3'
] E
I
K ] E
%W
From this analysis, the vertical dimension of the bar can be calculated as follow:
h = h0 + z5 − b6 φ5 − z4 − b3 φ4 (25)
With the vertical dimension determined, the bar’s length can be calculated directly:
#
PD = h2 + Bt2 (26)
374
Mathematics 2022, 10, 3081
dx = Ax + Bu
(30)
y = Cx + Du
Due to its dimensions and simplicity, the state matrix which is a 16 × 16 square
matrix will not be shown on this document, but the input matrix will be presented for its
non-linear formulation:
⎛ ⎞
0
⎜ v2
sin(α(t) + φ1 ) ⎟
⎜ ρ(t) ⎟
⎜ ⎟
⎜ 0 ⎟
⎜ ⎟
⎜ v2
sin α ( α ( t ) + φ ) ⎟
⎜ ρ(t) 1 ⎟
⎜ ⎟
⎜ 0 ⎟
⎜ ⎟
⎜ v2
sin α ( α ( t ) + φ ) ⎟
⎜ ρ(t) 1 ⎟
⎜ ⎟
⎜ 0 ⎟
⎜ ⎟
⎜ 2
− m1 ρv(t) cos(α(t)+φ1 ) (hG1 −h B1 )+(m1 g sin(α(t)+φ1 ))(hG1 −h B1 ) ⎟
⎜ ⎟
⎜ I1B ⎟
⎜ ⎟
⎜ 0 ⎟ (32)
⎜ 2 ⎟
⎜ v
sin ( α ( t − 1 ) + φ + φ ) + FPv ⎟
⎜ ρ ( t −1) 4 5 m 4 ⎟
⎜ ⎟
⎜ 0 ⎟
⎜ 2 FPv ⎟
⎜ v
sin ( α ( t − 1 ) + φ ) − ⎟
⎜ ρ ( t −1) 5 m 5 ⎟
⎜ 0 ⎟
⎜⎛ ⎞ ⎟
⎜ 2 ⎟
⎜ ⎝ −m4 ρ(t−1) cos(α(t − 1) + φ4 + φ5 ) ⎠
v
⎟
⎜ (hG2 −h B2 )− FPh (hS2 )+ FPv b3 ⎟
⎜ +m4 g sin(α(t − 1) + φ4 + φ5 ) ⎟
⎜ ⎟
⎜ I4B ⎟
⎜ ⎟
⎜ 0 ⎟
⎝
v2
! ⎠
(m5 g sin(α(t−1)+φ5 ))− m5 ρ(t−1) cos(α(t−1)+φ5 ) + FPh h E2 + FPv b6
I5B
375
Mathematics 2022, 10, 3081
From the equations given in this matrix, it is highly dependent on the state matrix so
that matrix B is a function of x. Although there is a control vector u and matrix C, output,
and matrix D, feedforward, this matrix will be replaced by an identity matrix and a row
vector with ones, hence the main purpose of the settled system is simulation. In further
works, this matrix can be replaced to obtain a control system. All equations for the path
and other elements, such as a Panhard bar, are modeled in a Simulink® algorithm.
Once the differential equations are expressed into the state space equation, it is neces-
sary to numerically fix all the mechanic parameters which have been taken from [24] are
expressed in the following list:
m1 = 7000 kg—Front suspended mass
m2 = 500 kg—Front left unspring mass
m3 = 500 kg—Front right unspring mass
I1 = 125, 000 kg · m2 —Front suspended mass inertia respect to its mass center
c1 = 600 N · s/m—Damping coefficient of the left front suspension element
c2 = 600 N · s/m—Damping coefficient of the right front suspension element
b1 = 0.8 m—Distance from suspended mass center to the left front suspension
b2 = 0.8 m—Distance from suspended mass center to the right front suspension
k s1 = 200, 000 N/m—Stiffness coefficient of the left front suspension element
k s2 = 200, 000 N/m—Stiffness coefficient of the right front suspension element
h B1 = 0.75 m—Roll center front suspended mass height respect to the path’s plane.
hG1 = 1.55 m—Mass center height of the suspended front mass
k TB = 1, 000, 000 N/m—Stiffness coefficient of the sway bar
k w1 = 921, 607 N/m—Stiffness coefficient of the front left tire
k w2 = 921, 607 N/m—Stiffness coefficient of the front right tire
cw1 = 800 N · s/m—Damping coefficient of the front left tire
cw2 = 800 N · s/m—Damping coefficient of the front right tire
k E = 600, 000, 000 N · m/rad—Torsional stiffness coefficient of the vehicle structure
m4 = 7000 kg—Read suspended mass
m5 = 1000 kg—Rear left unspring mass
I4 = 125, 000 kg · m2 —Rear suspended mass inertia respect to its mass center
I5 = 1000 kg · m2 —Rear unspring mass inertia respect to its mass center
c3 = 600 N · s/m—Damping coefficient of the left rear suspension element
c4 = 600 N · s/m—Damping coefficient of the right rear suspension element
b3 = 0.8 m—Distance from suspended mass center to the left rear suspension
b4 = 0.8 m—Distance from suspended mass center to the right rear suspension
b5 = 0.8 m—Distance from unspring mass center to the left rear tire
b6 = 0.8 m—Distance from unspring mass center to the right rear tire
k s3 = 200, 000 N/m—Stiffness coefficient of the left rear suspension element
k s4 = 200, 000 N/m—Stiffness coefficient of the right rear suspension element
h B2 = 0.75 m—Roll center rear suspended mass height respect to the path’s plane.
hG2 = 1.55 m—Mass center height of the suspended rear mass
h E2 = 0.5 m—Mass center height of the unspring rear mass
k w3 = 921, 607 N/m—Stiffness coefficient of the rear left tire
k w4 = 921, 607 N/m—Stiffness coefficient of the rear right tire
cw3 = 800 N · s/m—Damping coefficient of the rear left tire
cw4 = 800 N · s/m—Damping coefficient of the rear right tire
Also, for the Panhard bar, the following parameters are taken:
E = 2.1 × 105 N/mm2 —Panhard bar material’s Young modulus
d P = 30 mm—Panhard bar diameter
From the constants given above, the following results are obtained from the simulation
of the vehicle’s travel over a transition and circular path, as previously given (Figure 17).
376
Mathematics 2022, 10, 3081
Furthermore, the instantaneous Panhard’s bar force and angle are also obtained from
the simulation (Figure 18).
377
Mathematics 2022, 10, 3081
Those results show the bus dynamic behavior when it is traveling over a transition and
circular curve; in fact, it is noticeable that while the bus is over the clothoid its position and
rotation state variables are increasing constant and when it arrives at the circular path they
become oscillating, consequently, the Panhard bar length and angle have the same behavior.
Figure 19. Simulation results without Panhard bar—displacement and rotation angles.
378
Mathematics 2022, 10, 3081
The positive values of the structural moment present in the simulation with the
Panhard bar show a better stability due to this moment acts in opposition to the vehicles
rollover making the vehicle more stable; on the other hand, the absence of a Panhard bar
produces a negative structural moment which contributes to an increased risk of rollover.
P ]
)OL
:O
:Y K* ]
, %I P ]
P ] :O )O L K( ]
:O )O L
:Y
:Y )I
1
K( ] 1
)I
Figure 21. Free Body Diagram for front section dynamic equilibrium.
On the diagram shown, the inertial forces, which are obtained from the acceleration’s
decomposition used for the motion equations, and weight of each body is already decomposed.
379
Mathematics 2022, 10, 3081
The inertia forces on the vertical direction are defined using the states derivatives,
however, the lateral inertia forces and the weight decomposition are calculated as follow:
m1 v2
Fl1i = cos[α1 (t) + φ1 (t)] (33)
ρ1 ( t )
m2 v2
Fl2i = cos[α1 (t)] (34)
ρ1 ( t )
m3 v2
Fl3i = cos[α1 (t)] (35)
ρ1 ( t )
Analyzing the equilibrium on the lateral direction:
Furthermore, the inertial forces and torques required to be used into the state equations
to calculate the normal forces at each tire. Analyzing the equilibrium taking moments from
the left frontal wheel and factorizing N2 :
..
[Wl2 − Fl2i ](h E1 − $z2 ) + [Wl3 − F
%l3i ](h E1 − z3$) −..I1B φ1 %
.. /( Bt ) (37)
+[Wl1 − Fl1i ](hG1 − z1 ) + m1 z1 − Wv1 ( Bvia /2) + m3 z3 − Wv3 ( avia )
The magnitudes of the normal forces indicate if the vehicle rolls over or not when
any normal force becomes zero. On the other hand, in the lateral direction there are two
unknown variables, which are the friction force and just one equation, so that determines
the value of each friction force becomes an unsolvable problem, therefore a new procedure
to evaluate if the vehicle slips or not will be performed, using the value of both forces in a
conditional way:
Ff 1 + Ff 2 ≤ Ff 1,max + Ff 2,max (39)
In addition, it can be expressed as:
0 ≤ N1 μs + N2 μs − Ff 1 − Ff 2 (40)
The maximum values of friction force will be determined using the so-called Coulomb’s
dry friction approach. For that evaluation, it is assumed that both tires are not skidding, so
the friction coefficient is the static friction coefficient. The conditional form establishes that
when the value becomes negative it means that the vehicle is slipping. This analysis is also
applied for the rear section, with a similar free body diagram (Figure 22).
380
Mathematics 2022, 10, 3081
P \ )O L
:O
:Y
]
K*
, % I
P \ )O L
]
K(
:O
:Y
)I
, % I 1
)I 1
Figure 22. Free Body Diagram for rear section dynamic equilibrium.
m4 v2
Fl4i = cos[α2 (t) + φ4 (t) + φ5 (t)] (41)
ρ2 ( t )
m5 v2
Fl5i = cos[α2 (t) + φ5 (t)] (42)
ρ2 ( t )
Analyzing the equilibrium on the lateral direction:
Similar to the front section, analyzing the equilibrium taking moments from the left
wheel and factorizing N4 :
.. ..
[Wl5 − Fl5i$](h E2 − z5 ) −
% IB4 φ4 − IB5
$ φ5 ..+ [Wl4 −%Fl4i ](hG2 − z4 )
N4 = − .. /Bt (44)
+ m4 z4 − Wv4 ( Bvia /2) + m5 z5 − Wv5 ( Bt /2)
In other terms:
0 ≤ N3 μs + N4 μs − Ff 3 − Ff 4 (47)
For the simulation, a parameter more is necessary:
μk = 0.4—Static friction coefficient between the tire and path.
The results for the normal forces at each tire, while the bus is traveling over the
simulation path, are shown on the following graphics (Figure 23).
381
Mathematics 2022, 10, 3081
About the friction force, the skidding condition shows the remaining friction force so
that when it becomes negative the vehicle slips. Figure 24 shows the remaining friction
force at each section, while the bus is traveling over the simulation:
A comparison of the normal force can be made using a model without a Panhard bar
to analyze the effect of this element in the lateral stability. The normal force values of the
bus without Panhard bar are presented in Figure 25.
382
Mathematics 2022, 10, 3081
From Figure 25, it is possible to observe that the normal force in the left side reach or
even pass negative values, which implies that the vehicle loses adherence with the path. It
demonstrates that the Panhard bar improves the adherence between the path and the tires.
6. Conclusions
A model of a bus is presented, elaborated by means of rigid multibody systems with
which a virtual or mathematical model (system of ordinary differential equations) has been
generated. This model allows the simulation of the lateral dynamics of a bus when it travels
on a cambered curve.
Based on the proposed model, a simulation algorithm has been designed to analyze
the dynamic behavior of a bus in transit on a cambered curve. This model is adjusted to
realistic transport routes since it has been calculated and parameterized with curves of
agreement (trajectories that connect a straight and circular trajectory in such a way that it
starts with an infinite radius of curvature and ends in a constant radius of curvature) used
in road design.
The modeling has been carried out in such a way that the total system representing
the bus has been divided into two fundamental parts: the front part of the vehicle, which in
turn contemplates several subsystems, such as the unsprung masses, the unsprung masses
connected by a stability or sway bar, the flexible wheels, etc. In turn, the rear part has
been modeled in an analogous way although, unlike the front part, in this one there is
only one unsprung mass and a Panhard type bar is used to improve its vertical and lateral
dynamic behavior.
Therefore, the model used to model the bus dynamic behavior integrates two modi-
fiable sections that allow for testing some new models for the calculation of the stiffness
of the suspension system or of the tires, etc. For this first approximation, the suspension
system and tire stiffness as well as the damping of both bodies have been modeled as linear
components. This is responsible for the quasi—linear correlation between the geometric
parameters and the tendency shown in the results: the state variables increase as the geo-
metric parameters of the trajectory increase and oscillate, trying to reach a steady state in
time while the trajectory parameters are constant.
According to the comparison between the results with and without a Panhard bar, it
can be stated that the Panhard bar has a stabilization function, which reduces the oscillations
383
Mathematics 2022, 10, 3081
of each state variable. Additionally, the parameters which modify the Panhard bar stiffness
can be treated as a control variable on an active suspension system.
From the effects on the rotation angles that the Panhard bar has, it is also possible to
highlight that the presence of this element reduces the rotation angles of the mass linked to
it, this improves the lateral stability of the vehicle, increasing the value of the normal forces
between the tires and the road, as can be concluded by comparing Figures 23 and 25.
The torsional structural stiffness is a parameter that must be considered when the
stability of the vehicle is analyzed, as presented in Figure 20, where the direction of the
structural moments affects the vehicle lateral stability.
Due to the weak effect of the non-linearities, some state variables have some distur-
bances which can be treated as noise.
Author Contributions: Conceptualization, E.O., J.R.H. and V.D.; methodology, E.O., E.R.C.L., J.R.H.
and V.D.; software, E.R.C.L.; formal analysis, E.O., E.R.C.L., J.R.H. and V.D.; investigation, E.O.,
E.R.C.L., J.R.H. and V.D.; resources, E.O., E.R.C.L., J.R.H. and V.D.; data curation, E.O., E.R.C.L., J.R.H.
and V.D.; writing—original draft preparation, E.O. and E.R.C.L.; writing—review and editing, E.O.,
E.R.C.L., J.R.H. and V.D.; visualization, E.O., E.R.C.L., J.R.H. and V.D.; supervision, E.O. and V.D.;
funding acquisition, V.D. All authors have read and agreed to the published version of the manuscript.
Funding: This research received no external funding.
Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.
Data Availability Statement: Not applicable.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Yang, S.; Chen, L.; Li, S. Dynamics of Vehicle-Road Coupled System; Springer: Berlin/Heidelberg, Germany, 2015.
2. Popp, K.; Schiehlen, W. Ground Vehicle Dynamics; Springer: Berlin/Heidelberg, Germany, 2010.
3. Rill, G.; Schaeffer, T. Grundlagen und Methodik der Mehrkörpersimulation—Vertieft in Matlab-Beispielen, Übungen und Anwendungen;
Springer: Berlin/Heidelberg, Germany, 2014.
4. Schramm, D.; Hiller, M.; Bardini, R. Vehicle Dynamics—Modelling and Simulation; Springer: Berlin/Heidelberg, Germany, 2014.
5. Jeong, D.; Ko, G.; Choi, S.B. Estimation of sideslip angle and cornering stiffness of an articulated vehicle using a constrained
lateral dynamics model. Mechatronics 2022, 85, 102810. [CrossRef]
6. Seyedi, M.; Jung, S.; Wekezer, J. A comprehensive assessment of bus rollover crashes: Integration of multibody dynamic and
finite element simulation methods. Int. J. Crashworthiness 2020, 27, 273–288. [CrossRef]
7. Niculescu-Faida, O.-C.; Niculescu-Faida, A. Vehicle dynamics modeling during moving along a curved path. Mathematical
model usage on studying the robust stability. UPB Sci. Bull. 2008, 4, 49–60.
8. Gauchia, A.; Olmeda, E.; Aparicio, F.; Díaz, V. Bus mathematical model of acceleration threshold limit estimation in lateral
rollover test, Veh. Syst. Dyn. 2011, 49, 1695–1707. [CrossRef]
9. Gauchia, A.; Díaz, V.; Boada, M.J.L.; Olatunbosun, O.; Boada, B.L. Bus Structure Behaviour under Driving Manoeuvring and
Evaluation of the Effect of an Active Roll System. Int. J. Veh. Struct. Syst. 2010, 2, 14–19. [CrossRef]
10. Eftekharzadeh, S.F.; Khodabakhshi, A. Safety evaluation of highway geometric design criteria in horizontal curves at downgrades.
Int. J. Civ. Eng. 2014, 12, 326–332.
11. Yin, Y.; Wen, H.; Sun, L.; Hou, W. The Influence of Road Geometry on Vehicle Rollover and Skidding. Int. J. Environ. Res. Public
Health 2020, 17, 1648. [CrossRef] [PubMed]
12. Wang, B.; Hallmark, S.; Savolainen, P.; Dong, J. Crashes and near-crashes on horizontal curves along rural two-lane highways:
Analysis of naturalistic driving data. J. Saf. Res. 2017, 63, 163–169. [CrossRef] [PubMed]
13. Geedipally, S.R.; Pratt, M.P.; Lord, D. Effects of geometry and pavement friction on horizontal curve crash frequency. J. Transp.
Saf. Secur. 2019, 11, 167–188. [CrossRef]
14. Bogenreif, C.; Souleyrette, R.R.; Hans, Z. Identifying and measuring horizontal curves and related effects on highway safety. J.
Transp. Saf. Secur. 2012, 4, 179–192. [CrossRef]
15. Abdollahzadeh Nasiri, A.S.; Rahmani, O.; Abdi Kordani, A.; Karballaeezadeh, N.; Mosavi, A. Evaluation of Safety in Horizontal
Curves of Roads Using a Multi-Body Dynamic Simulation Process. Int. J. Environ. Res. Public Health 2020, 17, 5975. [CrossRef]
[PubMed]
16. Xu, J.; Xin, T.; Gao, C.; Sun, Z. Study on the Maximum Safe Instantaneous Input of the Steering Wheel against Rollover for Trucks
on Horizontal Curves. Int. J. Environ. Res. Public Health 2022, 19, 2025. [CrossRef] [PubMed]
384
Mathematics 2022, 10, 3081
17. Yin, Q.; Stiharu, I.; Rakheja, S. Kinetostatic analysis of a beam-axle suspension with Panhard rod restraining linkage. Int. J. Veh.
Des. 1998, 19, 108–123.
18. Mayr, S.; Wagner, W. Development of a front axle suspension for special purpose tractors. In Proceedings of the Conference:
Agricultural Engineering 2004, Dresden, Germany, 7–8 October 2004.
19. Sonsino, C.M.; Streicher, M. Optimization of cast iron safety components of commercial vehicles by material selection and
geometry under consideration of service loadings. Mater. Test.-Mater. Compon. Technol. Appl. 2009, 51, 428–436.
20. Torbic, D.J.; O’Laughlin, M.K.; Harwood, D.W.; Bauer, K.M.; Bokenkroger, C.D.; Lucas, L.M.; Ronchetto, J.R.; Brennan, S.; Donnell,
E.; Brown, A.; et al. NCHRP 774 Report: Superelevation Criteria for Sharp Horizontal Curves on Steep Grades; The National Academie
Press: Washington, DC, USA, 2014.
21. Esveld, C. Modern Railway Track; Delft University, MRT—Productions: Zaltbommel, The Netherlands, 2001.
22. Rajamani, R. Vehicle Dynamics and Control; Mechanical Engineering Series; Springer: Berlin/Heidelberg, Germany, 2012.
23. Aparicio, F.; Vera, C.; Díaz, V. Teoría de los Vehículos Automóviles; Sección de Publicaciones, ETSII, Universidad Politécnica de
Madrid: Madrid, Spain, 1995.
24. Mustafa Siddiqui, O. Dynamic Analysis of a Modern Urban Bus for Assessment of Ride Quality and Dynamic Wheel Loads.
Master’s Thesis, Concordia University, Montreal, QC, Canada, 2000.
385
MDPI
St. Alban-Anlage 66
4052 Basel
Switzerland
Tel. +41 61 683 77 34
Fax +41 61 302 89 18
www.mdpi.com