Reliability Comparative Evaluation of Active Redun
Reliability Comparative Evaluation of Active Redun
James Li
Abstract
Redundancy is a commonly applied reliability improvement technique to enhance the system reliability and availability
of safety critical systems, or operational impact systems in the railroad and mass transit industry. In this paper, two very
basic but different types of parallel redundancy, namely active redundancy and standby redundancy are introduced and
studied according to the mechanism structure built in a system. The pros and cons of the active redundancy and standby
redundancy are also discussed. The Markov model technique is utilized to illustrate the Mean Time Between Failure
(MTBF) calculation for the active and standby redundancy for the purpose of reliability evaluation. The comparison is
also undertaken for the active redundancy versus standby redundancy from a reliability point of view.
1. Introduction
In the railroad and mass transit industry, the safety critical functions or operational impact systems
require redundancy implementation to enhance the system safety and strengthen systems reliability
and availability. Redundancy is defined as the existence of more than one means for accomplishing
a given task or function in a system.
One thing that should be revealed is that the application of redundancy is not without penalties.
Despite reducing system mission failures, redundancy increases system logistics failures. It will
also increase weight, space requirements, complexity, cost, and time to design. The increase in
complexity results in an increase in unscheduled maintenance. Thus, system safety and mission
reliability is gained at the expense of adding an item(s) in the unscheduled maintenance chain. The
increase in unscheduled maintenance may be counteracted by reliability improvement techniques
such as design simplification, derating, and the use of more reliable components.
The two basic types of commonly applied redundancy are active redundancy and standby
redundancy. Active redundancy does not require the external components or devices to perform the
function of detection, decision and switching when an element or path in the redundant structure
fails. The redundant elements are always in operation to share the load of the system, and
automatically pick up the load for a failed element. Active redundancy is also called Full-on
122
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
redundancy or Load-sharing redundancy in other papers. Fig.1 shows an active redundant system
configuration.
Standby redundancy is defined as the redundancy that requires the external elements or devices to
detect, make a decision and switch to another element or path as a replacement for a failed element
or path. Standby units can be operating (hot standby) or inactive (cold standby). Hot standby and
active redundancy can be considered identical if the switching device is perfect. Fig. 2 shows a
standby redundant system configuration.
In reference Military Standard (2005), the concept of active redundancy and standby redundancy
were introduced. In reference Mok et al. (2013), types of redundancies including active redundancy
and standby redundancy were presented. In reference Mohammad et al. (2013), a load-sharing
systems using k-out-of-n structure were presented. Active redundancy is a 1-out-of-2 load-sharing
system.
In the reliability engineering practice, when we make a decision to use redundant design techniques
to improve system reliability and availability. We usually confront a fundamental question: what
type of redundancy is more appropriate to achieve required system reliability and availability?
Active redundancy or standby redundancy. In this paper, we will perform a reliability analysis to
compare the active redundancy against the standby redundancy by utilizing Markov model
technique. The conclusion will be summarized at the end of this paper.
2. Markov Model
The term “Markov model” is named after the Russian mathematician Andrei Markov, originally
referred to mathematical models in which the future state of a system depends only on its current
state, and not on its past history. That is the memory less characteristic, which is the main Markov
property. The other characteristic of Markov model is stationary. A stationary system is one in
which the probabilities which govern the transitions from state to state remain constant with time
(i.e. constant failure rate or repair rate). For any given system, a Markov model consists of a list of
123
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
the possible states of that system, the possible transition paths between those states, and the rate
parameters of those transitions.
Markov model is a very useful and powerful reliability analysis tool to evaluate the redundant
systems which have the constant failure rate and repair rate. In reference Klion (1977), Markov
approaches for full on operation and stand-by operation were introduced. In reference Military
Standard (2005), Markov theory was introduced. In reference Jackson (2013), Markov analysis
with non-constant hazard rates was presented. In reference Dakic (2015), Markov model was
presented as one of the deductive methods of reliability quantification methods and techniques. In
this paper, we will utilize Markov model to measure the reliability parameter Mean Time Between
Failure (MTBF) for the active redundant system and standby redundant system respectively. The
comparison will be undertaken between the active redundant system and the standby redundant
system based on the reliability parameter evaluation.
In the above state transition diagram, state one is the initial state where unit A and unit B are both
operating properly. State two is the state where one unit has failed, the remaining unit is still
working to keep the system operational (success). System only fails if both unit A and unit B fail
to meet the system operational requirement. State three is reached when unit A and B have both
failed. An assumption used in developing the state transition diagram is that unit A and unit B
cannot change states simultaneously. In Fig. 3 λ is the unit failure rate and μ is the unit repair rate.
P1 (t t ) P1 (t ) (1 2 t ) t P2 (t ) (1)
124
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
Rearranging by moving P1(t) from the right-hand side to left-hand side, and dividing ∆t on the both
sides of equation (1) to obtain equation (2)
P1 (t t ) P1 (t ) dP1 (t )
2 P1 (t ) P2 (t ) (2)
t dt
0
dP1 (t ) 2 P1 (t )dt P2 (t )
0 0
(3)
P1 () P1 (0) 2 T1 T2 (4)
Note that the boundary condition is equal to one at the state of P1 (0) or P3 (∞), and zero at all other
states.
T1 is defined as the expected time in state one; T2 is defined as the expected time in state two.
1 2 T1 T2 (5)
P2 (t t ) P1 (t ) 2 t P2 (t ) 1 ( ) t (6)
Rearranging by moving P2 (t) from the right-hand side to left-hand side, and dividing Δt on the both
side of equation (6) to obtain equation (7)
P2 (t t ) P2 (t ) dP2 (t )
2 P1 (t ) ( ) P2 (t ) (7)
t dt
0
dP2 (t ) 2 P1 (t )dt ( ) P2 (t )dt
0 0
(8)
P2 () P2 (0) 2 T1 ( ) T2 (9)
T1 T2 (10)
2
125
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
1
T2 , T1 (11)
22
Here, the success of the system is defined by state one and state two. State three is the failed
condition. Consequently, we can write the MTBF. The MTBF would be defined as the sum of the
expected time in state one and state two. Mathematically, this can be written as
3
MTBF T1 T2 (12)
22
If the system is not maintained or non-repairable, then removing μ from the equation (12) and is
simplified as
3
MTBF (13)
2
Considering the complexity introduced by the switching device, in this paper we assume that the
switching device is always operating until the system fails. In other words, the failure of the
switching device is not taken into account in the reliability analysis performed below.
In order to utilize the Markov model to analyze a standby redundant system, again, a state transition
diagram is illustrate in Fig. 4.
126
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
In the above state transition diagram, state one is the initial state where unit A is operating as a
primary unit and unit B is not operating as standby. State two is the state where unit A has failed,
the switching device detects the failure of primary unit A, and turn on the standby unit B to keep
the system operational (success). State three is the state when the primary unit A and the standby
unit B have both failed. In Fig. 4 λ is the unit failure rate and μ is the unit repair rate.
P1 (t t ) P1 (t ) (1 t ) t P2 (t ) (14)
Rearranging by moving P1(t) from the right-hand side to left-hand side, and dividing ∆t on the both
sides of equation (14) to obtain
P1 (t t ) P1 (t ) dP1 (t )
P1 (t ) P2 (t ) (15)
t dt
0
dP1 (t ) P1 (t )dt P2 (t )
0 0
(16)
P1 () P1 (0) T1 T2 (17)
Note that the boundary condition is equal to one at the state of P1(0) or P3(∞), and zero at all other
states.
127
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
T1 is defined as the expected time in state one; T2 is defined as the expected time in state two.
1 T1 T2 (18)
P2 (t t ) P1 (t ) t P2 (t ) 1 ( ) t (19)
Rearranging by moving P2(t) from the right-hand side to left-hand side, and dividing Δt on both
sides of equation (19) to obtain equation (20)
P2 (t t ) P2 (t ) dP2 (t )
P1 (t ) ( ) P2 (t ) (20)
t dt
0
dP2 (t ) P1 (t )dt ( ) P2 (t )dt
0 0
(21)
P2 () P2 (0) T1 ( ) T2 (22)
T1 T2 (23)
1
T2 , T1 (24)
2
Here, the success of the system is defined by state one and state two. State three is the failed
condition. Consequently, we can write the MTBF. The MTBF would be defined as the sum of the
expected time in state one and state two. Mathematically, this can be written as
2
MTBF T1 T2 (25)
2
If the system is not maintained or non-repairable, then the equation (25) is simplified as
2
MTBF (26)
128
International Journal of Mathematical, Engineering and Management Sciences
Vol. 1, No. 3, 122–129, 2016
ISSN: 2455-7749
References
Dakic, V. (2015). FTA vs. RBD – Differences and similarities for system modeling. In Proceeding Annual
Reliability and Maintainability Symposium (RAMS), 2015, Florida, USA
Dersin, P. & Valenzuela, R. (2015). Engineering availability in systems-of-systems. In Proceeding Annual
Reliability and Maintainability Symposium (RAMS), 2015, Florida, USA
Jackson, A. (2013, January). Reliability modeling that combines Markov analysis and Weibull distributions.
In Reliability and Maintainability Symposium (RAMS), 2013 Proceedings-Annual (pp. 1-6). IEEE.
Klion, J. (1977). System periodically maintained. In A Redundancy Notebook (pp. 29-38). Rome Air
Development Center Publishing.
Military Standard (2005). MIL-HDBK-338B Military Handbook Electronic Reliability Handbook, Notice 2
(pp. 344-350). Air Force Research Laboratory Information Publishing.
Mohammad, R., Kalam, A., & Amari, S. V. (2013, January). Reliability of load-sharing systems subject to
proportional hazards model. In Reliability and Maintainability Symposium (RAMS), 2013 Proceedings-
Annual (pp. 1-5). IEEE.
Mok, Y. L., Goh, C. H., & Segaran, R. C. (2013, January). Redundancy modeling for the X-Sat microsatellite
system. In Reliability and Maintainability Symposium (RAMS), 2013 Proceedings-Annual (pp. 1-6).
IEEE.
Nowizki, N., Zeiler, P., & Bertsche, B. (2016, January). Traversal time modeling of safety critical in-vehicle
communication using reliability modeling techniques. In 2016 Annual Reliability and Maintainability
Symposium (RAMS) (pp. 1-7). IEEE.
Reliability Information Analysis Center (2005). Hardware reliability modeling. In System Reliability Toolkit
(p. 394). Reliability Information Analysis Center (RIAC) and Data Analysis Center for Software (DACS)
Publishing.
Velasco, D. S., Kuhn, T., & Kemmann, S. (2013, January). Reliability analysis in model-driven development
of embedded systems. In Reliability and Maintainability Symposium (RAMS), 2013 Proceedings-
Annual (pp. 1-7). IEEE
129