Reliability of Parallel Redundant UPS
Reliability of Parallel Redundant UPS
A parallel redundant UPS configuration with decentralised static bypass switches provides the highest availability for mission critical applications
The primary objective in the implementation of a UPS system is to improve power reliability to the limits of technical capability, the ultimate aim being to totally eliminate the possibility of any power disturbance or downtime. When they first appeared in the 60s, static UPS systems comprised a rectifier, battery and inverter, and were used to stabilise the output power and to continue to support the load, without break, in the event of a supply failure. The reliability of this simple UPS chain depended predominantly on the inverter reliability. An inverter failure meant an immediate load crash. In the early 70s the static bypass switch was introduced to enable an interruption-free load transfer to the standby mains in the event of an inverter failure or overload. The standby mains, although far less reliable than the UPS, serves as a reserve power supply in the event of an inverter failure, enabling continuation of the power supply to the load while the inverter is being repaired. This new architecture substantially improved the overall reliability, which no longer depended predominantly on the inverter reliability. The reliability of the new UPS with static bypass depended on the quality of the mains (MTBFMAINS), the time-to-repair of the UPS (MTTRUPS), and on the reliability of the static switch. However, dependence upon computer-controlled real-time information systems has grown exponentially in recent years and the highest reliability UPS configurations have become an everyday requirement. Very critical loads cannot rely on a power supply configuration of a single UPS with static bypass system; the need for (n+1) redundant parallel UPS configurations is becoming standard. A comparison of the reliability for various UPS configurations is based on the reliability figures presented in MIL-HDBK-217 F (Not.2, 1995). The following calculations were implemented on the PowerWAVE Series UPS and have been confirmed by field statistics.
Electrical block diagram Mains RECT Reliability block diagram (series diagram)
Rectifier
Battery
BATT
Inverter
INV
Load
Figure 1 Single UPS without static bypass - electrical block diagram and reliability diagram Single UPS without static bypass switch (SBS) The reliability of a single UPS without bypass depends on the reliability of the rectifier, battery and inverter (see Figure 1). For example, in the event of an inverter fault, the load would crash. UPS 081-01-00
Calculation of MTBF (MRBFSU) (MTBFSU is the mean time between failures of single unit without static bypass UPS is the failure rate of single unit without static bypass switch RECT is the failure rate of the rectifier BATT is the failure rate of the battery INV is the failure rate of the inverter) MTBFUPS = 1/UPS UPS = RECT + BATT + INV Figures from statistical failure analysis show RECT = 20 per million hours, BATT = 10 per million hours, INV = 20 per million hours. If these figures are applied in the equation, MTBFUPS for a UPS system without static bypass switch will be 20 000 hours. Single UPS with static bypass switch The reliability of a single UPS can be increased significantly by introducing a redundant mains power source and linking it to the main UPS supply source by means of a static bypass transfer switch (Figure 2). For example, in the event of an inverter fault the load will not crash, but will transferred to mains without interruption.
Electrical block diagram Mains M Rectifier RECT Reliability block diagram (series diagram)
Battery
BATT
Inverter
INV
Static bypass
SBS
Load
Figure 2 Single UPS with static bypass - electrical block diagram and reliability diagram Calculation of MTBF (MTBFSU+SBS) (For all subsequent calculations): MTBFUPS+SBS is the mean time between failures of single unit with static bypass switch MTBFM is the mean time between failures of the mains UPS+SBS is the failure rate of a single unit with static bypass switch SBS is the failure rate of the static bypass switch with control circuit PBUS is the failure rate of parallel bus (only for parallel systems) M is the failure rate of the mains SU is the repair rate of the static bypass switch (SU = 1/MTTRUPS) M is the repair rate of mains (M = 1/MTTRM) MTTRSBS is the mean time to repair of static bypass switch MTTRM is the mean time to repair of the mains) UPS 081-01-00
MTBFUPS+SBS = 1/UPS+SBS UPS+SBS = UPS//M+SBS UPS/M = UPS M (UPS + M + UPS+M) UPS M +UPS M +(UPS+M) (UPS+M)+2UPS+2M = UPS M (UPS+M) = 6 per million hours UPS M Note that all calculations are performed using the following constants: MTBFM = 50 hours, this figure represents a good quality mains MTTRUPS = 6 hours MTTRM = 0.1 hours. Furthermore, from statistical failure analysis, the figures for the failure rates of the static bypass switch for the power part and the control electronics part give SBS = 2 per million hours. Using these results: UPS+SBS = UPS//M + SBS = 6 + 2 per million hours = 8 per million hours, or MTBFUPS+SBS = 125 000 hours. In the above formula it can be seen that the reliability of the UPS with static bypass switch (MTBFUPS+SBS) depends largely on three parameters: the reliability of the mains, the MTTR of the UPS and the reliability of the static bypass switch. This dependence is illustrated in Figure 3.
Figure 3 Graph showing the dependence of the MTBFUPS+SBS on MTBFMAINS and MTTRUPS
UPS 081-01-00
Parallel redundant UPS with static bypass switch The reliability of a single UPS can be increased significantly by introducing a redundant parallel configuration (Figure 4).
Electrical block diagram Reliability block diagram
UPS 1
UPS 2
UPS n+1
Rectifier
Rectifier
Rectifier
RECT
RECT
RECT
Battery
Battery
Battery
BATT
BATT
BATT
Inverter
Inverter
Inverter
INV
INV
INV
Static bypass
Static bypass
Static bypass
PBUS
Parallel bus
Parallel bus
PBUS
PBUS
SBS
SBS
SBS
Load
Figure 4 (n=1) parallel redundant UPS with static bypass electrical and reliability block diagram Calculation of MTBF for an (n+1) redundant parallel UPS system (MTBF(n+1)UPS+SBS) Failure rate, (n+1)UPS+SBS = (UPS1//UPS2..//UPS(n+1)) + (n+1)PBUS + (SBS1//SBS2..//SBS(n+1)) ~ (n+1)PBUS Reliability, MTBF(n+1)UPS+SBS = 1/(n+1)UPS+SBS Availability A(n+1)UPS+SBS = MTBF(n+1)UPS+SBS MTBF(n+1)UPS+SBS + MTTRUPS The following constants are used in the calculations (Figure 5): MTBFM = 50 hours, this figure represents a good quality mains PBUS = 0.4 per million hours.
UPS 081-01-00
The reliability of an (n+1) parallel redundant system depends largely on the failure rate of the parallel bus, which is the only single point of failure. The UPS parallel redundant chains, the static bypass switches and their control electronics, as well as the mains power lines are all redundant and have therefore minor or even negligible impact on the overall reliability.
Reliability (MTBF) hours 1 250 000 830 000 650 000 500 000 420 000 Failure rate () per million hours ~ 0.8 ~1.2 ~1.6 ~2.0 ~2.4
Redundant parallel configuration (1+1) redundant configuration (2+1) redundant configuration (3+1) redundant configuration (4+1) redundant configuration (5+1) redundant configuration
Figure 5 MTBF and failure rate for (n+1) redundant configurations The following constants are used in the calculations (Figure 5): MTBFM = 50 hours, this figure represents a good quality mains PBUS = 0.4 per million hours. Comparison of UPS configurations In the single UPS chain (rectifier, battery and inverter), the reliability of the UPS largely depends on the reliability of the inverter. By introducing the static bypass switch, which provides a reserve mains power supply, the reliability will increase by a factor of six if the mains MTBF is 50 hours (good quality) and the MTTR of the UPS is six hours. This reliability level is unfortunately not sufficient, because it still depends substantially on the reliability of the raw mains and on the quality of the UPS aftersales organization (response time, travelling time, repair time, etcetera). Modern critical loads cannot rely on the mains quality and on longer repair times. To overcome the dependance on the raw mains, n+1 redundant parallel UPS configurations are recommended. The disadvantage of traditional standalone n+1 redundant configuration is the relatively long repair time of the UPS (typically six to 12 hours). By implementing modular, hot-swappable, n+1 redundant parallel systems based on modular PowerWAVE technology, the critical load will be completely mains independent. A faulty UPS module may be replaced without the need to transfer the healthy UPS modules to raw mains. Furthermore the replacement of the modules takes at most 0.5 hours, which dramatically decreases the time-to-repair in comparison with traditional parallel systems. The following example shows the impact on the reliability and availability of the choice of UPS configuration, comparing a traditional (1+1) redundant UPS configuration and a modular (4+1) redundant UPS configuration.
UPS 081-01-00
UPS 1 30 kVA
UPS 2 30 kVA
UPS 3 30 kVA
UPS 4 30 kVA
UPS 5 30 kVA
Availability
Availability
Load 120kVA
Load 120kVA
Figure 6 Block diagram of two redundant UPS configurations Figure 6 shows a block diagram of two redundant UPS configurations. The system on the left represents a (1+1) redundant configuration with traditional standalone UPSs, whereas the system on the right side represents a (4+1) redundant configuration with modular hot-swappable UPSs. Availability (A) is an important parameter when evaluating the reliability of UPS configurations. A is defined as: A= MTBFUPS MTBFUPS + MTTRUPS Figure 7 compares the availability of the configurations shown in Figure 6. Note that the MTBFUPS figures are taken from Figure 5. Two cases are considered: Case 1: both UPS configurations have the same MTTRUPS, six hours Case 2: the traditional standalone UPS configurations has MTTRUPS of six hours, whereas the modular UPS configuration with hot-swappable modules has MTTRUPS of 0.5 hours.
UPS 081-01-00
Redundant configuration (1+1) Case 1 MTBF MTTR Availability Case 2 MTBF MTTR Availability stand-alone 1 250 000 hours 6 hours 0.9999952 (5 nines) stand-alone 1 250 000 hours 6 hours 0.9999952 (5 nines)
Redundant configuration (4+1) stand-alone 500 000 hours 6 hours 0.9999888 (4 nines) modular, hot swappable 500 000 hours 0.5 hours 0.9999990 (6 nines)
Figure 7 Comparison of availabilities of 1+1 and 4+1 configurations In Case 1, the availability of the (1+1) redundant configuration is higher than the availability of the (4+1) redundant configuration if the MTTR is the same for both configurations. This is due to the fact that the MTBF of a (1+1) redundant configuration is higher than the MTBF of a (4+1) redundant configuration. In Case 2, the availability of a (1+1) redundant configuration with longer MTTR is lower than the availability of a (4+1) redundant configuration with a shorter MTTR. Conclusion These cases show how important MTTR is for reaching high availabilities. If in one of the above redundant configurations one of the UPS is faulty, there will be no redundancy left (low-availability regime) and the faulty part/module must be repaired or replaced as quickly as possible in order to restore redundancy (high-availability regime). With PowerWAVE modular UPS, the shortest MTTRs are achieved and, consequently, the highest availabilities, even if a larger number of modules are paralleled. Contact Uninterruptible Power Supplies Ltd Woodgate Bartley Wood Business Park Hook Hampshire RG27 9XA Phone: Email: Web: 01256 386700 sales@upspower.co.uk www.upspower.co.uk
UPS 081-01-00