Risk in the Battery Room - The Need for a New Battery Management Program

 Thomas Leonard and Gene Bohensky, BTECH, Inc.


Introduction

During the great power blackout of the summer of 2003, many data centers experienced significant system failures that theoretically should not have happened. In too many cases, the battery backup systems that should have provided adequate power protection for the Uninterruptible Power Supply (UPS) failed in the moment of need. The cause: inadequate monitoring and maintenance of the batteries.

The UPS is your data insurance policy and business lifeline. The justification to spend between $50,000 to $200,000 or more on the UPS is to assure critical power continuity during the 15 to 45 second lapse between main power outage and the start-up of the backup power generator. When quantifying the risk of loss from a UPS failure, the decision to maintain the UPS at its highest state of readiness is critical - as is making sure the batteries will perform. When the battery is compromised within the UPS, your data insurance policy disappears.

The US power grid is said to be 99.9 percent availability, which means the average consumer across the US experiences 8 ¾ hours of outages per year. That 0.1 percent risk factor represents $125 billion per year in direct losses. Indirect losses for recovery can easily double that figure. Unplanned, catastrophic UPS failures play a contributing role in that dollar figure, with power quality outages estimated to cost $25 billion to the US economy per year. When the UPS fails, facility engineers are fired, unrecoverable business losses are suffered, and in worst-case scenarios, lives are jeopardized. This risk need not exist, since meaningful, real-time battery performance data can now be obtained 24/7. With access to this data, users are able to accurately pinpoint and replace weak batteries before they pose a risk to the power backup system.

UPS Failure Modes are 98% due to batteries

A typical UPS module consists of a two components: the electronic power management and the backup battery power. The common knowledge failure rates stated within the backup power industry show that the battery is the root of failure 98 percent of the time (see Figure 1). However, the industry is caught in an apparent catch-22: unless current battery management practices change, the 98% won't change either. The data proves new battery management techniques are required if these risks are to be eliminated.


Figure 1: Batteries fail more often than electronics. UPS suppliers and data from a battery monitoring center indicate that the cause of UPS failure is battery failure 98 percent of the time, and electronics failure responsible for just 2 percent.

The Unpredictable Nature of Battery Failure

Early battery failure can occur for a variety of reasons, and seems to be more common among VRLA (Valve Regulated Lead Acid), commonly known as sealed or "maintenance-free" batteries. These batteries are being specified more often than wet cell types due to their much lower cost, smaller footprint and apparent ease of maintenance. Trend data collected from continuous remote monitoring of such battery systems has shown that individual battery jars can fail within a two week period (see Figure 2) - and it takes just one failed jar to bring down the entire string when under load! Some of the most common reasons for failure include:

1) Cracks in battery jar cases, cell dry-out, post seal leaks, stuck valves, plate sulfation, dendritic
shorts, cracked plates or separators and poor plate welds
2) Environmental factors, especially temperature - in one case monitoring information exposed administrative staff that shut off the air conditioning on weekends, resulting in temperatures exceeding 100 F in the battery room
3) Indefinite length of time sitting without a charge between the battery manufacture date and UPS system installation and commissioning
4) Battery manufacturing defects - monitoring data has shown up to 3% of batteries to be defective "out of the box" and others failing shortly after the warranty period
5) Frequent cycling
6) Inappropriate UPS float charge and equalization settings
7) Battery interconnections not properly torqued by maintenance personnel

Battery health has traditionally been difficult to detect. Standard methods - often included in many advanced UPS systems - use total system voltage or discharge cycle measurement to find bad batteries. However, these methods are unable to detect individual battery jar weakness in advance. By the time a fault is detected, the battery system has already been compromised for an indefinite length of time.

Current battery management practices supported by major UPS manufacturers and the IEEE recommend quarterly, semi-annual or annual battery maintenance, including visual inspections, voltage checks, specific gravity readings, rewatering, resistance tests, retorquing connections and a general cleaning. Battery strings are replaced well in advance of the end battery service life to minimize risk. These services are done by the UPS manufacturer's service department or an independent battery service provider. As Figure 1 illustrates, it is obvious that these practices do not assure adequate protection - there is simply too much time between service intervals when the batteries could have posed a risk.

An advanced battery management system using battery impedance measurement data as the principal method of battery failure prediction is the only way battery users can be sure their systems will work. When this data is collected and used for trend analysis, individual problem jars can be identified and swapped out before the system is at risk. The results of this data can be used to develop a new battery management program that can reduce or eliminate risk due to battery failure.



The graph illustrates a 2-week, on-line battery monitor reading that documents an increasingly common failure trend from monitoring data in VRLA batteries. Yellow diamonds document weekly impedance readings of a 12-volt VRLA jar in a cabinet of 40 jars supporting a UPS. Within a two week period, the jar went from a stable 2.95 milli ohm reading, to a 189 percent rise in the first week and a continued rise to 322 percent in the second week before finally being replaced. The graph shows a return to acceptable levels upon replacement.

A New Battery Management Recommendation for Data Centers

A revised battery management program recommendation has been developed using continuous remote battery monitoring as its cornerstone. By implementing this program, risk due to battery failure can be eliminated at the same time as battery maintenance and replacement costs are reduced. Batteries are managed by data rather than a set schedule. The features of such a program follow:

1) Continuous, remote monitoring of battery parameters including battery temperature, voltage, and impedance - in real time
2) Trained personnel analyze the data from a central control room, at least on a weekly basis, and contact the facilities engineer or local battery service provider when a problem occurs
3) Problem batteries are replaced or serviced as needed
4) Physical inspection of batteries reduced to once per year
5) Battery string replacement extended to the point where total string impedance has increased 10%

Summary

The quest in search of 99.99999 percent power reliability realizes great equipment costs associated with designing systems that reduce the risk of battery failure. The number of redundant battery strings and UPS systems designed into today's mission critical backup power systems clearly illustrate this. Trend data obtained from online battery monitoring supports industry claims that the battery, by an overwhelming margin, should be the ultimate concern in today's UPS systems - especially since just one rogue battery is able to cause a complete system failure. As a result, UPS systems stand at risk of failure a greater portion of time than is realized.

This insight justifies a shift in the way batteries are monitored and maintained. By moving to a battery management program based on real-time, data-based, continuous impedance measurement, risk due to battery failure can be virtually eliminated while lowering maintenance costs. Redundant UPS and battery systems may no longer be required as the risk of premature battery failure disappears. It is time to change the way batteries are managed.

BTECH has just completed a white paper entitled "Remote Monitoring of VRLA Batteries - Uncovering a Hidden Risk". Contact BTECH for a copy of the paper.

Thomas Leonard, Vice President, COO is the managing partner and Gene Bohensky the Sales Manager of BTECH, Inc.

Contact BTECH, Inc. at 973-683-9950 or visit www.btechinc.com

 
Privacy Policy | Terms of Use
© Copyright 2003 Comm Solutions Company. All rights reserved. Tel: 1.800.795.7559 Fax: 610.889.0484Email: info@commsolution.com