Data center power efficiency increases, but so do power outages
An Uptime Institute survey finds the power usage effectiveness of data centers is better than ever. However, power outages have increased significantly.
A survey from the Uptime Institute found that while data centers are getting better at managing power than ever before, the rate of failures has also increased — and there is a causal relationship, reports Network World.
The Global Data Center Survey report from Uptime Institute gathered responses from nearly 900 data center operators and IT practitioners, both from major data center providers and from private, company-owned data centers.
According to Network World, the report found that the power usage effectiveness (PUE) of data centers has hit an all-time low of 1.58. By way of contrast, the average PUE in 2007 was 2.5, then dropped to 1.98 in 2011, and to 1.65 in the 2013 survey.
PUE is a measure of the power needed to operate and cool a data center. A PUE of 2 means for every watt of power to run the data center, another watt is needed to cool it. A PUE of 1.5 means for every watt into the IT systems, a half of a watt is needed for cooling. So, lowering PUE is something of an obsession among data center operators.
However, Uptime also found a negative trend: The number of infrastructure outages and “severe service degradation” incidents increased to 31 percent of those surveyed, that’s up 6 percentage points over last year’s 25 percent. Over the past three years, nearly half had experienced an outage at their own site or a service provider’s site, wrote Network World.
Most downtime incidents lasted one to four hours. Uptime asked people who suffered an outage what they estimated the cost to be, but 43 percent didn’t calculate the cost of an outage. That’s because far too many factors in determining the cause were outside that person’s specialty. Half of those who did make an estimate put the cost were less than $100,000, but 3 percent said costs were over $10 million.
What causes data center outages?
The leading causes of data center outages are power outages (33 percent), network failures (30 percent), IT staff or software errors (28 percent), on-premises non-power failure (12 percent), and third-party service provider outages (31 percent).
To err is human, and this survey showed it. Nearly 80 percent said their most recent outage could have been prevented. Another cause of failures is there is a trend toward data center consolidation, with firms moving workloads from secondary data centers to primary ones. This takes time, and since the secondary is being decommissioned, the owner doesn’t invest in it. So wear and neglect creeps into a doomed data center, making it more likely to fail, the website writes.
Another cause for problems is the cascading effect of one data center taking down others. That could be either two private data centers or a hybrid situation where an on-premises center is connected to a third-party provider such as Amazon or Microsoft. If one goes down, it has a greater chance of taking down the other(s).
Uptime found 24 percent of those surveyed said they were impacted by outages across multiple data centers, Network World reported.
Preventive maintenance plans are no small feat. If you’re accustomed to reacting to unexpected breakdowns and critical emergency repairs, a preventive maintenance plan will force you to think months, even years ahead. This may take you out of your comfort zone, but it will instill peace of mind knowing that your critical assets are covered.
If you are considering to employ RCM analysis at your facility, it means you have recognized the need for a change in your maintenance strategies. Reliability-centered maintenance is an excellent way to keep your plant or machinery up and running by helping you choose the optimal maintenance strategy for all of your important assets.