Home > Data Center Tips > > Reduce chances of hardware failure with preventive server maintenance
Data Center Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 


Reduce chances of hardware failure with preventive server maintenance


Logan G. Harbaugh, Contributor
09.16.2009
Rating: -1.50- (out of 5)


IT infrastructure news
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


Servers as hardware tend to get attention only when it's time to upgrade or when something goes wrong. The most common types of server hardware failures are failures of hard drives, power supplies, RAID adapters, motherboards, RAM or CPU. There are a number of preventative measures you can take to minimize the likelihood of a failure and to have the best chance of a quick recovery incase of a failure.

Use good equipment and care for it well
First, use good hardware and protect it well. While it is certainly possible to use a standard PC as a server, it's a recipe for disaster in any production server. Dedicated servers use boards that are intended to run 24/7 and are generally better engineered to make failures less likely.

At the high end, servers from makers like Hewlett-Packard Co. and Dell may even include features such as dual power supplies, hot-swap capability for PCI slots and fault-tolerant RAM that will continue to function even if one RAM module fails. In a similar vein, having a RAID 5 or 10 array is basic, but using enterprise-class drives is also important. Look for Raid Edition or Enterprise designations to ensure the drives are engineered to run in a 24/7 duty cycle.

Second, protect the hardware. Ensure that servers are plugged into high-quality (not $7.99) surge protectors or uninterruptible power supplies (UPSes). Servers -- and their air inlet filters -- should be cleaned regularly. The metallic dust that often accumulates in industrial buildings may contain metallic particles that can not only cause a server to overheat, but short it out as well. Don't leave slots uncovered if boards are removed -- insects and mice can enter through uncovered holes and wreak havoc on the system.

Make sure that the systems are well ventilated. If they're mounted in racks, ventilation holes shouldn't be covered, but equipment stacked in a wiring closet can easily end up with obstructed ventilation outlets, which is a sure...


BROWSE BY TAG
Blade server trends and advice,   Server hardware,   Information systems management,   Hardware and performance monitoring,   Data center operations management,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google



RELATED CONTENT
Blade server trends and advice
HP's buyout of 3Com continues IT convergence push
Users buying, configuring servers for virtualization
Data center purchasing survey 2009: Budgets flatten, clampdown on costs
New eBay data center director dishes
Blade server popularity cools
Tutor Perini: First Cisco UCS production customer
Vendors pledge joint support for Cisco's Unified Computing System
Roadmap for Sun Microsystems customers after the Oracle acquisition
Data center managers plan for power density jumps
Cisco's hopes its Extend Memory technology will boost UCS

Hardware and performance monitoring
APC adds monitoring, efficiency features to UPS line
Application performance monitoring firm targets cloud computing
Mission impossible: Data center asset management
Check server specs before upgrading your operating system
Sys admins: Are your config files and scripts hacks or products?
Systems management tools: Microsoft takes aim at the Big Four
HP downsizes data center cooling monitor: News in brief
Zenoss upgrades IT monitoring software to vie with Big Four
Indemnification, support woes plague open source systems management
Capacity planning tools tutorial for Linux and Unix

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
blade server  (SearchDataCenter.com)
brick server  (SearchDataCenter.com)
Cisco Catalyst Blade Switch 3020  (SearchDataCenter.com)
compaction  (SearchDataCenter.com)
cook-off test  (SearchDataCenter.com)
server blade  (SearchDataCenter.com)
unified computing system (UCS)  (SearchDataCenter.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


way to overheat and shorten the life of a server. Adequate cooling for the number of servers is also a necessity. The server room should be at no more than 70 degrees Fahrenheit, and the cooler the equipment stays, the longer it should last.

What to do if hardware fails anyway
If after all the precautions, equipment fails anyway (which it will, on occasion), you can do one of two things: pay more for overnight, on-site service and trust that the manufacturer will live up to the terms of the agreement, or keep spares on hand. If all your servers were purchased from the same manufacturer at the same time, it will be relatively cheap to keep at least some spares on hand -- one power supply, one motherboard, an extra CPU or two, enough RAM for one system, and two or three extra drives.

If you didn't purchase your systems at the same time, it gets more complex. Even if they're all from the same manufacturer, motherboards may not be the same even if the model number is, and the same model can have different CPUs and memory as well. It may be simpler to keep an extra server ready to go and move the disks from an existing server that breaks to the backup system.

Replacing components is generally straightforward once you figure out what's causing the problem. Servers are modular, and it's a straight remove-and-replace job. Dedicated servers can even make this part easier, with trouble lights that show exactly where a fault is located or which part has failed. The only critical thing to watch for is electrostatic discharge -- make sure to ground yourself by touching the power supply before handling any components.

ABOUT THE AUTHOR: Logan Harbaugh is a freelance reviewer, network systems analyst and consultant, specializing in reviews of network hardware and software, including network operating systems, clustering, load balancing, network-attached storage and storage area networks, traffic simulation, network management and server hardware.

What did you think of this feature? Write to SearchDataCenter.com's Matt Stansberry about your data center concerns at mstansberry@techtarget.com.

Rate this Tip
To rate tips, you must be a member of SearchDataCenter.com.
Register now to start rating these tips. Log in if you are already a member.




DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



White Papers - Data Center Networking

The Intel IT Technology Center - Power, Performance and Mobility Solutions

HomeNewsTopicsITKnowledge ExchangeTipsBlogsMultimediaWhite PapersEvents
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts