Home > Data Center Tips > Data Center Management Advisory Newsletter > Top items in a capacity planning and performance monitoring program
Data Center Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

DATA CENTER MANAGEMENT ADVISORY NEWSLETTER

Top items in a capacity planning and performance monitoring program


Kackie Cohen
04.26.2006
Rating: -4.25- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


More on this topic:

Top 10 data center operating procedures

Top 10 considerations for application hosting

Regardless of the size or complexity of your organization, a capacity planning and performance monitoring program is essential for proactive planning and management of your data center. Monitoring technical aspects of your environment, such as CPU, disk, memory and network utilization are important, but the ideal capacity planning and performance monitoring program goes beyond these types of measurements and includes both technical operations and business concerns. This article lists some important items to include in your data center's capacity planning and performance monitoring program.

  1. Set up and follow a regular schedule for review, analysis and presentation of the information you gather. The most important element of any planning and monitoring program is making it a routine event within your organization. Quarterly review and analysis is a good interval to plan for, but it may make more sense for your organization to do this review more or less frequently. Present your findings to stakeholders, along with next steps for addressing problem areas.
  2. Success and failure rates of backups. Regularly pulling tapes from on- and off-site storage facilities, and restoring files to ensure data is being backed up successfully and is accessible within the timeframe set forth by your organization, is part of standard operating procedures for many data centers. Analyzing the timing and success/failure rates over time is a good thing to include in your stakeholder presentations. Many stakeholders aren't interested in the mechanics of "how" the process works, but they do want to know that it works and how long it takes to restore a file in the event of a catastrophe.
  3. Put together a lifecycle roadmap of standard hardware and software supported in your data center. Get in touch with the product marketing team for the hardware and software you have installed in your organization. Find out the three-to-five year plans for specific features and components of interest to your organization. Keeping up with this information on a regular basis will help you better match up corporate rollout and upgrade plans with the products that will have the longest support window.
  4. Determine which reports your organization really needs. A daily dashboard approach for outages and basic utilization levels is great, and creating trend reports based on the standard information that comes from your monitoring system is essential. However, your organization may really be most interested in reports that demonstrate cause and effect relationships between seemingly disparate activities, such as the recent implementation of a policy limiting the size of mailboxes and an increase in the amount of space utilized on file servers, as users begin to archive old mail elsewhere in the network.
  5. Hardware support and repair. Examine how your current repair process works over time, paying particular attention to speed of resolution and cost of repairs. Your organization may have an extensive hot or cold spare inventory, but buying and deploying redundant hardware that the staff isn't trained on can result in a longer downtime scenario because it takes the staff longer to diagnose and fix the problem. It may be more cost effective to purchase a premium four hour on-site support contract with a vendor to repair/replace/troubleshoot an issue.
  6. Determine which response, resolution or service restoration phase is where critical time is lost during an outage. Track how long each of these repair phases lasts. Use this information to determine where improvements can be made to shorten the duration of the outage. By looking at the tasks performed and how long they take during the initial response -- the troubleshooting and resolution phase, and finally service restoration phases -- an organization can determine if changes need to be made in notification and escalation processes, if additional staff training is required or a platform change for a specific application would be the most resource and cost-effective way to reduce the impact of outages on your organization.
  7. Utilization levels of staff. It's common to determine how many transactions per hour or minute a single server can perform, but typically staffing is reviewed in terms of how many servers can a single administrator support. Complex environments, such as multiple virtual machines on a single server, clusters or multiple servers in a load-balanced configuration to increase the availability of the environment, increase the workload on the administrator. Practical experience shows that organizations with lower administration and engineering skill sets should stick with running multiple servers at lower utilization rates. Lack of skilled staff, combined with running clustered servers in any OS, can actually reduce availability rather than increase it. It may be more cost-effective to run more hardware in a simple configuration than running less hardware in a more complex configuration with more skilled, and expensive, staff over time.
  8. Infrastructure. Don't forget to review the physical facilities in addition to the computational gear. Electrical and cooling needs should be matched up with the roadmap mentioned in item No. 3 above and planned accordingly. Conversely, if your data center has limited capacity for expansion of power or cooling, selecting server and network equipment with lesser power and cooling requirements is essential. Don't forget to evaluate infrastructure services like Netegrity, Active Directory, domain name server and Dynamic Host Configuration Protocol, as well as racking, cabling and the number of available ports on LAN/WAN/SAN gear with regards to planned upgrades, rollouts or migrations.
  9. Staging requirements. The amount of space and resources an organization devotes to staging equipment for preproduction application testing is an important consideration. Work with development teams to determine the needs for services, such as Active Directory authentication, database space, availability and the hours the environment must be available for use. Include these plans and findings in the final analysis of the data center as a whole. It may be possible to identify ways to repurpose staff or equipment from other uses for support of the staging environment on an ongoing basis, limiting the total capital outlay but increasing support services provided for development teams.

Rate this Tip
To rate tips, you must be a member of SearchDataCenter.com.
Register now to start rating these tips. Log in if you are already a member.




Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


RELATED CONTENT
Data Center Management Advisory Newsletter
Negotiating a successful data center outsourcing arrangement
Top tactics for patch management strategy
ASHRAE data center infrastructure book eases bad vibrations
Green data centers are about money, not the environment
Data center facility control systems
APC launches data center infrastructure monitoring software
Google's Iowa data center smart move, experts say
Data center humidity levels source of debate
ITIL v3 launches with focus on IT service management
AFCOM special report

Capacity planning
BMC tool optimizes zIIP and zAAP use by IBM mainframe
IBM tool improves mainframe capacity planning
Cheap commodity servers can turn into expensive investments
Capacity planning for virtual servers: New risks, new tools
Time is ripe for data center infrastructure databases
Aperture unveils new capacity planning tool
Mainframe capacity planning: More than MIPS
Data center consolidation, virtualization: Ultra-dense server deployments
CA zaps mainframe costs with zIIP specialty engine
Data centers handle holiday rush

What is capacity planning?
Data centers handle holiday rush
Nine tips for nifty capacity planning
Capacity planning no longer a guessing game

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
capacity on demand  (SearchDataCenter.com)
capacity planning  (SearchDataCenter.com)
MIS  (SearchDataCenter.com)
server consolidation  (SearchDataCenter.com)
server sprawl  (SearchDataCenter.com)
Uptime Institute, Inc.  (SearchDataCenter.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.

HomeNewsTopicsITKnowledge ExchangeTipsBlogsMultimediaWhite PapersEvents
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides enterprise IT professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective IT purchase decisions and managing their organizations' IT projects - with its network of technology-specific Web sites, events and magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Reprints  |  Site Map




All Rights Reserved, Copyright 2005 - 2008, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts