Home > Data Center Tips > Data Center Management Advisory Newsletter > Data growth: The worst enemy of your recovery time objective
Data Center Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

DATA CENTER MANAGEMENT ADVISORY NEWSLETTER

Data growth: The worst enemy of your recovery time objective


Richard Jones, VP and Service Director for Data Center Strategies
08.01.2008
Rating: --- (out of 5)


IT infrastructure news
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


What do you do if your recovery data set has grown so large that traditional methods of disaster recovery (DR) won't meet your objectives? New technologies such as deduplication, storage tiering and data management policies can alleviate the high costs of a DR plan while also achieving the desired recovery time objective (RTO).

In a previous tip, I told the story of a company that neglected to keep its DR plan up-to-date. What hit it was the curse of ever-expanding data storage. Data growth over a four-year period has rendered its tape restore process completely inadequate for meeting its RTO. Once discovered, its solution was to change recovery technology from linear tape to asynchronously replicated storage co-located over 400 miles from its primary data center -- a solution that costs a great deal more money.

In today's economy, many IT shops do not have the budget to make this kind of change. Also, tape restores have always been slower than backups. The industry has been so focused on tape backup time windows that restore times tend to get lost. When was the last time you saw a vendor publish its technology's restore time?

There are a number of relatively new technologies on the market that, when coupled with some discipline, can help stave off the need to acquire a more expensive solution in order to meet RTOs. Let's look at each of these:

Data deduplication has been bantered around by vendors recently. If you have not done so yet, you should take a look at this technology. Data deduplication is a higher form of compression that seeks out duplicate files within a file system. This technology can be implemented at the data block level -- seeking duplicate data blocks within a disk volume. What does this give you? It finds and removes duplicate files or blocks from your storage volume and replaces them with small pointers to one "master" data file or block. The stored data size can be significantly reduced. Another advantage is that it can be


BROWSE BY TAG
Data center disaster recovery planning,   Data center operations management,   Storage backup - data center,   Data center storage,   Data Center Management Advisory Newsletter,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google


RELATED CONTENT
Data center disaster recovery planning
Lower disaster recovery costs with open source replication tools
Apple updates Xserve, Symantec ships DR monitoring software: News in brief
Data center disaster recovery planning
Google opts for battery backup over UPS: News in brief
Avoiding disaster recovery pitfalls in VMware and Linux: Rejecting default settings
Make IT change management part of your disaster recovery plan
Using carrier colocation in your data center disaster recovery plan
Gmail outage caused by data center rolling blackout
SunGard opens new data center, and other facility news in brief
Disaster recovery planning tools: Are they worth it?
Data center disaster recovery planning Research

Storage backup - data center
Data backup software: Selection, implementation and optimization advice for Indian businesses
Lower disaster recovery costs with open source replication tools
Avoiding disaster recovery pitfalls in VMware and Linux: Rejecting default settings
Sun Microsystems provides storage, hard drive wiping services
Offsite data storage advice
Tapeless backup gains favor among storage architects
Deduping: an essential backup tool in the data center?
Tiered storage shifts focus away from disk
CDW improves backup server speeds by 1,000%
Sun props up storage line in quarterly update
Storage backup - data center Research

Data Center Management Advisory Newsletter
Where and how to build your next data center in a down economy
Avoid data center relocation failure: Top five pitfalls
Understanding green IT and data center energy-efficiency regulations
Preventing a data center power crisis
Using chargeback to reduce data center power consumption: Five steps
Using Linux in a data center consolidation management strategy
Packaged chiller plants increase data center cooling efficiency
Chiller optimization for data center infrastructure
How many data centers are sufficient for disaster recovery? Two, three, or more?
Negotiating a successful data center outsourcing arrangement

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Business Continuity and Disaster Recovery (BCDR)  (SearchStorage.com)
high availability  (SearchDataCenter.com)
RAIN  (SearchDataCenter.com)
uninterruptible power supply  (SearchDataCenter.com)
Uptime Institute, Inc.  (SearchDataCenter.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary


used in conjunction with traditional file compression so that the "master" file is also smaller as a result of the compression algorithms.

The question to ask is: "Does data deduplication help me with all of the data in my data center?" The answer is no. This technology is especially helpful with unstructured data -- those file servers that hold employee office productivity files. Just think of the case when the human resources department sends out information on benefits open enrollment to all employees. Most employees are going to save that information -- and now you have nearly as many copies of the same information as you have employees. And worse, you are forced to restore all those copies in the case of a disaster. But what about your structured data? Those databases consist of only a handful of large files, so file-level data deduplication is not going to help you. Block-level data deduplication can, however. The savings may not be as great, but in today's world, every little bit helps. Where similar database records yield identical data blocks, those can be deduplicated. However, databases with little repetitive information will not yield noticeable savings.

Most backup and restore vendors' latest products include data deduplication. You should consider upgrading to leverage these features, especially for your unstructured data volumes. Smaller data sets will always result in shorter restore times in a DR situation.

Storage tiering may offer relief to data growth as well. Older data is moved to secondary tiers of storage. Vendor products that can split off the secondary storage tier can help you meet your RTOs. The process involves a level of data categorization, meaning that the data on the primary tier is most important and is what the RTO applies to. The data on the secondary tier is not as important and can be recovered later. Older data that is not required for day-to-day business operations but must be available for regulatory or other reasons can be restored later. Let's look at an example. A data volume holding unstructured data typically has only up to 20% of its data in active use. Eighty percent of the data has not been accessed for six months or longer. Applying storage tiering where the 20% is maintained on the primary tier and the older 80% is maintained on the secondary tier can speed restore times of the primary data by five times. RTOs for the business can easily be met. However, don't forget to designate RTOs for the secondary data. It will eventually need to be recovered.

Unfortunately, the number of data tiering solutions that allow for split recovery of primary and secondary tiers is limited. Make sure you check with storage tiering vendors to ensure that the primary can be restored without the secondary tier. Hopefully more solutions will appear in the future.

Data management policies within your organization can also help get a handle on your RTOs. Similar in concept to storage tiering, this includes corporate data deletion policies of older data after it has been archived to offline media (tape, DVD, etc.). Such policies groom the active data by removing older data to keep the size of your active data volumes in check. This is accomplished by automatically purging database records and deleting files that are older than a particular date. Policies need to be determined per data type and importance. While many financial records must be kept permanently, most do not need to be kept on active storage for longer than three years. A good archive of the data is sufficient. Often times end-user office productivity data doesn't need to be kept on active storage for more than 18 months. These policies depend greatly on the line of business and other regulatory requirements, so these approaches may not work in all cases.

The drawback with data management policies is they require some level of effort to manage and audit. Deleting data that has not been archived could spell regulatory problems for a company. The recent Federal Rules of Civil Procedure changes will overrule deletion policies if a legal hold is declared in your organization. Additionally, search-engine technologies can make all data appear as if it has been recently accessed. Search engines must open and read the entire file to create content indexes. Make sure your search technology keeps a record of those files that have already been indexed, otherwise an automated deletion system will never find any files that have not been recently accessed.

If your data growth rate is such that these measures will only buy you a short amount of time, you still need to implement alternative technologies to ensure you can continue to meet your RTOs.

Let us know what you think about this tip. Was it useful? Email Site Editor, Matt Stansberry.

Rate this Tip
To rate tips, you must be a member of SearchDataCenter.com.
Register now to start rating these tips. Log in if you are already a member.




DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



White Papers - Data Center Networking

The Intel IT Technology Center - Power, Performance and Mobility Solutions

HomeNewsTopicsITKnowledge ExchangeTipsBlogsMultimediaWhite PapersEvents
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts