Home > Data Center Tips > Infrastructure Management Tips > Compression, deduplication and encryption: What's the difference?
Data Center Tips:
EMAIL THIS
 TIPS & NEWSLETTERS TOPICS 

INFRASTRUCTURE MANAGEMENT TIPS

Compression, deduplication and encryption: What's the difference?


Stephen J. Bigelow
07.13.2006
Rating: --- (out of 5)


Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   


What you will learn from this tip: Learn the distinction between compression, deduplication and encryption as these concepts are gaining importance in everyday storage.

Hard disks now rule the storage industry, and traditional storage arrays are routinely flanked by other disk-based systems like virtual tape libraries (VTL) and replication platforms. As more corporate data is relegated to spinning disk, storage administrators must implement, configure and manage this escalating capacity -- stretching disk space to the limit while protecting important data against loss or theft. Compression, deduplication and encryption are emerging in disk storage, and it's important to understand the role of each.

Compression

Compression is actually a decades-old idea, but it's making a renewed appearance in storage systems like VTL. Compression basically attempts to reduce the size of a file by removing redundant data within the file. By making files smaller, less disk space is consumed, and more files can be stored on disk. For example, a 100 KB text file might be compressed to 52 KB by removing extra spaces or replacing long character strings with short representations. An algorithm recreates the original data when the file is read. Picture files are also usually compressed. For example, the .jpg image file format uses compression to eliminate redundant pixel data.

Virtually any file can be compressed, though files with non-redundant data may compress little (if at all), so compression ratios are a guideline -- not a rule. For example, a 2:1 compression ratio can ideally allow 400 GB worth of files on a 200 GB disk (or 200 GB worth of files would only take 100 GB on the disk). It's very difficult to determine exactly how much a file can be compressed until a compression algorithm is applied.

Deduplication

A typical data center may be storing many copies of the same file. File deduplication (sometimes called data reduction or commonality factoring) is another space-saving technology intended to eliminate redundant (duplicate) files on a storage system. By saving only one instance of a file, disk space can be significantly reduced.

For example, suppose the same 10 MB PowerPoint presentation is stored in 10 folders for each sales associate or department. That's 100 MB of disk space consumed to maintain the same 10 MB file. File deduplication ensures that only one complete copy is saved to disk. Subsequent iterations of the file are only saved as references that point to the saved copy, so end-users still see their own files in place. Similarly, a storage system may retain 200 e-mails, each with a 1 MB attachment. With deduplication, the 200 MB needed to store each 1 MB attachment is reduced to just 1 MB for one iteration of the file.

Deduplication can also provide more granular control, removing redundant portions of files -- potentially down to the block level. This is common in content-addressed storage (CAS) systems like Avamar Technology Inc.'s Axion product. When evaluating a de-duplication product, it's important to understand the granularity offered by their platform.

Encryption

With the increasing prominence in government regulations and corporate litigation, storage managers acknowledge the role of security in enterprise storage. Encryption is used to protect data, preventing unauthorized users from accessing information even if files are hacked and stolen. You can find encryption in secure systems like Nexsan Technologies' Assureon CAS appliance.

Encryption uses a mathematical algorithm with a unique key to encode a file into a form that cannot be read. No one else can access or use the encrypted file until it is unencrypted again using the identical key. Of course, if the encryption key is lost or forgotten, any data encrypted with that key will be rendered inaccessible.

Do you know….

How to troubleshoot your NAS system?

How to implement SAS storage?


Rate this Tip
To rate tips, you must be a member of SearchDataCenter.com.
Register now to start rating these tips. Log in if you are already a member.




BROWSE BY TAG
Infrastructure Management Tips,   Data center disaster recovery planning,   Data center operations management,   Mainframe management,   Chapter 2: Mainframe security and disaster recovery,   Encryption,   VIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



RELATED CONTENT
Infrastructure Management Tips
Grow a green business: Cut costs and improve energy efficiency with green IT
Closing the green gap: Expanding data centers with environmental benefits
Green data center site selection: Cost versus sustainability
Improving data center cooling capacity with chilled water plants
How to prepare for remote data center maintenance trips
DC Pro: a breakdown of a data center efficiency tool
Selecting a general contractor: Data center construction runbook, Chapter 3
Selecting a winning data center design team: Data center construction runbook, Chapter 2
Protecting your data center from real show-stoppers: Preparing a disaster recovery plan
Economizer performance: Applying CFD modeling to the data center's exterior

Data center disaster recovery planning
Data center pros must prepare for increased space weather
Are you wasting disaster recovery budget to avoid employee inconvenience?
Four ways to extend data center UPS battery life
Why SunGard uses flywheel UPSes in its data centers
Lower disaster recovery costs with open source replication tools
Apple updates Xserve, Symantec ships DR monitoring software: News in brief
Data center disaster recovery planning
Google opts for battery backup over UPS: News in brief
Avoiding disaster recovery pitfalls in VMware and Linux: Rejecting default settings
Make IT change management part of your disaster recovery plan
Data center disaster recovery planning Research

Encryption
Mainframe shops scramble to encrypt tapes
CA offers mainframe tape encryption
IBM mainframe encryption: The gold standard for security does an upgrade
IBM steps up mainframe encryption
Privacy breaches: How to avoid making headlines
Storage encryption prevents identity theft

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
Business Continuity and Disaster Recovery (BCDR)  (SearchStorage.com)
high availability  (SearchDataCenter.com)
RAIN  (SearchDataCenter.com)
uninterruptible power supply  (SearchDataCenter.com)
Uptime Institute, Inc.  (SearchDataCenter.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary

DISCLAIMER: Our Tips Exchange is a forum for you to share technical advice and expertise with your peers and to learn from other enterprise IT professionals. TechTarget provides the infrastructure to facilitate this sharing of information. However, we cannot guarantee the accuracy or validity of the material submitted. You agree that your use of the Ask The Expert services and your reliance on any questions, answers, information or other materials received through this Web site is at your own risk.



White Papers - Data Center Networking

The Intel IT Technology Center - Power, Performance and Mobility Solutions

HomeNewsTopicsITKnowledge ExchangeTipsBlogsMultimediaWhite PapersEvents
About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
SEARCH 
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2005 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts