Data Backup and Storage
Our lives seem to revolve around computers more and more everyday, making us more and more vulnerable to the imperfections of technology. This article deals with threats surrounding our information and the measures we can take to prevent its loss.
History of Information Storage
Data Loss
• Hard Disk Failure
• Avoiding Data Loss
What is a Backup?
• Full Backup
• Incremental Backup
• Differential Backup
Which Files Should I Back Up?
How Often Should I Back Up My Data?
Types of Storage Devices
• External Hard Disks
• USB Flash Drives
• Manufacturers
• CD Technology
• DVD Technology
• Manufacturers
• Online Storage
• Services
Which Backup Storage Media is Right for Me?
What About the Future?
History of Information Storage
Before the appearance of written records, individuals relied solely on oral tradition to
preserve knowledge and historical accounts for the future generations. History reveals that humans
experimented with other methods of passing on information like painting on cave walls and creating
depictions on pottery. Today people associate information storage with quite a recent development —
the computer. Though this technology has existed for only short time, the evolution of information
preservation has long been in progress and of high priority for civilization.
As
early as 3,000 BC the Egyptians used one of the first mediums for keeping reliable written records —
papyrus. Papyrus, paper's predecessor, was made from the Cyprus papyrus plant and widely used in the
Mediterranean and Middle East, which both share a rather dry climate. Papyrus was preserved exceptionally
well in the dry conditions, making it an ideal form of paper and means of storing information. Parts of
the Dead Sea Scrolls, written before 100 AD and discovered in the mid-20th century, were written on papyrus,
enduring almost two millenniums. Papyrus remained widespread until the appearance of two other mediums —
parchment and paper.
It is believed that in the first few years of the second century AD the Chinese invented paper.
The exact year of its creation is not known. They made paper out of a pulp consisting of water and the bark
of a mulberry tree. Paper became superior to papyrus due to its lightweight and ability to hold up in moist
climates. Paper moved west quite quickly and reached Europe by the eighth century AD making way for the next
major innovation in the world of information preservation — the printing press.
Though
primitive devices for mass printing had appeared between 700 and 800 AD, Johann Gutenberg changed the
world of information preservation in 1440, when he invented the printing press. The printing press utilized
moveable metal type. These metal pieces could be used many times, simplifying and speeding up the printing
process. Gutenberg's press put knowledge into the hands of the masses very quickly and at a low price,
triggering the spread of literacy throughout Europe.
After the invention of the printing press, time brought with it many new forms of spreading
and preserving information, including the phonograph, punch card, and magnetic tape. But the next notable
and significant innovation in the information world came in 1956, when IBM released the 350 disk storage
unit, the grandfather of modern hard drives. This early hard drive stored a bit over four megabytes. Today's
hard drives store as much as up to 1.5 terabytes.
Since then many other mediums of information storage have appeared, including floppy and optical
discs, multimedia cards, USB flash drives and many others. The demand for new, more efficient information
storage devices grows everyday as civilization acquires more and more knowledge. In addition to providing
new ways of storing this knowledge, the ease of using this technology has given way to an abundance of
useless information — saving outdated files, unwatched films, and other useless things has become
cheap. However, this ability to store information and knowledge gives people a sense of security and hope
for tomorrow and generations to come.
Data Loss
Though the world now possesses an abundance of information storage devices, we still remain
at risk of data loss. Data loss is defined as the unintentional loss of data caused by file corruption,
hardware failure, human error or site-related events, such as fire or theft. Keeping this in mind, a report
on data loss done in 2003 shows that the leading cause of data loss is hardware failure, accounting for
40% of data loss incidents. Coming in second is human error, accounting for 29%. Human error includes any
loss of data caused by the user's actions such as the accidental deletion or overwriting of files. Keeping
all of this in mind, one of the most common types of hardware failure is the dreaded hard drive crash.
Read more about data loss, its causes and the price of such an event here — The Cost of Data Loss.
Hard Disk Failure
Many
people know that hard drives crash but very few understand why. In general, there are two types of hard drive
failure — logical and physical.
Logical hard disk failure typically occurs due to system-file corruption, which can be caused
by viruses or the accidental deletion of key-files. Though rare, logical failure can also happen as a result
of electro-static discharge. In the event of logical failure, the actual hard drive is healthy and recognized
by the computer but appears inaccessible due to the corrupt or fragmented files. If you experience logical hard
drive failure, it is recommended not to use repair utilities, as this can worsen the problem. You should contact
a hard disk data recovery specialist as soon as possible.
Physical hard disk failure occurs when the storage media has incurred some kind of physical
damage. Distinguishing between physical and logical disk failure can be quite easy. Unlike logical hard disk
failure, the drive is usually not recognized by the computer. Generally, this type of hard disk failure can
be grouped into two primary categories — mechanical and electronic. The most common reason for
mechanical failure is a head crash, which occurs when the read-write heads come into contact with the platters
of the hard disk. Another common form of mechanical failure is spindle motor malfunction. Mechanical failure
is typically accompanied by grinding or clunking sounds, or by a loud high-pitched whine.
Electronic failure most often occurs due to power surges or faulty circuit board components.
Another issue that can play a role in electronic failure is excessive heat and poor ventilation. Like logical
hard disk failure, physical hard disk failure is usually beyond the average computer user's abilities and should
be handled by a hard disk data recovery specialist.
Avoiding Data Loss
The majority of disasters are unforeseen and rarely can the user prevent such events as hard drive
failure or site related disasters. Nevertheless, the user does have a limited bit of control over data
loss. In fact, data loss can often be prevented if one takes the proper measures. The trick is taking action
before a disaster occurs, because once data is lost — it can't be recovered. The most
reliable and efficient way of protecting your data is quite a simple process of which most users have
heard — backup.
What is a Backup?
Backup is a term used to denote the process of creating copies of data that can later be restored
in the event of data loss. A backup can be as simple or complicated as you want to make it. Saving a copy of
a file in a different location on the same hard disk could be an example of a very simple backup, whereas
more complex backups may include servers, off-site storage, and even paid guards!
Backups
prove very useful and important in our everyday lives, considering the fact that our computers often hold
gigabytes of irreplaceable information, such as personal and confidential information, financial records,
years of memories in the form of pictures or videos, etc. When it comes to business, data loss can potentially
cost a company hundreds of thousands of dollars or more. Keeping this in mind, it's best to keep yourself
protected by backing up data on a regular basis.
Though backing up your data is necessary and useful, one has to keep in mind that creating backups
has its minuses. The first and most relevant disadvantage is that backing up data takes up quite a bit of time
and has the potential to use a lot of space and money. Another snag is the delusion of being completely
protected against data loss. This misconception can be quite dangerous for businesses and average users
because although there are new available resources for protecting data, in the end, no one will be saved.
So when it comes to backup, one must hope for the best and prepare for the worst.
Though there are many terms to denote various types of backup, there exist three basic types
of backup — full, incremental, and differential, all of which may prove useful in different situations.
Full Backup
Every type of backup starts with a full backup. Therefore, full backups are typically part of an
overall backup plan. A full backup creates copies of all data a user has selected for backup. Unless the user
chooses to use compression, the backup file size will be equal to that of the original data. The advantage to
this type of backup is that it allows you to easily and quickly restore your data. The down side is that
creating full backups tends to be a long process, requiring the user to have a large backup window (the amount
of time reserved for backup). In addition, creating a series of full backups requires a storage device with a
significant amount of disk space, which can be expensive.
Incremental Backup
An incremental backup starts with a full backup. Incremental backups are generally done periodically
according to a schedule. This type backs up only those files that have changed since the last backup of any
type, be it full or incremental. The advantage to incremental backups is that it takes very little time to back
up your files after the initial full backup, requiring a smaller backup window. Concerning the archive size,
incremental backups are optimal, as they require less disk space. However, restoring files which have been
backed up incrementally can be quite a time-consuming task, as you first have to restore the last full backup
and then the following incremental backups in the correct order.
Differential Backup
A differential backup is similar to an incremental backup. They both back up only data that has
been changed. The difference is that a differential backup backs up only those files that have been changed
after the last full backup. This offers many users a happy medium between full and incremental backup. Differential
backup offers the user both a reasonably short backup and restoration time. Nevertheless, this option requires
more disk space than incremental backups, but less than full backups.
Learn more about backups in Wikipedia.
Which Files Should I Back Up?
This
question arises every time a user wants to back up his or her precious files. The first step to answering this
question is thinking about how long it took to gather the data. Things like financial records, address books,
digital photographs, and Internet browser bookmarks are accumulated over long periods of time and should have
high priority. In addition to this, you should consider whether or not files are of any financial importance.
This answers the question of what you should back up. But what about those files which you should not back up?
Users tend to save a lot of unnecessary files “just in case”. Though there is nothing
wrong with this in principle, backing up data costs money and it may not be profitable or reasonable to keep
files that you no longer need. Some examples might be files from previous jobs, rough drafts of already completed
projects, or files compatible only with discontinued software. If you nevertheless wish to back up files
compatible with discontinued or rather unpopular software, you should either save the files in a more modern
and widely used format or back up the program installation files as well. Otherwise, you may not be able to open
the files in the future.
How Often Should I Back Up My Data?
The
answer to this question differs for each person. One of the main factors is how often and to what extent your
files change during a certain period of time. A user who spends eight to twelve hours per day working at his
computer should ideally back up his files more often than someone who works at his or her computer five to ten
hours per week. Another important question is how much time you can devote to the backup process. The ideal time
to back up data is when a computer is not in use. If you have a small backup window, you may not be able to back
up your data very often. You must weight out the need to back up data frequently, the risk of data loss, the
availability of time and decide from there. In general, it is recommended that average computer users back up
their data once per week. The day and time depends on you and your needs. As a general rule, you should back
up your files at the end of the last workday of the week. In the end, it all comes down to one question —
how much are you willing to lose?
Types of Storage Devices
Aside from choosing the right backup type and schedule, it is also important to put some thought
into the storage device where the data will be saved. A backup is only as good as the device on which it is
stored, so below is some information concerning the most modern storage mediums on the market.
External Hard Disks
An
external hard disk is a hard disk separate from your system which more often than not connects to your computer
using the USB or FireWire port. The storage capacity of modern external hard drives can be as high as one
terabyte. With a low cost per gigabyte, external hard drives are a great choice for large backup archives.
In addition, these hard drives are quite compact and portable, making it quite simple to transport to and
from work, keeping backed up data in a separate place from the source.
Though external hard drives are quite durable, they can still be dropped and damaged. An external
hard drive has moving parts, making it susceptible to much of the same physical malfunctions as internal hard
drives. Despite a low cost per gigabyte, external hard drives may not be optimal for smaller backups.
USB Flash Drives
USB flash drives are quite new to the market and offer customers a very convenient and portable
way of transferring and storing information at a relatively low price, explaining their growing popularity.
There are two main types of flash drives — USB 1.0 (1.1) and 2.0. The difference lies in both
storage capacity and speed. USB 2.0 holds a significantly larger amount of data and higher speed. Flash
drives can generally hold up to 64 gigabytes.
Flash drives tend to be expensive if one considers the cost per unit, making it a pricey choice
for larger backups. Due to the fact that USB flash drives are compact and very convenient, they can be lost
very easily.
Hard Drive and USB Flash Drive Manufacturers
Below is a list of the most popular external hard drive and USB drive manufacturers. You can
buy their products on their web sites, as well as in many online and retail stores.
Western Digital
A major player in the data storage media market. Offers a line of external hard drives for both Macintosh users. Online store.
Seagate
A popular manufacturer of hard drives and other storage devices. Offers a line of external hard drives for
Macintosh users. Online store.
Iomega
A manufacturer of consumer external, portable and network storage hardware. Online store.
CD Technology
CDs
are optical discs used for data storage. Discs that can be written on once are known as CD-Rs. Those that
can be written to multiple times are called CD-RWs. The typical storage capacity for these discs is 700MB.
CDs are lightweight and portable, allowing for the convenient movement of backup archives. Concerning the
cost of using CDs as storage media, they have a relatively low cost per unit and therefore are ideal for
both small and large backups. In addition, most modern computers come with CD-R/RW drives.
If a computer doesn't have a CD-R/RW drive you'll need to buy one, which can be a hefty
investment. One disadvantage of CD-RW discs is that the human error factor still plays a role as files can
be deleted and written over. Moreover, CD-RWs become less and less reliable with each use. CDs are failure-prone
due to degradation caused by external factors such as handling the discs and environmental factors. Just like USB
flash drives and external hard drives, CDs have inherent risk due to portability.
DVD Technology
Like CDs, DVDs are optical discs used for data storage. One main difference, however, is that
DVD discs hold a significantly larger amount of data than CDs. Single-sided single-layer DVDs hold up to 4.7
gigabytes. A single-sided double layer DVD holds up to 8.50 gigabytes. DVDs also come in both recordable
(DVD-R) format and rewritable (DVD-RW). DVDs are lightweight and convenient due to their portability.
Not all modern computers have DVD writing drives which means that you may need to buy one.
DVD-RWs are not recommended for archival use since they become less and less reliable with each recording.
Like CDs, DVDs are failure-prone due to degradation caused by external factors. Due to their small size
and portability, DVDs can easily be lost.
CD and DVD Manufacturers
Below is a list of the most popular optical disc manufacturers. You can buy their discs on
their web sites, as well as in many online and retail stores.
Sony
A multinational global leader in electronics. First introduced the compact disc to the world. Online store.
Philips
A major market leader in the world of electronics and healthcare technology. Introduced the compact disc
together with Sony. Online store.
Memorex
A large and popular American manufacturer of blank CD and DVD media. Online store.
Maxell
A Japanese consumer electronics manufacturer.
Online Storage
Many companies offer users the opportunity to back up their data in their data centers via
Internet. Many times, their services include backups as small as 250 megabytes and extend up to hundreds of
gigabytes. These archives are stored offsite, protecting you from disasters in your own home or office.
One of the most attractive advantages is that as a rule, you can access your data from multiple locations.
One of the most uninviting aspects of storing backups online is the uncertainty of your data's
safety. If the company's servers go down you may not be able to access your information, which may cost
you a lot of time or money in a concurrent data loss event. Perhaps more frightening than this might be
hacking. If the company were to be hacked, your information could be stolen, possibly making you a victim
of identity theft or giving away the secrets of your company's business plan. In addition, if the data
storage company goes out of business, you'll lose your backup archive and be forced to start from zero
again. Apart from this, the monthly fees for using such services can be sky high.
Online Data Backup Services
IDrive
Offers backup storage for both Windows and Macintosh users. Includes backup software.
IBackup
Offers backup storage for both Windows and Macintosh users. Includes backup software. Access archives from handheld devices.
Mozy
Offers backup storage for both Windows and Macintosh users. Includes backup software.
Compare various online storage services here — Online Storage Services Review 2009.
Which Backup Storage Media is Right for Me?
Remember those absurdly large round, plastic things called records? What about 8-track cartridges?
8-inch floppy disks? Though you can easily find this media, it can be difficult to find players and drives in
working condition. Users are faced with this problem when they back up their data, as there is no use in
storing your archives on media that will not be used in five to ten years. However, it's impossible to avoid
the freight train advancement of technology. Concerning more short-term plans, you should consider how much
data you need to store, how much money you can afford to invest in your backup, the maximum acceptable restoration
time, the reliability of the device, and the possibility to store it offsite.
One question that everyone needs to ask when choosing a backup storage device is how long the device stores data. The answer to this question is not a simple one. If hard drives really retained data and worked properly as long as the manufacturers said, users would invest money into calendars instead of backup strategies. The fact of the matter is that data retention depends on a number of factors. One of the main factors is the quality of the product itself. As they say, you often pay for what you get. If you buy your backup media from a dependable, high-quality manufacturer, you'll maximize the retention of your data on that device. Another factor of equal importance is the conditions in which you keep the storage media. When it comes to CDs and DVDs, for example, it is recommended that you store them away from bright light sources, as they are quite vulnerable to light, especially sunlight. In addition, you should always store them in their cases to avoid scratching and chipping. Another important factor is dust and water, so you should store them in a closed box or cabinet. You should certainly store any storage media at room temperature, avoiding extremities of hot and cold. If you take such measures to protect your media, it will retain data and function properly much longer.
Taking all of these factors into consideration, you shouldn't keep all of your eggs in one basket.
The safest answer may be that it's best to use a number of different storage devices. However, that may not be
realistic, which is why a good option for the average user is to use an external hard drive with a minimum of
320GB, which connects using either FireWire or USB 2.0, to minimize the transfer time.
What About the Future?
Perhaps in the coming years we will see innovative storage devices offering even more reliable
methods of keeping information safe. Maybe computers will have a longer lifespan than humans. Ideally, there
will come a day when we will not have to think about backing up data. However, considering the variety of
incidents against which we try to insure ourselves by creating backups, backup will always be a necessary,
and at the same time, futile procedure, as the problem of human error and natural disaster cannot be solved
as easily as faulty hard drives and fragile discs.
Ray East III
Last reviewed: February 2009
|