Jump to content

Digital permanence: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Reverted 1 edit by HabibzadehParham (talk). (TW)
mNo edit summary
 
(30 intermediate revisions by 22 users not shown)
Line 1: Line 1:
{{Short description|Archival of data}}
'''Digital permanence''' addresses the history and development of digital storage techniques specifically quantifying the expected lifetime of data stored on various digital media and the factors which influence the ''permanence'' of digital data. It is often a mix of ensuring the data itself can be retained on a particular form of media and that the technology remains viable. Where possible, as well as describing expected lifetimes, factors affecting data retention will be detailed including potential technology issues.
'''Digital permanence''' addresses the history and development of [[digital storage]] techniques, specifically quantifying the expected [[lifetime of data]] stored on various [[digital media]] and the factors which influence the ''permanence'' of digital data. It is often a mix of ensuring the data itself can be retained on a particular form of media and that the technology remains viable. Where possible, as well as describing expected lifetimes, factors affecting [[data retention]] will be detailed, including potential technology issues.


Since the inception of computers, a key concept differentiating computers from other calculating machines has been their ability to store information. Over the years, various hardware device have been designed to store ever larger quantities of data. With the development of the Internet the quantity of information available appears to continue to grow at an ever increasing rate often characterised as an [[Information explosion|Information Explosion]]. As information stored on traditional media such as hand-written documents, printed books, photographic images and the likes is being replaced by digital files so our social and cultural legacy to future generations will depend more and more on the permanence of digital information.
Since the inception of automatic computers, a key difference between them and other calculating machines has been their ability to store information. Over the years, various hardware devices have been designed to store ever larger quantities of data. With the development of the Internet the quantity of information available appears to continue to grow at an ever-increasing rate often characterised as an [[information explosion]]. As information is increasingly being stored on electronic media as opposed to traditional media such as hand-written documents, printed books, and photographic images, humanity's social and cultural legacy to future generations will depend increasingly on the permanence of these new media.


However, not all this information is worth saving for any length of time; sometimes its value can be very short-lived. Other data, such as the contents of the Wikipedia we might like to keep around ... well, forever. This article describes how reliable different types of storage media are at storing data over time and factors affecting this reliability.
However, not all of this information is worth saving; sometimes its value can be short-lived. Other data, such as legal contracts, literature, scientific studies, are expected to last for centuries. This article describes how reliable different types of storage media are at storing data over time and factors affecting this reliability.


Librarians and Archivists responsible for large repositories of information take a deeper view of electronic archives.
[[Librarian]]s and [[archivist]]s responsible for large repositories of information take a deeper view of [[electronic archive]]s.
{{block indent | em = 1.5 | text =<nowiki/>
:; Data format : Data must be stored in a format which can be meaningfully accessed now and in the future.
; [[Recording format|Data format]]: Data must be stored in a format which can be meaningfully accessed now and in the future.
:; Technology reliance : If data requires a special program to view it, say, as an image, then software must also be available to both interpret the basic data file and also render it appropriately. In some cases, this might also require special hardware.
; Technology reliance: If data requires a special program to view it, say, as an image, then software must also be available to both interpret the basic data file and also render it appropriately. In some cases, this might also require special hardware.
:; Archival strategy : Data must remain available in the long term.
; Archival strategy: Data must remain available in the long term.
:: At present a growing problem is the time taken to reproduce an archive, for instance following a hardware or system upgrade. Since the sheer volume of archive data continues to grow, new hardware is always required to maintain the archive and so regular migration of data to a new system must be performed on a regular basis. The time taken to migrate data is starting to approach the frequency of system upgrade such that archive transfer will become a continuous, never-ending process.<ref name="Burk">Burk, Alan; James Kerr; and Andy Pope. "The Credibility of Electronic Publishing". Available at [http://web.mala.bc.ca/hssfc/Final/Archiving.htm web.mala.bc.ca]</ref>
: At present, a growing problem is the time taken to reproduce an archive, for instance following a hardware or system upgrade. Since the sheer volume of archive data continues to grow, new hardware is always required to maintain the archive and so regular migration of data to a new system must be performed on a regular basis. The time taken to migrate data is starting to approach the frequency of system upgrade, such that archive transfer will become a continuous, never-ending process.<ref name="Burk">Burk, Alan; James Kerr; and Andy Pope. "The Credibility of Electronic Publishing". Available at [http://web.mala.bc.ca/hssfc/Final/Archiving.htm web.mala.bc.ca]</ref>
:; Digital rights management : Maintaining digital information in an accurate and accessible format over an extended retention period also must address the requirements of the authors' digital rights.
:: In many cases the data may include proprietary information that should not be accessible to all, but only to a defined group of users who understand or have legally agreed to only utilize the information in limited ways so as to protect the proprietary rights of the original authoring team. Maintaining this requirement over decades can be a challenge that requires processes and tools to ensure total compliance.
; [[Digital rights management]]: Maintaining digital information in an accurate and accessible format over an extended retention period also must address the requirements of the authors' digital rights.
: In many cases, the data may include proprietary information that should not be accessible to all, but only to a defined group of users who understand or have legally agreed to only utilize the information in limited ways so as to protect the proprietary rights of the original authoring team. Maintaining this requirement over decades can be a challenge that requires processes and tools to ensure total compliance.
:; Reproducibility : Digital information must be able to be reproduced as originally intended or available.
; [[Reproducibility]]: Digital information must be able to be reproduced as originally intended or available.
:: This is significant especially where the original data was produced on technology at a lower level than currently possible. For example, archivists try to maintain the distinction between listening to gramophone record played on a gramophone as opposed to a digitally cleaned version of the same recording though a modern hi-fi system.
: This is significant especially where the original data was produced on technology at a lower level than currently possible. For example, archivists try to maintain the distinction between listening to a gramophone record played on a gramophone as opposed to a digitally cleaned version of the same recording through a modern hi-fi system.
}}

Given that individuals' personal data also seems to be growing at an alarming rate,<ref name="Sweeney">Sweeny, Latanya. "Information Explosion. Available at [http://privacy.cs.cmu.edu/people/sweeney/explosion.html privacy.cs.cmu.edu]</ref> these archiving issues affecting professional repositories will soon be manifest in small organisations and even the home.
Given that individuals' personal data has been growing at a rapid rate in the 21st century,<ref name="Sweeney">Sweeny, Latanya. "Information Explosion. Available at [http://privacy.cs.cmu.edu/people/sweeney/explosion.html privacy.cs.cmu.edu] {{webarchive|url=https://web.archive.org/web/20100118090334/http://privacy.cs.cmu.edu/people/sweeney/explosion.html |date=2010-01-18 }}</ref> these archiving issues affecting professional repositories will soon be manifest in small organisations and even the home.


==Types of storage==
==Types of storage==


===Solid-state memory devices===
===Solid-state memory devices===
Digital computers, in particular, make use of two forms of [[computer memory|memory]] known as [[RAM]] or [[Read-only memory|ROM]] and although the most common form today is RAM, designed to retain data while the computer is powered on, this was not always the case. Nor is ''active'' memory the only form used; ''passive'' memory devices are now in common use in digital cameras.
Digital computers, in particular, make use of two forms of [[computer memory|memory]] known as [[random-access memory|RAM]] or [[read-only memory|ROM]] and although the most common form today is RAM, designed to retain data while the computer is powered on, this was not always the case. Nor is ''active'' memory the only form used; ''passive'' memory devices are now in common use in digital cameras.


* [[Magnetic core memory|Magnetic]], or ferrite core, data retention is dependent on the magnetic properties of iron and its compounds.
* [[Magneti-core memory|Magnetic]], or ferrite core, data retention is dependent on the magnetic properties of iron and its compounds.
* [[PROM]], or ''programmable read-only memory'', stores data in a fixed form during the manufacturing process, with data retention dependent on the life expectancy of the device itself.
* [[Programmable read-only memory|PROM]], or ''programmable read-only memory'', stores data in a fixed form during the manufacturing process, with data retention dependent on the life expectancy of the device itself.
* [[EPROM]], or ''erasable programmable read-only memory'', is similar to PROM but can be cleared by exposure to [[ultraviolet]] light.
* [[EPROM]], or ''erasable programmable read-only memory'', is similar to PROM but can be cleared by exposure to [[ultraviolet]] light.
* [[EEPROM]], or ''electrically erasable programmable read-only memory'', is the format used by [[flash memory]] devices and can be erased and rewritten electronically. These devices tend to be extraordinarily resilient; in a 2005 destructive test, a USB key survived boiling in a custard pie, being run-over by a truck and fired from a mortar at a brick wall.<ref name="USB">[http://www.skyone.co.uk/programmes/brainiac/ Sky One]</ref> Although physically damaged after the final test, some deft soldering restored the device and data was successfully retrieved.
* [[EEPROM]], or ''electrically erasable programmable read-only memory'', is the format used by [[flash memory]] devices and can be erased and rewritten electronically.


===Magnetic media===
===Magnetic media===
[[File:magtape1.jpg|thumb|{{convert|7|in|cm|adj=on|0}} reel of {{convert|1/4|in|mm|0|adj=mid|wide}} audio recording tape, typical of consumer use in the 1950s–70s]]


[[Magnetic tape]]s consist of narrow bands of a magnetic medium bonded in paper or plastic. The magnetic medium passes across a semi-fixed head which reads or writes data. Typically magnetic media has maximum lifetime of about 50 years<ref name="Adelstein">Adelstein, Peter Z. "Permanence of Digital Information". Available at [http://www.ica.org/citra/citra.budapest.1999.eng/adelstein.rtf www.ica.org].</ref> although this assumes optimal storage conditions; life expectancy can decrease rapidly depending on storage conditions and the resilience and reliability of hardware components.
[[Magnetic tape]]s consist of narrow bands of a magnetic medium bonded in paper or plastic. The magnetic medium passes across a semi-fixed head which reads or writes data. Typically, magnetic media has a maximum lifetime of about 50 years<ref name="Adelstein">Adelstein, Peter Z. "Permanence of Digital Information". Available at [http://www.ica.org/citra/citra.budapest.1999.eng/adelstein.rtf www.ica.org] {{Webarchive|url=https://web.archive.org/web/20040926035349/http://www.ica.org/citra/citra.budapest.1999.eng/adelstein.rtf |date=2004-09-26 }}.</ref> although this assumes optimal storage conditions; life expectancy can decrease rapidly depending on storage conditions and the resilience and reliability of hardware components.
* magnetic tape reels
* magnetic tape reels
* magnetic stripe cards
* magnetic stripe cards
Line 45: Line 48:
* [[punched tape|punched paper-tape]]
* [[punched tape|punched paper-tape]]
* [[punched cards]]
* [[punched cards]]
* optical media (rotating media combined with a moveable read/write head comprising a laser), such as:
* [[optical media]] (rotating media combined with a movable read/write head comprising a laser), such as:
** pressed [[CD-ROM]]s
** pressed [[CD-ROM]]s and [[DVD-ROM]]s
** [[Write once read many]] (WORM) media such as [[CD-R]], [[DVD±R]], [[BD-R]].
** WORMs, CD-R
** Rewriteable media such as [[CD-RW]], [[DVD±RW]], [[BD-RE]].
** [[DVD]]s
** Some disc types can have multiple data layers for greater storage capacity.
** multi-layer DVDs


===Printing technology===
===Printing technology===
Although not a digital storage medium in itself, printing hard-copies of documents and images remains a popular means of representing digital data and possibly acquires the qualities associated with original documents especially their potential for endurance. More recent advances in printer technology have raised the quality of photographic images in particular. Unfortunately the permanence of printed documents cannot be easily discerned from the documents themselves.
Printing hard-copies of documents and images is a popular means of representing digital data and possibly acquires the qualities associated with original documents, especially their potential for endurance. More recent advances in printer technology have raised the quality of photographic images in particular. Unfortunately, the permanence of printed documents cannot be easily discerned from the documents themselves.
* wet-ribbon inked printers
* wet-ribbon inked printers
* heat sensitive papers, such as FAX rolls
* [[heat-sensitive paper]]s, such as FAX rolls
* NCR and other ''carbon'' technologies
* NCR and other ''carbon'' technologies
* ink-jet printers
* [[ink-jet printer]]s
** wax-based inks e.g. DataProducts SI810
** wax-based inks e.g. DataProducts SI810
** water-based inks
** water-based inks
** other bases
** other bases
* mono laser printers
* mono [[laser printer]]s
* colour laser printers
* colour laser printers


===Financial Driven Resources===
===Financial Driven Resources===
A way of preserving digital content through means of financial trusts. The data is driven with financial investments typically assigned to a Trust Company which pay traditional storage providers to house data for long periods of time with the interest gained on the principle. In 2008 a series of companies such as LivingStory.com and Orysa.com started offering these services to store point in time accounting data and provide consumer archive services.
A way of preserving digital content through means of [[financial trust]]s. The data is driven with financial investments typically assigned to a Trust Company which pay traditional storage providers to house data for long periods of time with the interest gained on the principal. In 2008 a series of companies such as LivingStory.com and Orysa.com started offering these services to store point in time accounting data and provide consumer archive services.


===Soft storage technology===
===Soft storage technology===
The short-comings of some storage media is already well recognised and various attempts have been made to supplement the permanence of an under-lying technology. These "soft storage technologies" enhance their base technology by applying software or system techniques often within quite narrow fields of data storage and not always with the explicit intention of improving digital permanence.
The short-comings of some storage media is already well recognised and various attempts have been made to supplement the permanence of an under-lying technology. These "soft storage technologies" enhance their base technology by applying software or system techniques often within quite narrow fields of data storage and not always with the explicit intention of improving digital permanence.
* RAID systems
* [[RAID]] systems
* Distributed systems, such as [[BitTorrent (protocol)|bitTorrent]]
* Distributed systems, such as [[BitTorrent (protocol)|BitTorrent]]
* networked backup services
* [[networked backup]] services
* public archive repositories
* public archive repositories
* web-site archives
* web-site archives
Line 76: Line 79:


==See also==
==See also==
* [[Preservation (library and archival science)]]
* [http://www.nationalarchives.gov.uk/information-management/policies/survival-of-records.htm UK Digital Archive Strategy]
* [[Print permanence]]


==References==
==References==
<references/>
<references/>

==External links==
* [http://www.nationalarchives.gov.uk/information-management/policies/survival-of-records.htm UK Digital Archive Strategy]



{{DEFAULTSORT:Digital Permanence}}
{{DEFAULTSORT:Digital Permanence}}

Latest revision as of 17:14, 29 November 2023

Digital permanence addresses the history and development of digital storage techniques, specifically quantifying the expected lifetime of data stored on various digital media and the factors which influence the permanence of digital data. It is often a mix of ensuring the data itself can be retained on a particular form of media and that the technology remains viable. Where possible, as well as describing expected lifetimes, factors affecting data retention will be detailed, including potential technology issues.

Since the inception of automatic computers, a key difference between them and other calculating machines has been their ability to store information. Over the years, various hardware devices have been designed to store ever larger quantities of data. With the development of the Internet the quantity of information available appears to continue to grow at an ever-increasing rate often characterised as an information explosion. As information is increasingly being stored on electronic media as opposed to traditional media such as hand-written documents, printed books, and photographic images, humanity's social and cultural legacy to future generations will depend increasingly on the permanence of these new media.

However, not all of this information is worth saving; sometimes its value can be short-lived. Other data, such as legal contracts, literature, scientific studies, are expected to last for centuries. This article describes how reliable different types of storage media are at storing data over time and factors affecting this reliability.

Librarians and archivists responsible for large repositories of information take a deeper view of electronic archives.

Data format
Data must be stored in a format which can be meaningfully accessed now and in the future.
Technology reliance
If data requires a special program to view it, say, as an image, then software must also be available to both interpret the basic data file and also render it appropriately. In some cases, this might also require special hardware.
Archival strategy
Data must remain available in the long term.
At present, a growing problem is the time taken to reproduce an archive, for instance following a hardware or system upgrade. Since the sheer volume of archive data continues to grow, new hardware is always required to maintain the archive and so regular migration of data to a new system must be performed on a regular basis. The time taken to migrate data is starting to approach the frequency of system upgrade, such that archive transfer will become a continuous, never-ending process.[1]
Digital rights management
Maintaining digital information in an accurate and accessible format over an extended retention period also must address the requirements of the authors' digital rights.
In many cases, the data may include proprietary information that should not be accessible to all, but only to a defined group of users who understand or have legally agreed to only utilize the information in limited ways so as to protect the proprietary rights of the original authoring team. Maintaining this requirement over decades can be a challenge that requires processes and tools to ensure total compliance.
Reproducibility
Digital information must be able to be reproduced as originally intended or available.
This is significant especially where the original data was produced on technology at a lower level than currently possible. For example, archivists try to maintain the distinction between listening to a gramophone record played on a gramophone as opposed to a digitally cleaned version of the same recording through a modern hi-fi system.

Given that individuals' personal data has been growing at a rapid rate in the 21st century,[2] these archiving issues affecting professional repositories will soon be manifest in small organisations and even the home.

Types of storage

[edit]

Solid-state memory devices

[edit]

Digital computers, in particular, make use of two forms of memory known as RAM or ROM and although the most common form today is RAM, designed to retain data while the computer is powered on, this was not always the case. Nor is active memory the only form used; passive memory devices are now in common use in digital cameras.

  • Magnetic, or ferrite core, data retention is dependent on the magnetic properties of iron and its compounds.
  • PROM, or programmable read-only memory, stores data in a fixed form during the manufacturing process, with data retention dependent on the life expectancy of the device itself.
  • EPROM, or erasable programmable read-only memory, is similar to PROM but can be cleared by exposure to ultraviolet light.
  • EEPROM, or electrically erasable programmable read-only memory, is the format used by flash memory devices and can be erased and rewritten electronically.

Magnetic media

[edit]
7-inch (18 cm) reel of 14-inch wide (6 mm) audio recording tape, typical of consumer use in the 1950s–70s

Magnetic tapes consist of narrow bands of a magnetic medium bonded in paper or plastic. The magnetic medium passes across a semi-fixed head which reads or writes data. Typically, magnetic media has a maximum lifetime of about 50 years[3] although this assumes optimal storage conditions; life expectancy can decrease rapidly depending on storage conditions and the resilience and reliability of hardware components.

Magnetic disks and drums include a rotating magnetic medium combined with a movable read/write head.

Non-magnetic media

[edit]

Printing technology

[edit]

Printing hard-copies of documents and images is a popular means of representing digital data and possibly acquires the qualities associated with original documents, especially their potential for endurance. More recent advances in printer technology have raised the quality of photographic images in particular. Unfortunately, the permanence of printed documents cannot be easily discerned from the documents themselves.

Financial Driven Resources

[edit]

A way of preserving digital content through means of financial trusts. The data is driven with financial investments typically assigned to a Trust Company which pay traditional storage providers to house data for long periods of time with the interest gained on the principal. In 2008 a series of companies such as LivingStory.com and Orysa.com started offering these services to store point in time accounting data and provide consumer archive services.

Soft storage technology

[edit]

The short-comings of some storage media is already well recognised and various attempts have been made to supplement the permanence of an under-lying technology. These "soft storage technologies" enhance their base technology by applying software or system techniques often within quite narrow fields of data storage and not always with the explicit intention of improving digital permanence.

  • RAID systems
  • Distributed systems, such as BitTorrent
  • networked backup services
  • public archive repositories
  • web-site archives
  • financial trust resources

See also

[edit]

References

[edit]
  1. ^ Burk, Alan; James Kerr; and Andy Pope. "The Credibility of Electronic Publishing". Available at web.mala.bc.ca
  2. ^ Sweeny, Latanya. "Information Explosion. Available at privacy.cs.cmu.edu Archived 2010-01-18 at the Wayback Machine
  3. ^ Adelstein, Peter Z. "Permanence of Digital Information". Available at www.ica.org Archived 2004-09-26 at the Wayback Machine.
[edit]