Report backs PDF variant for long-term archiving

By David Meyer, ZDNet UK
Friday, April 25, 2008 12:33 PM

The United Kingdom's National Archives has welcomed a report that backs a variant of Adobe's portable document format standard as a reliable way of preserving documents for future use.

However, the organization has warned that other file formats will need to still be monitored and considered, as the PDF/Archive (PDF/A) standard can only be one part of a long-term archiving policy.

On Thursday the Digital Preservation Coalition (DPC), a U.K.-based not-for-profit organization with the National Archives and the British Library among the membership, issued a report in which it called for PDF/A to be employed by organizations wanting to "be sure that their documents will be preserved for the long term".

Although PDF is a de facto standard, it is not a recognized ISO standard. PDF/A was therefore created and standardized in 2005 as ISO 19005-1--it is effectively a stripped-down version of PDF in which fonts have to be embedded and audio, video and JavaScript are not allowed.

In the DPC's Preserving the Data Explosion: Using PDF report, author Betsy Fanning wrote that "the development of PDF/A for long-term preservation of electronic documents is a logical use of the file format".

"When PDF/A is combined with a comprehensive records management program and formally established records policies and procedures, an organization can be sure that their electronic documents will be preserved," she added.

"We are only at the beginning of the implementation and adoption of PDF/A as an electronic preservation file format, however, it is hoped that PDF/A will be widely adopted as the long-term preservation file format for the future," Fanning wrote.

However, Fanning warned: "While PDF/A may be a suitable file format today for long-term preservation of electronic documents, it should be noted that there may be other file formats introduced in the future that may better serve the needs of an organization. Therefore, organizations should be continually reviewing the available file formats to ensure they have selected the best format for their purposes."

Adrian Brown, head of digital preservation at the National Archives said: "This report highlights the challenges we all face in a digital age. Using PDF/A as a standard will help information officers ensure that key business data survives. But it should never be viewed as the Holy Grail. It is merely a tool in the armory of a well-thought-out records-management policy."

Although the report addresses many rival document formats, including the OpenDocument Format (ODF), it highlights two Microsoft formats in its conclusion as being worth monitoring in the future: the XML Paper Specification (XPS) and Office Open XML (OOXML).

The National Archives has been working with Microsoft since mid-2007 to tackle its many legacy Microsoft-formated documents that can no longer be read by current Microsoft software.


WORTHWHILE?

0

0 votes
Blog

Talkback 0 comments

There are currently no comments for this post.


Tech Jobs Now!

Search for your ideal tech job:

Hands-on programming: Extract plain text from documents with Syncfusion's components

Web Development

Justin James recently tried Syncfusion's Essential DocIO and Essential PDF to help him extract text from documents he downloaded from the Internet. Here's the code he wrote to get the plain text.


Read more »



Will technology divide us further?

Blog thumbnail

So I finally watched 2012 over the weekend, but the film left me feeling extremely agitated.

The possibility that the world may meet its watery end in three years didn't..... by Eileen Yu

Read more »

Tags

  1. antivirus
  2. apple ipod
  3. cnet networks inc.
  4. desktop
  5. e - mail
  6. hard drive
  7. intuit inc.
  8. mcafee inc.
  9. microsoft corp.
  10. microsoft windows
  11. microsoft windows vista
  12. microsoft windows xp
  13. norton co.
  14. pc
  15. performance
  16. security
  17. software
  18. tool
  19. web
  20. web site