Web Data Connections for Archived Records
In the digital age, preserving the integrity and accessibility of digital records, including web records, has become a crucial task. One such initiative is the Web Records Preservation Plan, a managed digital preservation activity by various organisations, with the National Records of Scotland (NRS) Web Continuity Service being a notable example[1].
The Web Records Preservation Plan focuses on ensuring the ongoing accessibility and usability of digital records that have enduring value. Central to this effort is the understanding and preservation of significant properties for web records. These properties, documented in the Web Records Preservation Plan, relate to essential attributes that must be preserved to keep digital objects accessible, meaningful, and trustworthy over time.
The significant properties of digital records typically include the appearance, behaviour, quality, and usability. Appearance refers to how the record looks, including formatting, layout, and visual elements. Behaviour encompasses how interactive or dynamic elements function. Quality pertains to the integrity and completeness of the digital content. Usability ensures that the record remains functional and can be used as originally intended[3].
Preserving these significant properties is critical for maintaining authenticity, ensuring the record is free from tampering or corruption, and supporting appraisal, selection, metadata development, and future preservation strategies[3].
The National Archives and Records Administration (NARA) is another organisation contributing to this digital preservation effort. NARA makes its Linked Open Data available for public access on its website www.our website.gov. This data includes the Digital Preservation Framework, which is available in RDF Turtle (.ttl) format, the same as the version on GitHub[2].
Web records are sent from a server to a browser via Hypertext Transfer Protocol (HTTP). File extensions for related technologies include .xhtm or .xhtml for Extensible Hypertext Markup Language, .css for Cascading Style Sheets, and .xfdl for XFDL. These files are categorised under Software and Code, with some also categorised under Structured Data and Textual and Word Processing[2].
The Digital Preservation Framework, as Linked Open Data, includes the same elements as the version on GitHub. The CDX Internet Archive Index file is also categorised under Web Records. RDF Turtle (.ttl) files, including those containing the Digital Preservation Framework, can be opened in any text editor[2].
In conclusion, the Web Records Preservation Plan aims to preserve these significant properties so that digital records, including archived websites, remain authentic, reliable, and usable for current and future generations[1][3]. This collaborative effort is a significant step towards ensuring the longevity and integrity of digital records, contributing to our collective digital heritage.
[1] National Records of Scotland. (2017). Web Continuity Service. Retrieved from https://www.nrscotland.gov.uk/research-and-publications/research/web-continuity-service [2] National Archives and Records Administration. (n.d.). NARA Linked Open Data. Retrieved from https://www.our website.gov/records-management/guides/electronic-records/naras-linked-open-data [3] National Archives and Records Administration. (n.d.). Web Records Preservation Plan. Retrieved from https://www.our website.gov/records-management/guides/electronic-records/web-records-preservation-plan
Data-and-cloud-computing technology plays a significant role in the Web Records Preservation Plan, as it facilitates the management and preservation of digital records, ensuring their ongoing accessibility and usability. This technology is employed to document and preserve the significant properties of digital records, such as appearance, behavior, quality, and usability, which are essential for maintaining authenticity and reliability.