Harness the Power of Data in the Cloud — Unveil the Future of Tech

AWS registers open data from the National Archives Catalog

NARA, in conjunction with AWS, publicly releases the National Archives Catalog dataset on the AWS Registry of Open Data. This guide assists users in navigating to access theavailable data.

, and Administrator

2025 September 28 . 3:58 AM

2 min read

AWS-Registered Data Catalogue from National Archives Open Data Index

AWS registers open data from the National Archives Catalog

The National Archives and Records Administration (NARA) has made a significant portion of its archival descriptions and authority records available to the public through the AWS Registry of Open Data. This dataset, known as the National Archives Catalog, totals over 261 gigabytes of data and is organized according to records groups and collections.

Users can access this comprehensive dataset using the AWS Command Line Interface (CLI). To pull the full dataset, simply use the command . Here, can be replaced with either or , depending on whether you're interested in descriptions or authority records.

The dataset is structured in a way that each record group or collection directory contains a sequence of JSON files, each representing data for up to 10,000 descriptions or authority records. For instance, the files in record group or collection directories follow the pattern .

If you're interested in specific collections, record groups, or descriptions, you can use more specific commands. To pull descriptions for a specific collection, use . Similarly, to pull descriptions for a specific record group, use . For authority records, the commands are analogous, with the directory replacing the directory in the commands above.

The parent/child relationship of series to file units/items is conveyed for each record through the parentSeries, parentFileUnit, etc. elements within the JSON. This structure provides a clear and organized way to navigate the vast amount of data in the National Archives Catalog.

In addition to the JSON files, the dataset contains URLs for over 148 million digital objects and data from citizen archivist contributions. Users can download the dataset as zip files from specific locations, or they can use the AWS CLI commands to list the full dataset.

Lastly, the dataset can be accessed with a specific ARN, providing a unique identifier for the dataset. This makes it easy to reference and share the National Archives Catalog dataset with others.

Latest

This is the aerial view of a city. in this we can see buildings, towers, motor vehicles,...

Lifestyle

Romania's IPTV: The Future of Viewing Experiences

IPTV is revolutionizing Romania's content consumption. Engage with live polls, AR, and personalized content on your mobile devices. The future is here.

, and Administrator

2025 October 9

In the picture we can see a car engine with pipes, battery in it.

Climate-change

China Boosts EV Safety from 2026 with Mandatory Impact Tests and 'Battery Bazooka'

China's new EV safety rules promise tougher testing. The 'battery bazooka' could revolutionize fire prevention worldwide.

, and Administrator

2025 October 9

This is a paper. On this something is written.

War-and-conflicts

EU Committee Visits Taiwan Amid Rising Hybrid Threats and China Tensions

EU committee visits Taiwan to align against hybrid threats. President Lai Ching-te warns of increasing threats to both Taiwan and the EU.

, and Administrator

2025 October 9

In this image we can see there is a tool box with so many tools in it.

Stay Safe Online with Wise Learner Hub

CyberCX Speeds Up Essential Eight Compliance with New Solution

CyberCX's new solution cuts Essential Eight compliance time from months to days. It's a game-changer for organisations looking to bolster their cybersecurity fundamentals.

, and Administrator

2025 October 9

AWS registers open data from the National Archives Catalog

AWS registers open data from the National Archives Catalog

Read also:

Related

Latest