Versions Compared


  • This line was added.
  • This line was removed.
  • Formatting was changed.

All DPLA data in the DPLA repository is available for download as zipped JSON and parquet files on Amazon Simple Storage Service (S3) in the bucket named s3://dpla-provider-export.

For more details about how to access and download these files from S3, see the S3 documentation.

Current JSON format

Files are formatted asĀ JSONL, and have the following structure. Every line is a JSON object.


This is a straight dump of an Elasticsearch index and has some fields outside of "_source" that you can ignore.

Former JSON file formats

Before August 2018 the file format was as follows. Note that this is a JSON array.