Sharing via OAI-PMH

Sharing via OAI-PMH: Tips for Partners

Introduction

OAI-PMH is a protocol for harvesting the metadata descriptions of objects from an archive. It is the most common way that Plains to Peaks Collective (PPC) partners share their metadata with us, though we also accept metadata shared via spreadsheet, CSV file, and API.

In this guide you will find tips for making the OAI-PMH sharing process smoother between your institution and the PPC. While we have included links to OAI-PMH resources for some of the more common digital repositories (below), we are not experts in how to configure your specific OAI-PMH settings. Each system is different, so you will need to consult your user manual for information about making configuration changes.

OAI-PMH resources for common digital repositories

Islandora
https://github.com/Islandora/islandora_oai

Luna
https://doc.lunaimaging.com/display/V73D/OAI-PMH+in+LUNA

Contentdm
https://www.oclc.org/support/services/contentdm/help/server-admin-help/oai-support.en.html

Metadata schemes

Most institutions create metadata that complies with one of several possible metadata schemes. MODS, Dublin Core (DC), and Qualified Dublin Core (QDC) are the most common schemes that our partners use, though your institution might use a different scheme like METS or MARC, or even a locally created scheme.

Enabling your metadata scheme

It is important that your OAI-PMH feed is configured to share your institution’s optimal metadata scheme. For example, if your institution uses MODS, then you need to enable MODS in your OAI-PMH configuration. Otherwise your OAI-PMH feed will default to Dublin Core and your metadata won’t come through as granular or specific as you intended.

Tip: Be sure to double-check your OAI-PMH feed to make sure that it is outputting your metadata in the scheme you want.

Collections

The PPC harvests metadata at the collection level. We do not have the ability to harvest individual items or to omit specific items from a collection. In other words, we can’t cherry pick items from your collections.

Prior to each quarterly ingest, you will send us a list of collections (also called sets) that you want us to harvest. Ideally, you will also send us the collection name and the number of objects we can expect to find in each collection.

Here is an example of what to send:

Collection Name

Preferred

Collection Number

Required. The number or string should be formatted how it appears in OAI-PMH.

Number of Objects

Preferred

S.N. Leek papers #03138 uwydbuwy~74~74 2473
Victor Gruen papers #05809 uwydbuwy~109~109 64
Estelle Ishigo photographs #10368 uwydbuwy~42~42 116

Other tips

Turn off compound objects

You will need to disable compound objects in your OAI-PMH feed. For example, you might have a book cataloged as a compound object in your collection, which results in a separate metadata record for each page. By turning off compound objects, you will only be sending one record for the book as a whole.

Double check your OAI-PMH feed

Finally, always verify that you are actually sharing all of the fields that you intend. The PPC can only work with the fields that appear in your OAI-PMH feed, so if a field is missing or not mapped correctly it will not show up in the final aggregated metadata.