Harvesting the OKR

Structured metadata for OKR content is exposed according to the OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) protocol. This enables anyone to import the metadata for the entire OKR, a collection, or for a specific publication.

By following the steps below, repositories around the world that comply with OAI-PMH standards can harvest metadata from content in the OKR. Once metadata from the OKR is ingested into other repositories, users of those repositories are able to easily search, discover, and access World Bank publications.

Virtual sets

Any OAI-PMH-compliant repository can harvest specific publications or thematic sets of content across multiple collections within the OKR. This is done through “virtual sets” based on pre-defined search queries and/or filters. The virtual sets not only save repository managers time in harvesting the OKR content, but also save time spent on curation by ensuring that the items harvested meet their desired criteria.

Queries for virtual sets in the OKR may be as broad as “Health in Africa” or “Climate Change” or as narrow as “Agribusiness in Tanzania” and “Microfinance in Bangladesh”. The following six sample virtual sets have been created and are now available for other repositories to begin ingesting:

To request a new virtual set, repository managers may contact okr@worldbank.org

Syntax

Repository managers have the following options for harvesting metadata from the OKR. Note that there is a limit of 100 results per query. In order to harvest results exceeding this, make use of the resumption token.

1. HARVESTING METADATA OF A SINGLE PUBLICATION

Append the handle of the publication (the number in the URL following the format 10986/####) to a GetRecord query.

Example:
https://openknowledge.worldbank.org/oai/request?verb=GetRecord&metadataPrefix=oai_dc&identifier=oai:openknowledge.worldbank.org:10986/17584

2. HARVESTING METADATA OF PUBLICATIONS WITHIN A COMMUNITY

Reformat and append the handle of the community (the number in the URL following the format 10986/####) to a GetRecord query.

Example:
https://openknowledge.worldbank.org/oai/request?verb=ListRecords&metadataPrefix=oai_dc&set=com_10986_2123

3. HARVESTING METADATA OF PUBLICATIONS WITHIN A COLLECTION

Reformat and append the handle of the collection (the number in the URL following the format 10986/####) to a GetRecord query.

Example:
https://openknowledge.worldbank.org/oai/request?verb=ListRecords&metadataPrefix=oai_dc&set=col_10986_2124