Harvesting the OKR
Structured metadata for OKR content is exposed according to the OAI-PMH (Open Archives Initiative Protocol for Metadata Harvesting) protocol. This enables anyone to import the metadata for the entire OKR, a collection, or for a specific publication.
By following the steps below, repositories around the world that comply with OAI-PMH standards can harvest metadata from content in the OKR. Once metadata from the OKR is ingested into other repositories, users of those repositories are able to easily search, discover, and access World Bank publications.
Any OAI-PMH-compliant repository can harvest specific publications or thematic sets of content across multiple collections within the OKR. This is done through “virtual sets” based on pre-defined search queries and/or filters. The virtual sets not only save repository managers time in harvesting the OKR content, but also save time spent on curation by ensuring that the items harvested meet their desired criteria.
Queries for virtual sets in the OKR may be as broad as “Health in Africa” or “Climate Change” or as narrow as “Agribusiness in Tanzania” and “Microfinance in Bangladesh”. The following six sample virtual sets have been created and are now available for other repositories to begin ingesting:
- Health in Africa
- Research relating to China
- Energy in India
- Microfinance in Bangladesh
- Agribusiness in Tanzania
- Climate Change
To request a new virtual set, repository managers may contact firstname.lastname@example.org
Repository managers have the following options for harvesting metadata from the OKR. Note that there is a limit of 100 results per query. In order to harvest results exceeding this, make use of the resumption token.
1. HARVESTING METADATA OF A SINGLE PUBLICATION
Append the handle of the publication (the number in the URL following the format 10986/####) to a GetRecord query.
2. HARVESTING METADATA OF PUBLICATIONS WITHIN A COMMUNITY
Reformat and append the handle of the community (the number in the URL following the format 10986/####) to a GetRecord query.
3. HARVESTING METADATA OF PUBLICATIONS WITHIN A COLLECTION
Reformat and append the handle of the collection (the number in the URL following the format 10986/####) to a GetRecord query.