Publication: A Metadata Schema for Data from Experiments in the Social Sciences
Loading...
Files
703 downloads
20 downloads
Date
2023-02
ISSN
Published
2023-02
Editor(s)
Abstract
The use of randomized controlled trials (RCTs) in the social sciences has greatly expanded, resulting in newly abundant, high-quality data that can be reused to perform methods research in program evaluation, to systematize evidence for policymakers, and for replication and training purposes. However, potential users of RCT data often face significant barriers to discovery and reuse. This paper proposes a metadata schema that standardizes RCT data documentation and can serve as the basis for one—or many, interoperable —data catalogs that make such data easily findable, searchable, and comparable, and thus more readily reusable for secondary research. The schema is designed to document the unique properties of RCT data. Its set of fields and associated encoding schemes (acceptable formats and values) can be used to describe any dataset associated with a social science RCT. The paper also makes recommendations for implementing a catalog or database based on this metadata schema.
Link to Data Set
Citation
“Cavanagh, Jack; Fliegner, Jasmin Claire; Kopper, Sarah; Sautmann, Anja. 2023. A Metadata Schema for Data from Experiments in the Social Sciences. Policy Research Working Papers;10296. © World Bank. http://hdl.handle.net/10986/39414 License: CC BY 3.0 IGO.”
Associated URLs
Associated content
Other publications in this report series
Journal
Journal Volume
Journal Issue
Collections
Related items
Showing items related by metadata.
Publication Using Locational Data from Mobile Phones to Enhance the Science of Delivery(World Bank, Washington, DC, 2014-06)The objective of this report is to examine the potential of locational data for the 'science of delivery' in the field of development. The 'science of delivery' is a term popularized by the World Bank President, Jim Yong Kim, and refers to using evidence-based experimentation to improve development outcomes (Walji, 2013). In this context, locational data is a new tool that is starting to be used in a variety of development fields including health, education, disaster risk management, traffic planning etc. this broad introduction to the topic in chapter one, the next chapter explores the technology behind locational data. Chapter three presents the methodology followed in this research and chapter four, which is the heart of this report, then presents a series of mini case studies of how it is actually being used in a representative sample of different development fields. This is the 'evidence-based experimentation' which can be harnessed to improve the 'science of delivery', and examples of both active and passive collection of locational data are presented. Finally, chapter five examines, in broader terms, the longer term potential of locational data as a development tool, once smartphone ownership becomes more widespread.Publication Open Data Challenges and Opportunities for National Statistical Offices(Washington, DC, 2014-07-01)Open Data initiatives are transforming how governments and other public institutions interact and provide services to their constituents. They increase transparency and value to citizens, reduce inefficiencies and barriers to information, enable data-driven applications that improve public service delivery, and provide public data that can stimulate innovative business opportunities. As the gatekeepers of official statistics, National Statistics Offices (NSOs) produce many datasets that could typically comprise the foundation of an Open Data program. They may also have relationships with other data producing agencies in the national statistical system and have expertise in dealing with the many technical and data quality issues attendant in publishing data. In short, they are extremely well placed to make a valuable contribution to Open Data initiatives. Despite these advantages, NSOs do not always feature prominently in government-sponsored Open Data programs and they may be missing an important opportunity to expand the use and re-use of the data they produce. The goal of this working paper is to better understand the opportunities and challenges that Open Data presents to NSOs and to identify what steps and solutions are needed to enable NSOs to play a valuable role in national or sub-national Open Data initiatives.Publication Central America : Big Data in Action for Development(Washington, DC, 2014)This report stemmed from a World Bank pilot activity to explore the potential of big data to address development challenges in Central American countries. As part of this activity we collected and analyzed a number of examples of leveraging big data for development. Because of the growing interest in this topic this report makes available to a broader audience those examples as well as the underlying conceptual framework to think about big data for development. To make effective use of big data, many practitioners emphasize the importance of beginning with a question instead of the data itself. A question clarifies the purpose of utilizing big data, whether it is for awareness, understanding, and/or forecasting. In addition, a question suggests the kinds of real-world behaviors or conditions that are of interest. These behaviors are encoded into data through some generating process which includes the media through which behavior is captured. Then various data sources are accessed, prepared, consolidated and analyzed. This ultimately gives rise to insights into the question of interest, which are implemented to effect changes in the relevant behaviors. Utilizing big data for any given endeavor requires a host of capabilities. Hardware and software capabilities are needed for interaction of data from a variety of sources in a way which is efficient and scalable. Human capabilities are needed not only to make sense of data but to ensure a question-centered approach, so that insights are actionable and relevant. To this end, cooperation between development experts as well as social scientists and computer scientists is extremely important.Publication The Entry of Randomized Assignment into the Social Sciences(World Bank, Washington, DC, 2017-05)Although the concept of randomized assignment to control for extraneous factors reaches back hundreds of years, the first empirical use appears to have been in an 1835 trial of homeopathic medicine. Throughout the 19th century, there was primarily a growing awareness of the need for careful comparison groups, albeit often without the realization that randomization could be a particularly clean method to achieve that goal. In the second and more crucial phase of this history, four separate but related disciplines introduced randomized control trials within a few years of one another in the 1920s: agricultural science, clinical medicine, educational psychology, and social policy (specifically political science). Randomized control trials brought more rigor to fields that were in the process of expanding their purviews and focusing more on causal relationships. In the third phase, the 1950s through the 1970s saw a surge of interest in more applied randomized experiments in economics and elsewhere, in the lab and especially in the field.Publication Cash or Condition? Evidence from a Cash Transfer Experiment(2010-03-01)Conditional Cash Transfer programs are "...the world's favorite new anti-poverty device," (The Economist, July 29 2010) yet little is known about the specific role of the conditions in driving their success. In this paper, we evaluate a unique cash transfer experiment targeted at adolescent girls in Malawi that featured both a conditional (CCT) and an unconditional (UCT) treatment arm. We find that while there was a modest improvement in school enrollment in the UCT arm in comparison to the control group, this increase is only 43 percent as large as the CCT arm. The CCT arm also outperformed the UCT arm in tests of English reading comprehension. The schooling condition, however, proved costly for important non-schooling outcomes: teenage pregnancy and marriage rates were substantially higher in the CCT than the UCT arm. Our findings suggest that a CCT program for early adolescents that transitions into a UCT for older teenagers would minimize this trade-off by improving schooling outcomes while avoiding the adverse impacts of conditionality on teenage pregnancy and marriage.
Users also downloaded
Showing related downloaded files
Publication The Container Port Performance Index 2023(Washington, DC: World Bank, 2024-07-18)The Container Port Performance Index (CPPI) measures the time container ships spend in port, making it an important point of reference for stakeholders in the global economy. These stakeholders include port authorities and operators, national governments, supranational organizations, development agencies, and other public and private players in trade and logistics. The index highlights where vessel time in container ports could be improved. Streamlining these processes would benefit all parties involved, including shipping lines, national governments, and consumers. This fourth edition of the CPPI relies on data from 405 container ports with at least 24 container ship port calls in the calendar year 2023. As in earlier editions of the CPPI, the ranking employs two different methodological approaches: an administrative (technical) approach and a statistical approach (using matrix factorization). Combining these two approaches ensures that the overall ranking of container ports reflects actual port performance as closely as possible while also being statistically robust. The CPPI methodology assesses the sequential steps of a container ship port call. ‘Total port hours’ refers to the total time elapsed from the moment a ship arrives at the port until the vessel leaves the berth after completing its cargo operations. The CPPI uses time as an indicator because time is very important to shipping lines, ports, and the entire logistics chain. However, time, as captured by the CPPI, is not the only way to measure port efficiency, so it does not tell the entire story of a port’s performance. Factors that can influence the time vessels spend in ports can be location-specific and under the port’s control (endogenous) or external and beyond the control of the port (exogenous). The CPPI measures time spent in container ports, strictly based on quantitative data only, which do not reveal the underlying factors or root causes of extended port times. A detailed port-specific diagnostic would be required to assess the contribution of underlying factors to the time a vessel spends in port. A very low ranking or a significant change in ranking may warrant special attention, for which the World Bank generally recommends a detailed diagnostic.Publication Global Economic Prospects, January 2025(Washington, DC: World Bank, 2025-01-16)Global growth is expected to hold steady at 2.7 percent in 2025-26. However, the global economy appears to be settling at a low growth rate that will be insufficient to foster sustained economic development—with the possibility of further headwinds from heightened policy uncertainty and adverse trade policy shifts, geopolitical tensions, persistent inflation, and climate-related natural disasters. Against this backdrop, emerging market and developing economies are set to enter the second quarter of the twenty-first century with per capita incomes on a trajectory that implies substantially slower catch-up toward advanced-economy living standards than they previously experienced. Without course corrections, most low-income countries are unlikely to graduate to middle-income status by the middle of the century. Policy action at both global and national levels is needed to foster a more favorable external environment, enhance macroeconomic stability, reduce structural constraints, address the effects of climate change, and thus accelerate long-term growth and development.Publication Unlocking the Power of Healthy Longevity(Washington, DC: World Bank, 2024-09-12)Noncommunicable diseases (NCDs) are among the major health and development challenges of our time. Every year, about 41 million people die due to NCDs. This makes up about 74 percent of all deaths globally, the majority of which are in low- and middle-income countries (LMICs). Countless more people live with NCDs every day. Yet, NCDs are largely treatable and preventable. The risk of developing NCDs and deaths from them can both be lowered with appropriate attention to prevention and treatment. However, weak health systems and limited access to affordable care and information, especially in LMICs, contribute to lapses in seeking and receiving appropriate and timely care. This compendium is a compilation of 18 chapters, each exploring a different but related topic in the nexus of NCDs, human capital, and productivity. It is based on a series of analytical work taken up by the World Bank to support the Healthy Longevity Initiative (HLI) - a collaborative effort between the World Bank, the University of Toronto, and key academic and development partners including the Harvard University and the University of Washington. The HLI presents one of a growing set of efforts to increase the urgency of policy response to NCDs across the world.Publication Finance and Prosperity 2024(Washington, DC: World Bank, 2024-08-29)While financial sector risks in the larger and higher per capita countries are moderate, half of lower-income countries face significant risks over the next 12 months. Nearly 70 percent of countries facing high financial sector risks are currently not adequately prepared to handle financial stress. The report also identifies a particular risk facing financial sectors in several countries: a large and growing exposure to sovereign debt. This exposure surged to its highest level in the past decade. Finally, the report looks at how countries can enable more climate finance through the banking sector without compromising on the important goals of financial sector stability and inclusion for underserved people.Publication World Development Report 2004(World Bank, 2003)Too often, services fail poor people in access, in quality, and in affordability. But the fact that there are striking examples where basic services such as water, sanitation, health, education, and electricity do work for poor people means that governments and citizens can do a better job of providing them. Learning from success and understanding the sources of failure, this year’s World Development Report, argues that services can be improved by putting poor people at the center of service provision. How? By enabling the poor to monitor and discipline service providers, by amplifying their voice in policymaking, and by strengthening the incentives for providers to serve the poor. Freedom from illness and freedom from illiteracy are two of the most important ways poor people can escape from poverty. To achieve these goals, economic growth and financial resources are of course necessary, but they are not enough. The World Development Report provides a practical framework for making the services that contribute to human development work for poor people. With this framework, citizens, governments, and donors can take action and accelerate progress toward the common objective of poverty reduction, as specified in the Millennium Development Goals.