Publication: Combining Survey and Geospatial Data Can Significantly Improve Gender-Disaggregated Estimates of Labor Market Outcomes
Loading...
Date
2022-06
ISSN
Published
2022-06
Editor(s)
Abstract
Better understanding the geography of women’s labor market outcomes within countries is important to inform targeted efforts to increase women’s economic empowerment. This paper assesses the extent to which a method that combines simulated survey data from urban areas in Mexico with broadly available geospatial indicators from Google Earth Engine and OpenStreetMap can significantly improve estimates of labor force participation and unemployment rates. Incorporating geospatial information substantially increases the accuracy of male and female labor force participation and unemployment rates at the state level, reducing mean absolute deviation by 50 to 62 percent for labor force participation and 25 to 52 percent for unemployment. Small area estimation using a nested error conditional random effect model also greatly improves municipal estimates of labor force participation, as the mean absolute error falls by approximately half, while the mean squared error falls by almost 75 percent when holding coverage rates constant. In contrast, the results for municipal unemployment rate estimates are not reliable because values of unemployment rates are low and therefore poorly suited for linear models. The municipal results hold in repeated simulations of alternative samples. Models utilizing Basic Geo-Statistical Area (AGEB)–level auxiliary information generate more accurate predictions than area-level models specified using the same auxiliary data. Overall, integrating survey data and publicly available geospatial indicators is feasible and can greatly improve state-level estimates of male and female labor force participation and unemployment rates, as well as municipal estimates of male and female labor force participation.
Link to Data Set
Citation
“Merfeld, Joshua D.; Newhouse, David; Weber, Michael; Lahiri, Partha. 2022. Combining Survey and Geospatial Data Can Significantly Improve Gender-Disaggregated Estimates of Labor Market Outcomes. Policy Research Working Papers;10077. © World Bank. http://hdl.handle.net/10986/37519 License: CC BY 3.0 IGO.”
Associated URLs
Associated content
Other publications in this report series
Publication The Future of Poverty(Washington, DC: World Bank, 2025-07-15)Climate change is increasingly acknowledged as a critical issue with far-reaching socioeconomic implications that extend well beyond environmental concerns. Among the most pressing challenges is its impact on global poverty. This paper projects the potential impacts of unmitigated climate change on global poverty rates between 2023 and 2050. Building on a study that provided a detailed analysis of how temperature changes affect economic productivity, this paper integrates those findings with binned data from 217 countries, sourced from the World Bank’s Poverty and Inequality Platform. By simulating poverty rates and the number of poor under two climate change scenarios, the paper uncovers some alarming trends. One of the primary findings is that the number of people living in extreme poverty worldwide could be nearly doubled due to climate change. In all scenarios, Sub-Saharan Africa is projected to bear the brunt, contributing the largest number of poor people, with estimates ranging between 40.5 million and 73.5 million by 2050. Another significant finding is the disproportionate impact of inequality on poverty. Even small increases in inequality can lead to substantial rises in poverty levels. For instance, if every country’s Gini coefficient increases by just 1 percent between 2022 and 2050, an additional 8.8 million people could be pushed below the international poverty line by 2050. In a more extreme scenario, where every country’s Gini coefficient increases by 10 percent between 2022 and 2050, the number of people falling into poverty could rise by an additional 148.8 million relative to the baseline scenario. These findings underscore the urgent need for comprehensive climate policies that not only mitigate environmental impacts but also address socioeconomic vulnerabilities.Publication Exports, Labor Markets, and the Environment(Washington, DC: World Bank, 2025-07-14)What is the environmental impact of exports? Focusing on 2000–20, this paper combines customs, administrative, and census microdata to estimate employment elasticities with respect to exports. The findings show that municipalities that faced increased exports experienced faster growth in formal employment. The elasticities were 0.25 on impact, peaked at 0.4, and remained positive and significant even 10 years after the shock, pointing to a long and protracted labor market adjustment. In the long run, informal employment responds negatively to export shocks. Using a granular taxonomy for economic activities based on their environmental impact, the paper documents that environmentally risky activities have a larger share of employment than environmentally sustainable ones, and that the relationship between these activities and exports is nuanced. Over the short run, environmentally risky employment responds more strongly to exports relative to environmentally sustainable employment. However, over the long run, this pattern reverses, as the impact of exports on environmentally sustainable employment is more persistent.Publication The Macroeconomic Implications of Climate Change Impacts and Adaptation Options(Washington, DC: World Bank, 2025-05-29)Estimating the macroeconomic implications of climate change impacts and adaptation options is a topic of intense research. This paper presents a framework in the World Bank's macrostructural model to assess climate-related damages. This approach has been used in many Country Climate and Development Reports, a World Bank diagnostic that identifies priorities to ensure continued development in spite of climate change and climate policy objectives. The methodology captures a set of impact channels through which climate change affects the economy by (1) connecting a set of biophysical models to the macroeconomic model and (2) exploring a set of development and climate scenarios. The paper summarizes the results for five countries, highlighting the sources and magnitudes of their vulnerability --- with estimated gross domestic product losses in 2050 exceeding 10 percent of gross domestic product in some countries and scenarios, although only a small set of impact channels is included. The paper also presents estimates of the macroeconomic gains from sector-level adaptation interventions, considering their upfront costs and avoided climate impacts and finding significant net gross domestic product gains from adaptation opportunities identified in the Country Climate and Development Reports. Finally, the paper discusses the limits of current modeling approaches, and their complementarity with empirical approaches based on historical data series. The integrated modeling approach proposed in this paper can inform policymakers as they make proactive decisions on climate change adaptation and resilience.Publication The Asymmetric Bank Distress Amplifier of Recessions(Washington, DC: World Bank, 2025-07-11)One defining feature of financial crises, evident in U.S. and international data, is asymmetric bank distress—concentrated losses on a subset of banks. This paper proposes a model in which shocks to borrowers’ productivity dispersion lead to asymmetric bank losses. The framework exhibits a “bank distress amplifier,” exacerbating economic downturns by causing costly bank failures and raising uncertainty about the solvency of banks, thereby pushing banks to deleverage. Quantitative analysis shows that the bank distress amplifier doubles investment decline and increases the spread by 2.5 times during the Great Recession compared to a standard financial accelerator model. The mechanism helps explain how a seemingly small shock can sometimes trigger a large crisis.Publication Impact of Heat Waves on Learning Outcomes and the Role of Conditional Cash Transfers(Washington, DC: World Bank, 2025-07-14)This paper evaluates the impact of higher temperatures on learning outcomes in Peru. The results suggest that 1 degree above 20°C is equivalent to 7 and 6 percent of a standard deviation of what a student learns in a year for math and reading tests, respectively. These results hold true when the main specification is changed, splitting the sample, collapsing the data at school level, and using other climate specifications. The paper aims to improve understanding of how to deal with the impacts of climate change on learning outcomes in developing countries. The evidence suggests that conditional cash transfer programs can mitigate the negative effects of higher temperatures on students’ learning outcomes in math and reading.
Journal
Journal Volume
Journal Issue
Collections
Related items
Showing items related by metadata.
Publication Small Area Estimation of Monetary Poverty in Mexico Using Satellite Imagery and Machine Learning(World Bank, Washington, DC, 2022-09)Estimates of poverty are an important input into policy formulation in developing countries. The accurate measurement of poverty rates is therefore a first-order problem for development policy. This paper shows that combining satellite imagery with household surveys can improve the precision and accuracy of estimated poverty rates in Mexican municipalities, a level at which the survey is not considered representative. It also shows that a household-level model outperforms other common small area estimation methods. However, poverty estimates in 2015 derived from geospatial data remain less accurate than 2010 estimates derived from household census data. These results indicate that the incorporation of household survey data and widely available satellite imagery can improve on existing poverty estimates in developing countries when census data are old or when patterns of poverty are changing rapidly, even for small subgroups.Publication Small Area Estimation of Poverty in Four West African Countries by Integrating Survey and Geospatial Data(Washington, DC: World Bank, 2024-09-05)The paper presents a methodology to generate experimental small area estimates of poverty in four West African countries: Chad, Guinea, Mali, and Niger. Due to the absence of recent census data in these countries, household-level survey data are integrated with grid-level geospatial data, which are used as covariates in model-based estimation. Leveraging geospatial data enables reporting of poverty estimates more frequently at disaggregated administrative levels and makes estimation feasible in areas for which survey data are not available. The paper leverages the availability of a recent census in Burkina Faso for evaluation purposes. Estimates obtained with the same survey instruments and candidate geospatial covariates as the other four countries are compared against estimates obtained using recent census data and an empirical best predictor under a unit-level model. For Burkina Faso, the estimates obtained using geospatial data are highly correlated with the census-based ones in sampled areas but moderately correlated in non-sampled areas. The results demonstrate that in the absence of recent census data, small area estimation with publicly available geospatial covariates isPublication Small Area Estimation of Non-Monetary Poverty with Geospatial Data(World Bank, Washington, DC, 2020-09)This paper uses data from Sri Lanka and Tanzania to evaluate the benefits of combining household surveys with geographically comprehensive geospatial indicators to generate small area estimates of non-monetary poverty. The preferred estimates are generated by utilizing subarea-level geospatial indicators in a household-level empirical best predictor mixed model with a normalized welfare measure. Mean squared errors are estimated using a parametric bootstrap procedure. The resulting estimates are highly correlated with non-monetary poverty calculated from the full census in both countries, and the gain in precision is comparable to increasing the size of the sample by a factor of three in Sri Lanka and five in Tanzania. The empirical best predictor model moderately underestimates uncertainty, but coverage rates are similar to standard survey-based estimates that assume independent outcomes across clusters. A variety of checks, including adding noise to the welfare measure and model-based and design-based simulations, confirm that the main results are robust. The results demonstrate that combining household survey data with subarea-level geospatial indicators can greatly increase the precision of survey estimates of non-monetary poverty at comparatively low cost.Publication The Value of Vocational Education : High School Type and Labor Market Outcomes in Indonesia(2009-09-01)This paper examines the relationship between the type of senior high school attended by Indonesian youth and their subsequent labor market outcomes. This topic is very timely, given the government s recent decision to dramatically expand vocational enrollment. The analysis controls for an unusually rich set of predetermined characteristics, and exploits longitudinal data spanning 14 years to separately identify cohort and age effects. There are four main findings. First, students are sorted into different school types largely on the basis of their entering exam score. Public schools attract the highest-scoring students, while private vocational schools serve the lowest-scoring students. Second, after controlling for a variety of characteristics, including test scores, male public school graduates earn a substantial premium over their privately schooled counterparts. Third, private vocational school graduates fare at least as well as private general graduates, despite coming from more disadvantaged socioeconomic backgrounds. Finally, the returns to public vocational education have declined sharply for the most recent cohort of men. This raises important concerns about the current expansion of public vocational education, and the relevance of the male vocational curriculum in an increasingly service-oriented economy.Publication Improving Estimates of Mean Welfare and Uncertainty in Developing Countries(World Bank, Washington, DC, 2023-03)Reliable small-area estimates of economic welfare significantly inform the design and evaluation of development policies. This paper compares the accuracy of wealth estimates obtained from the empirical best predictor (EBP) of a linear nested error model, Cubist regression, extreme gradient boosting, and boosted regression forests. The evaluation draws two-stage samples from unit-level household census data in seven developing countries, combines them with publicly available geospatial indicators to generate small area estimates of assets for all seven countries and poverty for two, and evaluates these estimates against census-derived benchmarks. Extreme gradient boosting and Cubist regression generally produce more accurate predictions than traditional EBP models. A proposed two-stage residual bootstrap procedure slightly underestimates confidence intervals, but leads to higher coverage rates than the parametric bootstrap approach used for EBP predictions. These results demonstrate that, given a sufficiently large sample of enumeration areas, predictions from extreme gradient boosting or Cubist regression with a two-stage residual block bootstrap generally provide more accurate point and uncertainty estimates for generating small-area welfare estimates.
Users also downloaded
Showing related downloaded files
Publication Services Unbound(Washington, DC: World Bank, 2024-12-09)Services are a new force for innovation, trade, and growth in East Asia and Pacific. The dramatic diffusion of digital technologies and partial policy reforms in services--from finance, communication, and transport to retail, health, and education--is transforming these economies. The result is higher productivity and changing jobs in the services sector, as well as in the manufacturing sectors that use these services. A region that has thrived through openness to trade and investment in manufacturing still maintains innovation-inhibiting barriers to entry and competition in key services sectors. 'Services Unbound: Digital Technologies and Policy Reform in East Asia and Pacific' makes the case for deeper domestic reforms and greater international cooperation to unleash a virtuous cycle of increased economic opportunity and enhanced human capacity that would power development in the region.Publication World Development Report 2006(Washington, DC, 2005)This year’s Word Development Report (WDR), the twenty-eighth, looks at the role of equity in the development process. It defines equity in terms of two basic principles. The first is equal opportunities: that a person’s chances in life should be determined by his or her talents and efforts, rather than by pre-determined circumstances such as race, gender, social or family background. The second principle is the avoidance of extreme deprivation in outcomes, particularly in health, education and consumption levels. This principle thus includes the objective of poverty reduction. The report’s main message is that, in the long run, the pursuit of equity and the pursuit of economic prosperity are complementary. In addition to detailed chapters exploring these and related issues, the Report contains selected data from the World Development Indicators 2005‹an appendix of economic and social data for over 200 countries. This Report offers practical insights for policymakers, executives, scholars, and all those with an interest in economic development.Publication Classroom Assessment to Support Foundational Literacy(Washington, DC: World Bank, 2025-03-21)This document focuses primarily on how classroom assessment activities can measure students’ literacy skills as they progress along a learning trajectory towards reading fluently and with comprehension by the end of primary school grades. The document addresses considerations regarding the design and implementation of early grade reading classroom assessment, provides examples of assessment activities from a variety of countries and contexts, and discusses the importance of incorporating classroom assessment practices into teacher training and professional development opportunities for teachers. The structure of the document is as follows. The first section presents definitions and addresses basic questions on classroom assessment. Section 2 covers the intersection between assessment and early grade reading by discussing how learning assessment can measure early grade reading skills following the reading learning trajectory. Section 3 compares some of the most common early grade literacy assessment tools with respect to the early grade reading skills and developmental phases. Section 4 of the document addresses teacher training considerations in developing, scoring, and using early grade reading assessment. Additional issues in assessing reading skills in the classroom and using assessment results to improve teaching and learning are reviewed in section 5. Throughout the document, country cases are presented to demonstrate how assessment activities can be implemented in the classroom in different contexts.Publication Argentina Country Climate and Development Report(World Bank, Washington, DC, 2022-11)The Argentina Country Climate and Development Report (CCDR) explores opportunities and identifies trade-offs for aligning Argentina’s growth and poverty reduction policies with its commitments on, and its ability to withstand, climate change. It assesses how the country can: reduce its vulnerability to climate shocks through targeted public and private investments and adequation of social protection. The report also shows how Argentina can seize the benefits of a global decarbonization path to sustain a more robust economic growth through further development of Argentina’s potential for renewable energy, energy efficiency actions, the lithium value chain, as well as climate-smart agriculture (and land use) options. Given Argentina’s context, this CCDR focuses on win-win policies and investments, which have large co-benefits or can contribute to raising the country’s growth while helping to adapt the economy, also considering how human capital actions can accompany a just transition.Publication Lebanon Economic Monitor, Fall 2022(Washington, DC, 2022-11)The economy continues to contract, albeit at a somewhat slower pace. Public finances improved in 2021, but only because spending collapsed faster than revenue generation. Testament to the continued atrophy of Lebanon’s economy, the Lebanese Pound continues to depreciate sharply. The sharp deterioration in the currency continues to drive surging inflation, in triple digits since July 2020, impacting the poor and vulnerable the most. An unprecedented institutional vacuum will likely further delay any agreement on crisis resolution and much needed reforms; this includes prior actions as part of the April 2022 International Monetary Fund (IMF) staff-level agreement (SLA). Divergent views among key stakeholders on how to distribute the financial losses remains the main bottleneck for reaching an agreement on a comprehensive reform agenda. Lebanon needs to urgently adopt a domestic, equitable, and comprehensive solution that is predicated on: (i) addressing upfront the balance sheet impairments, (ii) restoring liquidity, and (iii) adhering to sound global practices of bail-in solutions based on a hierarchy of creditors (starting with banks’ shareholders) that protects small depositors.