Publication:
When Aggregation Misleads: Bias in Unit-Level Small Area Estimates of Poverty with Aggregate Data

Loading...
Thumbnail Image
Files in English
English PDF (1 MB)
16 downloads
English Text (49.31 KB)
4 downloads
Date
2025-05-01
ISSN
Published
2025-05-01
Author(s)
Editor(s)
Abstract
This paper explores why small area poverty estimates from models at the household level that only use aggregate data as covariates, exhibit systematic bias. The analysis demonstrates that this bias stems from the model’s inability to capture the complete between-household variation in welfare, as they rely solely on covariates aggregated at geographic levels. Through model-based simulations, the paper shows that the bias in these models is minimized when the empirical variability of simulated welfare based on the model is closest to the true empirical variance of welfare at the area level. This finding also has implications for bias in unit-level models.
Link to Data Set
Citation
Corral, Paul. 2025. When Aggregation Misleads: Bias in Unit-Level Small Area Estimates of Poverty with Aggregate Data. Public Research Working Paper; 11110. © World Bank. http://hdl.handle.net/10986/43150 License: CC BY 3.0 IGO.
Associated URLs
Associated content
Report Series
Report Series
Other publications in this report series
  • Publication
    Geopolitics and the World Trading System
    (Washington, DC: World Bank, 2024-12-23) Mattoo, Aaditya; Ruta, Michele; Staige, Robert W.
    Until the beginning of this century, the GATT/WTO system worked. Economic research provided a compelling explanation. It showed that if governments maximize the well-being of their own countries broadly defined, GATT/WTO principles would facilitate mutually beneficial cooperation over their trade policy choices. Now heightened geopolitical rivalry seems to have undermined the WTO. A simple transposition of the previous rationalization suggests that geopolitics and trade cooperation are not compatible. The paper shows that this is only true if rivalry eclipses any consideration of own-country well-being. In all other circumstances, there are gains from trade cooperation even with geopolitics. Furthermore, the WTO’s relevance is in question only if it adheres too rigidly to its existing rules and norms. Through measured adaptation to the geopolitical imperative, the WTO can continue to thrive as a forum for multilateral trade cooperation in the age of geopolitics.
  • Publication
    Innovative Financial Instruments and Their Role in the Development of Jurisdictional REDD+
    (Washington, DC: World Bank, 2025-05-08) Golub, Alexander; Hanusch, Marek; Bardal, Diogo; Keith, Bruce Ian; Simon, Daniel Navia; Fleischhaker, Cornelius
    Achieving global net zero carbon emissions requires stopping deforestation and making full use of tropical forests as carbon sinks. Market instruments for the sale and purchase of emission outcomes coming from Reducing Emissions from Deforestation and Forest Degradation framework programs could play a very significant role in achieving this goal. The development of these markets has been insufficient so far: their scale as of today is much lower than what would be required to generate meaningful resources for the countries that host tropical forests, and the quality of existing instruments is generally insufficient to allow a scaling up in demand. However, efforts to improve the transparency and integrity of these instruments are accelerating, particularly around jurisdictional Reducing Emissions from Deforestation and Forest Degradation framework programs. In parallel with these efforts, innovations in financial instruments suited for the framework’s carbon markets are also taking place, but their scale is limited so far. This paper looks beyond the current state of the framework’s carbon markets to consider a set of innovative financial instruments that would allow completing the infrastructure of emissions trading, enhancing its utility for both issuers and buyers of carbon credits in the framework’s jurisdictional programs. The paper shows how a combination of forest carbon bonds, where countries sell forward (or commit) their emission reduction outcomes, as well as call and put options can be used to de-risk and encourage early investment in jurisdictional Reducing Emissions from Deforestation and Forest Degradation framework programs. To quantify the value of these innovations, the paper evaluates the potential scale of these instruments for the case of Brazil. The estimates suggest that the amounts that could be mobilized would represent a critical contribution to effective forest conservation. The proposed instruments and methods can be used by other tropical nations that are prepared to implement a large-scale jurisdictional program. Although the paper acknowledges that the current state of carbon markets would still not allow their deployment in the short term, the conclusion is that these instruments have significant potential, and their future development could be an important contribution to the establishment of successful markets for the conservation of tropical forests.
  • Publication
    Disentangling the Key Economic Channels through Which Infrastructure Affects Jobs
    (Washington, DC: World Bank, 2025-04-03) Vagliasindi, Maria; Gorgulu, Nisan
    This paper takes stock of the literature on infrastructure and jobs published since the early 2000s, using a conceptual framework to identify the key channels through which different types of infrastructure impact jobs. Where relevant, it highlights the different approaches and findings in the cases of energy, digital, and transport infrastructure. Overall, the literature review provides strong evidence of infrastructure’s positive impact on employment, particularly for women. In the case of electricity, this impact arises from freeing time that would otherwise be spent on household tasks. Similarly, digital infrastructure, particularly mobile phone coverage, has demonstrated positive labor market effects, often driven by private sector investments rather than large public expenditures, which are typically required for other large-scale infrastructure projects. The evidence on structural transformation is also positive, with some notable exceptions, such as studies that find no significant impact on structural transformation in rural India in the cases of electricity and roads. Even with better market connections, remote areas may continue to lack economic opportunities, due to the absence of agglomeration economies and complementary inputs such as human capital. Accordingly, reducing transport costs alone may not be sufficient to drive economic transformation in rural areas. The spatial dimension of transformation is particularly relevant for transport, both internationally—by enhancing trade integration—and within countries, where economic development tends to drive firms and jobs toward urban centers, benefitting from economies scale and network effects. Turning to organizational transformation, evidence on skill bias in developing countries is more mixed than in developed countries and may vary considerably by context. Further research, especially on the possible reasons explaining the differences between developed and developing economies, is needed.
  • Publication
    Economic Consequences of Trade and Global Value Chain Integration
    (World Bank, Washington, DC, 2025-04-04) Borin, Alessandro; Mancini, Michele; Taglioni, Daria
    This paper introduces a new approach to measuring Global Value Chains (GVC), crucial for informed policy-making. It features a tripartite classification (backward, forward, and two-sided) covering trade and production data. The findings indicate that traditional trade-based GVC metrics significantly underestimate global GVC activity, especially in sectors like services and upstream manufacturing, and overstate risks in early trade liberalization stages. Additionally, conventional backward-forward classifications over-estimate backward linkages. The paper further applies these measures empirically to assess how GVC participation mediates the impact of demand shocks on domestic output, highlighting both the exposure and stabilizing potential of GVC integration. These new measures are comprehensively available on the World Bank’s WITS Platform, providing a key resource for GVC analysis.
  • Publication
    Labor Market Scarring in a Developing Economy
    (Washington, DC: World Bank, 2025-05-08) Arias, Francisco J.; Lederman, Daniel
    This paper estimates the magnitude of labor market scarring in a developing economy, a setting that has been understudied by the labor scarring literature dominated by advanced economies. The paper assesses the contributions of “stigma” versus “lost human capital,” which cause earnings losses among displaced workers relative to non-displaced workers. The findings indicate that job separations caused by plant closings result in sizable and long-lasting reductions in earnings, with an average decline of 7.5 percent in hourly wages over a nine-year period. The estimate for one year after a plant closing is larger, at a decline of 10.8 percent. In a common sample, after controlling for unobserved, time-invariant individual characteristics, the impact of a plant closing declines from 11.9 to 8.2 percent. These results imply that stigma in the labor market due to imperfect information about workers (captured by unobservable worker characteristics) accounts for 30.8 percent of the average earnings losses, whereas lost employer-specific human capital explains the remaining 69.2 percent. The paper explores the effects of job separations due to plant closings on other labor market outcomes, including hours worked and informality, and provides estimates across genders and levels of education.
Journal
Journal Volume
Journal Issue

Related items

Showing items related by metadata.

  • Publication
    Poverty Mapping in the Age of Machine Learning
    (World Bank, Washington, DC, 2023-05-04) Corral, Paul; Segovia, Sandra
    Recent years have witnessed considerable methodological advances in poverty mapping, much of which has focused on the application of modern machine-learning approaches to remotely sensed data. Poverty maps produced with these methods generally share a common validation procedure, which assesses model performance by comparing subnational machine-learning-based poverty estimates with survey-based, direct estimates. Although unbiased, survey-based estimates at a granular level can be imprecise measures of true poverty rates, meaning that it is unclear whether the validation procedures used in machine-learning approaches are informative of actual model performance. This paper examines the credibility of existing approaches to model validation by constructing a pseudo-census from the Mexican Intercensal Survey of 2015, which is used to conduct several design-based simulation experiments. The findings show that the validation procedure often used for machine-learning approaches can be misleading in terms of model assessment since it yields incorrect information for choosing what may be the best set of estimates across different methods and scenarios. Using alternative validation methods, the paper shows that machine-learning-based estimates can rival traditional, more data intensive poverty mapping approaches. Further, the closest approximation to existing machine-learning approaches, using publicly available geo-referenced data, performs poorly when evaluated against “true” poverty rates and fails to outperform traditional poverty mapping methods in targeting simulations.
  • Publication
    Migration, Remittances and Forests : Disentangling the Impact of Population and Economic Growth on Forests
    (2011-12-01) Bhattarai, Keshav; Tiwari, Sailesh
    International migration has increased rapidly in recent decades and this has been accompanied by a remarkable increase in transfers made by migrants to their home countries. This paper investigates the effect of the rural economic growth brought about by migration and remittances on Nepal's Himalayan forests. The authors assemble a unique village-panel dataset combining remote sensing data on land use and forest cover change with data from the census and multiple rounds of living standards surveys to test various inter-relationships between population, economic growth and forests. The results suggest that rural economic growth spurred by remittances has had an overall positive impact on forests. The paper also finds that remittances caused an increase in rural wages and an increase in income, but a decrease in land prices. Considered together, however, the relationship between forests and remittances is driven largely through the income channel, indicating that the demand for amenities provided by forests in the rural Nepali setting may have been more important than factor prices in influencing land use changes for the period of the study.
  • Publication
    Estimating Small Area Population Density Using Survey Data and Satellite Imagery
    (World Bank, Washington, DC, 2019-03) Engstrom, Ryan; Newhouse, David; Soundararajan, Vidhya
    Country-level census data are typically collected once every 10 years. However, conflict, migration, urbanization, and natural disasters can cause rapid shifts in local population patterns. This study uses Sri Lankan data to demonstrate the feasibility of a bottom-up method that combines household survey data with contemporaneous satellite imagery to track frequent changes in local population density. A Poisson regression model based on indicators derived from satellite data, selected using the least absolute shrinkage and selection operator, accurately predicts village-level population density. The model is estimated in villages sampled in the 2012/13 Household Income and Expenditure Survey to obtain out-of-sample density predictions in the nonsurveyed villages. The predictions approximate the 2012 census density well and are more accurate than other bottom-up studies based on lower-resolution satellite data. The predictions are also more accurate than most publicly available population products, which rely on areal interpolation of census data to redistribute population at the local level. The accuracies are similar when estimated using a random forest model, and when density estimates are expressed in terms of population counts. The collective evidence suggests that combining surveys with satellite data is a cost-effective method to track local population changes at more frequent intervals.
  • Publication
    Guidelines to Small Area Estimation for Poverty Mapping
    (Washington, DC : World Bank, 2022-06-16) Corral, Paul; Cojocaru, Alexandru; Segovia, Sandra; Molina, Isabel
    The eradication of poverty, which was the first of the millennium development goals (MDG) established by the United Nations and followed by the sustainable development goals (SDG), requires knowing where the poor are located. Traditionally, household surveys are considered the best source of information on the living standards of a country’s population. Data from these surveys typically provide a sufficiently accurate direct estimate of household expenditures or income and thus estimates of poverty at the national level and larger international regions. However, when one starts to disaggregate data by local areas or population subgroups, the quality of these direct estimates diminishes. Consequently, national statistical offices (NSOs) cannot provide reliable wellbeing statistical figures at a local level. For example, the module of socioeconomic conditions of the Mexican national survey of household income and expenditure (ENIGH) is designed to produce estimates of poverty and inequality at the national level and for the 32 federate entities (31 states and Mexico City) with disaggregation by rural and urban zones, every two years, but there is a mandate to produce estimates by municipality every five years, and the ENIGH alone cannot provide estimates for all municipalities with adequate precision. This makes monitoring progress toward the sustainable development goals more difficult.
  • Publication
    Estimating Small Area Poverty and Welfare Indicators in Timor-Leste Using Satellite Imagery Data
    (World Bank, Washington, DC, 2020-09-28) Purnamasari, Ririn; Wirapati, Bagus Arya; Alatas, Hamidah; Nasiir, Mercoledi
    This report is structured as follows: an in-depth explanation of the FHSAE method is presented in section two. Section three reviews the sub-district level data used in this study, which includes imprecise TL-SLS and DHS direct estimates, as well as satellite imagery data used in this study. The variable selection method used for the FHSAE model in this model is explained in section four. Section five provides the results of the FHSAE exercise on poverty estimates, average real per capita consumption and welfare index, presenting them in the graphical maps. Section six concludes.

Users also downloaded

Showing related downloaded files

  • Publication
    Infrastructure Monitor 2024
    (Washington, DC: World Bank, 2025-04-28) World Bank
    The Infrastructure Monitor report covers global trends in private investment in infrastructure to inform investors, policy-makers and other practitioners. The objective is to deliver global insights on global infrastructure trends across key topics such as investment volumes, performance, blended finance, and ESG drivers, facilitating the monitoring of private infrastructure investment and its performance. These insights aim to support policymakers, investors, and other stakeholders in developing sustainable, resilient, and inclusive infrastructure while fostering effective partnerships with the private sector. Acknowledging the significant infrastructure data gap — with notable variations in coverage, quality across countries and income groups, and differences in the availability of regional breakdowns — our approach leverages the best available aggregated data from leading infrastructure databases to generate market insights while also providing context on its limitations. 2025 will be the fifth version of the report, the first under the World Bank.
  • Publication
    Cities’ Partnership Initiative
    (Washington, DC: World Bank, 2025-04-24) World Bank
    Sustainable urban development is one of the key areas of development policy in Poland, which is in line with global trends. Sustainable urban development requires an integrated approach that takes into account the complexity and dynamics of phenomena and processes taking place in the urban environment. Meeting the challenges of urban development requires, on the one hand, a steady increase in the capacity of cities to plan and implement development projects, and on the other hand, a favorable regulatory and financial framework and support instruments that are an adequate response to the needs of urban centers. The Cities’ Partnership Initiative (CPI) is a flagship project of the Ministry of Development Funds and Regional Policy of Poland (MDFRP) aimed at supporting sustainable urban development. This final report is the third product of the Reimbursable Advisory Service Agreement on Sustainable Urban Development - Cities’ Partnership Initiative concluded between the MDFRP and the World Bank on January 28, 2022. The report summarizes the project work, including the results of the work of 30 CPI-participating cities, and presents conclusions and recommendations on the three thematic networks and the CPI formula itself.
  • Publication
    Commodity Markets Outlook, April 2025
    (Washington, DC: World Bank, 2025-04-29) World Bank
    Commodity prices are set to fall sharply this year, by about 12 percent overall, as weakening global economic growth weighs on demand. In 2026, commodity prices are projected to reach a six-year low. Oil prices are expected to exert substantial downward pressure on the aggregate commodity index in 2025, as a marked slowdown in global oil consumption coincides with expanding supply. The anticipated commodity price softening is broad-based, however, with more than half of the commodities in the forecast set to decrease this year, many by more than 10 percent. The latest shocks to hit commodity markets extend a so far tumultuous decade, marked by the highest level of commodity price volatility in at least half a century. Between 2020 and 2024, commodity price swings were frequent and sharp, with knock-on consequences for economic activity and inflation. In the next two years, commodity prices are expected to put downward pressure on global inflation. Risks to the commodity price projections are tilted to the downside. A sharper-than-expected slowdown in global growth—driven by worsening trade relations or a prolonged tightening of financial conditions—could further depress commodity demand, especially for industrial products. In addition, if OPEC+ fully unwinds its voluntary supply cuts, oil production will far exceed projected consumption. There are also important upside risks to commodity prices—for instance, if geopolitical tensions worsen, threatening oil and gas supplies, or if extreme weather events lead to agricultural and energy price spikes.
  • Publication
    State of Social Protection Report 2025
    (Washington, DC: World Bank, 2025-04-07) World Bank
    Social protection goes well beyond cash transfers; it includes policies and programs that bridge skill, financial, and information gaps, aiding people in securing better jobs. The three pillars of social protection—social assistance, social insurance, and labor market programs—support households and workers in handling crises, escaping poverty, facing transitions, and seizing employment opportunities. But despite a substantial expansion over the past decade, 2 billion people remain uncovered or inadequately covered across low- and middle-income countries. Drawing from administrative and household survey data from the World Bank’s Atlas of Social Protection Indicators of Resilience and Equity (ASPIRE), the "State of Social Protection Report 2025: The 2-Billion-Person Challenge" documents advances and challenges to strengthening social protection and labor systems across low- and middle-income countries, analyzing the evolution of expenditure, coverage, and adequacy of support. This report details four policy action areas governments can embrace to maximize the benefits of adequate social protection for all: extending social protection to those in need; strengthening the adequacy of social protection support; building shock-proof social protection systems; and optimizing social protection financing. The report discusses how the path of reforms will depend on country context, capacity, and fiscal space. The rising frequency of shocks and crises calls for major investments in the adaptability and preparedness of social protection and labor systems. Amid a world in transition, social protection is more important and necessary than ever.
  • Publication
    Air Quality Management in Central Asia
    (Washington, DC: World Bank, 2025-05-02) World Bank
    This report aims to enhance the understanding of the priorities, needs, and solutions for improving air quality (AQ) in Central Asia (CA) through local action and regional collaboration. It focuses on key components of holistic air quality management (AQM): evidence-based analytics to identify the main sources of air pollution in CA, application of modern tools to assess the impact of cost-effective measures to improve AQ, assessment of the institutional and governance setup for AQM in CA with recommendations to strengthen it, and approaches to financing AQ improvement. Given the lack of comprehensive systematic and validated emission inventories of all PM2.5 precursor emissions, the technical assessment employs the regional emission inventory of the Greenhouse Gas - Air Pollution Interactions and Synergies (GAINS) model. Input data were updated for this study based on recent energy statistics and relevant national surveys. This report addresses emissions and the regional transboundary flows of pollution between Kazakhstan, the Kyrgyz Republic, Tajikistan, Turkmenistan, and Uzbekistan. Subsequently, the resulting PM2.5 concentrations in ambient air throughout CA were computed with the atmospheric chemistry and transport calculations of the GAINS model. Employing the source apportionment results of the GAINS model, the analysis then examines the contributions to PM2.5 population exposure. The report also presents source apportionment analyses for important air pollution hot spots in CA: Dushanbe (Tajikistan), Bishkek (the Kyrgyz Republic), Tashkent (Uzbekistan), Samarkand (Uzbekistan), Astana, and Almaty (Kazakhstan).