Publication:
Pull Your Small Area Estimates Up by the Bootstraps

Loading...
Thumbnail Image
Files in English
Authors' accepted manuscript (4.25 MB)
129 downloads
Published
2021-05-08
ISSN
0094-9655
Date
2022-01-14
Author(s)
Molina, Isabel
Nguyen, Minh
Editor(s)
Abstract
This paper presents a methodological update to the World Bank's toolkit for small area estimation. The paper reviews the computational procedures of the current methods used by the institution: the traditional ELL approach and the Empirical Best (EB) addition introduced to imitate the original EB procedure of Molina and Rao [Small area estimation of poverty indicators. Canadian J Stat. 2010;38(3):369–385], including heteroskedasticity and survey weights, but using a different bootstrap approach, here referred to as clustered bootstrap. Simulation experiments provide empirical evidence of the shortcomings of the clustered bootstrap approach, which yields biased and noisier point estimates. The document presents an update to the World Bank’s EB implementation by considering the original EB procedures for point and noise estimation, extended for complex designs and heteroscedasticity. Simulation experiments illustrate that the revised methods yield considerably less biased and more efficient estimators than those obtained from the clustered bootstrap approach.
Link to Data Set
Digital Object Identifier
Associated URLs
Report Series
Other publications in this report series
Journal
Journal Volume
Journal Issue

Related items

Showing items related by metadata.

  • Publication
    Pull Your Small Area Estimates Up by the Bootstraps
    (World Bank, Washington, DC, 2020-05) Molina, Isabel; Corral, Paul; Nguyen, Minh
    After almost two decades of poverty maps produced by the World Bank and multiple advances in the literature, this paper presents a methodological update to the World Bank's toolkit for small area estimation. The paper reviews the computational procedures of the current methods used by the World Bank: the traditional approach by Elbers, Lanjouw and Lanjouw (2003) and the Empirical Best/Bayes (EB) addition introduced by Van der Weide (2014). The addition extends the EB procedure of Molina and Rao (2010) by considering heteroscedasticity and includes survey weights, but uses a different bootstrap approach, here referred to as clustered bootstrap. Simulation experiments comparing these methods to the original EB approach of Molina and Rao (2010) provide empirical evidence of the shortcomings of the clustered bootstrap approach, which yields biased point estimates. The main contributions of this paper are then two: 1) to adapt the original Monte Carlo simulation procedure of Molina and Rao (2010) for the approximation of the extended EB estimators that include heteroscedasticity and survey weights as in Van der Weide (2014); and 2) to adapt the parametric bootstrap approach for mean squared error (MSE) estimation considered by Molina and Rao (2010), and proposed originally by González-Manteiga et al. (2008), to these extended EB estimators. Simulation experiments illustrate that the revised Monte Carlo simulation method yields estimators that are considerably less biased and more efficient in terms of MSE than those obtained from the clustered bootstrap approach, and that the parametric bootstrap MSE estimators are in line with the true MSEs under realistic scenarios.
  • Publication
    A Map of the Poor or a Poor Map?
    (World Bank, Washington, DC, 2021-04) Himelein, Kristen; Corral, Paul; McGee, Kevin; Molina, Isabel
    This paper evaluates the performance of different small area estimation methods using model and design-based simulation experiments. Design-based simulation experiments are carried out using the Mexican Intra Censal survey as a census of roughly 3.9 million households from which 500 samples are drawn using a two-stage selection procedure similar to that of Living Standards Measurement Study surveys. Several unit-level methods are considered as well as a method that combines unit and area level information, which has been proposed as an alternative when the available census data is outdated. The findings show the importance of selecting a proper model and data transformation so that the model assumptions hold. A proper data transformation can lead to a considerable improvement in mean squared errors. The results from design-based validation show that all small area estimation methods represent an improvement, in terms of mean squared errors, over direct estimates. However, methods that model unit level welfare using only area level information suffer from considerable bias. Because the magnitude and direction of the bias are unknown ex ante, methods that rely only on aggregated covariates should be used with caution, but they may be an alternative to traditional area level models when these are not applicable.
  • Publication
    How Good a Map? Putting Small Area Estimation to the Test
    (World Bank, Washington, DC, 2007-03) Demombynes, Gabriel; Elbers, Chris; Lanjouw, Jean O.; Lanjouw, Peter
    The authors examine the performance of small area welfare estimation. The method combines census and survey data to produce spatially disaggregated poverty and inequality estimates. To test the method, they compare predicted welfare indicators for a set of target populations with their true values. They construct target populations using actual data from a census of households in a set of rural Mexican communities. They examine estimates along three criteria: accuracy of confidence intervals, bias, and correlation with true values. The authors find that while point estimates are very stable, the precision of the estimates varies with alternative simulation methods. While the original approach of numerical gradient estimation yields standard errors that seem appropriate, some computationally less-intensive simulation procedures yield confidence intervals that are slightly too narrow. The precision of estimates is shown to diminish markedly if unobserved location effects at the village level are not well captured in underlying consumption models. With well specified models there is only slight evidence of bias, but the authors show that bias increases if underlying models fail to capture latent location effects. Correlations between estimated and true welfare at the local level are highest for mean expenditure and poverty measures and lower for inequality measures.
  • Publication
    SAE - A Stata Package for Unit Level Small Area Estimation
    (World Bank, Washington, DC, 2018-10) Nguyen, Minh Cong; Corral, Paul; Azevedo, Joao Pedro; Zhao, Qinghua
    This paper presents a new family of Stata functions devoted to small area estimation. Small area methods attempt to solve low representativeness of surveys within areas, or the lack of data for specific areas/sub-populations. This is accomplished by incorporating information from outside sources. Such target data sets are becoming increasingly available and can take the form of a traditional population census, but also large scale administrative records from tax administrations, or geospatial information produced using remote sensing. The strength of these target data sets is their granularity on the subpopulations of interest, however, in many cases they lack the ability to collect analytically relevant variables such as welfare or caloric intake. The family of functions introduced follow a modular design to have the flexibility with which these can be expanded in the future. This can be accomplished by the authors and/or other collaborators from the Stata community. Thus far, a major limitation of such analysis in Stata has been the large size of target data sets. The package introduces new mata functions and a plugin used to circumvent memory limitations that inevitably arise when working with big data. From an estimation perspective, the paper starts by implementing a methodology that has been widely used for the production of several poverty maps.
  • Publication
    Frontiers in Small Area Estimation Research
    (Washington, DC: World Bank, 2024-06-28) Molina, Isabel
    This paper reviews the main methods for small area estimation of welfare indicators. It begins by discussing the importance of small area estimation methods for producing reliable disaggregated estimates. It mentions the baseline papers and describes the contents of the different sections. Basic direct estimators obtained from area-specific survey data are described first, followed by simple indirect methods, which include synthetic procedures that do not account for the area effects and composite estimators obtained as a composition (or weighted average) of a synthetic and a direct estimator. The previous estimators are design-based, meaning that their properties are assessed under the sampling replication mechanism, without assuming any model to be true. The paper then turns to proper model-based estimators that assume an explicit model. These models allow obtaining optimal small area estimators when the assumed model holds. The first type of models, referred to as area-level models, use only aggregated data at the area level to fit the model. However, unit-level survey data were previously used to calculate the direct estimators, which act as response variables in the most common area-level models. The paper then switches to unit-level models, describing first the usual estimators for area means, and then moving to general area indicators. Semi-parametric, non-parametric, and machine learning procedures are described in a separate section, although many of the procedures are applicable only to area means. Based on the previous material, the paper identifies gaps or potential limitations in existing procedures from a practitioner’s perspective, which could potentially be addressed through research over the next three to five years.

Users also downloaded

Showing related downloaded files

  • Publication
    World Development Report 2017
    (Washington, DC: World Bank, 2017-01-30) World Bank Group
    Why are carefully designed, sensible policies too often not adopted or implemented? When they are, why do they often fail to generate development outcomes such as security, growth, and equity? And why do some bad policies endure? This book addresses these fundamental questions, which are at the heart of development. Policy making and policy implementation do not occur in a vacuum. Rather, they take place in complex political and social settings, in which individuals and groups with unequal power interact within changing rules as they pursue conflicting interests. The process of these interactions is what this Report calls governance, and the space in which these interactions take place, the policy arena. The capacity of actors to commit and their willingness to cooperate and coordinate to achieve socially desirable goals are what matter for effectiveness. However, who bargains, who is excluded, and what barriers block entry to the policy arena determine the selection and implementation of policies and, consequently, their impact on development outcomes. Exclusion, capture, and clientelism are manifestations of power asymmetries that lead to failures to achieve security, growth, and equity. The distribution of power in society is partly determined by history. Yet, there is room for positive change. This Report reveals that governance can mitigate, even overcome, power asymmetries to bring about more effective policy interventions that achieve sustainable improvements in security, growth, and equity. This happens by shifting the incentives of those with power, reshaping their preferences in favor of good outcomes, and taking into account the interests of previously excluded participants. These changes can come about through bargains among elites and greater citizen engagement, as well as by international actors supporting rules that strengthen coalitions for reform.
  • Publication
    Digital Africa
    (Washington, DC: World Bank, 2023-03-13) Begazo, Tania; Dutz, Mark Andrew; Blimpo, Moussa
    All African countries need better and more jobs for their growing populations. "Digital Africa: Technological Transformation for Jobs" shows that broader use of productivity-enhancing, digital technologies by enterprises and households is imperative to generate such jobs, including for lower-skilled people. At the same time, it can support not only countries’ short-term objective of postpandemic economic recovery but also their vision of economic transformation with more inclusive growth. These outcomes are not automatic, however. Mobile internet availability has increased throughout the continent in recent years, but Africa’s uptake gap is the highest in the world. Areas with at least 3G mobile internet service now cover 84 percent of Africa’s population, but only 22 percent uses such services. And the average African business lags in the use of smartphones and computers as well as more sophisticated digital technologies that catalyze further productivity gains. Two issues explain the usage gap: affordability of these new technologies and willingness to use them. For the 40 percent of Africans below the extreme poverty line, mobile data plans alone would cost one-third of their incomes—in addition to the price of access devices, apps, and electricity. Data plans for small- and medium-size businesses are also more expensive than in other regions. Moreover, shortcomings in the quality of internet services—and in the supply of attractive, skills-appropriate apps that promote entrepreneurship and raise earnings—dampen people’s willingness to use them. For those countries already using these technologies, the development payoffs are significant. New empirical studies for this report add to the rapidly growing evidence that mobile internet availability directly raises enterprise productivity, increases jobs, and reduces poverty throughout Africa. To realize these and other benefits more widely, Africa’s countries must implement complementary and mutually reinforcing policies to strengthen both consumers’ ability to pay and willingness to use digital technologies. These interventions must prioritize productive use to generate large numbers of inclusive jobs in a region poised to benefit from a massive, youthful workforce—one projected to become the world’s largest by the end of this century.
  • Publication
    2022 Mini Grids for Half a Billion People
    (Washington, DC: World Bank, 2022-09-22) ESMAP (Energy Sector Management Assistance Program)
    This book is packed with actionable information for decision-makers, and it is the World Bank’s most comprehensive and authoritative publication on mini grids to date. The objective of this comprehensive knowledge package is to present road-tested options and examples from the leading edge of mini grid development. Decision-makers can draw on these options and examples to scale up mini grid deployment in their own contexts. By acknowledging different national approaches to mini grids and providing context-specific considerations for implementation, this suite of knowledge products offers an adaptive approach to helping countries achieve their electrification targets. The book is structured as follows. The overview presents a global market outlook for mini grids and introduces the 10 building blocks that need to be in place if mini grids are to be scaled up in any country. These building blocks also represent the 10 frontiers for innovation for the sector, where, with disruptive digital solutions across all 10 frontiers, the services offered to end users can be raised to a level substantially better than what would be possible with alternatives. In the Handbook, the terms “building blocks” and “frontiers” are used interchangeably. Chapters 1–10 present the 10 building blocks in detail and answer the question how do we scale up mini grid deployment to connect half a billion people by 2030 Chapter 11 is our call to action.
  • Publication
    Commodity Markets Outlook, April 2022
    (Washington, DC: World Bank, 2022-04-26) World Bank Group
    The war in Ukraine has caused major supply disruptions and led to historically higher prices for a number of commodities. Most commodity prices are now expected to see sharp increases in 2022 and remain high in the medium term. The price of Brent crude oil is projected to average $100/bbl in 2022, a 40 percent increase from 2021. Non-energy prices are expected to rise by about 20 percent in 2022, with the largest increases in commodities where Russia or Ukraine are key exporters. Wheat prices in particular are forecast to increase more than 40 percent this year. While price pressures are expected to ease in 2023, commodity prices will remain much higher than previously expected. The outlook depends on the duration of the war and the severity of disruptions to commodity flows. A Special Focus section investigates the impact of the war on commodity markets and compares the current episode with previous price spikes. It finds that previous oil price spikes led to the emergence of new sources of supplies and reduced demand in response to efficiency improvements and substitution to other commodities. In the case of food, new land was made available for food production. For policymakers, a short-term priority is providing targeted support to poorer households facing higher food and energy prices. For longer-lasting solutions, they facilitate investment in new sources of zero-carbon energy.
  • Publication
    World Bank Annual Report 2024
    (Washington, DC: World Bank, 2024-10-25) World Bank
    This annual report, which covers the period from July 1, 2023, to June 30, 2024, has been prepared by the Executive Directors of both the International Bank for Reconstruction and Development (IBRD) and the International Development Association (IDA)—collectively known as the World Bank—in accordance with the respective bylaws of the two institutions. Ajay Banga, President of the World Bank Group and Chairman of the Board of Executive Directors, has submitted this report, together with the accompanying administrative budgets and audited financial statements, to the Board of Governors.