Publication: Is Random Forest a Superior Methodology for Predicting Poverty?: An Empirical Assessment
Loading...
Published
2016-03
ISSN
Date
2016-04-26
Author(s)
Editor(s)
Abstract
Random forest is in many fields of research a common method for data driven predictions. Within economics and prediction of poverty, random forest is rarely used. Comparing out-of-sample predictions in surveys for same year in six countries shows that random forest is often more accurate than current common practice (multiple imputations with variables selected by stepwise and Lasso), suggesting that this method could contribute to better poverty predictions. However, none of the methods consistently provides accurate predictions of poverty over time, highlighting that technical model fitting by any method within a single year is not always, by itself, sufficient for accurate predictions of poverty over time.
Link to Data Set
Citation
“Sohnesen, Thomas Pave; Stender, Niels. 2016. Is Random Forest a Superior Methodology for Predicting Poverty?: An Empirical Assessment. Policy Research Working Paper;No. 7612. © World Bank. http://hdl.handle.net/10986/24154 License: CC BY 3.0 IGO.”
Digital Object Identifier
Associated URLs
Associated content
Other publications in this report series
Publication The Economic Value of Weather Forecasts: A Quantitative Systematic Literature Review(Washington, DC: World Bank, 2025-09-10)This study systematically reviews the literature that quantifies the economic benefits of weather observations and forecasts in four weather-dependent economic sectors: agriculture, energy, transport, and disaster-risk management. The review covers 175 peer-reviewed journal articles and 15 policy reports. Findings show that the literature is concentrated in high-income countries and most studies use theoretical models, followed by observational and then experimental research designs. Forecast horizons studied, meteorological variables and services, and monetization techniques vary markedly by sector. Estimated benefits even within specific subsectors span several orders of magnitude and broad uncertainty ranges. An econometric meta-analysis suggests that theoretical studies and studies in richer countries tend to report significantly larger values. Barriers that hinder value realization are identified on both the provider and user sides, with inadequate relevance, weak dissemination, and limited ability to act recurring across sectors. Policy reports rely heavily on back-of-the-envelope or recursive benefit-transfer estimates, rather than on the methods and results of the peer-reviewed literature, revealing a science-to-policy gap. These findings suggest substantial socioeconomic potential of hydrometeorological services around the world, but also knowledge gaps that require more valuation studies focusing on low- and middle-income countries, addressing provider- and user-side barriers and employing rigorous empirical valuation methods to complement and validate theoretical models.Publication The Macroeconomic Implications of Climate Change Impacts and Adaptation Options(Washington, DC: World Bank, 2025-05-29)Estimating the macroeconomic implications of climate change impacts and adaptation options is a topic of intense research. This paper presents a framework in the World Bank's macrostructural model to assess climate-related damages. This approach has been used in many Country Climate and Development Reports, a World Bank diagnostic that identifies priorities to ensure continued development in spite of climate change and climate policy objectives. The methodology captures a set of impact channels through which climate change affects the economy by (1) connecting a set of biophysical models to the macroeconomic model and (2) exploring a set of development and climate scenarios. The paper summarizes the results for five countries, highlighting the sources and magnitudes of their vulnerability --- with estimated gross domestic product losses in 2050 exceeding 10 percent of gross domestic product in some countries and scenarios, although only a small set of impact channels is included. The paper also presents estimates of the macroeconomic gains from sector-level adaptation interventions, considering their upfront costs and avoided climate impacts and finding significant net gross domestic product gains from adaptation opportunities identified in the Country Climate and Development Reports. Finally, the paper discusses the limits of current modeling approaches, and their complementarity with empirical approaches based on historical data series. The integrated modeling approach proposed in this paper can inform policymakers as they make proactive decisions on climate change adaptation and resilience.Publication Rigging the Scores: Corruption through Scoring Rule Manipulation in Public Procurement Auctions(Washington, DC: World Bank, 2025-12-02)Public procurement is highly susceptible to corruption, especially in developing countries. Although open auctions are widely adopted to curb it, this paper finds that corruption remains prevalent even within this procurement format. Procurement officers can collaborate with firms to manipulate scoring rules, ensuring predetermined winners, while corrupt firms submit noncompetitive bids to meet minimum bidder requirements. Using extensive data from Chinese public procurement auctions, the paper introduces model-driven statistical tools to detect such corruption, identifying a corruption rate of 65 percent. A procurement expert audit survey confirms the tools’ reliability, with a 91 percent probability that experts recognize suspicious scoring rules when flagged. Firm-level analysis reveals that local, state-owned, and less productive firms are favored in corrupt auctions. Lastly, the paper explores policy implications. Analysis of the national anti-corruption campaign since 2012 suggests that general investigations may be insufficient to address deeply ingrained corrupt practices. Using counterfactuals based on an estimated structural model, the paper shows that implementing anonymous call-for-tender evaluations could improve social welfare by 10 percent by eliminating suspicious rules and encouraging broader participation.Publication Labor Demand in the Age of Generative AI: Early Evidence from the U.S. Job Posting Data(Washington, DC: World Bank, 2025-11-18)This paper examines the causal impact of generative artificial intelligence on U.S. labor demand using online job posting data. Exploiting ChatGPT’s release in November 2022 as an exogenous shock, the paper applies difference-in-differences and event study designs to estimate the job displacement effects of generative artificial intelligence. The identification strategy compares labor demand for occupations with high versus low artificial intelligence substitution vulnerability following ChatGPT’s launch, conditioning on similar generative artificial intelligence exposure levels to isolate substitution effects from complementary uses. The analysis uses 285 million job postings collected by Lightcast from the first quarter of 2018 to the second quarter of 2025Q2. The findings show that the number of postings for occupations with above-median artificial intelligence substitution scores fell by an average of 12 percent relative to those with below-median scores. The effect increased from 6 percent in the first year after the launch to 18 percent by the third year. Losses were particularly acute for entry-level positions that require neither advanced degrees (18 percent) nor extensive experience (20 percent), as well as those in administrative support (40 percent) and professional services (30 percent). Although generative artificial intelligence generates new occupations and enhances productivity, which may increase labor demand, early evidence suggests that some occupations may be less likely to be complemented by generative artificial intelligence than others.Publication Investment Policy Reforms and Foreign Direct Investment Inflows(Washington, DC: World Bank, 2025-12-01)Foreign direct investment has the potential to introduce much-needed capital and expertise in emerging and developing economies. To attract foreign direct investment, many countries have eased restrictions on foreign ownership in various sectors, reformed their institutions, and set up investment promotion agencies. Until the mid-2010s, Ethiopia remained one of the few countries that resisted this trend, with several stringent restrictions in place on foreign direct investment entry and operations in the country. This study employs a synthetic control method to examine patterns in foreign capital inflows following a series of investment policy reforms that were substantively introduced in the mid-2010s (circa 2015). The study offers evidence that investment policy reforms contributed to a significant foreign direct investment inflow in Ethiopia, compared to what would have occurred in the absence of these policies. An alternative strategy that conservatively specifies the donor country pool using an AI-assisted deep search technique changes the donor pool weighting matrix of the synthetic control method, but the estimated policy effects largely remain robust to this specification. The findings highlight the importance of targeted reforms in promoting foreign direct investment inflow in developing countries.
Journal
Journal Volume
Journal Issue
Collections
Related items
Showing items related by metadata.
Publication Poverty and Income Seasonality in Bangladesh(2009-04-01)Seasonal poverty in Bangladesh, locally known as monga, refers to seasonal deprivation of food during the pre-harvest season of Aman rice. An analysis of household income and expenditure survey data shows that average household income and consumption are much lower during monga season than in other seasons, and that seasonal income greatly influences seasonal consumption. However, lack of income and consumption smoothing is more acute in greater Rangpur, the North West region, than in other regions, causing widespread seasonal deprivation. The analysis shows that agricultural income diversification accompanied by better access to micro-credit, irrigation, education, electrification, social safety net programs, and dynamic labor markets has helped reduce seasonality in income and poverty in regions other than Rangpur in the recent past. Hence, government policies should promote income diversification through infrastructure investments and provide income transfers to the targeted poor to contain income seasonality and poverty in this impoverished part of Bangladesh.Publication Can We Rely on Cash Transfers to Protect Dietary Diversity during Food Crises? Estimates from Indonesia(2011-01-01)The 2008 "food price crisis" and more recent spikes in food prices have led to a greater focus on policies and programs to cushion their impact on poverty and malnutrition. Estimating the income elasticity of micro-nutrients and assessing how they change during such crises is an important part of the policy debate as it affects the effectiveness of cash transfer and nutritional supplementation programs. This paper assesses these issues using data from two cross-sectional household surveys in Indonesia carried out before and soon after the 1997/98 economic crisis, which led to a sharp increase in food prices. First, the authors examine how the income elasticity of the starchy staple ratio differs between the two survey rounds using non-parametric as well as regression methods. Second, they provide updated estimates of the income elasticity for important nutrients in Indonesia. The analysis finds that (i) summary measures such as the income elasticity of the starchy staple ratio may not change during crises but this masks important differences across specific nutrients; (ii) methods matter -- the ordinary least squares estimates for the income elasticity of micro-nutrients are likely to be misleading due to measurement error bias; (iii) controlling for measurement error, the income elasticity of some key micro-nutrients, such as iron, calcium, and vitamin B1, is significantly higher in the crisis year compared with a normal year; and (iv) the income elasticity for certain micro-nutrients -- vitamin C in this case -- remains close to zero. These results suggest that cash transfer programs may be even more effective during crises to protect the consumption of many essential micro-nutrients compared with non-crisis periods but in order to ensure that all micro-nutrients are consumed, specific nutritional supplementation programs are also likely to be required.Publication Inequality Convergence(World Bank, Washington, DC, 2001-07)Comparing changes in inequality with initial levels, using new data, the author finds that within-country inequality in income or per capita consumption is converging toward medium levels--a Gini index around 40 percent. The finding is robust to allow for serially independent measurement error in inequality data and for short-run dynamics around longer-term trends. However, the convergence process is neither rapid nor certain, and more observations over time are needed to be confident of the pattern. The author offers an approach to modeling the determinants of inequality that may be a starting point for estimating richer models.Publication Income Inequality and Violent Crime : Evidence from Mexico's Drug War(World Bank, Washington, DC, 2014-06)The relationship between income inequality and crime has attracted the interest of many researchers, but little convincing evidence exists on the causal effect of inequality on crime in developing countries. This paper estimates this effect in a unique context: Mexico's Drug War. The analysis takes advantage of a unique data set containing inequality and crime statistics for more than 2,000 Mexican municipalities covering a period of 20 years. Using an instrumental variable for inequality that tackles problems of reverse causality and omitted variable bias, this paper finds that an increment of one point in the Gini coefficient translates into an increase of more than 10 drug-related homicides per 100,000 inhabitants between 2006 and 2010. There are no significant effects before 2005. The fact that the effect was found during Mexico's Drug War and not before is likely because the cost of crime decreased with the proliferation of gangs (facilitating access to knowledge and logistics, lowering the marginal cost of criminal behavior), which, combined with rising inequality, increased the expected net benefit from criminal acts after 2005.Publication Estimating Quarterly Poverty Rates Using Labor Force Surveys : A Primer(World Bank, Washington, DC, 2013-05)The paper shows how Labor Force Surveys can be used effectively to estimate poverty rates using Household Expenditure Surveys and cross-survey imputation methods. With only two rounds of Household Expenditure Survey data for Morocco (2001 and 2007), the paper estimates quarterly poverty rates for the period 2001-2010 by imputing household expenditures into the Labor Force Surveys. The results are encouraging. The methodology is able to accurately reproduce official poverty statistics by combining current Labor Force Surveys with previous period Household Expenditure Surveys, and vice versa. Although the focus is on head-count poverty, the method can be applied to any welfare indicator that is a function of household income or expenditure, such as the poverty gap or the Gini index of inequality. The newly produced time-series of poverty rates can help researchers and policy makers to: (a) study the determinants of poverty reduction or use poverty as an explanatory factor in cross-section and panel models; (b) forecast poverty rates based on a time-series model fitted to the data; and (c) explore the linkages between labor market conditions and poverty and simulate the effects of policy reforms or economic shocks. This is a promising research agenda that can expand significantly the tool-kit of the welfare economist.
Users also downloaded
Showing related downloaded files
Publication Digital Africa(Washington, DC: World Bank, 2023-03-13)All African countries need better and more jobs for their growing populations. "Digital Africa: Technological Transformation for Jobs" shows that broader use of productivity-enhancing, digital technologies by enterprises and households is imperative to generate such jobs, including for lower-skilled people. At the same time, it can support not only countries’ short-term objective of postpandemic economic recovery but also their vision of economic transformation with more inclusive growth. These outcomes are not automatic, however. Mobile internet availability has increased throughout the continent in recent years, but Africa’s uptake gap is the highest in the world. Areas with at least 3G mobile internet service now cover 84 percent of Africa’s population, but only 22 percent uses such services. And the average African business lags in the use of smartphones and computers as well as more sophisticated digital technologies that catalyze further productivity gains. Two issues explain the usage gap: affordability of these new technologies and willingness to use them. For the 40 percent of Africans below the extreme poverty line, mobile data plans alone would cost one-third of their incomes—in addition to the price of access devices, apps, and electricity. Data plans for small- and medium-size businesses are also more expensive than in other regions. Moreover, shortcomings in the quality of internet services—and in the supply of attractive, skills-appropriate apps that promote entrepreneurship and raise earnings—dampen people’s willingness to use them. For those countries already using these technologies, the development payoffs are significant. New empirical studies for this report add to the rapidly growing evidence that mobile internet availability directly raises enterprise productivity, increases jobs, and reduces poverty throughout Africa. To realize these and other benefits more widely, Africa’s countries must implement complementary and mutually reinforcing policies to strengthen both consumers’ ability to pay and willingness to use digital technologies. These interventions must prioritize productive use to generate large numbers of inclusive jobs in a region poised to benefit from a massive, youthful workforce—one projected to become the world’s largest by the end of this century.Publication World Development Report 2006(Washington, DC, 2005)This year’s Word Development Report (WDR), the twenty-eighth, looks at the role of equity in the development process. It defines equity in terms of two basic principles. The first is equal opportunities: that a person’s chances in life should be determined by his or her talents and efforts, rather than by pre-determined circumstances such as race, gender, social or family background. The second principle is the avoidance of extreme deprivation in outcomes, particularly in health, education and consumption levels. This principle thus includes the objective of poverty reduction. The report’s main message is that, in the long run, the pursuit of equity and the pursuit of economic prosperity are complementary. In addition to detailed chapters exploring these and related issues, the Report contains selected data from the World Development Indicators 2005‹an appendix of economic and social data for over 200 countries. This Report offers practical insights for policymakers, executives, scholars, and all those with an interest in economic development.Publication Classroom Assessment to Support Foundational Literacy(Washington, DC: World Bank, 2025-03-21)This document focuses primarily on how classroom assessment activities can measure students’ literacy skills as they progress along a learning trajectory towards reading fluently and with comprehension by the end of primary school grades. The document addresses considerations regarding the design and implementation of early grade reading classroom assessment, provides examples of assessment activities from a variety of countries and contexts, and discusses the importance of incorporating classroom assessment practices into teacher training and professional development opportunities for teachers. The structure of the document is as follows. The first section presents definitions and addresses basic questions on classroom assessment. Section 2 covers the intersection between assessment and early grade reading by discussing how learning assessment can measure early grade reading skills following the reading learning trajectory. Section 3 compares some of the most common early grade literacy assessment tools with respect to the early grade reading skills and developmental phases. Section 4 of the document addresses teacher training considerations in developing, scoring, and using early grade reading assessment. Additional issues in assessing reading skills in the classroom and using assessment results to improve teaching and learning are reviewed in section 5. Throughout the document, country cases are presented to demonstrate how assessment activities can be implemented in the classroom in different contexts.Publication Argentina Country Climate and Development Report(World Bank, Washington, DC, 2022-11)The Argentina Country Climate and Development Report (CCDR) explores opportunities and identifies trade-offs for aligning Argentina’s growth and poverty reduction policies with its commitments on, and its ability to withstand, climate change. It assesses how the country can: reduce its vulnerability to climate shocks through targeted public and private investments and adequation of social protection. The report also shows how Argentina can seize the benefits of a global decarbonization path to sustain a more robust economic growth through further development of Argentina’s potential for renewable energy, energy efficiency actions, the lithium value chain, as well as climate-smart agriculture (and land use) options. Given Argentina’s context, this CCDR focuses on win-win policies and investments, which have large co-benefits or can contribute to raising the country’s growth while helping to adapt the economy, also considering how human capital actions can accompany a just transition.Publication Lebanon Economic Monitor, Fall 2022(Washington, DC, 2022-11)The economy continues to contract, albeit at a somewhat slower pace. Public finances improved in 2021, but only because spending collapsed faster than revenue generation. Testament to the continued atrophy of Lebanon’s economy, the Lebanese Pound continues to depreciate sharply. The sharp deterioration in the currency continues to drive surging inflation, in triple digits since July 2020, impacting the poor and vulnerable the most. An unprecedented institutional vacuum will likely further delay any agreement on crisis resolution and much needed reforms; this includes prior actions as part of the April 2022 International Monetary Fund (IMF) staff-level agreement (SLA). Divergent views among key stakeholders on how to distribute the financial losses remains the main bottleneck for reaching an agreement on a comprehensive reform agenda. Lebanon needs to urgently adopt a domestic, equitable, and comprehensive solution that is predicated on: (i) addressing upfront the balance sheet impairments, (ii) restoring liquidity, and (iii) adhering to sound global practices of bail-in solutions based on a hierarchy of creditors (starting with banks’ shareholders) that protects small depositors.