Publication:
Where Are All the Jobs ?: A Machine Learning Approach for High Resolution Urban Employment Prediction in Developing Countries

Loading...
Thumbnail Image
Files in English
English PDF (8.59 MB)
1,159 downloads
English Text (114.72 KB)
68 downloads
Published
2022-03
ISSN
Date
2022-03-23
Author(s)
Barzin, Samira
O’Clery, Neave
Editor(s)
Abstract
Globally, both people and economic activity are increasingly concentrated in urban areas. Yet, for the vast majority of developing country cities, little is known about the granular spatial organization of such activity despite its key importance to policy and urban planning. This paper adapts a machine learning based algorithm to predict the spatial distribution of employment using input data from open access sources such as Open Street Map and Google Earth Engine. The algorithm is trained on 14 test cities, ranging from Buenos Aires in Argentina to Dakar in Senegal. A spatial adaptation of the random forest algorithm is used to predict within-city cells in the 14 test cities with extremely high accuracy (R- squared greater than 95 percent), and cells in out-of-sample ”unseen” cities with high accuracy (mean R-squared of 63 percent). This approach uses open data to produce high resolution estimates of the distribution of urban employment for cities where such information does not exist, making evidence-based planning more accessible than ever before.
Link to Data Set
Citation
Barzin, Samira; Rentschler, Jun; O’Clery, Neave; Avner, Paolo. 2022. Where Are All the Jobs ?: A Machine Learning Approach for High Resolution Urban Employment Prediction in Developing Countries. Policy Research Working Paper;9979. © World Bank. http://hdl.handle.net/10986/37195 License: CC BY 3.0 IGO.
Associated URLs
Associated content
Report Series
Report Series
Other publications in this report series
  • Publication
    Climate and Social Sustainability in Fragility, Conflict, and Violence Contexts
    (Washington, DC: World Bank, 2026-01-07) Cuesta Leiva, Jose Antonio; Huff, Connor
    Climate change is widely recognized as a driver of violent conflict, but its broader social effects remain less understood. Ignoring these dimensions risks a vicious cycle where climate policies might undermine socially just adaptation. Evidence is still limited on how climate shocks influence political participation, trust, or migration. This paper helps fill that gap by examining links between climate change, conflict, and social sustainability, with a focus on inclusion, resilience, cohesion, and legitimacy. Using secondary data from 2019–24, the study applies simple correlation-based methods to test three hypotheses on the nature, severity, and composition of these associations. The analysis combines multiple climate impact measures, new conflict classifications, recent social sustainability frameworks, and controls for population and geography. The results reveal strong correlations—not causation—between climate events and contexts of fragility, conflict, and violence. Climate impacts are most pronounced in both national and subnational conflict settings. The study also finds robust links between fragility, conflict, and violence and low levels of social sustainability, reflecting its role as both a driver and consequence of conflict. Some dimensions—such as violent events and insecurity—appear weaker in areas most affected by climate shocks. Two of the hypotheses are supported, and one remains inconclusive.
  • Publication
    The Macroeconomic Implications of Climate Change Impacts and Adaptation Options
    (Washington, DC: World Bank, 2025-05-29) Abalo, Kodzovi; Boehlert, Brent; Bui, Thanh; Burns, Andrew; Castillo, Diego; Chewpreecha, Unnada; Haider, Alexander; Hallegatte, Stephane; Jooste, Charl; McIsaac, Florent; Ruberl, Heather; Smet, Kim; Strzepek, Ken
    Estimating the macroeconomic implications of climate change impacts and adaptation options is a topic of intense research. This paper presents a framework in the World Bank's macrostructural model to assess climate-related damages. This approach has been used in many Country Climate and Development Reports, a World Bank diagnostic that identifies priorities to ensure continued development in spite of climate change and climate policy objectives. The methodology captures a set of impact channels through which climate change affects the economy by (1) connecting a set of biophysical models to the macroeconomic model and (2) exploring a set of development and climate scenarios. The paper summarizes the results for five countries, highlighting the sources and magnitudes of their vulnerability --- with estimated gross domestic product losses in 2050 exceeding 10 percent of gross domestic product in some countries and scenarios, although only a small set of impact channels is included. The paper also presents estimates of the macroeconomic gains from sector-level adaptation interventions, considering their upfront costs and avoided climate impacts and finding significant net gross domestic product gains from adaptation opportunities identified in the Country Climate and Development Reports. Finally, the paper discusses the limits of current modeling approaches, and their complementarity with empirical approaches based on historical data series. The integrated modeling approach proposed in this paper can inform policymakers as they make proactive decisions on climate change adaptation and resilience.
  • Publication
    Institutional Capacity for Policy Implementation: An Analytical Framework
    (Washington, DC: World Bank, 2026-01-07) Kim, Galileu; Kumar, Tanu; Ramalho, Rita; Russell, Stuart
    State capacity is an important prerequisite for policy implementation, yet at the country level it is difficult to measure, assess, and reform. This paper proposes a focus on institutional capacity: the ability of public institutions to implement the specific policy mandates for which they are responsible. Based on a review of existing literature, the paper defines the different dimensions that compose institutional capacity and groups them into two cross-cutting categories: organizational dimensions (personnel, financial resources, information systems, and management practices) and governance dimensions (transparency, independence, and accountability). The paper proposes measures for organizational and governance dimensions using existing data, shows intra-institutional variation of these measures within countries, and discusses how new data could be collected for better measurement of these concepts. Finally, the paper illustrates how the framework can be used to diagnose the sources of common problems related to weak policy implementation.
  • Publication
    South Africa’s Fragmented Cities: The Unequal Burden of Labor Market Frictions
    (Washington, DC: World Bank, 2026-01-08) Baez, Javier E.; Kshirsagar, Varun
    Using high-resolution administrative, census, and satellite data, this paper shows that South African cities are characterized by spatial mismatches between where people live and where jobs are located, relative to 20 global peers. Areas within 5 kilometers of commercial centers have 9,300 fewer residents per square kilometer than expected, which is 60 percent below the global median. Poor, dense neighborhoods are most affected. In Johannesburg, a 10-percentile increase in distance from the nearest business hub corresponds to a 3.7-percentile drop in asset wealth (a proxy of household wellbeing) and 4.9-percentile drop in employment. In Cape Town, the declines are 4.0 and 3.7 percentiles, respectively. Employment is 87 percent lower in the poorest decile than the richest in Johannesburg and 61 percent lower in Cape Town. These findings suggest that South Africa’s spatial organization of people and economic activity constrains agglomeration and reinforces inequality. This methodology provides a scalable and standardized data-driven framework to analyze spatial accessibility and agglomeration frictions in complex, data-constrained urban systems.
  • Publication
    Investment in Emerging and Developing Economies
    (Washington, DC: World Bank, 2026-01-07) Adarov, Amat; Kose, M. Ayhan; Vorisek, Dana
    The world faces a pressing challenge to meet key development objectives amid slowing growth and rising macroeconomic and geopolitical risks. With the number of job seekers rising rapidly, infrastructure shortfalls continuing to be large, and climate costs mounting, the case for a significant investment push has never been stronger. Yet the capacity to respond in many emerging markets and developing economies has eroded. Since the global financial crisis, investment growth has slowed to about half its pace in the 2000s, with both public and private investment weakening. Foreign direct investment inflows—a critical source of capital, technology, and managerial know-how—have also fallen sharply and become increasingly concentrated, leaving low-income countries with only a marginal share. The risks of further retrenchment are significant, as trade tensions, policy uncertainty, and elevated debt levels continue to weigh on investment. Reigniting momentum will require ambitious domestic reforms to strengthen institutions, rebuild macro-fiscal stability, and deepen trade and investment integration—the foundations of a supportive business climate. At the same time, international cooperation is indispensable. A renewed commitment to a predictable system of cross-border trade and investment flows, combined with scaled-up financial support and sustained technical assistance, is essential to help emerging markets and developing economies—especially low-income countries and economies in fragile and conflict situations—bridge financing gaps and implement the domestic reforms needed to restore investment as an engine of growth, jobs, and development.
Journal
Journal Volume
Journal Issue

Related items

Showing items related by metadata.

  • Publication
    Predicting Urban Employment Distributions
    (Washington, DC, 2022-06) Maruyama Rentschler, Jun Erik; Barzin, Samira; O’Clery, Neave; Avner, Paolo
    Cities are intricately interconnected socioeconomic systems, with transport networks connecting people to their jobs, health, and education facilities, and ensuring the smooth functioning of supply chains. When floods happen, they isolate people and firms from these vital networks, causing cascading disruptions and losses. Such floods are not limited to rare and extreme events. Especially in developing country cities, the lack of resilient infrastructure systems means that even regular rainfall events, for example, during rainy seasons, can cause havoc. Attention is often biased towards direct asset losses from floods, rather than the wider economic costs of disrupted networks. This is due primarily to the complex dynamics of economic and infrastructure networks. But public transport and road usage data are also often limited, especially when the predominant modes of transport are informal and walking. So how can we identify and prioritize cost-effective measures for urban resilience This note describes an analytical approach that can help prioritize investments in urban transport resilience and public transport, while also strengthening the economic case for such investments.
  • Publication
    Central America : Big Data in Action for Development
    (Washington, DC, 2014) World Bank
    This report stemmed from a World Bank pilot activity to explore the potential of big data to address development challenges in Central American countries. As part of this activity we collected and analyzed a number of examples of leveraging big data for development. Because of the growing interest in this topic this report makes available to a broader audience those examples as well as the underlying conceptual framework to think about big data for development. To make effective use of big data, many practitioners emphasize the importance of beginning with a question instead of the data itself. A question clarifies the purpose of utilizing big data, whether it is for awareness, understanding, and/or forecasting. In addition, a question suggests the kinds of real-world behaviors or conditions that are of interest. These behaviors are encoded into data through some generating process which includes the media through which behavior is captured. Then various data sources are accessed, prepared, consolidated and analyzed. This ultimately gives rise to insights into the question of interest, which are implemented to effect changes in the relevant behaviors. Utilizing big data for any given endeavor requires a host of capabilities. Hardware and software capabilities are needed for interaction of data from a variety of sources in a way which is efficient and scalable. Human capabilities are needed not only to make sense of data but to ensure a question-centered approach, so that insights are actionable and relevant. To this end, cooperation between development experts as well as social scientists and computer scientists is extremely important.
  • Publication
    Three Feet Under
    (World Bank, Washington, DC, 2019-06) Braese, Johannes; Rentschler, Jun; Jones, Nick; Avner, Paolo
    This paper analyses the degree to which infrastructure reliability and urban economic activity in several African cities is impacted by flooding. It combines firm-level micro data, flood maps, and several spatial data layers across cities through a harmonized geospatial network analysis. The analysis shows that a significant share of jobs in cities is directly affected by floods. It further details how transport infrastructure is subjected to significant flood risk that disproportionally affects main roads in many cities. While direct flood effects are revealed to be significant, this work further shows how knock-on implications for the entire urban economy might be even larger. Regardless of the direct flood exposure of firms, flooded transport networks mean that disruptions propagate across the city and drastically reduce the connectivity between firms. Access to hospitals is also found to be reduced significantly -- even during relatively light flooding events: From a third of locations in Kampala, floods mean that people would no longer be able to reach hospitals within the "golden hour" -- a rule of thumb referring to the window of time that maximizes the likelihood of survival after a severe medical incident. Overall, this study showcases the use of high-detail city-level analyses to better understand the localized impacts of natural hazards on urban infrastructure networks.
  • Publication
    Carbon Price Efficiency : Lock-in and Path Dependence in Urban Forms and Transport Infrastructure
    (World Bank, Washington, DC, 2014-06) Rentschler, Jun; Hallegatte, Stéphane; Avner, Paolo
    This paper investigates the effect of carbon or gasoline taxes on commuting-related CO2 emissions in an urban context. To assess the impact of public transport on the efficiency of the tax, the paper investigates two exogenous scenarios using a dynamic urban model (NEDUM-2D) calibrated for the urban area of Paris: (i) a scenario with the current dense public transport infrastructure, and (ii) a scenario without. It is shown that the price elasticity of CO2 emissions is twice as high in the short run if public transport options exist. Reducing commuting-related emissions thus requires lower (and more acceptable) tax levels in the presence of dense public transportation. If the goal of a carbon or gasoline tax is to change behaviors and reduce energy consumption and CO2 emissions (not to raise revenues), then there is an incentive to increase the price elasticity through complementary policies such as public transport development. The emission elasticity also depends on the baseline scenario and is larger when population growth and income growth are high. In the longer run, elasticities are higher and similar in the scenarios with and without public transport, because of larger urban reconfiguration in the latter scenario. These results are policy relevant, especially for fast-growing cities in developing countries. Even for cities where emission reductions are not a priority today, there is an option value attached to a dense public transport network, since it makes it possible to reduce emissions at a lower cost in the future.
  • Publication
    Floods and Urban Connectivity
    (Washington, DC, 2022-06) Maruyama Rentschler, Jun Erik; He, Yiyi; Thies, Stephan Fabian; Nell, Andrew David; Avner, Paolo
    Cities are intricately interconnected socioeconomic systems, with transport networks connecting people to their jobs, health, and education facilities, and ensuring the smooth functioning of supply chains. When floods happen, they isolate people and firms from these vital networks, causing cascading disruptions and losses. Such floods are not limited to rare and extreme events. Especially in developing country cities, the lack of resilient infrastructure systems means that even regular rainfall events, for example, during rainy seasons, can cause havoc. Attention is often biased towards direct asset losses from floods, rather than the wider economic costs of disrupted networks. This is due primarily to the complex dynamics of economic and infrastructure networks. But public transport and road usage data are also often limited, especially when the predominant modes of transport are informal and walking. So how can we identify and prioritize cost-effective measures for urban resilience This note describes an analytical approach that can help prioritize investments in urban transport resilience and public transport, while also strengthening the economic case for such investments.

Users also downloaded

Showing related downloaded files

  • Publication
    Digital Africa
    (Washington, DC: World Bank, 2023-03-13) Begazo, Tania; Dutz, Mark Andrew; Blimpo, Moussa
    All African countries need better and more jobs for their growing populations. "Digital Africa: Technological Transformation for Jobs" shows that broader use of productivity-enhancing, digital technologies by enterprises and households is imperative to generate such jobs, including for lower-skilled people. At the same time, it can support not only countries’ short-term objective of postpandemic economic recovery but also their vision of economic transformation with more inclusive growth. These outcomes are not automatic, however. Mobile internet availability has increased throughout the continent in recent years, but Africa’s uptake gap is the highest in the world. Areas with at least 3G mobile internet service now cover 84 percent of Africa’s population, but only 22 percent uses such services. And the average African business lags in the use of smartphones and computers as well as more sophisticated digital technologies that catalyze further productivity gains. Two issues explain the usage gap: affordability of these new technologies and willingness to use them. For the 40 percent of Africans below the extreme poverty line, mobile data plans alone would cost one-third of their incomes—in addition to the price of access devices, apps, and electricity. Data plans for small- and medium-size businesses are also more expensive than in other regions. Moreover, shortcomings in the quality of internet services—and in the supply of attractive, skills-appropriate apps that promote entrepreneurship and raise earnings—dampen people’s willingness to use them. For those countries already using these technologies, the development payoffs are significant. New empirical studies for this report add to the rapidly growing evidence that mobile internet availability directly raises enterprise productivity, increases jobs, and reduces poverty throughout Africa. To realize these and other benefits more widely, Africa’s countries must implement complementary and mutually reinforcing policies to strengthen both consumers’ ability to pay and willingness to use digital technologies. These interventions must prioritize productive use to generate large numbers of inclusive jobs in a region poised to benefit from a massive, youthful workforce—one projected to become the world’s largest by the end of this century.
  • Publication
    Predicting Urban Employment Distributions
    (Washington, DC, 2022-06) Maruyama Rentschler, Jun Erik; Barzin, Samira; O’Clery, Neave; Avner, Paolo
    Cities are intricately interconnected socioeconomic systems, with transport networks connecting people to their jobs, health, and education facilities, and ensuring the smooth functioning of supply chains. When floods happen, they isolate people and firms from these vital networks, causing cascading disruptions and losses. Such floods are not limited to rare and extreme events. Especially in developing country cities, the lack of resilient infrastructure systems means that even regular rainfall events, for example, during rainy seasons, can cause havoc. Attention is often biased towards direct asset losses from floods, rather than the wider economic costs of disrupted networks. This is due primarily to the complex dynamics of economic and infrastructure networks. But public transport and road usage data are also often limited, especially when the predominant modes of transport are informal and walking. So how can we identify and prioritize cost-effective measures for urban resilience This note describes an analytical approach that can help prioritize investments in urban transport resilience and public transport, while also strengthening the economic case for such investments.
  • Publication
    Reclaiming the Lost Century of Growth: Building Learning Economies in Latin America and the Caribbean
    (Washington, DC: World Bank, 2025-06-06) Maloney, William F.; Cirera, Xavier; Ferreyra, Maria Marta
    Update: The Spanish version of the full book was published on September 9, 2025. Latin America and the Caribbean has lost not decades but a century of growth due to its inability to learn—to identify, adapt, and implement the new technologies emerging since the Second Industrial Revolution. Superstars like Argentina, Chile, and Uruguay fell behind peers like France and Germany, while the entire region retrogressed in industries it once dominated and was unable to take advantage of new opportunities that propelled similarly lagging countries to high-income status. The report shows that this remains the case today as the region’s firms continue to lag in assimilating new technologies. However, it argues that Latin America and the Caribbean can reclaim the lost century by building learning economies, creating the human capital, institutions, and incentives needed to increase the demand for knowledge, facilitate the flow of new ideas, and foment the process of experimentation.
  • Publication
    Peru Country Climate and Development Report
    (World Bank, Washington, DC, 2022-11) World Bank Group
    The Peru Country Climate and Development Report (CCDR) provides analysis and recommendations on integrating the country’s efforts to achieve economic development with the pursuit of emission reduction and climate resilience. The CCDR explores opportunities and trade-offs for aligning Peru’s development path with its recent commitments on climate change. Peru is highly vulnerable to climate change and needs urgent adaptation action. Peru can benefit from decarbonization policies, thanks to its mining, forestry and agriculture, and renewable energy resources. Peru has many opportunities to develop and implement comprehensive climate policies that also increase productivity and reduce poverty. A low-carbon, resilient development for Peru would require substantial institutional reforms, in addition to public and private investments.
  • Publication
    Sustainable Urban Transport Financing from the Sidewalk to the Subway
    (Washington, DC: World Bank, 2016) Ardila-Gomez, Arturo; Ortegon-Sanchez, Adriana
    Urban transport systems are essential for economic development and improving citizens' quality of life. To establish high-quality and affordable transport systems, cities must ensure their financial sustainability to fund new investments in infrastructure while also funding maintenance and operation of existing facilities and services. However, many cities in developing countries are stuck in an "underfunding trap" for urban transport, in which large up-front investments are needed for new transport infrastructure that will improve the still small-scale, and perhaps, poor-quality systems, but revenue is insufficient to cover maintenance and operation expenses, let alone new investment projects. The urban transport financing gap in these cities is further widened by the implicit subsidies for the use of private cars, which represent a minority of trips but contribute huge costs in terms of congestion, sprawl, accidents, and pollution. Using an analytical framework based on the concept of "Who Benefits Pays," 24 types of financing instruments are assessed in terms of their social, economic and environmental impacts and their ability to fund urban transport capital investments, operational expenses, and maintenance. Urban transport financing needs to be based on an appropriate mix of complementary financing instruments. In particular for capital investments, a combination of grants –from multiple levels of government– and loans together with investments through public private partnerships could finance large projects that benefit society. Moreover, the property tax emerges as a key financing instrument for capital, operation, and maintenance expenses. By choosing the most appropriate mix of financing instruments and focusing on wise investments, cities can design comprehensive financing for all types of urban transport projects, using multi-level innovative revenue sources that promote efficient pricing schemes, increase overall revenue, strengthen sustainable transport, and cover capital investments, operation, and maintenance for all parts of a public transport system, "from the sidewalk to the subway."