Publication:
Where Are All the Jobs ?: A Machine Learning Approach for High Resolution Urban Employment Prediction in Developing Countries

Loading...
Thumbnail Image
Files in English
English PDF (8.59 MB)
1,031 downloads
English Text (114.72 KB)
60 downloads
Date
2022-03
ISSN
Published
2022-03
Author(s)
Barzin, Samira
O’Clery, Neave
Editor(s)
Abstract
Globally, both people and economic activity are increasingly concentrated in urban areas. Yet, for the vast majority of developing country cities, little is known about the granular spatial organization of such activity despite its key importance to policy and urban planning. This paper adapts a machine learning based algorithm to predict the spatial distribution of employment using input data from open access sources such as Open Street Map and Google Earth Engine. The algorithm is trained on 14 test cities, ranging from Buenos Aires in Argentina to Dakar in Senegal. A spatial adaptation of the random forest algorithm is used to predict within-city cells in the 14 test cities with extremely high accuracy (R- squared greater than 95 percent), and cells in out-of-sample ”unseen” cities with high accuracy (mean R-squared of 63 percent). This approach uses open data to produce high resolution estimates of the distribution of urban employment for cities where such information does not exist, making evidence-based planning more accessible than ever before.
Link to Data Set
Citation
Barzin, Samira; Rentschler, Jun; O’Clery, Neave; Avner, Paolo. 2022. Where Are All the Jobs ?: A Machine Learning Approach for High Resolution Urban Employment Prediction in Developing Countries. Policy Research Working Paper;9979. © World Bank. http://hdl.handle.net/10986/37195 License: CC BY 3.0 IGO.
Associated URLs
Associated content
Report Series
Report Series
Other publications in this report series
  • Publication
    Intergenerational Income Mobility around the World
    (Washington, DC: World Bank, 2025-07-09) Munoz, Ercio; Van der Weide, Roy
    This paper introduces a new global database with estimates of intergenerational income mobility for 87 countries, covering 84 percent of the world’s population. This marks a notable expansion of the cross-country evidence base on income mobility, particularly among low- and middle-income countries. The estimates indicate that the negative association between income mobility and inequality (known as the Great Gatsby Curve) continues to hold across this wider range of countries. The database also reveals a positive association between income mobility and national income per capita, suggesting that countries achieve higher levels of intergenerational mobility as they grow richer.
  • Publication
    The Impact of Trade Promotion Organizations on Exports
    (Washington, DC: World Bank, 2025-08-13) Choi, Yewon; Fernandes, Ana Margarida; Grover, Arti; Iacovone, Leonardo; Olarreaga, Marcelo
    This paper examines the impact of trade promotion organizations on exports during the COVID-19 pandemic using a World Bank survey. The results suggest that increased trade promotion organization budgets significantly boosted exports during downturns but had no effect during the recovery phase. Interestingly, e-commerce programs adopted by trade promotion organizations negatively affected exports during downturns as they diverted resources away from productive support, especially for sectors not intensive in online trade. These findings suggest that countercyclical trade promotion organizations budgets may enhance trade resilience during similar global shocks.
  • Publication
    The Future of Poverty
    (Washington, DC: World Bank, 2025-07-15) Fajardo-Gonzalez, Johanna; Nguyen, Minh C.; Corral, Paul
    Climate change is increasingly acknowledged as a critical issue with far-reaching socioeconomic implications that extend well beyond environmental concerns. Among the most pressing challenges is its impact on global poverty. This paper projects the potential impacts of unmitigated climate change on global poverty rates between 2023 and 2050. Building on a study that provided a detailed analysis of how temperature changes affect economic productivity, this paper integrates those findings with binned data from 217 countries, sourced from the World Bank’s Poverty and Inequality Platform. By simulating poverty rates and the number of poor under two climate change scenarios, the paper uncovers some alarming trends. One of the primary findings is that the number of people living in extreme poverty worldwide could be nearly doubled due to climate change. In all scenarios, Sub-Saharan Africa is projected to bear the brunt, contributing the largest number of poor people, with estimates ranging between 40.5 million and 73.5 million by 2050. Another significant finding is the disproportionate impact of inequality on poverty. Even small increases in inequality can lead to substantial rises in poverty levels. For instance, if every country’s Gini coefficient increases by just 1 percent between 2022 and 2050, an additional 8.8 million people could be pushed below the international poverty line by 2050. In a more extreme scenario, where every country’s Gini coefficient increases by 10 percent between 2022 and 2050, the number of people falling into poverty could rise by an additional 148.8 million relative to the baseline scenario. These findings underscore the urgent need for comprehensive climate policies that not only mitigate environmental impacts but also address socioeconomic vulnerabilities.
  • Publication
    The Macroeconomic Implications of Climate Change Impacts and Adaptation Options
    (Washington, DC: World Bank, 2025-05-29) Abalo, Kodzovi; Boehlert, Brent; Bui, Thanh; Burns, Andrew; Castillo, Diego; Chewpreecha, Unnada; Haider, Alexander; Hallegatte, Stephane; Jooste, Charl; McIsaac, Florent; Ruberl, Heather; Smet, Kim; Strzepek, Ken
    Estimating the macroeconomic implications of climate change impacts and adaptation options is a topic of intense research. This paper presents a framework in the World Bank's macrostructural model to assess climate-related damages. This approach has been used in many Country Climate and Development Reports, a World Bank diagnostic that identifies priorities to ensure continued development in spite of climate change and climate policy objectives. The methodology captures a set of impact channels through which climate change affects the economy by (1) connecting a set of biophysical models to the macroeconomic model and (2) exploring a set of development and climate scenarios. The paper summarizes the results for five countries, highlighting the sources and magnitudes of their vulnerability --- with estimated gross domestic product losses in 2050 exceeding 10 percent of gross domestic product in some countries and scenarios, although only a small set of impact channels is included. The paper also presents estimates of the macroeconomic gains from sector-level adaptation interventions, considering their upfront costs and avoided climate impacts and finding significant net gross domestic product gains from adaptation opportunities identified in the Country Climate and Development Reports. Finally, the paper discusses the limits of current modeling approaches, and their complementarity with empirical approaches based on historical data series. The integrated modeling approach proposed in this paper can inform policymakers as they make proactive decisions on climate change adaptation and resilience.
  • Publication
    Climate Vulnerability and Job Accessibility
    (Washington, DC: World Bank, 2025-08-11) Iimi, Atsushi
    Many developing cities are facing rapid population growth and extreme climate events. This paper examines the link between job accessibility and climate vulnerability, using data from Antananarivo, Madagascar, which frequently experiences flooding. As in other countries, the analysis finds that men’s commutes are longer than women’s, who tend to walk to work or use public transport. Even after controlling for observables and the potential endogeneity bias associated with commute time, the findings show that climate vulnerability negatively impacts wages, as people avoid commuting long to work due to anticipated potential climate risks. Building climate resilience into urban transport is therefore essential. As predicted by theory, the evidence also shows that the value of commuting is positive, and walking is disadvantageous. Motorized commuting yields higher returns, which could lead to overuse of private cars and taxis, posing decarbonization challenges.
Journal
Journal Volume
Journal Issue

Related items

Showing items related by metadata.

  • Publication
    Predicting Urban Employment Distributions
    (Washington, DC, 2022-06) Maruyama Rentschler, Jun Erik; Barzin, Samira; O’Clery, Neave; Avner, Paolo
    Cities are intricately interconnected socioeconomic systems, with transport networks connecting people to their jobs, health, and education facilities, and ensuring the smooth functioning of supply chains. When floods happen, they isolate people and firms from these vital networks, causing cascading disruptions and losses. Such floods are not limited to rare and extreme events. Especially in developing country cities, the lack of resilient infrastructure systems means that even regular rainfall events, for example, during rainy seasons, can cause havoc. Attention is often biased towards direct asset losses from floods, rather than the wider economic costs of disrupted networks. This is due primarily to the complex dynamics of economic and infrastructure networks. But public transport and road usage data are also often limited, especially when the predominant modes of transport are informal and walking. So how can we identify and prioritize cost-effective measures for urban resilience This note describes an analytical approach that can help prioritize investments in urban transport resilience and public transport, while also strengthening the economic case for such investments.
  • Publication
    Central America : Big Data in Action for Development
    (Washington, DC, 2014) World Bank
    This report stemmed from a World Bank pilot activity to explore the potential of big data to address development challenges in Central American countries. As part of this activity we collected and analyzed a number of examples of leveraging big data for development. Because of the growing interest in this topic this report makes available to a broader audience those examples as well as the underlying conceptual framework to think about big data for development. To make effective use of big data, many practitioners emphasize the importance of beginning with a question instead of the data itself. A question clarifies the purpose of utilizing big data, whether it is for awareness, understanding, and/or forecasting. In addition, a question suggests the kinds of real-world behaviors or conditions that are of interest. These behaviors are encoded into data through some generating process which includes the media through which behavior is captured. Then various data sources are accessed, prepared, consolidated and analyzed. This ultimately gives rise to insights into the question of interest, which are implemented to effect changes in the relevant behaviors. Utilizing big data for any given endeavor requires a host of capabilities. Hardware and software capabilities are needed for interaction of data from a variety of sources in a way which is efficient and scalable. Human capabilities are needed not only to make sense of data but to ensure a question-centered approach, so that insights are actionable and relevant. To this end, cooperation between development experts as well as social scientists and computer scientists is extremely important.
  • Publication
    Three Feet Under
    (World Bank, Washington, DC, 2019-06) Braese, Johannes; Rentschler, Jun; Jones, Nick; Avner, Paolo
    This paper analyses the degree to which infrastructure reliability and urban economic activity in several African cities is impacted by flooding. It combines firm-level micro data, flood maps, and several spatial data layers across cities through a harmonized geospatial network analysis. The analysis shows that a significant share of jobs in cities is directly affected by floods. It further details how transport infrastructure is subjected to significant flood risk that disproportionally affects main roads in many cities. While direct flood effects are revealed to be significant, this work further shows how knock-on implications for the entire urban economy might be even larger. Regardless of the direct flood exposure of firms, flooded transport networks mean that disruptions propagate across the city and drastically reduce the connectivity between firms. Access to hospitals is also found to be reduced significantly -- even during relatively light flooding events: From a third of locations in Kampala, floods mean that people would no longer be able to reach hospitals within the "golden hour" -- a rule of thumb referring to the window of time that maximizes the likelihood of survival after a severe medical incident. Overall, this study showcases the use of high-detail city-level analyses to better understand the localized impacts of natural hazards on urban infrastructure networks.
  • Publication
    Carbon Price Efficiency : Lock-in and Path Dependence in Urban Forms and Transport Infrastructure
    (World Bank, Washington, DC, 2014-06) Rentschler, Jun; Hallegatte, Stéphane; Avner, Paolo
    This paper investigates the effect of carbon or gasoline taxes on commuting-related CO2 emissions in an urban context. To assess the impact of public transport on the efficiency of the tax, the paper investigates two exogenous scenarios using a dynamic urban model (NEDUM-2D) calibrated for the urban area of Paris: (i) a scenario with the current dense public transport infrastructure, and (ii) a scenario without. It is shown that the price elasticity of CO2 emissions is twice as high in the short run if public transport options exist. Reducing commuting-related emissions thus requires lower (and more acceptable) tax levels in the presence of dense public transportation. If the goal of a carbon or gasoline tax is to change behaviors and reduce energy consumption and CO2 emissions (not to raise revenues), then there is an incentive to increase the price elasticity through complementary policies such as public transport development. The emission elasticity also depends on the baseline scenario and is larger when population growth and income growth are high. In the longer run, elasticities are higher and similar in the scenarios with and without public transport, because of larger urban reconfiguration in the latter scenario. These results are policy relevant, especially for fast-growing cities in developing countries. Even for cities where emission reductions are not a priority today, there is an option value attached to a dense public transport network, since it makes it possible to reduce emissions at a lower cost in the future.
  • Publication
    Floods and Urban Connectivity
    (Washington, DC, 2022-06) Maruyama Rentschler, Jun Erik; He, Yiyi; Thies, Stephan Fabian; Nell, Andrew David; Avner, Paolo
    Cities are intricately interconnected socioeconomic systems, with transport networks connecting people to their jobs, health, and education facilities, and ensuring the smooth functioning of supply chains. When floods happen, they isolate people and firms from these vital networks, causing cascading disruptions and losses. Such floods are not limited to rare and extreme events. Especially in developing country cities, the lack of resilient infrastructure systems means that even regular rainfall events, for example, during rainy seasons, can cause havoc. Attention is often biased towards direct asset losses from floods, rather than the wider economic costs of disrupted networks. This is due primarily to the complex dynamics of economic and infrastructure networks. But public transport and road usage data are also often limited, especially when the predominant modes of transport are informal and walking. So how can we identify and prioritize cost-effective measures for urban resilience This note describes an analytical approach that can help prioritize investments in urban transport resilience and public transport, while also strengthening the economic case for such investments.

Users also downloaded

Showing related downloaded files

  • Publication
    Global Economic Prospects, January 2025
    (Washington, DC: World Bank, 2025-01-16) World Bank
    Global growth is expected to hold steady at 2.7 percent in 2025-26. However, the global economy appears to be settling at a low growth rate that will be insufficient to foster sustained economic development—with the possibility of further headwinds from heightened policy uncertainty and adverse trade policy shifts, geopolitical tensions, persistent inflation, and climate-related natural disasters. Against this backdrop, emerging market and developing economies are set to enter the second quarter of the twenty-first century with per capita incomes on a trajectory that implies substantially slower catch-up toward advanced-economy living standards than they previously experienced. Without course corrections, most low-income countries are unlikely to graduate to middle-income status by the middle of the century. Policy action at both global and national levels is needed to foster a more favorable external environment, enhance macroeconomic stability, reduce structural constraints, address the effects of climate change, and thus accelerate long-term growth and development.
  • Publication
    Digital Progress and Trends Report 2023
    (Washington, DC: World Bank, 2024-03-05) World Bank
    Digitalization is the transformational opportunity of our time. The digital sector has become a powerhouse of innovation, economic growth, and job creation. Value added in the IT services sector grew at 8 percent annually during 2000–22, nearly twice as fast as the global economy. Employment growth in IT services reached 7 percent annually, six times higher than total employment growth. The diffusion and adoption of digital technologies are just as critical as their invention. Digital uptake has accelerated since the COVID-19 pandemic, with 1.5 billion new internet users added from 2018 to 2022. The share of firms investing in digital solutions around the world has more than doubled from 2020 to 2022. Low-income countries, vulnerable populations, and small firms, however, have been falling behind, while transformative digital innovations such as artificial intelligence (AI) have been accelerating in higher-income countries. Although more than 90 percent of the population in high-income countries was online in 2022, only one in four people in low-income countries used the internet, and the speed of their connection was typically only a small fraction of that in wealthier countries. As businesses in technologically advanced countries integrate generative AI into their products and services, less than half of the businesses in many low- and middle-income countries have an internet connection. The growing digital divide is exacerbating the poverty and productivity gaps between richer and poorer economies. The Digital Progress and Trends Report series will track global digitalization progress and highlight policy trends, debates, and implications for low- and middle-income countries. The series adds to the global efforts to study the progress and trends of digitalization in two main ways: · By compiling, curating, and analyzing data from diverse sources to present a comprehensive picture of digitalization in low- and middle-income countries, including in-depth analyses on understudied topics. · By developing insights on policy opportunities, challenges, and debates and reflecting the perspectives of various stakeholders and the World Bank’s operational experiences. This report, the first in the series, aims to inform evidence-based policy making and motivate action among internal and external audiences and stakeholders. The report will bring global attention to high-performing countries that have valuable experience to share as well as to areas where efforts will need to be redoubled.
  • Publication
    Global Economic Prospects, June 2025
    (Washington, DC: World Bank, 2025-06-10) World Bank
    The global economy is facing another substantial headwind, emanating largely from an increase in trade tensions and heightened global policy uncertainty. For emerging market and developing economies (EMDEs), the ability to boost job creation and reduce extreme poverty has declined. Key downside risks include a further escalation of trade barriers and continued policy uncertainty. These challenges are exacerbated by subdued foreign direct investment into EMDEs. Global cooperation is needed to restore a more stable international trade environment and scale up support for vulnerable countries grappling with conflict, debt burdens, and climate change. Domestic policy action is also critical to contain inflation risks and strengthen fiscal resilience. To accelerate job creation and long-term growth, structural reforms must focus on raising institutional quality, attracting private investment, and strengthening human capital and labor markets. Countries in fragile and conflict situations face daunting development challenges that will require tailored domestic policy reforms and well-coordinated multilateral support.
  • Publication
    Business Ready 2024
    (Washington, DC: World Bank, 2024-10-03) World Bank
    Business Ready (B-READY) is a new World Bank Group corporate flagship report that evaluates the business and investment climate worldwide. It replaces and improves upon the Doing Business project. B-READY provides a comprehensive data set and description of the factors that strengthen the private sector, not only by advancing the interests of individual firms but also by elevating the interests of workers, consumers, potential new enterprises, and the natural environment. This 2024 report introduces a new analytical framework that benchmarks economies based on three pillars: Regulatory Framework, Public Services, and Operational Efficiency. The analysis centers on 10 topics essential for private sector development that correspond to various stages of the life cycle of a firm. The report also offers insights into three cross-cutting themes that are relevant for modern economies: digital adoption, environmental sustainability, and gender. B-READY draws on a robust data collection process that includes specially tailored expert questionnaires and firm-level surveys. The 2024 report, which covers 50 economies, serves as the first in a series that will expand in geographical coverage and refine its methodology over time, supporting reform advocacy, policy guidance, and further analysis and research.
  • Publication
    The Container Port Performance Index 2023
    (Washington, DC: World Bank, 2024-07-18) World Bank
    The Container Port Performance Index (CPPI) measures the time container ships spend in port, making it an important point of reference for stakeholders in the global economy. These stakeholders include port authorities and operators, national governments, supranational organizations, development agencies, and other public and private players in trade and logistics. The index highlights where vessel time in container ports could be improved. Streamlining these processes would benefit all parties involved, including shipping lines, national governments, and consumers. This fourth edition of the CPPI relies on data from 405 container ports with at least 24 container ship port calls in the calendar year 2023. As in earlier editions of the CPPI, the ranking employs two different methodological approaches: an administrative (technical) approach and a statistical approach (using matrix factorization). Combining these two approaches ensures that the overall ranking of container ports reflects actual port performance as closely as possible while also being statistically robust. The CPPI methodology assesses the sequential steps of a container ship port call. ‘Total port hours’ refers to the total time elapsed from the moment a ship arrives at the port until the vessel leaves the berth after completing its cargo operations. The CPPI uses time as an indicator because time is very important to shipping lines, ports, and the entire logistics chain. However, time, as captured by the CPPI, is not the only way to measure port efficiency, so it does not tell the entire story of a port’s performance. Factors that can influence the time vessels spend in ports can be location-specific and under the port’s control (endogenous) or external and beyond the control of the port (exogenous). The CPPI measures time spent in container ports, strictly based on quantitative data only, which do not reveal the underlying factors or root causes of extended port times. A detailed port-specific diagnostic would be required to assess the contribution of underlying factors to the time a vessel spends in port. A very low ranking or a significant change in ranking may warrant special attention, for which the World Bank generally recommends a detailed diagnostic.