Publication: Identifying Urban Areas by Combining Data from the Ground and from Outer Space: An Application to India
Loading...
Published
2018-10
ISSN
Date
2018-11-01
Author(s)
Editor(s)
Abstract
This paper develops a tractable method to identify urban areas and applies it to India, where urbanization is messy. Google Earth images are assessed subjectively to determine whether a stratified large sample of Indian cities, towns and villages, as officially defined, are urban or rural in practice. Based on these assessments, a regression analysis combines two sources of information—data from georeferenced population censuses and data from satellite imagery—to identify the correlates of units in the sample being urban. The resulting model is used to predict whether the other units in the country are urban or rural in practice. Contrary to frequent claims, India is not substantially more urban than implied by census data. And the speed of urbanization is only marginally higher than official statistics suggest. But a considerable number of locations are misclassified in the midrange between villages and state capitals. The results confirm the value of combining subjective assessments with data from these different sources.
Link to Data Set
Citation
“Galdo, Virgilio; Li, Yue; Rama, Martin. 2018. Identifying Urban Areas by Combining Data from the Ground and from Outer Space: An Application to India. Policy Research Working Paper;No. 8628. © World Bank. http://hdl.handle.net/10986/30648 License: CC BY 3.0 IGO.”
Associated URLs
Associated content
Other publications in this report series
Publication Gender Gaps in the Performance of Small Firms: Evidence from Urban Peru(Washington, DC: World Bank, 2025-09-23)This paper estimates the gender gap in the performance of firms in Peru using representative data on both formal and informal firms. On average, informal female-led firms have lower sales, labor productivity, and profits compared to their male-led counterparts, with differences more pronounced when controlling for observable determinants of firm performance. However, gender gaps are only significant at the bottom of the performance distribution of informal firms, and these gaps disappear at the top of the distribution of informal firms and for formal firms. Possible explanations for the performance gaps at the bottom of the distribution include the higher likelihood of small, female-led firms being home-based, which is linked to lower profits, and their concentration in less profitable sectors. The paper provides suggestive evidence that household responsibilities play a key role in explaining the gender gap in firm performance among informal firms. Therefore, policies that promote access to care services or foster a more equal distribution of household activities may reduce gender productivity gaps and allow for a more efficient allocation of resources.Publication Global Poverty Revisited Using 2021 PPPs and New Data on Consumption(Washington, DC: World Bank, 2025-06-05)Recent improvements in survey methodologies have increased measured consumption in many low- and lower-middle-income countries that now collect a more comprehensive measure of household consumption. Faced with such methodological changes, countries have frequently revised upward their national poverty lines to make them appropriate for the new measures of consumption. This in turn affects the World Bank’s global poverty lines when they are periodically revised. The international poverty line, which is based on the typical poverty line in low-income countries, increases by around 40 percent to $3.00 when the more recent national poverty lines as well as the 2021 purchasing power parities are incorporated. The net impact of the changes in international prices, the poverty line, and new survey data (including new data for India) is an increase in global extreme poverty by some 125 million people in 2022, and a significant shift of poverty away from South Asia and toward Sub-Saharan Africa. The changes at higher poverty lines, which are more relevant to middle-income countries, are mixed.Publication Intergenerational Income Mobility around the World(Washington, DC: World Bank, 2025-07-09)This paper introduces a new global database with estimates of intergenerational income mobility for 87 countries, covering 84 percent of the world’s population. This marks a notable expansion of the cross-country evidence base on income mobility, particularly among low- and middle-income countries. The estimates indicate that the negative association between income mobility and inequality (known as the Great Gatsby Curve) continues to hold across this wider range of countries. The database also reveals a positive association between income mobility and national income per capita, suggesting that countries achieve higher levels of intergenerational mobility as they grow richer.Publication The Macroeconomic Implications of Climate Change Impacts and Adaptation Options(Washington, DC: World Bank, 2025-05-29)Estimating the macroeconomic implications of climate change impacts and adaptation options is a topic of intense research. This paper presents a framework in the World Bank's macrostructural model to assess climate-related damages. This approach has been used in many Country Climate and Development Reports, a World Bank diagnostic that identifies priorities to ensure continued development in spite of climate change and climate policy objectives. The methodology captures a set of impact channels through which climate change affects the economy by (1) connecting a set of biophysical models to the macroeconomic model and (2) exploring a set of development and climate scenarios. The paper summarizes the results for five countries, highlighting the sources and magnitudes of their vulnerability --- with estimated gross domestic product losses in 2050 exceeding 10 percent of gross domestic product in some countries and scenarios, although only a small set of impact channels is included. The paper also presents estimates of the macroeconomic gains from sector-level adaptation interventions, considering their upfront costs and avoided climate impacts and finding significant net gross domestic product gains from adaptation opportunities identified in the Country Climate and Development Reports. Finally, the paper discusses the limits of current modeling approaches, and their complementarity with empirical approaches based on historical data series. The integrated modeling approach proposed in this paper can inform policymakers as they make proactive decisions on climate change adaptation and resilience.Publication The Impact of Atlantic Hurricanes on Business Activity(Washington, DC: World Bank, 2025-09-22)This paper quantifies the short-run economic impact of 21 Atlantic hurricanes on U.S. local business activity from 2017 to 2024 using anonymized Mastercard transaction data aggregated by ZIP code. On average, hurricanes reduce merchant sales by 12.4 percent during the preparation, impact, and recovery phases—an estimated US$1.38 billion in lost revenue per storm. Substitution in spending across nearby areas or large online platforms is limited, indicating widespread local consumption declines. Economic disruption varies more by industry than storm intensity, with independent stores hit harder than chains. Local businesses with larger online presence face smaller, shorter sales declines, showing greater resilience.
Journal
Journal Volume
Journal Issue
Collections
Related items
Showing items related by metadata.
Publication Identifying Urban Areas by Combining Human Judgment and Machine Learning(World Bank, Washington, DC, 2020-02)This paper proposes a methodology for identifying urban areas that combines subjective assessments with machine learning, and applies it to India, a country where several studies see the official urbanization rate as an under-estimate. For a representative sample of cities, towns and villages, as administratively defined, human judgment of Google images is used to determine whether they are urban or rural in practice. Judgments are collected across four groups of assessors, differing in their familiarity with India and with urban issues, following two different protocols. The judgment-based classification is then combined with data from the population census and from satellite imagery to predict the urban status of the sample. The Logit model, and LASSO and random forests methods, are applied. These approaches are then used to decide whether each of the out-of-sample administrative units in India is urban or rural in practice. The analysis does not find that India is substantially more urban than officially claimed. However, there are important differences at more disaggregated levels, with “other towns” and “census towns” being more rural, and some southern states more urban, than is officially claimed. The consistency of human judgment across assessors and protocols, the easy availability of crowd-sourcing, and the stability of predictions across approaches, suggest that the proposed methodology is a promising avenue for studying urban issues.Publication Measuring Districts' Monthly Economic Activity from Outer Space(World Bank, Washington, DC, 2018-07)Evening-hour luminosity observed using satellites is a good proxy for economic activity. The strengths of measuring economic activity using nightlight measurements include that the data capture informal activity, are available in near real-time, are cheap to obtain, and can be used to conduct very spatially granular analysis. This paper presents a measure of monthly economic activity at the district level based on cleaned Visible Infrared Imaging Radiometer Suite nightlight and rural population. The paper demonstrates that this new method can shed light on recent episodes in South Asia: first, the 2015 earthquake in Nepal; second, demonetization in India; and, third, violent conflict outbreaks in Afghanistan.Publication Estimating Small Area Poverty and Welfare Indicators in Timor-Leste Using Satellite Imagery Data(World Bank, Washington, DC, 2020-09-28)This report is structured as follows: an in-depth explanation of the FHSAE method is presented in section two. Section three reviews the sub-district level data used in this study, which includes imprecise TL-SLS and DHS direct estimates, as well as satellite imagery data used in this study. The variable selection method used for the FHSAE model in this model is explained in section four. Section five provides the results of the FHSAE exercise on poverty estimates, average real per capita consumption and welfare index, presenting them in the graphical maps. Section six concludes.Publication Big Data and Thriving Cities(World Bank, Washington, DC, 2017-03)The recent global diffusion of new technologies, combined with the use of big data analytics, can help policymakers promote the effective development of future cities that provide living and work environments in which citizens can thrive. In particular, innovative applications of geospatial and sensing technologies and the penetration of mobile phone technology are providing unprecedented data collection This data can be analyzed for many purposes, including tracking population and mobility, private sector investment, and transparency in federal and local government. To help development practitioners within and beyond the World Bank take advantage of these trends, this brief profiles a sample of big data applications to support improved urban development in low- and middle-income countries. It also cites potential opportunities for big data analytics to help developing nations achieve sustainable urban growth, while reducing the economic differential with high-income countries.Publication Conflict and the Composition of Economic Activity in Afghanistan(World Bank, Washington, DC, 2020-03)Despite informality being the norm in conflict-affected countries, most estimates of the impact of conflict on economic activity rely on formal sector data. Using high-frequency data from Afghanistan, this paper assesses how surges in conflict intensity affect not only the formal sector, but also informal and illicit activities. Nighttime light provides a proxy for aggregate economic activity, mobile phone traffic by registered firms captures fluctuations in formal sector output, and the land surface devoted to poppy cultivation gives a measure of illicit production. The unit of observation is the district and the period of reference is 2012-16. The same dynamic specification and controls are used for the estimation in the three cases, making the results comparable across sectors. Controls include the presence of combat troops and the level of foreign aid at the local level, which both influence local living standards in Afghanistan. The results show that an increase in conflict-related casualties has a strong negative impact on formal economic activity in the following quarter and a positive effect on illicit activity after two quarters. The impact on aggregate economic activity is negative, but more muted.
Users also downloaded
Showing related downloaded files
Publication World Development Report 2006(Washington, DC, 2005)This year’s Word Development Report (WDR), the twenty-eighth, looks at the role of equity in the development process. It defines equity in terms of two basic principles. The first is equal opportunities: that a person’s chances in life should be determined by his or her talents and efforts, rather than by pre-determined circumstances such as race, gender, social or family background. The second principle is the avoidance of extreme deprivation in outcomes, particularly in health, education and consumption levels. This principle thus includes the objective of poverty reduction. The report’s main message is that, in the long run, the pursuit of equity and the pursuit of economic prosperity are complementary. In addition to detailed chapters exploring these and related issues, the Report contains selected data from the World Development Indicators 2005‹an appendix of economic and social data for over 200 countries. This Report offers practical insights for policymakers, executives, scholars, and all those with an interest in economic development.Publication Argentina Country Climate and Development Report(World Bank, Washington, DC, 2022-11)The Argentina Country Climate and Development Report (CCDR) explores opportunities and identifies trade-offs for aligning Argentina’s growth and poverty reduction policies with its commitments on, and its ability to withstand, climate change. It assesses how the country can: reduce its vulnerability to climate shocks through targeted public and private investments and adequation of social protection. The report also shows how Argentina can seize the benefits of a global decarbonization path to sustain a more robust economic growth through further development of Argentina’s potential for renewable energy, energy efficiency actions, the lithium value chain, as well as climate-smart agriculture (and land use) options. Given Argentina’s context, this CCDR focuses on win-win policies and investments, which have large co-benefits or can contribute to raising the country’s growth while helping to adapt the economy, also considering how human capital actions can accompany a just transition.Publication The Journey Ahead(Washington, DC: World Bank, 2024-10-31)The Journey Ahead: Supporting Successful Migration in Europe and Central Asia provides an in-depth analysis of international migration in Europe and Central Asia (ECA) and the implications for policy making. By identifying challenges and opportunities associated with migration in the region, it aims to inform a more nuanced, evidencebased debate on the costs and benefits of cross-border mobility. Using data-driven insights and new analysis, the report shows that migration has been an engine of prosperity and has helped address some of ECA’s demographic and socioeconomic disparities. Yet, migration’s full economic potential remains untapped. The report identifies multiple barriers keeping migration from achieving its full potential. Crucially, it argues that policies in both origin and destination countries can help maximize the development impacts of migration and effectively manage the economic, social, and political costs. Drawing from a wide range of literature, country experiences, and novel analysis, The Journey Ahead presents actionable policy options to enhance the benefits of migration for destination and origin countries and migrants themselves. Some measures can be taken unilaterally by countries, whereas others require close bilateral or regional coordination. The recommendations are tailored to different types of migration— forced displacement as well as high-skilled and low-skilled economic migration—and from the perspectives of both sending and receiving countries. This report serves as a comprehensive resource for governments, development partners, and other stakeholders throughout Europe and Central Asia, where the richness and diversity of migration experiences provide valuable insights for policy makers in other regions of the world.Publication Classroom Assessment to Support Foundational Literacy(Washington, DC: World Bank, 2025-03-21)This document focuses primarily on how classroom assessment activities can measure students’ literacy skills as they progress along a learning trajectory towards reading fluently and with comprehension by the end of primary school grades. The document addresses considerations regarding the design and implementation of early grade reading classroom assessment, provides examples of assessment activities from a variety of countries and contexts, and discusses the importance of incorporating classroom assessment practices into teacher training and professional development opportunities for teachers. The structure of the document is as follows. The first section presents definitions and addresses basic questions on classroom assessment. Section 2 covers the intersection between assessment and early grade reading by discussing how learning assessment can measure early grade reading skills following the reading learning trajectory. Section 3 compares some of the most common early grade literacy assessment tools with respect to the early grade reading skills and developmental phases. Section 4 of the document addresses teacher training considerations in developing, scoring, and using early grade reading assessment. Additional issues in assessing reading skills in the classroom and using assessment results to improve teaching and learning are reviewed in section 5. Throughout the document, country cases are presented to demonstrate how assessment activities can be implemented in the classroom in different contexts.Publication Lebanon Economic Monitor, Fall 2022(Washington, DC, 2022-11)The economy continues to contract, albeit at a somewhat slower pace. Public finances improved in 2021, but only because spending collapsed faster than revenue generation. Testament to the continued atrophy of Lebanon’s economy, the Lebanese Pound continues to depreciate sharply. The sharp deterioration in the currency continues to drive surging inflation, in triple digits since July 2020, impacting the poor and vulnerable the most. An unprecedented institutional vacuum will likely further delay any agreement on crisis resolution and much needed reforms; this includes prior actions as part of the April 2022 International Monetary Fund (IMF) staff-level agreement (SLA). Divergent views among key stakeholders on how to distribute the financial losses remains the main bottleneck for reaching an agreement on a comprehensive reform agenda. Lebanon needs to urgently adopt a domestic, equitable, and comprehensive solution that is predicated on: (i) addressing upfront the balance sheet impairments, (ii) restoring liquidity, and (iii) adhering to sound global practices of bail-in solutions based on a hierarchy of creditors (starting with banks’ shareholders) that protects small depositors.