Policy Research Working Paper 9888 How Much Does Physical Infrastructure Contribute to Economic Growth? An Empirical Analysis Govinda Timilsina David I. Stern Debasish K. Das Development Economics Development Research Group December 2021 Policy Research Working Paper 9888 Abstract Existing literature on the relationship between infrastruc- pooled mean group estimator, which tests for the weak ture and economic growth is inconclusive. This study exogeneity of the infrastructure variables, is employed. The evaluates the contributions to economic growth of three key finding of the study is that an increase in infrastructure, main categories of infrastructure—transport, electricity, especially electricity generation capacity and telecommu- and telecommunications—using data from 87 countries nications, has significant positive effects on gross domestic over 1992–2017. Compared with existing studies, this product. Infrastructure has a larger effect in more recent study uses more recent data, includes new types of infra- years (1992–2017) than in earlier years (1970–1991), and structure such as mobile phones, and provides separate the effects of infrastructure are higher in developing econ- estimates for developing and developed countries. The omies than in industrialized economies. This paper is a product of the Development Research Group, Development Economics. It is part of a larger effort by the World Bank to provide open access to its research and make a contribution to development policy discussions around the world. Policy Research Working Papers are also posted on the Web at http://www.worldbank.org/prwp. The authors may be contacted at gtimilsina@worldbank.org. The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent. Produced by the Research Support Team How Much Does Physical Infrastructure Contribute to Economic Growth? An Empirical Analysis1 Govinda Timilsina, David I. Stern, and Debasish K. Das2 JEL Codes: O18, O47 1 The authors would like to thank Gal Hochman, Pravakar Sahoo, and Fan Zhang for their valuable comments and suggestions. The views and interpretations are of authors and should not be attributed to the World Bank Group and the organizations they are affiliated with. We acknowledge World Bank’s Research Support Grant (RSB) for financial support. 2 Govinda Timilsina (gtimilsina@worldbank.org) is a senior economist at the Development Research Group, World Bank, Washington, DC; David Stern (david.stern@anu.edu.au) is a professor in the Arndt-Corden Department of Economics in the Crawford School of Public Policy at the Australian National University, Canberra, Australia. Debasish Das (debasish.das@anu.edu.au) is an assistant professor, Department of Economics, Khulna University, Bangladesh and a Ph.D. candidate at the Arndt-Corden Department of Economics. 2 1. Introduction Policy makers, particularly in developing countries, often face a critical question: how much they should invest in physical infrastructure from their scarce financial resources, both resources generated domestically and provided by foreign sources such as bilateral and multilateral donors and direct foreign investment. This question is also important for international development institutions such as the World Bank Group and regional development banks that wish to maximize their impact on economic development and social welfare in recipient economies. Timilsina et al. (2020) review a large body of literature and conclude that no consensus exists on the impacts of infrastructure investment on economic growth. Some existing studies show a strong positive relationship between infrastructure development and economic growth, whereas others find a mildly positive relationship or no relationship. Many factors are responsible for these varying results, such as differences in methods, differing approaches to measuring infrastructure development, the varying development stages of countries included in the sample, varying time periods, and geographical factors such as high or low population density (Timilsina et al. 2020; Elburz et al. 2017 3). To further illuminate the role of infrastructure in economic growth and development, this study uses a dynamic panel data model to evaluate the contributions of three main categories of infrastructure: transport, electricity, and telecommunications, to growth in a panel of 87 countries over the period 1992 to 2017. Our main estimate uses the pooled mean group estimator (Pesaran et al., 1999) to estimate the effects of these and other inputs on growth. The best previous research on this topic (see Burke et al., 2018), such as Caldéron et al. (2015), is based on data that ends in 2000 and does not include new types of infrastructure such as mobile phones and internet connections. Our study makes four main contributions. First, we expand the types of infrastructure considered to include these new categories, specifically mobile phones. Second, we update the analysis to include the most recent available data (2017). Third, we provide separate estimates for developed and developing countries. Finally, we estimate short- and long-run elasticities of GDP with respect to infrastructure. Identifying the effect of infrastructure on economic growth and development is challenging because it is likely to be built in the expectation that there will be demand for it, creating a reverse causality challenge (Cook, 2011). There are also likely to be lags of varying 3 Elburz et al. (2017) conducted a meta-analysis of 42 empirical studies published between 1995 and 2014 highlighting the sensitivity of results to various factors. 3 lengths before the full extent of economic benefits is received from infrastructure. Identifying the effects of specific types of infrastructure will also be challenging because provision of electricity, transport, and telecommunications infrastructures are likely to be positively correlated. For example, World Bank (2018) finds that the impact of improved access to electricity in rural areas is positively related to road access. One way to address this issue, that has been used in some previous research (e.g. Calderon et al., 2015) is to use a dynamic model, which allows us to test the effect of past changes in infrastructure on growth, while testing for the weak exogeneity of infrastructure inputs. Although several studies show a strong positive relationship between infrastructure and economic growth in less developed countries deprived of adequate infrastructure (Calderón and Servén, 2010; Kodongo and Ojah, 2016; Chakamera and Alagidede, 2018), whether this finding holds for industrialized economies remains an open question. Is there a threshold level of economic development (measured in terms of per capita GDP or human development indicator) below which the relationship between the infrastructure and economic growth is stronger, whereas the relationship is weak or absent above the threshold? An investigation of this relationship by separating the countries into different groups by income, using the World Bank classification of countries into high-income, middle-income, and low-income countries, can help answer this question. Therefore, we also provide separate estimates for a group of low- and middle-income countries and a panel of high-income economies. Our study finds larger effects of infrastructure on economic output than found by the previous best studies. We also find that the effects of infrastructure are a higher after 1991 than before that date. The infrastructure has larger effects in developing economies as compared to those in industrialized economies. The paper is organized as follows. Section 2 presents a review of previous research on the relationship between infrastructure and economic growth. The next two sections introduce the data and econometric methods used. Section 5 presents and interprets the results. Section 6 draws key conclusions and policy insights. 2. Previous Research Several early studies (e.g., Aschauer, 1989; Munnell, 1990; Duffy-Deno and Randall 1991; Garcia-Milla 1992; Rives and Micheal 1995; Wylie, 1996; Morrison and Schwartz, 1996.) investigate the role of public infrastructure in economic growth. For example, using state-level data in the United States, Munnell (1990) finds that public capital has a significant, 4 positive impact on output. Similarly, using data of 48 U.S. states from 1969 to 1983, Garcia- Milla et al. (1992) find a positive relationship between public infrastructure on education and highways and gross state products. Studies such as Rives and Heaney (1995) show that the impacts of infrastructure are higher in the local economy as compared to in the national economy. Wylie (1996) reports higher output elasticities of infrastructure investment in Canada than in the United States. These studies, however, suffer from methodological limitations, such endogeneity between the public capital stock and economic performance, common trends inducing spurious correlation, reverse causality where the causation also runs from the economic measures to public infrastructure investment, measurement errors, and data availability. Some of these problems have been addressed in later research. However, later studies report that the relationship between infrastructure investment and economic growth is either very weak or absent at least in the United States. Using a meta-analysis, Elburz et al. (2017) conclude that there exists no relationship between infrastructure investment and economic output. Timilsina et al. (2020) find that existing studies do not agree on the relationship between infrastructure and economic growth. The early studies mentioned above focused on industrialized economies, particularly those in North America. The subsequent literature, however, extends the research frontier to cover both industrialized and developing countries. Many of these studies use the stocks of physical infrastructures (electricity generation capacity, roads, landline telephone system) instead of public expenditure on infrastructure. Calderón and Servén (2010) use indices infrastructure quantity and quality index to aggregate heterogeneous infrastructure assets. The dependent variable is non-overlapping five-year average GDP growth rates for the 1960-2005 period. They applied the system GMM developed by Arellano and Bond (1991) and Arellano and Bover (1995) to a dynamic panel to address the reverse causality problem, employing both internal instrumental variables and external instruments (demographic variables). They find positive and significant effects on economic growth for both variables and argue that infrastructure development raised the growth rate globally by 1.6 percentage points per annum in 2001-2005 relative to 1991-1995. Following Calderón and Servén’s approach, Kodongo and Ojah (2016) and Chakamera and Alagidede (2018) both also used system GMM to estimate the relationship between changes in an infrastructure index and economic growth in Sub- Saharan Africa. Chakamera and Alagidede (2018) also find Granger causality from infrastructure to growth but not the reverse. It should be noted that difference and system generalized method of moments (GMM), techniques may suffer from problems caused by weak internal instruments (Bun and Windmeijer, 2010; Bazzi and Clemens, 2013). 5 Using a synthetic infrastructure index composed using principal component analysis from electricity generation capacity, total road length, and the number of fixed telephone lines, Calderón et al. (2015) conducted a dynamic panel analysis for a sample of 88 countries over the period 1960–2000. They use the pooled mean group estimator (Pesaran et al., 1999) to estimate the model focusing on a long-run production function relationship between infrastructure variables, other manufactured capital, human capital, and GDP. They found a positive and significant long-run effect of infrastructure on GDP with an elasticity of 0.07 to 0.1 depending on specification. The authors could not reject the null that infrastructure is weakly exogenous, which helps to assuage reverse causality concerns. Thus, their study provides macro-level evidence that infrastructure capital can deliver economic dividends. They find no evidence that the effects of infrastructure on GDP vary across countries at different development levels. Some studies focus their analysis at the regional level, such as Sub-Saharan Africa (Estache, et al. 2005; Nketiah-Amponsah, 2009; Kodongo and Ojah, 2016; and Chakamera, and Alagidede, 2018). Reviewing the literature focused on Sub-Saharan Africa, Ajakaiye and Ncube (2010) report that most studies conducted for this region show a strong relationship between infrastructure investment and economic growth. Several studies have been conducted at the country level. These include Lewis (1998) for Ghana, Mostert and Heerden (2015) for South Africa, Lall (1999), Dash and Sahoo (2010), Sahoo and Dash (2009), Srinivasu and Rao (2013), and Roy et al. (2014) for India, and Demurger (2001), Fan and Zhang (2004), Sahoo and Dash (2012), and Shi et al. (2017) for China. While the studies for African countries converge on the finding that infrastructure development has a strong positive effect on economic growth and reducing income inequality poverty, findings for Asian countries vary across studies. While Lall (1999), Roy et al. (2014), and Shi et al. (2017) find ambiguous or heterogeneous links between physical infrastructure investment and regional economic growth, the other studies find a positive linkage between infrastructure investment and economic development. A few studies have been conducted for countries in South America, such as German-Soto and Bustillos (2014) for Mexico and Urrunaga and Aparicio (2012) for Peru. These studies find that the regional disparity in economic development within a country can be explained through the level of infrastructure investment in its regions. There are also many micro-level studies of the effects of increasing access to electricity or other forms of infrastructure. While these micro studies typically suggest positive impacts of electrification on income and other development outcomes, more recent quasi-experimental approaches such as randomized controlled trials typically find a smaller impact for 6 electrification than earlier studies did (Lee et al., 2020). This could be a result of a limited time window for assessing the outcomes of interventions as it takes time to invest in the complementary inputs that are needed to increase income. 3. Data 3.1. Overview We estimate the impact of infrastructure stocks on economic growth using a large panel data set. Following Calderon et al. (2015), we estimate an infrastructure augmented aggregate output function using the PMG estimator. Aggregate output (i.e. real GDP) is a function of physical capital, human capital, and infrastructure variables. Initially, we assembled a panel data set for 189 countries over the period 1992 to 2017. Due to missing data and in order to ensure a balanced panel for the PMG estimation, the sample was reduced to 87 countries. Sample countries are listed in the Appendix. We also carry out the analysis for developing and developed countries separately. There are 48 low- and middle-income countries, as classified by the World Bank, 4 in the sample. Data sources and processing are described in the Appendix. 3.2. Principal Components To reduce the dimension of the data for the PMG estimation we follow the approach of Sanchez-Robles (1998) and Caldéron et al. (2015) by constructing composite infrastructure indices using principal components analysis. Principal component analysis derives a set of new orthogonal latent variables that are linear functions of the original variables so that the first principal component explains the maximal amount of the variance in the original variables. The second principal component then explains the maximal amount of the remaining variance and so forth. We use the five variables for which we have time series for 1992 to 2017 for the 87 countries. They are the natural logarithms of the following variables per 1,000 workers: the total length of road networks in kilometers, the total length of railways in kilometers, electric power generation capacity in megawatts, number of mobile phone subscribers, and the number of land-line telephone subscribers. We removed country means from each of these variables so that we only use the within variation. By only using variation within countries, we avoid the issue of different definitions of roads in different countries. Then we standardized the variables 4 https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups 7 by dividing them by their standard errors. We computed the eigenvalues and eigenvectors of the correlation matrix, which are presented in the following tables. Table 1. Principal Components Analysis a. Eigenvalues and variance explained by the five principal components Eigenvalues 1.9388 1.1058 0.8468 0.6761 0.4325 % Variance Explained 38.8% 22.1% 16.9% 13.5% 8.6% b. Eigenvectors of five principal components and standard deviation of variables Variables Eigenvectors Standard Deviation 1st 2nd 3rd 4th 5th Roads 0.1002 0.7936 0.5471 -0.1334 0.2074 0.2358 Railways -0.2582 0.5647 -0.7789 0.0648 0.0602 0.5301 Electricity 0.5056 0.0144 -0.2389 -0.8072 -0.1887 0.2391 Mobile phones 0.5938 -0.0946 -0.1877 0.2542 0.7339 3.1440 Telephones 0.5613 0.2053 -0.0423 0.5117 -0.6158 0.3428 Table 1a gives the eigenvalues of the five orthogonal principal components and the percentage of the variance of the original infrastructure variables each explains. We select the first two principal components as they have eigenvalues greater than one. Together, they explain 61% of the variation. This share is less than in some existing studies, such as Caldéron et al. (2015), because we took out country fixed effects. The eigenvectors in Table 1b give the coefficients, which will be used to derive the respective principal components as functions of the original variables, as explained below. The first principal component loads strongly on electricity, mobile phones, and land-line telephones, hence we interpret it as a telecommunications and electrification factor. The second principal component loads strongly on roads and railways and so we interpret it as a transport factor. To aid the interpretability of our econometric model we normalize the principal components so that the coefficients of the non-standardized infrastructure variables in each principal component sum to one. The two principal components are given by: 5 � � �, =� , = 1,2 (1) 5 , =1 ∑=1 � � where the are the coefficients of the eigenvectors in Table 1 and the are the standard deviations of the original infrastructure variables, , , expressed in logs of quantities per 8 worker with country means subtracted. The coefficients of each of these variables in the principal components are, therefore: Table 2. Coefficients of Principal Components 1st PC 2nd PC Roads 0.1095 0.6652 Railways -0.1256 0.2105 Electricity 0.5452 0.0119 Mobile phones 0.0487 -0.0060 Telephones 0.4222 0.1180 3.3. Summary Statistics Table 3 presents summary statistics for the variables in their original units at the beginning and end of our sample period, which also illustrates the relative growth in the variables over the period. All variables enter our models in logarithmic form and the panel regression models also include country fixed effects or individual country intercepts, which together reduce the variation in magnitude and variance seen in Table 3. Focusing on infrastructure, with the exception of mobile phones, all forms of infrastructure roughly kept pace with the increase in the number of workers. Mobile phones increased immensely and must, therefore, largely drive the increase in the first of the two infrastructure principal components. By 2017, even the poorest countries had plenty of basic mobile phones with the minimum being 1.074 per worker in Niger or about one for every three people. Electricity generation capacity per worker also increased almost everywhere and on average by 33% from 1992 to 2017. In 2017, Niger was also the country with the least generation capacity per worker at only 36 watts. The variation across countries in most types of infrastructure is similar to the variation in GDP per worker and physical capital. The mean and standard deviations are roughly equal for each of these. In 2017 though, the standard deviation of mobile phones per worker is much lower than the mean. Figure 1 plots the relationship between real GDP per worker with different infrastructure indicators in the 87-country panel. It indicates that across countries higher infrastructure per worker is associated with higher real GDP per worker. This relationship is particularly strong for electricity generation capacity (r = 0.77) and the telecommunications variables (r = 0.52 and r = 0.67 for mobile and fixed line telephones, respectively). 9 Table 3. Summary Statistics 1992 2017 Variable Mean SD Min Max Mean SD Min Max Units Real GDP 42,112 37,690 1,031 190,070 57,951 41,742 2,975 194,314 2017 US dollars Physical capital 210,129 192,972 1,192 757,195 281,972 234,701 8,976 988,081 2017 US dollars Human capital 2.308 0.656 1.037 3.476 2.807 0.676 1.211 3.974 PWT 10 index Electricity 2.259 2.358 0.020 13.335 2.999 2.716 0.036 14.665 Kilowatts Roads 0.021 0.026 0.001 0.166 0.019 0.021 0.001 0.090 Kilometres Railways 0.001 0.001 0.000 0.006 0.001 0.001 0.000 0.004 Kilometres Number of Mobile phones 0.019 0.034 0.000 0.177 2.733 0.690 1.074 4.702 subscriptions Telephones 0.448 0.419 0.003 1.275 0.449 0.348 0.003 1.399 Number of lines PC1 -0.617 0.425 -2.034 0.835 0.203 0.324 -0.630 1.079 Log index PC2 0.023 0.268 -0.949 0.573 -0.104 0.258 -1.130 0.456 Log index Note: All variables are expressed in per worker terms. 10 Figure 1. Real GDP per worker and various infrastructure variables, 1992-2017 country means 11 4. Econometric Methods 4.1 Background Economic growth is an economy-wide, dynamic process with effects that cannot be fully captured by micro studies, and, therefore, macroeconomic analysis is important. It is likely to be easier to find evidence for causal effects using disaggregated micro-level data rather than using macro-level data (Lee et al. 2020; Asher and Novosad, 2018). This is because some variables may more easily be considered exogenous at the micro-level, and randomized trials and other field experiments are possible. But it does not seem very plausible that we can find exogenous factors that are correlated with infrastructure development across many countries but do not have direct effects on the economy to use as instrumental variables, though natural experiments might be available for specific countries. So, identifying exogenous shocks using time series or panel data methods is probably the only viable approach. Ideally, we would identify exogenous shocks to infrastructure using a structural panel vector autoregression (Pedroni, 2013). There are three main challenges to using such an approach. First, both common and country-specific shocks can be formulated. With a large cross-sectional dimension, a very large number of parameters and impulse response functions will be estimated. If we are interested in the typical effects of infrastructure on development, this information will need to be summarized in some way. The best way to summarize the uncertainty in these summary measures is not clear. Second, the VAR approach requires us to model each of the dependent variables when we are only really interested in the GDP equation. Third, we need to formulate identifying restrictions. Methods to empirically identify a panel SVAR using independent component analysis (see Maxand, 2020) have not yet been developed. Therefore, potentially questionable theoretical restrictions must be used. The current study can be seen as a step towards the long-term goal of developing such a model. Therefore, our estimation follows Calderon et al. (2015) in using a dynamic estimator, which allows us to test for the weak exogeneity of the infrastructure variables. Our contribution is to extend the study beyond 2000 and to investigate the importance of new infrastructure types. Calderon et al. (2015) resorted to using the first principal component of three infrastructure measures and Calderón and Servén (2010) also used the first principal component of infrastructure quality variables. In this paper, we use two such composite infrastructure variables, but we also present the results for the disaggregated variables. 12 4.2. Econometric Techniques Our main analysis uses an aggregate production function as the long-run relationship embedded in an ARDL model. We impose constant returns to scale: = � ( )1−−∑ , > 0, > 0 (1) =1 where is real GDP in the th country at time , is total factor productivity, is physical capital, is human capital per worker, and L is labor. The n are different types of infrastructure capital. As infrastructure is part of the overall capital stock, K, the coefficient vector will reflect the effect of allocating some of the capital stock to infrastructure rather than other forms of capital such as private structures and equipment. This model implies that though GDP is defined as the income of capital and labor, if capital and labor both increase by the same percentage, but infrastructure remains constant, then, if the are greater than zero, GDP increases by less than that percentage so that there are decreasing returns. Taking logs, subtracting the log of aggregate human capital from both sides, and introducing an error term, , which is likely to be serially correlated, we have: − ℎ = + ( − ℎ ) + � �, − ℎ � + (2) =1 where is ln , y is ln(/), k is ln( /), h is lnH and the are ln� /�. As described above, we use the first two principal components of the infrastructure variables in our estimation. The resulting model is: − ℎ = + ( − ℎ ) + �1 � �1, − ℎ � + �2 � �2, − ℎ � + (3) �1, and Where �2, are the first two principal components. Then the elasticity of GDP with respect to form j of infrastructure is: = �1 1 + �2 2 (4) �1 and where �2 are the two regression parameters of the principal components in (3) and 1 and 2 are coefficients of the respective infrastructure types in the first and second columns of Table 2. Our main objective is to examine the long-run relationship between infrastructure and economic growth. Pesaran (2006) pointed out that three specification issues need to be addressed when estimating the long-run parameters of (3). First, the long run relationship (3) 13 should cointegrate and all the variables should be unit-root in levels and stationary in first differences. Hence, we perform both the IPS (Im et al. 2003) and cross-sectionally augmented panel unit-root (CIPS) tests (Pesaran, 2007) to find the order of integration of variables considered in the model. We use the Kao (1999), Pedroni (2004), and Westerlund (2005) cointegration tests. The first two tests are based on modified Dickey-Fuller and modified Phillips-Perron test statistics and Westerlund is based on variance ratio for the residuals. The alternative hypothesis for each test is that the time series in each and every country are cointegrated. In order to treat the estimated parameters as the causal effect of infrastructure on growth, we need to test for the weak exogeneity of the input variables. We do this using the test proposed by Johansen (1992) as laid out in Calderon et al. (2015). This estimates a VAR in first differences of the input variables, adding first differences of output and the error correction terms from the PMG model. If the error correction terms are jointly insignificant, then the input variables are weakly exogenous. Second, there is the issue of cross-sectional dependence. If cross-sectional dependence in the residuals is not modeled this can seriously affect inference and even result in bias in the estimates of the regression coefficients (Söderbom et al., 2014). To deal with cross-sectional dependence, we removed the cross-sectional mean from the data before performing the unit- root tests but we also employ the Pesaran (2021) CD test to check the cross-sectional independence of the residuals. Third, as countries vary regarding their income level, resources, geographic locations etc., we should take cross-country parameter heterogeneity into consideration when estimating Equation (3) (Pesaran et al., 1999). We use the Pesaran et al. (1999) Pooled Mean Group (PMG) estimator that is embedded in an autoregressive distributed lag (ARDL) framework and models heterogenous short-run dynamics: −1 −1 �−1 − −1 ′) + � Δ � = ( Δ �− + � − ′ + + + (5) =1 =0 Where � = − ℎ , ′ = [ − ℎ , �2, − ℎ ], is a country fixed effect, �1, − ℎ , is a time effect common across countries, and is an independent and identically distributed random error term. The first term on the RHS models adjustment towards the long-run equilibrium, while the second and third terms model short-run dynamics. We used the replication file provided by RATS for Pesaran et al. (1999) to develop our code using static fixed effects as the initial coefficient vector. 14 We also provide static fixed effects, dynamic fixed effects (DFE), and mean-group (MG) estimates. These estimation techniques consider different homogeneity restrictions on the coefficients. The mean-group estimation permits the coefficients to differ in both the short- run dynamics and long-run equilibrium relationship, whereas the dynamic fixed-effects model imposes homogeneity of all parameters. PMG allows short-run dynamics but not the long-run relationship to vary across countries. All these estimates control to a certain degree for confounding variables that vary across countries. We also estimate the model separately for developing and developed countries. To test whether the effect of infrastructure has changed over time we also estimated the model for a 1970-1991 sample. Finally, to examine the robustness of our results, we estimate the model using the World Bank PPP GDP series. 5. Results and Discussion 5.1. Panel Unit Root and Cointegration Tests Table 4 presents the result of IPS and Pesaran (2007) CIPS tests. The number of mobile phones is trend stationary according to both the IPS and Pesaran CIPS test. Both tests suggest that all other individual variables contain a unit root and become stationary after differencing. Therefore, we find that all variables except mobile phones are integrated of order one. According to the IPS test the second principal component is trend stationary. But the CIPS test finds that it is only stationary in levels. Table 5 presents the results of the cointegration tests. As mentioned above, the alternative hypothesis in each case is that the time series cointegrate in every country. All three test statistics reject the null hypothesis of no cointegration. 15 Table 4. Panel unit root tests Variable IPS (2003) Pesaran CIPS (2007) Level First difference Level First difference GDP 2.928 -7.035*** -2.479 -4.334*** Physical capital 2.130 -5.008*** -2.575 -3.357*** Human capital 2.541 -4.394*** -1.622 -3.531*** Electricity generation -0.390 -7.320*** -2.358 -4.375*** Roads 0.238 -7.322*** -2.051 -4.029*** Railways 0.371 -6.136*** -1.482 -3.126*** Mobile phones -7.835*** -16.863*** -3.973*** -4.690*** Telephones 4.359 -3.055*** -1.797 -3.870*** 1st PC -1.457 -7.734*** -2.378 -4.214*** 2 PC nd -2.925*** -9.609*** -2.365 -3.915*** Note: All variables are expressed in logs (or IHS) and per worker. For the IPS (2003) and Pesaran (2007) CIPS test we include a trend and two lags. The null hypothesis of both tests is the existence of unit roots. *** significant at 1%. Table 5. Panel cointegration tests Test Test statistic Kao (1999) Modified Dickey -Fuller 2.929*** Pedroni (2004) Modified Phillips-Perron 3.737*** Westerlund (2005) Variance ratio 3.251*** Note: *** indicates the 1% and ** the 5% significance levels. Kao and Pedroni tests use two lags and the Westerlund test includes a trend and cross-sectionally demeans the data. 5.2. Coefficient Estimates We consider maximum values for p and q in (5) of three (two lags of the first differences). We use the Akaike Information Criterion based on the likelihood function in Pesaran (1999) to select the optimal lag length. We do not choose different lag lengths for each country. Effectively, we are choosing a maximum lag length for all countries. Excess lag coefficients can be estimated as zero. Table 6 shows that the optimal lag length is two lags of both the dependent variable and the independent variables (one lagged first differences). The parameter estimates are quite similar for all models with ≥ 2. The infrastructure elasticities are smaller for models with = 1. 16 Table 6. Lag Length Selection Using Akaike Information Criterion p=1 p=2 p=3 q=1 -9444 -9425 -9332 q=2 -9516 -9524 -9474 q=3 -9404 -9402 -9464 Table 7. Pooled Mean Group and Alternative Specifications Static fixed Dynamic Mean group Pooled mean group effects fixed (MG) (PMG) (FE) effects (DFE) Physical capital 0.542*** 0.259*** 0.422*** 0.313*** (0.015) (0.093) (0.0549) (0.018) 1st PC 0.030*** 0.260*** 0.021 0.199*** (0.011) (0.067) (0.031) (0.011) 2nd PC 0.085*** 0.021 0.136** 0.104*** (0.012) (0.071 (0.059) (0.012) Human capital 0.342*** 0.460*** 0.421*** 0.383*** (0.016) (0.091) (0.054) (0.021) Adjustment Coefficient -0.050*** -0.419*** -0.126*** (0.006) (0.021) (0.018) CD test p-value 0.000 0.037 0.810 0.121 Observations 2,088 2,088 2,088 2,088 Notes: The table reports the long-run elasticities for each of the inputs and the adjustment coefficient . For MG and PMG the adjustment coefficient is the mean across all countries and for MG the elasticities are also sample means. All estimations use individual and annual fixed effects, the latter by cross-sectionally demeaning the data. All dynamic models use one lag of first differences. The CD test null hypothesis indicates cross-section independence. Standard errors are in parentheses. ***, ** and * significant at the 1%, 5% and 10% level respectively. 17 Table 7 reports static fixed effects (FE), dynamic fixed effects (DFE), mean group (MG), and pooled mean group (PMG) estimates of the long-run parameters and the rate of adjustment coefficient. We chose the same lag length for the DFE and MG estimates as for the PMG estimates. The most flexible model is the mean group estimator and the least static fixed effects. The elasticity of human capital is most similar across the four estimators, while the elasticity of physical capital varies more. The elasticities of the infrastructure variables vary the most. Both are positive and highly significant for the FE and PMG estimates. The PMG estimates are larger. However, for DFE and MG only one of the elasticities is statistically significant – the first for DFE and the second for MG. The magnitude of the DFE elasticities is somewhat similar to the PMG estimates. In general, the elasticities are very large compared to previous estimates such as Calderon et al. (2015). The relatively low value of the adjustment coefficient for PMG and DFE shows that GDP responds slowly to changes in infrastructure as we would expect. Focusing on the PMG estimates, the first principal component, which we interpret as mainly an electricity and telecommunications factor has double the effect on GDP than the second principal component, which we interpret as mainly a transport factor. Together they have about the same effect on GDP as physical capital in general. As described in (4), we can also compute long-run elasticities of GDP with respect to the individual types of infrastructure. Table 8 reports these for the FE and PMG estimates. Roads, railways, and telephones have similar large elasticities and mobile phones and railways small elasticities. Given the restrictions imposed by the principal components analysis the relative sizes of the effects of each type of infrastructure are similar across the estimators but all are smaller for the FE estimate. Table 8. Elasticities of Infrastructure Variables FE PMG Roads 0.060 0.091 Railways 0.014 -0.003 Electricity 0.017 0.110 Mobile phones 0.001 0.009 Telephones 0.023 0.096 Among the telecommunications infrastructure types, we estimate a larger elasticity for fixed line phones than for mobile phones. However, mobile phones have grown in number 18 much more rapidly and so the small elasticity does not reflect a necessarily small contribution to growth. While the mean number of telephone lines per worker was unchanged from 1992 to 2017, the mean number of mobile phones increased by a factor of 144 or in natural logarithms almost 5 (Table 3). Microeconomic research has identified positive impacts of income from mobile and smart phones (Hübler and Hartje, 2016). Though we have assumed that elasticities are constant, in reality they likely depend on the level of the inputs. Table 7 includes the Pesaran (2021) CD test. This shows that the residuals are cross- sectionally independent for the MG and PMG estimates if we use a 5% significance level. We also carry out the test for weak exogeneity. The chi-square test statistic is 9.84 with three degrees of freedom and, therefore, a p-value of 0.02. Hence, we cannot reject the null hypothesis of weak exogeneity at the 1% level, and we can interpret the PMG elasticities as causal effects. As a robustness test, we also estimate the PMG model using GDP in 2017 PPP dollars from the World Bank Development Indicators (Table 10). In the latter data set some countries have missing data in the first few years. We used the growth rates of RGDPO from PWT 10 to fill in these missing observations. We impose the same lag length as in our main estimate. For the World Bank GDP data, the coefficient estimates are very similar to our main estimate. However, we can reject the weak exogeneity hypothesis for this estimate at the 5% level. 5.3. Development Status Table 10 also reports results for the PMG estimator separately for developing and developed countries. Tables 9a and 9b report the AIC for alternative lag lengths for these sub- samples. As a result, we chose p=1, q=3 for the developing country sample and p=3, q=1 for the developed country sample. The infrastructure coefficients are smaller in the developed country sample than in the developing country sample. Additionally, we can reject weak exogeneity of the inputs in the developing country sample. This seems to be the result of choosing the shortest lag length for the dependent variable for this sample as we see the same phenomenon for the 1970-91 sample. Table 9a. Lag Length Selection Using Akaike Information Criterion – Developing Countries p=1 p=2 p=3 q=1 -3615 -3407 -4734 q=2 -3533 -3402 -4835 q=3 -4847 -4761 -4714 19 Table 9b. Lag Length Selection Using Akaike Information Criterion – Developed Countries p=1 p=2 p=3 q=1 -4403 -4425 -4719 q=2 -4351 -4353 -4679 q=3 -4663 -4672 -4690 Table 9c. Lag Length Selection Using Akaike Information Criterion – 1970-91 p=1 p=2 p=3 q=1 -4496 -4374 -4441 q=2 -4337 -4279 -4277 q=3 -4273 -4352 -4388 Table 10. Alternative Data, Development Status, and Earlier Decades World Bank Data Developing Developed 1970-91 Countries Countries Physical capital 0.313*** 0.574*** 0.296*** 0.500*** (0.018) (0.023) (0.031) (0.015) 1st PC 0.199*** 0.201*** 0.136*** 0.205*** (0.011) (0.010) (0.014) (0.018) 2nd PC 0.104*** 0.109*** 0.093*** 0.017*** (0.012) (0.015) (0.013) (0.006) Human capital 0.383*** 0.115*** 0.474*** 0.279*** (0.021) (0.025) (0.030) (0.014) Adj. Coefficient -0.126*** -0.096*** -0.127*** -0.104*** (0.018) (0.023) (0.031) (0.015) CD test p-value 0.380 0.214 0.040 0.000 Weak exogeneity 0.020 0.000 0.125 0.000 p-value p 2 1 3 1 q 2 3 1 1 Observations 1104 897 1580 20 Table 11 presents estimates of the elasticities of GDP with respect to the individual forms of infrastructure in the two development sub-samples. Based on this, the elasticities in the developing country sample are about the same size as in the full sample, while in developed economies they are somewhat smaller. Railways essentially have a zero effect in these split samples as they did in the full sample. Table 11. Elasticities of Infrastructure Variables Developing Developed 1970-91 Roads 0.095 0.077 0.085 Railways -0.002 0.002 0.016 Electricity 0.111 0.075 0.062 Mobile phones 0.009 0.006 0.008 Telephones 0.098 0.068 0.050 5.4. Earlier Decades Table 10 reports results using a sample for the years 1970 to 1991. 5 Data is available for 79 countries for this period (Table A2). Many countries were added as well as dropped. In total, 21 countries appear in this sample that do not appear in the later sample, 58 countries appear in both samples, and 29 do not appear in this earlier sample but are in the later sample. We recomputed the principal components, again selecting the first two principal components. Table 9c shows that the optimal lag lengths are p=q=1. The estimated parameters are somewhat similar to those for the developing country sample, with a large elasticity of physical capital compared to human capital in this sample. Comparing the coefficients of individual infrastructure variables in Table 11 to those for the post 1991 period in Table 8, we find that they are of similar magnitude but are, on the whole, a little smaller especially for electricity and phones than in the earlier decades. As we mentioned above, our main estimates find a much larger effect for infrastructure on growth than the best previous studies. This could partly be explained by an increase in the size of the effect in the three recent decades compared to the previous two. 5 Examining even shorter periods, such as after the 2008-09 global financial crisis, would result in too short a time series in each country for reliable estimation. 21 5.5. Long-Run versus Short-Run Effects As we have estimated a dynamic panel model, we can compute both short-run and long-run elasticities. We recover the average short-run parameters across the sample by estimating a mean-group model imposing the cointegrating vector estimated by PMG. Figure 2 reports the impulse response curves of GDP to changes in the two principal components for our main estimates: Figure 2. Response of GDP to increase in infrastructure Figure 2 shows that in the full sample, not only does transport infrastructure, which is mainly associated with the second principal component, have a smaller long-run effect than electricity and telecommunications, in the short run its effect is negative. However, even for electricity and telecommunications, which are mainly associated with the first principal component, the short-run effect is only about a quarter of the final long-run effect. This suggests that studies that find small effects of infrastructure provision (e.g. Lee et al., 2020) need to adopt a much longer-run time frame for program evaluation. Similarly, we see that the static fixed effects estimates, which tend to converge to short-run effects (Stern, 2010) are much smaller than our long-run estimates. 5.6. Rate of Economic Growth We also estimated cross-section growth models of the following form: 22 − −5 = 0 + ′ −5 + (6) where −5 is a vector of the initial values of a set of variables including infrastructure variables and initial GDP per-capita. The latter variable controls for many unobserved determinants of growth. These regressions allow us to use a wider range of infrastructure variables that are not available as extensive time series, such as airports. On the whole, the coefficients of the infrastructure variables were statistically insignificant in these regressions, and we do not report them in this paper. This implies that higher levels of infrastructure are not associated with more rapid economic growth. 6. Conclusions and Policy Implications We extended previous research on the role of infrastructure in economic growth to recent decades (1992-2017) and to also including new types of infrastructure, such as mobile telephones. Our study shows two major insights. First, we find larger effects of infrastructure on growth than the previous best studies found. Second, we find that infrastructure has greater impact in developing economies than in developed economies. Both of these findings can be explained through the use of recent data sets covering developing countries where lack of infrastructure is a major development bottleneck. These findings suggest that access to infrastructure plays an important role in unraveling the barriers created by lack of infrastructure to economic growth. The higher impact in the more recent period can also be attributed to the rapid expansion telephone services (through fixed line or mobile phone services). Inputs that are limiting factors or “binding constraints” (McCulloch and Zileviciute, 2017; Burke et al., 2018) are likely to make more difference to output if their quantity increases. Infrastructure stocks do not seem to increase the rate of economic growth. The findings of the study imply important policy considerations. The higher impacts of telecommunication indicate that economic growth can be stimulated through better information that enhances the market access to products, potential productivity gains through increased substitution possibilities facilitated by information services or through better telecommunication access. Providing telecommunication services is cheaper than providing transportation and electricity services although there would be very little substitution between them. Another policy implication is that infrastructure whose unavailability has already created barriers to economic development has to be addressed first. In other words, an optimal investment in infrastructure is critical. Instead of providing ‘too much’ road infrastructure like 23 in the United States, an optimal road infrastructure where use of roads (services obtained from road infrastructure) instead of availability of roads (i.e., just increasing the stock of road infrastructures) can be maximized. We collected data on a wide range of other infrastructure variables, such as the number of airports or the percentage of roads that are paved, but these were very limited in time dimension or the number of countries for which data was available. Future research on the effects of infrastructure on growth would benefit from extending collection or estimation of these variables, particularly in the time dimension. References Ajakaiye, Olu, and Mthuli Ncube. 2010. "Infrastructure and Economic Development in Africa: An Overview." Journal of African Economies, 19(suppl_1), i3–12. Arellano, Manuel, and Stephen Bond. 1991. "Some Tests of Specification for Panel Data: Monte Carlo Evidence and an Application to Employment Equations." Review of Economic Studies, 58(2), 277–97. Arellano, Manuel, and Olympia Bover. 1995. "Another Look at the Instrumental Variable Estimation of Error-Components Models." Journal of Econometrics, 68(1), 29–51. Aschauer, David A. 1989. "Is Public Expenditure Productive?" Journal of Monetary Economics, 23(2), 177–200. Asher, Sam, and Paul Novosad. 2020. "Rural Roads and Local Economic Development." American Economic Review, 110(3), 797–823. Bazzi, Samuel, and Michael A. Clemens. 2013. "Blunt Instruments: Avoiding Common Pitfalls in Identifying the Causes of Economic Growth." American Economic Journal: Macroeconomics, 5(2), 152–86. Bun, Maurice J. G., and Frank Windmeijer. 2010. "The Weak Instrument Problem of the System GMM Estimator in Dynamic Panel Data Models." Econometrics Journal, 13(1), 95–126. Burke, Paul J., David I. Stern, and Stephan B. Bruns. 2018. "The Impact of Electricity on Economic Development: A Macroeconomic Perspective." International Review of Environmental and Resource Economics, 12(1), 85–127. Calderón, César, Enrique Moral‐Benitom, and Luis Servén. 2015. "Is Infrastructure Capital Productive? A Dynamic Heterogeneous Approach." Journal of Applied Econometrics, 30(2), 177–98. Calderón, César, and Luis Servén. 2010. "Infrastructure and Economic Development in Sub- Saharan Africa." Journal of African Economies, 19(suppl_1), i13–i87. 24 Chakamera, Chengete, and Paul Alagidede. 2018. "The Nexus between Infrastructure (Quantity and Quality) and Economic Growth in Sub Saharan Africa." International Review of Applied Economics, 32(5), 641–72. Cook, Paul. 2011. "Infrastructure, Rural Electrification and Development." Energy for Sustainable Development, 15(3), 304–13. Dash, Ranjan Ku, and Pravakar Sahoo. 2010. "Economic Growth in India: The Role of Physical and Social Infrastructure." Journal of Economic Policy Reform, 13(4), 373–85. Démurger, Sylvie. 2001. "Infrastructure Development and Economic Growth: An Explanation for Regional Disparities in China?" Journal of Comparative Economics, 29(1), 95–117. Duffy-Deno, Kevin T., and Randall W. Eberts. 1991. "Public Infrastructure and Regional Economic Development: A Simultaneous Equations Approach." Journal of Urban Economics, 30(3), 329–43. Elburz, Zeynep, Peter Nijkamp, and Eric Pels. 2017. "Public Infrastructure and Regional Growth: Lessons from Meta-Analysis." Journal of Transport Geography, 58, 1–8. Estache, Antonio, Biagio Speciale, and David Veredas. 2005. "How Much Does Infrastructure Matter to Growth in Sub-Saharan Africa." unpublished, World Bank (June 2005). Feenstra, Robert C., Robert Inklaar, and Marcel P. Timmer. 2015. "The Next Generation of the Penn World Table." American Economic Review, 105(10), 3150–82. Garcia-Mila, Teresa, and Therese J. McGuire. 1992. "The Contribution of Publicly Provided Inputs to States' Economies." Regional Science and Urban Economics, 22(2), 229–41. German-Soto, Vicente, and Héctor A. Barajas Bustillos. 2014. "The Nexus between Infrastructure Investment and Economic Growth in the Mexican Urban Areas." Modern Economy, 5(13), 1208–20. Hübler, Michael, and Rebecca Hartje. 2016. "Are Smartphones Smart for Economic Development?" Economics Letters, 141, 130–33. Inklaar, Robert, and Marcel P. Timmer. 2013. “Capital, labor, and TFP in PWT8.0”, Groningen Growth and Development Centre, University of Groningen Im, Kyung So, M. Hashem Pesaran, and Yongcheol Shin. 2003. "Testing for Unit Roots in Heterogeneous Panels." Journal of Econometrics, 115(1), 53–74. Johansen, Søren. 1992. "Cointegration in Partial Systems and the Efficiency of Single-Equation Analysis." Journal of Econometrics, 52(3), 389–402. Kao, Chihwa. 1999. "Spurious Regression and Residual-Based Tests for Cointegration in Panel Data." Journal of Econometrics, 90(1), 1–44. Kodongo, Odongo, and Kalu Ojah. 2016. "Does Infrastructure Really Explain Economic Growth in Sub-Saharan Africa?" Review of Development Finance, 6(2), 105–25. Lall, Chander M. 1999. "India Spruces up Its IP Regime and Its Enforcement." Journal of World Intellectual Property, 2(6), 961–68. 25 Lee, Kenneth, Edward Miguel, and Catherine Wolfram. 2020. "Does Household Electrification Supercharge Economic Development?" Journal of Economic Perspectives, 34(1), 122–44. Lewis, Blane D. 1998. "The Impact of Public Infrastructure on Municipal Economic Development: Empirical Results from Kenya." Review of Urban & Regional Development Studies, 10(2), 142–56. Maxand, Simone. 2020. "Identification of Independent Structural Shocks in the Presence of Multiple Gaussian Components." Econometrics and Statistics, 16, 55–68. McCulloch, Neil, and Dalia Zileviciute. "Is Electricity Supply a Binding Constraint to Economic Growth in Developing Countries?" EEG State-of-Knowledge Paper 1.3. Morrison, Catherine J., and Amy E. Schwartz. 1996. "State Infrastructure and Productive Performance." American Economic Review, 1095–111. Mostert, J. W., and J. H. Van Heerden. 2015. "A Computable General Equilibrium (CGE) Analysis of the Expenditure on Infrastructure in the Limpopo Economy in South Africa." International Advances in Economic Research, 21(2), 227–36. Munnell, A. H. 1990. "How Does Public Infrastructure Affect Regional Economic Performance?" New England Economic Review, September, 11–33. Nketiah-Amponsah, Edward. 2009. "Public Spending and Economic Growth: Evidence from Ghana (1970–2004)." Development Southern Africa, 26(3), 477–97. Pedroni, Peter. 2004. "Panel Cointegration: Asymptotic and Finite Sample Properties of Pooled Time Series Tests with an Application to the PPP Hypothesis." Econometric Theory, 597– 625. Pedroni, Peter. 2013. "Structural Panel Vars." Econometrics, 1(2), 180–206. Pesaran, M. Hashem. 2006. "Estimation and Inference in Large Heterogeneous Panels with a Multifactor Error Structure." Econometrica, 74(4), 967–1012. Pesaran, M. Hashem. 2021. "General Diagnostic Tests for Cross-Sectional Dependence in Panels." Empirical Economics, 60, 13–50. Pesaran, M. Hashem. 2007. "A Simple Panel Unit Root Test in the Presence of Cross‐Section Dependence." Journal of Applied Econometrics, 22(2), 265–312. Pesaran, M. Hashem, Yongcheol Shin, and Ron P. Smith. 1999. "Pooled Mean Group Estimation of Dynamic Heterogeneous Panels." Journal of the American Statistical Association, 94(446), 621–34. Rives, Janet M., and Michael T. Heaney. 1995. "Infrastructure and Local Economic Development." Journal of Regional Analysis and Policy, 25(1), 58–73. Roy, Bimal Chandra, Satyaki Sarkar, Nikhil R. Mandal, and Shashikant Pandey. 2014. "Impact of Infrastructure Availability on Level of Industrial Development in Jharkhand, India: A District Level Analysis." International Journal of Technological Learning, Innovation and Development, 7(2), 93–123. 26 Sahoo, Pravakar, and Ranjan K. Dash. 2012. "Economic Growth in South Asia: Role of Infrastructure." The Journal of International Trade & Economic Development, 21(2), 217–52. Sahoo, Pravakar, and Ranjan K. Dash. 2009. "Infrastructure Development and Economic Growth in India." Journal of the Asia Pacific Economy, 14(4), 351–65. Sanchez‐Robles, Blanca. 1998. "Infrastructure Investment and Growth: Some Empirical Evidence." Contemporary Economic Policy, 16(1), 98–108. Shenggen, Fan, and Xiaobo Zhang. 2004. "Infrastructure and Regional Economic Development in Rural China." China Economic Review, 15(2), 203–14. Shi, Yingying, Shen Guo, and Puyang Sun. 2017. "The Role of Infrastructure in China’s Regional Economic Growth." Journal of Asian Economics, 49, 26–41. Söderbom, Måns, Francis Teal, Markus Eberhardt, Simon Quinn, and Andrew Zeitlin. 2014. Empirical Development Economics. Routledge. Srinivasu, B., and P. Srinivasa Rao. 2013. "Infrastructure Development and Economic Growth: Prospects and Perspective." Journal of Business Management and Social Sciences Research, 2(1), 81–91. Stern, David I. 2010. "Between Estimates of the Emissions-Income Elasticity." Ecological Economics, 69(11), 2173–82. Timilsina, Govinda R., Gal Hochman, and Ze Song. 2020. "Infrastructure, Economic Growth, and Poverty: A Review." World Bank Policy Research Working Paper, (9258). Urrunaga, Roberto, and Carlos Aparicio. 2012. "Infrastructure and Economic Growth in Peru." Cepal Review, 107, 145–63. Westerlund, Joakim. 2005. "New Simple Tests for Panel Cointegration." Econometric Reviews, 24(3), 297–316. World Bank. 2018. “Access to Electricity in Sub-Saharan Africa.” Africa Pulse Report, 17. World Bank. Washington, DC. Wylie, Peter J. 1996. "Infrastructure and Canadian Economic Growth 1946-1991." Canadian Journal of Economics/Revue Canadienne d'Economique, 29, S350–55. 27 APPENDIX We extracted real GDP, capital stock, human capital, and employment data from Penn World Table 10 (Feenstra et al. 2015). We use the “rgdpna” series which measures real GDP in constant 2017 US PPP dollars, using the growth rates from countries’ national accounts. We use “rnna” capital stock series, which is also in constant 2017 US PPP dollars, using the growth rates from countries’ national accounts. We refer to this in the following as “physical capital” to distinguish it from human capital. To test the robustness of our analysis to data sources we also used World Bank data on PPP adjusted GDP (World Development Indicators). The human capital stock is measured by the “hc” series which is an index based on years of schooling and returns to education (Inklaar and Timmer, 2013). Lastly, we use the “emp” series which captures number of persons engaged in work force to measure labor input. We divide GDP and capital stock by labor and take natural logarithms of these variables. 3.2. Infrastructure Data For infrastructure, we assembled data for both quantity and quality of infrastructure variables. We prepared data in three broad categories: transport, telecommunications, and energy as follows. Transport Road quantity: We use two editions of World Road Statistics from the International Road Federations for the periods 1990-2007 and 2000-2017 to prepare the length of road network (in kilometers). We also received data from Cesar Caldéron that is derived from similar sources. We merged the data sets as follows: i. Where the values for 2000 in the two WRS datasets matched we just merged the two series. ii. Where they did not, but the Calderon data did match the 2000-2017 WRS data we used the Calderon data for years before 2000. iii. Where neither matched exactly we used the implied growth rates in either earlier data set to project back from 2000 to 1990. iv. We linearly interpolated missing values. v. Additionally, we deleted the data point for UAE in 2000 for being anomalously low. We removed the first 1 from the numbers for Botswana for 1992-4. We deleted the datapoint for Bulgaria 2005. 28 vi. Finally, 11 countries had missing data for 2017, in that case we extrapolated the missing 2017 data by keeping same value as stated in 2016. If there was no or little growth in the road network prior to 2016. There were six countries where that was possible: Belgium, Lao PDR, New Caledonia, Papua New Guinea, Thailand and Vietnam. Railways: The length of railways in km is obtained from the World Bank’s Development Indicators Database, Knoema (2020) database and complemented by data from national sources referenced in Wikipedia. For some countries, we drop apparently bad data and replaced it with data from official government statistics. For instance, for Australia we obtained data from the Bureau of Infrastructure and Transport Research (2020). We also linearly interpolated some missing or obviously incorrect datapoints. Telecommunications Telecommunications: Number of mobile phones subscribers and number of telephone lines data are taken from the International Telecommunication Union’s (2020) World Telecommunication /ICT Indicators database. Energy Electricity generation capacity data (in megawatts) extracted from the UN Energy Statistics Database (2020). All variables are converted to per worker values with the exception of human capital which is already in per worker form. All variables apart from mobile phones, and railways are converted to natural logarithms. As there are many 0 values for the number of mobile phones and the length of railways, we use the inverse hyperbolic sine (IHS) transformation for these variables given by � + √ 2 + 1� instead of logarithms. 29 Table A1: List of economies, main sample Argentina Costa Rica Indonesia Moldova Portugal Iran, Islamic Armenia Cyprus Rep. Mongolia Qatar Australia Denmark Ireland Morocco Russian Federation Austria Dominican Republic Israel Mozambique Saudi Arabia Bahrain Ecuador Jamaica Myanmar Senegal Bangladesh Egypt, Arab Rep. Japan Namibia Singapore Belgium Estonia Jordan Nepal Slovenia Bolivia Finland Kazakhstan Netherlands Spain Botswana France Kenya New Zealand Switzerland Brazil Gambia, The Korea, Rep. Nicaragua Tanzania Brunei Darussalam Germany Kuwait Niger Tunisia Bulgaria Greece Latvia Norway Turkey Burkina Faso Guatemala Lithuania Pakistan Ukraine Cameroon Honduras Luxembourg Panama United Kingdom Hong Kong SAR, Canada China Malaysia Paraguay United States Chile Hungary Mali Peru China Iceland Mauritius Philippines Colombia India Mexico Poland Note: High income countries (World Bank) are written in italics. 30 Table A2: List of economies, 1970-91 sample Egypt, Arab Algeria Rep. Jordan Paraguay Tanzania Argentina Ethiopia Kenya Peru Uganda Australia Finland Korea, Rep. Philippines United Kingdom Austria France Madagascar Portugal United States Belgium Gabon Malawi Romania Uruguay Bolivia Ghana Malaysia Rwanda Venezuela, RB Brazil Greece Mali Senegal Zimbabwe Burkina Faso Guatemala Mauritius Singapore Cameroon Honduras Mexico Spain Canada Iceland Morocco Sri Lanka Chile India Mozambique Sweden China Indonesia Netherlands Switzerland Iran, Islamic Syrian Arab Colombia Rep. New Zealand Republic Costa Rica Ireland Niger Thailand Côte d'Ivoire Israel Nigeria Trinidad and Tobago Denmark Italy Norway Tunisia Dominican Republic Jamaica Pakistan Turkey Ecuador Japan Panama South Africa