Policy Research Working Paper                       9222




Estimating Poverty among Refugee Populations
          A Cross-Survey Imputation Exercise for Chad

                                 Theresa Beltramo
                                 Hai-Anh H. Dang
                                  Ibrahima Sarr
                                   Paolo Verme




Fragility, Conflict and Violence Global Theme
  &
Development Data Group
April 2020
Policy Research Working Paper 9222


  Abstract
 Household consumption surveys do not typically cover                               administrative data that fall within a 95 percent margin of
 refugee populations, and poverty estimates for refugees                            poverty estimates based on survey consumption data. This
 are rare. This paper tests the performance of cross-survey                         result is robust to different poverty lines, sets of regressors,
 imputation methods to estimate poverty for a sample of                             and modeling assumptions of the error term. The method
 refugees in Chad, by combining United Nations High                                 outperforms common targeting methods, such as proxy
 Commissioner for Refugees survey and administrative data.                          means tests and the targeting method currently used by
 The proposed method offers poverty estimates based on                              humanitarian organizations in Chad.




 This paper is a product of the Fragility, Conflict and Violence Global Theme and the Development Data Group. It is part
 of a larger effort by the World Bank to provide open access to its research and make a contribution to development policy
 discussions around the world. Policy Research Working Papers are also posted on the Web at http://www.worldbank.org/
 prwp. The authors may be contacted at pverme@worldbank.org.




         The Policy Research Working Paper Series disseminates the findings of work in progress to encourage the exchange of ideas about development
         issues. An objective of the series is to get the findings out quickly, even if the presentations are less than fully polished. The papers carry the
         names of the authors and should be cited accordingly. The findings, interpretations, and conclusions expressed in this paper are entirely those
         of the authors. They do not necessarily represent the views of the International Bank for Reconstruction and Development/World Bank and
         its affiliated organizations, or those of the Executive Directors of the World Bank or the governments they represent.


                                                       Produced by the Research Support Team
                        Estimating Poverty among Refugee Populations:
                          A Cross-Survey Imputation Exercise for Chad


                Theresa Beltramo Hai-Anh H. Dang                      Ibrahima Sarr         Paolo Verme 1




JEL classifications: C15, F22, I32, O15, O20.


Keywords: Refugees; Forced displacement; Targeting; Poverty; Chad.



1 * Theresa Beltramo: United Nations High Commissioner for Refugees, Rue de Montbrilliant 92, Geneva 1201, Switzerland (e-

mail: beltramo@unhcr.org); Hai-Anh Dang: DECAT- Analytics and Tools Team-The World Bank, 1818 H St. N.W. Washington,
D.C. 20433 (e-mail: hdang@worldbank.org); Ibrahima Sarr: United Nations High Commissioner for Refugees (e-mail:
sarri@unhcr.org); Paolo Verme: Fragility, Conflict, Violence - The World Bank, 1818 H St. N.W. Washington, D.C. 20433 (e-
mail: pverme@worldbank.org). The authors are grateful to Andrew Dabalen, Giacomo De Giorgi, Kristen Himelein, Aly Sanoh,
and participants to the American Economic Associationâ€™s session on â€œComparing Poverty of Refugees and Their Hostsâ€ (San
Diego, January 5th, 2020) for helpful comments on previous versions. This work is part of the program ``Building the Evidence on
Protracted Forced Displacement: A Multi-Stakeholder Partnership''. The program is funded by UK aid from the United Kingdom's
Department for International Development (DFID), it is managed by the World Bank Group (WBG) and was established in
partnership with the United Nations High Commissioner for Refugees (UNHCR). The scope of the program is to expand the global
knowledge on forced displacement by funding quality research and disseminating results for the use of practitioners and policy
makers. We further thank DFID for additional funding through its Knowledge for Change (KCP) program. This work does not
necessarily reflect the views of DFID, the WBG or UNHCR.
     I. Introduction


UN General Assembly Sustainable Development Goal 1â€”End poverty in all its forms by 2030â€”
explicitly pledges that â€œno one will be left behind.â€ To achieve this goal, the availability of high-
quality household consumption surveys is essential, and it is equally important for these surveys
to be inclusive and cover marginal populations, such as refugees and internally displaced persons
(IDPs). Unfortunately, household consumption surveys rarely include forcibly displaced
populations, despite that these populations are among the most vulnerable and deprived. They lack
fundamental rights such as freedom of movement and the right to work, have eroded human and
physical capital, and face more frequent shocks than surrounding host communities do.


This is a significant and growing challenge, particularly in Sub-Saharan Africa. The global number
of forcibly displaced persons grew from 43.3 million in 2009 to 70.8 million in 2018. Among
them, there are 25.9 million refugees, 3.5 million asylum seekers, and 40.3 million IDPs. Almost
four out of five refugees live in a country neighboring their home country, and some 84% of them
live in developing countries. Sub-Saharan Africa hosts around one-third of the worldâ€™s refugee
population and witnessed an increase of 1.1 million refugees in 2017, which represents a 22%
increase from 2016. In 2018, Sub-Saharan African countries represented half of the 10 countries
with the highest refugee population relative to the national population and six of the 10 countries
with the highest numbers of IDPs. The region also has the highest poverty rates in the world, and
data for the region are typically scarce or of low quality. Therefore, measuring poverty among
displaced populations in Sub-Saharan Africa is a particularly important task that is severely
hampered by missing household consumption data. 2


There is an established literature in statistics that has developed methods to impute missing data
(see Little and Rubin (2002) for a review of these methods). Survey-to-survey imputation methods
have been widely employed in economics to estimate household welfare trends across time periods
in the context of repeated cross-section or panel data (e.g., Dang and Lanjouw 2018), across
geographical areas in the context of poverty mapping exercises (e.g., Elbers et al. 2003), or across

2Missing data issues are not a problem limited to displaced populations, but can emerge because of lack of survey data on a
particular topic of interest, population group, or time period. These issues can also be caused by sampling errors, incomplete data
due to unit or item nonresponse, data input errors, or post-survey data manipulations such as top-coding or censoring.


                                                                                                                                 2
types of surveys such as consumption and labor force surveys (e.g., Douidich et al. 2016).
Newhouse et al. (2014) and Dang et al. (2019) offer recent summaries of previous imputation
studies that highlight the main advantages, debate different approaches, and provide useful insights
about welfare imputation practices. More recently, Dang and Verme (2019) is the first study to
propose the use of a cross-survey imputation method in the context of the Syrian refugees living
in Jordan, using survey and administrative data provided by the United Nations High
Commissioner for Refugees (UNHCR). Their findings suggest that in the absence of actual
household consumption survey data, their cross-survey imputation method can provide a second-
best alternative to address the data challenge.


In this paper, we make several new contributions to the nascent literature on poverty measurement
for refugees. First, we apply the cross-survey imputation method to a sample of refugees based in
Chad. Since this country is in Sub-Saharan Africa and one of the poorest countries in the world,
its differences from Jordan in both geographical location and income levels represent a new setting
to apply and further validate this imputation method. Second, we exploit a richer and more diverse
set of data than the Jordan study, which includes registration data, census-type targeting data, and
a household consumption survey. We also rigorously examine the imputation method against
different poverty lines, including the food poverty line, the national poverty line, the international
poverty line, and various other simulated values. Finally, in addition to the poverty imputation
exercise, we test the performance of the proposed method for targeting purposes. In particular, we
compare the targeting performance of our method with the targeting method currently used in Chad
to administer cash assistance to refugees as well as with the global experience.


The estimation results indicate that the limited set of variables available in the UNHCR registration
data predict household consumption (welfare) reasonably well. Estimates from the three sets of
data available for the analysis produce similar welfare figures. The current targeting strategy in
Chad, which is used jointly by the National Commission on the Welcoming and Resettlement of
Refugees (CNARR), UNHCR, and World Food Programme (WFP), is accurate in predicting
household welfare. However, our results suggest that this targeting strategy could be further
improved by reducing the inclusion and exclusion errors. If these encouraging results are replicated




                                                                                                    3
in other contexts, poverty predictions for refugees can be expanded at scale, with good prospects
for the improvement of targeted programs.


The poverty estimates used in this paper do not reflect the official poverty estimates monitored by
the government and the international community. The interest of this paper is to test a cross-survey
imputation methodology, and, for this purpose, we use a subsample of UNHCR refugee data that
are not nationally representative of the refugee population in Chad. By contrast, the official poverty
statistics require national consumption surveys conducted by the national statistical office with
samples that are nationally representative. At the time of writing this paper, these national data
were not available, but they are expected to be published before the end of 2020. This will provide
another opportunity to validate the cross-survey imputation method proposed in this paper.


The paper is organized as follows. Section II outlines the country context. Section III presents the
data and analytical framework. The estimation results are presented in section IV, and section V
evaluates the targeting strategy used in Chad and our targeting method in light of the global
experience. Section VI discusses the limitations of the study, and section VII concludes.



   II. Country Context


Chad is one of the poorest countries in the world. According to the latest national consumption
survey, which was administered in 2011, 29% of the population falls below the food poverty line
and 47% falls below the national poverty line (World Bank 2018). The past decade has been a
decade of instability for Chad, with negative consequences on household well-being. Per capita
gross domestic product decreased by 15% between 2015 and 2017, from an average of US$963 in
2015 to US$823 in 2017 (in 2010 purchasing power parity (PPP)). In terms of overall development,
Chad is ranked 187 of 189 countries on the Human Development Index (World Bank 2019). Due
to these challenges, Chad struggled to meet many of the Millennium Development Goals in 2015,
and barring unforeseen economic growth or great increases in official development assistance, the
country is not likely to meet many of the Sustainable Development Goals objectives set for 2030.




                                                                                                    4
Despite the current negative economic downturn, Chad continues to host a high number of refugees
and is among the countries that top the worldâ€™s list in this respect (Table A1). Chad is the 10th
largest host country for refugees in the world and the fifth largest host in the Africa region after
Ethiopia, Kenya, Uganda, and the Demographic Republic of Congo. Chadâ€™s refugee population
represents a significant portion of the national populationâ€”about 3%. The number of forcibly
displaced persons increased from 474,478 in 2015 to 667,586 as of March 2019, of which about
69% were refugees or asylum seekers. 3 Of the 459,809 current refugees and asylum seekers, the
majority are Sudanese refugees (74%) living in the eastern part of Chad; 21% are Central African
Republic refugees living in southern Chad; a much smaller number of Nigerian refugees (about
2%) are living in the Lake Chad Basin. The situation is further complicated by the large population
of IDPs in the Lake Chad region, estimated at 165,313 at end of 2018 (UNHCR 2018). Map 1
shows the locations of the refugee camps in Chad.



  III. Methodology and Data


Methodology
The methodology used in this paper relies on the cross-survey imputation framework, which was
first introduced by Elbers et al. (2003). 4 Most recently, Dang et al. (2017) build on this literature
to propose a model that imposes fewer restrictive assumptions and offers an explicit formula for
estimating the poverty rate and its variance. Three advantages of the modifications introduced by
Dang et al. (2017) are (i) the variance formula, which is simple and in line with the recent statistical
literature; (ii) it can accommodate complex design sampling; and (iii) the framework remains
applicable to two surveys with different designs. Finally, the approach allows for different
modeling methods, including the standard linear regression model, its variant with a flexible
specification of the empirical distribution of error terms, a logit model, and/or a probit model.




3 UNHCR uses the term â€œpeople of concernâ€ to describe those who are forcibly uprooted from their homes, including asylum-

seekers, refugees, stateless persons, the internally displaced, and returnees.
4 See also Tarozzi (2007) and Mathiassen (2009) for further improvements and adaptation of this approach (e.g., by estimating the

standard errors in a different way).


                                                                                                                               5
Let xj be a vector of characteristics that are commonly observed between the two surveys, where
j indicates survey type, with 1 the base survey and 2 the target survey. Let us assume that the
welfare indicator is a function of household and individual characteristics (í µí±¥í µí±¥í µí±—í µí±— ):


                                                 yj = Î²j xj + Î¼cj + Îµj


where í µí±¦í µí±¦í µí±—í µí±— is the welfare indicator (consumption per capita per month), í µí»½í µí»½í µí±—í µí±— is a vector of parameters,
í µí¼‡í µí¼‡í µí±í µí±í µí±í µí± is cluster random effects, and í µí¼€í µí¼€í µí±—í µí±— is the idiosyncratic error term.


The framework proposed by Dang et al. (2017) is based on two assumptions. The first assumption
(Assumption 1), which is critical for poverty imputation, states that measurement of household
characteristics in each sample of data is a consistent measure of the characteristics of the whole
population. In other words, it stipulates that the two surveys are representative of the same target
population. In our context, the two surveys represent the same population of refugees, and they
were conducted approximatively at the same time. Therefore, we do not expect major issues with
this first assumption. However, we will conduct means difference tests on the observed
overlapping variables between the target data and base data to ensure that this is the case. The
second assumption states that changes in í µí±¥í µí±¥í µí±—í µí±— between the data collection periods of the two data
sets can capture the change in welfare over the period (Assumption 2). Since data collection for
the two data sets we use refers to the same year, there is no need to test Assumption 2. Under these
two assumptions, the imputed welfare is


                                1    â€²
                               y2 = Î²1 x2 + Ï…1 + Îµ1 .                                   (1)


Dang et al. (2017) propose different imputation methods for parameter estimation. The first
method relies on the assumption of the normal distribution for the two error terms (Î¼cj and Îµj are
uncorrelated and Î¼cj âˆ• xj ~í µí²©í µí²© (0, ÏƒÎ¼cj ) and Îµj âˆ• xj ~í µí²©í µí²© (0, ÏƒÎµj )). Hereafter, this method is referred to
as the normal linear regression model. An alternative method proposed is the empirical error
method, which assumes no functional form for these error terms and uses instead the empirical
distribution to estimate the parameters. Dang et al. (2017) also propose two other alternative



                                                                                                             6
methodsâ€”the Probit Model and the Logit Modelâ€” which are more restrictive and model poverty
status (poor and nonpoor) instead of consumption expenditure.


Once the parameters are estimated, the welfare indicator, which is household consumption per
capita, is obtained as follows:


                                  ï¿½2
                                  y 1     ï¿½â€²      ï¿½ 1,s + Îµ
                                     ,s = Î²1 x2 + Ï…
                                                  ï¿½       ï¿½1,s .
                                                          ï¿½                           (2)


The imputed poverty rate and its variance are then estimated as:


                                           1
                                  i) ï¿½
                                     P2 = S âˆ‘S     ï¿½2
                                             s=1 P(y
                                                     1
                                                      ,s â‰¤ z1 )                       (3)

                                        ï¿½2 ) = 1 âˆ‘S
                                  ii) V(P              ï¿½2,s |x2 ) + V ï¿½1 âˆ‘S
                                                     V(P                    ï¿½
                                                                            P |x ï¿½. (4)
                                               S s=1                   S s=1 2,s 2



These poverty estimates are unbiased estimates of the parameters of interest (Dang et al., 2017)
and outperform in terms of prediction accuracy those proposed by the proxy means testing
literature, which typically omits the error terms Ï…1 + Îµ1 , leading to biased estimates of the welfare
indicator.


Data
In its mandate to protect displaced persons in host countries, the UNHCR collects data to track
refugees and other populations of interest, better monitor these populations, and deliver assistance
and services. In the framework of this study, we use three sets of data collected by the UNHCR
(Table A2). The first one is the ProGres data set, which is the UNHCRâ€™s registration system
covering all refugees or asylum seekers requiring assistance. The ProGres data set is a live
instrument that is continuously updated as new refugees/asylum seekers arrive or existing refugees
contact the UNHCR. The data we use were extracted at the end of December 2017. This set of data
contains socioeconomic variables (such as household size, marital status, gender, age, country of
origin, and region of residence) but no consumption or expenditure data. This data set can be
considered as the â€œcensusâ€ of refugees.




                                                                                                    7
The second set of data, the Targeting data set, is also a census-like data set for refugees living in
Chad. The main aims of this census are to fill knowledge gaps on refugee livelihoods and the levels
and differences of vulnerability in refugee households, and to categorize refugees into wealth
groups for cash, food, and livelihood assistance. In addition to categorizing refugees, the Targeting
data set aims to identify factors that can enable self-reliance. This data set is based on a mixed
methods approach, including qualitative and quantitative methods. The first step is the use of focus
groups with refugee leaders, womenâ€™s organizations, and youth associations, to identify the wealth
characteristics and key challenges specific to age and gender. Next, a sample survey is carried out
across camps to confirm the wealth characteristics that were identified by refugees in the first step.
Based on the outcomes of the first two steps, a quantitative survey that is designed to capture
wealth characteristics is administered to all the refugee households. The Targeting data set includes
all Sudanese, Central African, and Nigerian refugees living in Chad. The data were collected
between June 17, 2017 and July 15, 2017 and cover 19 refugee sites and refugees living in nine
host villages. After the data collection, a statistical model, which takes into account household
welfare, is used to classify households into four socioeconomic groups (very poor, poor, average,
and better off). For the variables that are relevant for this study, this data set contains demographic
variables (household size, gender, age, country of origin, and region of residence), variables for
asset and animal ownership, and variables reflecting coping strategies. As for the ProGres data set,
the Targeting data set does not contain information on consumption or expenditure. However, it
does contain information on wealth.


The last set of data is the Post-Distribution Monitoring (PDM) data set, which is from a sample
survey that covers similar themes as the Targeting data set. The PDM data set, which was collected
in 2017 by the WFP, aims to provide better understanding of how refugees use food assistance and
contains data on consumption and expenditure. The sampling design is a two-stage stratified
random sample, where the first stage includes the selection of camps and the second stage the
selection of households. Beforehand, the different camps are stratified in three zones: (i) North
East (Ourecassoni, Amnaback, Iridimi Touloum), (ii) Centre-East (Goz Amir, Djabal, Gaga,
Teguine, Bredjing, Farchana), and (iii) South (Amboko, Dossey, GondjÃ©, Belom, Moyo) (Map 1).
In addition, the sampling takes into account the kind of humanitarian assistance that is provided to
refugees (in-kind, food voucher, or cash). The survey includes two consumption aggregates



                                                                                                     8
measuring monthly total consumption and monthly food consumption, using retrospective
questions with varying recall periods depending on the item considered (from seven days to one
year). The consumption aggregate is compiled by aggregating the different food and non-food
items, including expenditures on education, health, durable assets, and rent. In the framework of
this study, we consider two welfare indicators from the PDM data set. The first is the household
total consumption expenditure per capita per month, and the second is the household food
consumption per capita per month. 5


For poverty imputation purposes, three data sets are constructed from the ProGres, Targeting, and
PDM data sets. The first, which we refer to as â€œProGres 2,â€ is obtained by appending the ProGres
data to the end of the PDM data. As the ProGres and PDM data share only demographic variables,
ProGres 2 contains demographic variables for all observations, although only the observations
from the PDM data have consumption expenditure. The second set of data constructed, â€œTargeting
2,â€ comes from appending the Targeting data to the end of the PDM data. Therefore, Targeting 2
contains demographic variables, asset and animal ownership, and coping strategies variables as
well as consumption data. The last set of data, â€œProGres Targeting,â€ is obtained by first merging
the ProGres and Targeting data (matching 72% of the observations) and then appending these data
to the end of PDM data. This set of data is the more complete data set in terms of variables. The
motivation of the construction of these three sets of data is to check whether the different sources
of data as well as different sets of variables generate different poverty figures, consequently
determining the set that best predicts poverty. To ensure comparability across the three data sets,
we restrict the analysis to 16 of the 19 refugee sites, because the PDM data cover only 16 sites.
Consequently, this study covers only refugees from the Central African Republic and Sudan, and
not the Nigerian refugees, and all the estimates provided in the paper are not representative of all
the refugees living in the country.




5 The aim of the paper is not to measure consumption accurately or estimate nationally representative poverty figures for refugees

in Chad. The purpose of the paper is only to test the cross-survey imputation methodology using a sample of refugee data. In this
respect, our only concern is that the poverty predictions are close to the poverty rates calculated with the consumption data. Whether
our consumption data are accurate or not, this is less relevant for us. We should expect the cross-survey methodology to produce
even better results if the quality of the consumption aggregate improves.


                                                                                                                                    9
  IV.   Estimation Results


As a first step, we check whether our data sets are representative of the same underlying population
(Assumption 1) by performing means difference tests across key predictors. Given that the PDM
is a subsample of the Targeting or ProGres data sets, we need a test that is suitable for partially
overlapping samples. Here we use the same test proposed by Verme at al. (2016, 58) applied to
refugee ProGres data in Jordan. Table 1 provides the results. It shows that all the variables are not
significantly different in terms of means, indicating that the two samples are representative of the
same population.


To evaluate the performance of the welfare estimation model, we consider three models. Model 1
includes demographic and geographic variables (region of residence and country of origin). This
is the most parsimonious model and uses the variables that are most readily available in the
ProGres data set. Model 2 adds to Model 1 variables related to animal and asset ownership. Model
2 is richer than Model 1, but it is more demanding in terms of the control variables, which may
also be less reliable or more likely to be missing in the census data. Model 3 adds to Model 2
variables measuring coping strategies. To test for multicollinearity, Table 2 reports the variance
inflation factor (VIF) for the different models. It shows that no variable has a VIF that is over 5,
and the mean VIF is smaller than this threshold. We conclude that multicollinearity is not an issue
for any of the models considered.


Next, we test the out-of-sample performance and possible overfitting of the three models, using
PDM data and the root mean square error (RMSE) and mean absolute error (MAE) as performance
functions. To do so, the data set is split into five equal folds. In the first iteration, the first fold is
used to test the model, and the rest are used to train the model. In the second iteration, the second
fold is used as the testing set, while the rest serve as the training set. This process is repeated until
each of the five folds has been used as the testing set. The performance function is obtained as the
mean across the five iterations.


For the food consumption aggregate, the three models have similar measures of goodness-of-fit
for both indicators. Model 1â€™s RMSE is 0.55, Models 2 and 3â€™s RMSE is 0.54. For the MAE,



                                                                                                        10
Models 1 and 3 have a value of 0.42, whereas Model 2 has an RMSE of 0.41. When we turn to the
overall consumption aggregate, we note a small difference between the three models. The RMSE
values range from 0.53 to 0.58, with Model 3 and Model 1 having the smallest and highest RMSE,
respectively. The MAE is quite similar across the three models, within a range from 0.39 (Model
3) to 0.41 (Model 2). These results suggest that no model consistently outperforms the other
models.


Tables 3 and 4 apply the model to the three data sets described in the data section (ProGres 2,
Targeting 2, and ProGres Targeting data), using the normal linear regression model and the
Empirical Error Model and three poverty lines: (i) a poverty line of US$1.9 a day (PPP), which
represents the international poverty line for extreme poverty (panel A); (ii) the national poverty
line, which corresponds to around US$2.6 (World Bank 2013), represented in panel A; and (iii)
the national food poverty line of US$1.8 (PPP), shown in Table 3. These three poverty lines are
among the set of the arbitrary poverty lines considered in the general simulation above to evaluate
the quality of the prediction. 6 With one exception, the predicted poverty rates are not statistically
different from the poverty rates obtained from survey consumption data (henceforth, â€œsurvey
estimatesâ€). For the case of the food and international poverty lines, this is partly due to the large
standard errors of the prediction estimates, but these findings hold for the national poverty line
where the standard errors of the predicted values are much smaller. 7


Figure 1 repeats the exercise of Table 5 for the ProGres Targeting data and all the poverty lines
between the 66th and 99th quantiles of consumption. Panels A and B, respectively, are the normal
linear model and empirical error model. The results suggest that Models 1 and 2 predict the poverty
rates for different poverty lines well. The predicted poverty rates are within the 95% confidence
interval for all the arbitrary poverty lines considered. The predictions are also very similar across
the normal and empirical error models. However, Model 3 overestimates poverty, and the
predicted poverty rates provided are outside the 95% confidence interval of the survey-based rates
for the set of different poverty lines considered. As Model 3 adds variables related to coping

6 See also Table A3 in the annex for the full base model.
7
  The national and international poverty lines are used for illustrative purposes only. As the consumption aggregate used in this
paper is not comparable to a full consumption aggregate as usually derived from nationally representative household surveys, the
resulting poverty statistics are not comparable to national or international poverty statistics. In our case, these lines are simply
alternative thresholds used to test the sensitivity of poverty estimates to different poverty lines.


                                                                                                                                11
strategies, it might be that households are not accurate in reporting these strategies, for example,
by overestimating the frequency of using these strategies to receive more assistance from
humanitarian organizations.


Figure 2 shows the predicted welfare rates for the set of different poverty lines for all three models,
but this time with a focus on food security. Welfare based on food security is defined in
humanitarian settings as the inability to afford the minimum expenditure basket required to
purchase a food basket based on basic needs. The minimum expenditure basket is defined by the
WFP â€œas what a household requires in order to meet their essential needs, on a regular or seasonal
basis, and its average costâ€ (WFP 2018). The results are very similar to the overall welfare results
displayed in Figure 1. The results indicate that Models 1 and 2 predict the actual welfare rates well
based on food security for different poverty lines and are within the 95% confidence interval for
all the arbitrary poverty lines considered. The predictions are also very similar for the two different
estimation models of error terms, the normal linear model and empirical error model. Again, Model
3 overestimates the poverty rates, as the predicted welfare rates are outside the confidence interval
of the survey-based rates. 8


In summary, Figures 1 and 2, corresponding to Models 1 and 2, underestimate welfare for low
poverty lines and overestimate for high poverty lines but are within the confidence intervals. Model
3 always overestimates poverty for smaller poverty lines and its predictions are outside the
confidence interval. In general, Models 1 and 2 predict poverty and food security poverty well for
different arbitrary poverty lines. Based on these results, we conclude that these two models provide
fairly accurate aggregate welfare estimations of refugees living in Chad, and that the variables
currently available in the ProGres UNHCR registration system can be combined with other survey
data to predict the aggregate welfare of refugees efficiently.


The cross-survey imputation literature 9 stresses the importance of selecting a few key predictors,
and our results from Model 1, which contains only demographic variables, are in line with this


8 To check for possible heterogeneity, we split the sample with respect to country of origin. The results were similar except the

larger estimate variances (less precision), which might be due to sample size for refugees from the Central African Republic.
9 See, for example, Dang et al. (2017a); Dang and Lanjouw (2018); Dang and Verme (2019). See also Luca et al. (2018) for a

related study on variable selection with linear models.


                                                                                                                             12
empirical evidence. Previous empirical studies also highlight that adding household assets helps
to improve on poverty estimates, and Model 2, which adds asset and animal ownership to Model
1, is consistent with this evidence. However, adding more variables may lead to overfitting,
resulting in less accurate welfare estimates. The results of Model 3 could be placed in this context.




      V. Targeting Performance


The imputed welfare estimates can be useful in evaluating ex post the inclusion/exclusion errors
of the food assistance programs administered by the government and humanitarian organizations
during 2016/17. The targeting strategy for food assistance was agreed to and implemented by the
UNHCR, WFP, and the government agency in Chad responsible for refugees, the CNARR. We
perform an analysis to show how accurately the current targeting strategy identifies poor
households in terms of inclusion (leakage) and exclusion (undercoverage) errors. Both error types
are important but from different perspectives. The inclusion error matters primarily from a budget
perspective, as it represents a waste of resources. The exclusion error summarizes the programâ€™s
failure to cover households in need.


The current UNHCR/WFP/CNARR targeting approach relies on the Food Consumption Score
(FCS) generated by the WFPâ€™s PDM surveys, which is a composite score based on dietary
diversity, food consumption frequency, and the relative nutritional importance of different food
items. As is the case for any index, the FCS is contingent on the selection of the food group weights
as well as the food item thresholds, which are based on inherently subjective choices. Survey-to-
survey methods have been shown to outperform these types of index approaches, whereas the
Dang et al. (2017) cross-imputation method has been shown to perform better than the proxy means
testing also in refugee contexts (Dang and Verme 2019). 10


In light of the previous findings, we empirically evaluate how the UNHCR/WFP/CNARR
targeting strategy performs relative to the targeting method based on predicted consumption and


10   On optimal targeting in humanitarian contexts, see also Verme and Gigliarano (2019).


                                                                                                  13
relative to the available international evidence. Table 6 shows the undercoverage and leakage rates
for the different approaches. The method we propose (panel B) outperforms the targeting method
currently used in Chad (panel A) for all the poverty lines except the 25th percentile poverty line.
The errors are not low overall, with the UNHCR/WFP/CNARR undercoverage rates ranging from
9% to 32% and the leakage rates from 12% to 36%, and our model-based undercoverage rates
from 6% to 40% and the leakage rates from 9% to 41%. However, these methods perform relatively
well when compared with international evidence. For example, Skoufias et al. (2001) find that the
undercoverage and leakage rates for the PROGRESA program in Mexico were 7% and 70%,
respectively, for a poverty rate of 25%, a better performance on the undercoverage rate but a much
worse performance on the leakage rate compared with the other two programs.


The estimated targeting rates for Chad are also better than the median performance of similar
scores for programs across the world (see Table 7). Coady et al. (2004) report an index of targeting
performance obtained by dividing the proportion of beneficiaries falling within the target
population by the proportion of beneficiaries that would result from a random allocation. For
example, if the bottom 40% of the income distribution receives 60% of the funding, the
performance indicator is 1.5 (60/40). The higher the indicator is, the greater is the performance of
the targeting strategy. Table 6 reports this indicator for the 85 programs considered by Coady et
al. (2004) (A), the UNHCR/WFP/CNARR targeting program (B), and our proposed methodology
(C). Our methodology outperforms the UNHCR/WFP/CNARR targeting program and the median
value of the programs covered by Coady et al. (2004), and the UNHCR/WFP/CNARR program
does not perform poorly when compared with the international evidence.



  VI. Limitations


The objective of this work was to test how cross-survey imputation methods perform in estimating
poverty for refugee populations, using Chad as a case study. Although the results of our cross-
survey imputation exercise show that key demographic variables from ProGres predict well the
welfare measure captured in the PDM at the aggregate level, additional work is needed to assess
how well this methodology performs in refugee contexts, particularly in poor countries and data
scarce environments. To do this, data sets should ideally contain more detailed information on


                                                                                                 14
consumption, and they should be matched by individual household, using the IDs available in the
ProGres registration data.


Further, the PDM data measure consumption using relatively fewer variables than those in round
4 of the Chadian Household Consumption and Informal Sector Surveys (ECOSIT4). As such, the
poverty estimations from the PDM data could be improved. The poverty estimates used in this
paper do not reflect the official poverty estimates monitored by the government and the
international community. The interest of this paper was to test a cross-survey methodology, and,
for this purpose, we used a subsample of UNHCR refugee data that are not nationally
representative of the refugee population in Chad. By contrast, official poverty statistics require
national consumption surveys conducted by the national statistical office, with samples that are
nationally representative. The UNHCR and the World Bank are working closely to improve and
increase comparable refugee data to nationals. Thanks to these recent efforts, in 2018/19, refugees
were included for the first time in the ECOSIT4. When these data become available, it will be
possible to run a similar analysis to assess how cross-survey imputation fares using nationally
representative consumption measures for poverty, and to understand how cross-survey imputation
can predict household poverty outcomes for refugees and host populations alike with comparable
data.


Although the work presented in this paper remains a valid experiment for cross-survey
imputations, the data did not cover the entirety of refugees in Chad, including some refugees who
live outside camps. As the latter live in different environments, predicting their welfare may
require different sets of variables. And measuring consumption among refugees who rely on a
combination of handouts and informal incomes is a relatively new science. Existing survey
instruments may need to be adapted, and the meanings of concepts such as utility and capabilities
among refugees need to be reconsidered.

 VII. Conclusion


UN General Assembly Sustainable Development Goal 1â€”End poverty in all its forms by 2030â€”
explicitly pledges that â€œno one will be left behind.â€ Tracking the progress made toward this
objective requires the availability of high-quality household consumption surveys. However, the


                                                                                                15
majority of countries across the world, especially developing countries, face challenges in
collecting poverty data. High-quality consumption surveys that are comparable for forcibly
displaced persons and their hosts are and will remain in limited supply, given the cost and
challenges associated with these types of surveys. In the meantime, cross-survey imputation
methods can provide a second-best alternative that can potentially save time and money.


This study combined survey and census-type data on refugees to estimate welfare for refugees in
Chad. We showed how different sets of variables as well as different sources of data fare in the
identification of poor households, in particular, how well the set of variables available in the
ProGres database can predict poverty. In a second step, the paper estimated the accuracy of the
current UNHCR/WFP/CNARR targeting strategy and compared it with the targeting strategy
based on imputed consumption in the light of international evidence.


The results suggest that the set of variables available in ProGres accurately predicts the welfare
rates for different poverty lines. Adding variables related to asset and animal ownership provides
predictions that are very close to the ones with only the variables available in the ProGres data set.
These results are especially promising, as the UNHCR ProGres data are available in most refugee
locations where the UNHCR runs the registration system, and thus these methods are replicable in
many settings of forcibly displaced persons.


The current targeting strategy that is used for food, livelihoods, and cash-based assistance, despite
its simplicity, is rather accurate when compared with the existing international evidence. The
targeting errors resulting from the current UNHCR/WFP/CNARR targeting strategy for a poverty
rate of 25% are in the same error range as other targeting methods around the world, as reported
in Coady et al. (2004). We also showed that the existing targeting method can be improved by
imputing consumption using the methodology proposed in this paper.




                                                                                                   16
References

Altman, D.G., 1991. Mathematics for kappa. Pract. Stat. Med. Res. 406â€“407.
Barker, D.J.P., 2007. The origins of the developmental origins theory. J. Intern. Med. 261, 412â€“
       417. https://doi.org/10.1111/j.1365-2796.2007.01809.x
Borjas, G.J., Monras, J., 2017. The labour market consequences of refugee supply shocks. Econ.
       Policy 32, 361â€“413. https://doi.org/10.1093/epolic/eix007
Coady, D., Grosh, M., Hoddinott, J., 2004. Targeting of Transfers in Developing Countries:
       Review of Lessons and Experience.
Dang, H.-A., Jolliffe, D., & Carletto, C. (2019). Data gaps, data incomparability, and data
       imputation: A review of poverty measurement methods for dataâ€scarce
       environments. Journal of Economic Surveys, 33, 757-797.
Dang, H.-A.H., Lanjouw, P.F., 2018. Poverty Dynamics in India between 2004 and 2012:
       Insights from Longitudinal Analysis Using Synthetic Panel Data. Economic Development
       and Cultural Change, 67(1), 131-170
Dang, H.-A.H., Lanjouw, P.F., Serajuddin, U., 2017. Updating poverty estimates in the absence
       of regular and comparable consumption data: methods and illustration with reference to a
       middle-income country. Oxf. Econ. Pap. 69, 939â€“962.
Dang, H.-A.H., Verme, P., 2019. Estimating Poverty for Refugee Populations Can Cross-Survey
       Imputation Methods Substitute for Data Scarcity? World Bank Policy research Working
       Paper No. 9076
Del Carpio, X.V., Wagner, M., 2015. The Impact of Syrians Refugees on the Turkish Labor
       Market, Policy Research Working Papers. The World Bank.
       https://doi.org/10.1596/1813-9450-7402
Derrick, B., Russ, B., Toher, D., White, P., 2017. Test Statistics for the Comparison of Means for
       Two Samples That Include Both Paired and Independent Observations. J. Mod. Appl.
       Stat. Methods 16, 9.
Elbers, C., Lanjouw, J.O., Lanjouw, P., 2003. Microâ€“Level Estimation of Poverty and Inequality.
       Econometrica 71, 355â€“364. https://doi.org/10.1111/1468-0262.00399




                                                                                               17
Douidich, M, Ezzrari, A., van der Weide, R and Verme, P. 2016 Estimating Quarterly Poverty
       Rates Using Labor Force Surveys: A Primer, World Bank Economic Review, Vol. 30,
       Issue 3, pages 475-500,
Gluckman, P.D., Hanson, M.A., Pinal, C., 2005. The developmental origins of adult disease.
       Matern. Child. Nutr. 1, 130â€“141. https://doi.org/10.1111/j.1740-8709.2005.00020.x
Gwet, K.L., 2012. Handbook of inter-rater reliability: the definitive guide to measuring the
       extent of agreement among ratersâ€¯; [a handbook for researchers, practitioners, teachers &
       students], 3. ed. ed. Advanced Analytics, Gaithersburg, MD.
Jyoti, D.F., Frongillo, E.A., Jones, S.J., 2005. Food Insecurity Affects School Childrenâ€™s
       Academic Performance, Weight Gain, and Social Skills. J. Nutr. 135, 2831â€“2839.
       https://doi.org/10.1093/jn/135.12.2831
Little, R, J. A. and Rubin, D. B. (2002) Statistical Analysis with Missing Data. John Wiley and
       Sons, second edition
Luca, G.D., Magnus, J.R., Peracchi, F., 2018. Balanced Variable Addition in Linear Models. J.
       Econ. Surv. 32, 1183â€“1200. https://doi.org/10.1111/joes.12245
Lumey, L.H., Stein, A.D., Susser, E., 2011. Prenatal Famine and Adult Health. Annu. Rev.
       Public Health 32, 237â€“262. https://doi.org/10.1146/annurev-publhealth-031210-101230
Mathiassen, A., 2009. A model based approach for predicting annual poverty rates without
       expenditure data. J. Econ. Inequal. 7, 117â€“135. https://doi.org/10.1007/s10888-007-9059-
       7
Newhouse, D., Shivakumaran, S., Takamatsu, S., Yoshida, N., 2014. How survey-to-survey
       imputation can fail. The World Bank.
Skoufias, E., Davis, B., de la Vega, S., 2001. Targeting the Poor in Mexico: An Evaluation of the
       Selection of Households into PROGRESA. World Dev. 29, 1769â€“1784.
       https://doi.org/10.1016/S0305-750X(01)00060-2
UNHCR, 2019. GlobalTrends: Forced Displacement in 2018.
UNHCR, 2018. Figures at a Glance [WWW Document]. UNHCR. URL
       http://www.unhcr.org/figures-at-a-glance.html (accessed 11.9.18).
van den Berg, G.J., Pinger, P.R., Schoch, J., 2016. Instrumental Variable Estimation of the
       Causal Effect of Hunger Early in Life on Health Later in Life. Econ. J. 126, 465â€“506.
       https://doi.org/10.1111/ecoj.12250



                                                                                                  18
Verme, P., Gigliarano, C., Wieser, C., Hedlund, K., Petzoldt, M., Santacroce, M., 2016. The
       Welfare of Syrian Refugees: Evidence from Jordan and Lebanon. The World Bank.
       https://doi.org/10.1596/978-1-4648-0770-1
Verme, P. Gigliarano, C. 2019 Optimal Targeting under Budget Constraints in a Humanitarian
       Context, World Development, vol. 119(C), pages 224-233.
World Bank, 2013. Dynamics of Poverty and Inequality following the Rise of the Oil Sector.
World Food Programme, 2018. Minimum Expenditure Basket: Interim Guidance Note. WFP.
       https://docs.wfp.org/api/documents/WFP-0000074198/download/




                                                                                              19
Table 1 : Means Difference Tests

                                                         PDM                       ProGres Targeting            Two-sided p-value

                                                 Mean    Std. Dev.    N     Mean       Std. Dev.   N Obs.   t-test for overlapping groups
                                                                     Obs.

        Demographics and geographical variable
        HH size                                  4.76      2.96      1440   4.11         2.53      65943                0.83
        Gender                                   0.65      0.48      1440   0.69         0.46      65943                0.96
        Age of HH head                           42.34    14.01      1441   42.19        14.70     65943                0.97
        Education
         No Education                            0.63      0.48      1440   0.55         0.50      56838                0.81
         Koranic School                          0.15      0.36      1440   0.19         0.39      56838                0.99
         Primary                                 0.12      0.33      1440   0.15         0.36      56838                0.90
         Secondary                               0.09      0.28      1440   0.10         0.30      56838                0.92
         Higher                                  0.01      0.07      1440   0.01         0.08      56838                0.99
        Marital status
         Married                                 0.08      0.28      1440   0.10         0.30      65934                0.91
         Divorced                                0.08      0.28      1440   0.10         0.30      65934                0.90
         Widowed                                 0.18      0.38      1440   0.11         0.31      65934                0.96
         Single                                  0.05      0.22      1440   0.08         0.27      65934                0.90
        Occupation is agriculture                0.49      0.50      1439   0.789        0.408     65943                0.99
        Origin                                   0.467    0.499      1441   2.79         0.41      65943                0.61
        Asset and animal ownership
         HH has phone                            0.15      0.36      1440   0.17         0.38      65943                0.99
         HH has carts                            0.02      0.14      1440   0.03         0.18      65943                0.92
         HH has bike                             0.05      0.23      1440   0.02         0.15      65943                0.98
         HH has moto                             0.02      0.12      1440   0.02         0.13      65943                0.98
         HH has radio                            0.06      0.23      1440   0.08         0.27      65943                0.91
         HH has cattle                           0.02      0.15      1441   0.02         0.13      65943                0.98
         HH has donkeys                          0.07      0.25      1441   0.44         0.50      65942                0.87
         HH has sheep                            0.04      0.19      1441   0.09         0.29      65943                0.16
         HH Has goats                            0.06      0.24      1441   0.14         0.35      65943                0.18
         HH Has horses                           0.06      0.23      1441   0.04         0.19      65942                0.87
         HH Has poultry                          0.09      0.28      1441   0.17         0.38      65943                0.19
        Coping strategies
         Consume seeds                           0.17      0.38      1104   0.17         0.38      65943                0.92
         Sell assets                             0.01      0.07      1104   0.06         0.24      65943                0.80
         Send children for Begging               0.03      0.16      1104   0.05         0.21      65943                0.99
         Sell last breeding Female               0.01      0.10      1104   0.05         0.22      65943                0.87
        Region of residence                      4.74      2.19      1441   5.08         2.02      65943                0.97
           Region 1                              0.13      0.32      1441   0.101        0.30      65943                0.95
           Region 2                              0.19      0.39      1441   0.12         0.32                           0.88
           Region 3                              0.18      0.38      1441   0.07         0.25      65943                0.80
           Region 4                              0.09      0.30      1441   0.18         0.48      65993                0.61
           Region 5                              0.16      0.37      1441   0.03         0.17      65943                0.74
           Region 6                              0.09      0.29      1441   0.18         0.38      65943                0.86
           Region 7                              0.15      0.36      1441   0.13         0.338     65943                0.96

Source: Authorsâ€™ calculations.




                                                                                                                                            20
Table 2: Collinearity Tests
                              Model 1           Model 2           Model 3
                              VIF       1/VIF   VIF       1/VIF   VIF       1/VIF
 HH size                      1.15      0.87
 Age of head of HH            1.29      0.77    1.33      0.75    1.29      0.77
 HH is Farming                1.37      0.73    1.4       0.71    1.34      0.75
 Head of HH has primary       1.25      0.80    1.27      0.79    1.27      0.78
 education
 Head of HH attended          1.14      0.87    1.2       0.83    1.23      0.81
 Islamic School
 Head of HH has               1.16      0.86    1.19      0.84    1.36      0.74
 Secondary education
 Head of HH has Higher        1.02      0.98    1.03      0.97    1.06      0.95
 education
 HH is Female                 1.41      0.71    1.5       0.66    1.42      0.70
 Head of HH is divorced       1.16      0.86    1.19      0.84    1.14      0.88
 Head of HH is widowed        1.35      0.74    1.4       0.71    1.49      0.67
 Head of HH is single         1.14      0.87    1.19      0.84    1.14      0.88
 Country origin is            3.71      0.27    4.94      0.20    3.46      0.29
 Soudan
 Region 2                     2.12      0.47    2.69      0.37    1.91      0.52
 Region 3                     2.3       0.43    3.01      0.33    2.3       0.43
 Region 4                     1.9       0.53    1.96      0.51    1.5       0.66
 Region 5                     2.08      0.48
 Region 6                     2.07      0.48    2.21      0.45    1.71      0.59
 Region 7                     1.79      0.56    1.81      0.55    1.66      0.60
 HH has Phone                                   1.21      0.83    1.23      0.81
 HH has Carts                                   1.15      0.87    1.14      0.87
 HH has Bikes                                   1.21      0.83    1.4       0.71
 HH has Moto                                    1.05      0.95    1.08      0.93
 HH has Radio                                   1.17      0.85    1.22      0.82
 HH has Cattle                                  1.06      0.94    1.07      0.94
 HH has Horses                                  1.30      0.77    1.31      0.76
 HH consumes seeds as                                             1.15      0.87
 coping strategies




                                                                                    21
Table 3: Predicted Total and Food Poverty Rates Compared to the International and National Poverty Lines*

                                                      ProGres                   Targeting 2              ProGres Targeting
                                                         2
                                                      Model 1       Model         Model       Model   Model    Model      Model
                                                        (1)           1             2           3       1        2          3
                                                                     (2)           (3)         (4)     (5)      (6)        (7)
 Panel A: Poverty rates at international
 standard
  Normal linear regression model                        80.9         76.4         76.0        75.4    78.1      79.7       79.8
                                                        (4.2)       (3.42         (3.4)       (4.2)   (3.5)     (3.6)      (4.2)
  Empirical error model                                 81.5         77.0         76.55       75.5    79.0      80.5       80.4
                                                        (4.2)       (3.6)         (3.6)       (4.4)   (3.5)     (3.6)      (4.3)
  Survey Poverty Rate
                                                                78.8
                                                                (1.9)
 Panel B: Poverty rates at national standard
  Normal linear regression model                        90.9            88.0       87.5       87.0    88.6      90.0       90.0
                                                        (2.4)           (2.1)      (2.7)      (2.7)   (2.2)     (2.1)      (2.6)
   Empirical error model                                91.6            89.0       88.4       87.4    89.7      90.9       90.1
                                                        (2.1)           (21)       (21)       (2.7)   (2.1)     (2.0)      (2.5)
   Survey Poverty Rate
                                                                89.7
                                                                (1.5)
 Control Variables
  Demographics & Employment                              Y            Y             Y           Y       Y        Y          Y
  Asset and animal ownership                             N            N             Y           Y       N        Y          Y
  Coping Strategies                                      N            N             N           Y       N        N          Y
 R2 adjusted                                            0.57         0.55          0.57        0.66    0.52     0.55       0.62
 Observations (N)                                      65242        82468         82467       82467   56830    56829      56829

Note:*The international total poverty line is $1.88 PPP per person per day while the most recent national total (Food) poverty line
in Chad is $2.60 ($1.88) per person per day. Robust standard errors in parentheses are clustered at the camp level. We use 1,000
simulations for each model run.
Source: Authorsâ€™ calculations.




                                                                                                                               22
Table 4: Food Poverty
                                                                                        ProGres Targeting
                                                                   Model 1              Model 2                         Model 3
                                                                    (1)                   (3)                            (3)
  Food Poverty rates
  Normal linear regression model                                     82.1                 83.4                           81.6
                                                                     (3.5)                (3.2)                          (4.0)
  Empirical error model                                              82.8                 83.2                           82.0
                                                                     (3.3)                (3.3)                          (4.2)
  Survey poverty                                                                                  80.1
                                                                                                  (1.9)




Table 5: Models Out of Sample Performance, Individual Level

                                  Food Consumption aggregate                                    Overall Consumption aggregate
                    Model 1            Model 2                 Model 3                     Model 1            Model 2            Model 3
 RMSE                  0.5                0.5                     0.5                           0.6               0.5              0.5
 MAE                   0.4                0.4                     0.4                           0.4               0.4              0.4
Note:* The sample size of PDM dataset that 1441 is divided into five parts. Performances functions (RMSE and MAE) are obtained
as the mean across the five iterations.
Source: Authorsâ€™ calculations.


Table 3: Comparison of Coverage and Leakage Rates (%)
                                                                                            Poverty lines
                                                                  25th                   50th                80th              90th
                                                                  Percentile         Percentile           Percentile        Percentile
 A. Current UNHCR/WFP/CNARR targeting strategy
 approach
  Undercoverage Rate                                                     32              32                  19                    9
  Leakage Rate                                                           36              36                  22                   12
 B. Our predicted consumption-based targeting
  Undercoverage Rate                                                     40              26                  12                    6
  Leakage Rate                                                           41              28                  14                    9
 C. PROGRESAâ€™s method targeting
  Undercoverage Rate                                                     7               10                  16
  Leakage Rate                                                           70              43                  16
Source: Authorsâ€™ calculations for UNHCR/WFP/CNARR Targeting Strategy and Skoufias et al. (2001).




                                                                                                                                         23
Table 4: Targeting Performance of sample of programs, current UNHCR/WFP/CNARR, and our imputed
consumption-based Targeting

                                                                                   Poverty lines
                                                     10th Percentile           20th Percentile              40th Percentile
                                                  Median      Min      Max    Median      Min      Max    Median       Min    Max

 A. All 85 programs in Coady et al. (2004).         2.8       0.8      7.5      2.2        0.7     4.3       1.5       1.0    2.1

 B. UHNCR Targeting                                  4                          3.1                          1.6
 C. Imputed Consumption based Targeting             5.5                         3.3                          1.9

Source: Authorsâ€™ calculations for UNHCR/WFP/CNARR Targeting Strategy and compilations based on data in Coady et al. (2004).




                                                                                                                                24
     Figure 1: Predicted welfare and Survey based welfare for different poverty lines, ProGres Targeting




Note: The blue dashed curve presents the actual poverty rates derived from the PDM observations in the ProGres Targeting,
meaning that the blue dashed curve presents poverty rates derived from observed consumption of the PDM. The green solid curve
with circle symbol represents the predicted poverty rates from Model 1 with observations from Merged ProGres Targeting. The
indigo solid curve with symbol â€œxâ€ represents the predicted poverty rates from Model 2 with the Merged ProGres Targeting
observations while the orange solid curve with the triangle symbol represents the predicted poverty rates from Model 3 with the
Merged ProGres Targeting observations.




                                                                                                                           25
Figure 2 : Predicted welfare and survey based welfare based on food security for different poverty lines,
                                          ProGres Targeting




                                                                                                            26
Map 1: Campsâ€™ map




Source: UNHCR, 2018




                      27
Annex:

Table A1: Distribution of persons of concern by Group in Chad

 Type                                                           Number   Proportion
 Refugee and Asylum seeker                                      459809      68.9
 Returnees                                                       5746       0.9
 IDPs                                                           165313      24.8
 Chadian Returnees from CAR                                     16718       2.5
 Others                                                         20000       3.0
 Total                                                          667586     100.0
Source: Authorsâ€™ calculations, ProGres.




                                                                                      28
Table A2: Summary of data

 Number   Dataset              Overview              Date       Implementing     Existence of   Relevant Variables to poverty
                                                                Agency           Consumption    imputation available
                                                                                 expenditure
                                                                                 information
                                                    Panel A: Data available

 1        UNHCR                Census for all        June       UNHCR                No              1.   Demographics
          Registration Data    refugee households    2017
          (ProGres)-
 2        Targeting Database   Census for all        June-      UNHCR/WFP            No              1.   Demographics
          2017- All Chad       refugee households    July       and CNARR                            2.   Asset and animal
                                                     2017                                                 ownership
                                                                                                     3.   Coping strategies



 3        Post Distribution    Sub-Sample of         2017       WFP                  Yes             1.   Demographics
          Monitoring 2017-     refugees                                                              2.   Asset and animal
                                                                                                          ownership
                                                                                                     3.   Coping strategies
                                                                                                     4.   Consumption
                                                                                                          expenditure

                                      Panel B: Data constructed for poverty imputation

 1        ProGres 2            CAR and Sudanese          -      Constructed by           -           1.   Demographics
                               refugees living in               authors                              2.   Consumption
                               regions covered by                                                         expenditure for
                               PDM                                                                        observations from
                                                                                                          PDM



 2        Targeting 2          CAR and Sudanese          -      Constructed by           -           1.   Demographics
                               refugees living in               authors                              2.   Asset and animal
                               regions covered by                                                         ownership
                               PDM                                                                   3.   Coping strategies
                                                                                                     4.   Consumption for
                                                                                                          observations from
                                                                                                          PDM



 3        ProGres Targeting    CAR and Sudanese          -      Constructed by           -           1.   Demographics
                               refugees living in               authors                              2.   Asset and animal
                               regions covered by                                                         ownership
                               PDM                                                                   3.   Coping strategies
                                                                                                     4.   Consumption for
                                                                                                          observations from
                                                                                                          PDM


Source: Authors calculations




                                                                                                                              29
Table A3: Estimation Model
                                                   (1)                                    (2)                                     (3)
                                                Model 1                                Model 2                                 Model 3
 HH size                                        -0.16***                               -0.15***                                -0.14***
                                                 (0.01)                                 (0.01)                                  (0.01)

 Age of head of HH                                -0.00                                  0.00                                    -0.00
                                                 (0.00)                                 (0.00)                                  (0.00)

 HH is Farming                                    0.08*                                 0.11**                                 0.18***
                                                 (0.05)                                 (0.04)                                 (0.05)

 Head of HH has primary education                 -0.06                                  -0.01                                   -0.06
                                                 (0.06)                                 (0.05)                                  (0.07)

 Head of HH attended Islamic School               -0.00                                  -0.09                                 -0.18**
                                                 (0.08)                                 (0.07)                                 (0.09)

 Head of HH has Secondary education               0.07                                   -0.06                                   -0.11
                                                 (0.09)                                 (0.09)                                  (0.10)

 Head of HH has Higher education                  -0.23                                  -0.25                                  -0.55*
                                                 (0.33)                                 (0.31)                                  (0.30)

 HH is Female                                    -0.18***                               -0.12**                                -0.21***
                                                  (0.05)                                (0.05)                                  (0.06)

 Head of HH is divorced                           0.00                                   0.01                                    0.11
                                                 (0.09)                                 (0.08)                                  (0.13)

 Head of HH is widowed                           -0.19***                               -0.13*                                 -0.25***
                                                  (0.07)                                (0.07)                                  (0.08)

 Head of HH is single                             0.04                                   0.05                                    0.11
                                                 (0.12)                                 (0.10)                                  (0.16)

 Country origin is Soudan                        0.42***                                0.59***                                0.84***
                                                 (0.10)                                 (0.11)                                 (0.11)

 Region 2                                        -0.60***                               -0.30**                                -0.45***
                                                  (0.11)                                (0.12)                                  (0.14)

 Region 3                                        -1.13***                               -0.94***                               -0.77***
                                                  (0.12)                                 (0.12)                                 (0.14)

 Region 4                                        -0.44***                               -0.51***                               -0.66***
                                                  (0.07)                                 (0.07)                                 (0.09)

 Region 5                                         0.00                                   0.00                                   0.00
                                                   (.)                                    (.)                                    (.)

 Region 6                                        -0.38***                               -0.45***                               -0.63***
                                                  (0.08)                                 (0.07)                                 (0.10)

 Region 7                                         0.03                                   0.01                                    -0.03
                                                 (0.06)                                 (0.06)                                  (0.06)

 HH has Phone                                                                            0.07                                    0.09
                                                                                        (0.06)                                  (0.07)

 HH has Carts                                                                           0.33**                                   0.39*
                                                                                        (0.14)                                  (0.20)

 HH has Bikes                                                                            0.11                                    0.15
                                                                                        (0.13)                                  (0.16)

 HH has Moto                                                                             0.35*                                   0.32
                                                                                        (0.21)                                  (0.21)

 HH has Radio                                                                           0.20**                                   0.15
                                                                                        (0.09)                                  (0.11)

 HH has Cattle                                                                           -0.05                                   -0.11
                                                                                        (0.12)                                  (0.14)

 HH has Horses                                                                           0.08                                    0.09
                                                                                        (0.07)                                  (0.11)

 HH consumes seeds as coping strategies                                                                                          0.01
                                                                                                                                (0.07)

 _cons                                           6.43***                                6.06***                                5.99***
                                                 (0.13)                                 (0.13)                                 (0.15)
 N                                                803                                    803                                    503
 R2 adjusted                                      0.52                                   0.55                                   0.62


Source: Authorsâ€™ calculations, PDM survey.
Note: Standard errors in parentheses, * p < 0.10, ** p < 0.05, *** p < 0.01. The dependent variable is the log of household expenditure per capita
and results obtained from the PDM survey alone.




                                                                                                                                                 30