Issues in the construction of wealth indices for the measurement of socioeconomic position in lowincome countries
 Laura D Howe^{1}Email author,
 James R Hargreaves^{1} and
 Sharon RA Huttly^{1}
DOI: 10.1186/1742762253
© Howe et al; licensee BioMed Central Ltd. 2008
Received: 30 July 2007
Accepted: 30 January 2008
Published: 30 January 2008
Abstract
Background
Epidemiological studies often require measures of socioeconomic position (SEP). The application of principal components analysis (PCA) to data on assetownership is one popular approach to household SEP measurement. Proponents suggest that the approach provides a rational method for weighting asset data in a single indicator, captures the most important aspect of SEP for health studies, and is based on data that are readily available and/or simple to collect. However, the use of PCA on asset data may not be the best approach to SEP measurement. There remains concern that this approach can obscure the meaning of the final index and is statistically inappropriate for use with discrete data. In addition, the choice of assets to include and the level of agreement between wealth indices and more conventional measures of SEP such as consumption expenditure remain unclear. We discuss these issues, illustrating our examples with data from the Malawi Integrated Household Survey 2004–5.
Methods
Wealth indices were constructed using the assets on which data are collected within Demographic and Health Surveys. Indices were constructed using five weighting methods: PCA, PCA using dichotomised versions of categorical variables, equal weights, weights equal to the inverse of the proportion of households owning the item, and Multiple Correspondence Analysis. Agreement between indices was assessed. Indices were compared with per capita consumption expenditure, and the difference in agreement assessed when different methods were used to adjust consumption expenditure for household size and composition.
Results
All indices demonstrated similarly modest agreement with consumption expenditure. The indices constructed using dichotomised data showed strong agreement with each other, as did the indices constructed using categorical data. Agreement was lower between indices using data coded in different ways. The level of agreement between wealth indices and consumption expenditure did not differ when different consumption equivalence scales were applied.
Conclusion
This study questions the appropriateness of wealth indices as proxies for consumption expenditure. The choice of data included had a greater influence on the wealth index than the method used to weight the data. Despite the limitations of PCA, alternative methods also all had disadvantages.
Introduction
Defining and measuring socioeconomic position
Socioeconomic position (SEP) is a concept widely used in epidemiological research. Definitions vary, but commonly incorporate physical resources, social resources, and status within a social hierarchy[1]. Measurement of SEP is crucial not only for studies focusing on the social determinants of health, but also for the vast majority of observational health research, since SEP is likely to confound many relationships.
Traditionally, indicators of SEP have tended to be monetary measures such as income or consumption expenditure, based on the assumption that material living standards largely determine wellbeing[2]. Whilst it is now widely recognised that monetary measures of SEP fail to capture all of the diverse aspects of wellbeing, their use remains widespread, partially due to difficulties in measuring more complex conceptualisations of SEP, and because monetary measures may have clearer policy implications. There is longstanding debate about whether income or consumption expenditure is a better measure of SEP. Income is generally more variable than consumption; Friedman's permanent income hypothesis states that households are likely to base their consumption decisions on more than just their current income – people tend to 'smooth' their consumption in times of income fluctuation, for example by borrowing or drawing on savings in times of low income[3]. It is therefore widely asserted that consumption expenditure is a better marker of longterm SEP than income. This argument holds particularly strongly in lowincome countries, where income may come from a variety of sources and may vary dramatically across seasons. Longerterm aspects of SEP are thought to be most relevant to many health outcomes, adding to the reasons for choosing consumption expenditure over income.
In lowincome countries, measurement of consumption expenditure is fraught with difficulties. There are problems with recall and reluctance to divulge information. Additionally, prices are likely to differ substantially across times and areas, necessitating complex adjustment of expenditure figures to reflect these price differences[4]. Furthermore, collecting consumption expenditure data requires lengthy questionnaires that must be completed by skilled and trained interviewers. There are therefore both reliability and cost/time reasons why epidemiologists conducting health research in lowincome countries may wish to use an alternative measure of SEP. Additionally there are existing datasets rich in health data, such as the Demographic and Health Surveys (DHS), which lack information on income or consumption expenditure.
The assetbased approach to measuring socioeconomic position
An assetbased approach to measuring household SEP is one alternative to income and consumption expenditure. This approach has arisen from demographic studies such as the DHS, which although lacking data on income or consumption expenditure, collect information on ownership of a range of durable assets (e.g. car, refrigerator, television), housing characteristics (e.g. material of dwelling floor and roof, toilet facilities), and access to basic services (e.g. electricity supply, source of drinking water). These items were all originally included in the surveys for their direct influences on health; for instance, television and radio ownership was of interest to identify households receiving public health messages. Researchers began to see that these assets could be used as indicators of living standards and have sought to construct wealth indices for that purpose[2, 5]. Wealth indices measure SEP at the household level and can only be used to assess relative SEP within a population.
Collection of asset data has been claimed to be more reliable than income or consumption expenditure, since it uses simple questions or direct observation by the interviewer and should therefore suffer from less recall or social desirability bias[6]. This claim has, however, been questioned by a recent study which demonstrated at best moderate interobserver and betweentest reliability for asset data collection[7].
An assetbased wealth index could be theorised to represent longterm SEP in a similar way to consumption expenditure; asset ownership is likely to be based at least partially on economic wealth and household assets are unlikely to change in response to shortterm economic shocks. There is, however, continuing debate about the appropriateness of considering a wealth index as a proxy for consumption expenditure. Two separate studies have demonstrated weak correlation between consumption expenditure and wealth indices: a study in Mozambique showed a Spearman's rank correlation coefficient of 0.37[8], and a study using multiple datasets producing R^{2} values from regressions of consumption expenditure on a wealth index of ≤ 0.23[9]. A study using Indonesian data found that there was considerable reranking of households between a wealth index and consumption expenditure, with approximately 50% of households being misclassified when the population was split into the bottom 30%, middle 40% and top 30%[10]. Other studies have demonstrated considerable variation in the correlation across countries, with Spearman's rank correlation coefficients between 0.43–0.64 in one study and 0.39–0.71 in another[6, 11]. It could be argued that a wealth index captures a longerterm state of wealth than consumption expenditure; in times of economic shock, selling assets is likely to come subsequent to reductions in consumption expenditure. As both measures attempt to measure longterm SEP, and since it is useful to have a standard against which to judge wealth indices, we will consider consumption expenditure as a gold standard measure of longterm SEP, and explore the extent to which wealth indices agree with consumption expenditure.
Weighting the items in a wealth index
When constructing a wealth index from a set of variables, a decision must be made about the weights to assign to each indicator. Principal Components Analysis (PCA) was recommended as a method for determining weights for components of a wealth index by Filmer and Pritchett[11]. Guidelines for the use of PCA for wealth indices were published by Vyas and Kumaranayake[12].
PCA is a 'data reduction' procedure. It involves replacing a set of correlated variables with a set of uncorrelated 'principal components' which represent unobserved characteristics of the population. The principal components are linear combinations of the original variables; the weights are derived from the correlation matrix of the data or the covariance matrix if the data have been standardised prior to PCA. The first principal component explains the largest proportion of the total variance. If the first few principal components explain a substantial proportion of the total variance, they can be used to represent the original items, thus reducing the number of variables required in models[13].
For constructing a wealth index, the first principal component is taken to represent the household's wealth[14]. The weights for each indicator from this first principal component are used to generate a household score. Assets that are more unequally distributed across the sample will have a higher weight in the first principal component[12]. The relative rank of households using the score generated from the first principal component is then used as a measure of relative SEP, enabling calculation of a single estimate of the effect of wealth[15]. The use of a single principal component in this way could be questioned, since the first principal component from PCA of a set of assets frequently explains a low proportion of the total variation in those assets (often less than 20%)[11, 12, 16]. It could be the case that the theoretical 'wealth' construct is multidimensional, with the first few principal components each capturing a specific aspect of wealth. Using only the first principal component would, in this case, not capture the entire wealth effect. However, the aim of using PCA to generate a wealth index is to define a single indicator of SEP, and using multiple principal components would not be compatible with this. If the first principal component explains a small proportion of the total variance, each subsequent higher order component will explain a smaller proportion still, so using two or three principal components may not drastically improve the proportion of the total variance explained. It is also not generally straightforward to identify which aspects of wealth higher order principal components might represent, since there is not usually a clear pattern of which assets are assigned positive/negative or higher/lower weights. Furthermore, there is some evidence that utilising higher order principal components is unnecessary. McKenzie demonstrated that the standard deviation of higher order components was not associated with consumption expenditure, whereas that of the first principal component was[16]. Filmer and Pritchett noted that multivariate analyses of the association between the wealth index and school enrollment were robust to the inclusion of higher order components[11].
After the paper by Filmer and Pritchett, the use of PCA for wealth index construction was quickly adopted by the World Bank and Macro International Inc. for analysis of inequalities within DHS datasets[5, 17–19]. The approach is now also more widely used. Nevertheless, this application of PCA is not fully justified and requires further investigation. PCA is designed for use with continuous, normallydistributed data. Its application to the predominantly discrete data in a wealth index is therefore inappropriate. The use of binary dummy variables for each category of categorical variables (as recommended by Filmer and Pritchett[11]) is particularly problematic. The linear dependence between the dummy variables may lead to incorrect estimates of the wealth index; the PCA method is affected by collinearity, with variation in the data arising both from the underlying concept of wealth and from the linear dependence between dummy variables of categorical variables. This approach has been shown to be inferior to several alternative methods of dealing with categorical data[20]. The alternative methods explored were using ordinal variables, using group means, and using polychoric correlations. These methods, whilst being preferable in terms of the data assumptions of PCA, do require strong assumptions about the ordinal nature of the data. It is not necessarily straightforward, for instance, to rank different sources of drinking water, and to assume that they are equally spaced from each other in terms of their relationship with SEP.
The limitations of PCA for the construction of wealth indices are thus twofold: i) PCA is problematic with the discrete data commonly included in a wealth index, and ii) the first principal component frequently explains only a low proportion of the total variation in asset data. Furthermore, PCA is a fairly complex method. It is likely to be unfamiliar and poorly understood by less technical readers of papers. It could therefore be argued that simpler, more familiar and easily understood methods for weighting the items in a wealth index would be preferable. Using an equal weights approach (simple sum) was used in several early studies using wealth indices[21, 22]. Although simple, this approach could be criticised for being arbitrary and simplistic, since different assets are unlikely to have equal meaning in terms of SEP. The literature comparing indices constructed using PCA and using an equal weights approach is not consistent. There is some evidence that PCA performs no better as a proxy for consumption expenditure than an equal weights approach[23]. In contrast, Bollen et al. showed that a PCAbased wealth index and an equal weights index had considerably different regression coefficients with consumption expenditure[24]; another study also demonstrated that a PCAbased wealth index had a stronger relationship than an equal weights index with a latent variable of permanent income (planned and anticipated income, a longterm concept of SEP that both consumption expenditure and wealth indices have been claimed to be measuring)[25].
Another potentially simpler and more easily understood alternative to PCA is to use the inverse of the proportion of households that own an asset as its weight. This is based on a method originally suggested by Townsend[26]. The underlying assumption is that assets owned by a smaller proportion of households are indicative of higher household wealth and are therefore assigned a higher weight[27]. A problem with methods using inverse proportion weights is that not all assets show a linear relationship with living standards, e.g. ownership of a motorbike may tend to increase up to a certain income and subsequently decrease in richer households[5]. A similar method was applied by Morris et al., who calculated weights by using the inverse of the proportion of households that owned each item, multiplying that by the number of units of asset owned by the household, and summing this quantity for all assets[28]. Both the equal weights and the inverse proportion weighting methods can only be applied to binary data.
Multiple Correspondence Analysis (MCA) is analogous to PCA, but is for discrete data[29]. Whilst this method does not remove the complexity and unfamiliarity of PCA, nor the problems of the first dimension explaining a small proportion of the total variance, it is appropriate for the analysis of the categorical data commonly collected on most assets[30]. Booysen et al. utilised MCA to construct wealth indices for seven subSaharan African countries. They found that the index was very highly correlated with one constructed using PCA, and that although households were not always in the same quintile by the two indices, movement was in most cases limited to one quintile in either direction. They also showed that the weights assigned to index items were generally similar by the two methods[30].
Other methods for weighting items in a wealth index do exist, but in general offer neither more simplicity than PCA, nor more suitability for discrete data. For instance, latent variable approaches have been proposed[31, 32]. In his 2005 paper, Montgomery constructs a wealth index using a latent variable approach called MIMIC; this model specifies which variables are determinants of living standards (e.g. education and occupation) and which are indicators of living standards (e.g. consumer durables). In other methods of wealth index construction, both determinants and indicators of the underlying socioeconomic construct may be included without distinction. For instance, producer durables such as farm equipment are sometimes included in a wealth index in the same way as consumer durables, whereas these should in fact be considered as determinants of the socioeconomic construct and not treated in the same way as indicator variables[31]. Latent variable methods, despite offering some theoretical advantages over PCA, are far more complex and arguably even less easily understood by a wide readership than PCA. A further option could be to assign weights based on the price of an item, but this requires detailed information allowing for date of purchase, area of purchase, and current condition of the item. There is also some evidence that pricebased indices are less reliable than alternatives; one study showed a pricebased index to have implausible relationships with health outcomes[33] and a further study demonstrated that two price methods had weaker relationships with a permanent income latent variable than alternative weighting methods[25]. In contrast, however, Morris et al. showed high correlation between wealth indices constructed using the inverse proportion method and weights based on the current value of each item[28]. The issue of prices is a crucial one. Consumption expenditure measures are adjusted for the variability of prices across regions. In contrast, the variability in prices is generally ignored when pooling data across regions to construct a wealth index. The methods currently used in the literature to incorporate prices into weights for wealth index indicators (typically relying on selfreported current sale value) do not, however, appear to be appropriate, and more complex methods involving regional price data calculation similar to the approach used for consumption expenditure data would probably be too costly for the majority of epidemiological studies.
Which concept of longterm SEP does a wealth index represent?
Both consumption expenditure and wealth indices are measured using householdlevel data. Equivalence scales are generally applied to consumption expenditure data in order to allow for household size and composition. The most frequently used equivalence scales are per capita (i.e. divided by the total number of household members), per adult or per adult equivalent (where each child is considered to require a predetermined proportion of the consumption of one adult). Wealth indices, however, are not generally adjusted for household size or composition. There is some evidence that adjusting a wealth index for household size results in implausible relationships with health outcomes[5]. It has also been argued that while consumption needs and patterns will obviously be strongly affected by household size and composition, the benefits of most items included in a wealth index are at the household level[5]. It has, however, been demonstrated that wealth indices and per capita expenditures produce very different patterns in household size; in 11 lowincome countries, the poorrich difference in average household size was consistently greater when using per capita expenditures compared with a wealth index[34]. This indicates that households with a greater number of members, a factor often associated with poverty, would not always end up in the lower quintiles of a wealth index.
In considering the appropriateness of a wealth index as a proxy for consumption expenditure, it has been suggested that the choice of equivalence scale may have a substantial impact on the observed relationship. Sahn and Stifel suggested that the correlation of a wealth index would be highest when total household expenditures were considered, intermediate when a per adult equivalence scale is used, and lowest when per capita consumption expenditure is used[6]. There is, however, no evidence of this presented in the current body of literature.
Aim
The aim of these analyses is to compare wealth indices constructed using different weighting methods to identify whether PCA offers an advantage over either simpler, more transparent methods (equal weights and inverse of the proportion of the population owning the asset) or methods more appropriate for discrete data (MCA). Furthermore, the agreement of a wealth index with consumption expenditure measures adjusted for household size and composition in different ways will be examined to identify which aspect of longterm SEP a wealth index best represents.
Methods
To illustrate our exploration of wealth indices, we analysed data from the Malawi Integrated Household Survey 2004–5 (IHS2)[35]. This national survey of 11,280 households collected data on the socioeconomic living conditions in Malawi. It contained both asset data and a measure of consumption expenditure. The measure of consumption expenditure was calculated using annualised figures for consumption expenditure across categories of food and nonfood consumption according to the UN classification system 'Classification of Individual Consumption According to Purpose'. A price index was used to adjust for differences in prices across areas and times. The Malawi National Statistical Office evaluated equivalence scales for the consumption expenditure aggregate, and found the poverty profile to be remarkably similar when a per capita or a per adult equivalent scale was used[36]. For these analyses, a per capita equivalence scale was used, i.e. total household consumption expenditure was divided by the number of household members. The assets used to construct the wealth indices were those used in analyses by the World Bank of the 2000 Malawi DHS (toilet facility, main cooking fuel, main drinking water source, floor material of main dwelling, whether there is electricity in the home, owns radio, owns television/VCR, owns bicycle, owns car, owns motorbike/scooter, owns agricultural land, and presence of a domestic servant)[19]. All data cleaning and analyses were performed in Stata version 9[37].
 1.
Using PCA including all categories of categorical variables
 2.
Using PCA but with dichotomised versions of all categorical variables
 3.
Applying equal weights to binary variables
 4.
Weighting binary variables by the inverse of the proportion of the population which owns that item
 5.
Using MCA including all categories of categorical variables
Following recommended practice, for index 1 dummy binary variables were created for each category of categorical variable for inclusion in the PCA; for example a fourcategory variable would have been converted into four separate yes/no variables; for each household one of these would be coded 'yes' the other three 'no'[12]. Alternative ways of using categorical variables in PCA were not used because they require imposing an ordinal structure on the categories.
Applying equal weights and using the inverse of the proportion of the population that owns the item can only be carried out using binary variables. Therefore, for the purposes of creating indices 3 and 4, each categorical variable was collapsed to a binary variable based on a subjective assessment of the most appropriate dichotomisation, resulting in an appropriate distribution of ownership and meaningful categories. The detailed entries for observations coded as 'other' were examined in order to determine the most appropriate way to classify the 'other' group. The dichotomisations are detailed below:
Details of dichotomisation of categorical variables
Floor material

Lower SEP group: sand, smoothed mud

Higher SEP group: smooth cement, tile, other
Cooking fuel

Lower SEP group: firewood, crop residue, other

Higher SEP group: paraffin, electricity, charcoal
Water supply

Lower SEP group: personal open unprotected well, communal open unprotected well, river, spring, lake, reservoir, other

Higher SEP group: piped into dwelling, piped outside dwelling, communal standpipe, personal handpump, communal handpump, protected spring
Toilet facility

Lower SEP group: no toilet facility, other

Higher SEP group: flush toilet, VIP latrine, traditional latrine with roof, latrine without roof
In addition to using these binary variables for indices 3 and 4, index 2 was created in order to explore its agreement with index 1, and to facilitate a more direct comparison of the PCA approach with the simpler weighting methods used in indices 3 and 4.
Indices were standardised to give a mean of zero and a variance of one. Survey analysis was used for descriptive analyses to adjust for the complex sampling used in IHS2. Sampling weights cannot be applied during MCA and PCA; therefore, in order to facilitate comparisons, sampling weights were not used when calculating the weights for any index, but they were used for generating quintiles, as in previous studies[19, 38].
The PCAbased indices utilised the weights from the first principal component to ascertain the weights.
A Stata macro for MCA was downloaded from the EconPapers website[39]. In a similar manner to PCA, the weights used are those identified from the first dimension of the MCA. However, unlike PCA, the MCA command is not compatible with postestimation commands in Stata. Thus, in order to apply the weights, a score variable was manually generated applying the appropriate weight from the MCA to each indicator.
The distribution of each index was examined graphically to assess the extent of skewness and clumping. Clumping is a problem commonly found in wealth indices whereby a large proportion of households have the same (usually low) score, because a large number of households have similar (low) access to public services and ownership of consumer durables.
Indices were compared with each other in terms of scatter diagrams and misclassification of households between quintiles of indices. Kappa statistics were calculated in order to assess the agreement of classification between indices. The Kappa statistic is a measure of reliability that takes into account the agreement expected on the basis of chance. A Kappa statistic of one indicates perfect agreement and a value of zero indicates no agreement better than chance. There are no universal rules for interpreting Kappa statistics, but in general a value of less than 0.5 would indicate poor agreement. Misclassification between quintiles was chosen as the measure of agreement since almost all epidemiological studies using a wealth index will use quintiles of the index in analyses. Although previous studies have often used correlation coefficients to compare indices, this can be misleading since correlation can hide a systematic bias and does not necessarily imply agreement. Graphs were also constructed to compare indices; scatter plots were used for comparing two indices both using categorical data, and boxplots were used when one or both of the indices used binary variables.
In addition to comparisons between the indices, each index was compared with per capita consumption expenditure, which despite having its own limitations and reliability issues was taken as a gold standard measure of SEP.
In order to assess which aspect of longterm SEP a wealth index best represents, consumption expenditure measures were constructed adjusted in the following ways: i) no adjustment, i.e. total household expenditures, ii) per adult expenditures and iii) per capita expenditures. The agreement of each consumption expenditure measure with a wealth index was calculated. The wealth index was constructed from the same asset indicators as above, using PCA.
Results
Missing data levels were very low. Complete data were available on 11,243 of 11,280 households (99.7%).
Distribution of Indices
Weights assigned to index components
Weights assigned to each indicator in indices using categorical variables:
Item  Item weight  

PCA  MCA  
Toilet facility:  
Flush toilet  0.2760  2.081  
VIP latrine  0.0894  0.515  
Traditional latrine with roof  0.0015  0.019  
Latrine no roof  0.0613  0.125  
None or other  0.0923  0.197  
Water source:  
Piped inside dwelling  0.2762  2.428  
Piped outside dwelling  0.1631  0.857  
Communal standpipe  0.1251  0.161  
Personal handpump or well  0.0154  0.011  
Communal handpump or well  0.2270  0.138  
River, lake, spring, reservoir, or other  0.0433  0.179  
Cooking fuel:  
Collected firewood  0.3049  0.153  
Purchased firewood  0.1252  0.176  
Paraffin, gas or charcoal  0.2196  0.721  
Electricity  0.2451  2.537  
Crop residue, saw dust, or other  0.0043  0.084  
Floor material:  
Sand  0.0078  0.168  
Smoothed mud or other  0.3113  0.154  
Smooth cement, wood, or tiles  0.3310  0.613  
Electricity:  Yes  0.3427  1.6 
No    0.1  
Radio:  Yes  0.0193  0.007 
No    0.009  
TV:  Yes  0.2836  1.726 
No    0.070  
Bike:  Yes  0.0025  0.002 
No    0.001  
Car:  Yes  0.1885  2.247 
No    0.028  
Motorbike:  Yes  0.0432  0.869 
No    0.003  
Domestic servant:  Yes  0.1426  1.32 
No    0.025  
Agricultural land:  Yes  0.2280  0.081 
No    0.589 
Weights assigned to each indicator in indices using binary variables:
Item  Item weight  

PCA  Equal weights  Inverse proportion  
Toilet facility:  
some toilet facility  0.1429  1  1.2 
Water source:  
protected source  0.1703  1  1.5 
Cooking fuel:  
more likely to have been purchased  0.4320  1  11.8 
Floor material:  
modern  0.4084  1  5.0 
Electricity:  0.4600  1  17.1 
Radio:  0.0225  1  1.8 
TV:  0.4012  1  25.7 
Bike:  0.0014  1  2.8 
Car:  0.2766  1  82.3 
Motorbike:  0.0725  1  275.1 
Domestic servant:  0.2190  1  53.4 
Agricultural land:  0.3072  1  1.1 
Agreement of the indices with consumption
Movement of households between quintiles of wealth indices and per capita consumption expenditure
% Households moving between quintiles of the wealth index and quintiles of per capita consumption expenditure  1. PCA index  2. PCA index using binary variables  3. Equal weights index  4. Inverse proportion index  5. MCA index 

Same quintile  28.9  28.0  26.6  28.2  29.2 
Move one quintile  34.8  36.0  37.8  33.6  34.3 
Move two quintiles  21.5  20.6  22.3  22.5  22.1 
Move three quintiles  11.6  12.2  10.5  11.3  11.4 
Move four quintiles  2.9  3.1  2.8  4.4  3.0 
Kappa  0.11*  0.10*  0.082*  0.10*  0.12* 
Comparing the indices
Percentage of households in the same quintile and Kappa statistics of agreement between pairs of indices
1. PCA  2. PCA (binary)  3. Equal weights  4. Inverse proportion  5. MCA  

1. PCA    
2. PCA (binary)  41.9% κ = 0.27*    
3. Equal weights  35.9% κ = 0.20*  73.6% κ = 0.67*    
4. Inverse proportion  39.3% κ = 0.24*  69.5% κ = 0.62*  67.7% κ = 0.60*    
5. MCA  75.6% κ = 0.69*  51.5% κ = 039*  40.6% κ = 0.26*  43.4% κ = 0.29*   
Movement of households between quintiles of the indices
Wealth indices being compared  % Households moving between quintiles  

Same quintile  Move 1 quintile  Move 2 quintiles  Move 3 quintiles  Move 4 quintiles  
Index 1 (PCA all categories) and Index 2 (PCA binary variables)  41.9  41.3  13.3  4.5  0.4 
Index 1 (PCA all categories) and Index 3 (Equal weights)  35.9  38.5  18.8  7.1  1.1 
Index 1 (PCA all categories) and Index 4 (Inverse proportion)  39.3  39.2  13.3  8.6  0.98 
Index 1 (PCA all categories) and Index 5 (MCA)  75.6  18.9  5.8  0.65  0.33 
Index 2 (PCA binary variables) and Index 3 (Equal weights)  73.6  18.7  4.5  4.0  0.5 
Index 2 (PCA binary variables) and Index 4 (Inverse Proportion)  69.5  23.1  5.6  2.7  0.33 
Index 2 (PCA binary variables) and Index 5 (MCA)  51.5  36.3  11.6  1.5  0.36 
Index 3 (Equal weights) and Index 4 (Inverse proportion)  67.7  28.8  3.5  0.91  0.37 
Index 3 (Equal weights) and Index 5 (MCA)  40.6  38.4  16.4  4.9  1.0 
Index 4 (Inverse proportion) and Index 5 (MCA)  43.4  39.8  10.5  6.7  0.90 
Comparing Index 1 (PCA) and Index 5 (MCA), which both used categorical variables, approximately 75% of households were in the same quintile in the two indices, with a Kappa statistic of 0.69. For households in different quintiles, movement was generally limited to one quintile, with less than 5% of households moving two or more quintiles.
Agreement between pairs of indices using binary variables (Indices 2, 3 and 4) was also reasonably high, with approximately 70% of households being in the same quintile between two indices and Kappa statistics of approximately 0.6.
When comparisons were made between an index using categorical variables and an index using binary variables, agreement was weaker. Here, approximately 35–50% of households were in the same quintile between pairs of indices, with Kappa statistics of 0.2–0.4.
Figure 2D demonstrates that Index 4 (Inverse proportion) created a group of outliers; households which were ranked substantially higher by the inverse proportion index than by the PCA index. This pattern was present in comparisons of the inverse proportion index with all other indices. Closer examination of this group of households reveals that they have a significantly higher prevalence of motorbike ownership; 52.6% of households with a score of > 9 on the inverse proportion index own a motorbike, compared with 0.36% in the whole population. This demonstrates that when items of very low prevalence are included in an index constructed using the inverse proportion weighting method, the resultant very high weight they are assigned can produce some strange classifications of households.
Agreement of the wealth index with different measures of consumption expenditure
% Households moving between quintiles of wealth index and per capita consumption expenditure  

Consumption equivalence scale  Same quintile  Move 1 quintiles  Move 2 quintiles  Move 3 quintiles  Move 4 quintiles  Kappa (SE) 
Total consumption expenditure  28.8  34.7  21.7  12.1  2.7  0.10 (0.005) 
Per adult consumption expenditure  27.3  35.7  21.1  12.8  3.0  0.090 (0.005) 
Per capita consumption expenditure  28.9  34.8  21.5  11.6  2.9  0.11 (0.005) 
Discussion
The use of PCA to assign weights to assets included in a wealth index has gained popularity in recent years. Despite this popularity, this application of PCA remains novel; it is statistically unsuitable for use with the categorical data frequently included in wealth indices, and has not been fully investigated. Simpler, more familiar and easily understood methods for weighting a wealth index could include assigning equal weights to all items, or using weights equal to the inverse of the proportion of households owning the item.
We have shown that within this context, the way data are coded is far more important than the weighting method used to construct the index. Indices using data coded in the same way demonstrated high agreement with each other. Agreement was considerably lower between wealth indices constructed using data coded in different ways, i.e. indices using categorical variables compared with indices using binary variables. This suggests that the indicators used in a wealth index are of great importance, although further work attempting to replicate this finding in other settings would be beneficial. Whilst these analyses have used only the assets collected by DHS, further work investigating the effects of using a wider/different set of assets is recommended. Bollen et al. showed that within the Ghana 1998/9 Living Standards Measurement Study (LSMS), a wealth index constructed using a wider set of indicators had a stronger relationship with a permanent income latent variable than a wealth index constructed using only the core set of assets included in the DHS; in the Peru 1985 LSMS, however, the difference was small[25]. Researchers are urged to remember that this set of core assets was not originally included in the DHS for SEP measurement; the assets predictive of wealth may vary substantially between settings and over time and if the wealth index approach to SEP measurement is used in new data collection, it would seem unwise to rely on this set of assets without further exploration of the important indicators of SEP in a particular context.
The fact that the core set of assets in the DHS were originally included in the surveys for their direct effects on health has additional implications. Depending on the outcome of interest, many indicators commonly included in a wealth index potentially have direct effects on health. It may be the case, therefore, that variables are 'double counted' if included both in a wealth index and as separate indicators in a model, making interpretation of coefficients unclear. Houweling et al. demonstrated that excluding from the wealth index variables thought to have the strongest direct effects on child health did affect the magnitude and even direction of inequalities in child health, but the effect was not consistent across countries[38]. One approach to disentangle the effects of education on child health has been to include the education of the household head in the wealth index, and use the education of the child's parents as separate variables[31].
In analyses such as ours, which use large existing datasets, application of an inverse proportion approach can lead to items that are meaningless in a given context being assigned a large weight. This is demonstrated in our analyses by the fact that ownership of a motorbike was assigned a very high weight in the inverse proportion index, far higher than car ownership. In the other indices, car ownership is assigned a higher weight than motorbike ownership, as would probably be expected. This resulted in a subset of households being ranked far higher by the inverse proportion index than by the other indices. We would therefore suggest that using the inverse proportion weighting method is only suitable when data collection has been informed by formative research.
The indices all had similarly modest agreement with consumption expenditure. Within this setting, neither the weighting method used to construct the index nor the difference between using categorical and binary variables has a strong impact on the ability of a wealth index to proxy consumption expenditure. The modest agreement with consumption expenditure brings into question the use of a wealth index as a proxy for consumption, and raises the question of what a wealth index should be considered to be measuring. Despite its use in this and other studies as a goldstandard measure of SEP, consumption expenditure itself has considerable limitations and reliability issues. The lengthy questionnaires requiring accurate details of expenditures on many items over varying periods mean that the variable is at risk from substantial measurement error. Furthermore, the adjustments required for price differences across regions and imputations for rental value of housing and usevalue of durable goods require considerable assumptions and therefore introduce the possibility of bias. Consumption expenditure itself could be viewed as a proxy for some underlying socioeconomic concept, such as Friedman's notion of permanent income – planned and anticipated income, as opposed to current income[3]. The wealth index may therefore be measuring a different aspect of this underlying socioeconomic concept than consumption expenditure, or it may be measuring something else entirely. Some have claimed that a wealth index measures a longerterm economic status than consumption expenditure, since households are more likely to alter consumption in response to an economic shock than they are to sell assets or alter housing characteristics or access to public services[11]. In this context, the agreement of the wealth index with consumption expenditure did not differ between total, per adult and per capita consumption expenditure, meaning that this study was unable to shed further light on which aspect of longterm SEP a wealth index may be measuring.
The appropriateness of the wealth index as a measure of SEP may differ between subgroups of the population; different household economic strategies may affect the proportion of income that is spent on consumer durables. For instance, city slumdwellers may be at risk of frequent relocation and theft, and may therefore choose not to invest in durable goods, perhaps resulting in a lower wealth index score than may be appropriate. In addition, because prices are not generally taken into consideration in wealth index construction, the appropriateness of the wealth index may differ between urban and rural areas, and between regions. Further research into the extent of these differences and strategies to overcome them is warranted.
In terms of the ability of a wealth index to proxy consumption expenditure, PCA appears to offer little advantage over the simpler, more easily understood methods, nor over the more statistically appropriate method of MCA. However, agreement between the indices using the categorical variables and the indices using the binary variables was modest, suggesting that the data included in the wealth index does impact on the final index. While it is not possible to judge whether the indices using categorical data or the indices using binary data are more appropriate based on the agreement with consumption expenditure, other features of the data can be used to make this assessment. There will inevitably be some loss of information between categorical and binary variables, and few would disagree that more detailed information is generally preferable. Decisions regarding the dichotomisation of variables will inevitably be subjective to a large degree, and may therefore be inappropriate or suboptimal. Furthermore, the indices using categorical variables demonstrated considerably less clumping than the indices using binary variables, making it easier to generate quintiles of even size and improving differentiation between households. It could therefore be argued that PCA and MCA may be preferable over equal weights or inverse proportion approaches, despite the simple interpretation and ease of understanding for a wide audience of the latter two.
A further issue with PCA is its inappropriateness with discrete data. MCA is one possible solution to this. The indices generated by PCA and MCA demonstrated high agreement, and had a very similar agreement with consumption expenditure. It therefore appears that, despite concerns over the violation of assumptions underlying PCA, using discrete data in a PCAbased wealth index is of limited cause for concern. Due to the advantages of PCA in terms of computational simplicity, we would not advocate the use of MCA in preference over PCA. Furthermore, continuous variables such as number of people per sleeping room or area of land owned cannot be included in MCA.
Despite the fact that PCA is unfamiliar to many readers of epidemiological research papers and that it could be accused of obscuring the process of constructing a wealth index, there seems to be little reason to adopt any of the alternatives explored in this analysis. Within the current study setting, the simpler methods resulted in indices with more clumping, and the inverse proportion method is unsuitable unless data collection has been preceded by substantial formative research. MCA is no simpler to implement or understand than PCA, cannot be used with a mixture of discrete and continuous variables, and results in an index with very high agreement with a PCA index. We would therefore recommend that having made the decision to construct a wealth index, PCA is a suitable tool for assigning weights to the indicators. Researchers are urged, however, to be clear about the concept of SEP they wish to measure, and to give careful consideration to the feasibility and appropriateness of alternative indicators such as consumption expenditure. The data used to construct a wealth index have a far stronger impact on the final wealth index than the method used to weight the items. Researchers planning data collection for a wealth index are therefore encouraged to carefully consider the data they collect rather than simply collecting data on the set of assets in DHS questionnaires. Formative research may help to identify assets that are strong predictors of SEP in a particular context, increasing the appropriateness of the wealth index as a measure of SEP. A further possibility for selecting assets for data collection is to identify assets which are highly correlated with consumption expenditure[40]. This approach requires full data on consumption expenditure and assets from a recent existing study in the same setting.
The difficulties of collecting income and consumption expenditure data for health research in lowincome countries remain, and further alternatives to the wealth index approach are limited. Qualitative methods such as Participatory Wealth Ranking (PWR) have also been suggested as an alternative way of collecting SEP data, but such methods are probably only practical in small geographical areas[41–43]. This work has reviewed some of the issues with the wealth index approach to SEP measurement and has provided evidence that the data included in the index are more important than the method of index construction. We have also provided doubt that such an approach should be considered as a proxy for consumption expenditure, at least when using the standard set of assets collected by the DHS. This study, however, has been limited to a single dataset; further work to verify the generalisability of the findings in other contexts is recommended. In particular, results may differ in settings at varying stages of economic development. Furthermore, additional work on the consequences of using different sets of assets is recommended, as is an exploration of alternative methods to allow for price and other differences between urban and rural areas and between regions.
Declarations
Acknowledgements
The Malawi National Statistical Office kindly provided the IHS2 dataset. The authors would like to thank Paul Clarke and Bianca De Stavola for statistical advice and helpful suggestions following an early draft of the paper, and the two anonymous reviewers for important suggestions and improvements. LH is supported by an ESRC/MRC Interdisciplinary PhD studentship.
Authors’ Affiliations
References
 Krieger N: A glossary for social epidemiology. Journal of Epidemiology and Community Health. 2001, 55: 693700. 10.1136/jech.55.10.693. 10.1136/jech.55.10.693PubMed CentralView ArticlePubMedGoogle Scholar
 Falkingham J, Namazie C: Measuring health and poverty: a review of approaches to identifying the poor. DFID Health Systems Resource Centre; 2002.
 Friedman M: A theory of the consumption function. Princeton, New Jersey , Princeton University Press; 1957.Google Scholar
 Deaton A, Zaidi S: Guidelines for constructing consumption aggregates for welfare analysis. Washington DC , World Bank; 1999.Google Scholar
 Rutstein SO, Johnson K: DHS Comparative Reports 6: The DHS Wealth Index. Calverton, Maryland, USA , ORC Macro; MEASURE DHS; 2004.Google Scholar
 Sahn D, Stifel D: Exploring alternative measures of welfare in the absence of expenditure data. Review of Income and Wealth. 2003, 49 (4): 463489. 10.1111/j.00346586.2003.00100.x. 10.1111/j.00346586.2003.00100.xView ArticleGoogle Scholar
 Onwujekwe O, Hanson K, FoxRushby J: Some indicators of socioeconomic status may not be reliable and use of indices with these data could worsen equity. Health Economics. 2006, 15 (6): 639644. 10.1002/hec.1071View ArticlePubMedGoogle Scholar
 Lindelow M: Sometimes more equal than others: how health inequalities depend on the choice of welfare indicator. Health Economics. 2006, 15 (3): 263279. 10.1002/hec.1058View ArticlePubMedGoogle Scholar
 Montgomery MR, Gragnolati M, Burke KA, Paredes E: Measuring living standards with proxy variables. Demography. 2000, 37 (2): 155174. 10.2307/2648118View ArticlePubMedGoogle Scholar
 Sumarto S, Suryadarma D, Suryahadi A: Predicting consumption poverty using nonconsumption indicators: experiments using Indonesian data. SMERU Research Institute; 2006.
 Filmer D, Pritchett LH: Estimating wealth effects without expenditure data  or tears: an application to educational enrollments in states of India. Demography. 2001, 38: 115132.PubMedGoogle Scholar
 Vyas S, Kumaranayake L: Constructing socioeconomic status indices: how to use principal components analysis. Health Policy Plan. 2006, 21 (6): 459468. 10.1093/heapol/czl029View ArticlePubMedGoogle Scholar
 Bartholomew DJ, Steele F, Moustaki I, Galbraith JI: Chapter 5: Principal Components Analysis. The analysis and interpretation of multivariate data for social scientists. Chapman & Hall/CRC; 2002, 115142.Google Scholar
 CGAP: Assessing the relative poverty of microfinance clients: A CGAP operational tool.
 Abeyasekera S: Chapter 18: Multivariate methods for index construction. Household surveys in developing and transition countries: design, implementation and analysis. United Nations Statistics Division; 2003.Google Scholar
 McKenzie DJ: BREAD working paper No. 042: Measuring inequality with asset indicators. Bureau for Research in Economic Analysis of Development, 2003.
 Gwatkin DR, Rutstein S, Johnson K, Pande RP, Wagstaff A: Socioeconomic differences in health, nutrition, and population in Ghana. HNP/Poverty Thematic Group of the World Bank, 2000.Google Scholar
 Gwatkin DR, Rutstein S, Johnson K, Pande RP, Wagstaff A: Socioeconomic differences in health, nutrition, and population in Vietnam. HNP/Poverty Thematic Group of the World Bank, 2000.Google Scholar
 Gwatkin DR, Rutstein S, Johnson K, Suliman E, Wagstaff A, Amouzou A: Socioeconomic differences in health, nutrition and population: Malawi 1992, 2000. HNP/Poverty Thematic Group of the World Bank; 2000.Google Scholar
 Kolenikov S, Angeles G: The use of discrete data in PCA: theory, simulations, and applications to socioeconomic indices. University of North Carolina, 2004.Google Scholar
 Razzaque A, Alum N, LWai L, Foster A: Sustained effects of the 19745 famine on infant and child mortality in a rural are of Bangladesh. Population Studies. 1990, 44 (1): 145154. 10.1080/0032472031000144426View ArticlePubMedGoogle Scholar
 Guiley D, Jayne S: Fertility transition in Zimbabwe: determinants of contraceptive use and method choice. Population Studies. 1997, 51 (2): 173190. 10.1080/0032472031000149896. 10.1080/0032472031000149896View ArticleGoogle Scholar
 Setel P, Abeyasekera S, Ward P, Hemed Y, Whiting D, Mswia R, Antoninis M, Kitange H: Development, validation, and performance of a rapid consumption expenditure proxy for measuring income poverty in Tanzania: experience from AMMP Demographic Surveillance Sites. 2003.
 Bollen KA, Guilkey DK, Mroz TA: Binary Outcomes and Endogenous Explanatory Variables: Tests and Solutions with an Application to the Demand for Contraceptive Use in Tunisia. Demography. 1995, 32 (1): 111131. 10.2307/2061900View ArticlePubMedGoogle Scholar
 Bollen KA, Glanville JL, Stecklov G: Socioeconomic status, permanent income, and fertility: A latentvariable approach. Population Studies. 2007, 61 (1): 1534. 10.1080/00324720601103866View ArticlePubMedGoogle Scholar
 Townsend P: Poverty in the United Kingdom. Allen Lane and Penguin Books, Harmondsworth, Middlesborough and Berkley, University of California Press; 1979.Google Scholar
 Layte R, Nolan B, Whelan CT: Persistent and consistent poverty in the 1994 and 1995 waves of the European Community Household Panel Study. Working Paper. Dublin, Ireland , The Economic and Social Research Institute; 2002.Google Scholar
 Morris SS, Carletto C, Hoddinott J, Christiaensen LJM: Validity of rapid estimates of household wealth and income for health surveys in rural Africa. Journal of Epidemiology and Community Health. 2000, 54: 381387. 10.1136/jech.54.5.381. 10.1136/jech.54.5.381PubMed CentralView ArticlePubMedGoogle Scholar
 Bartholomew DJ, Steele F, Moustaki I, Galbraith JI: Chapter 4: Correspondence analysis. The analysis and interpretation of multivariate data for social scientists Chapman & Hall/CRC; 2002:81114.
 Booysen F, van der Berg S, Burger R, von Maltitz M, du Rand G: Using an asset index to assess trends in poverty in seven SubSaharan African countries: Brasilia, Brazil. 2005.
 Montgomery MR, Hewett PC: Urban poverty and health in developing countries: household and neighborhood effects. Demography. 2005, 42 (3): 397425. 10.1353/dem.2005.0020View ArticlePubMedGoogle Scholar
 Ferguson B, Tandon A, Gakidou E, Murray CJL: Estimating permanent income using indicator variables. Evidence and Information for Policy Cluster; World Health Organization, Geneva, Switzerland; 2002.Google Scholar
 Bollen KA, Glanville JL, Stecklov G: Socioeconomic status and class in studies of fertility and health in developing countries. Measure Evaluation, Carolina Population Center, University of North Carolina; 1999.Google Scholar
 Filmer D, Scott K: Assessing Asset Indices. The World Bank; 2007.Google Scholar
 Malawi Second Integrated Household Survey (IHS2) 20052005: Basic Information Document. Zomba , National Statistical Office of Malawi; 2005.z
 Note on construction of expenditure aggregate and poverty lines for IHS2. National Statistical Office of Malawi.
 Stata 9.2. Texas , StataCorp; 2006.
 Houweling TAJ, Kunst AE, Mackenbach JP: Measuring health inequality among children in developing countries: does the choice of indicator of economic status matter?. International Journal for Equity in Health. 2003, 2: 819. 10.1186/1475927628PubMed CentralView ArticlePubMedGoogle Scholar
 Econpapers: MCA: Stata module to perform multiple correspondence analysis http://econpapers.repec.org/software/bocbocode/s335503.htm
 Hanson K, McPake B, Nakamba P, Archard L: Preferences for hospital quality in Zambia: results from a discrete choice experiment. Health Economics. 2005, 14 (7): 687701. 10.1002/hec.959View ArticlePubMedGoogle Scholar
 Chambers R: The origins and practice of participatory rural appraisal. World Development. 1994, 27 (7): 953969. 10.1016/0305750X(94)901414. 10.1016/0305750X(94)901414View ArticleGoogle Scholar
 Hargreaves JR, Morison LA, Gear JSS, Kim JC, Makhubele MB, Porter JDH, Watts C, Pronyk PM: Assessing household wealth in health studies in developing countries: a comparison of participatory wealth ranking and survey techniques from rural South Africa. Emerging Themes in Epidemiology. 2007, 4 (1): [Epub ahead of print] 10.1186/1742762244.
 Hargreaves JR, Morison LA, Gear JSS, Porter JDH, Makhubele MB, Kim JC, Busza J, Watts C, Pronyk PM: "Hearing the voices of the poor": Assigning poverty lines on the basis of local perceptions of poverty; a quantitative analysis of qualitative data from participatory wealth ranking in rural South Africa. World Development. 2007
Copyright
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.